About me

I'm Pijush Pal, a Data Scientist with a Master's in Data Science from the University of New Haven and a Bachelor's in Computer Science from the West Bengal University of Technology. Fueled by a passion for machine learning and AI, I specialize in transforming complex challenges into innovative solutions. My expertise spans supervised and unsupervised learning, as well as Generative AI (GenAI), enabling me to develop and integrate advanced software systems that drive impactful results.

I enjoy leveraging AI and machine learning to solve real-world problems. I am particularly fascinated by Large Language Models (LLMs) and their potential to transform industries, from medical diagnostics to autonomous vehicles. My interests include working on projects involving Retrieval-Augmented Generation (RAG), watermarking algorithms for text and images, and semantic segmentation for self-driving cars. Staying updated with the latest advancements in AI/ML and continuously learning new techniques drives my passion for collaborative research and pushing the boundaries of technology.

What i'm interested in

  • design icon

    Gen - AI Engineer

    I design and optimize Retrieval-Augmented Generation (RAG) models to enhance information retrieval and generation processes, driving innovation in AI solutions.

  • Web development icon

    Data Scientist

    As a Data Scientist at Welspot.inc, I developed scalable machine learning pipelines, enhancing decision-making processes by improving accuracy and efficiency.

  • mobile app icon

    ML Engineer

    Experienced Machine Learning Engineer adept at developing scalable ML pipelines and collaborating on AI solutions for diverse applications.

Recommendation

  • Tom John - (CIO / CTO | Hands-On Leader | Strategic Visionary | Technology Architect I Fintech Guru)

    Tom John - (CIO / CTO | Hands-On Leader | Strategic Visionary | Technology Architect I Fintech Guru) LinkedIn

    Pijush is a knowledgeable and capable Data Scientist. I hand selected him for our internship program and placed him into our AI/NLP and Data Science/ML project teams. Pijush was able to contribute and excel in his areas of strength and contribute to both individual and team tasks. Pijush was able to assess problems and overcome challenges by finding effective solutions to the tasks at hand. He will be a valuable asset to any company that he works for in the future.

  • Michael Lederman - (CRO | Strategy | AI/ML | FinTech-Global Banking-Financial Services |)

    Michael Montenegro Lederman - (Chief Risk Officer | Strategy | AI/ML Data Science | Digital Transformation | FinTech-Global Banking-Financial Services |) LinkedIn

    Pijush is a well rounded data scientist while completing his internship at WelSpot (FinTech) , where I serve as Chief Strategic Risk Officer, as a requirement for his completion in the Masters in Data Science at the University of New Haven, Pijush has demonstrated strong knowledge and business applicability skills where he built the AI/ML Framework for the firm while understanding the strategic and operational implication that this document has for culture evolution. Pijush has good professional acumen, listening and initiative to take on AI/ML projects and shows keen interest in learning and applying AI/ML techniques looking after the firm's objectives.

Resume

PIJUSH PAL


pijushpl2023@gmail.com | New Haven, CT, USA | linkedin.com/in/-pijush-pal/ | github.com/pijush2022

Summary

Accomplished Data Scientist proficient in machine learning, particularly in supervised and unsupervised learning as well as Generative AI (GenAI). Skilled in Python and C++, demonstrating a successful history of creating and integrating sophisticated software systems. Effective in communication and dedicated to fostering innovation in AI/ML solutions

Education

  1. University of New Haven

    2023 — 2024

    Masters of Science In Data Science

    GPA - (3.60/4.00)

  2. West Bengal University of Technology

    2014 — 2018

    Bachelore Of Technology In Computer Science and Engineering

    GPA - (3.33/4.00)

Experience

  1. AI Engineer - Mangoes.ai (New York)

    April 2024 — Present

    Design, implement, and optimize Retrieval-Augmented Generation (RAG) models.
    1. This role involves enhancing information retrieval and generation processes, integrating machine learning techniques, and fine-tuning models for high performance.
    2. Responsibilities include data preprocessing, model training, evaluation, and deployment, as well as collaborating with cross-functional teams to deliver cutting- edge AI solutions. Ideal candidates have expertise in natural language processing, and experience with large-scale data systems.

  2. Data Scientist - Welspot.inc (Miami,Florida)

    January 2024 — May 2024

    Machine learning pipelines for the Credit Engine Model, from data preprocessing to training and inference, contributing to 30% efficiency improvement in data processing.
    • Enhanced decision-making through ML models, including A/B testing and performance optimization, resulting in a 25% increase in accuracy and a 20% reduction in decision-making time.
    • Stayed updated with the latest ML research, applying insights to address business challenges effectively.

  3. ML Engineer - University of New Haven (West Haven)

    January 2023 — July 2023

    Engineered a decision-making and assessment platform for global postgraduate admissions, slashing applicant evaluation time from 20 minutes to just 2 minutes.
    • Designed bespoke machine learning algorithms and oversaw a 15-person data science unit to enhance performance metrics tailored to industry standards.
    • Spearheaded the development of a Streamlit application and predictive model within the ML team, achieving

Skills

  • 1. Programming Languages: Python, C/C++.
    2. MLOps Tools: MLflow, GitHub Action, CI/CD, DVC, BentoML
    3. Frameworks: PyTorch, Keras, TensorFlow
    4. Domain: ML Algorithms, Deep Learning, Statistics, NLP, EDA.
    5. Scraping Tools: Selenium, Beautiful Soup.
    6. Audio: TTS.
    7. NLP Framework: RAG, GAN, Transformers.
    8. Database: SQL, MongoDB, Chroma DB, Pinecone, Neo4j.
    9. Cloud Platforms: AWS Sage Maker, Elastic Beanstalk, EC2, S3, ECR, EMR, Snowflake, Lambda.
    10. Deployment Framework: Docker, Langchain, Hugging Face, Kubernetes.
    11. LLMs: GPT, LLAMA, Claude, Gemini Pro, Mistral, Whisper, Prompt Engineering.
    12. Robotics: Autonomous Vehicle.
    13. Microsoft Excel.

Research Work

  • Artificial Intelligence Institute of South Carolina Columbia,SC,USA (April 2024 - Present)

    Research Intern


    Designing and implementing a Watermark model specifically tailored for text and image text.

    • Leveraging state-of-the-art Large Language Models (LLMs) to analyze and extract meaningful patterns from textual content.
    • Collaborating with a multidisciplinary team to refine and optimize the watermarking algorithm for real-world applications.



Achievements

    1. Solar Eclipse Research Team Member


    ** Collaborated with NASA at UNH on atmospheric research, developing altitude-controlled weather balloons.

    2. Work as a TA (Teaching Assisstant) For Machine Learning Course Work


    ** As a Teaching Assistant for a Machine Learning course, I assist in lectures, tutorials, and labs, while also holding office hours to provide additional support and answer student questions. My responsibilities include grading assignments and exams, developing course materials, and guiding students on their projects,ensuring effective communication between the instructor and students.

Portfolio (Project)