Welcome to my GitHub! I'm Neha—a data enthusiast and applied AI/ML engineer who loves building things - LLM applications, RAG-style retrieval systems, automation workflows, and end-to-end deployable ML pipelines. With a background in Computer Science & Engineering and hands-on experience in data science, machine learning, and cloud technologies, I’m passionate about designing workflows, scaling data pipelines in Python, and turning messy data into intelligent, user-facing systems that solve real problems.
Currently, I’m pursuing my Master’s in Business Analytics at BU Questrom, specializing in Data & Methods, where I focus on applied machine learning, deep learning, NLP, and experimentation. I’m fortunate to be supported by the Director’s Achievement Scholarship and am actively engaged in case competitions and collaborative analytics projects. Most of my work sits at the intersection of Python, LLMs, Airflow, GCP, and scalable ML pipelines, always centered around creating reliable, deployable systems
What I’m working on now:
- LLM-powered RAG pipelines (summarization, semantic search, retrieval, feature extraction)
- Airflow + GCP + BigQuery ETL pipelines for credit-risk intelligence (daily records, idempotent DAGs, automated validation)
I love roles where I can experiment, automate, and build cool things - whether it’s running controlled experiments, designing ML-powered features, or creating intelligent workflows with LLMs. If you're working in the space of experimentation, applied ML, AI systems, or data-driven automation, I’d be happy to connect.
- 🌟 Programming: Python (Pandas, Scikit-learn, TensorFlow, PyTorch, Statsmodels, HuggingFace), SQL, Git
- ML & Product Analytics: A/B Testing, Causal Inference, Experimentation Design, Customer Segmentation, Churn Prediction, Recommender Systems
- LLM & GenAI: Transformers, BERT, RoBERTa, Prompt Engineering, Fine-tuning, Retrieval-Augmented Generation (RAG), LangChain
- Big Data & Cloud: Hadoop, Apache Spark, AWS, GCP, Google BigQuery
- Production & Monitoring: MLflow, Model A/B Testing, Feature Stores, Data Validation, Pipeline Orchestration (Airflow, Prefect)
- Tools: Tableau, Power BI, Git, Jupyter Notebooks, Excel
Outside the world of data:
- I’m a trained singer and was part of a rock band
- I have a huge sweet tooth (chocolate and ice cream lover!)
- Love exploring new places and cultures
📧 Email: nehags@bu.edu
💼 LinkedIn: linkedin.com/in/nehagshastri/
Thanks for stopping by! Feel free to explore my repositories or drop a message if you'd like to collaborate!



