Hi there! ๐ I'm Ashhar Farooqui, a passionate Data Scientist & ML Engineer based in Bengaluru, India. My journey is one of curiosity-driven discovery where I transform complex datasets into compelling narratives that drive business decisions and solve real-world problems.
"Data is the new oil, but insights are the engine."
My career has been defined by one core mission: extracting meaning from chaos. Whether it's predicting market trends, uncovering patterns in time series data, or building intelligent systems that learn and adapt, I believe that every dataset has a story to tell. My role is to be that storytellerโarmed with Python, machine learning, and a relentless curiosity.
- ๐ง Solving Complex Problems with machine learning and statistical rigor
- ๐ Telling Data Stories that influence strategic decisions
- ๐ Building Scalable Solutions that create real-world impact
- ๐ Continuous Learning in the ever-evolving AI/ML landscape
- ๐ค Collaborating with Teams to deliver data-driven products
๐ฐ Oil Price Prediction
Forecasting crude oil prices using machine learning and time series models
The Challenge: Can we predict volatile commodity prices with acceptable accuracy?
The Solution:
- ๐ค Built an ensemble of 5 sophisticated models (ARIMA, SARIMA, Prophet, LSTM, XGBoost)
- ๐ Achieved 97% Rยฒ score with ยฑ$2-3/barrel prediction accuracy
- ๐จ Created an interactive Streamlit dashboard for real-time insights
- โ๏ธ Implemented automated retraining pipelines for model freshness
Key Metrics:
- RMSE: 1.65 | MAE: 1.23 | MAPE: 1.72%
- Successfully captures 89% of price direction changes
- Backtested on 12 months with 93% out-of-sample accuracy
Tech Stack: Python, TensorFlow/Keras, XGBoost, ARIMA, Prophet, Streamlit, Plotly
๐ View Repository | Read Full Documentation
- ๐ค Supervised Learning: Regression, Classification, Ensemble Methods
- ๐ Unsupervised Learning: Clustering, Dimensionality Reduction, Anomaly Detection
- โฐ Time Series Analysis: ARIMA, SARIMA, Exponential Smoothing, Prophet, LSTM
- ๐ง Deep Learning: Neural Networks, CNNs, RNNs, LSTMs, Transformers
- ๐ ETL Pipelines: Design, development, and optimization
- ๐๏ธ Big Data Processing: Spark, Hadoop, distributed computing
- ๐ Statistical Analysis: Hypothesis testing, A/B testing, Bayesian methods
- ๐ Data Quality: Validation, profiling, and governance
- ๐ Text Classification: Sentiment analysis, toxicity detection, topic modeling
- ๐ฃ๏ธ Language-specific NLP: Multilingual support (English, Hindi, Bengali)
- ๐ฌ Sequence-to-sequence Models: Translation, summarization
- ๐ Information Extraction: Named entity recognition, relation extraction
- ๐ Dashboard Development: Tableau, Power BI, Streamlit
- ๐ KPI Monitoring: Real-time analytics and alerting
- ๐ก Data Storytelling: Translating insights into business narratives
- ๐ฏ Predictive Analytics: Forecasting and trend analysis
| Achievement | Details |
|---|---|
| ๐ฅ Prediction Accuracy | 97% Rยฒ score on complex financial forecasting |
| ๐ Performance Optimization | 70% reduction in analysis time through automation |
| ๐ Data Quality | 99.2% detection rate for data anomalies |
| ๐ง NLP Excellence | 99.41% accuracy on multi-class text classification |
| ๐ฌ Research Impact | Contributed to epidemiological modeling for policy decisions |
| โก Scalability | Built systems handling 100M+ records with sub-second latency |
I'm passionate about continuous learning and staying at the forefront of data science innovation:
- ๐ Regular reader of arxiv papers on ML/AI
- ๐ Coursework in Advanced Statistics, Deep Learning, Time Series Analysis
- ๐ Active contributor to open-source projects
- ๐ค Enthusiast in technical blogging and knowledge sharing
- ๐ค Mentor in ML communities helping aspiring data scientists
I'm always excited to discuss:
- ๐ค Novel approaches to complex data problems
- ๐ Building scalable ML systems in production
- ๐ Data storytelling and visualization techniques
- ๐ฌ Latest research in AI and machine learning
- ๐ Real-world applications of data science
| Platform | Link |
|---|---|
| ๐ง Email | ashhar.farooqui07@gmail.com |
| ๐ผ LinkedIn | linkedin.com/in/ashhar-farooqui |
| ๐ GitHub | github.com/ashharfarooqui |
| ๐ฆ Twitter | @ashhar_farooqui |
| ๐ Portfolio | ashharfarooqui.com |
I'm currently exploring:
- ๐ค Large Language Models and their applications
- ๐ฎ Generative AI for data synthesis and augmentation
- ๐ Federated Learning for privacy-preserving ML
- ๐ Causal Inference for understanding root causes
- โก Edge AI for real-time, on-device predictions
- ๐ Graph Neural Networks for relational data
I believe in giving back to the community:
- ๐ง Active contributor to scikit-learn, TensorFlow, and Pandas projects
- ๐ Maintainer of several ML utility libraries
- ๐ก Creator of data science templates and boilerplates
- ๐ Author of tutorials on advanced ML topics
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ Data โ Questions โ Exploration โ Modeling โ
โ โ โ
โ Validation โ Interpretation โ Results โ Training โ
โ โ โ
โ Insights โ Storytelling โ Action โ Impact โ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
- Curiosity First ๐ โ Always ask "why" before jumping to solutions
- Data Integrity ๐ก๏ธ โ Garbage in, garbage out; prioritize quality
- Interpretability ๐งฉ โ Black boxes are the enemy; explain your models
- Reproducibility ๐ โ Make your work auditable and repeatable
- Impact-Driven ๐ฏ โ Every model should solve a real problem
- Continuous Iteration ๐ โ V1 is never the final version
- ๐ ML Best Practices Guide โ Industry-standard approaches
- ๐ฌ Time Series Playbook โ Comprehensive forecasting guide
- ๐ Data Visualization Cookbook โ Creating impactful visuals
- ๐ง NLP Starter Kit โ Text analysis from scratch
- ๐ Production ML Checklist โ Deploying models reliably
- ๐ง Specialization: Time Series Analysis & Forecasting
- ๐ผ Experience: 5+ years in Data Science & ML Engineering
- ๐ Location: Bengaluru, India ๐ฎ๐ณ
- ๐ฏ Goal: Building intelligent systems that solve real problems
- ๐ฑ Growing Interest: Generative AI, MLOps, Causal Inference
- โ Fun Fact: I debug code with coffee and classical music! ๐ต
If you're interested in:
- ๐ค Collaboration on data science projects
- ๐ผ Discussing job opportunities
- ๐ Learning from my experience
- ๐ Building something amazing together
Feel free to reach out! I'm always up for interesting conversations and exciting challenges.
