Skip to content
View reethj-07's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report reethj-07

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
reethj-07/README.md

Reeth Jain

ML Engineer focused on production GenAI, LLM applications, and MLOps.

I build reliable AI systems that move from experimentation to real-world deployment, with emphasis on retrieval quality, agent reliability, and observable ML infrastructure.

Professional Summary

  • Build and deploy end-to-end LLM systems: ingestion, retrieval, orchestration, serving, monitoring
  • Design RAG pipelines with hybrid retrieval, reranking, and evaluation-driven optimization
  • Develop agentic workflows with tool use, memory, planning, and guardrails
  • Productionize ML with CI/CD, containerized deployment, and cloud-native operations
  • Apply strong engineering standards: testing, observability, reproducibility, and security

Core Competencies

GenAI and LLM Engineering

  • Prompt engineering, model adaptation, and response-quality optimization
  • RAG architecture design with chunking, indexing, reranking, and query routing
  • Agent frameworks for multi-step task execution and tool orchestration
  • LLM evaluation pipelines for quality, latency, and cost trade-offs
  • Multi-modal pipelines for text, image, and audio use cases

Machine Learning and Applied AI

  • NLP: Transformers, embedding systems, semantic search, information extraction
  • Computer vision: detection, segmentation, and classification workflows
  • Model lifecycle: training, validation, packaging, and deployment
  • Inference optimization: batching, quantization, caching, and distillation

MLOps and Platform Engineering

  • Containerized deployment with autoscaling and service reliability practices
  • CI/CD for ML and backend systems with automated checks and rollout safety
  • Observability: logs, metrics, traces, and model/data drift monitoring
  • Infrastructure as code with reproducible environments and versioned pipelines
  • Data and experiment management for auditable model development

Technical Skills

Languages

Python SQL C++

AI/ML Frameworks

PyTorch TensorFlow Scikit-learn Hugging Face OpenCV

LLM and Agent Stack

LangChain LangGraph LlamaIndex OpenAI API Mistral FAISS Whisper RAG Evaluation

Backend and APIs

FastAPI Flask Streamlit REST APIs

Data and Messaging

PostgreSQL Kafka Airflow DVC

DevOps and Infrastructure

Docker Kubernetes Terraform Helm GitHub Actions

Cloud and Observability

AWS Azure GCP MLflow Prometheus Grafana OpenTelemetry

What I Build

Intelligent Agent Systems

  • Multi-agent orchestration with dynamic tool selection and task planning
  • Context-aware agents with conversation memory and stateful workflows
  • Tool-augmented LLM systems for automation, analytics, and developer productivity

Production RAG Platforms

  • End-to-end pipelines for ingestion, indexing, retrieval, reranking, and response synthesis
  • Hybrid retrieval stacks combining dense and sparse strategies
  • Evaluation loops to improve answer quality, grounding, and hallucination resistance

End-to-End ML Applications

  • Real-time inference APIs for latency-sensitive workloads
  • Batch prediction pipelines for high-volume offline processing
  • Vision and NLP applications deployed as reliable services

MLOps Enablement

  • Automated training, validation, and release workflows
  • Model and data monitoring with drift and performance tracking
  • Scalable serving systems with reliability, rollback, and cost controls

Engineering Principles

  • Reliability: strong testing, robust error handling, graceful degradation
  • Scalability: horizontal scaling, caching, efficient resource utilization
  • Observability: actionable metrics, structured logging, distributed tracing
  • Reproducibility: versioned code/data/models and deterministic workflows
  • Security: authentication, rate limiting, and secure secret management
  • Cost awareness: right-sized infrastructure and efficient inference patterns

Contact

Building AI systems that scale from prototype to production.

Pinned Loading

  1. ai-loyalty-service ai-loyalty-service Public

    Python 1

  2. LargeDS_QnA LargeDS_QnA Public

    E-commerce analytics agent on McAuley Amazon-Reviews-2023 (JSONL): CPU embeddings (MiniLM), FAISS vector search, BM25, DuckDB SQL, LangChain/LangGraph multi-agent pipeline, Groq. Includes eval scri…

    Python 1

  3. legal-knowledge-graph legal-knowledge-graph Public

    Python 1

  4. JSO-Agent JSO-Agent Public

    TypeScript