Hi, I'm |
ML Platform Engineer & AI Systems Architect
I design, build scalable and resilient machine learning and LLM platforms that power intelligent applications at enterprise scale. With experience across ML lifecycles, RAG apps, and agentic tools, I transform prototypes into production-ready AI systems.

About Me
With 5 years of experience in ML/DL model deployment, RESTful APIs, Docker, Kubernetes, and cloud platforms (Azure, GCP, Databricks), I am a proactive and results-driven machine learning engineer passionate about building highly scalable and resilient ML infrastructure. My expertise lies in CI/CD automation, container orchestration, Infrastructure as Code, and developing low-latency, high-performance inference systems to enable data scientists with seamless deployments and robust, production-grade workflows.
Azure
GCP
Databricks
Docker
Kubernetes
GitHub
Professional Experience
5+ years of building scalable ML systems and platforms
Machine Learning Engineer (MLOps)
Key Achievements
- Haleon GenAI Assistant: Architected and developed an enterprise-grade, multi-team GenAI platform integrating FastAPI, React, Azure OpenAI and Azure Cloud Services to enable secure, intelligent data access across Finance, Procurement, and Help Center from scratch.
- Built team-specific AI agents and modular tool frameworks using LangChain and LangGraph for contextual data retrieval.
- Enhanced retrieval quality through semantic ranker with multilingual support and hybrid search using text and vector search in Azure AI Search.
- Engineered a scalable, structure-aware document processing pipeline using Azure AI Document Intelligence with automated GitHub Actions workflows, extracting structured layouts and entities, performing chunking, ingesting and managing documents in Azure Blob Storage, and updating vector and keyword indexes in Azure AI Search for real-time, high-relevance retrieval and RAG-ready content delivery.
- Experimented with MCP (Model Context Protocol) client–server architecture for distributed tool orchestration across databases and AI services.
- Optimized chat architecture for sub-second latency through asynchronous processing, containerized deployment, and scalable microservice design.
- Prometheus: Implemented end-to-end MLOps framework using Databricks Bundles, Unity Catalog, and GitHub Actions. Built robust CI/CD workflows with linting, bundle validation, pre and post semantic versioning, and automated Databricks job orchestration across Dev/UAT/Prod. Enabled dynamic, market and brand specific task generation at runtime, versioned model and data persistence in Unity Catalog, reproducible deployments via explicit semantic release tags, with outputs seamlessly refreshed in Power BI for stakeholders.
- Federated Help Center Search: Architected and delivered a federated search solution across 15+ brand help center URLs, providing search results from multiple brand websites and help centers across different markets, languages, and brands.
- Large-scale web crawling from 500+ brand websites (including help centers) across multiple countries, markets, domains, and languages. Automated scraping using Scrapy, structure-aware information extraction from leaflets using Azure AI Document Intelligence, data ingestion to Azure Blob Storage, and updating vector and keyword indexes in Azure AI Search for real-time, high-relevance retrieval and RAG-ready content delivery.
- Providing robust search, autocomplete, and suggestion APIs from Azure AI Search directly to the frontend team for seamless integration.
- For internal testing using GenAI assistant: Added Playground mode with custom model selection, temperature control, and system prompt configuration, including full prompt visibility alongside search results for enhanced experimentation and debugging.
Federated Search API DemoInteractive demo of Autocomplete, Suggestions, and Search APIs Integrated across 15+ help centers providing results from brand websites
Federated Search DemoScroll into view to see animation - Procurement Contract Analysis: Led ML-Ops for a RAG-based (Agentic-RAG) system by productionizing modular workflows with Databricks Asset Bundles, integrating Azure Document Intelligence, Unity Catalog, and model serving endpoints along with automated deployment of Databricks App. Built CI/CD pipelines with GitHub Actions for automated, rollback-ready workflows, deployments, and GitOps-based orchestration.
- Asset Vision: Designed, implemented a robust CI/CD pipeline with GitHub Actions to build and publish Docker images to GitHub Container Registry (GHCR) and Azure Container Registry (ACR), enabling automated deployment to Azure Web App and microservices on Kubernetes cluster for scalable, secure and reliable application management across environments. Applied same approach to streamline deployments in other RAG based project.
- MLOps Template for Databricks: Improved and implemented a comprehensive MLOps template for Databricks leveraging GitHub Actions to automate CI/CD pipelines, including code linting, testing, environment setup, and seamless deployment of Spark jobs, such as model training, validation, and deployment, resulting in significantly enhanced workflow efficiency and reduced deployment times.
🏆Awards & Recognition
Global Employee Recognition
BronzeUnlock Value at PaceDec 2025Rajesh has 1168 GitHub contributions this year, which is impeccable. He led MLE efforts and delivered successful deployments for Help Centre Federated Search, Prometheus, and Contract Analyzer projects. He presented the Contract Analyzer project on AI Day, receiving significant recognition across the Data Office and CDO.
Global Employee Recognition
BronzeGo BeyondNov 2024Outstanding ML Engineer with major contributions to GenAI initiatives including Help Centre Federated Search and RAG-based chatbots using Azure and Databricks. Significant impact on MLOps through Prometheus and MLOps template contributions. Recognized as a strong team player, actively supporting deployments and leading discussions across MLOps forums.
Global Employee Recognition
Collaborate for ImpactSep 2025Exceptional contribution and energy during Career Day and Technovation Day. Presented Contract Analyzer and Help Centre Federated Search projects, receiving widespread appreciation across the Data Office and CDO.
2025 Excellence Award – India Capability Centre
Simplify for ImpactWin as OneDec 2025Helped bring Haleon’s “Win as One” strategy to life. 2025 Excellence Award – India Capability Centre for Build Contract AI – A synergy of Procurement Analytics and Data Office, revolutionising contract intelligence and management using Agentic-RAG.
⚙️Technologies Used
Machine Learning Engineer (MLOps)
Key Achievements
- CI/CD Infrastructure: Robust CI/CD pipelines using Cloud Build for seamless and automated deployment of microservices and batch services on GKE, Cloud Functions, and Cloud Run, significantly reducing deployment errors, manual effort, and turnaround times across environments. Collaborated with DS team to optimise code and model inference for higher throughput.
- EdgeDB Inference Store: Architected, deployed and upgraded Inference store (EdgeDB on Google Kubernetes Engine with provisioned PostgreSQL instance as backend) with FastAPI endpoints deployed in GKE. API employed across various DS Services for efficient caching and retrieval of inferences in real time (latency around 5ms), as well as in batch processing jobs with traffic coming from Queue (PubSub) or an Orchestrator (Airflow). Integrated CI/CD pipelines for automated deployments in GCP (Upgrade Flow).
- Feast Feature Store: Efficiently deployed Feast feature-server on Google Kubernetes Engine, leveraging an automated materialization pipeline for real-time data updates in Redis (Online Store) and BigQuery (Offline Store) via Cloud Functions triggered by a diverse range of team data uploads in a GCS bucket. API effectively addresses the challenge of accessing up-to-date features from CSV files and currently employed in 10+ DS services, delivering features with minimal latency (<10ms). CI/CD enabled for robust rollouts.
- Data Flow Pipeline: Built data flow pipeline to optimize the timely data sync between Snowflake and Google Sheets using Airflow, Pub-Sub, EdgeDB and GKE. Syncing data for approx 2500 Sheets with roughly 15000 Jobs/day. (blog) (citation by Modern Data Stack, citation by Data Engineering Weekly).
- Airflow Orchestration Platform: Deployed, administered, upgraded a robust Airflow infrastructure on Google Kubernetes Engine, overseeing 70+ diverse git synchronised DAGs running approx 1000+ times daily, covering a total of 3800+ tasks executions to support various DS-BI teams. Efficiently managed the execution of tasks, encompassing Cloud Function triggering, job scheduling, PubSub messaging, EdgeDB integration for jobs status update, and Snowflake-to-Google Sheets data synchronization and seamless integration with diverse GCP services. Implemented RBAC to enhance team visibility, established automated log cleanup with ensuring a streamlined, well-maintained system.
- NBFC Infrastructure Migration: Spearheaded migration of NBFC DS infrastructure from AWS to GCP, breaking down monolith into scalable sub-monoliths and GKE-based microservices. Decoupled DB operations, slashing response times from 3 seconds to 503ms (99th percentile), eliminated overall timeouts, previously at 8-9%. Implemented streamlined data flow via Pub/Sub and Cloud Functions, enabling efficient EdgeDB updates and hourly Snowflake synchronization. CI/CD pipelines orchestrated for seamless deployment of microservices on GKE and respective cloud functions.
- Triton Python SDK: Developed a pip-installable, reusable Python wheel package for seamless integration with Triton Inference Server, supporting high-performance model inference by handling both HTTP and GRPC requests in synchronous and asynchronous modes, with built-in logging, error handling, and configuration flexibility.
- Request-Response Store: Designed, deployed high-performance APIs Request-Response Store using FastAPI, featuring client authentication, rate limiting and low 5ms latency. Orchestrated storage of request-response in EdgeDB, synchronized to Snowflake based on team-defined frequencies, and leveraged by 10+ DS services.
- Customer Interaction Analytics: Designed and implemented a cloud-based workflow for end-to-end processing of customer interactions data. The workflow encompasses data ingestion from Snowflake, audio download, transcription generation, information extraction, and automated customer summary generation. This system is orchestrated through Airflow and leverages Google Cloud Functions, Cloud Storage, Kserve InferenceService, Triton, MongoDB and EdgeDB.
- Data Platform R&D: Experimenting with deploying Doris, StarRocks, and associated open-source tools on GKE to construct a scalable and high-performance data platform, incorporating S3-compatible object storage via MINIO/GCS, implementing access control and data retention policies, and ensuring effective Kubernetes cluster monitoring and alerting through Grafana's dashboards utilizing Prometheus as a primary time-series data source.
- Audio Embeddings Search: Experimented Minio and Milvus integration to extract audio embeddings from Minio, enabling efficient search for similar content, Redis for metadata storage.
🏆Awards & Recognition
Rookie Award – Appreciation Certificate
Special RecognitionQ3 2022This certificate is proudly awarded to Rajesh Dhanda (ML Engg – Global) for outstanding performance in Quarter 3, 2022.
Signatories: VP – DS and BI, COO
⚙️Technologies Used
Computer Vision Software Engineer
Key Achievements
- Implemented a reference direction-based entry-exit system and Object Counting Feature using DeepSort, showcased in video.
- Conducted TensorFlow Lite model conversion of DeepSort and executed its deployment on Raspberry Pi hardware.
⚙️Technologies Used
Artificial Intelligence Engineer Intern
Key Achievements
- Developed algorithm for many-to-many mapping of symptoms to diseases, enabling disease diagnosis based on symptom sets.
- Created mathematical formulation to facilitate symptom recommendation, improving user experience and healthcare decision-making.
- Generated random datasets to rigorously test and optimize ML algorithms for recommendation systems.
⚙️Technologies Used
Career Highlights
5+ years
Total Experience
4
Companies Worked
5+
Team Members Led
Skills & Technologies
Comprehensive expertise in modern cloud technologies, ML platforms, and scalable system architecture
Cloud Platforms & Services
Enterprise cloud infrastructure and AI services
Container & Orchestration
Containerization and deployment automation
Data & ML Platforms
Machine learning and data processing platforms
API & Development
Backend development and API architecture
Competition Achievements
Proven track record in machine learning competitions and hackathons
HackerEarth ML Challenge - Marketing Optimization
Achieved Rank 1 in HackerEarth Machine Learning Challenge focused on optimizing marketing expenditure by leveraging predictive analytics.
HackerEarth ML Challenge - Wind Power Prediction
Secured Rank 5 in HackerEarth Machine Learning Challenge centered around predicting power generation on windy days using data-driven approaches.
GHF Hackathon - Credit Risk Prediction
Attained the 21st position in the GHF Hackathon organized by Univ.AI, demonstrating proficiency in credit risk prediction and financial modeling.
Excellence in competitive programming and machine learning challenges
Published Application
Production-ready native Linux application on Ubuntu Snap Store
CanvasNote
A comprehensive native Linux note-taking and drawing application optimized for 2-in-1/tablet devices with stylus support, featuring intelligent palm rejection, multiple input methods, and professional organization tools.
Why I Built This
I use an ASUS ROG Flow Z13 (2022) with touchscreen and stylus on Linux. Since I couldn't find a Linux note-taking app that handled stylus input and palm rejection reliably, I built one tailored to my needs and shared it on the Snap Store for others to use.
Ubuntu Snap Store
Key Features
Technical Highlights
Architecture
Built with GTK4 and libadwaita for native Linux desktop integration with Cairo graphics rendering
Input Handling
Advanced evdev-based palm rejection with automatic stylus detection and pressure sensitivity
Deployment
Packaged with Snapcraft for secure, sandboxed distribution with automatic updates
Academic Projects
Research and development projects from IIT Kanpur
Sparse Linear Regression with Advanced Optimization
Implemented sparse linear regression utilizing Accelerated Proximal Gradient Descent and Stochastic Coordinate Gradient Descent optimization techniques.
Technologies Used
Multiclass Code Repair Classification
Developed multiclass classification system for code repair using Bag of Words representations with Learning with Prototypes, One vs All, and Decision Tree methods.
Technologies Used
8 Programming Languages in 8 Weeks
Comprehensive programming challenge covering 8 different programming languages in 8 weeks, exploring various paradigms and developing foundational understanding.
Technologies Used
ANITA Experiment Data Analysis
Analyzed radio pulses and surface topography in ANITA experiment, creating 3D visualizations of mathematical formulations and experimental data using Python.
Technologies Used
Projects completed at Indian Institute of Technology, Kanpur
Academic Journey
From foundational learning to advanced research, a comprehensive educational pathway spanning prestigious institutions and competitive achievements.
Educational Background
| Year | Degree/Certificate | Institute | Description |
|---|---|---|---|
| 2012 | CBSE Class X | Jawahar Navodaya Vidyalaya, Bhiwani | Secondary education with Science and Mathematics |
| 2014 | CBSE Class XII | Jawahar Navodaya Vidyalaya, Bhiwani | Higher secondary with Science and Mathematics |
| 2015-2018 | B.Sc Physics Honours | Maharshi Dayanand University, Rohtak | Undergraduate Programme Specialised in Physics |
| 2018-2020 | Master's in Physics | Indian Institute of Technology Kanpur | Postgraduate Specialised in Computational Physics |
Academic Excellence
Research Entrance Exams
JEST 2018
CSIR-NET 2018
CSIR-NET 2019
Competitive Exams
IIT JAM 2018
IIT JEE Advanced 2015
GATE CSE 2021
Professional Certifications
Machine Learning & AI Certifications
Industry-recognized specialized courses
Collection of ML/AI certifications covering deep learning, data science, cloud platforms from leading tech companies and educational institutions.
Leadership & Responsibilities
Orientation Team Member
2019Assisted in coordinating with the Counseling Service and facilitating the Hall Allocation process and Orientation Program of 2019.
Hall Election Officer
2019-20Successfully organized and executed hall elections in Hall of Residence X, resulting in the formation of HEC for academic year 2019-20.
Get In Touch
Ready to transform ideas into reality? I'm passionate about collaborating on innovative ML projects, building scalable infrastructure, and solving complex technical challenges. Let's create something extraordinary together.
