Hi, I'm |

ML Platform Engineer & AI Systems Architect

I design, build scalable and resilient machine learning and LLM platforms that power intelligent applications at enterprise scale. With experience across ML lifecycles, RAG apps, and agentic tools, I transform prototypes into production-ready AI systems.

Rajesh Dhanda - ML Platform Engineer

About Me

With 5 years of experience in ML/DL model deployment, RESTful APIs, Docker, Kubernetes, and cloud platforms (Azure, GCP, Databricks), I am a proactive and results-driven machine learning engineer passionate about building highly scalable and resilient ML infrastructure. My expertise lies in CI/CD automation, container orchestration, Infrastructure as Code, and developing low-latency, high-performance inference systems to enable data scientists with seamless deployments and robust, production-grade workflows.

Microsoft Azure

Azure

Google Cloud Platform

GCP

Databricks

Databricks

Docker

Docker

Kubernetes

Kubernetes

GitHub

Professional Experience

5+ years of building scalable ML systems and platforms

Machine Learning Engineer (MLOps)

HALEONApril 2024 - PresentBengaluru, India
Productionizing DS Services, Infrastructure, and Pipelines on Azure and Databricks for scalable and reliable deployment.

Key Achievements

  • Haleon GenAI Assistant: Architected and developed an enterprise-grade, multi-team GenAI platform integrating FastAPI, React, Azure OpenAI and Azure Cloud Services to enable secure, intelligent data access across Finance, Procurement, and Help Center from scratch.
    • Built team-specific AI agents and modular tool frameworks using LangChain and LangGraph for contextual data retrieval.
    • Enhanced retrieval quality through semantic ranker with multilingual support and hybrid search using text and vector search in Azure AI Search.
    • Engineered a scalable, structure-aware document processing pipeline using Azure AI Document Intelligence with automated GitHub Actions workflows, extracting structured layouts and entities, performing chunking, ingesting and managing documents in Azure Blob Storage, and updating vector and keyword indexes in Azure AI Search for real-time, high-relevance retrieval and RAG-ready content delivery.
    • Experimented with MCP (Model Context Protocol) client–server architecture for distributed tool orchestration across databases and AI services.
    • Optimized chat architecture for sub-second latency through asynchronous processing, containerized deployment, and scalable microservice design.
  • Prometheus: Implemented end-to-end MLOps framework using Databricks Bundles, Unity Catalog, and GitHub Actions. Built robust CI/CD workflows with linting, bundle validation, pre and post semantic versioning, and automated Databricks job orchestration across Dev/UAT/Prod. Enabled dynamic, market and brand specific task generation at runtime, versioned model and data persistence in Unity Catalog, reproducible deployments via explicit semantic release tags, with outputs seamlessly refreshed in Power BI for stakeholders.
  • Federated Help Center Search: Architected and delivered a federated search solution across 15+ brand help center URLs, providing search results from multiple brand websites and help centers across different markets, languages, and brands.
    • Large-scale web crawling from 500+ brand websites (including help centers) across multiple countries, markets, domains, and languages. Automated scraping using Scrapy, structure-aware information extraction from leaflets using Azure AI Document Intelligence, data ingestion to Azure Blob Storage, and updating vector and keyword indexes in Azure AI Search for real-time, high-relevance retrieval and RAG-ready content delivery.
    • Providing robust search, autocomplete, and suggestion APIs from Azure AI Search directly to the frontend team for seamless integration.
    • For internal testing using GenAI assistant: Added Playground mode with custom model selection, temperature control, and system prompt configuration, including full prompt visibility alongside search results for enhanced experimentation and debugging.
    Federated Search API Demo

    Interactive demo of Autocomplete, Suggestions, and Search APIs Integrated across 15+ help centers providing results from brand websites

    Federated Search Demo
    Scroll into view to see animation
  • Procurement Contract Analysis: Led ML-Ops for a RAG-based (Agentic-RAG) system by productionizing modular workflows with Databricks Asset Bundles, integrating Azure Document Intelligence, Unity Catalog, and model serving endpoints along with automated deployment of Databricks App. Built CI/CD pipelines with GitHub Actions for automated, rollback-ready workflows, deployments, and GitOps-based orchestration.
  • Asset Vision: Designed, implemented a robust CI/CD pipeline with GitHub Actions to build and publish Docker images to GitHub Container Registry (GHCR) and Azure Container Registry (ACR), enabling automated deployment to Azure Web App and microservices on Kubernetes cluster for scalable, secure and reliable application management across environments. Applied same approach to streamline deployments in other RAG based project.
  • MLOps Template for Databricks: Improved and implemented a comprehensive MLOps template for Databricks leveraging GitHub Actions to automate CI/CD pipelines, including code linting, testing, environment setup, and seamless deployment of Spark jobs, such as model training, validation, and deployment, resulting in significantly enhanced workflow efficiency and reduced deployment times.

🏆Awards & Recognition

  • Global Employee Recognition
    BronzeUnlock Value at PaceDec 2025

    Rajesh has 1168 GitHub contributions this year, which is impeccable. He led MLE efforts and delivered successful deployments for Help Centre Federated Search, Prometheus, and Contract Analyzer projects. He presented the Contract Analyzer project on AI Day, receiving significant recognition across the Data Office and CDO.

  • Global Employee Recognition
    BronzeGo BeyondNov 2024

    Outstanding ML Engineer with major contributions to GenAI initiatives including Help Centre Federated Search and RAG-based chatbots using Azure and Databricks. Significant impact on MLOps through Prometheus and MLOps template contributions. Recognized as a strong team player, actively supporting deployments and leading discussions across MLOps forums.

  • Global Employee Recognition
    Collaborate for ImpactSep 2025

    Exceptional contribution and energy during Career Day and Technovation Day. Presented Contract Analyzer and Help Centre Federated Search projects, receiving widespread appreciation across the Data Office and CDO.

  • 2025 Excellence Award – India Capability Centre
    Simplify for ImpactWin as OneDec 2025

    Helped bring Haleon’s “Win as One” strategy to life. 2025 Excellence Award – India Capability Centre for Build Contract AI – A synergy of Procurement Analytics and Data Office, revolutionising contract intelligence and management using Agentic-RAG.

⚙️Technologies Used

AzureDatabricksGitHub ActionsDockerKubernetesUnity CatalogAzure AI SearchAzure OpenAISparkPythonFastAPIReactLangChainAzure Cognitive SearchMCPAzure Document Intelligence

Machine Learning Engineer (MLOps)

CARS24Sept 2021 - April 2024Gurgaon, India
Productionizing DS/BI/Data-engg/Infras/Pipelines on GCP (GKE, Kserve, Triton Inference Server, PostgreSQL, Airflow and VertexAI).

Key Achievements

  • CI/CD Infrastructure: Robust CI/CD pipelines using Cloud Build for seamless and automated deployment of microservices and batch services on GKE, Cloud Functions, and Cloud Run, significantly reducing deployment errors, manual effort, and turnaround times across environments. Collaborated with DS team to optimise code and model inference for higher throughput.
  • EdgeDB Inference Store: Architected, deployed and upgraded Inference store (EdgeDB on Google Kubernetes Engine with provisioned PostgreSQL instance as backend) with FastAPI endpoints deployed in GKE. API employed across various DS Services for efficient caching and retrieval of inferences in real time (latency around 5ms), as well as in batch processing jobs with traffic coming from Queue (PubSub) or an Orchestrator (Airflow). Integrated CI/CD pipelines for automated deployments in GCP (Upgrade Flow).
  • Feast Feature Store: Efficiently deployed Feast feature-server on Google Kubernetes Engine, leveraging an automated materialization pipeline for real-time data updates in Redis (Online Store) and BigQuery (Offline Store) via Cloud Functions triggered by a diverse range of team data uploads in a GCS bucket. API effectively addresses the challenge of accessing up-to-date features from CSV files and currently employed in 10+ DS services, delivering features with minimal latency (<10ms). CI/CD enabled for robust rollouts.
  • Data Flow Pipeline: Built data flow pipeline to optimize the timely data sync between Snowflake and Google Sheets using Airflow, Pub-Sub, EdgeDB and GKE. Syncing data for approx 2500 Sheets with roughly 15000 Jobs/day. (blog) (citation by Modern Data Stack, citation by Data Engineering Weekly).
  • Airflow Orchestration Platform: Deployed, administered, upgraded a robust Airflow infrastructure on Google Kubernetes Engine, overseeing 70+ diverse git synchronised DAGs running approx 1000+ times daily, covering a total of 3800+ tasks executions to support various DS-BI teams. Efficiently managed the execution of tasks, encompassing Cloud Function triggering, job scheduling, PubSub messaging, EdgeDB integration for jobs status update, and Snowflake-to-Google Sheets data synchronization and seamless integration with diverse GCP services. Implemented RBAC to enhance team visibility, established automated log cleanup with ensuring a streamlined, well-maintained system.
  • NBFC Infrastructure Migration: Spearheaded migration of NBFC DS infrastructure from AWS to GCP, breaking down monolith into scalable sub-monoliths and GKE-based microservices. Decoupled DB operations, slashing response times from 3 seconds to 503ms (99th percentile), eliminated overall timeouts, previously at 8-9%. Implemented streamlined data flow via Pub/Sub and Cloud Functions, enabling efficient EdgeDB updates and hourly Snowflake synchronization. CI/CD pipelines orchestrated for seamless deployment of microservices on GKE and respective cloud functions.
  • Triton Python SDK: Developed a pip-installable, reusable Python wheel package for seamless integration with Triton Inference Server, supporting high-performance model inference by handling both HTTP and GRPC requests in synchronous and asynchronous modes, with built-in logging, error handling, and configuration flexibility.
  • Request-Response Store: Designed, deployed high-performance APIs Request-Response Store using FastAPI, featuring client authentication, rate limiting and low 5ms latency. Orchestrated storage of request-response in EdgeDB, synchronized to Snowflake based on team-defined frequencies, and leveraged by 10+ DS services.
  • Customer Interaction Analytics: Designed and implemented a cloud-based workflow for end-to-end processing of customer interactions data. The workflow encompasses data ingestion from Snowflake, audio download, transcription generation, information extraction, and automated customer summary generation. This system is orchestrated through Airflow and leverages Google Cloud Functions, Cloud Storage, Kserve InferenceService, Triton, MongoDB and EdgeDB.
  • Data Platform R&D: Experimenting with deploying Doris, StarRocks, and associated open-source tools on GKE to construct a scalable and high-performance data platform, incorporating S3-compatible object storage via MINIO/GCS, implementing access control and data retention policies, and ensuring effective Kubernetes cluster monitoring and alerting through Grafana's dashboards utilizing Prometheus as a primary time-series data source.
  • Audio Embeddings Search: Experimented Minio and Milvus integration to extract audio embeddings from Minio, enabling efficient search for similar content, Redis for metadata storage.

🏆Awards & Recognition

  • Rookie Award – Appreciation Certificate
    Special RecognitionQ3 2022

    This certificate is proudly awarded to Rajesh Dhanda (ML Engg – Global) for outstanding performance in Quarter 3, 2022.

    Signatories: VP – DS and BI, COO

⚙️Technologies Used

GCPGKEKserveTriton Inference ServerPostgreSQLAirflowVertexAIEdgeDBFastAPIFeastRedisBigQueryCloud BuildPub/SubSnowflakeMongoDBDorisStarRocksMINIOMilvusGrafanaPrometheus

Computer Vision Software Engineer

Deep Sight AI LabsJune 2021 - Sept 2021Remote
Developed solutions for object detection, object tracking, and video surveillance utilizing Deep Learning techniques.

Key Achievements

  • Implemented a reference direction-based entry-exit system and Object Counting Feature using DeepSort, showcased in video.
  • Conducted TensorFlow Lite model conversion of DeepSort and executed its deployment on Raspberry Pi hardware.

⚙️Technologies Used

Computer VisionDeep LearningDeepSortTensorFlow LiteRaspberry PiObject DetectionObject TrackingVideo Surveillance

Artificial Intelligence Engineer Intern

DocBot+March 2021 - May 2021Remote
Developed intelligent healthcare solutions focusing on disease diagnosis and symptom analysis using machine learning algorithms.

Key Achievements

  • Developed algorithm for many-to-many mapping of symptoms to diseases, enabling disease diagnosis based on symptom sets.
  • Created mathematical formulation to facilitate symptom recommendation, improving user experience and healthcare decision-making.
  • Generated random datasets to rigorously test and optimize ML algorithms for recommendation systems.

⚙️Technologies Used

Machine LearningData AnalysisAlgorithm DevelopmentHealthcare AIPythonRecommendation Systems

Career Highlights

5+ years

Total Experience

4

Companies Worked

5+

Team Members Led

Skills & Technologies

Comprehensive expertise in modern cloud technologies, ML platforms, and scalable system architecture

☁️

Cloud Platforms & Services

Enterprise cloud infrastructure and AI services

AzureAzure OpenAIGCPGitHubDatabricks
🐳

Container & Orchestration

Containerization and deployment automation

DockerKubernetesGKEGitHub ActionsCloud Build
🤖

Data & ML Platforms

Machine learning and data processing platforms

DatabricksUnity CatalogSparkAirflowMLOpsFeastEdgeDBSnowflake

API & Development

Backend development and API architecture

FastAPIPythonRESTful APIsMicroservicesGitOpsSemantic VersioningKserveTriton Inference Server

Competition Achievements

Proven track record in machine learning competitions and hackathons

CompetitionApr 2021

HackerEarth ML Challenge - Marketing Optimization

Achieved Rank 1 in HackerEarth Machine Learning Challenge focused on optimizing marketing expenditure by leveraging predictive analytics.

Machine LearningPredictive AnalyticsPythonData Science
Rank 1 🏆
View Results
CompetitionMay 2021

HackerEarth ML Challenge - Wind Power Prediction

Secured Rank 5 in HackerEarth Machine Learning Challenge centered around predicting power generation on windy days using data-driven approaches.

Machine LearningTime SeriesPredictive ModelingPython
Rank 5 🏆
View Results
HackathonApr 2021

GHF Hackathon - Credit Risk Prediction

Attained the 21st position in the GHF Hackathon organized by Univ.AI, demonstrating proficiency in credit risk prediction and financial modeling.

Credit RiskFinancial ModelingMachine LearningPython
Rank 21 🏆
View Results

Excellence in competitive programming and machine learning challenges

Published Application

Production-ready native Linux application on Ubuntu Snap Store

CanvasNote Logo

CanvasNote

A comprehensive native Linux note-taking and drawing application optimized for 2-in-1/tablet devices with stylus support, featuring intelligent palm rejection, multiple input methods, and professional organization tools.

Why I Built This

I use an ASUS ROG Flow Z13 (2022) with touchscreen and stylus on Linux. Since I couldn't find a Linux note-taking app that handled stylus input and palm rejection reliably, I built one tailored to my needs and shared it on the Snap Store for others to use.

Published on

Ubuntu Snap Store

Get it from the Snap StoreView on GitHub
sudo snap install canvasnote

Key Features

Pressure-sensitive stylus with palm rejection
Text input with formatting support
Multi-page A4 notes with templates
Shape tools and highlighter
Subject-based library organization
Export to PNG/PDF formats

Technical Highlights

Architecture

Built with GTK4 and libadwaita for native Linux desktop integration with Cairo graphics rendering

Input Handling

Advanced evdev-based palm rejection with automatic stylus detection and pressure sensitivity

Deployment

Packaged with Snapcraft for secure, sandboxed distribution with automatic updates

Academic Projects

Research and development projects from IIT Kanpur

ML - IIT Kanpur

Sparse Linear Regression with Advanced Optimization

Implemented sparse linear regression utilizing Accelerated Proximal Gradient Descent and Stochastic Coordinate Gradient Descent optimization techniques.

Technologies Used
Machine LearningOptimizationPythonMathematical Modeling
Advanced ML Algorithms
ML - IIT Kanpur

Multiclass Code Repair Classification

Developed multiclass classification system for code repair using Bag of Words representations with Learning with Prototypes, One vs All, and Decision Tree methods.

Technologies Used
ClassificationNLPDecision TreesMachine Learning
Multiple ML Approaches
Programming Club - IIT Kanpur

8 Programming Languages in 8 Weeks

Comprehensive programming challenge covering 8 different programming languages in 8 weeks, exploring various paradigms and developing foundational understanding.

Technologies Used
Multi-LanguageProgramming ParadigmsSoftware DevelopmentProblem Solving
8 Languages Mastered
Academic Project - IIT Kanpur

ANITA Experiment Data Analysis

Analyzed radio pulses and surface topography in ANITA experiment, creating 3D visualizations of mathematical formulations and experimental data using Python.

Technologies Used
Data Analysis3D VisualizationPythonScientific Computing
3D Data Visualization

Academic Journey

From foundational learning to advanced research, a comprehensive educational pathway spanning prestigious institutions and competitive achievements.

Educational Background

YearDegree/CertificateInstituteDescription
2012
CBSE Class X
Jawahar Navodaya Vidyalaya, Bhiwani

Secondary education with Science and Mathematics

2014
CBSE Class XII
Jawahar Navodaya Vidyalaya, Bhiwani

Higher secondary with Science and Mathematics

2015-2018
B.Sc Physics Honours
Maharshi Dayanand University, Rohtak

Undergraduate Programme Specialised in Physics

2018-2020
Master's in Physics
Indian Institute of Technology Kanpur

Postgraduate Specialised in Computational Physics

Academic Excellence

Research Entrance Exams

JEST 2018
AIR 65
CSIR-NET 2018
AIR 70
CSIR-NET 2019
AIR 51

Competitive Exams

IIT JAM 2018
AIR 22
IIT JEE Advanced 2015
Qualified
GATE CSE 2021
Qualified

Professional Certifications

Machine Learning & AI Certifications

Industry-recognized specialized courses

Collection of ML/AI certifications covering deep learning, data science, cloud platforms from leading tech companies and educational institutions.

Leadership & Responsibilities

Orientation Team Member

2019

Assisted in coordinating with the Counseling Service and facilitating the Hall Allocation process and Orientation Program of 2019.

Hall Election Officer

2019-20

Successfully organized and executed hall elections in Hall of Residence X, resulting in the formation of HEC for academic year 2019-20.

Get In Touch

Ready to transform ideas into reality? I'm passionate about collaborating on innovative ML projects, building scalable infrastructure, and solving complex technical challenges. Let's create something extraordinary together.

Contact Information

Email

rajeshkrdhanda@gmail.com

Location

Bengaluru, India

Start a Conversation

Choose your preferred discussion topic and I'll help you craft the perfect message. Each option opens Gmail with a professionally structured email template.