Designing resilient data architectures and agentic AI systems that power the modern enterprise.
For over 15 years, I've engineered data platforms that don't just scale—they think. Processing massive streams with sub-second latency, and bridging extreme data architecture with Agentic AI.
Building hyperscale infrastructure across industry leaders.
Engineered a petabyte-scale real-time analytics platform utilizing Apache Flink, Kafka, and Iceberg to process massive enterprise streaming workloads.
Rewrote Iceberg table procedures, achieving an 80% reduction in cloud compute costs and a 4x query performance boost.
Architected an LLM and MCP-powered operational interface for Apache Pinot, enabling natural language querying over complex streaming data.
Engineered and automated a petabyte-scale cloud data lake platform, providing highly reliable infrastructure to support 10,000+ analytical workflows at scale.
Spearheaded the migration of monolithic backends to an auto-scaling AWS EKS environment running Spark and Iceberg on S3.
Innovated a custom, zero-downtime S3 active-active replication engine, saving millions annually on native replication fees.
Led the real-time integration of India's Aadhaar National Biometric Identity system with legacy core banking infrastructure.
Architected secure, high-throughput microservices using Java and Spring Boot, exposing robust REST APIs to process millions of daily eKYC authentication requests with strict high-availability SLAs.
Built a low-latency trading pipeline utilizing Apache Flink and Kafka. Engineered an ML inference engine using 5 regime learners.
Foundational tools for modern data platforms.
Authored an open-source agent workflow for an iterative worker-reviewer cycle with subagent critiques.
Fixed non-daemon threads blocking JVM shutdown in RenewableTlsUtils.
Merged configured and discovered provider models, unifying the API compatibility layer.