Data and Infrastructure Engineer with 3+ years of experience bridging traditional data engineering with modern AI systems. I specialize in building scalable architectures through event-driven design, containerization, and agentic LLM systems.
- π Performance Champion: Reduced a 27-hour SQL procedure to 5 seconds using Python-based solutions for 80M+ rows
- π€ AI/LLM Expert: Building agentic architectures with fine-tuned models achieving 95%+ accuracy
- βοΈ Multi-Cloud Specialist: Experienced across AWS, Azure, and GCP ecosystems
- π Data Pipeline Architect: Designed Bronze-Silver-Gold data layers processing millions of records
- π¬ Published Researcher: Co-authored paper on quantum computing simulation systems
- π Currently based in Gurugram, India
- Data Engineering: Designed optimized pipelines using Python, PySpark, SQL, and Airflow
- AI Integration: Built AI-powered applications with FastAPI, OpenAI, and LangChain
- Performance: Achieved 99.9% runtime reduction (27.5 hours β 5 seconds) on 80M+ row processing
- Impact: Improved campaign outcomes by 16% through real-time dashboards and APIs
- Platform Development: Built cloud-native tools for 50k+ daily users
- Migration: Successfully migrated PostgreSQL to MongoDB systems
- DevOps: Implemented CI/CD pipelines with Azure DevOps
- Built scalable Bronze-Silver-Gold data layers in Treasure Data
- Engineered 12-scenario truth table for multichannel consent processing
- Integrated data across CDP, S3, Treasure Data, and SFMC platforms
- Built agent-based LLM system with 95%+ accuracy using fine-tuned Mistral7B and Llama 13B
- Developed advanced CLI tool processing 10B+ tokens in minutes
- Deployed FastAPI on Docker with GPU-accelerated inference
- Reduced 27.5-hour SQL procedure to 5 seconds using Python/Pandas/DuckDB
- Processed 80M+ rows for 150 clients on a single machine
- Showcased Python's capability for high-performance data processing
- Reduced manual prospecting effort by 60%
- Improved client response rates by 45% with AI-generated communications
- Integrated multiple APIs (OpenAI, Perplexity, LinkedIn) for profile validation
π¬ Published Research: "QuDiet: A Classical Simulation Platform for Qubit-Qudit Hybrid Quantum Systems" - IET Quantum Communication (2023)
- π§ Agentic AI Systems: Building sophisticated multi-agent LLM architectures
- β‘ Performance Engineering: Optimizing data processing at massive scale
- βοΈ Multi-Cloud Architecture: Designing platform-agnostic solutions
- π Real-time Data Streaming: Event-driven architectures with Kafka
- π€ LLM Fine-tuning: Custom model optimization for domain-specific tasks
"Transforming complex business requirements into scalable technical reality, one optimized pipeline at a time." πβ¨