Iβm a data scientist who builds end-to-end ML and data engineering systems that ship in real healthcare and public health environments. I focus on making models usable and trustworthy by pairing strong modeling with reproducible pipelines, validation, and workflow integration.
At the CDC, I worked on surveillance informatics in Palantir Foundry (1CDP). I designed modular ingestion and transformation pipelines, implemented schema validation and data quality checks, used lineage to debug upstream issues, and built tools that reduced manual burden for epidemiologists and state partners. I also developed structured, auditable workflows for semi-automated tasks like schema mapping, with human review, versioned configurations, and clear traceability.
What I work on most:
ML engineering: training and inference pipelines, distributed processing, monitoring, reproducibility
Data engineering: schema management, validation gates, lineage-driven debugging, scalable transforms
Applied healthcare AI: interpretable models, uncertainty-aware decisions, clinical workflow fit
Tools: Python, SQL, PyTorch, Spark/PySpark, Git, containers, Palantir Foundry
I like practical problems where correctness, traceability, and maintainability matter as much as model performance.
Pinned Loading
-
Length-of-Stay-Prediction-in-Hospitals-Using-MIMIC-IV-Dataset
Length-of-Stay-Prediction-in-Hospitals-Using-MIMIC-IV-Dataset PublicThis study investigates the relationship between patient demographics, hospitalization factors, and length of stay (LOS) in hospitals. Statistical methods, including Kruskal Wallis tests and post-hβ¦
-
-
iupui-soic/dhis2-opencpu
iupui-soic/dhis2-opencpu PublicCreating an Open Web App for DHIS2 that can run any R code for statistical analysis
-
-
NEDSS-DataReporting
NEDSS-DataReporting PublicForked from CDCgov/NEDSS-DataReporting
Data Near Real Time Reporting micro services for Modernized NBS System
TSQL
-
litellm
litellm PublicForked from BerriAI/litellm
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
Python
If the problem persists, check the GitHub status page or contact support.

