Skip to content
View Amorfati123's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report Amorfati123

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Amorfati123/README.md

πŸ’« About Me:

I’m a data scientist who builds end-to-end ML and data engineering systems that ship in real healthcare and public health environments. I focus on making models usable and trustworthy by pairing strong modeling with reproducible pipelines, validation, and workflow integration.

At the CDC, I worked on surveillance informatics in Palantir Foundry (1CDP). I designed modular ingestion and transformation pipelines, implemented schema validation and data quality checks, used lineage to debug upstream issues, and built tools that reduced manual burden for epidemiologists and state partners. I also developed structured, auditable workflows for semi-automated tasks like schema mapping, with human review, versioned configurations, and clear traceability.

What I work on most:

ML engineering: training and inference pipelines, distributed processing, monitoring, reproducibility

Data engineering: schema management, validation gates, lineage-driven debugging, scalable transforms

Applied healthcare AI: interpretable models, uncertainty-aware decisions, clinical workflow fit

Tools: Python, SQL, PyTorch, Spark/PySpark, Git, containers, Palantir Foundry

I like practical problems where correctness, traceability, and maintainability matter as much as model performance.

πŸ’» Tech Stack:

Python R PowerShell Bash Script TypeScript Windows Terminal Markdown Azure AWS Apache Spark Apache Kafka Chart.js Apache Apache Tomcat MicrosoftSQLServer MongoDB MySQL Postgres Adobe Adobe Acrobat Reader Adobe Lightroom Adobe Photoshop Keras Matplotlib mlflow NumPy Pandas Plotly PyTorch scikit-learn Scipy TensorFlow GitLab CI GitHub Actions Git GitHub GitLab

πŸ“Š GitHub Stats:



πŸ† GitHub Trophies

✍️ Random Dev Quote


Pinned Loading

  1. Length-of-Stay-Prediction-in-Hospitals-Using-MIMIC-IV-Dataset Length-of-Stay-Prediction-in-Hospitals-Using-MIMIC-IV-Dataset Public

    This study investigates the relationship between patient demographics, hospitalization factors, and length of stay (LOS) in hospitals. Statistical methods, including Kruskal Wallis tests and post-h…

  2. wound-forecast wound-forecast Public

    Forecasting wound trajectory using ML

    Jupyter Notebook 1

  3. iupui-soic/dhis2-opencpu iupui-soic/dhis2-opencpu Public

    Creating an Open Web App for DHIS2 that can run any R code for statistical analysis

    JavaScript 1 2

  4. CheXNet CheXNet Public

    Forked from arnoweng/CheXNet

    A pytorch reimplementation of CheXNet

    Python 1

  5. NEDSS-DataReporting NEDSS-DataReporting Public

    Forked from CDCgov/NEDSS-DataReporting

    Data Near Real Time Reporting micro services for Modernized NBS System

    TSQL

  6. litellm litellm Public

    Forked from BerriAI/litellm

    Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]

    Python