Skip to content
View BDepanfilis's full-sized avatar

Block or report BDepanfilis

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
BDepanfilis/README.md

Bradley DePanfilis — Data Analytics & Data Science

Gen-Z CPI (2020–2025): Fixed vs. Chained Index vs BLS CPI-U

  • Developed a reproducible ETL pipeline using BigQuery SQL to compute two consumer price indices (fixed-weight Laspeyres and chained Laspeyres) tailored to Gen-Z spending, validated against BLS CPI-U.
  • Deliverables: versioned CSVs, GitHub Pages site with Plotly visualizations, and a Power BI dashboard.
  • Highlights: Transparent methodology, normalized base (2020=1.00), cost-aware query design, and automated data processing for easy re-runs.
  • Skills: SQL (BigQuery), data pipeline development, data visualization, time-series analysis, index construction.
  • Repo: REPO_URL • Live: LIVE_URL • Power BI: LIVE_URL

Gender Wage Gap (2019–2024): Sequential Specs & Robust Inference

  • Analyzed CPS March ASEC microdata to estimate the U.S. career gender wage gap using sequential regression models (baseline, education, personal/geo, household, singles).
  • Implemented HC1 robust standard errors, year-by-year trends, and publication-quality visualizations with R/ggplot2.
  • Deliverables: Modular R scripts (wrangle/analysis), CSV outputs, GitHub Pages site, and Power BI dashboard.
  • Highlights: Reproducible data wrangling, robust statistical modeling, and clear visualizations showing a persistent 21–24% gap (14% for singles).
  • Skills: Statistical modeling, R (tidyverse, broom, sandwich/lmtest), data visualization, data wrangling, regression analysis.
  • Repo: REPO_URL • Live: LIVE_URL • Power BI: LIVE_URL

Olist Analytics — Streamlit Dashboard

  • Built an interactive e-commerce analytics dashboard using Streamlit, DuckDB, Altair, and Pandas on the Olist dataset, delivering metrics on revenue, AOV, customer cohorts, LTV, retention heatmaps, returns, and marketing ROI (ROAS).
  • Highlights: Optimized query caching, defensive data handling, and production-ready secrets management; serves ~180 days of data with snappy performance.
  • Skills: Analytics engineering, Python (Pandas, Altair), Streamlit, DuckDB, data visualization, ETL pipelines, cohort analysis, LTV modeling.
  • Repo: REPO_URL • Live: LIVE_URL

ecom-analytics-olist (dbt Project)

  • Engineered a curated analytics warehouse using dbt Core to transform raw Olist e-commerce data into a star schema with staging, core dimensions/facts, and marts for sales, marketing ROI, cohorts, and returns quality.
  • Implemented data quality tests (dbt_utils, dbt_expectations), seeds for enrichment, and automated dbt Docs publishing to GitHub Pages for lineage and observability.
  • Highlights: Warehouse-agnostic workflow with DuckDB for local/CI and optional BigQuery execution with cost guards; demonstrates robust ELT and CI/CD integration.
  • Skills: Data engineering, dbt Core, SQL, DuckDB, BigQuery, data modeling, ELT pipelines, data quality testing, CI/CD.
  • Repo: REPO_URL • Live Docs: LIVE_URL

AB Uplift Analysis (Python)

  • Created an end-to-end A/B testing pipeline for uplift modeling using the Criteo Uplift dataset, featuring power checks, causal effect estimates (diff-in-proportions, CUPAC, IPW/AIPW), and targeting via Qini/AUUC with a Top-K policy.
  • Engineered as a reproducible Python package with CLI entry point, deterministic seeds, and auto-generated reports/figures; includes covariate balance (SMDs) and propensity diagnostics.
  • Highlights: Narrative Jupyter Notebook with ROI appendix; supports data-driven marketing and product decisions.
  • Skills: Data analytics, Python (pandas, NumPy, statsmodels, scikit-learn, matplotlib), causal inference, A/B testing, statistical modeling, data visualization.
  • Repo: REPO_URL

Bayesian Elo (Valorant)

  • Developed a production-ready Bayesian team rating system in for Valorant teams displayed on Vlr.gg, aggregating player stats into team skills with hierarchical priors for uncertainty, incorporating margin-of-win scaling, map adjustments, meta updates, roster resets, time decay, and regional biases to enable accurate 1:1 cross-regional predictions for Tier 1 & 2 teams across global regions. .
  • Deployed for 2024–2025 VCT/VCL matches with >28M total visitors; supports dynamic leaderboards and real-time updates.
  • Highlights: Modular design with FastAPI, SQLAlchemy, and PostgreSQL; deployed via Docker on AWS; integrates Chart.js for visualizations.
  • Skills: Data engineering, Docker, AWS, Python (pandas, NumPy, FastAPI, SQLAlchemy, Pydantic), PostgreSQL, API development, Bayesian modeling, data visualization.
  • Repo: REPO_URL

Cake Chooser (Elo micro-app)

  • Built a lightweight GitHub Pages app using JavaScript and a simple Elo system to rank cake preferences, demonstrating decision-making tools with a playful dataset.
  • Highlights: Clean UI, reproducible logic, and rapid deployment for stakeholder interaction.
  • Skills: Data analytics, JavaScript, data visualization, decision support systems.
  • Repo: REPO_URL • Live: LIVE_URL

Contact


Notes for Reviewers

Repos are structured for quick digestion. If you’d like a private walkthrough of any project or a short Loom overview, I’m happy to share.

Pinned Loading

  1. Gender-Wage-Gap-2019-2024 Gender-Wage-Gap-2019-2024 Public

    R and Power BI analysis of CPS ASEC data (2019–2024) to estimate the gender wage gap over careers with demographic and household controls

    R

  2. BUSN-5000-Gender-Wage-Gap-March-2022-CPS-Project-Public BUSN-5000-Gender-Wage-Gap-March-2022-CPS-Project-Public Public

    Data-driven investigation of gender-based earnings inequality using CPS ASEC 2022

  3. Elo-Cake-Birthday Elo-Cake-Birthday Public

    Fun side project in which I built an elo system around different cakes and had my girlfriend choose which one she wanted for her birthday!

    JavaScript

  4. Valorant-Bayesian-Elo-System Valorant-Bayesian-Elo-System Public

    Bayesian Elo system for Valorant with player stat weighting, regional bias, time decay, and API design

  5. BDepanfilis BDepanfilis Public

    About me, my projects, and my journey

  6. ab-uplift-analysis ab-uplift-analysis Public

    End-to-end A/B uplift analysis in Python. Reproducible CLI runs power & balance checks, estimates effects with CUPAC and IPW/AIPW, and produces Qini/AUUC plus a Top-K targeting policy. Includes an …

    HTML