I'm Jay. I'm a data scientist, and I currently live in Washington, DC. I primarily do machine learning and data analysis, with some web development and devops on the side. Nowadays, I mostly program in Python, though I also sometimes use R. In a former life, I was an aerospace engineer and wrote a lot of MATLAB.
✨ Open Source Projects
Some of the open source projects I maintain include:
- cloudpathlib: pathlib-style classes for cloud storage services
- deon: a data science ethics checklist framework and tool
- erdantic: entity-relationship diagrams for Python data classes like Pydantic
- pkgnet: an R package for network analysis of R package dependencies and structure
- quickhttp: lightweight http server with automatic port-finding and shutdown
- reprexlite: render reproducible examples of Python code for sharing
- spongebob: a collection of tools for creating text in moCkInG SPoNGeboB cAsE
☀️ My Day Job
I'm a Lead Data Scientist at DrivenData, where I do full-stack data science and data engineering consulting for social impact organizations.
📭 Talk to Me!
Feel free to direct-message me on Bluesky if you'd like to chat about open source, data for good, civic tech, data ethics, puzzle hunts, or anything else.