Skip to content
View SnehaDharne's full-sized avatar

Highlights

  • Pro

Block or report SnehaDharne

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
SnehaDharne/README.md

πŸ‘‹ Hi there! I'm Sneha Dharne

πŸš€ Data Engineer | 🧠 ML Engineer | πŸ₯ Health Tech Innovator

I'm passionate about leveraging data to drive insights and innovation, especially in the healthcare sector. Currently pursuing my MS in Computer Science at Stevens Institute of Technology, I'm on a mission to transform complex data into actionable intelligence.

πŸ”¬ What I'm Up To

  • πŸŽ—οΈ Interning at Oncology Reference Inc., where I'm building ETL pipelines and automating data processes
  • πŸ“Š Developing AI-powered tools for financial report generation and medical data analysis
  • πŸ“š Studying for GCP and AWS Solution Architect certifications
  • 🀝 Seeking connections with fellow data engineers and health tech enthusiasts

πŸ› οΈ Tech Stack

  • Languages: Python, SQL, Java, JavaScript
  • Data Science: Pandas, NumPy, Scikit-learn, TensorFlow, PyTorch
  • Big Data: Spark, Kafka, PySpark
  • Cloud: AWS (S3, EC2, Lambda), Google Cloud Platform
  • Visualization: Power BI, Matplotlib, Seaborn

πŸ† Featured Projects

πŸ“ˆ Data Engineering Capstone (https://github.com/SnehaDharne/StockAnalyticswithAWS)

Developed a big data pipeline using AWS, Spark, and Kafka, for a Capstone Project at Chubb.

Built an AI tool using LangChain for automated financial report generation, cutting report creation time from 8 hours to 10 minutes.

πŸš— NYC Collision Risk Prediction (https://github.com/SnehaDharne/BigDataAnalytics-MVCollisions)

Led ETL on 5M records using PySpark, built a prediction model with 84% accuracy, and scaled the solution on GCP.

πŸ€– Guided ML - An Assistant for Sr. Data Scientists

Automate manual finetuning, hyperparameter testing and feature engineering using LLMs and Agentic RAG.

🩺 Early Ocular Disease Diagnosis (https://github.com/SnehaDharne/OcularDiseaseRecognition)

Used CNN models (ResNet50, VGG16) to classify fundus images, achieving an F1 score of 0.8 and AUC of 0.9.

🩺 Early GDM Diagnosis in Women (https://github.com/SnehaDharne/GDM-diagnosis)

Used ML models to diagnose early stage GDM in pregnant women, research featured in Journal of Ayurveda and Integrative Medicine

πŸ“« Let's Connect!

I'm always excited to collaborate on innovative projects or discuss the latest in data engineering and AI. Feel free to reach out!

Looking forward to connecting with fellow data enthusiasts and health tech innovators! Let's build something amazing together! πŸš€

Pinned Loading

  1. Azure/PyRIT Public

    The Python Risk Identification Tool for generative AI (PyRIT) is an open source framework built to empower security professionals and engineers to proactively identify risks in generative AI systems.

    Python 2.3k 450

  2. StockAnalyticswithAWS Public

    Capstone Project with Chubb.

    Python 1

  3. FinancialReportGeneratorGenAI Public

    Report Generator ChatBot designed to analyze financial metrics, generate reports, and create graphs based on user preferences. It leverages LangChain for AI agent workflows and provides a suite of …

    Jupyter Notebook 2

  4. BigDataAnalytics-MVCollisions Public

    Leveraging NYC Open Data, this repository contains Databricks notebooks for analyzing motor vehicle collisions. We perform EDA, spatial clustering, and predictive modeling on collision, vehicle, an…

    Jupyter Notebook 1

  5. OcularDiseaseRecognition Public

    Fundus image analysis for ocular disease recognition using deep learning. This repository implements image preprocessing, MIRNet enhancement, and transfer learning with ResNet50 and VGG16 for class…

    Jupyter Notebook 1

  6. VAERS-SymptomExtractionwithAI Public

    VAERS Adverse Event Analysis for COVID 19 Vaccine : A hybrid approach combining LLMs (Gemini 1.5 Flash) and statistical methods for enhanced vaccine safety signal detection. Analyzes temporal and a…

    Jupyter Notebook 1 1