Skip to content
View shreyamalogi's full-sized avatar
:octocat:
Keep Hustling, Keep Shining!!
:octocat:
Keep Hustling, Keep Shining!!
  • CodeMacrocosm
  • Berlin, Germany
  • 06:22 (UTC +01:00)

Block or report shreyamalogi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
shreyamalogi/README.md

👋 Hallo! Ich bin SHREYA MALOGI

Data Scientist | ML Engineer | MSc Data Analytics @ BSBI Berlin

Former Founder & Tech Director at CodeMacrocosm

total stars followers

Profile views

GIF


🚀 Career Objective & Availability

🎯 Actively Seeking: Werkstudent (Working Student) Roles (Immediate Start)

🎓 Full-Time Data Science/ML Roles (From April 2026)

📍 Location: Berlin, Germany (Open to Hybrid/Remote)

🇩🇪 Language: German (A2 Elementary – Currently advancing to B1)


📊 Professional Highlights

  • 🔭 Current Project: Architecting an Industrial Demand Forecasting Pipeline for 15.2M records.
  • ⚙️ Resource Optimization: Expert in memory-safe data processing (achieved 70% RAM reduction via downcasting).
  • 🎓 Academics: Advancing research in Sales Analytics and Predictive Modeling at BSBI.
  • 👥 Leadership: Former Technical Director of a global open-source community (scaled to 500+ devs).

🛠️ Technical Toolbox

Category Tools & Technologies
Data Science / ML Python LightGBM XGBoost Scikit-learn
Big Data / Cloud PySpark GCP PostgreSQL
Vision / Real-Time OpenCV Flask HOG/Dlib

🏆 Featured Project: Industrial-Scale Forecasting

Scale: 15.2 Million Transactions | Optimization: 70% RAM Reduction A production-grade pipeline solving zero-inflation in retail demand using Tweedie-LightGBM. Explore Repository →


📫 Connect with Me

LinkedIn

Email

Other Tools:

aws bootstrap c cplusplus css3 dart django docker express figma firebase flask flutter git graphql heroku html5 java javascript kotlin mongodb mysql nextjs nodejs opencv php postman python react redux spring sqlite tailwind typescript

An image of @5hre9a's Holopin badges, which is a link to view their full Holopin profile

+@ @ @ @ @ @ @ @ @ @ @ @ @ @ @ @ @ @ @ @ @ @ @ @ @ @ @ @+
@@       o o                                           @@
@@       | |                                           @@
@@      _L_L_                                          @@
@@   ❮\/__-__\/❯ Programming isn't about what you know @@
@@   ❮(|~o.o~|)❯  It's about what you can figure out   @@
@@   ❮/ \`-'/ \❯                                       @@
@@     _/`U'\_                                         @@
@@    ( .   . )     .----------------------------.     @@
@@   / /     \ \    | while( ! (succed=try() ) ) |     @@
@@   \ |  ,  | /    '----------------------------'     @@
@@    \|=====|/                                        @@
@@     |_.^._|                                         @@
@@     | |"| |                                         @@
@@     ( ) ( )   Testing leads to failure              @@
@@     |_| |_|   and failure leads to understanding    @@
@@ _.-' _j L_ '-._                                     @@
@@(___.'     '.___)                                    @@
+@ @ @ @ @ @ @ @ @ @ @ @ @ @ @ @ @ @ @ @ @ @ @ @ @ @ @ @+

Pinned Loading

  1. Industrial-Demand-Forecasting-Pipeline Industrial-Demand-Forecasting-Pipeline Public

    Architected a high-performance predictive pipeline processing 15 Million transactions. Optimized memory by 70% via custom downcasting and implemented Tweedie-LightGBM to solve zero-inflation in ret…

    Jupyter Notebook 14

  2. Amazon-BigData-Verified-Review-Classifier Amazon-BigData-Verified-Review-Classifier Public

    Scalable Trust-Signal Detection: A Big Data pipeline using PySpark and GCP Dataproc to classify 8GB+ of Amazon reviews with high-precision Random Forest modeling. Engineered for horizontal scalabil…

    Python 3

  3. Multi-Domain-CV-Intelligence-Workspace Multi-Domain-CV-Intelligence-Workspace Public

    A high-precision Computer Vision workspace featuring ReUNet for medical segmentation and MobileNetV2 for AgTech classification. Demonstrating cross-domain AI expertise in Healthcare Diagnostics and…

    Jupyter Notebook 1

  4. Retail-Data-Engineering-Pipeline Retail-Data-Engineering-Pipeline Public

    Scalable ETL Pipeline: Processing 5M+ retail records with PySpark on GCP Dataproc. Automated the extraction of global business KPIs and consumer trends. Includes an Ethical Data Framework to ensure…

    Python 14

  5. Biometric-Attendance-Engine Biometric-Attendance-Engine Public

    Real-time face recognition system using HOG encodings and Dlib landmarks. Features a high-speed Flask/OpenCV pipeline for live video processing and automated SQL database logging

    HTML 16 1

  6. Intelligent-Travel-Recommendation-Engine Intelligent-Travel-Recommendation-Engine Public

    An Intelligent Travel Recommendation Engine using TF-IDF Vectorization and KNN to predict optimal tourist destinations. Features a modular Python/Tkinter architecture and mathematical similarity sc…

    Python 8