Skip to content
View SreejaBethu's full-sized avatar

Block or report SreejaBethu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
sreejabethu/README.md

๐Ÿ‘‹ Hi there! I'm Sreeja Bethu

๐ŸŽฏ Data Analyst | AI-Powered Automation Builder | LLMs + BI + Strategy

I'm passionate about transforming data into actionable insights that drive strategic decisions and measurable outcomes. Passionate about data storytelling, AI automation, and solving business problems using large language models, dashboards, and ETL workflows. With a strong foundation in both data analytics and business understanding, I specialize in uncovering trends, solving real-world problems, and helping organizations make data-backed decisions.

I enjoy working with large datasets, building dashboards, and generating reports that speak the language of business. From cleaning raw data to visualizing key metrics, I make sure every data point counts.

๐Ÿงฐ Skills & Tools

๐Ÿ‘ฉโ€๐Ÿ’ป Programming & Query Languages

Python โ€“ Data manipulation (Pandas, NumPy), analysis, automation, LLM integration (Gemini Pro, OpenAI), scripting

SQL โ€“ Advanced querying, joins, CTEs, window functions, optimization, window functions, data validation

Excel โ€“ Functions, pivot tables, dashboards, and VBA basics

๐Ÿ“Š Data Visualization

Power BI โ€“ Interactive dashboards, KPIs and data storytelling

Tableau โ€“ Dynamic reports and visual analytics

Matplotlib / Seaborn / Plotly / Streamlit โ€“ Custom visuals, user-interactive web dashboards

๐Ÿ› ๏ธ Data Handling & Workflow

ETL Processes โ€“ Cleaning, transforming, and loading structured data

PIs & Web Scraping โ€“ Requests, BeautifulSoup

Git & GitHub โ€“ Version control and collaborative development

Jupyter Notebook / Google Colab โ€“ Exploratory data analysis notebooks

๐Ÿค– AI & Automation

LLMs & GenAI โ€“ Google Gemini, OpenAI GPT, RAG-style prompting

Prompt Engineering โ€“ Structured JSON output, task chaining

NLP & NLG โ€“ Resume matching, cover letter generation, summarization

๐Ÿ› ๏ธ Data Engineering & Workflow

ETL Pipelines โ€“ Data extraction, transformation, loading via Python/SQL

APIs & Web Scraping โ€“ Requests, BeautifulSoup, Gemini API, OpenAI API

Version Control โ€“ Git, GitHub

Notebooks โ€“ Jupyter, Google Colab

๐Ÿ’ผ Featured Projects

Here are a few projects that reflect my skills and problem-solving capabilities:

๐Ÿค– AI Job Application Assistant โ€“ Google GenAI Capstone (Finalist)

AI agent that tailors resumes, matches job descriptions, and writes personalized cover letters.

  • Tools: Python, Google Gemini Pro, Prompt Engineering
  • Outputs: Match scoring, bullet suggestions, JSON-structured output
  • Featured on Kaggle, GitHub, and YouTube

๐Ÿ”— GitHub Repo | Kaggle Notebook | YouTube Demo


Leverages LLMs and AI agents to automatically analyze reports (PDF/Excel/CSV) and generate actionable summaries, charts, and insights.

๐Ÿ” Automated insight extraction using Python & OpenAI APIs

๐Ÿ“Š Visualizations using Plotly and Matplotlib

๐Ÿค– Intelligent summarization & natural language generation


An interactive Streamlit application visualizing and comparing cost of living indices across various countries.

Technologies Used: Python, Streamlit, Pandas, Plotly, Seabornโ€‹

Features:

  • ๐Ÿ—บ๏ธ Compare indices by country using visual charts

  • ๐Ÿ“Š Built with Plotly, Seaborn, Streamlit

  • ๐Ÿงฎ Focus on rent, groceries, utilities, etc.

    Outcome: Facilitates users in making informed decisions regarding global cost comparisons.


Analyzes sales data and builds time-series models to forecast future trends.

  • ๐Ÿงผ Data wrangling and preprocessing with Pandas
  • ๐Ÿ“ˆ Time-series forecasting with ARIMA & statsmodels
  • ๐Ÿ“‰ Actionable sales insights for business planning

๐Ÿ… Full Set of Kaggle Badges

๐Ÿท๏ธ Kaggle Badges

View on Kaggle Kaggle Profile Kaggle Competitions Kaggle Datasets Kaggle Notebooks Kaggle Discussions

๐Ÿ“ฌ Letโ€™s Connect

Iโ€™m always excited to collaborate, learn, or just chat about data!

๐Ÿ”— LinkedIn

๐Ÿ“ง Email: [email protected]

๐Ÿง  Portfolio Website: https://sreejabethu.github.io/

๐Ÿ“ Location: United States (Open to Remote & Hybrid Roles)

Letโ€™s make data work smarter with AI ๐Ÿš€

Pinned Loading

  1. GEN-AI-CAPSTONE-PROJECT GEN-AI-CAPSTONE-PROJECT Public

    This project demonstrates a Generative AI-powered assistant that streamlines the job application process using Google Gemini Pro. It analyzes a userโ€™s resume against a job description, calculates aโ€ฆ

    Jupyter Notebook 1

  2. Smart-Report-Analyzer Smart-Report-Analyzer Public

    An AI-powered LLM app to analyze and summarize Excel, CSV, and PDF reports using Hugging Face language models. Built with Streamlit.

    Python 1

  3. Cost-Of-Living-Index-Globally Cost-Of-Living-Index-Globally Public

    This Streamlit app is a data visualization tool that allows users to explore and compare the cost of living indices across different countries. The app takes in a dataset of cost of living indices โ€ฆ

    Python 1

  4. Forecasting-Weather Forecasting-Weather Public

    Weather Forecasting using OpenWeatherMap API and Random Forest Regressor in Python. Converts temperature data to Fahrenheit, and provides visualizations for actual vs predicted temperatures.

    Python 1

  5. Paris-Olympics-2024-Medals-List Paris-Olympics-2024-Medals-List Public

    Python 1

  6. Realtime-Stock-Market-Analysis-Visualization Realtime-Stock-Market-Analysis-Visualization Public

    Python 1