This project analyzes retail sales data from a Superstore to uncover performance trends, high-profit categories, and growth opportunities. Using SQL for business-driven queries and Python for visual analytics, we extract actionable insights across product lines, customer segments, and geographic regions.
- Analyze sales and profit trends across segments, categories, and regions
- Perform SQL-based subgroup analysis to classify products and customers
- Visualize key metrics to support strategic retail decisions
- SQL β Business queries, segment analysis, profitability logic
- Python β Data cleaning & visualization
- Libraries:
pandas
,matplotlib
,seaborn
- PowerPoint β Final presentation and summary
- Canon imageCLASS 2200 is the top-selling product (βΉ61,000+)
- West region leads in total sales; South lags behind
- Copiers and Phones are highly profitable; Tables incur losses
- Consumers drive the most revenue among segments
- High discounts don't always lead to high sales or profit
Cleaned_Superstore.csv
β Cleaned and preprocessed datasetSuperstore_sales_data_analysis.ipynb
β Python notebook with insights and graphsCustomer_And_Products.sql
β SQL queries for business analysisSuperstore_Sales_Analysis_ppt.pdf
β Presentation slide deck
This project provides a clear breakdown of what drives Superstore sales and profitability. By combining SQL logic with data visualization, we identified top-performing products, regions needing improvement, and the impact of discounting strategies. These findings can support smarter inventory planning, customer targeting, and promotional strategies.