Welcome to my submission for Week 1 of Summer Analytics 2024 by C&A. This assignment focused on applying basic data analysis skills using Python libraries such as Pandas, NumPy, Seaborn, and Matplotlib.
The dataset used is Cars.csv, which contains specifications of cars from the 1970s and 1980s, including:
mpg(Miles per gallon)cylinders,displacement,horsepowerweight,acceleration,model_year,origin,name
- Imported the dataset and handled missing values (e.g., filled missing
horsepowervalues with the median) - Generated descriptive statistics
- Visualized relationships using:
- Pairplot
- Correlation heatmap
- Line plot of MPG trend by model year
- Bar plot of average MPG by origin
- Count plot for car distribution by origin
- Japanese cars have the highest average fuel efficiency.
- MPG increased steadily from 1970 to 1982.
- Negative correlation found between MPG and horsepower, weight, and displacement.
- Python 3
- pandas
- numpy
- seaborn
- matplotlib
- Clone this repo or download the notebook and dataset.
- Open
Summer_Analytics_Week1.ipynbin Jupyter Notebook. - Make sure
Cars.csvis in the same directory. - Run all cells.
- Name: Shivangi Sharma
- Institute: JECRC University
- Program: Summer Analytics 2024
- Week: 1 – Data Analysis
- Assignment Title: Data Grand Prix!
Thank you C&A for this opportunity!