This project was completed as a part of assessment for Data Science with Python module. We used different Python libraries such as NumPy, SciPy, Pandas, scikit-learn and matplotlib to complete the given analysis and visualization tasks
To perform these tasks, you can use any of the different Python libraries such as NumPy, SciPy, Pandas, scikit-learn and matplotlib.
- Import data into Python environment.
- Provide the trend chart for the number of complaints at monthly and daily granularity levels.
- Provide a table with the frequency of complaint types.
- Which complaint types are maximum i.e., around internet, network issues, or across any other domains.
- Create a new categorical variable with value as Open and Closed. Open & Pending is to be categorized as Open and Closed & Solved is to be categorized as Closed.
- Provide state wise status of complaints in a stacked bar chart. Use the categorized variable from Q5. Provide insights on:
- Which state has the maximum complaints
- Which state has the highest percentage of unresolved complaints
- Provide the percentage of complaints resolved till date, which were received through the Internet and customer care calls.