Skip to content

cydneyghamel/excel-analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 

Repository files navigation

Background

Crowdfunding platforms like Kickstarter and Indiegogo have been growing in success and popularity since the late 2000s. From independent content creators to famous celebrities, more and more people are using crowdfunding to launch new products and generate buzz, but not every project has found success.

To receive funding, the project must meet or exceed an initial goal, so many organizations dedicate considerable resources looking through old projects in an attempt to discover “the trick” to finding success. For this challenge, I organized and analyzed a database of 1,000 sample projects to uncover any hidden trends.

Goal of exploration

Using the Excel workbook in the .zip file, I modified and analyzed the sample-project data and attempted to assess market trends.

Data for this dataset was generated by edX Boot Camps LLC, and is intended for educational purposes only.

Approach

I used conditional formatting to fill each cell in the outcome column with a different color, depending on whether the associated campaign was successful, failed, canceled, or is currently live.

I created a new column called Percent Funded that uses a formula to find how much money a campaign made relative to its initial funding goal.

I used conditional formatting to fill each cell in the Percent Funded column according to a three-color scale. The scale started at 0 with a dark shade of red, and transitioned to green at 100 and blue at 200.

I created a new column called Average Donation that uses a formula to find how much each project backer paid on average.

I created two new columns, one called Parent Category and another called Sub-Category, that use formulas to split the Category and Sub-Category column into the two new, separate columns.

Category Stats

I created:

  1. a new sheet with a pivot table that analyzes your initial worksheet to count how many campaigns were successful, failed, canceled, or are currently live per category.

  2. a stacked-column pivot chart that can be filtered by country based on the table that you created.

  3. a new sheet with a pivot table that analyzes your initial sheet to count how many campaigns were successful, failed, or canceled, or are currently live per sub-category.

  4. a stacked-column pivot chart that can be filtered by country and parent category based on the table that you created.

The dates in the deadline and launched_at columns use Unix timestamps. Fortunately for us, this formulaLinks to an external site that can be used to convert these timestamps to a normal date.

I also:

  • Created a new column named Date Created Conversion that will use this formulaLinks to an external site. to convert the data contained in launched_at into Excel's date format.

  • Created a new column named Date Ended Conversion that will use this formulaLinks to an external site. to convert the data contained in deadline into Excel's date format.

  • Created a new sheet with a pivot table that has a column of outcome, rows of Date Created Conversion, values based on the count of outcome, and filters based on parent category and Years.

  • Then, I created a pivot-chart line graph that visualizes this new table.

  • And finally, I wrote a report in Microsoft Word to answer the following questions:

    1. Given the provided data, what are three conclusions that we can draw about crowdfunding campaigns?

    2. What are some limitations of this dataset?

    3. What are some other possible tables and/or graphs that we could create, and what additional value would they provide?

Crowfunding Goal Analysis

I created a new sheet with 8 columns:

  • Goal

  • Number Successful

  • Number Failed

  • Number Canceled

  • Total Projects

  • Percentage Successful

  • Percentage Failed

  • Percentage Canceled

In the Goal column, create 12 rows with the following headers:

  • Less than 1000

  • 1000 to 4999

  • 5000 to 9999

  • 10000 to 14999

  • 15000 to 19999

  • 20000 to 24999

  • 25000 to 29999

  • 30000 to 34999

  • 35000 to 39999

  • 40000 to 44999

  • 45000 to 49999

  • Greater than or equal to 50000

A table and corresponding graph showing the percentage of projects that are successful, failed, and canceled based on their crowdfunding goal.

Using the COUNTIFS() formula, I counted how many successful, failed, and canceled projects were created with goals within the ranges listed above. Next, I populated the Number Successful, Number Failed, and Number Canceled columns with these data points.

I added up each of the values in the Number Successful, Number Failed, and Number Canceled columns to populate the Total Projects column. Then, using a mathematical formula, I found the percentage of projects that were successful, failed, or canceled per goal range.

Next, I createe a line chart that graphs the relationship between a goal amount and its chances of success, failure, or cancellation.

Statistical Analysis

Most people would use the number of campaign backers to assess the success of a crowdfunding campaign. Creating a summary statistics table is one of the most efficient ways that data scientists can characterize quantitative metrics, such as the number of campaign backers.

For gaining an in-depth understanding of campaign backers, I evaluated the number of backers of successful and unsuccessful campaigns by creating your own summary statistics table.

I created a new worksheet in the workbook, and created one column for the number of backers of successful campaigns and one column for unsuccessful campaigns.

I also made a table containing a column for the number of backers of successful campaigns and a column for unsuccessful campaigns.

Finally, I used Excel to evaluate the following values for successful campaigns, and then did the same for unsuccessful campaigns:

  • The mean number of backers

  • The median number of backers

  • The minimum number of backers

  • The maximum number of backers

  • The variance of the number of backers

  • The standard deviation of the number of backers

I used the data to determine whether the mean or the median better summarizes the data and to determine if there is more variability with successful or unsuccessful campaigns.

About

Excel data analysis

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published