Skip to content

1101-datascience/finalproject-finalproject_group9

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

[Group9] Bike Sharing Demand

Groups

  • 王靖文, 105703057
  • 鄭以湉, 106304003
  • 黃大瑋, 107207438

Goal

Our goal is to forecast bike rental demand in the Capital Bikeshare program in Washington, D.C. based on different conditions.

Demo

You should provide an example commend to reproduce your result

Rscript code/data_science_final.R
  • any on-line visualization

Folder organization and its related information

docs

data

  • Source
  • Input format
    • CSV file
  • Any preprocessing?
    • method1 : convert "datetime" variable into four variables (year, month, day, hour)
    • method2 : method1 + remove outliers in "count" variable
    • method3 : method2 + create four groups based on different hour period (from peak period to off-peak period)

code

  • Which method do you use?
    • Lasso, Xgboost, Random Forest
  • What is a null model for comparison?
    • Our null model is the mean of the count from training data.
  • How do your perform evaluation? ie. cross-validation, or addtional indepedent data set
    • We apply cross-validation to get optimal hyperparameters and then add into training process.

results

  • Which metric do you use
    • Root Mean Squared Logarithmic Error(RMSLE)
  • Is your improvement significant?
    • yes, we create some versions of training data based on different methods of data cleaning, and the testing RMSLE decreases significantly after training the models by using those versions of training data
  • What is the challenge part of your project?
    • data cleaning
  • Score on Kaggle
    score

References

  • Code/implementation which you include/reference (You should indicate in your presentation if you use code for others. Otherwise, cheating will result in 0 score for final project.)
  • Packages you use
    • library(lubridate)
    • library(randomForest)
    • library(glmnet)
    • library(ModelMetrics)
    • ibrary(xgboost)
  • Related publications

About

finalproject-finalproject_group9 created by GitHub Classroom

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •  

Languages