Identify missing csv data preprocessing methods and implement the methods with a test case | Generic Issue - Not to be assigned #12

Ask149 · 2021-03-09T18:54:47Z

Description

Identify the missing methods of CSV data preprocessing in this repository.
Find suitable cases of data and machine learning problems for which the method should be used.
Implement the method only for those cases.

Please Note -
This is a generic issue and multiple students can work on the same. Notify the mentors once you identify a method (as mentioned above). The mentor will create a separate issue and assign you the same.

Contribution guidelines will be updated soon. Please refer them for guidance before committing any development work.

rubyruins · 2021-03-10T05:26:29Z

Hi, I would like to work on this for GSSOC. Could you give an example of what kinds of missing data visualization methods you are looking to implement?

Ask149 · 2021-03-10T06:37:58Z

Hi @rubyruins, thank you for your interest. So, if you take a look at csv_preprocess.py, there are methods such as fill numerical na, normalize numerical columns, label encode categorical columns, etc. Identify if there are any novel methods that we might have missed out on already and should be included. One of them I can think of for now is - Identify the format of a date column then extract the month, day of the week, date, year, etc. from the same and append the same into the column list.

rubyruins · 2021-03-11T06:26:51Z

@Ask149 sounds good. Do let me know if there are any other examples you can think of. If you can create an issue for those, I can start working on them. Hopefully, we can discuss it today evening!

ashish-hacker · 2021-03-11T12:10:00Z

@Ask149 I checked out csv_preprocess.py , and noticed there is only one method for scaling the features i.e., min-max normalisation. I think it would be better to add some more scaling methods like mean normalisation and standardization for Gaussian distributions to make it more flexible.
May I work on the same?
I am a participant in GSSOC'21.

Ask149 · 2021-03-11T12:46:17Z

Thank you for your interest @ashish-hacker! Could you please refer the issue #15 and add a similar short description before you start the implementation, we can discuss the same prior to your implementation and make it precise? I am assigning an issue under your name - Issue #16, use the same to add the details.

ashish-hacker · 2021-03-11T13:02:30Z

Sure @Ask149 : )

Ask149 added enhancement New feature or request good first issue Good for newcomers gssoc21 GirlScript Summer of Code'21 labels Mar 9, 2021

Ask149 mentioned this issue Mar 10, 2021

Identify missing csv data visualization methods and implement the methods with a test case | Generic Issue - Not to be assigned #13

Open

Ask149 mentioned this issue Mar 10, 2021

Identify missing image data preprocessing methods and implement the methods with a test case | Generic Issue - Not to be assigned #14

Open

Ask149 added Level3 Very Hard Level Difficulty and removed good first issue Good for newcomers labels Apr 3, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Identify missing csv data preprocessing methods and implement the methods with a test case | Generic Issue - Not to be assigned #12

Identify missing csv data preprocessing methods and implement the methods with a test case | Generic Issue - Not to be assigned #12

Ask149 commented Mar 9, 2021 •

edited

Loading

rubyruins commented Mar 10, 2021 •

edited by Ask149

Loading

Ask149 commented Mar 10, 2021

rubyruins commented Mar 11, 2021

ashish-hacker commented Mar 11, 2021 •

edited

Loading

Ask149 commented Mar 11, 2021

ashish-hacker commented Mar 11, 2021

Identify missing csv data preprocessing methods and implement the methods with a test case | Generic Issue - Not to be assigned #12

Identify missing csv data preprocessing methods and implement the methods with a test case | Generic Issue - Not to be assigned #12

Comments

Ask149 commented Mar 9, 2021 • edited Loading

Description

rubyruins commented Mar 10, 2021 • edited by Ask149 Loading

Ask149 commented Mar 10, 2021

rubyruins commented Mar 11, 2021

ashish-hacker commented Mar 11, 2021 • edited Loading

Ask149 commented Mar 11, 2021

ashish-hacker commented Mar 11, 2021

Ask149 commented Mar 9, 2021 •

edited

Loading

rubyruins commented Mar 10, 2021 •

edited by Ask149

Loading

ashish-hacker commented Mar 11, 2021 •

edited

Loading