Data Cleaning & Merging Assignment #4
Replies: 14 comments
-
Here is my filtered Index of DH abstracts dataset. |
Beta Was this translation helpful? Give feedback.
-
Here is my filtered dataset and subsequent explanation here: Sanchita's Filtered Dataset By Sanchita S. Kamath |
Beta Was this translation helpful? Give feedback.
-
My filtered dataset is here: https://github.com/rschneider98/is578-introduction-dh/blob/master/limited_dh_works.csv |
Beta Was this translation helpful? Give feedback.
-
Here is my link: https://github.com/Amoura23/DH-Class/blob/main/dh-conferences-works-csv.csv I was not successful in this assignment. Everything went as it should until I went to upload to GitHub. Even though my .csv only has 71 lines, what is uploaded to Github has 7065. I have no idea why this is. Any thoughts would be appreciated! |
Beta Was this translation helpful? Give feedback.
-
Hi all! Here is my link: https://github.com/kfata2/is578-introduction-to-dh/blob/main/DH-filtered-dataset.csv I filtered it down to entries that had the keyword "Voyant." The only issue I had was actually in GitHub! I completely forgot that the file type is necessary to communicate to GitHub what kind of data is and how you want it to be displayed, so I was super confused when it was displaying as lines rather than a CSV! Renamed it with csv at the end and everything was solved! |
Beta Was this translation helpful? Give feedback.
-
Hello! Here is my filtered dataset and my explanation I'm looking forward to learning how to use OpenRefine in more ways! |
Beta Was this translation helpful? Give feedback.
-
Here is the link to my data set and explanation! |
Beta Was this translation helpful? Give feedback.
-
Hi, here is my link: https://drive.google.com/file/d/1lUWx-6QOhf6e6K1alIkLMk41EvjT7hbP/view?usp=sharing |
Beta Was this translation helpful? Give feedback.
-
Here is my link: https://github.com/yoonsuh3/Datasetsample/blob/main/dh_conferences_works1.csv Sadly, I had the same trouble as [Amoura23]. I think the original dataset was "not clean" and the cell contained more than 500 letters so it wouldn't export properly. |
Beta Was this translation helpful? Give feedback.
-
Beta Was this translation helpful? Give feedback.
-
Here's my file. I don't believe I did this right. I attempted to export the file as a CSV and didn't appear to be successful. I also tried Google Sheets, but received an error message as well. |
Beta Was this translation helpful? Give feedback.
-
Here is my markdown file with a link to the google spreadsheet I was working on! - https://github.com/rubylm2/is578-introduction-to-dh/blob/main/IS578_DH_Tools_merged.md I am going to try again to completely merge the data. |
Beta Was this translation helpful? Give feedback.
-
Hi, Once again I had trouble getting another new tool to work on my computer. This tool did not work well for me. I followed the instructions and took the time to read the tutorials, but I spent a lot of time just trying to get OpenRefine to download and install for me to use. I think I was finally able to get OpenRefine to install successfully on the 4th attempt. Once OpenRefine was installed, I was able to make use of the tool to explore and experiment with the data cleaning tasks. I was able to complete the tasks to transform the data I had loaded into OpenRefine and then filter the data into one tool. While the tool's functionality was certainly quite useful working with filtering and organizing the data into a merged dataset and easier to work with, the process took a great deal of guidance from the aids and tutorials, and I did not find it very intuitive. The OpenRefine tool provided me with an Excel file. The assignment asked for a link to Google Sheets of my merged and filtered dataset. However, Google Sheets did not handle the conversion from Excel spreadsheet to Google Sheets well. My attempts at the time of this assignment to get a link to a Google Sheets with my dataset didn't work. Taking some additional time, I have made some new attempts at getting a readable Google Sheets version of my dataset. I believe I finally have a csv file where the data will at least be viewable. https://drive.google.com/file/d/1729NpfvzncRehuhvPxgLydu0pyum05OX/view?usp=sharing Even opening this file with Google Sheets shows empty cells. I think the data is actually in the cells, however, the formatting is so out of whack that it looks empty, and is impossible to view the data in a Google Sheets spreadsheet. The dataset can be seen in the Excel file. |
Beta Was this translation helpful? Give feedback.
-
You can find my datasets and markdown files for the second option of the assignment here: https://github.com/valarbonies/is578-intro-to-DH/tree/aa19b00df3babc9f4f9958a498f2fc570d369386/DH-DataCleaningMerging |
Beta Was this translation helpful? Give feedback.
-
Post your links to your filtered Index of DH abstracts dataset and if completed your merged datasets
Beta Was this translation helpful? Give feedback.
All reactions