Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

chore: Create validation dataset from LSST community forum #21

Open
wants to merge 5 commits into
base: main
Choose a base branch
from

Conversation

parvatijay2901
Copy link
Contributor

Issue #1

This pull request introduces a new script for scraping data from the LSST community forum. The most important changes include the addition of the scraping script, updates to the README file, and the removal of outdated notebook code.

New Features and Documentation:

  • Scraping Script:

    • eval/data_extraction/scrape_lsst.py: Added a new script to scrape discussions from the LSST community forum, store the extracted data into a CSV file, and support incremental scraping.
  • README Update:

    • eval/data_extraction/README.md: Updated the README file to include an overview of the new scraping script, its file structure, usage instructions, and details about the output data fields.

Code Cleanup:

  • Notebook Removal:

@parvatijay2901 parvatijay2901 self-assigned this Mar 13, 2025
Copy link

Check out this pull request on  ReviewNB

See visual diffs & provide feedback on Jupyter Notebooks.


Powered by ReviewNB

@parvatijay2901 parvatijay2901 requested a review from vanitech March 13, 2025 02:31
@parvatijay2901
Copy link
Contributor Author

TO Do: Add the script that takes the raw .csv file and remove HTML tags from question and answer.

@parvatijay2901
Copy link
Contributor Author

@vanitech I have made sure to remove the author name from the scripts/dataset

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant