Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Python scraping files #6

Closed
wants to merge 1 commit into from
Closed

Python scraping files #6

wants to merge 1 commit into from

Conversation

venvis
Copy link
Collaborator

@venvis venvis commented Feb 8, 2024

There are three python files :

1.operative_extractions.py- This is a python script with a class that takes a Celex id when initializing the class and returns the operative part , scraped from the web link of the respective Celex ID.The purpose of the different functions within the class is to overcome the different html page structures and syntax of different web pages , accordingly looping the respective page of the given celex id through each structure until it gets the operative part from the website. All the functions are of return type ->list.It doesn't take care of scraping of operative id from old cases since function for that already exists

2.para.py-This python script extracts the Paragraph (P) numbers of the citations and returns them in a list only if they are found else it returns an empty list.

3.Citations.py-This python script extracts the citations(CELEX ID) and returns them in a list only if they are found else it returns an empty list.

@shashankmc
Copy link
Member

Rejecting because of formatting, documentation and structure issues.

@shashankmc shashankmc closed this Mar 6, 2024
@shashankmc shashankmc deleted the new_branch branch July 4, 2024 13:53
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants