Skip to content

College Counseling Ranklist Evaluator is a Python tool for analyzing rank lists during the college counseling process. It extracts and filters data from PDF rank lists based on criteria like ranks, colleges, and search terms, and outputs the results as Word documents for easy review.

License

Notifications You must be signed in to change notification settings

agnivadas/Ranklist_Evaluator

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 

Repository files navigation

College Counseling Rank Evaluator

GPL License Image_Categorizer Maintenance Static Badge

College Counseling Rank Evaluator is a Python-based tool designed to analyze PDF file of rank lists for the college counseling process. It allows users to filter ranks, evaluate eligibility for specific colleges, and extract relevant data into user-friendly formats Word document. This tool could be used for filtering pdf for other purpose also as per user.

Features

  • PDF Rank List Extraction: Extracts tabular data from PDF files.
  • Search and Filter: Filters ranks based on specific criteria such as search terms, rank ranges, and college preferences.
  • Customizable Output: Saves filtered data as CSV and generates Word tables for easy review.
  • Automated Workflow: Simplifies rank evaluation for efficient college counseling.

Requirements

Ensure you have the following dependencies installed:

  • Python 3.7 or later
  • pdfplumber
  • python-docx Install dependencies using::
pip install pdfplumber python-docx

Usage

1.Input Requirements:

Provide a rank list in PDF format. Specify relevant parameters like column indices, rank ranges, and search terms.

2.Run the Script:

python ranklist_evaluator.py

3.Output:

A filtered rank list saved as a DOCX file. A Word document containing the extracted data in a tabular format.

Example

# Input parameters
pdf_path = 'sample.pdf' 
min_rank = 200    
max_rank = 1000  
main_term = 'All India' 
main_term_index = 2  

# Extract data
temp_csv_path = extract_rows_with_optional_filters(
    pdf_path,
    sc_column_index,
    rank_column_index=1,
    college_column_index=None,
    min_rank=min_rank,
    max_rank=max_rank,
    target_colleges=['M.D. (PAEDIATRICS)'],
    search_term=search_term
)

# Save as Word document
output_docx_path = 'output.docx'
save_to_docx_table(temp_csv_path, output_docx_path)

Demo

Suppose the tables of ranklist pdf look like this below table structure.

  • To filter all rows with alloted quota 'All India' main_term = 'All India'
  • Put main_term column number in main_term_index(column number starts with 0) main_term_index = 2
  • For rank filtering min and max rank and the rank column index
  • For target college or subject use college_column_index and target_colleges.
       # For example 
        college_column_index= 4
        target_colleges=['M.D. (PAEDIATRICS)']
    Multiple target_colleges could be used .
      # For example 
       college_column_index= 3
       target_colleges=['Rajasthan','West Bengal']

Contributing

Contributions are welcome! Feel free to submit a pull request or open an issue for suggestions and bug reports.

Acknowledgements

Special thanks to the open-source community for tools like pdfplumber and python-docx, which make this project possible.

About

College Counseling Ranklist Evaluator is a Python tool for analyzing rank lists during the college counseling process. It extracts and filters data from PDF rank lists based on criteria like ranks, colleges, and search terms, and outputs the results as Word documents for easy review.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages