Skip to content

Topsis for ranking pretrained models with text summarisation in specific and any pretrained model in general using parameters that affect them.

Notifications You must be signed in to change notification settings

Shivam-Verma1/Pretrained-model-Comparison-using-Topsis

Repository files navigation

Text Summarization Model Comparison Project

graph TD;
  Start --> LoadData;
  LoadData --> PreprocessText;
  PreprocessText --> ChooseMetrics;
  ChooseMetrics --> ApplyTOPSIS;
  ApplyTOPSIS --> RankModels;
  RankModels --> VisualizeResults;
  VisualizeResults --> End;
Loading

Overview

Text summarization is a crucial natural language processing task that involves condensing large documents into concise and informative summaries. This project focuses on comparing the performance of various text summarization models to help users choose the most suitable model for their specific needs.

Key Features:

  1. Metrics Considered:

    • The comparison is based on essential metrics, including Rouge scores, length of the summary, and training time. Rouge scores assess the quality of the generated summaries, while length and training time provide insights into efficiency and resource requirements.
  2. Methodology - TOPSIS:

    • The Technique for Order of Preference by Similarity to Ideal Solution (TOPSIS) method is employed for the comparison. This method considers both the similarity to the ideal solution and the dissimilarity to the negative ideal solution, providing a comprehensive ranking.
  3. Models Evaluated:

    • Real-world pretrained models, such as BERTSumExt, GPT-3, T5, XLNet, BART, and Pegasus, are included in the comparison. These models are widely used in text summarization tasks.

Project Structure:

  • data.csv: CSV file containing evaluation metrics for each model.
  • result.csv: CSV file with ranked results in tabular format.
  • result.csv: CSV file with data used for creating a bar chart.
  • barchart.png: Bar chart visualizing the model comparison.

How to Run:

  1. Clone the Repository:
    git clone https://github.com/Shivam-Verma1/Pretrained-model-Comparison-using-Topsis.git
    

Results and Analysis:

  1. Ranked Table:
  • Explore detailed ranked results in summarization_table_result.csv:
Model Rouge Scores Length of Summary Training Time
BERTSumExt 0.75 130 9
GPT-3 0.82 150 12
T5 0.78 140 10
XLNet 0.76 135 11
BART 0.80 145 8
Pegasus 0.79 138 13
  1. Bar Chart:

The bar chart visually represents the performance metrics of each model, providing an easy-to-understand comparison. Rouge scores, length of the summary, training time, and normalized ranks are included for comprehensive evaluation.

Alt Text

Analysis:

Model Performance: GPT-3 outperforms other models in terms of Rouge scores, securing the top rank. BERTSumExt and T5 follow closely, showcasing competitive performance. Efficiency Consideration: BERTSumExt is the most resource-efficient, with the lowest training time. BART and T5 offer a balanced trade-off between Rouge scores and efficiency. Next Steps: Feel free to analyze the provided CSV files for more insights. Consider adjusting the evaluation metrics or adding new models based on your specific use case. Use the project as a foundation for ongoing research and development in text summarization.

About

Topsis for ranking pretrained models with text summarisation in specific and any pretrained model in general using parameters that affect them.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published