graph TD;
Start --> LoadData;
LoadData --> PreprocessText;
PreprocessText --> ChooseMetrics;
ChooseMetrics --> ApplyTOPSIS;
ApplyTOPSIS --> RankModels;
RankModels --> VisualizeResults;
VisualizeResults --> End;
Text summarization is a crucial natural language processing task that involves condensing large documents into concise and informative summaries. This project focuses on comparing the performance of various text summarization models to help users choose the most suitable model for their specific needs.
-
Metrics Considered:
- The comparison is based on essential metrics, including Rouge scores, length of the summary, and training time. Rouge scores assess the quality of the generated summaries, while length and training time provide insights into efficiency and resource requirements.
-
Methodology - TOPSIS:
- The Technique for Order of Preference by Similarity to Ideal Solution (TOPSIS) method is employed for the comparison. This method considers both the similarity to the ideal solution and the dissimilarity to the negative ideal solution, providing a comprehensive ranking.
-
Models Evaluated:
- Real-world pretrained models, such as BERTSumExt, GPT-3, T5, XLNet, BART, and Pegasus, are included in the comparison. These models are widely used in text summarization tasks.
data.csv
: CSV file containing evaluation metrics for each model.result.csv
: CSV file with ranked results in tabular format.result.csv
: CSV file with data used for creating a bar chart.barchart.png
: Bar chart visualizing the model comparison.
- Clone the Repository:
git clone https://github.com/Shivam-Verma1/Pretrained-model-Comparison-using-Topsis.git
- Ranked Table:
- Explore detailed ranked results in summarization_table_result.csv:
Model | Rouge Scores | Length of Summary | Training Time |
---|---|---|---|
BERTSumExt | 0.75 | 130 | 9 |
GPT-3 | 0.82 | 150 | 12 |
T5 | 0.78 | 140 | 10 |
XLNet | 0.76 | 135 | 11 |
BART | 0.80 | 145 | 8 |
Pegasus | 0.79 | 138 | 13 |
- Bar Chart:
The bar chart visually represents the performance metrics of each model, providing an easy-to-understand comparison. Rouge scores, length of the summary, training time, and normalized ranks are included for comprehensive evaluation.
Model Performance: GPT-3 outperforms other models in terms of Rouge scores, securing the top rank. BERTSumExt and T5 follow closely, showcasing competitive performance. Efficiency Consideration: BERTSumExt is the most resource-efficient, with the lowest training time. BART and T5 offer a balanced trade-off between Rouge scores and efficiency. Next Steps: Feel free to analyze the provided CSV files for more insights. Consider adjusting the evaluation metrics or adding new models based on your specific use case. Use the project as a foundation for ongoing research and development in text summarization.