The repository include the evaluation code fot the SumTO summarization system proposed for the FNS 2020 Shared Task.
Summarizer.pyinclude the code for the Summarizer python object. It is able to initialize the model and perform the summarization using the.summarize()functionsummarize.pycontains the code to initialize and apply the model to pre-parsed input data collections.- In
summarize.py:DATA_DIRandTEST_DIRshould be set according to your environment configuration - In
summarize.py:YourSystemIDshould be set according to your output folder (it will contain the summarized documents at the end of the summarization process) components/Dataset.pycontains the Dataset class exploited by the summarization algorithm to predict the summaries.create_dataset.pycontains the instructions to create and store the Dataset object (this version is intended explictly for the test set).
Available at https://huggingface.co/morenolq/SumTO_FNS2020 or using the transformers python library with the tag morenolq/SumTO_FNS2020
La Quatra, M., & Cagliero, L. (2020, December). End-to-end Training For Financial Report Summarization. In Proceedings of the 1st Joint Workshop on Financial Narrative Processing and MultiLing Financial Summarisation (pp. 118-123).