Nepali Extractive Text Summarization using TextRank algorithm

Working of this model follows these steps

Read the text.
Remove useless characters from the text.
Splitting text into sentences and words as array.
Tokenizing individual words present.
Calculating influence factor of each words.
Measuring average influence of the sentence using word influence.
Ranking sentences based on that influences.
Re-sequencing top N influencial sentences sentences with
Displaying summarized text and saving it to the output.txt file.

Installing the dependencies

This model barely uses any complicated libraries, rather it uses numpy. Version I used is given in requirements.txt. But it is more likely latest version of numpy works just fine.

pip install numpy

If you are facing any compatibility issues try:

pip install -r requirements.txt

Inferencing the model

This model can be inferenced by various ways. First step is to clone the repo.

git clone https://github.com/AnjaanKhadka/Extractive-text-summarization-Nepali.git

Then execute following code to get summary from text in sample.txt file

python main.py

This inferencing will give summarized text as:

Sample text

Summarized text

To use this model on custom text file, execute following code.

python main.py -i <path_to_your_text_file> -o <path_to_your_text_file>

Or you can execute

python main.py -t

Then CLI asks for the text input and you can get summary that way as well.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
difference.txt		difference.txt
fix_kriyapads.py		fix_kriyapads.py
kriyapad.txt		kriyapad.txt
kriyapad2.txt		kriyapad2.txt
kriyapad_backup.txt		kriyapad_backup.txt
main.py		main.py
minimal_kriyapad.txt		minimal_kriyapad.txt
old_kriyapad.txt		old_kriyapad.txt
out.txt		out.txt
output.txt		output.txt
output_screen.txt		output_screen.txt
ranker.py		ranker.py
readme.md		readme.md
sample.txt		sample.txt
samyojak.txt		samyojak.txt
screen.txt		screen.txt
stopwords.txt		stopwords.txt
tokenizer.py		tokenizer.py
valid_chars.json		valid_chars.json
word_endings.txt		word_endings.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Nepali Extractive Text Summarization using TextRank algorithm

Working of this model follows these steps

Installing the dependencies

Inferencing the model

Sample text

Summarized text

About

Releases

Packages

Languages

AnjaanKhadka/Extractive-text-summarization-Nepali

Folders and files

Latest commit

History

Repository files navigation

Nepali Extractive Text Summarization using TextRank algorithm

Working of this model follows these steps

Installing the dependencies

Inferencing the model

Sample text

Summarized text

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages