Skip to content

Latest commit

 

History

History
27 lines (15 loc) · 1.89 KB

README.md

File metadata and controls

27 lines (15 loc) · 1.89 KB

WikiContext Streamlit App

Wikipedia articles are hardly lucid. There is always some sort of jargon which floats around, that is difficult to understand in layman's term. This project is an attempt to simplify the understanding of any Wikipedia article by providing a summary of some of the key concepts. We provide the context, hence the name WikiContext.

Streamlit UI Animation

Beneath the hood

WikiContext makes use of extractive text summarization, using TextRank. First step is to define a keyword list based on the hyperlinks present in the article. Once this is done, based on keyword ranking, we identify the most relevent keywords in the article. Then the data is fetched and the summarization is performed.

What's next?

  1. Wikipedia2Vec to obtain the embeddings of a wikipedia subject and use them as prerequisite sub-topics.

  2. BERT extractive text summarizer as an additional algorithm for summarization. We are exploring the option of using Serverless Framework to deploy this service.

Contributing

We are doing some active development on this now. Feel free to raise a pull request.

Link 1: Streamlit App (Hugging Face Spaces)

Link 2: Streamlit App (Streamlit Cloud)