Skip to content

ankit2104/Web-Search-Engine

Repository files navigation

Web-Search-Engine

Web Search Engine developed in Java

Features Used

  1. Web Crawler
  2. Pattern Matching
  3. HTML to Text
  4. Searching Word
  5. Word Suggestion
  6. Page Ranking

Concepts Used (till now):

  1. KMP algorithm --- for efficient word searching
  2. Edit Distance --- for alternative word suggestion
  3. Html to Text --- to convert html files to text files
  4. jsoup -- java library to fetch URLs and extract data
  5. Merge Sort -- for ranking web pages on basis of word occurence
  6. Hashtable -- indexing the file

Project Flow
Project flow