Skip to content

lehai2909/learn-web-scraping

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

44 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Welcome to my Github.io

This is repository for learning to scrapping web data from Wikipedia.

The reason I created this project: As the first step, I want to scrape data and handle them myself. So I choosed Wikipedia, with the support from BeautifulSoup library (Python). I also use Streamlit library to build a tiny app.

I know it seems complex doing web scraping manually while Wiki has bunch of convenient APIs for this, but I want to start simply and stupidly 😁.

You can try it by clone the repo, install necessary dependencies with pip (I provided the requirements.txt, you're very welcome). Or use can use the virtual environment that I included (More about this here).

Run the app by use command:

streamlit run my_app.py

Then, head to your http://localhost:8501 and check the app.

Have a nice day!

About

Build app with Streamlit to scrape data from Wiki

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published