GitHub - lehai2909/learn-web-scraping: Build app with Streamlit to scrape data from Wiki

Welcome to my Github.io

This is repository for learning to scrapping web data from Wikipedia.

The reason I created this project: As the first step, I want to scrape data and handle them myself. So I choosed Wikipedia, with the support from BeautifulSoup library (Python). I also use Streamlit library to build a tiny app.

I know it seems complex doing web scraping manually while Wiki has bunch of convenient APIs for this, but I want to start simply and stupidly 😁.

You can try it by clone the repo, install necessary dependencies with pip (I provided the requirements.txt, you're very welcome). Or use can use the virtual environment that I included (More about this here).

Run the app by use command:

streamlit run my_app.py

Then, head to your http://localhost:8501 and check the app.

Have a nice day!

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
Lib/site-packages		Lib/site-packages
Scripts		Scripts
.gitignore		.gitignore
README.md		README.md
_config.yml		_config.yml
my_app.py		my_app.py
pyvenv.cfg		pyvenv.cfg
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Welcome to my Github.io

About

Releases

Packages

Languages

lehai2909/learn-web-scraping

Folders and files

Latest commit

History

Repository files navigation

Welcome to my Github.io

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages