PublicPages: FB OpenGraph API data collection

A simple API downloader for FB's public pages data

This script calls the FB API (v3.3) on a public page.

You can define a page_id (or a list of id's) to target. You find in the URL of the public page that you are interested. For example, for the Italian newspaper Repubblica (https://www.facebook.com/repubblica -> page_id = 'repubblica') The resulting dataset contains these fields:

title	description	message	status_type	url	date	post_id	comments	shares	likes	love	wow	haha	sad	angry

You can find an example dataset in 'repubblica_example.tsv' The idea is based on minimaxir's facebook scraper, which was built before FB closed the access to public pages.

Now you need to submit your app for App Review process in order have the Page Public Content Access.

After the approval you will be able to retrieve data from the endpoint /{page-id}/feed. The app, after approval, will still be subject to Rate Limiting. I have not implemented any fancy throttling method on this, but suggestions are more than welcome. Same goes for the exception-handling during the parsing of json response.

For the time being, the script is far from perfect but could be helpful to retrieve quickly similar datasets once the App Review process is complete.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
LICENSE		LICENSE
README.md		README.md
public_pages.py		public_pages.py
repubblica_example.tsv		repubblica_example.tsv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PublicPages: FB OpenGraph API data collection

A simple API downloader for FB's public pages data

About

Releases

Packages

Languages

License

ebergam/PublicPages_FB_api

Folders and files

Latest commit

History

Repository files navigation

PublicPages: FB OpenGraph API data collection

A simple API downloader for FB's public pages data

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages