Skip to content

Latest commit

 

History

History
61 lines (43 loc) · 3.63 KB

README.md

File metadata and controls

61 lines (43 loc) · 3.63 KB

What is this?

I thought it would be cool to see some metadata about this past year's Facebook group conversation with some friends. After playing with the ~160k messages, I came back with a few cool stats in the form of an infographic.

Below is a README on using the actual scrapper to download your own Facebook messages.

Facebook Message Scraper

A simple python script to download the entire conversation from Facebook, not limited like the one in the data dump provided by Facebook

Outputs the conversation in a JSON format, as well as the JSON for each individual chunk.

This is a fork of the following repository: https://github.com/RaghavSood/FBMessageScraper. As of December 21, 2015, this version works again.

Initial Setup

Run for both dumper.py and group_dumper.py

  1. In Chrome, open facebook.com/messages and open any conversation with a fair number of messages
  2. Open the network tab of the Chrome Developer tools
  3. Scroll up in the conversation until the page attempts to load previous messages
  4. Look for the POST request to thread_info.php
  5. You need to copy certain parameters from this request into the python script to complete the setup:
  6. Set the cookie value to the value you see in Chrome under Request Headers
  7. Set the __user value to the value you see in Chrome under Form Data
  8. Set the __a value to the value you see in Chrome under Form Data
  9. Set the __dyn value to the value you see in Chrome under Form Data
  10. Set the __req value to the value you see in Chrome under Form Data
  11. Set the fb_dtsg value to the value you see in Chrome under Form Data
  12. Set the ttstamp value to the value you see in Chrome under Form Data
  13. Set the __rev value to the value you see in Chrome under Form Data

You're now all set to start downloading messages.

Downloading Messages

  1. Get the conversation ID for those messages by opening http://graph.facebook.com/{username-of-chat-partner}. Edit. This method no longer works. Click on yout partner's profile picture, and check the URL. The last sequence of numbers (10 digits) is their Facebook ID.
  2. Copy the id value from there
  3. For group conversations, the ID can be retrieved from the messages tab, as part of the URL. You must use group_dumper.py instead.
  4. Run the command python dumper.py {id} 2000, and put the value you retrieved for ID earlier. Edit: Also specify a configuration file: the command will be: python dumper.py {configuration file} {id} 2000 Fill out the sample configuration file with your own values.
  5. To use text_printer.py, do: python text_printer.py {configuration_file}, {id}. This will print your message on the terminal screen to redirect the output to a .txt file, do : python text_printer.py {configuration_file}, {id} > output.txt.

Messages are saved by default to Messages/{id}/

Disclaimer

This is a fork of the following repository: https://github.com/RaghavSood/FBMessageScraper.

Here's a changelog, as compared to that repository's commit 4e3f268:

  • This version now uses configuration files instead of hardcoded values
  • Fixed a bug detecting the end_of_history mark
  • Fixed a bug downloading the latest messages (these were probably caused by Facebook adjusting the format of their response JSONs)
  • Added a file text_printer.py to print the contents of the JSON dump.