Skip to content

Fetching Bitcoin Core Blockchain data

Jörn Franke edited this page Nov 12, 2017 · 2 revisions

You need to download and verify blockchain data using a tool. Bitcoin Core (use the most recent version!) has been used for this project. Once you have installed it then simply start it and let it download the whole blockchain (can take several hours). You need at least 100-150 GB free space (depending on the size of the blockchain more). Once you have downloaded the blockchain, you will find in your user directory in the subdirectory .bitcoin/blocks/blk*.dat all the bitcoin data.

Note do NOT copy the rev*.data data from this folder. It is not relevant for analysis purposes.

You can put it on your HDFS cluster by executing the following commands:

hadoop fs -mkdir -p /user/bitcoin/input

hadoop fs -put ~./.bitcoin/blocks/blk*.dat /user/bitcoin/input

After it has been copied you are ready to analyze it.

Clone this wiki locally