Baselight

Bitcoin Blockchain Historical Data

Complete data collection of the Bitcoin Blockchain for researchers & developers.

@kaggle.jesusgraterol_bitcoin_blockchain_dataset

About this Dataset

Bitcoin Blockchain Historical Data

A blockchain is a distributed ledger with growing lists of records (blocks) that are securely linked together via cryptographic hashes. Each block contains a cryptographic hash of the previous block, a timestamp, and transaction data (generally represented as a Merkle tree, where data nodes are represented by leaves). Since each block contains information about the previous block, they effectively form a chain (compare linked list data structure), with each additional block linking to the ones before it. Consequently, blockchain transactions are irreversible in that, once they are recorded, the data in any given block cannot be altered retroactively without altering all subsequent blocks.

Inspiration

Bitcoin's Blockchain is public. Anybody can run a node locally and get access to all the blocks, since genesis to the current height. However, this process can take some time as the current size exceeds 400 gigabytes and managing the data can be complex as most implementations require the data to be decoded.

The purpose of this dataset is to aid individual researches and developers to be able to analyze the blockchain deeply or even train machine learning models in just a few minutes, instead of several hours/days.

Content

dataset.csv

In this dataset, you will get access to the most relevant information about the Bitcoin Blockchain to kick off your data science project or machine learning model. It is also very easy to subset the data if you're looking to focus on a specific date range.

If you found this dataset useful, please up-vote it to show your appreciation.

Share link

Anyone who has the link will be able to view this.