Baselight

Bitcoin BTC, 7 Exchanges, 1h Full Historical Data

The Most Complete, Continuous and Clean BTCUSD 1h Dataset for ML Engineers

@kaggle.imranbukhari_comprehensive_btcusd_1h_data

About this Dataset

Bitcoin BTC, 7 Exchanges, 1h Full Historical Data

I am a new developer and I would greatly appreciate your support. If you find this dataset helpful, please consider giving it an upvote!

Key Features:

Complete 1h Data: Raw 1h historical data from multiple exchanges, covering the entire trading history of BTCUSD available through their API endpoints. This dataset is updated daily to ensure up-to-date coverage.

Combined Index Dataset: A unique feature of this dataset is the combined index, which is derived by averaging all other datasets into one, please see attached notebook. This creates the longest continuous, unbroken BTCUSD dataset available on Kaggle, with no gaps and no erroneous values. It gives a much more comprehensive view of the market i.e. total volume across multiple exchanges.

Superior Performance: The combined index dataset has demonstrated superior 'mean average error' (MAE) metric performance when training machine learning models, compared to single-source datasets by a whole order of MAE magnitude.

Unbroken History: The combined dataset's continuous history is a valuable asset for researchers and traders who require accurate and uninterrupted time series data for modeling or back-testing.

This plot illustrates the continuity of the dataset over time, with no gaps in data, making it ideal for time series analysis.

Included Resources:

Two Notebooks:

Dataset Usage and Diagnostics: This notebook demonstrates how to use the dataset and includes a powerful data diagnostics function, which is useful for all time series analyses.

Aggregating Multiple Data Sources: This notebook walks you through the process of combining multiple exchange datasets into a single, clean dataset. (Currently unavailable, will be added shortly)

Share link

Anyone who has the link will be able to view this.