Time Series Practice Dataset
Synthetic time series data
@kaggle.samuelcortinhas_time_series_practice_dataset
Synthetic time series data
@kaggle.samuelcortinhas_time_series_practice_dataset
This dataset is intended for anybody who wants to practice and improve their time series skills.
This dataset contains simulated time series data covering 10 years (2010-2019). The features include date, store id, product id and number sold. The train.csv covers the years 2010-2018 and the test.csv covers 2019 only. The are 7 unique stores and 10 unique products. The are no null values. The objective is to predict the number sold feature in the test.csv.
I created this time series data using multiple features including various long term trends, year-long seasonality patterns, weekday/weekend effects and noise. Moreover, the products and the stores are supposed to be weakly correlated.
To compare with other peoples solutions, I recommend using the MAPE (Mean Absolute Percentage Error) metric.
CREATE TABLE test (
"date" TIMESTAMP,
"store" BIGINT,
"product" BIGINT,
"number_sold" BIGINT
);CREATE TABLE train (
"date" TIMESTAMP,
"store" BIGINT,
"product" BIGINT,
"number_sold" BIGINT
);Anyone who has the link will be able to view this.