Baselight

Stock Market Predictions

Contains daily news for stock market predictions

@kaggle.tanishqdublish_stock_market_predictions

Loading...
Loading...

About this Dataset

Stock Market Predictions

Actually, I prepare this dataset for students on my Deep Learning and NLP course.

But I am also very happy to see kagglers play around with it.

Have fun!

Description:

There are two channels of data provided in this dataset:

News data: I crawled historical news headlines from Reddit WorldNews Channel (/r/worldnews). They are ranked by reddit users' votes, and only the top 25 headlines are considered for a single date.
(Range: 2008-06-08 to 2016-07-01)

Stock data: Dow Jones Industrial Average (DJIA) is used to "prove the concept".
(Range: 2008-08-08 to 2016-07-01)

I provided three data files in .csv format:

RedditNews.csv: two columns
The first column is the "date", and second column is the "news headlines".
All news are ranked from top to bottom based on how hot they are.
Hence, there are 25 lines for each date.

DJIA_table.csv:
Downloaded directly from Yahoo Finance: check out the web page for more info.

Combined_News_DJIA.csv:
To make things easier for my students, I provide this combined dataset with 27 columns.
The first column is "Date", the second is "Label", and the following ones are news headlines ranging from "Top1" to "Top25".

Tables

Combined News Djia

@kaggle.tanishqdublish_stock_market_predictions.combined_news_djia
  • 3.96 MB
  • 1989 rows
  • 27 columns
Loading...

CREATE TABLE combined_news_djia (
  "date" TIMESTAMP,
  "label" BIGINT,
  "top1" VARCHAR,
  "top2" VARCHAR,
  "top3" VARCHAR,
  "top4" VARCHAR,
  "top5" VARCHAR,
  "top6" VARCHAR,
  "top7" VARCHAR,
  "top8" VARCHAR,
  "top9" VARCHAR,
  "top10" VARCHAR,
  "top11" VARCHAR,
  "top12" VARCHAR,
  "top13" VARCHAR,
  "top14" VARCHAR,
  "top15" VARCHAR,
  "top16" VARCHAR,
  "top17" VARCHAR,
  "top18" VARCHAR,
  "top19" VARCHAR,
  "top20" VARCHAR,
  "top21" VARCHAR,
  "top22" VARCHAR,
  "top23" VARCHAR,
  "top24" VARCHAR,
  "top25" VARCHAR
);

Redditnews

@kaggle.tanishqdublish_stock_market_predictions.redditnews
  • 5.56 MB
  • 73608 rows
  • 2 columns
Loading...

CREATE TABLE redditnews (
  "date" TIMESTAMP,
  "news" VARCHAR
);

Upload Djia Table

@kaggle.tanishqdublish_stock_market_predictions.upload_djia_table
  • 99.11 KB
  • 1989 rows
  • 7 columns
Loading...

CREATE TABLE upload_djia_table (
  "date" TIMESTAMP,
  "open" DOUBLE,
  "high" DOUBLE,
  "low" DOUBLE,
  "close" DOUBLE,
  "volume" BIGINT,
  "adj_close" DOUBLE
);

Share link

Anyone who has the link will be able to view this.