High-Quality Financial News Dataset For NLP Tasks
Financial Dataset for SFT Task
@kaggle.sayelabualigah_high_quality_financial_news_for_nlp_tasks
Financial Dataset for SFT Task
@kaggle.sayelabualigah_high_quality_financial_news_for_nlp_tasks
This repository contains a meticulously scraped dataset from various financial websites. The data extraction process ensures high-quality and accurate text, including content from both the websites and their embedded PDFs.
We applied the advanced Mixtral 7X8 model to generate the following additional fields:
The prompt used to generate the additional fields was highly effective, thanks to extensive discussions and collaboration with the Mistral AI team. This ensures that the dataset provides valuable insights and is ready for further analysis and model training.
This dataset can be used for various applications, including but not limited to:
CREATE TABLE dataset (
"date" VARCHAR,
"subject" VARCHAR,
"content" VARCHAR,
"paraphrasedsubject" VARCHAR,
"compactedsummary" VARCHAR,
"detailedsummary" VARCHAR,
"impact" VARCHAR
);Anyone who has the link will be able to view this.