All GPT-4 Conversations
All chat datasets generated by GPT-4 from Huggingface in the same format
@kaggle.thedevastator_all_gpt_4_synthetic_chat_datasets
All chat datasets generated by GPT-4 from Huggingface in the same format
@kaggle.thedevastator_all_gpt_4_synthetic_chat_datasets
The dataset includes all chat conversations generated by GPT-4 that are hosted on open Huggingface datasets.
Everything is converted to the same format so the datasets can be easily merged and used for large scale training of LLMs.
This dataset is a collection of several single chat datasets.
If you use this dataset in your research, please credit the original authors of the internal datasets.
Data Source
License: CC0 1.0 Universal (CC0 1.0) - Public Domain Dedication
No Copyright - You can copy, modify, distribute and perform the work, even for commercial purposes, all without asking permission. See Other Information.
CREATE TABLE share_gpt_vicuna_unfiltered (
"message" VARCHAR,
"message_type" VARCHAR,
"message_id" BIGINT,
"conversation_id" BIGINT
);CREATE TABLE wizard_vicuna_dataset_unfiltered (
"message" VARCHAR,
"message_type" VARCHAR,
"message_id" BIGINT,
"conversation_id" BIGINT
);Anyone who has the link will be able to view this.