All chat datasets generated by GPT-4 from Huggingface in the same format
Dataset Description
All GPT-4 Generated Datasets
Every chat dataset generated by GPT-4 from Huggingface at the same format
About this dataset
How to use the dataset
The dataset includes all chat conversations generated by GPT-4 that are hosted on open Huggingface datasets.
Everything is converted to the same format so the datasets can be easily merged and used for large scale training of LLMs.
Acknowledgements
This dataset is a collection of several single chat datasets.
If you use this dataset in your research, please credit the original authors of the internal datasets.
Data Source
License
License: CC0 1.0 Universal (CC0 1.0) - Public Domain Dedication
No Copyright - You can copy, modify, distribute and perform the work, even for commercial purposes, all without asking permission. See Other Information.
Related Datasets
-
Synthetic Therapy Conversations
@kaggle
-
Dummy Monster
@owid
-
Eucalyptus Growth And Environmental Data
@euremarkable