Baselight

LLM - Detect AI Generated Text Dataset

LLM - Detect AI Generated Text Training Essay Dataset

@kaggle.sunilthite_llm_detect_ai_generated_text_dataset

Loading...
Loading...

About this Dataset

LLM - Detect AI Generated Text Dataset

In this Dataset contains both AI Generated Essay and Human Written Essay for Training Purpose
This dataset challenge is to to develop a machine learning model that can accurately detect whether an essay was written by a student or an LLM. The competition dataset comprises a mix of student-written essays and essays generated by a variety of LLMs.

Dataset contains more than 28,000 essay written by student and AI generated.

Features :

  1. text : Which contains essay text
  2. generated : This is target label . 0 - Human Written Essay , 1 - AI Generated Essay

Tables

Training Essay Data

@kaggle.sunilthite_llm_detect_ai_generated_text_dataset.training_essay_data
  • 31.12 MB
  • 29145 rows
  • 2 columns
Loading...

CREATE TABLE training_essay_data (
  "text" VARCHAR,
  "generated" BIGINT
);