Baselight

LLM: Mistral-7B Instruct Texts

(4900 LLM texts for the "Detect AI Generated Text" competition)

@kaggle.carlmcbrideellis_llm_mistral_7b_instruct_texts

Loading...
Loading...

About this Dataset

LLM: Mistral-7B Instruct Texts

This dataset (specifically the file Mistral7B_CME_v7.csv) consists of 4900 LLM generated texts.
(Note: versions 1 to 6 are redundant, and are only kept so as not to break any notebooks that use them)

Update: The new file Mistral7B_CME_v7_15_percent_corruption.csv has also been added as per the discussion "Alternative approach - Simulating hidden dataset".

v1: 700 LLM texts for prompt 6 "Exploring Venus" for use in the LLM - Detect AI Generated Text competition.

v2: + 700 LLM texts for prompt 8 "The Face on Mars"

v3: + 700 LLM texts for prompt 4 "A Cowboy Who Rode the Waves"

v4: + 700 LLM texts for prompt 11 "Driverless cars"

v5: + 700 LLM texts for prompt 7 "Facial action coding system"

v6: + 700 LLM texts for prompt 2 "Car-free cities"

v7: + 700 LLM texts for prompt 12 "Does the electoral college work?"

Photo credit: Image of Venus by NASA.

Tables

Mistral7b Cme V1

@kaggle.carlmcbrideellis_llm_mistral_7b_instruct_texts.mistral7b_cme_v1
  • 738.77 KB
  • 700 rows
  • 4 columns
Loading...

CREATE TABLE mistral7b_cme_v1 (
  "prompt_id" BIGINT,
  "text" VARCHAR,
  "prompt_name" VARCHAR,
  "generated" BIGINT
);

Mistral7b Cme V2

@kaggle.carlmcbrideellis_llm_mistral_7b_instruct_texts.mistral7b_cme_v2
  • 1.4 MB
  • 1400 rows
  • 4 columns
Loading...

CREATE TABLE mistral7b_cme_v2 (
  "prompt_id" BIGINT,
  "text" VARCHAR,
  "prompt_name" VARCHAR,
  "generated" BIGINT
);

Mistral7b Cme V3

@kaggle.carlmcbrideellis_llm_mistral_7b_instruct_texts.mistral7b_cme_v3
  • 2.16 MB
  • 2100 rows
  • 4 columns
Loading...

CREATE TABLE mistral7b_cme_v3 (
  "prompt_id" BIGINT,
  "text" VARCHAR,
  "prompt_name" VARCHAR,
  "generated" BIGINT
);

Mistral7b Cme V4

@kaggle.carlmcbrideellis_llm_mistral_7b_instruct_texts.mistral7b_cme_v4
  • 2.84 MB
  • 2800 rows
  • 4 columns
Loading...

CREATE TABLE mistral7b_cme_v4 (
  "prompt_id" BIGINT,
  "text" VARCHAR,
  "prompt_name" VARCHAR,
  "generated" BIGINT
);

Mistral7b Cme V5

@kaggle.carlmcbrideellis_llm_mistral_7b_instruct_texts.mistral7b_cme_v5
  • 3.57 MB
  • 3500 rows
  • 4 columns
Loading...

CREATE TABLE mistral7b_cme_v5 (
  "prompt_id" BIGINT,
  "text" VARCHAR,
  "prompt_name" VARCHAR,
  "generated" BIGINT
);

Mistral7b Cme V6

@kaggle.carlmcbrideellis_llm_mistral_7b_instruct_texts.mistral7b_cme_v6
  • 4.3 MB
  • 4200 rows
  • 4 columns
Loading...

CREATE TABLE mistral7b_cme_v6 (
  "prompt_id" BIGINT,
  "text" VARCHAR,
  "prompt_name" VARCHAR,
  "generated" BIGINT
);

Mistral7b Cme V7

@kaggle.carlmcbrideellis_llm_mistral_7b_instruct_texts.mistral7b_cme_v7
  • 4.98 MB
  • 4900 rows
  • 4 columns
Loading...

CREATE TABLE mistral7b_cme_v7 (
  "prompt_id" BIGINT,
  "text" VARCHAR,
  "prompt_name" VARCHAR,
  "generated" BIGINT
);

Mistral7b Cme V7–15 Percent Corruption

@kaggle.carlmcbrideellis_llm_mistral_7b_instruct_texts.mistral7b_cme_v7_15_percent_corruption
  • 5.02 MB
  • 4900 rows
  • 4 columns
Loading...

CREATE TABLE mistral7b_cme_v7_15_percent_corruption (
  "prompt_id" BIGINT,
  "text" VARCHAR,
  "prompt_name" VARCHAR,
  "generated" BIGINT
);

Share link

Anyone who has the link will be able to view this.