Baselight

Esophageal Cancer Dataset

Comprehensive Esophageal Cancer Dataset for AI-Driven Early Detection & Research

@kaggle.abhinaba1biswas_esophageal_cancer_dataset

Esophageal Dataset
@kaggle.abhinaba1biswas_esophageal_cancer_dataset.esophageal_dataset

  • 286.77 KB
  • 3985 rows
  • 85 columns
unnamed_0

Unnamed: 0

patient_barcode

Patient Barcode

tissue_source_site

Tissue Source Site

patient_id

Patient Id

bcr_patient_uuid

Bcr Patient Uuid

informed_consent_verified

Informed Consent Verified

icd_o_3_site

Icd O 3 Site

icd_o_3_histology

Icd O 3 Histology

icd_10

Icd 10

tissue_prospective_collection_indicator

Tissue Prospective Collection Indicator

tissue_retrospective_collection_indicator

Tissue Retrospective Collection Indicator

days_to_birth

Days To Birth

country_of_birth

Country Of Birth

gender

Gender

height

Height

weight

Weight

country_of_procurement

Country Of Procurement

state_province_of_procurement

State Province Of Procurement

city_of_procurement

City Of Procurement

race_list

Race List

ethnicity

Ethnicity

other_dx

Other Dx

history_of_neoadjuvant_treatment

History Of Neoadjuvant Treatment

person_neoplasm_cancer_status

Person Neoplasm Cancer Status

vital_status

Vital Status

days_to_last_followup

Days To Last Followup

days_to_death

Days To Death

tobacco_smoking_history

Tobacco Smoking History

age_began_smoking_in_years

Age Began Smoking In Years

stopped_smoking_year

Stopped Smoking Year

number_pack_years_smoked

Number Pack Years Smoked

alcohol_history_documented

Alcohol History Documented

frequency_of_alcohol_consumption

Frequency Of Alcohol Consumption

amount_of_alcohol_consumption_per_day

Amount Of Alcohol Consumption Per Day

reflux_history

Reflux History

antireflux_treatment_types

Antireflux Treatment Types

h_pylori_infection

H Pylori Infection

initial_diagnosis_by

Initial Diagnosis By

barretts_esophagus

Barretts Esophagus

goblet_cells_present

Goblet Cells Present

history_of_esophageal_cancer

History Of Esophageal Cancer

number_of_relatives_diagnosed

Number Of Relatives Diagnosed

has_new_tumor_events_information

Has New Tumor Events Information

day_of_form_completion

Day Of Form Completion

month_of_form_completion

Month Of Form Completion

year_of_form_completion

Year Of Form Completion

has_follow_ups_information

Has Follow Ups Information

has_drugs_information

Has Drugs Information

has_radiations_information

Has Radiations Information

project

Project

stage_event_system_version

Stage Event System Version

stage_event_clinical_stage

Stage Event Clinical Stage

stage_event_pathologic_stage

Stage Event Pathologic Stage

stage_event_tnm_categories

Stage Event Tnm Categories

stage_event_psa

Stage Event Psa

stage_event_gleason_grading

Stage Event Gleason Grading

stage_event_ann_arbor

Stage Event Ann Arbor

stage_event_serum_markers

Stage Event Serum Markers

stage_event_igcccg_stage

Stage Event Igcccg Stage

stage_event_masaoka_stage

Stage Event Masaoka Stage

primary_pathology_tumor_tissue_site

Primary Pathology Tumor Tissue Site

primary_pathology_esophageal_tumor_cental_location

Primary Pathology Esophageal Tumor Cental Location

primary_pathology_esophageal_tumor_involvement_sites

Primary Pathology Esophageal Tumor Involvement Sites

primary_pathology_histological_type

Primary Pathology Histological Type

primary_pathology_columnar_metaplasia_present

Primary Pathology Columnar Metaplasia Present

primary_pathology_columnar_mucosa_goblet_cell_present

Primary Pathology Columnar Mucosa Goblet Cell Present

primary_pathology_columnar_mucosa_dysplasia

Primary Pathology Columnar Mucosa Dysplasia

primary_pathology_neoplasm_histologic_grade

Primary Pathology Neoplasm Histologic Grade

primary_pathology_days_to_initial_pathologic_diagnosis

Primary Pathology Days To Initial Pathologic Diagnosis

primary_pathology_age_at_initial_pathologic_diagnosis

Primary Pathology Age At Initial Pathologic Diagnosis

primary_pathology_year_of_initial_pathologic_diagnosis

Primary Pathology Year Of Initial Pathologic Diagnosis

primary_pathology_initial_pathologic_diagnosis_method

Primary Pathology Initial Pathologic Diagnosis Method

primary_pathology_init_pathology_dx_method_other

Primary Pathology Init Pathology Dx Method Other

primary_pathology_lymph_node_metastasis_radiographic_evidence

Primary Pathology Lymph Node Metastasis Radiographic Evidence

primary_pathology_primary_lymph_node_presentation_assessment

Primary Pathology Primary Lymph Node Presentation Assessment

primary_pathology_lymph_node_examined_count

Primary Pathology Lymph Node Examined Count

primary_pathology_number_of_lymphnodes_positive_by_he

Primary Pathology Number Of Lymphnodes Positive By He

primary_pathology_number_of_lymphnodes_positive_by_ihc

Primary Pathology Number Of Lymphnodes Positive By Ihc

primary_pathology_planned_surgery_status

Primary Pathology Planned Surgery Status

primary_pathology_treatment_prior_to_surgery

Primary Pathology Treatment Prior To Surgery

primary_pathology_residual_tumor

Primary Pathology Residual Tumor

primary_pathology_karnofsky_performance_score

Primary Pathology Karnofsky Performance Score

primary_pathology_eastern_cancer_oncology_group

Primary Pathology Eastern Cancer Oncology Group

primary_pathology_radiation_therapy

Primary Pathology Radiation Therapy

primary_pathology_postoperative_rx_tx

Primary Pathology Postoperative Rx Tx

TCGA-2H-A9GF2HA9GF0500F1A6-A528-43F3-B035-12D3B7C99C0FYESC15.58140/3C15.5NOYES-24487nanMALE18395NetherlandsZHRotterdamnannanNoNoWITH TUMORDead784NOnannannanSymptomaticNonannanYES2522014NONONOTCGA-ESCA5thnanStage IIIT3N1M0EsophagusDistalDistalEsophagus Adenocarcinoma, NOSNONONegative/no dysplasiaG3672001Other method, specify:surgical resectionYESYES87nannanR1NONO
1TCGA-2H-A9GG2HA9GG70084008-697D-442D-8F74-C12F8F598570YESC15.58140/3C15.5NOYES-24328nanMALE17874NetherlandsZHRotterdamnannanNoNoWITH TUMORDead610NOnannannanSymptomaticYes-UKNOnanYES2522014NONONOTCGA-ESCA5thnanStage IIIT3N1M0EsophagusDistalDistalEsophagus Adenocarcinoma, NOSYESNOLow grade dysplasiaG2661999Other method, specify:surgical resectionNOYES194nannanR1NONO
2TCGA-2H-A9GH2HA9GH606DC5B8-7625-42A6-A936-504EF25623A4YESC15.58140/3C15.5NOYES-16197nanMALE18391NetherlandsZHRotterdamnannanNoNoWITH TUMORDead951NONOnannanSymptomaticYes-UKnannanYES2522014NONONOTCGA-ESCA5thnanStage IIBT1N1M0EsophagusDistalDistalEsophagus Adenocarcinoma, NOSYESnannanG2441998Other method, specify:surgical resectionNOYES301nannanR0NONO
3TCGA-2H-A9GI2HA9GICEAF98F8-517E-457A-BF29-ACFE22893D49YESC15.58140/3C15.5NOYES-25097nanMALE188100NetherlandsZHRotterdamnannanNoNoWITH TUMORDead435NOnannannanSymptomaticYes-UKnannanYES2522014NONONOTCGA-ESCA5thnanStage IIIT3N1M0EsophagusDistalDistalEsophagus Adenocarcinoma, NOSYESnannanG2681999Other method, specify:surgical resectionNOYES84nannanR0NONO
4TCGA-2H-A9GJ2HA9GJEE47CD59-C8D8-4B1E-96DB-91C679E4106FYESC15.58140/3C15.5NOYES-21180nanMALE18970NetherlandsZHRotterdamnannanNoNoWITH TUMORDead1781NOYESnannanSymptomaticYes-UKnannanYES2522014NONONOTCGA-ESCA5thnanStage IT1N0M0EsophagusDistalDistalEsophagus Adenocarcinoma, NOSYESnanHigh grade dysplasiaG2572000Other method, specify:surgical resectionNOYES19nannanR0NONO
5TCGA-2H-A9GK2HA9GK61DF8A4B-95F8-40AB-A252-D00C4300C290YESC15.58140/3C15.5NOYES-16067nanMALE18080NetherlandsZHRotterdamnannanNoNoWITH TUMORDead232NOYESnannanSymptomaticYes-UKnannanYES2522014NONONOTCGA-ESCA5thnanStage IIIT3N1M0EsophagusDistalDistalEsophagus Adenocarcinoma, NOSYESnanHigh grade dysplasiaG3432000Other method, specify:surgical resectionYESYES53nannanR0NONO
6TCGA-2H-A9GL2HA9GLAB2755D2-5BD9-4E2F-B255-E813AFC8D268YESC15.58140/3C15.5NOYES-27115nanMALE17385NetherlandsZHRotterdamnannanNoNoWITH TUMORDead180NONOnannanSymptomaticYes-UKnannanYES2522014NONONOTCGA-ESCA5thnanStage IIIT3N1M0EsophagusDistalDistalEsophagus Adenocarcinoma, NOSYESnannanG3742000Other method, specify:surgical resectionNOYES72nannanR1NONO
7TCGA-2H-A9GM2HA9GM6584FA8F-B4F4-4844-A71E-3BFB731AD445YESC15.58140/3C15.5NOYES-19484nanMALE17977NetherlandsZHRotterdamnannanNoNoWITH TUMORDead424NOnannannanSymptomaticYes-UKnannanYES2522014NONONOTCGA-ESCA5thnanStage IIBT1N1M0EsophagusDistalDistalEsophagus Adenocarcinoma, NOSYESnannanG2532000Other method, specify:surgical resectionNOYES131nannanR0NONO
8TCGA-2H-A9GN2HA9GN078DC5F1-D2DC-4408-B785-27703C7813F1YESC15.58140/3C15.5NOYES-25664nanMALE18582NetherlandsZHRotterdamnannanNoNoWITH TUMORDead272NOnannannanSymptomaticNonannanYES2522014NONONOTCGA-ESCA5thnanStage IIIT3N1M0EsophagusDistalDistalEsophagus Adenocarcinoma, NOSNONONegative/no dysplasiaG3702000Other method, specify:surgical resectionNOYES225nannanR0NONO
9TCGA-2H-A9GO2HA9GOCA400EA1-3E4E-437E-A54C-446431B741DAYESC15.58140/3C15.5NOYES-21486nanMALE17083NetherlandsZHRotterdamnannanNoNoWITH TUMORDead494NOnannannanSymptomaticYes-UKnannanYES2522014NONONOTCGA-ESCA5thnanStage IVAT3N1M1aEsophagusDistalDistalEsophagus Adenocarcinoma, NOSYESnannanG3582000Other method, specify:surgical resectionNOYES139nannanR1NONO

CREATE TABLE esophageal_dataset (
  "unnamed_0" BIGINT,
  "patient_barcode" VARCHAR,
  "tissue_source_site" VARCHAR,
  "patient_id" VARCHAR,
  "bcr_patient_uuid" VARCHAR,
  "informed_consent_verified" VARCHAR,
  "icd_o_3_site" VARCHAR,
  "icd_o_3_histology" VARCHAR,
  "icd_10" VARCHAR,
  "tissue_prospective_collection_indicator" VARCHAR,
  "tissue_retrospective_collection_indicator" VARCHAR,
  "days_to_birth" BIGINT,
  "country_of_birth" VARCHAR,
  "gender" VARCHAR,
  "height" DOUBLE,
  "weight" DOUBLE,
  "country_of_procurement" VARCHAR,
  "state_province_of_procurement" VARCHAR,
  "city_of_procurement" VARCHAR,
  "race_list" VARCHAR,
  "ethnicity" VARCHAR,
  "other_dx" VARCHAR,
  "history_of_neoadjuvant_treatment" VARCHAR,
  "person_neoplasm_cancer_status" VARCHAR,
  "vital_status" VARCHAR,
  "days_to_last_followup" DOUBLE,
  "days_to_death" DOUBLE,
  "tobacco_smoking_history" DOUBLE,
  "age_began_smoking_in_years" DOUBLE,
  "stopped_smoking_year" DOUBLE,
  "number_pack_years_smoked" DOUBLE,
  "alcohol_history_documented" VARCHAR,
  "frequency_of_alcohol_consumption" DOUBLE,
  "amount_of_alcohol_consumption_per_day" DOUBLE,
  "reflux_history" VARCHAR,
  "antireflux_treatment_types" VARCHAR,
  "h_pylori_infection" VARCHAR,
  "initial_diagnosis_by" VARCHAR,
  "barretts_esophagus" VARCHAR,
  "goblet_cells_present" VARCHAR,
  "history_of_esophageal_cancer" VARCHAR,
  "number_of_relatives_diagnosed" DOUBLE,
  "has_new_tumor_events_information" VARCHAR,
  "day_of_form_completion" BIGINT,
  "month_of_form_completion" BIGINT,
  "year_of_form_completion" BIGINT,
  "has_follow_ups_information" VARCHAR,
  "has_drugs_information" VARCHAR,
  "has_radiations_information" VARCHAR,
  "project" VARCHAR,
  "stage_event_system_version" VARCHAR,
  "stage_event_clinical_stage" VARCHAR,
  "stage_event_pathologic_stage" VARCHAR,
  "stage_event_tnm_categories" VARCHAR,
  "stage_event_psa" VARCHAR,
  "stage_event_gleason_grading" VARCHAR,
  "stage_event_ann_arbor" VARCHAR,
  "stage_event_serum_markers" VARCHAR,
  "stage_event_igcccg_stage" VARCHAR,
  "stage_event_masaoka_stage" VARCHAR,
  "primary_pathology_tumor_tissue_site" VARCHAR,
  "primary_pathology_esophageal_tumor_cental_location" VARCHAR,
  "primary_pathology_esophageal_tumor_involvement_sites" VARCHAR,
  "primary_pathology_histological_type" VARCHAR,
  "primary_pathology_columnar_metaplasia_present" VARCHAR,
  "primary_pathology_columnar_mucosa_goblet_cell_present" VARCHAR,
  "primary_pathology_columnar_mucosa_dysplasia" VARCHAR,
  "primary_pathology_neoplasm_histologic_grade" VARCHAR,
  "primary_pathology_days_to_initial_pathologic_diagnosis" BIGINT,
  "primary_pathology_age_at_initial_pathologic_diagnosis" BIGINT,
  "primary_pathology_year_of_initial_pathologic_diagnosis" DOUBLE,
  "primary_pathology_initial_pathologic_diagnosis_method" VARCHAR,
  "primary_pathology_init_pathology_dx_method_other" VARCHAR,
  "primary_pathology_lymph_node_metastasis_radiographic_evidence" VARCHAR,
  "primary_pathology_primary_lymph_node_presentation_assessment" VARCHAR,
  "primary_pathology_lymph_node_examined_count" DOUBLE,
  "primary_pathology_number_of_lymphnodes_positive_by_he" DOUBLE,
  "primary_pathology_number_of_lymphnodes_positive_by_ihc" DOUBLE,
  "primary_pathology_planned_surgery_status" VARCHAR,
  "primary_pathology_treatment_prior_to_surgery" VARCHAR,
  "primary_pathology_residual_tumor" VARCHAR,
  "primary_pathology_karnofsky_performance_score" DOUBLE,
  "primary_pathology_eastern_cancer_oncology_group" DOUBLE,
  "primary_pathology_radiation_therapy" VARCHAR,
  "primary_pathology_postoperative_rx_tx" VARCHAR
);

Share link

Anyone who has the link will be able to view this.