Baselight

Lung Cancer Prediction

A comprehensive dataset for predictive modeling of lung cancer prognosis.

@kaggle.rashadrmammadov_lung_cancer_prediction

Lung Cancer Data
@kaggle.rashadrmammadov_lung_cancer_prediction.lung_cancer_data

  • 3.91 MB
  • 23658 rows
  • 38 columns
patient_id

Patient ID

age

Age

gender

Gender

smoking_history

Smoking History

tumor_size_mm

Tumor Size Mm

tumor_location

Tumor Location

stage

Stage

treatment

Treatment

survival_months

Survival Months

ethnicity

Ethnicity

insurance_type

Insurance Type

family_history

Family History

comorbidity_diabetes

Comorbidity Diabetes

comorbidity_hypertension

Comorbidity Hypertension

comorbidity_heart_disease

Comorbidity Heart Disease

comorbidity_chronic_lung_disease

Comorbidity Chronic Lung Disease

comorbidity_kidney_disease

Comorbidity Kidney Disease

comorbidity_autoimmune_disease

Comorbidity Autoimmune Disease

comorbidity_other

Comorbidity Other

performance_status

Performance Status

blood_pressure_systolic

Blood Pressure Systolic

blood_pressure_diastolic

Blood Pressure Diastolic

blood_pressure_pulse

Blood Pressure Pulse

hemoglobin_level

Hemoglobin Level

white_blood_cell_count

White Blood Cell Count

platelet_count

Platelet Count

albumin_level

Albumin Level

alkaline_phosphatase_level

Alkaline Phosphatase Level

alanine_aminotransferase_level

Alanine Aminotransferase Level

aspartate_aminotransferase_level

Aspartate Aminotransferase Level

creatinine_level

Creatinine Level

ldh_level

LDH Level

calcium_level

Calcium Level

phosphorus_level

Phosphorus Level

glucose_level

Glucose Level

potassium_level

Potassium Level

sodium_level

Sodium Level

smoking_pack_years

Smoking Pack Years

Patient000068MaleCurrent Smoker81.67867747756685Lower LobeStage IIISurgery44HispanicMedicareNoYesYesYesNoYesYesYes3161999213.53799986330629.800707283777747321.7352658994583.56838317357492649.31004808172746627.9855707494940646.80121369153551.2458486726480742239.240255344747610.366307497008683.547734301889561113.919242538001094.968163296458876139.8228612755749517.006956111026494
Patient000158MaleNever Smoked78.44827194239302Lower LobeStage IRadiation Therapy101CaucasianPrivateYesYesYesNoNoYesYesNo4101919316.8003120380865744.378427902136247251.58153838735283.6993565701964513111.4216321173692830.1209563579046139.711531459438221.4632310298788858233.515237017174910.081731189647342.9450198927233777101.321578056557843.896794578388724135.4493613143372793.27089292439568
Patient000244MaleFormer Smoker67.71430467925421Lower LobeStage IChemotherapy69African AmericanOtherYesNoNoNoYesYesNoNo109748114.4734927851090776.157791530165437393.456532240540074.70838512471476676.648005935981395.88241845149967732.640601708702750.6301090575275732169.037459568097868.6608921007416024.63739949743354978.214176771609934.369050355351924143.377155194519970.3483762961798
Patient000372MaleCurrent Smoker70.80600837963757Lower LobeStage IIIChemotherapy95African AmericanMedicareYesYesNoYesNoYesYesNo1103856817.4420628801991746.259383054768449275.17789828289024.72767219205187981.9524864357317738.9081544369312744.319392719997050.5943416106583191213.967590111723448.8326687914508163.617098166594064127.895360779390064.348474077210295138.5860052804317519.82812752042046
Patient000437FemaleNever Smoked87.2724327432066Lower LobeStage IVRadiation Therapy105AsianMedicaidNoYesYesYesNoYesNoNo165699913.545170552580525.203515891066979381.70557214077944.605604018896203107.5134233141071626.3448768893582115.7469058870546481.478239027082041118.18754332502149.2476087236474534.773254868358881148.80118505353623.671975834333574141.230724282176381.04745606131432
Patient000550MaleNever Smoked72.1486560839287Lower LobeStage ISurgery49HispanicOtherYesNoYesNoYesNoYesYes41091036815.933717983431574.764999299816994175.79561431298463.05121192216290847.452013347450834.8138685375378429.769655491430690.825544006698517218.204614028651838.711924025428812.6610528218789358142.782619090355134.606624509454229135.4979443465054218.058524678606503
Patient000668FemaleCurrent Smoker19.122174980343733Middle LobeStage IRadiation Therapy63African AmericanOtherNoYesNoYesYesYesNoYes3137938916.7058652600083136.572108623069942190.849914622079444.3406526075127768.4917414633265531.01644601094215839.878953208268760.7995927921017624181.550727768834958.089885308865874.591885757771680575.377093762164044.800979942364532138.373412902201286.48233924788725
Patient000748MaleCurrent Smoker68.09505652070685Lower LobeStage IVChemotherapy101African AmericanMedicareNoYesNoYesYesNoNoYes3901019512.0157465876874925.261679595465527378.28557446181884.860352810042039103.5256963722003412.20826730463356223.908107125120251.4364531541299097119.057097235795249.3677657529427844.90935893894422599.511880849246984.061255316555459136.3471586774704568.23992040517143
Patient000852FemaleFormer Smoker25.29943999554186Lower LobeStage ITargeted Therapy35CaucasianOtherNoNoNoYesNoNoNoNo4157876716.2407490053182368.161061959857904248.14795222728484.68356210322021363.9183128276066736.8883575335176835.822952556851821.0891689830166689197.791756970455710.1880129596162343.326972618633935145.657153917728184.767092005123215141.1135025345896896.80888873695108
Patient000940MaleCurrent Smoker11.282766617549546Lower LobeStage ISurgery19OtherMedicaidYesNoNoYesNoYesYesYes1161957115.0273619080019046.330154812646519186.857251086803474.73163230103272892.9615039849885233.8360739521594744.230240131187241.0787938515579505227.048429560102768.2487178739012623.1734707621667324109.755477684712034.075269200733317139.1748550761947268.5958747306421

CREATE TABLE lung_cancer_data (
  "patient_id" VARCHAR,
  "age" BIGINT,
  "gender" VARCHAR,
  "smoking_history" VARCHAR,
  "tumor_size_mm" DOUBLE,
  "tumor_location" VARCHAR,
  "stage" VARCHAR,
  "treatment" VARCHAR,
  "survival_months" BIGINT,
  "ethnicity" VARCHAR,
  "insurance_type" VARCHAR,
  "family_history" VARCHAR,
  "comorbidity_diabetes" VARCHAR,
  "comorbidity_hypertension" VARCHAR,
  "comorbidity_heart_disease" VARCHAR,
  "comorbidity_chronic_lung_disease" VARCHAR,
  "comorbidity_kidney_disease" VARCHAR,
  "comorbidity_autoimmune_disease" VARCHAR,
  "comorbidity_other" VARCHAR,
  "performance_status" BIGINT,
  "blood_pressure_systolic" BIGINT,
  "blood_pressure_diastolic" BIGINT,
  "blood_pressure_pulse" BIGINT,
  "hemoglobin_level" DOUBLE,
  "white_blood_cell_count" DOUBLE,
  "platelet_count" DOUBLE,
  "albumin_level" DOUBLE,
  "alkaline_phosphatase_level" DOUBLE,
  "alanine_aminotransferase_level" DOUBLE,
  "aspartate_aminotransferase_level" DOUBLE,
  "creatinine_level" DOUBLE,
  "ldh_level" DOUBLE,
  "calcium_level" DOUBLE,
  "phosphorus_level" DOUBLE,
  "glucose_level" DOUBLE,
  "potassium_level" DOUBLE,
  "sodium_level" DOUBLE,
  "smoking_pack_years" DOUBLE
);

Share link

Anyone who has the link will be able to view this.