Baselight

Cancer-treatment-response

synthetic clinical data + outcomes

@kaggle.lijsbeth_cancer_treatment_response

Loading...
Loading...

About this Dataset

Cancer-treatment-response

Prediction of the pathologically complete response after neoadjuvant chemoradiotherapy for oesophageal cancer (Toxopeus, E. L. A., Nieboer, D., Shapiro, J., Biermann, K., van der Gaast, A., van Rij, C. M., ... & Wijnhoven, B. P. L. (2015). Nomogram for predicting pathologically complete response after neoadjuvant chemoradiotherapy for oesophageal cancer. Radiotherapy and Oncology, 115(3), 392-398.)

Columns:

For information
Cancer stages:

  • T: main tumor: 0 - not found, 1-4 - size and extend, X - cannot be measured,
  • N: nearby lymphnode affected: 0 - not found, 1-3 - number of lymph nodes, X - cannot be measured,
  • M: metastasis: 0 - did not spread, 1 - has spread, X - cannot be measured.

Differentiation grades: 1-4 from well differentiated to undifferentiated, X - cannot be measured.

We use an assumption that the input parameters are non-correlated, which is not biologically the case.

There are some distortions:

  • duplication of records,
  • duplication of IDs,
  • introducing missing values,
  • introducing non-realistic values (negative age),
  • introducing unexpected values,
  • introducing typos.
    It is important to deal with them wisely as in healthcare, every data point is precious.

Tables

Treatment Response Development

@kaggle.lijsbeth_cancer_treatment_response.treatment_response_development
  • 99.44 KB
  • 1235 rows
  • 16 columns
Loading...

CREATE TABLE treatment_response_development (
  "unnamed_0" BIGINT,
  "index" BIGINT,
  "id" VARCHAR,
  "age" DOUBLE,
  "sex" VARCHAR,
  "tumor_type" VARCHAR,
  "differentiation_grade" VARCHAR,
  "t_stage" VARCHAR,
  "n_stage" VARCHAR,
  "m_stage" VARCHAR,
  "survival_time_days" DOUBLE,
  "overall_stage" VARCHAR,
  "smoking" VARCHAR,
  "weight_loss_percent" DOUBLE,
  "tumor_location" VARCHAR,
  "complete_response_probability" DOUBLE
);

Treatment Response External

@kaggle.lijsbeth_cancer_treatment_response.treatment_response_external
  • 46.79 KB
  • 500 rows
  • 15 columns
Loading...

CREATE TABLE treatment_response_external (
  "unnamed_0" BIGINT,
  "id" VARCHAR,
  "age" DOUBLE,
  "sex" VARCHAR,
  "tumor_type" VARCHAR,
  "differentiation_grade" VARCHAR,
  "t_stage" VARCHAR,
  "n_stage" VARCHAR,
  "m_stage" VARCHAR,
  "survival_time_days" DOUBLE,
  "overall_stage" VARCHAR,
  "smoking" VARCHAR,
  "weight_loss_percent" DOUBLE,
  "tumor_location" VARCHAR,
  "complete_response_probability" DOUBLE
);

Share link

Anyone who has the link will be able to view this.