Baselight
Sign In
kaggle

Cancer-treatment-response

Kaggle

@kaggle.lijsbeth_cancer_treatment_response

Loading...
Loading...

synthetic clinical data + outcomes

Dataset Description

Prediction of the pathologically complete response after neoadjuvant chemoradiotherapy for oesophageal cancer (Toxopeus, E. L. A., Nieboer, D., Shapiro, J., Biermann, K., van der Gaast, A., van Rij, C. M., ... & Wijnhoven, B. P. L. (2015). Nomogram for predicting pathologically complete response after neoadjuvant chemoradiotherapy for oesophageal cancer. Radiotherapy and Oncology, 115(3), 392-398.)

Columns:

For information
Cancer stages:

  • T: main tumor: 0 - not found, 1-4 - size and extend, X - cannot be measured,
  • N: nearby lymphnode affected: 0 - not found, 1-3 - number of lymph nodes, X - cannot be measured,
  • M: metastasis: 0 - did not spread, 1 - has spread, X - cannot be measured.

Differentiation grades: 1-4 from well differentiated to undifferentiated, X - cannot be measured.

We use an assumption that the input parameters are non-correlated, which is not biologically the case.

There are some distortions:

  • duplication of records,
  • duplication of IDs,
  • introducing missing values,
  • introducing non-realistic values (negative age),
  • introducing unexpected values,
  • introducing typos.
    It is important to deal with them wisely as in healthcare, every data point is precious.

Related Datasets

Share link

Anyone who has the link will be able to view this.