Baselight

Smoking Related Lung Cancers

Predict lung cancer based on demographic data

@kaggle.raddar_smoking_related_lung_cancers

Loading...
Loading...

About this Dataset

Smoking Related Lung Cancers

This is a subset of data available from the US National Lung Screening Trial (NLST). The data contains information about current and former smokers who were observed for 7 years and were tested for lung cancer each year. No non-smokers were involved in the trial.

Data contains:

  • pid - anonymous identifier of a person
  • age - age of a person at the start of the trial
  • gender - Male/Female
  • race - the race of a person
  • smoker - Former/Current (Former is defined as quit smoking in last 15 years)
  • days_to_cancer - number of days passed since the trial when the cancer was first observed
  • stage_of_cancer - the stage of cancer when the cancer was first observed

Full trial metadata is available at https://wiki.cancerimagingarchive.net/download/attachments/5800702/package-nlst-780.2021-05-28.zip?version=1&modificationDate=1633562878492&api=v2

Tables

Lung Cancer

@kaggle.raddar_smoking_related_lung_cancers.lung_cancer
  • 403.93 kB
  • 53,427 rows
  • 7 columns
Loading...
CREATE TABLE lung_cancer (
  "pid" BIGINT,
  "age" BIGINT,
  "gender" VARCHAR,
  "race" VARCHAR,
  "smoker" VARCHAR,
  "days_to_cancer" DOUBLE,
  "stage_of_cancer" VARCHAR
);

Share link

Anyone who has the link will be able to view this.