Youth Tobacco Dataset (2 Decades)
A comprehensive dataset of over two decades of data
@kaggle.sahirmaharajj_youth_tobacco_survey
A comprehensive dataset of over two decades of data
@kaggle.sahirmaharajj_youth_tobacco_survey
This dataset was developed to provide states with comprehensive data on both middle school and high school students regarding tobacco use, exposure to environmental tobacco smoke, smoking cessation, school curriculum, minors' ability to purchase or otherwise obtain tobacco products, knowledge and attitudes about tobacco, and familiarity with pro-tobacco and anti-tobacco media messages. The dataset uses a two-stage cluster sample design to produce representative samples of students in middle schools (grades 6–8) and high schools (grades 9–12)
This dataset is valuable for data science due to its coverage of youth tobacco use over nearly two decades. Its rich demographic details and broad geographical spread enable researchers and policymakers to identify trends, behaviors, and risk factors associated with tobacco use among the youth.
For instance, it can help in understanding how tobacco use prevalence varies across different age groups, genders, races, and educational backgrounds. The stratification of data by location and demographic characteristics allows for targeted analysis that can inform public health strategies and educational campaigns aimed at reducing tobacco use among young people.
Some analysis of this dataset can include:
CREATE TABLE youth_tobacco_survey_yts_data (
"year" BIGINT,
"locationabbr" VARCHAR,
"locationdesc" VARCHAR,
"topictype" VARCHAR,
"topicdesc" VARCHAR,
"measuredesc" VARCHAR,
"datasource" VARCHAR,
"response" VARCHAR,
"data_value_unit" VARCHAR,
"data_value_type" VARCHAR,
"data_value" DOUBLE,
"data_value_footnote_symbol" VARCHAR,
"data_value_footnote" VARCHAR,
"data_value_std_err" DOUBLE,
"low_confidence_limit" DOUBLE,
"high_confidence_limit" DOUBLE,
"sample_size" DOUBLE,
"gender" VARCHAR,
"race" VARCHAR,
"age" VARCHAR,
"education" VARCHAR,
"geolocation" VARCHAR,
"topictypeid" VARCHAR,
"topicid" VARCHAR,
"measureid" VARCHAR,
"stratificationid1" VARCHAR,
"stratificationid2" VARCHAR,
"stratificationid3" VARCHAR,
"stratificationid4" VARCHAR,
"submeasureid" VARCHAR,
"displayorder" BIGINT
);Anyone who has the link will be able to view this.