This dataset was developed to provide states with comprehensive data on both middle school and high school students regarding tobacco use, exposure to environmental tobacco smoke, smoking cessation, school curriculum, minors' ability to purchase or otherwise obtain tobacco products, knowledge and attitudes about tobacco, and familiarity with pro-tobacco and anti-tobacco media messages. The dataset uses a two-stage cluster sample design to produce representative samples of students in middle schools (grades 6–8) and high schools (grades 9–12)
This dataset is valuable for data science due to its coverage of youth tobacco use over nearly two decades. Its rich demographic details and broad geographical spread enable researchers and policymakers to identify trends, behaviors, and risk factors associated with tobacco use among the youth.
For instance, it can help in understanding how tobacco use prevalence varies across different age groups, genders, races, and educational backgrounds. The stratification of data by location and demographic characteristics allows for targeted analysis that can inform public health strategies and educational campaigns aimed at reducing tobacco use among young people.
Some analysis of this dataset can include:
- Statistical assessments of tobacco use trends, examining changes in attitudes towards tobacco, and identifying high-risk groups based on demographic characteristics.
- Performing time-series analyses to understand how tobacco use has evolved over the years or spatial analyses to identify geographical variations in tobacco use trends.
- Correlation studies can help uncover associations between tobacco use and factors like education levels, race, and gender.
- Advanced machine learning models could predict future trends in youth tobacco use or evaluate the potential impact of new tobacco control measures.