U.S. Healthcare Data
Population Health, Diseases, Drugs, Nutritions, Health-plans
@kaggle.maheshdadhich_us_healthcare_data
Population Health, Diseases, Drugs, Nutritions, Health-plans
@kaggle.maheshdadhich_us_healthcare_data
Health care in the United States is provided by many distinct organizations. Health care facilities are largely owned and operated by private sector businesses. 58% of US community hospitals are non-profit, 21% are government owned, and 21% are for-profit. According to the World Health Organization (WHO), the United States spent more on healthcare per capita ($9,403), and more on health care as percentage of its GDP (17.1%), than any other nation in 2014. Many different datasets are needed to portray different aspects of healthcare in US like disease prevalences, pharmaceuticals and drugs, Nutritional data of different food products available in US. Such data is collected by surveys (or otherwise) conducted by Centre of Disease Control and Prevention (CDC), Foods and Drugs Administration, Center of Medicare and Medicaid Services and Agency for Healthcare Research and Quality (AHRQ). These datasets can be used to properly review demographics and diseases, determining start ratings of healthcare providers, different drugs and their compositions as well as package informations for different diseases and for food quality. We often want such information and finding and scraping such data can be a huge hurdle. So, Here an attempt is made to make available all US healthcare data at one place to download from in csv files.
Nhanes Survey (National Health and Nutrition Examination Survey) - The National Health and Nutrition Examination Survey (NHANES) is a program of studies designed to assess the health and nutritional status of adults and children in the United States. The survey is unique in that it combines interviews and physical examinations. NHANES is a major program of the National Center for Health Statistics (NCHS). NCHS is part of the Centers for Disease Control and Prevention (CDC) and has the responsibility for producing vital and health statistics for the Nation. The NHANES interview includes demographic, socioeconomic, dietary, and health-related questions. The examination component consists of medical, dental, and physiological measurements, as well as laboratory tests administered by highly trained medical personnel. The diseases, medical conditions, and health indicators to be studied include: Anemia, Cardiovascular disease, Diabetes, Environmental exposures, Eye diseases, Hearing loss, Infectious diseases, Kidney disease, Nutrition, Obesity, Oral health, Osteoporosis, Physical fitness and physical functioning, Reproductive history and sexual behavior, Respiratory disease (asthma, chronic bronchitis, emphysema), Sexually transmitted diseases, Vision. 10000 individuals are surveyed to represent US statistics.
Five files in this datasets represent current recent Nhanes data -
Nhanes_2005_2006.csv
Nhanes_2007_2008.csv
Nhanes_2009_2010.csv
Nhanes_2011_2012.csv
Nhanes_2013_2014.csv
US Drugs datasets - FDA provides a database for searching all the published drugs and all the unpublished drugs on their website, This database provides all the information about package of drugs and compositions of drugs their NDC codes. Description of variables for this datasets are as follows -
Drugs_product (current and unfinished)
Drugs Package (current and unfinished)
Nutritions Data from USDA - Whenever we buy a packaged food product, we find the nutritional fact written on it. United States Department of Agriculture Agricultural Research Service’s Food composition database. This database contains all kinds food products available in US and provides description of their nutritions. This dataset is web scrapped and converted into a csv file. Variables are self-explanatory names yet the descriptions can be found at this link - variables descriptions -( All values are per 100 grams) -
Star rating of health care plans with HOS-CAHPS measures - HOS CAHPS survey measures are the base of determining star rating of healthcare plan. Files related to star rating have two types of measures which are used to determine star rating of the healthcare plans - Part C and Part D. Part C is has three type of information 1. Chronic conditions (disease) 2. Tests and Vaccines 3. Member experience with healthcare plans. All variables starting with C01 to C32 are related to part C of the surveys. Similarly Part D of the survey is related to Drugs plans customer services. In data variables starting with D01 to D15 is related to part D. Surveys such as HOS CAHPS etc contains questions whose final standing results into C01 to C32, and D01 to D15 measures. Dataset has two star rating and measurements data released in fall 2015 and Spring 2016. Files description -
I have collected these files from various data websites and data sources listed below -
Nhanes - from CDS's National Health and Nutrition Examination Survey. Link
Drugs' dataset - from FDA drug database. link
Nutritions' dataset - USDA Food composition databsase. link
Star rating dataset - CMS website. link
These datasets are used for hundreds of publications per year worldwide. Link
Anyone who has the link will be able to view this.