Baselight

College Completion Dataset

Graduation Rates, Race, Efficiency Measures and More

@kaggle.thedevastator_boost_student_success_with_college_completion_da

Loading...
Loading...

About this Dataset

College Completion Dataset


College Completion Dataset

Graduation Rates, Race, Efficiency Measures and More

By Jonathan Ortiz [source]


About this dataset

This College Completion dataset provides an invaluable insight into the success and progress of college students in the United States. It contains graduation rates, race and other data to offer a comprehensive view of college completion in America. The data is sourced from two primary sources – the National Center for Education Statistics (NCES)’ Integrated Postsecondary Education System (IPEDS) and Voluntary System of Accountability’s Student Success and Progress rate.

At four-year institutions, the graduation figures come from IPEDS for first-time, full-time degree seeking students at the undergraduate level, who entered college six years earlier at four-year institutions or three years earlier at two-year institutions. Furthermore, colleges report how many students completed their program within 100 percent and 150 percent of normal time which corresponds with graduation within four years or six year respectively. Students reported as being of two or more races are included in totals but not shown separately

When analyzing race and ethnicity data NCES have classified student demographics since 2009 into seven categories; White non-Hispanic; Black non Hispanic; American Indian/ Alaskan native ; Asian/ Pacific Islander ; Unknown race or ethnicity ; Non resident with two new categorize Native Hawaiian or Other Pacific Islander combined with Asian plus students belonging to several races. Also worth noting is that different classifications for graduate data stemming from 2008 could be due to variations in time frame examined & groupings used by particular colleges – those who can’t be identified from National Student Clearinghouse records won’t be subjected to penalty by these locations .

When it comes down to efficiency measures parameters like “Awards per 100 Full Time Undergraduate Students which includes all undergraduate completions reported by a particular institution including associate degrees & certificates less than 4 year programme will assist us here while we also take into consideration measures like expenditure categories , Pell grant percentage , endowment values , average student aid amounts & full time faculty members contributing outstandingly towards instructional research / public service initiatives .

When trying to quantify outcomes back up Median Estimated SAT score metric helps us when it is derived either on 25th percentile basis / 75th percentile basis with all these factors further qualified by identifying required criteria meeting 90% threshold when incoming students are considered for relevance . Last but not least , Average Student Aid equalizes amount granted by institution dividing same over total sum received against what was allotted that particular year .

All this analysis gives an opportunity get a holistic overview about performance , potential deficits &

More Datasets

For more datasets, click here.

Featured Notebooks

  • 🚨 Your notebook can be here! 🚨!

How to use the dataset

This dataset contains data on student success, graduation rates, race and gender demographics, an efficiency measure to compare colleges across states and more. It is a great source of information to help you better understand college completion and student success in the United States.

In this guide we’ll explain how to use the data so that you can find out the best colleges for students with certain characteristics or focus on your target completion rate. We’ll also provide some useful tips for getting the most out of this dataset when seeking guidance on which institutions offer the highest graduation rates or have a good reputation for success in terms of completing programs within normal timeframes.

Before getting into specifics about interpreting this dataset, it is important that you understand that each row represents information about a particular institution – such as its state affiliation, level (two-year vs four-year), control (public vs private), name and website. Each column contains various demographic information such as rate of awarding degrees compared to other institutions in its sector; race/ethnicity Makeup; full-time faculty percentage; median SAT score among first-time students; awards/grants comparison versus national average/state average - all applicable depending on institution location — and more!

When using this dataset, our suggestion is that you begin by forming a hypothesis or research question concerning student completion at a given school based upon observable characteristics like financial resources invested per award or correlations between institutional control vs outcomes within five different sectors: public 2‑year schools, public 4‑year schools, private non‑profit 2‑year schools, private non‑profit 4‑year schools and for-profit postsecondary institutions. You may choose certain demographic groups as part of your hypothesis or study outcomes trends across entire population groups over time with these cohorts selected from 2000–2014 years listed in comparative chart below:

Cohort	Group	        Numeric Equivalent Code

    G12	Graduates 12th grade   	    1 
    FW14 Freshman who will graduate 14th grade   2  

FTUG14 First Time Undergraduate ‑ Graduates 14th year 3                                  
  TTUG14     Transfer Undergraduates ‑ Graduates 14th year 4               

Our recommendation would be using similar code method suggested above when combining “Group” & pressurizing corresponding numeric figure to specify cohort drop down menu under Statistical Filter tab formatting user experience accordingly). After forming initial

Research Ideas

  • Developing targeted programs to increase graduation rates and reduce the racial achievement gap: This dataset can be used to analyze college completion trends at both the state and institutional levels, focus on differential rates of student graduation that vary based on gender, race, or other factors, and measure performance against set benchmarks. This analysis could be used to create solutions tailored to individual institutions and states aimed at reducing disparities in college completion between different demographic groups.

  • Identifying financial efficiency measures for higher education institutions: Using this dataset, researchers can explore expenditure patterns among different types of schools (e.g., publics versus privates) and focus on total spending per award as a way of gauging financial efficiency among higher education institutions. By understanding the links between spending patterns and student outcomes such as graduation rates, policy makers can design strategies to optimize institutional resources while increasing overall collegiate success.

  • Developing a correlation model between SAT scores/test results and college completion: Researchers could use this dataset's estimated median SAT score information as well as its state-level comparison data to develop a model exploring the relationship between initial test scores (or other pre-college indications) with eventual college matriculation or graduation

Acknowledgements

If you use this dataset in your research, please credit the original authors.
Data Source

License

License: CC0 1.0 Universal (CC0 1.0) - Public Domain Dedication
No Copyright - You can copy, modify, distribute and perform the work, even for commercial purposes, all without asking permission. See Other Information.

Columns

File: cc_state_sector_grads.csv

Column name Description
state The state in which the college is located. (String)
state_abbr The two-letter abbreviation of the state in which the college is located. (String)
control The type of college (public or private). (String)
level The type of institution (two-year or four-year). (String)
gender The gender of the student. (String)
race The race/ethnicity of the student. (String)
cohort The number of students in the cohort. (Integer)
grad_cohort The number of students who graduated from the cohort. (Integer)
grad_100 The number of students who graduated within 100% of normal time. (Integer)
grad_150 The number of students who graduated within 150% of normal time. (Integer)
grad_100_rate The graduation rate within 100% of normal time. (Float)
grad_150_rate The graduation rate within 150% of normal time. (Float)
grad_cohort_ct The total number of students in the cohort. (Integer)

File: cc_institution_grads.csv

Column name Description
level The type of institution (two-year or four-year). (String)
gender The gender of the student. (String)
race The race/ethnicity of the student. (String)
cohort The number of students in the cohort. (Integer)
grad_cohort The number of students who graduated from the cohort. (Integer)
grad_100 The number of students who graduated within 100% of normal time. (Integer)
grad_150 The number of students who graduated within 150% of normal time. (Integer)
grad_100_rate The graduation rate within 100% of normal time. (Float)
grad_150_rate The graduation rate within 150% of normal time. (Float)

File: cc_institution_details.csv

Column name Description
state The state in which the college is located. (String)
level The type of institution (two-year or four-year). (String)
control The type of college (public or private). (String)
chronname The name of the college or university. (String)
city The city in which the college is located. (String)
basic A flag indicating whether the college is a basic institution. (Boolean)
hbcu A flag indicating whether the college is a historically black college or university. (Boolean)
flagship A flag indicating whether the college is a flagship institution. (Boolean)
long_x The longitude of the college. (Float)
lat_y The latitude of the college. (Float)
site The website of the college. (String)
student_count The number of students enrolled at the college. (Integer)
awards_per_value The number of awards per 100 full-time undergraduates. (Float)
awards_per_state_value The number of awards per 100 full-time undergraduates compared to the state average. (Float)
awards_per_natl_value The number of awards per 100 full-time undergraduates compared to the national average. (Float)
exp_award_value The amount of money spent per award. (Float)
exp_award_state_value The amount of money spent per award compared to the state average. (Float)
exp_award_natl_value The amount of money spent per award compared to the national average. (Float)
exp_award_percentile The percentile of the amount of money spent per award compared to other colleges. (Float)
ft_pct The percentage of full-time students. (Float)
fte_value The number of full-time equivalent students. (Float)
fte_percentile The percentile of the number of full-time equivalent students compared to other colleges. (Float)
med_sat_value The median estimated SAT score. (Float)

File: cc_state_sector_details.csv

Column name Description
state The state in which the college is located. (String)
state_abbr The two-letter abbreviation of the state in which the college is located. (String)
level The type of institution (two-year or four-year). (String)
control The type of college (public or private). (String)
awards_per_state_value The number of awards per 100 full-time undergraduates compared to the state average. (Float)
awards_per_natl_value The number of awards per 100 full-time undergraduates compared to the national average. (Float)
exp_award_state_value The amount of money spent per award compared to the state average. (Float)
exp_award_natl_value The amount of money spent per award compared to the national average. (Float)

Acknowledgements

If you use this dataset in your research, please credit the original authors.
If you use this dataset in your research, please credit Jonathan Ortiz.

Tables

Cc Institution Details

@kaggle.thedevastator_boost_student_success_with_college_completion_da.cc_institution_details
  • 834.79 KB
  • 3798 rows
  • 63 columns
Loading...

CREATE TABLE cc_institution_details (
  "index" BIGINT,
  "unitid" BIGINT,
  "chronname" VARCHAR,
  "city" VARCHAR,
  "state" VARCHAR,
  "level" VARCHAR,
  "control" VARCHAR,
  "basic" VARCHAR,
  "hbcu" VARCHAR,
  "flagship" VARCHAR,
  "long_x" DOUBLE,
  "lat_y" DOUBLE,
  "site" VARCHAR,
  "student_count" BIGINT,
  "awards_per_value" DOUBLE,
  "awards_per_state_value" DOUBLE,
  "awards_per_natl_value" DOUBLE,
  "exp_award_value" BIGINT,
  "exp_award_state_value" BIGINT,
  "exp_award_natl_value" BIGINT,
  "exp_award_percentile" BIGINT,
  "ft_pct" DOUBLE,
  "fte_value" BIGINT,
  "fte_percentile" BIGINT,
  "med_sat_value" DOUBLE,
  "med_sat_percentile" DOUBLE,
  "aid_value" DOUBLE,
  "aid_percentile" DOUBLE,
  "endow_value" DOUBLE,
  "endow_percentile" DOUBLE,
  "grad_100_value" DOUBLE,
  "grad_100_percentile" DOUBLE,
  "grad_150_value" DOUBLE,
  "grad_150_percentile" DOUBLE,
  "pell_value" DOUBLE,
  "pell_percentile" DOUBLE,
  "retain_value" DOUBLE,
  "retain_percentile" DOUBLE,
  "ft_fac_value" DOUBLE,
  "ft_fac_percentile" DOUBLE,
  "vsa_year" DOUBLE,
  "vsa_grad_after4_first" DOUBLE,
  "vsa_grad_elsewhere_after4_first" DOUBLE,
  "vsa_enroll_after4_first" DOUBLE,
  "vsa_enroll_elsewhere_after4_first" DOUBLE,
  "vsa_grad_after6_first" DOUBLE,
  "vsa_grad_elsewhere_after6_first" DOUBLE,
  "vsa_enroll_after6_first" DOUBLE,
  "vsa_enroll_elsewhere_after6_first" DOUBLE,
  "vsa_grad_after4_transfer" DOUBLE,
  "vsa_grad_elsewhere_after4_transfer" DOUBLE,
  "vsa_enroll_after4_transfer" DOUBLE,
  "vsa_enroll_elsewhere_after4_transfer" DOUBLE,
  "vsa_grad_after6_transfer" DOUBLE,
  "vsa_grad_elsewhere_after6_transfer" DOUBLE,
  "vsa_enroll_after6_transfer" DOUBLE,
  "vsa_enroll_elsewhere_after6_transfer" DOUBLE,
  "similar" VARCHAR,
  "state_sector_ct" BIGINT,
  "carnegie_ct" BIGINT,
  "counted_pct" VARCHAR,
  "nicknames" VARCHAR,
  "cohort_size" DOUBLE
);

Cc Institution Grads

@kaggle.thedevastator_boost_student_success_with_college_completion_da.cc_institution_grads
  • 10.32 MB
  • 1302102 rows
  • 11 columns
Loading...

CREATE TABLE cc_institution_grads (
  "index" BIGINT,
  "unitid" BIGINT,
  "year" BIGINT,
  "gender" VARCHAR,
  "race" VARCHAR,
  "cohort" VARCHAR,
  "grad_cohort" DOUBLE,
  "grad_100" DOUBLE,
  "grad_150" DOUBLE,
  "grad_100_rate" DOUBLE,
  "grad_150_rate" DOUBLE
);

Cc State Sector Details

@kaggle.thedevastator_boost_student_success_with_college_completion_da.cc_state_sector_details
  • 22.15 KB
  • 312 rows
  • 17 columns
Loading...

CREATE TABLE cc_state_sector_details (
  "index" BIGINT,
  "stateid" BIGINT,
  "state" VARCHAR,
  "state_abbr" VARCHAR,
  "state_post" VARCHAR,
  "level" VARCHAR,
  "control" VARCHAR,
  "schools_count" BIGINT,
  "counted_pct" DOUBLE,
  "awards_per_state_value" DOUBLE,
  "awards_per_natl_value" DOUBLE,
  "exp_award_state_value" DOUBLE,
  "exp_award_natl_value" BIGINT,
  "state_appr_value" DOUBLE,
  "state_appr_rank" DOUBLE,
  "grad_rate_rank" DOUBLE,
  "awards_per_rank" DOUBLE
);

Cc State Sector Grads

@kaggle.thedevastator_boost_student_success_with_college_completion_da.cc_state_sector_grads
  • 1.06 MB
  • 84942 rows
  • 16 columns
Loading...

CREATE TABLE cc_state_sector_grads (
  "index" BIGINT,
  "stateid" BIGINT,
  "state" VARCHAR,
  "state_abbr" VARCHAR,
  "control" VARCHAR,
  "level" VARCHAR,
  "year" BIGINT,
  "gender" VARCHAR,
  "race" VARCHAR,
  "cohort" VARCHAR,
  "grad_cohort" BIGINT,
  "grad_100" DOUBLE,
  "grad_150" BIGINT,
  "grad_100_rate" DOUBLE,
  "grad_150_rate" DOUBLE,
  "grad_cohort_ct" BIGINT
);

Share link

Anyone who has the link will be able to view this.