Baselight

Predict Students' Dropout And Academic Success

Investigating the Impact of Social and Economic Factors

@kaggle.thedevastator_higher_education_predictors_of_student_retention

Loading...
Loading...

About this Dataset

Predict Students' Dropout And Academic Success


Predict students' dropout and academic success

Investigating the Impact of Social and Economic Factors

By [source]


About this dataset

This dataset provides a comprehensive view of students enrolled in various undergraduate degrees offered at a higher education institution. It includes demographic data, social-economic factors and academic performance information that can be used to analyze the possible predictors of student dropout and academic success. This dataset contains multiple disjoint databases consisting of relevant information available at the time of enrollment, such as application mode, marital status, course chosen and more. Additionally, this data can be used to estimate overall student performance at the end of each semester by assessing curricular units credited/enrolled/evaluated/approved as well as their respective grades. Finally, we have unemployment rate, inflation rate and GDP from the region which can help us further understand how economic factors play into student dropout rates or academic success outcomes. This powerful analysis tool will provide valuable insight into what motivates students to stay in school or abandon their studies for a wide range of disciplines such as agronomy, design, education nursing journalism management social service or technologies

More Datasets

For more datasets, click here.

Featured Notebooks

  • 🚨 Your notebook can be here! 🚨!

How to use the dataset

This dataset can be used to understand and predict student dropouts and academic outcomes. The data includes a variety of demographic, social-economic and academic performance factors related to the students enrolled in higher education institutions. The dataset provides valuable insights into the factors that affect student success and could be used to guide interventions and policies related to student retention.

Using this dataset, researchers can investigate two key questions:

  • which specific predictive factors are linked with student dropout or completion?
  • how do different features interact with each other?
    For example, researchers could explore if there any demographic characteristics (e.g., gender, age at enrollment etc.) or immersion conditions (e.g., unemployment rate in region) are associated with higher student success rates, as well as understand what implications poverty has for educational outcomes. By answering these questions, research insight is generated which can provide critical information for administrators on formulating strategies that promote successful degree completion among students from diverse backgrounds in their institutions.

In order to use this dataset effectively it is important that scientists familiarize themselves with all variables provided in the dataset including categorical (qualitative) variables such as gender or application mode; numerical variables such as number of curricular units at the beginning of semesters or age at enrollment; ordinal data measurement type variables such as marital status; studied trends over time such as inflation rate or GDP; frequency measurements variables like percentage of scholarship holders; etc.. Additionally scientists should make sure they aware off all potential bias included in the data prior running analysis–for example understanding if one population is underrepresented compared another -as this phenomenon could lead unexpected results if not taken into consideration while conducting research undertaken using this data set.. Finally it would be important for practitioners realize that this current Kaggle Dataset contains only one semester-worth information on each admission intake whereas additional studies conducted for a longer time period might be able provide more accurate results related selected topic area due further deterioration retention achievement coefficients obtained from those gradually accurate experiments unfolding different year-long admissions seasons

Research Ideas

  • Prediction of Student Retention: This dataset can be used to develop predictive models that can identify student risk factors for dropout and take early interventions to improve student retention rate.
  • Improved Academic Performance: By using this data, higher education institutions could better understand their students' academic progress and identify areas of improvement from both an individual and institutional perspective. This will enable them to develop targeted courses, activities, or initiatives that enhance academic performance more effectively and efficiently.
  • Accessibility Assistance: Using the demographic information included in the dataset, institutions could develop specific initiatives designed to help certain groups more easily access higher education services or resources that may not typically be available in their area or for their social-economic class, helping close existing gaps in accessibility across different student populations

Acknowledgements

If you use this dataset in your research, please credit the original authors.
Data Source

License

License: CC0 1.0 Universal (CC0 1.0) - Public Domain Dedication
No Copyright - You can copy, modify, distribute and perform the work, even for commercial purposes, all without asking permission. See Other Information.

Columns

File: dataset.csv

Column name Description
Marital status The marital status of the student. (Categorical)
Application mode The method of application used by the student. (Categorical)
Application order The order in which the student applied. (Numerical)
Course The course taken by the student. (Categorical)
Daytime/evening attendance Whether the student attends classes during the day or in the evening. (Categorical)
Previous qualification The qualification obtained by the student before enrolling in higher education. (Categorical)
Nacionality The nationality of the student. (Categorical)
Mother's qualification The qualification of the student's mother. (Categorical)
Father's qualification The qualification of the student's father. (Categorical)
Mother's occupation The occupation of the student's mother. (Categorical)
Father's occupation The occupation of the student's father. (Categorical)
Displaced Whether the student is a displaced person. (Categorical)
Educational special needs Whether the student has any special educational needs. (Categorical)
Debtor Whether the student is a debtor. (Categorical)
Tuition fees up to date Whether the student's tuition fees are up to date. (Categorical)
Gender The gender of the student. (Categorical)
Scholarship holder Whether the student is a scholarship holder. (Categorical)
Age at enrollment The age of the student at the time of enrollment. (Numerical)
International Whether the student is an international student. (Categorical)
Curricular units 1st sem (credited) The number of curricular units credited by the student in the first semester. (Numerical)
Curricular units 1st sem (enrolled) The number of curricular units enrolled by the student in the first semester. (Numerical)
Curricular units 1st sem (evaluations) The number of curricular units evaluated by the student in the first semester. (Numerical)
Curricular units 1st sem (approved) The number of curricular units approved by the student in the first semester. (Numerical)

Acknowledgements

If you use this dataset in your research, please credit the original authors.
If you use this dataset in your research, please credit .

Tables

Dataset

@kaggle.thedevastator_higher_education_predictors_of_student_retention.dataset
  • 106.79 KB
  • 4424 rows
  • 35 columns
Loading...

CREATE TABLE dataset (
  "marital_status" BIGINT,
  "application_mode" BIGINT,
  "application_order" BIGINT,
  "course" BIGINT,
  "daytime_evening_attendance" BIGINT,
  "previous_qualification" BIGINT,
  "nacionality" BIGINT,
  "mother_s_qualification" BIGINT,
  "father_s_qualification" BIGINT,
  "mother_s_occupation" BIGINT,
  "father_s_occupation" BIGINT,
  "displaced" BIGINT,
  "educational_special_needs" BIGINT,
  "debtor" BIGINT,
  "tuition_fees_up_to_date" BIGINT,
  "gender" BIGINT,
  "scholarship_holder" BIGINT,
  "age_at_enrollment" BIGINT,
  "international" BIGINT,
  "curricular_units_1st_sem_credited" BIGINT,
  "curricular_units_1st_sem_enrolled" BIGINT,
  "curricular_units_1st_sem_evaluations" BIGINT,
  "curricular_units_1st_sem_approved" BIGINT,
  "curricular_units_1st_sem_grade" DOUBLE,
  "curricular_units_1st_sem_without_evaluations" BIGINT,
  "curricular_units_2nd_sem_credited" BIGINT,
  "curricular_units_2nd_sem_enrolled" BIGINT,
  "curricular_units_2nd_sem_evaluations" BIGINT,
  "curricular_units_2nd_sem_approved" BIGINT,
  "curricular_units_2nd_sem_grade" DOUBLE,
  "curricular_units_2nd_sem_without_evaluations" BIGINT,
  "unemployment_rate" DOUBLE,
  "inflation_rate" DOUBLE,
  "gdp" DOUBLE,
  "target" VARCHAR
);

Share link

Anyone who has the link will be able to view this.