Baselight

World University Rankings 2023 - Cleaned

World University Rankings 2023 include 1,799 universities across 104 countries.

@kaggle.samiatisha_world_university_rankings_2023_clean_dataset

Loading...
Loading...

About this Dataset

World University Rankings 2023 - Cleaned

Dataset Name: World University Rankings 2023 - Cleaned

Dataset Source

This dataset is a cleaned and preprocessed version of the "World University Rankings 2023" originally provided by Syed Ali Taqi on Kaggle. The original dataset included 13 features, covering information about universities worldwide.

Dataset Features

  1. University Rank
  2. Name of University
  3. Location
  4. No of Student
  5. No of Student per Staff
  6. International Student
  7. Female : Male Ratio
  8. Overall Score
  9. Teaching Score
  10. Research Score
  11. Citations Score
  12. Industry Income Score
  13. International Outlook Score

Description

This cleaned version of the dataset has undergone rigorous preprocessing, including handling missing values and encoding categorical features, resulting in a dataset with enhanced usability and cleanliness. It now consists of 2,341 rows and 2,361 columns, providing valuable insights for data analysis, machine learning, and research in the field of higher education.

Original vs. Cleaned Version Comparison

Original Version

The original version of the "World University Rankings 2023" dataset was a comprehensive collection of data on 1,799 universities across 104 countries and regions. While it provided valuable insights into higher education worldwide, it presented some challenges due to missing values, inconsistencies, and a mix of data types.

Original Dataset Source:
World University Rankings 2023

Cleaned Version

In this cleaned version of the dataset, significant efforts have been made to enhance its quality and usability. The following improvements were made:

Handling Missing Values:

  • All missing values, including NaN and Null values, have been meticulously addressed for every feature in the dataset.
  • Specifically, missing values in the "Name of University" and "Location" columns have been replaced with meaningful placeholders: "Unknown University" and "Unknown Location," respectively.

Encoding and Transformation:

  • One-hot encoding has been applied to the "Name of University" and "Location" columns, converting categorical data into a numerical format suitable for analysis and modeling.
  • The "Female Ratio" and "Male Ratio" columns have been separated, allowing for more straightforward analysis of gender ratios.
  • "OverAll Score" has been divided into "OverAll Score Min" and "OverAll Score Max" columns, providing insights into the range of scores.
  • "International Student" values have been encoded as fractional values, making it easier to interpret and analyze.
  • Several features, including "Female Ratio," "Male Ratio," "OverAll Score Min," "OverAll Score Max," "No of Student," and "International Student," have been encoded as numerical values, improving their compatibility with data analysis and modeling techniques.

These enhancements have transformed the dataset into a cleaned and well-structured resource for data analysis, machine learning, and research in the field of higher education. Researchers and data enthusiasts can now explore and gain valuable insights from this improved dataset with confidence.

Whether you are conducting exploratory data analysis, building predictive models, or conducting research, this cleaned version of the dataset provides a solid foundation for your analytical endeavors.

GitHub Repository

For more details on the data preprocessing steps and to access the cleaned dataset, you can visit the GitHub repository where the preprocessing was performed: GitHub Repository

If you find value in this "World University Rankings 2023 - Cleaned" dataset, please consider upvoting it on Kaggle to boost its visibility. Additionally, star our GitHub repository to show your support for the data preprocessing efforts. Your support is greatly appreciated!

Tables

Preprocessed World University Rankings 2023 Dataset

@kaggle.samiatisha_world_university_rankings_2023_clean_dataset.preprocessed_world_university_rankings_2023_dataset
  • 2.32 MB
  • 2341 rows
  • 2362 columns
Loading...

CREATE TABLE preprocessed_world_university_rankings_2023_dataset (
  "no_of_student" DOUBLE,
  "no_of_student_per_staff" DOUBLE,
  "international_student" DOUBLE,
  "teaching_score" DOUBLE,
  "research_score" DOUBLE,
  "citations_score" DOUBLE,
  "industry_income_score" DOUBLE,
  "international_outlook_score" DOUBLE,
  "female_ratio" DOUBLE,
  "male_ratio" DOUBLE,
  "overall_score_min" DOUBLE,
  "overall_score_max" DOUBLE,
  "name_of_university_agh_university_of_krakow" BIGINT,
  "name_of_university_aalborg_university" BIGINT,
  "name_of_university_aalto_university" BIGINT,
  "name_of_university_aarhus_university" BIGINT,
  "name_of_university_abdelmalek_essa_di_university" BIGINT,
  "name_of_university_abdul_wali_khan_university_mardan" BIGINT,
  "name_of_university_abdullah_g_l_university" BIGINT,
  "name_of_university_abertay_university" BIGINT,
  "name_of_university_aberystwyth_university" BIGINT,
  "name_of_university_abu_dhabi_university" BIGINT,
  "name_of_university_academy_of_economic_studies_of_moldova" BIGINT,
  "name_of_university_acharya_nagarjuna_university" BIGINT,
  "name_of_university_ac_badem_university" BIGINT,
  "name_of_university_adam_mickiewicz_university_pozna" BIGINT,
  "name_of_university_adamawa_state_university_mubi" BIGINT,
  "name_of_university_addis_ababa_university" BIGINT,
  "name_of_university_adolfo_ib_ez_university" BIGINT,
  "name_of_university_afe_babalola_university" BIGINT,
  "name_of_university_aga_khan_university" BIGINT,
  "name_of_university_ahl_al_bayt_university" BIGINT,
  "name_of_university_ahvaz_jundishapur_university_of_med_787053b7" BIGINT,
  "name_of_university_aichi_medical_university" BIGINT,
  "name_of_university_aichi_prefectural_university" BIGINT,
  "name_of_university_ain_shams_university" BIGINT,
  "name_of_university_air_university" BIGINT,
  "name_of_university_aix_marseille_university" BIGINT,
  "name_of_university_ajeenkya_dy_patil_university" BIGINT,
  "name_of_university_ajman_university" BIGINT,
  "name_of_university_ajou_university" BIGINT,
  "name_of_university_akdeniz_university" BIGINT,
  "name_of_university_akita_prefectural_university" BIGINT,
  "name_of_university_akita_university" BIGINT,
  "name_of_university_aksaray_university" BIGINT,
  "name_of_university_akwa_ibom_state_university" BIGINT,
  "name_of_university_al_ahliyya_amman_university" BIGINT,
  "name_of_university_al_ayen_university" BIGINT,
  "name_of_university_al_azhar_university" BIGINT,
  "name_of_university_al_balqa_applied_university" BIGINT,
  "name_of_university_al_esraa_university_college" BIGINT,
  "name_of_university_al_farabi_kazakh_national_university" BIGINT,
  "name_of_university_al_farabi_university_college" BIGINT,
  "name_of_university_al_farahidi_university" BIGINT,
  "name_of_university_al_kitab_university" BIGINT,
  "name_of_university_al_maarif_university_college" BIGINT,
  "name_of_university_al_manara_college_for_medical_sciences" BIGINT,
  "name_of_university_al_mustaqbal_university" BIGINT,
  "name_of_university_al_muthanna_university" BIGINT,
  "name_of_university_al_nahrain_university" BIGINT,
  "name_of_university_al_noor_university_college" BIGINT,
  "name_of_university_al_quds_university" BIGINT,
  "name_of_university_alagappa_university" BIGINT,
  "name_of_university_alex_ekwueme_federal_university_ndufu_alike" BIGINT,
  "name_of_university_alexandria_university" BIGINT,
  "name_of_university_alexandru_ioan_cuza_university" BIGINT,
  "name_of_university_alfaisal_university" BIGINT,
  "name_of_university_aligarh_muslim_university" BIGINT,
  "name_of_university_alisher_navo_i_tashkent_state_unive_bc684697" BIGINT,
  "name_of_university_alt_nba_university" BIGINT,
  "name_of_university_alzahra_university" BIGINT,
  "name_of_university_amedeo_avogadro_university_of_easte_2e500f28" BIGINT,
  "name_of_university_american_international_university_bangladesh" BIGINT,
  "name_of_university_american_university" BIGINT,
  "name_of_university_american_university_in_cairo" BIGINT,
  "name_of_university_american_university_of_beirut" BIGINT,
  "name_of_university_american_university_of_madaba" BIGINT,
  "name_of_university_american_university_of_nigeria" BIGINT,
  "name_of_university_american_university_of_ras_al_khaimah" BIGINT,
  "name_of_university_american_university_of_sharjah" BIGINT,
  "name_of_university_amirkabir_university_of_technology" BIGINT,
  "name_of_university_amity_university" BIGINT,
  "name_of_university_amity_university_rajasthan_jaipur" BIGINT,
  "name_of_university_amity_university_gurugram" BIGINT,
  "name_of_university_amity_university_gwalior" BIGINT,
  "name_of_university_amity_university_mumbai" BIGINT,
  "name_of_university_an_najah_national_university" BIGINT,
  "name_of_university_anadolu_university" BIGINT,
  "name_of_university_andhra_university" BIGINT,
  "name_of_university_andijan_state_medical_institute" BIGINT,
  "name_of_university_andr_s_bello_catholic_university_ucab" BIGINT,
  "name_of_university_anglia_ruskin_university_aru" BIGINT,
  "name_of_university_anglo_american_university" BIGINT,
  "name_of_university_ankara_science_university" BIGINT,
  "name_of_university_ankara_university" BIGINT,
  "name_of_university_anna_university" BIGINT,
  "name_of_university_annamalai_university" BIGINT,
  "name_of_university_antonio_nari_o_university" BIGINT,
  "name_of_university_aoyama_gakuin_university" BIGINT,
  "name_of_university_arab_academy_for_science_technology_5c86ad58" BIGINT
);

Share link

Anyone who has the link will be able to view this.