A well-structured dataset combining GDP and population data for insights.
Dataset Description
This dataset merges global GDP and population data for 2023 into a structured and analysis-ready format. The raw data originally contained inconsistencies like currency symbols, commas, and percentage signs, which have been carefully cleaned and formatted.
š Data Cleaning Steps:
Merged GDP and Population data using the country name as the key.
Removed unnecessary characters (e.g., $, %, , from numeric columns).
Converted GDP figures from text (trillion/billion format) into absolute values.
Handled missing values to maintain data consistency.
Rearranged column order for better readability.
š Columns in the Dataset:
Country ā Name of the country.
Population ā Total population in 2023.
Area ā Total land area (sq km).
Languages ā Official languages spoken.
Currencies ā National currencies used.
Region & Sub-region ā Geographical classification.
Capital ā Country's capital city.
GDP (2023) ā Total nominal GDP in USD.
GDP per Capita ā GDP divided by population.
GDP Growth ā Percentage increase/decrease in GDP.
Share of World GDP ā Country's contribution to global GDP.
š” Why Use This Dataset?
This dataset is perfect for data visualization, economic analysis, and machine learning models. You can:
ā
Compare GDP & population across different regions.
ā
Analyze economic growth trends.
ā
Create stunning visualizations (bar charts, pie charts, choropleth maps).
ā
Build predictive models on economic indicators.
š Credits & Source:
The original GDP and population data were collected from [reliable global sources] (add the actual sources here).
Cleaning and transformation were done using Python (Pandas, NumPy, Matplotlib/Seaborn for visual checks).
Related Datasets
-
World Population Data
@kaggle