Baselight

Additional Resources For Kiva Crowdfunding

Region inclusion for Kiva locations with poverty decomposition

@kaggle.lucian18_mpi_on_regions

About this Dataset

Additional Resources For Kiva Crowdfunding

Context

This dataset contains the locations found in the Kiva datasets included in an administrative or geographical region. You can also find poverty data about this region. This facilitates answering some of the tough questions about a region's poverty.

Content

In the interest of preserving the original names and spelling for the locations/countries/regions all the data is in Excel format and has no preview (I think only the Kaggle recommended file types have preview - if anyone can show me how to do this for an xlsx file, it will be greatly appreciated)

The Tables datasets contain the most recent analysis of the MPI on countries and regions. These datasets are updated regularly.
In unique regions_names_from_google_api you will find 3 levels of inclusion for every geocode provided in Kiva datasets. (village/town, administrative region, sub-national region - which can be administrative or geographical). These are the results from the Google API Geocoding process.

Files:

  • all_kiva_loans.csv

Dropped multiple columns, kept all the rows from loans.csv with names, tags, descriptions and got a csv file of 390MB instead of 2.13 GB. Basically is a simplified version of loans.csv (originally included in the analysis by beluga)

  • country_stats.csv
  1. population source: https://en.wikipedia.org/wiki/List_of_countries_by_population_(United_Nations)
  2. population_below_poverty_line: Percentage
  3. hdi: Human Development Index
  4. life_expectancy: Life expectancy at birth
  5. expected_years_of_schooling: Expected years of schooling
  6. mean_years_of_schooling: Mean years of schooling
  7. gni: Gross national income (GNI) per capita
    This dataset was originally created by beluga.
  • all_loan_theme_merged_with_geo_mpi_regions.xlsx

This is the loan_themes_by_region left joined with Tables_5.3_Contribution_of_Deprivations. (all the original entries from loan_themes and only the entries that match from Tables_5; for the regions that lack MPI data, you will find Nan)

These are the columns in the database:

  1. Partner ID
  2. Field Partner
  3. Name
  4. sector
  5. Loan Theme ID
  6. Loan Theme Type
  7. Country
  8. forkiva
  9. number
  10. amount
  11. geo
  12. rural_pct
  13. City
  14. Administrative region
  15. Sub-national region
  16. ISO
  17. World region
  18. Population Share of the Region (%)
  19. region MPI
  20. Education (%)
  21. Health (%)
  22. Living standards (%)
  23. Schooling (%)
  24. Child school attendance (%)
  25. Child Mortality (%)
  26. Nutrition (%)
  27. Electricity (%)
  28. Improved sanitation (%)
  29. Drinking water (%)
  30. Floor (%)
  31. Cooking fuel (%)
  32. Asset ownership (%)
  • mpi_on_regions.xlsx

Matched the loans in loan_themes_by_region with the regions that have info regarding MPI. This dataset brings together the amount invested in a region and the biggest problems the said region has to deal with. It is a join between the loan_themes_by_region provided by Kiva and Tables 5.3 Contribution_of_Deprivations.

It is a subset of the all_loan_theme_merged_with_geo_mpi_regions.xlsx, which contains only the entries that I could match with poverty decomposition data. It has the same columns.

  • Tables_5_SubNational_Decomposition_MPI_2017-18.xlsx

Multidimensional poverty index decomposition for over 1000 regions part of 79 countries.

Table 5.3: Contribution of deprivations to the MPI, by sub-national regions
This table shows which dimensions and indicators contribute most to a region's MPI, which is useful for understanding the major source(s) of deprivation in a sub-national region.

Source: http://ophi.org.uk/multidimensional-poverty-index/global-mpi-2016/

  • Tables_7_MPI_estimations_country_levels.xlsx

MPI decomposition for 120 countries.

Table 7 All Published MPI Results since 2010
The table presents an archive of all MPI estimations published over the past 5 years, together with MPI, H, A and censored headcount ratios. For comparisons over time please use Table 6, which is strictly harmonised. The full set of data tables for each year published (Column A), is found on the 'data tables' page under 'Archive'.

The data in this file is shown in interactive plots on Oxford Poverty and Human Development Initiative website.
http://www.dataforall.org/dashboard/ophi/index.php/

  • unique_regions_from_kiva_loan_themes.xlsx

These are all the regions corresponding to the geocodes found in Kiva's loan_themes_by_region.
There are 718 unique entries, that you can join with any database from Kiva that has either a coordinates or region column.
Columns:

  • geo: pair of Lat, Lon (from loan_themes_by_region)

  • City: name of the city (has the most NaN's)

  • Administrative region: first level of administrative inclusion for the city/location;
    (the equivalent of county for US)

  • Sub-national region: second level of administrative inclusion for the geo pair. (like state for US)

  • Country: name of the country

Acknowledgements

Thanks to Shane Lynn for the batch geocoding and to Joseph Deferio for reverse geocoding:

https://www.shanelynn.ie/batch-geocoding-in-python-with-google-geocoding-api/

https://github.com/jdeferio/Reverse_Geocode

The MPI datasets you can find on the Oxford website (http://ophi.org.uk/) under Research.

"Citation: Alkire, S. and Kanagaratnam, U. (2018)

“Multidimensional Poverty Index Winter 2017-18: Brief methodological note and results.” Oxford Poverty and Human Development Initiative, University of Oxford, OPHI Methodological Notes 45."

Share link

Anyone who has the link will be able to view this.