Gender Gap In Spanish Wikipedia
Estimate number of women editors & their editing practices in Spanish Wikipedia
@kaggle.joebeachcapital_gender_gap_in_spanish_wikipedia
Estimate number of women editors & their editing practices in Spanish Wikipedia
@kaggle.joebeachcapital_gender_gap_in_spanish_wikipedia
Data set used to estimate the number of women editors and their editing practices in the Spanish Wikipedia.
Introductory Paper
Exploring the gender gap in the Spanish Wikipedia: Differences in engagement and editing practices
By J. Minguillón, J. Meneses, E. Aibar, Núria Ferran-Ferrer, Sergi Fàbregues. 2021
Published in PLoS ONE
Variable Information
gender: 0 (unknown), 1 (male), 2 (female)
C_api: gender extracted from WikiMedia API, codes as female / male / unknown
C_man: gender extracted from content coding, coded as 1 (male) / 2 (female) / 3 (unknown)
E_NEds: I index of stratum IJ (0,1,2,3)
E_Bpag: J index of stratum IJ (0,1,2,3)
firstDay: first edition in the Spanish Wikipedia (YYYYMMDDHHMMSS)
lastDay: last edition in the Spanish Wikipedia (YYYYMMDDHHMMSS)
NEds: total number of editions
NDays: number of days (lastDay-firstDay+1)
NActDays: number of days with editions
NPages: number of different pages edited
NPcreated: number of pages created
pagesWomen: number of edits in pages related to women
wikiprojWomen: number of edits in WikiProjects related to women
ns_user: number of edits in namespace user
ns_wikipedia: number of edits in namespace wikipedia
ns_talk: number of edits in namespace talk
ns_userTalk: number of edits in namespace user talk
ns_content: number of edits in content pages
weightIJ: correcting weight for stratum IJ
NIJ: number of elements in stratum IJ
CREATE TABLE data (
"gender" BIGINT,
"c_api" VARCHAR,
"c_man" BIGINT,
"e_neds" BIGINT,
"e_bpag" BIGINT,
"firstday" BIGINT,
"lastday" BIGINT,
"neds" BIGINT,
"ndays" BIGINT,
"nactdays" BIGINT,
"npages" BIGINT,
"npcreated" BIGINT,
"pageswomen" BIGINT,
"wikiprojwomen" BIGINT,
"ns_user" BIGINT,
"ns_wikipedia" BIGINT,
"ns_talk" BIGINT,
"ns_usertalk" BIGINT,
"ns_content" BIGINT,
"weightij" DOUBLE,
"nij" BIGINT
);Anyone who has the link will be able to view this.