Familiist (pro-natalist) communities in social net
Exploring Pregnancy, Childhood, Motherhood, and Paternity
By [source]
About this dataset
This dataset presents an unparalleled collection of comments from Russian-speaking Familiist pro-natalist communities on the social media platform VKontakte. With its focus on pregnancy, childbirth, motherhood, and fatherhood topics, this vast archive offers researchers a powerful data set in which to assess the perception of modern family life within these platforms.
The comprehensive dataset contains UTF-8 formatted .csv files with each entry containing unique identifiers for both the post and user as well as several pre-processed versions of each comment such as removing punctuation marks and lowercasing words, text stemming and lemmatization. Additional data includes datetime stamps and likes counts of every comment.
This cutting edge collection provides a window into contemporary debate over parenthood matters that were previously unobtainable by researchers – uncovering a valuable perspective into how mainstream views are expressed within these online communities. By delving deep into this dataset you can help shape our understanding of how modern online discussion forums perceive pregnancy, childhood, motherhood, paternity - furthering all academic research in these interconnected fields!
More Datasets
For more datasets, click here.
Featured Notebooks
- 🚨 Your notebook can be here! 🚨!
How to use the dataset
This dataset is includes comments from Familiist pro-natalist communities in the social network VKontakte and provides valuable insights into the different topics discussed in these communities such as pregnancy, childhood, motherhood, paternity and more. While this dataset is primarily intended for data analysis purposes by research laboratories, it can also be used by individuals wanting to learn more about these topics or social networks.
The data is provided in .csv format and consists of several columns: link_author (unique identifier of the user who posted the comment); gender (gender of the user); link_comment (unique identifier of the comment); date_time (date and time when the comment was posted); text (the text of the comment after pre-processing); likes (number of likes received by a comment); text_prep (the text after pre-processing), including removing punctuation marks; text stemmized, which includes stemming; text sw for stopwords removal;and finally lemmatization.
By using this data set you will have access to important information related to pregnancy, childhood, motherhood and fatherhood within Familiist pro-natalist social networks users’ discussions on VKontakte. Additionally you will be able to discover new trends on those topics as well as quantity metrics such as number of views comments receive. Use this data set creatively!
Research Ideas
- Analyzing trends in gender representation amongst users on Familiist Pro-Natalism communities, to understand the male/female conversation dynamics and differences in topics or engagement.
- Developing sentiment analysis models for automatically detecting the mood of comments posted, to gauge pro-natalism or anti-natalist sentiments.
- Tracking changes in the likes received for different comments over time, to study popular topics or themes within this domain and their associated engagement levels
Acknowledgements
If you use this dataset in your research, please credit the original authors.
Data Source
License
License: CC0 1.0 Universal (CC0 1.0) - Public Domain Dedication
No Copyright - You can copy, modify, distribute and perform the work, even for commercial purposes, all without asking permission. See Other Information.
Columns
File: vk_posts_stem_lemm.csv
Column name |
Description |
link_author |
Unique identifier of the user who posted the comment. (String) |
gender |
Gender of user who posted comment. (String) |
link_comment |
Unique identifier of comment. (String) |
date_time |
Date and time when comment was posted. (String) |
text |
Text of comment before any preprocessing was done. (String) |
likes |
Number of likes that were received on this particular post. (Integer) |
text_prep |
Text after preprocessing was done. (String) |
text_stem |
Text after stemming. (String) |
text_sw |
Text after stopwords removal. (String) |
text_lemm |
Text after lemmatization. (String) |
Acknowledgements
If you use this dataset in your research, please credit the original authors.
If you use this dataset in your research, please credit .