Baselight

Familiist (pro-natalist) Communities In Social Net

Exploring Pregnancy, Childhood, Motherhood, and Paternity

@kaggle.thedevastator_familiist_pro_natalist_vkontakte_comments_and_li

Loading...
Loading...

About this Dataset

Familiist (pro-natalist) Communities In Social Net


Familiist (pro-natalist) communities in social net

Exploring Pregnancy, Childhood, Motherhood, and Paternity

By [source]


About this dataset

This dataset presents an unparalleled collection of comments from Russian-speaking Familiist pro-natalist communities on the social media platform VKontakte. With its focus on pregnancy, childbirth, motherhood, and fatherhood topics, this vast archive offers researchers a powerful data set in which to assess the perception of modern family life within these platforms.

The comprehensive dataset contains UTF-8 formatted .csv files with each entry containing unique identifiers for both the post and user as well as several pre-processed versions of each comment such as removing punctuation marks and lowercasing words, text stemming and lemmatization. Additional data includes datetime stamps and likes counts of every comment.

This cutting edge collection provides a window into contemporary debate over parenthood matters that were previously unobtainable by researchers – uncovering a valuable perspective into how mainstream views are expressed within these online communities. By delving deep into this dataset you can help shape our understanding of how modern online discussion forums perceive pregnancy, childhood, motherhood, paternity - furthering all academic research in these interconnected fields!

More Datasets

For more datasets, click here.

Featured Notebooks

  • 🚨 Your notebook can be here! 🚨!

How to use the dataset

This dataset is includes comments from Familiist pro-natalist communities in the social network VKontakte and provides valuable insights into the different topics discussed in these communities such as pregnancy, childhood, motherhood, paternity and more. While this dataset is primarily intended for data analysis purposes by research laboratories, it can also be used by individuals wanting to learn more about these topics or social networks.

The data is provided in .csv format and consists of several columns: link_author (unique identifier of the user who posted the comment); gender (gender of the user); link_comment (unique identifier of the comment); date_time (date and time when the comment was posted); text (the text of the comment after pre-processing); likes (number of likes received by a comment); text_prep (the text after pre-processing), including removing punctuation marks; text stemmized, which includes stemming; text sw for stopwords removal;and finally lemmatization.

By using this data set you will have access to important information related to pregnancy, childhood, motherhood and fatherhood within Familiist pro-natalist social networks users’ discussions on VKontakte. Additionally you will be able to discover new trends on those topics as well as quantity metrics such as number of views comments receive. Use this data set creatively!

Research Ideas

  • Analyzing trends in gender representation amongst users on Familiist Pro-Natalism communities, to understand the male/female conversation dynamics and differences in topics or engagement.
  • Developing sentiment analysis models for automatically detecting the mood of comments posted, to gauge pro-natalism or anti-natalist sentiments.
  • Tracking changes in the likes received for different comments over time, to study popular topics or themes within this domain and their associated engagement levels

Acknowledgements

If you use this dataset in your research, please credit the original authors.
Data Source

License

License: CC0 1.0 Universal (CC0 1.0) - Public Domain Dedication
No Copyright - You can copy, modify, distribute and perform the work, even for commercial purposes, all without asking permission. See Other Information.

Columns

File: vk_posts_stem_lemm.csv

Column name Description
link_author Unique identifier of the user who posted the comment. (String)
gender Gender of user who posted comment. (String)
link_comment Unique identifier of comment. (String)
date_time Date and time when comment was posted. (String)
text Text of comment before any preprocessing was done. (String)
likes Number of likes that were received on this particular post. (Integer)
text_prep Text after preprocessing was done. (String)
text_stem Text after stemming. (String)
text_sw Text after stopwords removal. (String)
text_lemm Text after lemmatization. (String)

Acknowledgements

If you use this dataset in your research, please credit the original authors.
If you use this dataset in your research, please credit .

Tables

Vk Posts Stem Lemm

@kaggle.thedevastator_familiist_pro_natalist_vkontakte_comments_and_li.vk_posts_stem_lemm
  • 57.07 MB
  • 112900 rows
  • 11 columns
Loading...

CREATE TABLE vk_posts_stem_lemm (
  "unnamed_0" BIGINT,
  "link_author" VARCHAR,
  "gender" VARCHAR,
  "link_comment" VARCHAR,
  "date_time" TIMESTAMP,
  "text" VARCHAR,
  "likes" DOUBLE,
  "text_prep" VARCHAR,
  "text_stem" VARCHAR,
  "text_sw" VARCHAR,
  "text_lemm" VARCHAR
);

Share link

Anyone who has the link will be able to view this.