Baselight

Ukraine War

Reddit Posts and Comments Dataset on the Ukraine War

@kaggle.sinatavakoli_ukraine_war

Loading...
Loading...

About this Dataset

Ukraine War

This dataset offers a comprehensive collection of data pertaining to the Ukraine war, meticulously sourced from Reddit, a prominent online platform for discussions, news, and information sharing. It comprises two primary CSV files: 'ukrainewar_full.csv' and 'comments_data.csv', each offering a unique perspective on discussions and interactions surrounding this critical global issue.

Data Source: The data is derived from Reddit, a dynamic platform known for its diverse and active communities. Reddit provides a valuable lens through which to observe public sentiments, opinions, and conversations regarding events such as the Ukraine war.

Data Structure: 'ukrainewar_full.csv' encompasses an array of post-related attributes, including the subreddit, selftext (post content), author_fullname, title, upvote_ratio, ups, created (date of post creation), created_utc, num_comments, author (post author's username), and id (a unique post identifier). 'comments_data.csv' delves into the realm of comments on these posts, offering details such as comment_id, comment_body, and post_id, thereby enabling in-depth analysis of engagement within Reddit communities.

Data Preprocessing: To ensure data quality and prevent redundancy, we have meticulously removed duplicate posts based on their unique 'id' attribute. This allows for a dataset free from repetitive content and more conducive to meaningful analysis.

Collection Date: The dataset was meticulously collected during [mention the date or time frame], capturing discussions and responses related to the Ukraine war within that specific time period. This temporal focus provides valuable insights into how opinions and discussions evolve over time.

Use Cases: This dataset offers an abundance of analytical opportunities. Researchers and data analysts can utilize it for tasks such as sentiment analysis, topic modeling, trend analysis, and more. It provides a unique opportunity to gain insights into how the Reddit community engages with and reacts to events such as the Ukraine war.

Data Quality: While we have made diligent efforts to ensure data accuracy, it is important to acknowledge that Reddit data can reflect the unique biases and dynamics inherent in user-generated content. Researchers should be aware of these nuances when interpreting the data.

Dataset Purpose: We have made this dataset available to support research, analysis, and a broader understanding of how online communities respond to and discuss the Ukraine war. We hope it proves to be a valuable resource for a range of applications, shedding light on public sentiment and interactions surrounding this important global issue.

Tables

Comments Data

@kaggle.sinatavakoli_ukraine_war.comments_data
  • 3.09 MB
  • 18183 rows
  • 3 columns
Loading...

CREATE TABLE comments_data (
  "comment_id" VARCHAR,
  "comment_body" VARCHAR,
  "post_id" VARCHAR
);

Edgelist

@kaggle.sinatavakoli_ukraine_war.edgelist
  • 134.4 KB
  • 77722 rows
  • 3 columns
Loading...

CREATE TABLE edgelist (
  "source" VARCHAR,
  "target" VARCHAR,
  "weight" BIGINT
);

Nodes

@kaggle.sinatavakoli_ukraine_war.nodes
  • 19.72 KB
  • 1083 rows
  • 2 columns
Loading...

CREATE TABLE nodes (
  "id" VARCHAR,
  "label" VARCHAR
);

Original Data

@kaggle.sinatavakoli_ukraine_war.original_data
  • 170.3 KB
  • 10000 rows
  • 12 columns
Loading...

CREATE TABLE original_data (
  "unnamed_0" BIGINT,
  "subreddit" VARCHAR,
  "selftext" VARCHAR,
  "author_fullname" VARCHAR,
  "title" VARCHAR,
  "upvote_ratio" DOUBLE,
  "ups" BIGINT,
  "created" BIGINT,
  "created_utc" BIGINT,
  "num_comments" BIGINT,
  "author" VARCHAR,
  "id" VARCHAR
);

Text Filtered Data

@kaggle.sinatavakoli_ukraine_war.text_filtered_data
  • 125.25 KB
  • 4137 rows
  • 9 columns
Loading...

CREATE TABLE text_filtered_data (
  "unnamed_0" BIGINT,
  "id" VARCHAR,
  "author" VARCHAR,
  "subreddit" VARCHAR,
  "title" VARCHAR,
  "selftext" VARCHAR,
  "upvote_ratio" DOUBLE,
  "num_comments" BIGINT,
  "clean_text" VARCHAR
);

Words

@kaggle.sinatavakoli_ukraine_war.words
  • 18.1 KB
  • 1083 rows
  • 3 columns
Loading...

CREATE TABLE words (
  "id" BIGINT,
  "lable" VARCHAR,
  "weight" BIGINT
);

Share link

Anyone who has the link will be able to view this.