This dataset is a collection of news articles in the Bosnian language sourced from klix.ba, a prominent Bosnian online news portal. The dataset covers a wide range of topics including local and international news, politics, economics, sports, entertainment, and more.
Dataset Contents
The dataset is structured as follows:
Contents
- Number of Articles: 786755
- Language: Bosnian
- Source: klix.ba
- Topics: Various (news, politics, economics, sports, entertainment, etc.)
Potential Uses
This dataset can be utilized for various natural language processing tasks such as text classification, sentiment analysis, topic modeling, and more. The presence of additional metadata columns, such as the number of comments and shares, allows for more comprehensive analyses.
Accessing the Dataset
You can access and explore the dataset on GitHub, Kaggle, Hugging Face
This dataset is intended solely for research purposes and is not affiliated with or endorsed by klix.ba.