This dataset will contain Annotated NER data for multiple languages

Context

There's a story behind every dataset and here's your opportunity to share yours.
The main idea of this dataset os to perform NER on regional languages as well like Tamil,Telugu,Kannada,Malayalam,Hindi and more.

Content

For now have added for Tamil language and in upcoming days I will add more.

Acknowledgements

We wouldn't be here without the help of others. If you owe any attributions or thanks, include them here along with any citations of past research.

Inspiration

I have done some Named Entity Recognition (NER) on English data , so why can't we do for our regional data. That's how I started this.

Related Datasets

Multilingual NER Dataset

@kaggle
ISO 639 Languages

@blt
Ethnic Power Relations Dataset (ETH, 2021)

@owid
Primary Written Language Of Applicants For Insurance Affordability Programs

@usgov
AI Performance On Language Tasks

@owid
Eucalyptus Growth And Environmental Data

@euremarkable

Multilingual NER Dataset

ISO 639 Languages

Ethnic Power Relations Dataset (ETH, 2021)

Primary Written Language Of Applicants For Insurance Affordability Programs

AI Performance On Language Tasks

Eucalyptus Growth And Environmental Data