Annotated NER For Indian Language
This dataset will contain Annotated NER data for multiple languages
@kaggle.vpkprasanna_annotated_indian_language_ner
This dataset will contain Annotated NER data for multiple languages
@kaggle.vpkprasanna_annotated_indian_language_ner
There's a story behind every dataset and here's your opportunity to share yours.
The main idea of this dataset os to perform NER on regional languages as well like Tamil,Telugu,Kannada,Malayalam,Hindi and more.
For now have added for Tamil language and in upcoming days I will add more.
We wouldn't be here without the help of others. If you owe any attributions or thanks, include them here along with any citations of past research.
I have done some Named Entity Recognition (NER) on English data , so why can't we do for our regional data. That's how I started this.
CREATE TABLE hindi_ner (
"tokens" VARCHAR,
"tags" VARCHAR
);
CREATE TABLE kannada_ner (
"tokens" VARCHAR,
"tags" VARCHAR
);
CREATE TABLE malayalam_ner (
"tokens" VARCHAR,
"tags" VARCHAR
);
CREATE TABLE tamil_ner (
"name" VARCHAR,
"tags" VARCHAR
);
CREATE TABLE telugu_ner (
"tokens" VARCHAR,
"tags" VARCHAR
);
Anyone who has the link will be able to view this.