Baselight

Language Detection

A Dataset for Language Detection

@kaggle.basilb2s_language_detection

Loading...
Loading...

About this Dataset

Language Detection

About the Dataset

It's a small language detection dataset. This dataset consists of text details for 17 different languages, ie, you will be able to create an NLP model for predicting 17 different language..

Languages

  1. English
  2. Malayalam
  3. Hindi
  4. Tamil
  5. Kannada
  6. French
  7. Spanish
  8. Portuguese
  9. Italian
  10. Russian
  11. Sweedish
  12. Dutch
  13. Arabic
  14. Turkish
  15. German
  16. Danish
  17. Greek

Tables

Language Detection

@kaggle.basilb2s_language_detection.language_detection
  • 896.34 KB
  • 10337 rows
  • 2 columns
Loading...

CREATE TABLE language_detection (
  "text" VARCHAR,
  "language" VARCHAR
);

Share link

Anyone who has the link will be able to view this.