NLP On Research Articles
Multi Label Classification using NLP on Research Articles
@kaggle.vetrirah_janatahack_independence_day_2020_ml_hackathon
Multi Label Classification using NLP on Research Articles
@kaggle.vetrirah_janatahack_independence_day_2020_ml_hackathon
Topic Modeling for Research Articles
Researchers have access to large online archives of scientific articles. As a consequence, finding relevant articles has become more difficult. Tagging or topic modelling provides a way to give token of identification to research articles which facilitates recommendation and search process.
Given the abstract and title for a set of research articles, predict the topics for each article included in the test set.
Note that a research article can possibly have more than 1 topic. The research article abstracts and titles are sourced from the following 6 topics:
CREATE TABLE sample_submission (
"id" BIGINT,
"computer_science" BIGINT,
"physics" BIGINT,
"mathematics" BIGINT,
"statistics" BIGINT,
"quantitative_biology" BIGINT,
"quantitative_finance" BIGINT
);CREATE TABLE test (
"id" BIGINT,
"title" VARCHAR,
"abstract" VARCHAR
);CREATE TABLE train (
"id" BIGINT,
"title" VARCHAR,
"abstract" VARCHAR,
"computer_science" BIGINT,
"physics" BIGINT,
"mathematics" BIGINT,
"statistics" BIGINT,
"quantitative_biology" BIGINT,
"quantitative_finance" BIGINT
);Anyone who has the link will be able to view this.