Resume Dataset
A collection of Resumes in PDF as well as String format for data extraction.
@kaggle.snehaanbhawal_resume_dataset
A collection of Resumes in PDF as well as String format for data extraction.
@kaggle.snehaanbhawal_resume_dataset
A collection of Resume Examples taken from livecareer.com for categorizing a given resume into any of the labels defined in the dataset.
Contains 2400+ Resumes in string as well as PDF format.
PDF stored in the data folder differentiated into their respective labels as folders with each resume residing inside the folder in pdf form with filename as the id defined in the csv.
Inside the CSV:
Present categories are
HR, Designer, Information-Technology, Teacher, Advocate, Business-Development, Healthcare, Fitness, Agriculture, BPO, Sales, Consultant, Digital-Media, Automobile, Chef, Finance, Apparel, Engineering, Accountant, Construction, Public-Relations, Banking, Arts, Aviation
Data was obtained by scrapping individual resume examples from www.livecareer.com website. Web Scrapping code present in my Github Repo.
CREATE TABLE resume (
"id" BIGINT,
"resume_str" VARCHAR,
"resume_html" VARCHAR,
"category" VARCHAR
);Anyone who has the link will be able to view this.