Baselight

How Much Does Data Scientists Earn In 2017-2020

The result of the developers having data-related job in Stack Overflow Survey

@kaggle.phuchuynguyen_datarelated_developers_survey_by_stack_overflow

About this Dataset

How Much Does Data Scientists Earn In 2017-2020

👉 Hope this helps, please upvote if you find this dataset useful 🙌 🔥 😊

Data-related jobs are considered the most sexiest career of 2021. Because of the increasing demand for data specialists, many professionals expect a
very high compensation for performing jobs in this field. Thus we decided to evaluate the impact different variables have on the expected salary, so we could adjust our expectations to the current international market. This is the dataset processed from the Stack overflow Annual Developers Survey.

Content: There are 2 files

👉 processed_data_toDummies.csv

This is the data pre-processed from the original dataset with all the Developer type converted into dummies variables:

  • Data Scientist or Machine Learning Specialist
  • Database Administrator
  • Data Analyst
  • Business Analyst and Data Engineer)

Together with other variables such as: Country, Education level, Employment, Job Satisfaction, Organization size, Undergraduate major, Year of coding as professionals

👉 survey_final.csv:

The original dataset comes from the Stack overflow Annual Developers Survey, and for this dataset we will only consider the respondents that considered themselves already in a data related job (Data Scientist, Machine Learning Specialist, Database Administrator, Data Analyst, Business Analyst and Data Engineer)

Acknowledgements

This dataset is processed from the Stack Overflow Annual Survey result from 2017 to 2020
The original datasets can be found at: https://insights.stackoverflow.com/survey

Tables

Processed Data Todummies

@kaggle.phuchuynguyen_datarelated_developers_survey_by_stack_overflow.processed_data_todummies
  • 261.83 KB
  • 33601 rows
  • 14 columns
Loading...

CREATE TABLE processed_data_todummies (
  "year" BIGINT,
  "hobbyist" VARCHAR,
  "convertedcomp" DOUBLE,
  "country" VARCHAR,
  "edlevel" VARCHAR,
  "employment" VARCHAR,
  "jobsat" DOUBLE,
  "orgsize" VARCHAR,
  "undergradmajor" VARCHAR,
  "yearscodepro" DOUBLE,
  "data_scientist_or_machine_learning_specialist" BIGINT,
  "database_administrator" BIGINT,
  "data_or_business_analyst" DOUBLE,
  "engineer_data" DOUBLE
);

Survey Final

@kaggle.phuchuynguyen_datarelated_developers_survey_by_stack_overflow.survey_final
  • 8.7 MB
  • 303591 rows
  • 15 columns
Loading...

CREATE TABLE survey_final (
  "year" BIGINT,
  "hobbyist" VARCHAR,
  "convertedcomp" DOUBLE,
  "country" VARCHAR,
  "databasedesirenextyear" VARCHAR,
  "databaseworkedwith" VARCHAR,
  "devtype" VARCHAR,
  "edlevel" VARCHAR,
  "employment" VARCHAR,
  "jobsat" VARCHAR,
  "languagedesirenextyear" VARCHAR,
  "languageworkedwith" VARCHAR,
  "orgsize" VARCHAR,
  "undergradmajor" VARCHAR,
  "yearscodepro" VARCHAR
);