Baselight

Kaggle Survey From 2017 To 2022

collection of consecutive 5 years of survey data by kaggle

@kaggle.shievam_kaggle_survey_from_2017_to_2022

Surveyschema
@kaggle.shievam_kaggle_survey_from_2017_to_2022.surveyschema

  • 50.38 KB
  • 12 rows
  • 52 columns
n_2018_kaggle_machine_learning_and_data_science_survey

2018 Kaggle Machine Learning And Data Science Survey

q1

Q1

q10

Q10

q11

Q11

q12

Q12

q13

Q13

q14

Q14

q15

Q15

q16

Q16

q17

Q17

q18

Q18

q19

Q19

q2

Q2

q20

Q20

q21

Q21

q22

Q22

q23

Q23

q24

Q24

q25

Q25

q26

Q26

q27

Q27

q28

Q28

q29

Q29

q3

Q3

q30

Q30

q31

Q31

q32

Q32

q33

Q33

q34

Q34

q35

Q35

q36

Q36

q37

Q37

q38

Q38

q39

Q39

q4

Q4

q40

Q40

q41

Q41

q42

Q42

q43

Q43

q44

Q44

q45

Q45

q46

Q46

q47

Q47

q48

Q48

q49

Q49

q5

Q5

q50

Q50

q6

Q6

q7

Q7

q8

Q8

q9

Q9

time_from_start_to_finish_seconds

Time From Start To Finish (seconds)

Question:What is your gender? - Selected ChoiceDoes your current employer incorporate machine learning methods into their business?Select any activities that make up an important part of your role at work: (Select all that apply) - Selected ChoiceWhat is the primary tool that you use at work or school to analyze data? (include text response) - Selected ChoiceWhich of the following integrated development environments (IDE's) have you used at work or school in the last 5 years? (Select all that apply) - Selected ChoiceWhich of the following hosted notebooks have you used at work or school in the last 5 years? (Select all that apply) - Selected ChoiceWhich of the following cloud computing services have you used at work or school in the last 5 years? (Select all that apply) - Selected ChoiceWhat programming languages do you use on a regular basis? (Select all that apply) - Selected ChoiceWhat specific programming language do you use most often? - Selected ChoiceWhat programming language would you recommend an aspiring data scientist to learn first? - Selected ChoiceWhat machine learning frameworks have you used in the past 5 years? (Select all that apply) - Selected ChoiceWhat is your age (# years)?Of the choices that you selected in the previous question, which ML library have you used the most? - Selected ChoiceWhat data visualization libraries or tools have you used in the past 5 years? (Select all that apply) - Selected ChoiceOf the choices that you selected in the previous question, which specific data visualization library or tool have you used the most? - Selected ChoiceApproximately what percent of your time at work or school is spent actively coding?How long have you been writing code to analyze data?For how many years have you used machine learning methods (at work or in school)?Do you consider yourself to be a data scientist?Which of the following cloud computing products have you used at work or school in the last 5 years (Select all that apply)? - Selected ChoiceWhich of the following machine learning products have you used at work or school in the last 5 years? (Select all that apply) - Selected ChoiceWhich of the following relational database products have you used at work or school in the last 5 years? (Select all that apply) - Selected ChoiceIn which country do you currently reside?Which of the following big data and analytics products have you used at work or school in the last 5 years? (Select all that apply) - Selected ChoiceWhich types of data do you currently interact with most often at work or school? (Select all that apply) - Selected ChoiceWhat is the type of data that you currently interact with most often at work or school? - Selected ChoiceWhere do you find public datasets? (Select all that apply) - Selected ChoiceDuring a typical data science project at work or school, approximately what proportion of your time is devoted to the following? (Answers must add up to 100%) - Cleaning dataWhat percentage of your current machine learning/data science training falls under each category? (Answers must add up to 100%) - Self-taughtOn which online platforms have you begun or completed data science courses? (Select all that apply) - Selected ChoiceOn which online platform have you spent the most amount of time? - Selected ChoiceWho/what are your favorite media sources that report on data science topics? (Select all that apply) - Selected ChoiceHow do you perceive the quality of online learning platforms and in-person bootcamps as compared to the quality of the education provided by traditional brick and mortar institutions? - Online learning platforms and MOOCs:What is the highest level of formal education that you have attained or plan to attain within the next 2 years?Which better demonstrates expertise in data science: academic achievements or independent projects? - Your views:How do you perceive the importance of the following topics? - Fairness and bias in ML algorithms:What metrics do you or your organization use to determine whether or not your models were successful? (Select all that apply) - Selected ChoiceApproximately what percent of your data projects involved exploring unfair bias in the dataset and/or algorithm?What do you find most difficult about ensuring that your algorithms are fair and unbiased? (Select all that apply)In what circumstances would you explore model insights and interpret your model's predictions? (Select all that apply)Approximately what percent of your data projects involve exploring model insights?What methods do you prefer for explaining and/or interpreting decisions that are made by ML models? (Select all that apply) - Selected ChoiceDo you consider ML models to be "black boxes" with outputs that are difficult or impossible to explain?What tools and methods do you use to make your work easy to reproduce? (Select all that apply) - Selected ChoiceWhich best describes your undergraduate major? - Selected ChoiceWhat barriers prevent you from making your work even easier to reuse and reproduce? (Select all that apply) - Selected ChoiceSelect the title most similar to your current role (or most recent title if retired): - Selected ChoiceIn what industry is your current employer/contract (or your most recent employer if retired)? - Selected ChoiceHow many years of experience do you have in your current role?What is your current yearly compensation (approximate $USD)?Duration (in seconds)
# of Respondents:2386020670195181919919117189711886418828152231878918697238601299018593121851854818534184921848111060108871071923860974616922138791681615938157461567296711633815980234391588014937135841312013344136531329013418133691289122948128142290121686211022018623860
Who was excluded? (0 = not excluded; 1 = excluded)000000000000000000000000000000000000000000000000000
If What is your age (# years)? 0-17 Is Selected Edit Condition011111111110111111111111111111111111111111111111110
If What is the highest level of formal education that you have attained or plan to attain within the... No formal education past high school Is Selected Edit Condition000000000000000000000000000000000000000000001000000
If Select the title most similar to your current role (or most recent title if retired): Not employed Is Selected Edit Condition011000000000000000000000000000000000000000000011110
Or How long have you been writing code to analyze data? I have never written code and I do not want to learn Is Selected Edit Condition000000000000000001111101111111100000111111110100000
If How do you perceive the importance of the following topics? Fairness and bias in ML algorithms: - No opinion; I do not know Is SelectedEdit Condition000000000000000000000000000000000000111000000000000
If How do you perceive the importance of the following topics? Being able to explain ML model outputs and/or predictions - No opinion; I do not know Is Selected Edit Condition000000000000000000000000000000000000000111100000000
If How do you perceive the importance of the following topics? Reproducibility in data science - No opinion; I do not know Is SelectedEdit Condition000000000000000000000000000000000000000000010100000

CREATE TABLE surveyschema (
  "n_2018_kaggle_machine_learning_and_data_science_survey" VARCHAR,
  "q1" VARCHAR,
  "q10" VARCHAR,
  "q11" VARCHAR,
  "q12" VARCHAR,
  "q13" VARCHAR,
  "q14" VARCHAR,
  "q15" VARCHAR,
  "q16" VARCHAR,
  "q17" VARCHAR,
  "q18" VARCHAR,
  "q19" VARCHAR,
  "q2" VARCHAR,
  "q20" VARCHAR,
  "q21" VARCHAR,
  "q22" VARCHAR,
  "q23" VARCHAR,
  "q24" VARCHAR,
  "q25" VARCHAR,
  "q26" VARCHAR,
  "q27" VARCHAR,
  "q28" VARCHAR,
  "q29" VARCHAR,
  "q3" VARCHAR,
  "q30" VARCHAR,
  "q31" VARCHAR,
  "q32" VARCHAR,
  "q33" VARCHAR,
  "q34" VARCHAR,
  "q35" VARCHAR,
  "q36" VARCHAR,
  "q37" VARCHAR,
  "q38" VARCHAR,
  "q39" VARCHAR,
  "q4" VARCHAR,
  "q40" VARCHAR,
  "q41" VARCHAR,
  "q42" VARCHAR,
  "q43" VARCHAR,
  "q44" VARCHAR,
  "q45" VARCHAR,
  "q46" VARCHAR,
  "q47" VARCHAR,
  "q48" VARCHAR,
  "q49" VARCHAR,
  "q5" VARCHAR,
  "q50" VARCHAR,
  "q6" VARCHAR,
  "q7" VARCHAR,
  "q8" VARCHAR,
  "q9" VARCHAR,
  "time_from_start_to_finish_seconds" VARCHAR
);

Share link

Anyone who has the link will be able to view this.