2021 PGA Tour Stats
Stats for the 2021 PGA Tour Season
@kaggle.lilcobeeee_2021_pga_tour_stats
Stats for the 2021 PGA Tour Season
@kaggle.lilcobeeee_2021_pga_tour_stats
I have seen some other datasets for various PGA Tour Stats and I noticed there wasn't one for 2021. I built a scraper to pull the information from the PGA Tour website and then did my own manipulation on the data to get it in the right format as well as make custom column names for reference.
Inside the data set you'll find the first column with the player names for 197 players on the PGA Tour. And then each column is named like the following:
"Data Title---Data Subcategory---Data Category." For example this is how the column "AVERAGE---sg: approach the green---APPROACH THE GREEN" would read, Average is the average value for SG on Approach the Green under the tab APPROACH THE GREEN on the PGA Tour Stats site. The columns are broken down like this because the Data Titles have shared names across the Data Subcategories, to avoid confusion or miss represented data I labeled them accordingly.
Original Site here: https://www.pgatour.com/stats.html
Have to acknowledge Steve Russo and his data set Here
The inspiration for this data set came from this webscraper created by Steve Russo. https://github.com/sjrusso8/pga-scraper
I will upload my version to my Github here shortly.
CREATE TABLE n_2021_pga_tour_filtered_stats (
"player_name" VARCHAR,
"rounds_sg_approach_the_green_approach_the_green" BIGINT -- ROUNDS---sg: Approach The Green---APPROACH THE GREEN,
"average_sg_approach_the_green_approach_the_green" DOUBLE -- AVERAGE---sg: Approach The Green---APPROACH THE GREEN,
"total_sg_app_sg_approach_the_green_approach_the_green" DOUBLE -- TOTAL SG:APP---sg: Approach The Green---APPROACH THE GREEN,
"measured_rounds_sg_approach_the_green_approach_the_green" BIGINT -- MEASURED ROUNDS---sg: Approach The Green---APPROACH THE GREEN,
"n__greens_in_regulation_percentage_approach_the_green" DOUBLE -- %---greens In Regulation Percentage---APPROACH THE GREEN,
"greens_hit_greens_in_regulation_percentage_approach_the_green" BIGINT -- GREENS HIT---greens In Regulation Percentage---APPROACH THE GREEN,
"n__holes_greens_in_regulation_percentage_approach_the_green" BIGINT -- # HOLES---greens In Regulation Percentage---APPROACH THE GREEN,
"relative_par_greens_in_regulation_percentage_approach_the_green" DOUBLE -- RELATIVE/PAR---greens In Regulation Percentage---APPROACH THE GREEN,
"avg_average_distance_of_birdie_putts_made_putting" VARCHAR -- AVG---average Distance Of Birdie Putts Made---PUTTING,
"tot_dist_ft_average_distance_of_birdie_putts_made_putting" DOUBLE -- TOT DIST. (FT)---average Distance Of Birdie Putts Made---PUTTING,
"tot_birdie_putts_made_average_distance_of_birdie_putts_9aac35d1" BIGINT -- TOT BIRDIE PUTTS MADE---average Distance Of Birdie Putts Made---PUTTING,
"longest_made_ft_average_distance_of_birdie_putts_made_putting" BIGINT -- LONGEST MADE (FT)---average Distance Of Birdie Putts Made---PUTTING,
"shortest_made_ft_average_distance_of_birdie_putts_made_putting" VARCHAR -- SHORTEST MADE (FT)---average Distance Of Birdie Putts Made---PUTTING,
"avg_average_putting_distance_all_1_putts_putting" VARCHAR -- AVG---average Putting Distance - All 1 Putts---PUTTING,
"distance_feet_average_putting_distance_all_1_putts_putting" DOUBLE -- DISTANCE (FEET)---average Putting Distance - All 1 Putts---PUTTING,
"total_putts_average_putting_distance_all_1_putts_putting" BIGINT -- TOTAL PUTTS---average Putting Distance - All 1 Putts---PUTTING,
"avg_putts_per_round_round_3_putting" DOUBLE -- AVG---putts Per Round - Round 3---PUTTING,
"total_putts_putts_per_round_round_3_putting" BIGINT -- TOTAL PUTTS---putts Per Round - Round 3---PUTTING,
"total_rounds_putts_per_round_round_3_putting" BIGINT -- TOTAL ROUNDS---putts Per Round - Round 3---PUTTING,
"avg_overall_putting_average_putting" DOUBLE -- AVG---overall Putting Average---PUTTING,
"n__of_putts_overall_putting_average_putting" BIGINT -- # OF PUTTS---overall Putting Average---PUTTING,
"n__of_holes_overall_putting_average_putting" BIGINT -- # OF HOLES---overall Putting Average---PUTTING,
"avg_putts_per_round_round_2_putting" DOUBLE -- AVG---putts Per Round - Round 2---PUTTING,
"total_putts_putts_per_round_round_2_putting" BIGINT -- TOTAL PUTTS---putts Per Round - Round 2---PUTTING,
"total_rounds_putts_per_round_round_2_putting" BIGINT -- TOTAL ROUNDS---putts Per Round - Round 2---PUTTING,
"avg_putts_per_round_round_1_putting" DOUBLE -- AVG---putts Per Round - Round 1---PUTTING,
"total_putts_putts_per_round_round_1_putting" BIGINT -- TOTAL PUTTS---putts Per Round - Round 1---PUTTING,
"total_rounds_putts_per_round_round_1_putting" BIGINT -- TOTAL ROUNDS---putts Per Round - Round 1---PUTTING,
"avg_putts_per_round_putting" DOUBLE -- AVG---putts Per Round---PUTTING,
"total_putts_putts_per_round_putting" BIGINT -- TOTAL PUTTS---putts Per Round---PUTTING,
"total_rounds_putts_per_round_putting" BIGINT -- TOTAL ROUNDS---putts Per Round---PUTTING,
"low_total_putts_putts_per_round_putting" BIGINT -- LOW TOTAL PUTTS---putts Per Round---PUTTING,
"avg_putts_per_round_round_4_putting" DOUBLE -- AVG---putts Per Round - Round 4---PUTTING,
"total_putts_putts_per_round_round_4_putting" BIGINT -- TOTAL PUTTS---putts Per Round - Round 4---PUTTING,
"total_rounds_putts_per_round_round_4_putting" BIGINT -- TOTAL ROUNDS---putts Per Round - Round 4---PUTTING,
"avg_3_putts_per_round_putting" DOUBLE -- AVG---3-putts Per Round---PUTTING,
"total_3_putts_3_putts_per_round_putting" BIGINT -- TOTAL 3 PUTTS---3-putts Per Round---PUTTING,
"total_rounds_3_putts_per_round_putting" BIGINT -- TOTAL ROUNDS---3-putts Per Round---PUTTING,
"n__3_putt_avoidance_putting" DOUBLE -- %---3-putt Avoidance---PUTTING,
"total_3_putts_3_putt_avoidance_putting" BIGINT -- TOTAL 3 PUTTS---3-putt Avoidance---PUTTING,
"total_holes_overall_3_putt_avoidance_putting" BIGINT -- TOTAL HOLES (OVERALL)---3-putt Avoidance---PUTTING,
"total_total_1_putts_inside_5_putting" BIGINT -- TOTAL---total 1 Putts - Inside 5\u0027---PUTTING,
"total_for_all_distances_total_1_putts_inside_5_putting" BIGINT -- TOTAL (FOR ALL DISTANCES)---total 1 Putts - Inside 5\u0027---PUTTING,
"pct_of_all_1_putts_total_1_putts_inside_5_putting" DOUBLE -- PCT OF ALL 1-PUTTS---total 1 Putts - Inside 5\u0027---PUTTING,
"average_sg_putting_putting" DOUBLE -- AVERAGE---sg: Putting---PUTTING,
"total_sg_putting_sg_putting_putting" DOUBLE -- TOTAL SG:PUTTING---sg: Putting---PUTTING,
"measured_rounds_sg_putting_putting" BIGINT -- MEASURED ROUNDS---sg: Putting---PUTTING,
"avg_1_putts_per_round_putting" DOUBLE -- AVG---1-putts Per Round---PUTTING,
"total_1_putts_1_putts_per_round_putting" BIGINT -- TOTAL 1 PUTTS---1-putts Per Round---PUTTING,
"total_rounds_1_putts_per_round_putting" BIGINT -- TOTAL ROUNDS---1-putts Per Round---PUTTING,
"n__one_putt_percentage_putting" DOUBLE -- %---one-putt Percentage---PUTTING,
"n__of_1_putts_one_putt_percentage_putting" BIGINT -- # OF 1 PUTTS---one-putt Percentage---PUTTING,
"n__of_holes_one_putt_percentage_putting" BIGINT -- # OF HOLES---one-putt Percentage---PUTTING,
"avg_dtp_proximity_to_hole_arg_around_the_green" VARCHAR -- AVG DTP---proximity To Hole (arg)---AROUND THE GREEN,
"total_distance_feet_proximity_to_hole_arg_around_the_green" DOUBLE -- TOTAL DISTANCE (FEET)---proximity To Hole (arg)---AROUND THE GREEN,
"n__of_shots_proximity_to_hole_arg_around_the_green" BIGINT -- # OF SHOTS---proximity To Hole (arg)---AROUND THE GREEN,
"scrambling_rank_proximity_to_hole_arg_around_the_green" BIGINT -- SCRAMBLING RANK---proximity To Hole (arg)---AROUND THE GREEN,
"n__scrambling_around_the_green" DOUBLE -- %---scrambling---AROUND THE GREEN,
"par_or_better_scrambling_around_the_green" BIGINT -- PAR OR BETTER---scrambling---AROUND THE GREEN,
"missed_gir_scrambling_around_the_green" BIGINT -- MISSED GIR---scrambling---AROUND THE GREEN,
"average_sg_around_the_green_around_the_green" DOUBLE -- AVERAGE---sg: Around-the-green---AROUND THE GREEN,
"total_sg_arg_sg_around_the_green_around_the_green" DOUBLE -- TOTAL SG:ARG---sg: Around-the-green---AROUND THE GREEN,
"measured_rounds_sg_around_the_green_around_the_green" BIGINT -- MEASURED ROUNDS---sg: Around-the-green---AROUND THE GREEN,
"n__sand_save_percentage_around_the_green" DOUBLE -- %---sand Save Percentage---AROUND THE GREEN,
"n__saves_sand_save_percentage_around_the_green" BIGINT -- # SAVES---sand Save Percentage---AROUND THE GREEN,
"n__bunkers_sand_save_percentage_around_the_green" BIGINT -- # BUNKERS---sand Save Percentage---AROUND THE GREEN,
"total_o_u_par_sand_save_percentage_around_the_green" DOUBLE -- TOTAL O/U PAR---sand Save Percentage---AROUND THE GREEN,
"avg_round_4_scoring_average_scoring" DOUBLE -- AVG---round 4 Scoring Average---SCORING,
"total_strokes_round_4_scoring_average_scoring" BIGINT -- TOTAL STROKES---round 4 Scoring Average---SCORING,
"total_rounds_round_4_scoring_average_scoring" BIGINT -- TOTAL ROUNDS---round 4 Scoring Average---SCORING,
"avg_round_3_scoring_average_scoring" DOUBLE -- AVG---round 3 Scoring Average---SCORING,
"total_strokes_round_3_scoring_average_scoring" BIGINT -- TOTAL STROKES---round 3 Scoring Average---SCORING,
"total_rounds_round_3_scoring_average_scoring" BIGINT -- TOTAL ROUNDS---round 3 Scoring Average---SCORING,
"avg_round_2_scoring_average_scoring" DOUBLE -- AVG---round 2 Scoring Average---SCORING,
"total_strokes_round_2_scoring_average_scoring" BIGINT -- TOTAL STROKES---round 2 Scoring Average---SCORING,
"total_rounds_round_2_scoring_average_scoring" BIGINT -- TOTAL ROUNDS---round 2 Scoring Average---SCORING,
"avg_round_1_scoring_average_scoring" DOUBLE -- AVG---round 1 Scoring Average---SCORING,
"total_strokes_round_1_scoring_average_scoring" BIGINT -- TOTAL STROKES---round 1 Scoring Average---SCORING,
"total_rounds_round_1_scoring_average_scoring" BIGINT -- TOTAL ROUNDS---round 1 Scoring Average---SCORING,
"avg_final_round_scoring_average_scoring" DOUBLE -- AVG---final Round Scoring Average---SCORING,
"total_strokes_final_round_scoring_average_scoring" BIGINT -- TOTAL STROKES---final Round Scoring Average---SCORING,
"total_rounds_final_round_scoring_average_scoring" BIGINT -- TOTAL ROUNDS---final Round Scoring Average---SCORING,
"n__birdie_or_better_percentage_scoring" DOUBLE -- %---birdie Or Better Percentage---SCORING,
"total_birdies_birdie_or_better_percentage_scoring" BIGINT -- TOTAL BIRDIES---birdie Or Better Percentage---SCORING,
"total_holes_birdie_or_better_percentage_scoring" BIGINT -- TOTAL HOLES---birdie Or Better Percentage---SCORING,
"gir_rank_birdie_or_better_percentage_scoring" BIGINT -- GIR RANK---birdie Or Better Percentage---SCORING,
"total_total_birdies_scoring" BIGINT -- TOTAL---total Birdies---SCORING,
"total_total_eagles_scoring" BIGINT -- TOTAL---total Eagles---SCORING,
"avg_scoring_average_actual_scoring" DOUBLE -- AVG---scoring Average (actual)---SCORING,
"total_strokes_scoring_average_actual_scoring" BIGINT -- TOTAL STROKES---scoring Average (actual)---SCORING,
"total_rounds_scoring_average_actual_scoring" BIGINT -- TOTAL ROUNDS---scoring Average (actual)---SCORING,
"average_sg_total_scoring" DOUBLE -- AVERAGE---sg: Total---SCORING,
"total_sg_t_sg_total_scoring" DOUBLE -- TOTAL SG:T---sg: Total---SCORING,
"total_sg_t2g_sg_total_scoring" DOUBLE -- TOTAL SG:T2G---sg: Total---SCORING,
"total_sg_p_sg_total_scoring" DOUBLE -- TOTAL SG:P---sg: Total---SCORING,
"measured_rounds_sg_total_scoring" BIGINT -- MEASURED ROUNDS---sg: Total---SCORING,
"avg_scoring_average_scoring" DOUBLE -- AVG---scoring Average---SCORING,
"total_strokes_scoring_average_scoring" BIGINT -- TOTAL STROKES---scoring Average---SCORING,
"total_adjustment_scoring_average_scoring" VARCHAR -- TOTAL ADJUSTMENT---scoring Average---SCORING
);
Anyone who has the link will be able to view this.