Baselight

Canadian Baseball

Examining Player Performance by Division, State and School

@kaggle.thedevastator_canadian_baseball_performance_in_us_college_athl

Loading...
Loading...

About this Dataset

Canadian Baseball


Canadian Baseball

Examining Player Performance by Division, State and School

By [source]


About this dataset

This dataset contains extensive and detailed information about Canadian baseball players and coaches who have participated in the United States college athletics system between 2021 to 2023. Through combining the datasets we are able to analyze the individual performance of each player or coach, their playing style, as well as their changes in performance according to division, state and school. By exploring these data we are able to understand how different levels of competition affect athlete performance as well as determine which player has achieved the highest levels of success within their fields. The columns included in this dataset include name, position, batting/throwing preference, class, school, division state hometown and more which provides us with a detailed understanding of each athlete's playing style from a statistical standpoint

More Datasets

For more datasets, click here.

Featured Notebooks

  • 🚨 Your notebook can be here! 🚨!

How to use the dataset

How to use this Dataset:

This dataset offers a comprehensive look at the performance of Canadian baseball players and coaches in U.S. college athletics from 2021-2023. By focusing on individual player performances across multiple factors (e.g., batting average, number of home runs, etc.), we can get a better understanding of how Canadian athletes have done in this competitive environment.

The first step is to become familiar with the data fields present in the dataset – name, position, batting/throwing preference, class, school, division, state location/hometown/stats link as well as several columns related to performance statistics such as hits and innings pitched - and understand what each field means so that you can determine which statistical measures are most relevant for your analysis.

Once you have identified the relevant variables for your study, you can begin conducting exploratory analysis on these variables by creating basic summary statistics (such as mean or median) or visualizations (like line graphs or histograms). This way you can quickly identify patterns across different divisions and schools while also uncovering any outliers that might exist within individual datasets or between two separate datasets when compared side-by-side.

To further expand upon these preliminary insights obtained from exploratory analysis it is important to also consider more advanced statistical techniques such as regressions models in order to identify any causal relationships existing between player performance characteristics vs different geographic locations such as states or divisions mentioned earlier - something which could prove crucial in understanding their successes more holistically than just looking at one factor alone. It should also be noted that these above steps are only applicable if there’s enough data available for said analysis; so it may be necessary to combine different datasets available within this repository before proceeding with more complicated methods mentioned above if needed

Research Ideas

  • Analyzing which position of players and coaches are more successful in U.S. college athletics, broken down by division, state, and school located in Canada.
  • Examine the performance of Canadian baseball players across different schools and divisions based on their batting average, on-base percentage or slugging rate to identify areas for improvement in certain metrics or fields of playe
  • Compare player stats across teams within the same division and state to identify team strengths and weaknesses for possible recruitment targets

Acknowledgements

If you use this dataset in your research, please credit the original authors.
Data Source

License

License: CC0 1.0 Universal (CC0 1.0) - Public Domain Dedication
No Copyright - You can copy, modify, distribute and perform the work, even for commercial purposes, all without asking permission. See Other Information.

Columns

File: canadians_2021.csv

Column name Description
name The name of the player or coach. (String)
position The position the player or coach plays or coaches. (String)
b The player's batting preference (right or left). (String)
class The player's class (Freshman, Sophomore, Junior, Senior). (String)
school The school the player or coach is associated with. (String)
division The division the school is in (NCAA D1, NCAA D2, etc). (String)
state The state the school is located in. (String)
hometown The player's hometown. (String)
stats_link A link to the player's stats page. (URL)

File: canadians_2022.csv

Column name Description
name The name of the player or coach. (String)
position The position the player or coach plays or coaches. (String)
b The player's batting preference (right or left). (String)
class The player's class (Freshman, Sophomore, Junior, Senior). (String)
school The school the player or coach is associated with. (String)
division The division the school is in (NCAA D1, NCAA D2, etc). (String)
state The state the school is located in. (String)
hometown The player's hometown. (String)
stats_link A link to the player's stats page. (URL)

File: canadians_manual_2021.csv

Column name Description
name The name of the player or coach. (String)
position The position the player or coach plays or coaches. (String)
b The player's batting preference (right or left). (String)
class The player's class (Freshman, Sophomore, Junior, Senior). (String)
school The school the player or coach is associated with. (String)
division The division the school is in (NCAA D1, NCAA D2, etc). (String)
state The state the school is located in. (String)
hometown The player's hometown. (String)
stats_link A link to the player's stats page. (URL)

File: canadians_manual_2022.csv

Column name Description
name The name of the player or coach. (String)
position The position the player or coach plays or coaches. (String)
b The player's batting preference (right or left). (String)
class The player's class (Freshman, Sophomore, Junior, Senior). (String)
school The school the player or coach is associated with. (String)
division The division the school is in (NCAA D1, NCAA D2, etc). (String)
state The state the school is located in. (String)
hometown The player's hometown. (String)

File: roster_pages_2021.csv

Column name Description
division The division the school is in (NCAA D1, NCAA D2, etc). (String)
state The state the school is located in. (String)

File: roster_pages_2022.csv

Column name Description
division The division the school is in (NCAA D1, NCAA D2, etc). (String)
state The state the school is located in. (String)
title The title of the dataset. (String)
roster_link The link to the roster of the school. (URL)

File: stats.csv

Column name Description
Name The name of the player or coach. (String)
Position The position the player or coach plays or coaches. (String)
School The school the player or coach is associated with. (String)
Division The division the school is in. (String)
Type The type of player or coach. (String)
Games Played (G) The number of games the player or coach has played. (Integer)
At Bats (AB) The number of times the player has been at bat. (Integer)
Runs Scored (R) The number of runs the player has scored. (Integer)
Hits (H) The number of hits the player has made. (Integer)
Doubles (2B) The number of doubles the player has hit. (Integer)
Triples (3B) The number of triples the player has hit. (Integer)
Home Runs (HR) The number of home runs the player has hit. (Integer)
Runs Batted In (RBI) The number of runs the player has batted in. (Integer)
Stolen Bases (SB) The number of bases the player has stolen. (Integer)
Batting Average (AVG) The player's batting average. (Float)
On-Base Percentage (OBP) The player's on-base percentage. (Float)
Slugging Percentage (SLG) The player's slugging percentage. (Float)
On-Base plus Slugging (OPS) The player's on-base plus slugging percentage. (Float)
Appearances (G) The number of appearances the player has made. (Integer)
Games Started (GS) The number of games the player has started. (Integer)
Innings Pitched (IP) The number of innings the player has pitched. (Integer)
Wins (W) The number of wins the player has earned. (Integer)
Losses (L) The number of losses the player has suffered. (Integer)
Earned Runs (ER) The number of earned runs the player has allowed
Hits Allowed (H) The number of hits the player has allowed. (Integer)
Walks Allowed (BB) The number of walks the player has allowed. (Integer)
Earned Run Average (ERA) The player's earned run

File: stats_2021.csv

Column name Description
Name The name of the player or coach. (String)
Position The position the player or coach plays or coaches. (String)
School The school the player or coach is associated with. (String)
Division The division the school is in. (String)
Type The type of player or coach. (String)
Games Played (G) The number of games the player or coach has played. (Integer)
At Bats (AB) The number of times the player has been at bat. (Integer)
Runs Scored (R) The number of runs the player has scored. (Integer)
Hits (H) The number of hits the player has made. (Integer)
Doubles (2B) The number of doubles the player has hit. (Integer)
Triples (3B) The number of triples the player has hit. (Integer)
Home Runs (HR) The number of home runs the player has hit. (Integer)
Runs Batted In (RBI) The number of runs the player has batted in. (Integer)
Stolen Bases (SB) The number of bases the player has stolen. (Integer)
Batting Average (AVG) The player's batting average. (Float)
On-Base Percentage (OBP) The player's on-base percentage. (Float)
Slugging Percentage (SLG) The player's slugging percentage. (Float)
On-Base plus Slugging (OPS) The player's on-base plus slugging percentage. (Float)
Appearances (G) The number of appearances the player has made. (Integer)
Games Started (GS) The number of games the player has started. (Integer)
Innings Pitched (IP) The number of innings the player has pitched. (Integer)
Wins (W) The number of wins the player has earned. (Integer)
Losses (L) The number of losses the player has suffered. (Integer)
Earned Runs (ER) The number of earned runs the player has allowed
Hits Allowed (H) The number of hits the player has allowed. (Integer)
Walks Allowed (BB) The number of walks the player has allowed. (Integer)
Earned Run Average (ERA) The player's earned run

File: stats_2022.csv

Column name Description
Name The name of the player or coach. (String)
Position The position the player or coach plays or coaches. (String)
School The school the player or coach is associated with. (String)
Division The division the school is in. (String)
Type The type of player or coach. (String)
Games Played (G) The number of games the player or coach has played. (Integer)
At Bats (AB) The number of times the player has been at bat. (Integer)
Runs Scored (R) The number of runs the player has scored. (Integer)
Hits (H) The number of hits the player has made. (Integer)
Doubles (2B) The number of doubles the player has hit. (Integer)
Triples (3B) The number of triples the player has hit. (Integer)
Home Runs (HR) The number of home runs the player has hit. (Integer)
Runs Batted In (RBI) The number of runs the player has batted in. (Integer)
Stolen Bases (SB) The number of bases the player has stolen. (Integer)
Batting Average (AVG) The player's batting average. (Float)
On-Base Percentage (OBP) The player's on-base percentage. (Float)
Slugging Percentage (SLG) The player's slugging percentage. (Float)
On-Base plus Slugging (OPS) The player's on-base plus slugging percentage. (Float)
Appearances (G) The number of appearances the player has made. (Integer)
Games Started (GS) The number of games the player has started. (Integer)
Innings Pitched (IP) The number of innings the player has pitched. (Integer)
Wins (W) The number of wins the player has earned. (Integer)
Losses (L) The number of losses the player has suffered. (Integer)
Earned Runs (ER) The number of earned runs the player has allowed
Hits Allowed (H) The number of hits the player has allowed. (Integer)
Walks Allowed (BB) The number of walks the player has allowed. (Integer)
Earned Run Average (ERA) The player's earned run

Acknowledgements

If you use this dataset in your research, please credit the original authors.
If you use this dataset in your research, please credit .

Tables

Canadians 2021

@kaggle.thedevastator_canadian_baseball_performance_in_us_college_athl.canadians_2021
  • 46.73 KB
  • 814 rows
  • 10 columns
Loading...

CREATE TABLE canadians_2021 (
  "name" VARCHAR,
  "position" VARCHAR,
  "b" VARCHAR,
  "t" VARCHAR,
  "class" VARCHAR,
  "school" VARCHAR,
  "division" VARCHAR,
  "state" VARCHAR,
  "hometown" VARCHAR,
  "stats_link" VARCHAR
);

Canadians 2022

@kaggle.thedevastator_canadian_baseball_performance_in_us_college_athl.canadians_2022
  • 55.8 KB
  • 996 rows
  • 10 columns
Loading...

CREATE TABLE canadians_2022 (
  "name" VARCHAR,
  "position" VARCHAR,
  "b" VARCHAR,
  "t" VARCHAR,
  "class" VARCHAR,
  "school" VARCHAR,
  "division" VARCHAR,
  "state" VARCHAR,
  "hometown" VARCHAR,
  "stats_link" VARCHAR
);

Canadians 2023

@kaggle.thedevastator_canadian_baseball_performance_in_us_college_athl.canadians_2023
  • 27.15 KB
  • 660 rows
  • 11 columns
Loading...

CREATE TABLE canadians_2023 (
  "name" VARCHAR,
  "position" VARCHAR,
  "b" VARCHAR,
  "t" VARCHAR,
  "class" VARCHAR,
  "school" VARCHAR,
  "league" VARCHAR,
  "division" DOUBLE,
  "state" VARCHAR,
  "hometown" VARCHAR,
  "stats_link" VARCHAR
);

Canadians Manual 2021

@kaggle.thedevastator_canadian_baseball_performance_in_us_college_athl.canadians_manual_2021
  • 9.87 KB
  • 70 rows
  • 10 columns
Loading...

CREATE TABLE canadians_manual_2021 (
  "name" VARCHAR,
  "position" VARCHAR,
  "b" VARCHAR,
  "t" VARCHAR,
  "class" VARCHAR,
  "school" VARCHAR,
  "division" VARCHAR,
  "state" VARCHAR,
  "hometown" VARCHAR,
  "stats_link" VARCHAR
);

Canadians Manual 2022

@kaggle.thedevastator_canadian_baseball_performance_in_us_college_athl.canadians_manual_2022
  • 8.6 KB
  • 56 rows
  • 9 columns
Loading...

CREATE TABLE canadians_manual_2022 (
  "name" VARCHAR,
  "position" VARCHAR,
  "b" VARCHAR,
  "t" VARCHAR,
  "class" VARCHAR,
  "school" VARCHAR,
  "division" VARCHAR,
  "state" VARCHAR,
  "hometown" VARCHAR
);

Canadians Manual 2023

@kaggle.thedevastator_canadian_baseball_performance_in_us_college_athl.canadians_manual_2023
  • 5.59 KB
  • 11 columns
Loading...

CREATE TABLE canadians_manual_2023 (
  "name" VARCHAR,
  "position" VARCHAR,
  "b" VARCHAR,
  "t" VARCHAR,
  "class" VARCHAR,
  "school" VARCHAR,
  "league" VARCHAR,
  "division" VARCHAR,
  "state" VARCHAR,
  "hometown" VARCHAR,
  "stats_link" VARCHAR
);

Coaches

@kaggle.thedevastator_canadian_baseball_performance_in_us_college_athl.coaches
  • 6.72 KB
  • 28 rows
  • 7 columns
Loading...

CREATE TABLE coaches (
  "name" VARCHAR,
  "position" VARCHAR,
  "school" VARCHAR,
  "league" VARCHAR,
  "division" DOUBLE,
  "state" VARCHAR,
  "hometown" VARCHAR
);

Roster Pages 2021

@kaggle.thedevastator_canadian_baseball_performance_in_us_college_athl.roster_pages_2021
  • 84.44 KB
  • 1681 rows
  • 7 columns
Loading...

CREATE TABLE roster_pages_2021 (
  "title" VARCHAR,
  "division" VARCHAR,
  "conference" VARCHAR,
  "state" VARCHAR,
  "location" VARCHAR,
  "link" VARCHAR,
  "roster_link" VARCHAR
);

Roster Pages 2022

@kaggle.thedevastator_canadian_baseball_performance_in_us_college_athl.roster_pages_2022
  • 52.97 KB
  • 1681 rows
  • 4 columns
Loading...

CREATE TABLE roster_pages_2022 (
  "title" VARCHAR,
  "division" VARCHAR,
  "state" VARCHAR,
  "roster_link" VARCHAR
);

Roster Pages 2023

@kaggle.thedevastator_canadian_baseball_performance_in_us_college_athl.roster_pages_2023
  • 53.45 KB
  • 1687 rows
  • 5 columns
Loading...

CREATE TABLE roster_pages_2023 (
  "school" VARCHAR,
  "league" VARCHAR,
  "division" DOUBLE,
  "state" VARCHAR,
  "roster_link" VARCHAR
);

Stats

@kaggle.thedevastator_canadian_baseball_performance_in_us_college_athl.stats
  • 29.07 KB
  • 197 rows
  • 29 columns
Loading...

CREATE TABLE stats (
  "name" VARCHAR,
  "position" VARCHAR,
  "school" VARCHAR,
  "division" VARCHAR,
  "type" VARCHAR,
  "games_played_g" BIGINT,
  "at_bats_ab" BIGINT,
  "runs_scored_r" BIGINT,
  "hits_h" BIGINT,
  "doubles_2b" BIGINT,
  "triples_3b" BIGINT,
  "home_runs_hr" BIGINT,
  "runs_batted_in_rbi" BIGINT,
  "stolen_bases_sb" BIGINT,
  "batting_average_avg" DOUBLE,
  "on_base_percentage_obp" DOUBLE,
  "slugging_percentage_slg" DOUBLE,
  "on_base_plus_slugging_ops" DOUBLE,
  "appearances_g" BIGINT,
  "games_started_gs" BIGINT,
  "innings_pitched_ip" DOUBLE,
  "wins_w" BIGINT,
  "losses_l" BIGINT,
  "earned_runs_er" BIGINT,
  "hits_allowed_h" BIGINT,
  "walks_allowed_bb" BIGINT,
  "earned_run_average_era" DOUBLE,
  "saves_sv" BIGINT,
  "strikeouts_k" BIGINT
);

Stats 2021

@kaggle.thedevastator_canadian_baseball_performance_in_us_college_athl.stats_2021
  • 57.79 KB
  • 720 rows
  • 29 columns
Loading...

CREATE TABLE stats_2021 (
  "name" VARCHAR,
  "position" VARCHAR,
  "school" VARCHAR,
  "division" VARCHAR,
  "type" VARCHAR,
  "games_played_g" BIGINT,
  "at_bats_ab" BIGINT,
  "runs_scored_r" BIGINT,
  "hits_h" BIGINT,
  "doubles_2b" BIGINT,
  "triples_3b" BIGINT,
  "home_runs_hr" BIGINT,
  "runs_batted_in_rbi" BIGINT,
  "stolen_bases_sb" BIGINT,
  "batting_average_avg" DOUBLE,
  "on_base_percentage_obp" DOUBLE,
  "slugging_percentage_slg" DOUBLE,
  "on_base_plus_slugging_ops" DOUBLE,
  "appearances_g" BIGINT,
  "games_started_gs" BIGINT,
  "innings_pitched_ip" DOUBLE,
  "wins_w" BIGINT,
  "losses_l" BIGINT,
  "earned_runs_er" BIGINT,
  "hits_allowed_h" DOUBLE,
  "walks_allowed_bb" DOUBLE,
  "earned_run_average_era" DOUBLE,
  "saves_sv" BIGINT,
  "strikeouts_k" BIGINT
);

Stats 2022

@kaggle.thedevastator_canadian_baseball_performance_in_us_college_athl.stats_2022
  • 64.82 KB
  • 863 rows
  • 29 columns
Loading...

CREATE TABLE stats_2022 (
  "name" VARCHAR,
  "position" VARCHAR,
  "school" VARCHAR,
  "division" VARCHAR,
  "type" VARCHAR,
  "games_played_g" BIGINT,
  "at_bats_ab" BIGINT,
  "runs_scored_r" BIGINT,
  "hits_h" BIGINT,
  "doubles_2b" BIGINT,
  "triples_3b" BIGINT,
  "home_runs_hr" BIGINT,
  "runs_batted_in_rbi" BIGINT,
  "stolen_bases_sb" BIGINT,
  "batting_average_avg" DOUBLE,
  "on_base_percentage_obp" DOUBLE,
  "slugging_percentage_slg" DOUBLE,
  "on_base_plus_slugging_ops" DOUBLE,
  "appearances_g" BIGINT,
  "games_started_gs" BIGINT,
  "innings_pitched_ip" DOUBLE,
  "wins_w" BIGINT,
  "losses_l" BIGINT,
  "earned_runs_er" BIGINT,
  "hits_allowed_h" DOUBLE,
  "walks_allowed_bb" DOUBLE,
  "earned_run_average_era" DOUBLE,
  "saves_sv" BIGINT,
  "strikeouts_k" BIGINT
);

Share link

Anyone who has the link will be able to view this.