Canadian Baseball
Examining Player Performance by Division, State and School
By [source]
About this dataset
This dataset contains extensive and detailed information about Canadian baseball players and coaches who have participated in the United States college athletics system between 2021 to 2023. Through combining the datasets we are able to analyze the individual performance of each player or coach, their playing style, as well as their changes in performance according to division, state and school. By exploring these data we are able to understand how different levels of competition affect athlete performance as well as determine which player has achieved the highest levels of success within their fields. The columns included in this dataset include name, position, batting/throwing preference, class, school, division state hometown and more which provides us with a detailed understanding of each athlete's playing style from a statistical standpoint
More Datasets
For more datasets, click here.
Featured Notebooks
- 🚨 Your notebook can be here! 🚨!
How to use the dataset
How to use this Dataset:
This dataset offers a comprehensive look at the performance of Canadian baseball players and coaches in U.S. college athletics from 2021-2023. By focusing on individual player performances across multiple factors (e.g., batting average, number of home runs, etc.), we can get a better understanding of how Canadian athletes have done in this competitive environment.
The first step is to become familiar with the data fields present in the dataset – name, position, batting/throwing preference, class, school, division, state location/hometown/stats link as well as several columns related to performance statistics such as hits and innings pitched - and understand what each field means so that you can determine which statistical measures are most relevant for your analysis.
Once you have identified the relevant variables for your study, you can begin conducting exploratory analysis on these variables by creating basic summary statistics (such as mean or median) or visualizations (like line graphs or histograms). This way you can quickly identify patterns across different divisions and schools while also uncovering any outliers that might exist within individual datasets or between two separate datasets when compared side-by-side.
To further expand upon these preliminary insights obtained from exploratory analysis it is important to also consider more advanced statistical techniques such as regressions models in order to identify any causal relationships existing between player performance characteristics vs different geographic locations such as states or divisions mentioned earlier - something which could prove crucial in understanding their successes more holistically than just looking at one factor alone. It should also be noted that these above steps are only applicable if there’s enough data available for said analysis; so it may be necessary to combine different datasets available within this repository before proceeding with more complicated methods mentioned above if needed
Research Ideas
- Analyzing which position of players and coaches are more successful in U.S. college athletics, broken down by division, state, and school located in Canada.
- Examine the performance of Canadian baseball players across different schools and divisions based on their batting average, on-base percentage or slugging rate to identify areas for improvement in certain metrics or fields of playe
- Compare player stats across teams within the same division and state to identify team strengths and weaknesses for possible recruitment targets
Acknowledgements
If you use this dataset in your research, please credit the original authors.
Data Source
License
License: CC0 1.0 Universal (CC0 1.0) - Public Domain Dedication
No Copyright - You can copy, modify, distribute and perform the work, even for commercial purposes, all without asking permission. See Other Information.
Columns
File: canadians_2021.csv
Column name |
Description |
name |
The name of the player or coach. (String) |
position |
The position the player or coach plays or coaches. (String) |
b |
The player's batting preference (right or left). (String) |
class |
The player's class (Freshman, Sophomore, Junior, Senior). (String) |
school |
The school the player or coach is associated with. (String) |
division |
The division the school is in (NCAA D1, NCAA D2, etc). (String) |
state |
The state the school is located in. (String) |
hometown |
The player's hometown. (String) |
stats_link |
A link to the player's stats page. (URL) |
File: canadians_2022.csv
Column name |
Description |
name |
The name of the player or coach. (String) |
position |
The position the player or coach plays or coaches. (String) |
b |
The player's batting preference (right or left). (String) |
class |
The player's class (Freshman, Sophomore, Junior, Senior). (String) |
school |
The school the player or coach is associated with. (String) |
division |
The division the school is in (NCAA D1, NCAA D2, etc). (String) |
state |
The state the school is located in. (String) |
hometown |
The player's hometown. (String) |
stats_link |
A link to the player's stats page. (URL) |
File: canadians_manual_2021.csv
Column name |
Description |
name |
The name of the player or coach. (String) |
position |
The position the player or coach plays or coaches. (String) |
b |
The player's batting preference (right or left). (String) |
class |
The player's class (Freshman, Sophomore, Junior, Senior). (String) |
school |
The school the player or coach is associated with. (String) |
division |
The division the school is in (NCAA D1, NCAA D2, etc). (String) |
state |
The state the school is located in. (String) |
hometown |
The player's hometown. (String) |
stats_link |
A link to the player's stats page. (URL) |
File: canadians_manual_2022.csv
Column name |
Description |
name |
The name of the player or coach. (String) |
position |
The position the player or coach plays or coaches. (String) |
b |
The player's batting preference (right or left). (String) |
class |
The player's class (Freshman, Sophomore, Junior, Senior). (String) |
school |
The school the player or coach is associated with. (String) |
division |
The division the school is in (NCAA D1, NCAA D2, etc). (String) |
state |
The state the school is located in. (String) |
hometown |
The player's hometown. (String) |
File: roster_pages_2021.csv
Column name |
Description |
division |
The division the school is in (NCAA D1, NCAA D2, etc). (String) |
state |
The state the school is located in. (String) |
File: roster_pages_2022.csv
Column name |
Description |
division |
The division the school is in (NCAA D1, NCAA D2, etc). (String) |
state |
The state the school is located in. (String) |
title |
The title of the dataset. (String) |
roster_link |
The link to the roster of the school. (URL) |
File: stats.csv
Column name |
Description |
Name |
The name of the player or coach. (String) |
Position |
The position the player or coach plays or coaches. (String) |
School |
The school the player or coach is associated with. (String) |
Division |
The division the school is in. (String) |
Type |
The type of player or coach. (String) |
Games Played (G) |
The number of games the player or coach has played. (Integer) |
At Bats (AB) |
The number of times the player has been at bat. (Integer) |
Runs Scored (R) |
The number of runs the player has scored. (Integer) |
Hits (H) |
The number of hits the player has made. (Integer) |
Doubles (2B) |
The number of doubles the player has hit. (Integer) |
Triples (3B) |
The number of triples the player has hit. (Integer) |
Home Runs (HR) |
The number of home runs the player has hit. (Integer) |
Runs Batted In (RBI) |
The number of runs the player has batted in. (Integer) |
Stolen Bases (SB) |
The number of bases the player has stolen. (Integer) |
Batting Average (AVG) |
The player's batting average. (Float) |
On-Base Percentage (OBP) |
The player's on-base percentage. (Float) |
Slugging Percentage (SLG) |
The player's slugging percentage. (Float) |
On-Base plus Slugging (OPS) |
The player's on-base plus slugging percentage. (Float) |
Appearances (G) |
The number of appearances the player has made. (Integer) |
Games Started (GS) |
The number of games the player has started. (Integer) |
Innings Pitched (IP) |
The number of innings the player has pitched. (Integer) |
Wins (W) |
The number of wins the player has earned. (Integer) |
Losses (L) |
The number of losses the player has suffered. (Integer) |
Earned Runs (ER) |
The number of earned runs the player has allowed |
Hits Allowed (H) |
The number of hits the player has allowed. (Integer) |
Walks Allowed (BB) |
The number of walks the player has allowed. (Integer) |
Earned Run Average (ERA) |
The player's earned run |
File: stats_2021.csv
Column name |
Description |
Name |
The name of the player or coach. (String) |
Position |
The position the player or coach plays or coaches. (String) |
School |
The school the player or coach is associated with. (String) |
Division |
The division the school is in. (String) |
Type |
The type of player or coach. (String) |
Games Played (G) |
The number of games the player or coach has played. (Integer) |
At Bats (AB) |
The number of times the player has been at bat. (Integer) |
Runs Scored (R) |
The number of runs the player has scored. (Integer) |
Hits (H) |
The number of hits the player has made. (Integer) |
Doubles (2B) |
The number of doubles the player has hit. (Integer) |
Triples (3B) |
The number of triples the player has hit. (Integer) |
Home Runs (HR) |
The number of home runs the player has hit. (Integer) |
Runs Batted In (RBI) |
The number of runs the player has batted in. (Integer) |
Stolen Bases (SB) |
The number of bases the player has stolen. (Integer) |
Batting Average (AVG) |
The player's batting average. (Float) |
On-Base Percentage (OBP) |
The player's on-base percentage. (Float) |
Slugging Percentage (SLG) |
The player's slugging percentage. (Float) |
On-Base plus Slugging (OPS) |
The player's on-base plus slugging percentage. (Float) |
Appearances (G) |
The number of appearances the player has made. (Integer) |
Games Started (GS) |
The number of games the player has started. (Integer) |
Innings Pitched (IP) |
The number of innings the player has pitched. (Integer) |
Wins (W) |
The number of wins the player has earned. (Integer) |
Losses (L) |
The number of losses the player has suffered. (Integer) |
Earned Runs (ER) |
The number of earned runs the player has allowed |
Hits Allowed (H) |
The number of hits the player has allowed. (Integer) |
Walks Allowed (BB) |
The number of walks the player has allowed. (Integer) |
Earned Run Average (ERA) |
The player's earned run |
File: stats_2022.csv
Column name |
Description |
Name |
The name of the player or coach. (String) |
Position |
The position the player or coach plays or coaches. (String) |
School |
The school the player or coach is associated with. (String) |
Division |
The division the school is in. (String) |
Type |
The type of player or coach. (String) |
Games Played (G) |
The number of games the player or coach has played. (Integer) |
At Bats (AB) |
The number of times the player has been at bat. (Integer) |
Runs Scored (R) |
The number of runs the player has scored. (Integer) |
Hits (H) |
The number of hits the player has made. (Integer) |
Doubles (2B) |
The number of doubles the player has hit. (Integer) |
Triples (3B) |
The number of triples the player has hit. (Integer) |
Home Runs (HR) |
The number of home runs the player has hit. (Integer) |
Runs Batted In (RBI) |
The number of runs the player has batted in. (Integer) |
Stolen Bases (SB) |
The number of bases the player has stolen. (Integer) |
Batting Average (AVG) |
The player's batting average. (Float) |
On-Base Percentage (OBP) |
The player's on-base percentage. (Float) |
Slugging Percentage (SLG) |
The player's slugging percentage. (Float) |
On-Base plus Slugging (OPS) |
The player's on-base plus slugging percentage. (Float) |
Appearances (G) |
The number of appearances the player has made. (Integer) |
Games Started (GS) |
The number of games the player has started. (Integer) |
Innings Pitched (IP) |
The number of innings the player has pitched. (Integer) |
Wins (W) |
The number of wins the player has earned. (Integer) |
Losses (L) |
The number of losses the player has suffered. (Integer) |
Earned Runs (ER) |
The number of earned runs the player has allowed |
Hits Allowed (H) |
The number of hits the player has allowed. (Integer) |
Walks Allowed (BB) |
The number of walks the player has allowed. (Integer) |
Earned Run Average (ERA) |
The player's earned run |
Acknowledgements
If you use this dataset in your research, please credit the original authors.
If you use this dataset in your research, please credit .