Formula E Championship
Formula E race data from seasons 1-7
@kaggle.mlandry_formula_e_championship
Formula E race data from seasons 1-7
@kaggle.mlandry_formula_e_championship
A single table of the prominent data regarding all Formula E races, derived from Wikipedia race reports.
This data set is merely a single CSV file, backed with all the files I used to create it. This is taken purely from Wikipedia race reports, with some R code to parse the relevant results tables and clean things up.
So while 57 files are available (as of Version 1), the main output file, as shown in the preview is the intended data set to use.
It has not been denormalized, so in it we have race, driver, team, and results information.
Race: season, race number, race date, and race name
Driver: name
Team: car number, team name from Wikipedia, continuity-based team name
Results: two forms of rank, grid start, number of laps, report time/retirement message, the points awarded, and the three categories of points
Wikipedia's race reports are consistent enough that a couple hours of cleanup was all that was needed to derive this data set. A big thanks is owed to the contributors there.
Motorsports Stats information is a bit more expansive and possibly simpler to parse, but I used Wikipedia to keep licensing as simple as possible.
The inspiration for adding this to Kaggle was that it begs a comparison to Formula 1. @vopani has posted the ergast.com data set, and its accessibility had me able to work with the data enough to do some simple predictions. I have not found a Formula E data set that provides the results in one place. Unfortunately I don't know of a source for lap times at all. But with Formula E continually branding themselves as one of the most unpredictable championships in racing, putting this data in Kaggle seemed useful. It's my first true data set, and it's nice to give back to a community I've been part of for so long.
So I aim to add a few notebooks here soon to start this out.
I also aim to manually keep it updated through the flurry of Berlin races to finish Season 6, ideally the night following each race using hand-entered results.
Data that is available that I have chosen not to use would be a deeper dive into Qualifying results, and potentially practice times. The qualifying results are already in the HTML pages I've posted here, they'd just need to be parsed. But even with that data in hand with the F1 data set, I have yet to use it other than pre-penalty grid positions. For those that don't know, Formula E's qualifying introduces a negative feedback loop, in that the top 6 of the Championship are forced to qualify in the first group, where the track is frequently very dirty/dusty and has less grip. It is rare that a driver from Group 1 makes it to super pole. And listening to the commentators, they frequently will comment on who "looked fast in practice" so if you had that information it might help predict race finish.
Anyone who has the link will be able to view this.