Chess Game Dataset (Lichess)
20,000+ Lichess Games, including moves, victor, rating, opening details and more
@kaggle.datasnaek_chess
20,000+ Lichess Games, including moves, victor, rating, opening details and more
@kaggle.datasnaek_chess
General Info
This is a set of just over 20,000 games collected from a selection of users on the site Lichess.org, and how to collect more. I will also upload more games in the future as I collect them. This set contains the:
For each of these separate games from Lichess. I collected this data using the Lichess API, which enables collection of any given users game history. The difficult part was collecting usernames to use, however the API also enables dumping of all users in a Lichess team. There are several teams on Lichess with over 1,500 players, so this proved an effective way to get users to collect games from.
Possible Uses
Lots of information is contained within a single chess game, let alone a full dataset of multiple games. It is primarily a game of patterns, and data science is all about detecting patterns in data, which is why chess has been one of the most invested in areas of AI in the past. This dataset collects all of the information available from 20,000 games and presents it in a format that is easy to process for analysis of, for example, what allows a player to win as black or white, how much meta (out-of-game) factors affect a game, the relationship between openings and victory for black and white and more.
CREATE TABLE games (
"id" VARCHAR,
"rated" BOOLEAN,
"created_at" DOUBLE,
"last_move_at" DOUBLE,
"turns" BIGINT,
"victory_status" VARCHAR,
"winner" VARCHAR,
"increment_code" VARCHAR,
"white_id" VARCHAR,
"white_rating" BIGINT,
"black_id" VARCHAR,
"black_rating" BIGINT,
"moves" VARCHAR,
"opening_eco" VARCHAR,
"opening_name" VARCHAR,
"opening_ply" BIGINT
);Anyone who has the link will be able to view this.