This dataset provides a comprehensive list of video games with sales exceeding 100,000 copies. The data was collected through web scraping from vgchartz.com, ensuring a robust compilation of sales figures for various games across multiple regions and platforms.
Key Features of the Dataset:
Rank: Indicates the global sales ranking of each game.
Name: Specifies the title of the game.
Platform: Identifies the platform on which the game was released (e.g., PC, PS4, Xbox One).
Year: Records the year of the game's release.
Genre: Categorizes the game based on its genre (e.g., Action, Adventure, RPG).
Publisher: Lists the company responsible for publishing the game.
Regional Sales Data:
NA_Sales: Sales in North America, measured in millions.
EU_Sales: Sales in Europe, measured in millions.
JP_Sales: Sales in Japan, measured in millions.
Other_Sales: Sales in the rest of the world, measured in millions.
Global_Sales: Represents the total worldwide sales, aggregating all regional sales.
The dataset contains 16,598 records, offering a rich resource for analysis. Notably, two entries were excluded due to incomplete information.
Data Collection Method:
The dataset was generated using a Python script available on GitHub. The script utilizes the BeautifulSoup library for web scraping, ensuring accurate and detailed extraction of data.
This dataset is an invaluable resource for analyzing video game sales trends, evaluating the performance of specific platforms or genres, and understanding regional preferences in the gaming industry.