Lottery Draws (Euro-millions)
Exploring Features, Targets, and Possibilities
By Jeffrey Mvutu Mabilama [source]
About this dataset
This dataset brings you the past results of two popular European lotteries, Euromillions and TrioMagic, in a format that allows for easy analysis and feature engineering. With this data, you can build models to predict future lottery draws accurately as well as to glean insights into what causes certain draws to be more likely than others. Through preprocessing historic draws like a signal or time series in order to craft higher-level features, we hope empower those curious about the lottery with an accessible dataset that encourages further exploration and discussion.
We have several features based on our newly developed concept of Universe Length, or the number of different numbers drawn in a given time frame. For instance, in a 1/10 lottery, it would be 6 since there are 6 different numbers drawn over ten draws. We also provide labels such as parity - the number of even numbers in both current and running window frames (e.g 10) - which can be used as targets for either classification or regression problems. Additionally, we include several other features depending on said running window frames which measure changes between single ball draws regarding things like Universe length drops/ increases , mean frequencies of each draw number's appearance , gaps between when each draw is pulled etc., all of which allow us an unprecedented level of detail into how lotteries behave over time.
We offer not just theoretical statistics but potential real world applications through these special columns: “date” simply showcases when the draw was taken along with specs such as “draw-id”; while “running-window-frame” demonstrates what frame size is being used for all other approximations built upon it; lastly Target T related columns give you numerous options for predicting high level concepts - for example target_universe_length_willFollowIncrease gives you an indicator on whether Universe Length will follow increase or decrease from previous result ones based solely off predetermined calculated feats from its respective table block .
The possibilities are indeed abundant! Try your hand at predicting upcoming money making outcomes or delve deep into hunting complex patterns relevant only to your own recognized criteria with this Open datasource today!
More Datasets
For more datasets, click here.
Featured Notebooks
- 🚨 Your notebook can be here! 🚨!
How to use the dataset
Introduction
This dataset contains historic draws for the Euromillions lottery draw,as well as various predictive features and labels to facilitate further analysis. The aim of this guide is to provide an overview on how best to utilize this dataset.
Research Ideas
- Building models to predict the frequency and universe length of future draws. This could be used by lottery players to help them decide which numbers they should pick, or by data scientists to analyze the probability of certain draws.
- Applying machine learning algorithms such as random forests or neural networks to discover patterns in the dataset and draw conclusions about the underlying probability distribution of a particular lottery and its outcomes.
- Investigating different approaches for manipulating running windows frames in order to determine more accurate predictions regarding target features (such as the parity of future draws). This could potentially lead to more accurate predictions on a particular lottery's outcome, allowing players and data scientists alike to gain an advantage when playing or analyzing a certain lottery game
Acknowledgements
If you use this dataset in your research, please credit the original authors.
Data Source
License
License: CC0 1.0 Universal (CC0 1.0) - Public Domain Dedication
No Copyright - You can copy, modify, distribute and perform the work, even for commercial purposes, all without asking permission. See Other Information.
Columns
File: euromillions.pool-1.features.csv
Column name |
Description |
universe-length |
The number of unique numbers drawn in a specified time frame. (Numeric) |
parity |
The number of even numbers in a draw. (Numeric) |
parity-over-frame |
The probability of an even number being drawn in a given frame. (Numeric) |
last-moving-direction-of-universe-length |
The direction of the last movement of the universe length. (Categorical) |
move-balance-of-universe-length |
The mean or median frequency of numbers appearing in previous draws. (Numeric) |
move-balance-of-universe-length_mean-early |
The mean frequency of numbers appearing in early draws. (Numeric) |
universe-length-drop |
The probability of a universe length drop in the next draw. (Numeric) |
universe-length-increase |
The probability of a universe length increase in the next draw. (Numeric) |
universe-length-repetition-same |
The probability of a universe length repetition in the next draw. (Numeric) |
greater-universe-length-than-repetition |
The probability of a universe length greater than the repetition in the next draw. (Numeric) |
mean-frequency-of-drawn-numbers |
The mean frequency of numbers appearing in the draw. (Numeric) |
median-frequency-of-drawn-numbers |
The median frequency of numbers appearing in the draw. (Numeric) |
universe-length-willFollowIncrease |
The probability of a universe length increase in the next draw. (Numeric) |
date |
The date of the draw. (Date) |
File: euromillions.pool-2.features.csv
Column name |
Description |
universe-length |
The number of unique numbers drawn in a specified time frame. (Numeric) |
parity |
The number of even numbers in a draw. (Numeric) |
parity-over-frame |
The probability of an even number being drawn in a given frame. (Numeric) |
last-moving-direction-of-universe-length |
The direction of the last movement of the universe length. (Categorical) |
move-balance-of-universe-length |
The mean or median frequency of numbers appearing in previous draws. (Numeric) |
move-balance-of-universe-length_mean-early |
The mean frequency of numbers appearing in early draws. (Numeric) |
universe-length-drop |
The probability of a universe length drop in the next draw. (Numeric) |
universe-length-increase |
The probability of a universe length increase in the next draw. (Numeric) |
universe-length-repetition-same |
The probability of a universe length repetition in the next draw. (Numeric) |
greater-universe-length-than-repetition |
The probability of a universe length greater than the repetition in the next draw. (Numeric) |
mean-frequency-of-drawn-numbers |
The mean frequency of numbers appearing in the draw. (Numeric) |
median-frequency-of-drawn-numbers |
The median frequency of numbers appearing in the draw. (Numeric) |
universe-length-willFollowIncrease |
The probability of a universe length increase in the next draw. (Numeric) |
date |
The date of the draw. (Date) |
Acknowledgements
If you use this dataset in your research, please credit the original authors.
If you use this dataset in your research, please credit Jeffrey Mvutu Mabilama.