Baselight

Lottery Draws (Euro-millions)

Exploring Features, Targets, and Possibilities

@kaggle.thedevastator_predictive_analysis_for_euromillions_lottery_dra

Loading...
Loading...

About this Dataset

Lottery Draws (Euro-millions)


Lottery Draws (Euro-millions)

Exploring Features, Targets, and Possibilities

By Jeffrey Mvutu Mabilama [source]


About this dataset

This dataset brings you the past results of two popular European lotteries, Euromillions and TrioMagic, in a format that allows for easy analysis and feature engineering. With this data, you can build models to predict future lottery draws accurately as well as to glean insights into what causes certain draws to be more likely than others. Through preprocessing historic draws like a signal or time series in order to craft higher-level features, we hope empower those curious about the lottery with an accessible dataset that encourages further exploration and discussion.

We have several features based on our newly developed concept of Universe Length, or the number of different numbers drawn in a given time frame. For instance, in a 1/10 lottery, it would be 6 since there are 6 different numbers drawn over ten draws. We also provide labels such as parity - the number of even numbers in both current and running window frames (e.g 10) - which can be used as targets for either classification or regression problems. Additionally, we include several other features depending on said running window frames which measure changes between single ball draws regarding things like Universe length drops/ increases , mean frequencies of each draw number's appearance , gaps between when each draw is pulled etc., all of which allow us an unprecedented level of detail into how lotteries behave over time.

We offer not just theoretical statistics but potential real world applications through these special columns: “date” simply showcases when the draw was taken along with specs such as “draw-id”; while “running-window-frame” demonstrates what frame size is being used for all other approximations built upon it; lastly Target T related columns give you numerous options for predicting high level concepts - for example target_universe_length_willFollowIncrease gives you an indicator on whether Universe Length will follow increase or decrease from previous result ones based solely off predetermined calculated feats from its respective table block .

The possibilities are indeed abundant! Try your hand at predicting upcoming money making outcomes or delve deep into hunting complex patterns relevant only to your own recognized criteria with this Open datasource today!

More Datasets

For more datasets, click here.

Featured Notebooks

  • 🚨 Your notebook can be here! 🚨!

How to use the dataset

Introduction

This dataset contains historic draws for the Euromillions lottery draw,as well as various predictive features and labels to facilitate further analysis. The aim of this guide is to provide an overview on how best to utilize this dataset.

Research Ideas

  • Building models to predict the frequency and universe length of future draws. This could be used by lottery players to help them decide which numbers they should pick, or by data scientists to analyze the probability of certain draws.
  • Applying machine learning algorithms such as random forests or neural networks to discover patterns in the dataset and draw conclusions about the underlying probability distribution of a particular lottery and its outcomes.
  • Investigating different approaches for manipulating running windows frames in order to determine more accurate predictions regarding target features (such as the parity of future draws). This could potentially lead to more accurate predictions on a particular lottery's outcome, allowing players and data scientists alike to gain an advantage when playing or analyzing a certain lottery game

Acknowledgements

If you use this dataset in your research, please credit the original authors.
Data Source

License

License: CC0 1.0 Universal (CC0 1.0) - Public Domain Dedication
No Copyright - You can copy, modify, distribute and perform the work, even for commercial purposes, all without asking permission. See Other Information.

Columns

File: euromillions.pool-1.features.csv

Column name Description
universe-length The number of unique numbers drawn in a specified time frame. (Numeric)
parity The number of even numbers in a draw. (Numeric)
parity-over-frame The probability of an even number being drawn in a given frame. (Numeric)
last-moving-direction-of-universe-length The direction of the last movement of the universe length. (Categorical)
move-balance-of-universe-length The mean or median frequency of numbers appearing in previous draws. (Numeric)
move-balance-of-universe-length_mean-early The mean frequency of numbers appearing in early draws. (Numeric)
universe-length-drop The probability of a universe length drop in the next draw. (Numeric)
universe-length-increase The probability of a universe length increase in the next draw. (Numeric)
universe-length-repetition-same The probability of a universe length repetition in the next draw. (Numeric)
greater-universe-length-than-repetition The probability of a universe length greater than the repetition in the next draw. (Numeric)
mean-frequency-of-drawn-numbers The mean frequency of numbers appearing in the draw. (Numeric)
median-frequency-of-drawn-numbers The median frequency of numbers appearing in the draw. (Numeric)
universe-length-willFollowIncrease The probability of a universe length increase in the next draw. (Numeric)
date The date of the draw. (Date)

File: euromillions.pool-2.features.csv

Column name Description
universe-length The number of unique numbers drawn in a specified time frame. (Numeric)
parity The number of even numbers in a draw. (Numeric)
parity-over-frame The probability of an even number being drawn in a given frame. (Numeric)
last-moving-direction-of-universe-length The direction of the last movement of the universe length. (Categorical)
move-balance-of-universe-length The mean or median frequency of numbers appearing in previous draws. (Numeric)
move-balance-of-universe-length_mean-early The mean frequency of numbers appearing in early draws. (Numeric)
universe-length-drop The probability of a universe length drop in the next draw. (Numeric)
universe-length-increase The probability of a universe length increase in the next draw. (Numeric)
universe-length-repetition-same The probability of a universe length repetition in the next draw. (Numeric)
greater-universe-length-than-repetition The probability of a universe length greater than the repetition in the next draw. (Numeric)
mean-frequency-of-drawn-numbers The mean frequency of numbers appearing in the draw. (Numeric)
median-frequency-of-drawn-numbers The median frequency of numbers appearing in the draw. (Numeric)
universe-length-willFollowIncrease The probability of a universe length increase in the next draw. (Numeric)
date The date of the draw. (Date)

Acknowledgements

If you use this dataset in your research, please credit the original authors.
If you use this dataset in your research, please credit Jeffrey Mvutu Mabilama.

Tables

Euromillions Results Ch

@kaggle.thedevastator_predictive_analysis_for_euromillions_lottery_dra.euromillions_results_ch
  • 34.93 KB
  • 1060 rows
  • 5 columns
Loading...

CREATE TABLE euromillions_results_ch (
  "index" BIGINT,
  "date" VARCHAR,
  "numeros" VARCHAR,
  "etoiles" VARCHAR,
  "super_star" VARCHAR
);

Euromillions Pool 1 Features

@kaggle.thedevastator_predictive_analysis_for_euromillions_lottery_dra.euromillions_pool_1_features
  • 43.94 KB
  • 1050 rows
  • 17 columns
Loading...

CREATE TABLE euromillions_pool_1_features (
  "index" BIGINT,
  "universe_length" DOUBLE,
  "parity" DOUBLE,
  "parity_over_frame" DOUBLE,
  "last_moving_direction_of_universe_length" DOUBLE,
  "move_balance_of_universe_length" DOUBLE,
  "move_balance_of_universe_length_mean_early" DOUBLE,
  "universe_length_drop" DOUBLE,
  "universe_length_increase" DOUBLE,
  "universe_length_repetition_same" DOUBLE,
  "greater_universe_length_than_repetition" DOUBLE,
  "universe_length_didfollowincrease" DOUBLE,
  "mean_frequency_of_drawn_numbers" DOUBLE,
  "median_frequency_of_drawn_numbers" DOUBLE,
  "universe_length_willfollowincrease" DOUBLE,
  "date" VARCHAR,
  "draw" VARCHAR
);

Euromillions Pool 2 Features

@kaggle.thedevastator_predictive_analysis_for_euromillions_lottery_dra.euromillions_pool_2_features
  • 33.14 KB
  • 1050 rows
  • 17 columns
Loading...

CREATE TABLE euromillions_pool_2_features (
  "index" BIGINT,
  "universe_length" DOUBLE,
  "parity" DOUBLE,
  "parity_over_frame" DOUBLE,
  "last_moving_direction_of_universe_length" DOUBLE,
  "move_balance_of_universe_length" DOUBLE,
  "move_balance_of_universe_length_mean_early" DOUBLE,
  "universe_length_drop" DOUBLE,
  "universe_length_increase" DOUBLE,
  "universe_length_repetition_same" DOUBLE,
  "greater_universe_length_than_repetition" DOUBLE,
  "universe_length_didfollowincrease" DOUBLE,
  "mean_frequency_of_drawn_numbers" DOUBLE,
  "median_frequency_of_drawn_numbers" DOUBLE,
  "universe_length_willfollowincrease" DOUBLE,
  "date" VARCHAR,
  "draw" VARCHAR
);

Univ Length Over10 Short Col Names

@kaggle.thedevastator_predictive_analysis_for_euromillions_lottery_dra.univ_length_over10_short_col_names
  • 30.42 KB
  • 594 rows
  • 21 columns
Loading...

CREATE TABLE univ_length_over10_short_col_names (
  "index" BIGINT,
  "targettrend" BIGINT,
  "pred2ndnext" BIGINT,
  "pred1rstnext" BIGINT,
  "predwillfollowincreasecapacity" VARCHAR,
  "drawid" BIGINT,
  "ft_ulen_over10" BIGINT,
  "ft_ulen_over10_didfollowincreasecapacity" BOOLEAN,
  "ft_ulen_over10_lastmovingdirection" BIGINT,
  "ft_ulen_over10_shortmovingdirectionbalance" BIGINT,
  "ft_ulen_over10_previouslastmovingdirection" BIGINT,
  "ft_ulen_over10_largermovingdirectionbalance" BIGINT,
  "ft_ulen_over10_samevalueserie" BIGINT,
  "ft_ulen_over10_greaterthanserie" BIGINT,
  "ft_ulen_over10_lowerthanserie" BIGINT,
  "ft_ulen_over10_canincreaseof" BIGINT,
  "ft_ulen_over10_candecreaseof" BIGINT,
  "ft_effectifs_over10_andsupa20_meaneffsin" DOUBLE,
  "ft_effectifs_over10_andsupa20_meaneffsout" DOUBLE,
  "ft_effectifs_over10_andsupa20_medianeffsin" DOUBLE,
  "ft_effectifs_over10_andsupa20_medianeffsout" DOUBLE
);

Share link

Anyone who has the link will be able to view this.