Baselight

L 77/2020–2020-07-17 - Word Frequencies, Timeframe

L 77/2020 of 2020-07-17 converting D 34/2020 a.k.a. "Decreto Rilancio"

@kaggle.robertolofaro_l_772020_20200717_word_frequencies_timeframe

About this Dataset

L 77/2020–2020-07-17 - Word Frequencies, Timeframe

Context

Along with the dataset on UN SDG and the World Bank selected indicators previously released , in May 2020 released a dataset containing the whole list of articles and subdivisions within the "Decreto Rilancio", D.L. 34, as issued on 2020-05-19.

On July 17th, the Government Decree 34/2020 was converted into a law, and expanded in scope and content, as L. 77/2020, [published on the Gazzetta Ufficiale on 2020-07-18] (https://www.gazzettaufficiale.it/atto/serie_generale/caricaDettaglioAtto/originario?atto.dataPubblicazioneGazzetta=2020-07-18&atto.codiceRedazionale=20G00095&elenco30giorni=true)

This is the baseline dataset.

This implies that it does not yet contain any further amendments that might be introduced by further laws etc.

Content

The dataset used online contains all the textual content of the Decreto Legge, in Italian.

The tag cloud list on the right-hand side is from a local application using the same tag cloud search framework I already used for the ECB Speeches tag cloud search that since October 2019 update weekly on Sundays.

The webapp Legge 77/2020 is available online; for historical reference, the webapp Decreto 34/2020 will be still online with the original tag cloud, but, in order to avoid misunderstandings, the links to the actual articles reference the new text, as the aim of both webapps is to enable easier access to the content for those who need to reference the law (previously, decree).

By choice, instead of filtering out common Italian words, this dataset:

  1. lists each article within the Italian Government decree with the hierarchical structure that was most common within the decree:
    | Column in the dataset | Equivalent in the law | Contents |
    | --- | --- | --- |
    | item_id | none | a unique key (to allow future traceability should there be further amendments) |
    | what_chapter | Titolo | the main aggregations within the law |
    | what_section | Capo | most common subdivision - in some case there was a further subdivision, sezione, but was inserted within the description of the Capo to avoid adding a columns that would be mostly unused |
    | what_title | Articolo | one of the articles within the law |
    | what_frequencies | none | the computed frequencies for all the words within each article |
    | what_end | none | year maximum impacts according to the text |
    | what_timeframe | none | using the what_end column, a clusterization (label) in four categories |

  2. contains no filtering, as the purpose was to have the law converted into a format that would be useful for various data analysis and search/extraction/processing purposes, as well as tracking future evolution.

  3. these are the clusters adopted within the what_timeframe column: 2020, 2021, Multiyear, Structural

Locally, the database contains also additional categorizations, and links to the full text, published following the what_chapter and what_title on the same GitHub repository used for the Government Decree, updated.

PLEASE NOTE: I have no affiliation whatsoever with the Italian Government- I selected these data (along with others from other sources, e.g. Eurostat, OECD, World Bank, UN) just to support my publishing purposes on the use of Open Data for business and social projects and initiatives

More information on the concept, and associated past and future datasets or publications, please visit Data Democracy

Acknowledgements

Thanks to the Gazzetta Ufficiale for releasing on 2020-07-18 the searchable PDF version (used to load the database).

Inspiration

Connecting different data points to identify potential correlations, as part of my knowledge update/learning process (and to complement my other publication activities)

Share link

Anyone who has the link will be able to view this.