Baselight

Paradise-Panama-Papers

Data Scientists United Against Corruption

@kaggle.zusmani_paradisepanamapapers

Loading...
Loading...

About this Dataset

Paradise-Panama-Papers

Context

The Paradise Papers is a cache of some 13GB of data that contains 13.4 million confidential records of offshore investment by 120,000 people and companies in 19 tax jurisdictions (Tax Heavens - an awesome video to understand this); that was published by the International Consortium of Investigative Journalists (ICIJ) on November 5, 2017. Here is a brief video about the leak. The people include Queen Elizabeth II, the President of Columbia (Juan Manuel Santos), Former Prime Minister of Pakistan (Shaukat Aziz), U.S Secretary of Commerce (Wilbur Ross) and many more. According to an estimate by the Boston Consulting Group, the amount of money involved is around $10 trillion. The leak contains many famous companies, including Facebook, Apple, Uber, Nike, Walmart, Allianz, Siemens, McDonald’s and Yahoo.

It also contains a lot of U. S President Donald Trump allies including Rax Tillerson, Wilbur Ross, Koch Brothers, Paul Singer, Sheldon Adelson, Stephen Schwarzman, Thomas Barrack and Steve Wynn etc. The complete list of Politicians involve is avaiable here.

The Panama Papers in the cache of 38GB of data from the national corporate registry of Bahamas. It contains world’s top politicians and influential persons as head and director of offshore companies registered in Bahamas.

Offshore Leaks details 13,000 offshore accounts in a report.

I am calling all data scientists to help me stop the corruption and reveal the patterns and linkages invisible for the untrained eye.

Content

The data is the effort of more than 100 journalists from 60+ countries

The original data is available under creative common license and can be downloaded from this link.

I will keep updating the datasets with more leaks and data as it’s available

Acknowledgements

International Consortium of Investigative Journalists (ICIJ)

Paradise Papers Update

Paradise Papers data has been uploaded as released by ICIJ on Nov 21, 2017. You can find Paradise Papers zip file and six extracted files in CSV format, all starting with a prefix of Paradise. Happy Coding!

Inspiration

Some ideas worth exploring:

  1. How many companies and individuals are there in all of the leaks data

  2. How many countries involved

  3. Total money involved

  4. What is the biggest best tax heaven

  5. Can we compare the corruption with human development index and make an argument that would correlate corruption with bad conditions in that country

  6. Who are the biggest cheaters and where they live

  7. What role Fortune 500 companies play in this game

I need your help to make this world corruption free in the age of NLP and Big Data

Tables

Addresses

@kaggle.zusmani_paradisepanamapapers.addresses
  • 9.79 MB
  • 151605 rows
  • 8 columns
Loading...

CREATE TABLE addresses (
  "address" VARCHAR,
  "icij_id" VARCHAR,
  "valid_until" VARCHAR,
  "country_codes" VARCHAR,
  "countries" VARCHAR,
  "node_id" BIGINT,
  "sourceid" VARCHAR,
  "note" VARCHAR
);

All Edges

@kaggle.zusmani_paradisepanamapapers.all_edges
  • 11.96 MB
  • 1535552 rows
  • 7 columns
Loading...

CREATE TABLE all_edges (
  "node_1" BIGINT,
  "rel_type" VARCHAR,
  "node_2" BIGINT,
  "sourceid" VARCHAR,
  "valid_until" VARCHAR,
  "start_date" VARCHAR,
  "end_date" VARCHAR
);

Bahamas Leaks Edges

@kaggle.zusmani_paradisepanamapapers.bahamas_leaks_edges
  • 1.62 MB
  • 249190 rows
  • 7 columns
Loading...

CREATE TABLE bahamas_leaks_edges (
  "node_1" BIGINT,
  "rel_type" VARCHAR,
  "node_2" BIGINT,
  "r_sourceid" VARCHAR,
  "r_valid_until" VARCHAR,
  "r_start_date" VARCHAR,
  "r_end_date" VARCHAR
);

Bahamas Leaks Nodes Address

@kaggle.zusmani_paradisepanamapapers.bahamas_leaks_nodes_address
  • 27.91 KB
  • 551 rows
  • 18 columns
Loading...

CREATE TABLE bahamas_leaks_nodes_address (
  "labels_n" VARCHAR,
  "n_valid_until" VARCHAR,
  "n_country_codes" VARCHAR,
  "n_countries" VARCHAR,
  "n_node_id" BIGINT,
  "n_sourceid" VARCHAR,
  "n_address" VARCHAR,
  "n_name" VARCHAR,
  "n_jurisdiction_description" VARCHAR,
  "n_service_provider" VARCHAR,
  "n_jurisdiction" VARCHAR,
  "n_closed_date" VARCHAR,
  "n_incorporation_date" VARCHAR,
  "n_ibcruc" VARCHAR,
  "n_type" VARCHAR,
  "n_status" VARCHAR,
  "n_company_type" VARCHAR,
  "n_note" VARCHAR
);

Bahamas Leaks Nodes Entity

@kaggle.zusmani_paradisepanamapapers.bahamas_leaks_nodes_entity
  • 4.5 MB
  • 175888 rows
  • 18 columns
Loading...

CREATE TABLE bahamas_leaks_nodes_entity (
  "labels_n" VARCHAR,
  "n_valid_until" VARCHAR,
  "n_country_codes" VARCHAR,
  "n_countries" VARCHAR,
  "n_node_id" BIGINT,
  "n_sourceid" VARCHAR,
  "n_address" VARCHAR,
  "n_name" VARCHAR,
  "n_jurisdiction_description" VARCHAR,
  "n_service_provider" VARCHAR,
  "n_jurisdiction" VARCHAR,
  "n_closed_date" VARCHAR,
  "n_incorporation_date" TIMESTAMP,
  "n_ibcruc" VARCHAR,
  "n_type" VARCHAR,
  "n_status" VARCHAR,
  "n_company_type" VARCHAR,
  "n_note" VARCHAR
);

Bahamas Leaks Nodes Intermediary

@kaggle.zusmani_paradisepanamapapers.bahamas_leaks_nodes_intermediary
  • 24.7 KB
  • 541 rows
  • 18 columns
Loading...

CREATE TABLE bahamas_leaks_nodes_intermediary (
  "labels_n" VARCHAR,
  "n_valid_until" VARCHAR,
  "n_country_codes" VARCHAR,
  "n_countries" VARCHAR,
  "n_node_id" BIGINT,
  "n_sourceid" VARCHAR,
  "n_address" VARCHAR,
  "n_name" VARCHAR,
  "n_jurisdiction_description" VARCHAR,
  "n_service_provider" VARCHAR,
  "n_jurisdiction" VARCHAR,
  "n_closed_date" VARCHAR,
  "n_incorporation_date" VARCHAR,
  "n_ibcruc" VARCHAR,
  "n_type" VARCHAR,
  "n_status" VARCHAR,
  "n_company_type" VARCHAR,
  "n_note" VARCHAR
);

Bahamas Leaks Nodes Officer

@kaggle.zusmani_paradisepanamapapers.bahamas_leaks_nodes_officer
  • 305.66 KB
  • 25262 rows
  • 18 columns
Loading...

CREATE TABLE bahamas_leaks_nodes_officer (
  "labels_n" VARCHAR,
  "n_valid_until" VARCHAR,
  "n_country_codes" VARCHAR,
  "n_countries" VARCHAR,
  "n_node_id" BIGINT,
  "n_sourceid" VARCHAR,
  "n_address" VARCHAR,
  "n_name" VARCHAR,
  "n_jurisdiction_description" VARCHAR,
  "n_service_provider" VARCHAR,
  "n_jurisdiction" VARCHAR,
  "n_closed_date" VARCHAR,
  "n_incorporation_date" VARCHAR,
  "n_ibcruc" VARCHAR,
  "n_type" VARCHAR,
  "n_status" VARCHAR,
  "n_company_type" VARCHAR,
  "n_note" VARCHAR
);

Entities

@kaggle.zusmani_paradisepanamapapers.entities
  • 26.77 MB
  • 495038 rows
  • 21 columns
Loading...

CREATE TABLE entities (
  "name" VARCHAR,
  "original_name" VARCHAR,
  "former_name" VARCHAR,
  "jurisdiction" VARCHAR,
  "jurisdiction_description" VARCHAR,
  "company_type" VARCHAR,
  "address" VARCHAR,
  "internal_id" DOUBLE,
  "incorporation_date" TIMESTAMP,
  "inactivation_date" TIMESTAMP,
  "struck_off_date" VARCHAR,
  "dorm_date" TIMESTAMP,
  "status" VARCHAR,
  "service_provider" VARCHAR,
  "ibcruc" VARCHAR,
  "country_codes" VARCHAR,
  "countries" VARCHAR,
  "note" VARCHAR,
  "valid_until" VARCHAR,
  "node_id" BIGINT,
  "sourceid" VARCHAR
);

Intermediaries

@kaggle.zusmani_paradisepanamapapers.intermediaries
  • 1.27 MB
  • 24177 rows
  • 10 columns
Loading...

CREATE TABLE intermediaries (
  "name" VARCHAR,
  "internal_id" VARCHAR,
  "address" VARCHAR,
  "valid_until" VARCHAR,
  "country_codes" VARCHAR,
  "countries" VARCHAR,
  "status" VARCHAR,
  "node_id" BIGINT,
  "sourceid" VARCHAR,
  "note" VARCHAR
);

Officers

@kaggle.zusmani_paradisepanamapapers.officers
  • 14.08 MB
  • 370854 rows
  • 8 columns
Loading...

CREATE TABLE officers (
  "name" VARCHAR,
  "icij_id" VARCHAR,
  "valid_until" VARCHAR,
  "country_codes" VARCHAR,
  "countries" VARCHAR,
  "node_id" BIGINT,
  "sourceid" VARCHAR,
  "note" VARCHAR
);

Offshore Leaks Edges

@kaggle.zusmani_paradisepanamapapers.offshore_leaks_edges
  • 3.87 MB
  • 561393 rows
  • 7 columns
Loading...

CREATE TABLE offshore_leaks_edges (
  "node_1" BIGINT,
  "rel_type" VARCHAR,
  "node_2" BIGINT,
  "r_sourceid" VARCHAR,
  "r_valid_until" VARCHAR,
  "r_start_date" TIMESTAMP,
  "r_end_date" TIMESTAMP
);

Offshore Leaks Nodes Address

@kaggle.zusmani_paradisepanamapapers.offshore_leaks_nodes_address
  • 2.72 MB
  • 57600 rows
  • 18 columns
Loading...

CREATE TABLE offshore_leaks_nodes_address (
  "labels_n" VARCHAR,
  "n_valid_until" VARCHAR,
  "n_country_codes" VARCHAR,
  "n_countries" VARCHAR,
  "n_node_id" BIGINT,
  "n_sourceid" VARCHAR,
  "n_address" VARCHAR,
  "n_name" VARCHAR,
  "n_jurisdiction_description" VARCHAR,
  "n_service_provider" VARCHAR,
  "n_jurisdiction" VARCHAR,
  "n_closed_date" VARCHAR,
  "n_incorporation_date" VARCHAR,
  "n_ibcruc" VARCHAR,
  "n_type" VARCHAR,
  "n_status" VARCHAR,
  "n_company_type" VARCHAR,
  "n_note" VARCHAR
);

Offshore Leaks Nodes Entity

@kaggle.zusmani_paradisepanamapapers.offshore_leaks_nodes_entity
  • 3.61 MB
  • 105516 rows
  • 18 columns
Loading...

CREATE TABLE offshore_leaks_nodes_entity (
  "labels_n" VARCHAR,
  "n_valid_until" VARCHAR,
  "n_country_codes" VARCHAR,
  "n_countries" VARCHAR,
  "n_node_id" BIGINT,
  "n_sourceid" VARCHAR,
  "n_address" VARCHAR,
  "n_name" VARCHAR,
  "n_jurisdiction_description" VARCHAR,
  "n_service_provider" VARCHAR,
  "n_jurisdiction" VARCHAR,
  "n_closed_date" VARCHAR,
  "n_incorporation_date" TIMESTAMP,
  "n_ibcruc" VARCHAR,
  "n_type" VARCHAR,
  "n_status" VARCHAR,
  "n_company_type" VARCHAR,
  "n_note" VARCHAR
);

Offshore Leaks Nodes Intermediary

@kaggle.zusmani_paradisepanamapapers.offshore_leaks_nodes_intermediary
  • 248.89 KB
  • 9526 rows
  • 18 columns
Loading...

CREATE TABLE offshore_leaks_nodes_intermediary (
  "labels_n" VARCHAR,
  "n_valid_until" VARCHAR,
  "n_country_codes" VARCHAR,
  "n_countries" VARCHAR,
  "n_node_id" BIGINT,
  "n_sourceid" VARCHAR,
  "n_address" VARCHAR,
  "n_name" VARCHAR,
  "n_jurisdiction_description" VARCHAR,
  "n_service_provider" VARCHAR,
  "n_jurisdiction" VARCHAR,
  "n_closed_date" VARCHAR,
  "n_incorporation_date" VARCHAR,
  "n_ibcruc" VARCHAR,
  "n_type" VARCHAR,
  "n_status" VARCHAR,
  "n_company_type" VARCHAR,
  "n_note" VARCHAR
);

Offshore Leaks Nodes Officer

@kaggle.zusmani_paradisepanamapapers.offshore_leaks_nodes_officer
  • 2.37 MB
  • 107190 rows
  • 18 columns
Loading...

CREATE TABLE offshore_leaks_nodes_officer (
  "labels_n" VARCHAR,
  "n_valid_until" VARCHAR,
  "n_country_codes" VARCHAR,
  "n_countries" VARCHAR,
  "n_node_id" BIGINT,
  "n_sourceid" VARCHAR,
  "n_address" VARCHAR,
  "n_name" VARCHAR,
  "n_jurisdiction_description" VARCHAR,
  "n_service_provider" VARCHAR,
  "n_jurisdiction" VARCHAR,
  "n_closed_date" VARCHAR,
  "n_incorporation_date" VARCHAR,
  "n_ibcruc" VARCHAR,
  "n_type" VARCHAR,
  "n_status" VARCHAR,
  "n_company_type" VARCHAR,
  "n_note" VARCHAR
);

Panama Papers Edges

@kaggle.zusmani_paradisepanamapapers.panama_papers_edges
  • 6.01 MB
  • 674102 rows
  • 7 columns
Loading...

CREATE TABLE panama_papers_edges (
  "node_1" BIGINT,
  "rel_type" VARCHAR,
  "node_2" BIGINT,
  "r_sourceid" VARCHAR,
  "r_valid_until" VARCHAR,
  "r_start_date" VARCHAR,
  "r_end_date" VARCHAR
);

Panama Papers Nodes Address

@kaggle.zusmani_paradisepanamapapers.panama_papers_nodes_address
  • 4.14 MB
  • 93454 rows
  • 18 columns
Loading...

CREATE TABLE panama_papers_nodes_address (
  "labels_n" VARCHAR,
  "n_valid_until" VARCHAR,
  "n_country_codes" VARCHAR,
  "n_countries" VARCHAR,
  "n_node_id" BIGINT,
  "n_sourceid" VARCHAR,
  "n_address" VARCHAR,
  "n_name" VARCHAR,
  "n_jurisdiction_description" VARCHAR,
  "n_service_provider" VARCHAR,
  "n_jurisdiction" VARCHAR,
  "n_closed_date" VARCHAR,
  "n_incorporation_date" VARCHAR,
  "n_ibcruc" VARCHAR,
  "n_type" VARCHAR,
  "n_status" VARCHAR,
  "n_company_type" VARCHAR,
  "n_note" VARCHAR
);

Panama Papers Nodes Entity

@kaggle.zusmani_paradisepanamapapers.panama_papers_nodes_entity
  • 9.62 MB
  • 213634 rows
  • 18 columns
Loading...

CREATE TABLE panama_papers_nodes_entity (
  "labels_n" VARCHAR,
  "n_valid_until" VARCHAR,
  "n_country_codes" VARCHAR,
  "n_countries" VARCHAR,
  "n_node_id" BIGINT,
  "n_sourceid" VARCHAR,
  "n_address" VARCHAR,
  "n_name" VARCHAR,
  "n_jurisdiction_description" VARCHAR,
  "n_service_provider" VARCHAR,
  "n_jurisdiction" VARCHAR,
  "n_closed_date" VARCHAR,
  "n_incorporation_date" TIMESTAMP,
  "n_ibcruc" VARCHAR,
  "n_type" VARCHAR,
  "n_status" VARCHAR,
  "n_company_type" VARCHAR,
  "n_note" VARCHAR
);

Panama Papers Nodes Intermediary

@kaggle.zusmani_paradisepanamapapers.panama_papers_nodes_intermediary
  • 960.38 KB
  • 14110 rows
  • 18 columns
Loading...

CREATE TABLE panama_papers_nodes_intermediary (
  "labels_n" VARCHAR,
  "n_valid_until" VARCHAR,
  "n_country_codes" VARCHAR,
  "n_countries" VARCHAR,
  "n_node_id" BIGINT,
  "n_sourceid" VARCHAR,
  "n_address" VARCHAR,
  "n_name" VARCHAR,
  "n_jurisdiction_description" VARCHAR,
  "n_service_provider" VARCHAR,
  "n_jurisdiction" VARCHAR,
  "n_closed_date" VARCHAR,
  "n_incorporation_date" VARCHAR,
  "n_ibcruc" VARCHAR,
  "n_type" VARCHAR,
  "n_status" VARCHAR,
  "n_company_type" VARCHAR,
  "n_note" VARCHAR
);

Panama Papers Nodes Officer

@kaggle.zusmani_paradisepanamapapers.panama_papers_nodes_officer
  • 3.97 MB
  • 238402 rows
  • 18 columns
Loading...

CREATE TABLE panama_papers_nodes_officer (
  "labels_n" VARCHAR,
  "n_valid_until" VARCHAR,
  "n_country_codes" VARCHAR,
  "n_countries" VARCHAR,
  "n_node_id" BIGINT,
  "n_sourceid" VARCHAR,
  "n_address" VARCHAR,
  "n_name" VARCHAR,
  "n_jurisdiction_description" VARCHAR,
  "n_service_provider" VARCHAR,
  "n_jurisdiction" VARCHAR,
  "n_closed_date" VARCHAR,
  "n_incorporation_date" VARCHAR,
  "n_ibcruc" VARCHAR,
  "n_type" VARCHAR,
  "n_status" VARCHAR,
  "n_company_type" VARCHAR,
  "n_note" VARCHAR
);

Paradise Papers Edges

@kaggle.zusmani_paradisepanamapapers.paradise_papers_edges
  • 2.69 MB
  • 364456 rows
  • 7 columns
Loading...

CREATE TABLE paradise_papers_edges (
  "node_1" BIGINT,
  "rel_type" VARCHAR,
  "node_2" BIGINT,
  "r_sourceid" VARCHAR,
  "r_valid_until" VARCHAR,
  "r_start_date" VARCHAR,
  "r_end_date" VARCHAR
);

Paradise Papers Nodes Address

@kaggle.zusmani_paradisepanamapapers.paradise_papers_nodes_address
  • 3.93 MB
  • 59228 rows
  • 18 columns
Loading...

CREATE TABLE paradise_papers_nodes_address (
  "labels_n" VARCHAR,
  "n_valid_until" VARCHAR,
  "n_country_codes" VARCHAR,
  "n_countries" VARCHAR,
  "n_node_id" BIGINT,
  "n_sourceid" VARCHAR,
  "n_address" VARCHAR,
  "n_name" VARCHAR,
  "n_jurisdiction_description" VARCHAR,
  "n_service_provider" VARCHAR,
  "n_jurisdiction" VARCHAR,
  "n_closed_date" VARCHAR,
  "n_incorporation_date" VARCHAR,
  "n_ibcruc" VARCHAR,
  "n_type" VARCHAR,
  "n_status" VARCHAR,
  "n_company_type" VARCHAR,
  "n_note" VARCHAR
);

Paradise Papers Nodes Entity

@kaggle.zusmani_paradisepanamapapers.paradise_papers_nodes_entity
  • 887.58 KB
  • 24957 rows
  • 18 columns
Loading...

CREATE TABLE paradise_papers_nodes_entity (
  "labels_n" VARCHAR,
  "n_valid_until" VARCHAR,
  "n_country_codes" VARCHAR,
  "n_countries" VARCHAR,
  "n_node_id" BIGINT,
  "n_sourceid" VARCHAR,
  "n_address" VARCHAR,
  "n_name" VARCHAR,
  "n_jurisdiction_description" VARCHAR,
  "n_service_provider" VARCHAR,
  "n_jurisdiction" VARCHAR,
  "n_closed_date" VARCHAR,
  "n_incorporation_date" VARCHAR,
  "n_ibcruc" VARCHAR,
  "n_type" VARCHAR,
  "n_status" VARCHAR,
  "n_company_type" VARCHAR,
  "n_note" VARCHAR
);

Paradise Papers Nodes Intermediary

@kaggle.zusmani_paradisepanamapapers.paradise_papers_nodes_intermediary
  • 17.65 KB
  • 186 rows
  • 18 columns
Loading...

CREATE TABLE paradise_papers_nodes_intermediary (
  "labels_n" VARCHAR,
  "n_valid_until" VARCHAR,
  "n_country_codes" VARCHAR,
  "n_countries" VARCHAR,
  "n_node_id" BIGINT,
  "n_sourceid" VARCHAR,
  "n_address" VARCHAR,
  "n_name" VARCHAR,
  "n_jurisdiction_description" VARCHAR,
  "n_service_provider" VARCHAR,
  "n_jurisdiction" VARCHAR,
  "n_closed_date" VARCHAR,
  "n_incorporation_date" VARCHAR,
  "n_ibcruc" VARCHAR,
  "n_type" VARCHAR,
  "n_status" VARCHAR,
  "n_company_type" VARCHAR,
  "n_note" VARCHAR
);

Paradise Papers Nodes Officer

@kaggle.zusmani_paradisepanamapapers.paradise_papers_nodes_officer
  • 1.83 MB
  • 77012 rows
  • 18 columns
Loading...

CREATE TABLE paradise_papers_nodes_officer (
  "labels_n" VARCHAR,
  "n_valid_until" VARCHAR,
  "n_country_codes" VARCHAR,
  "n_countries" VARCHAR,
  "n_node_id" BIGINT,
  "n_sourceid" VARCHAR,
  "n_address" VARCHAR,
  "n_name" VARCHAR,
  "n_jurisdiction_description" VARCHAR,
  "n_service_provider" VARCHAR,
  "n_jurisdiction" VARCHAR,
  "n_closed_date" VARCHAR,
  "n_incorporation_date" VARCHAR,
  "n_ibcruc" VARCHAR,
  "n_type" VARCHAR,
  "n_status" VARCHAR,
  "n_company_type" VARCHAR,
  "n_note" VARCHAR
);

Share link

Anyone who has the link will be able to view this.