Baselight
Sign In
kaggle

🚢 Pakistan Maritime Trade & Shipping Dataset

Kaggle

@kaggle.hammadansari7_pakistan_maritime_trade_and_shipping_dataset

Loading...
Loading...

Pakistan from January 2020 to April 2026 Pakistan National Shipping Corporation

Dataset Description

🚢 Pakistan Maritime Trade & Shipping Dataset (2020–2026)


📌 About the Dataset

This synthetic dataset simulates maritime trade and shipping activity in Pakistan from January 2020 to April 2026. It is inspired by the operations of the Pakistan National Shipping Corporation (PNSC) — Pakistan's national maritime enterprise — and reflects realistic patterns in seaborne trade across Pakistan's major ports (Karachi Port, Port Qasim, and Gwadar).

The dataset covers 5,000 shipment records spanning tanker voyages, bulk carrier trade, container shipping, and break-bulk cargo — providing a comprehensive view of Pakistan's import/export flows via sea.


📦 Data Source & Methodology

  • Inspiration: Pakistan National Shipping Corporation (PNSC)
  • Type: Synthetic (algorithmically generated using pandas & numpy)
  • Realism: Exchange rates, freight rates, cargo types, port names, HS codes, and vessel specifications reflect real-world Pakistan maritime trade conditions
  • Scope: 2020–2026 | Imports & Exports | All major cargo segments

🗂️ Feature Descriptions (47 Columns)

🔖 Identifiers & Time

Column Type Description
Shipment_ID String Unique identifier per shipment (e.g., PKT-2022-00123)
Year Integer Year of departure (2020–2026)
Month Integer Month of departure (1–12)
Quarter String Fiscal quarter (Q1–Q4)

📅 Dates

Column Type Description
Departure_Date Date (YYYY-MM-DD) Date vessel left the port of loading
Arrival_Date Date (YYYY-MM-DD) Date vessel arrived at port of discharge
Transit_Days Integer Sea transit duration in days
Customs_Clearance_Days Float Days taken for customs clearance at destination

🌍 Trade Information

Column Type Description
Trade_Direction Categorical Import or Export
Trade_Category Categorical Sector: Energy, Agri-commodities, Manufactured Goods, etc.
Cargo_Type Categorical Specific commodity (e.g., Crude Oil, Wheat, Coal, Textiles)
HS_Code String Harmonized System customs classification code
Incoterms Categorical Trade terms: CIF, FOB, CFR, EXW, DDP, FAS
Payment_Terms Categorical LC, DP, DA, Open Account, Advance Payment
Currency Categorical Transaction currency (USD, EUR, AED, PKR, etc.)

🏗️ Ports

Column Type Description
Port_of_Loading String Origin port name and country
Port_of_Discharge String Destination port name

🚢 Vessel Information

Column Type Description
Vessel_Name String Name of the vessel
Vessel_Type Categorical Vessel category (VLCC, Aframax, Panamax Bulk Carrier, etc.)
Vessel_Flag Categorical Country of vessel registration
Shipping_Line Categorical Operating carrier (PNSC, Maersk, CMA CGM, etc.)
Vessel_DWT_MT Integer Deadweight tonnage capacity (metric tons)
Gross_Tonnage_GT Integer Gross tonnage of the vessel
Vessel_Age_Years Float Age of the vessel in years
Average_Speed_Knots Float Average sailing speed during voyage

📦 Cargo

Column Type Description
Cargo_Volume_MT Float Cargo weight in metric tons
TEU_Count Float Container count (only for container ships; NaN otherwise)

💰 Financials

Column Type Description
Freight_Rate_USD_per_Unit Float Rate per MT (or per TEU for containers) in USD
Total_Freight_USD Float Total freight cost in USD
Cargo_Value_USD Float Estimated CIF/FOB value of goods in USD
PKR_USD_Exchange_Rate Float PKR/USD rate at time of shipment
Cargo_Value_PKR Float Cargo value converted to Pakistani Rupees
Port_Charges_USD Float Port dues, pilotage, berthing fees in USD
Demurrage_Days Float Excess days in port beyond allowed laytime
Demurrage_Cost_USD Float Financial penalty for excess port time
Import_Duty_Pct Float Customs import duty percentage
Import_Duty_USD Float Calculated import duty in USD
Insurance_Pct Float Cargo insurance rate (% of cargo value)
Insurance_Cost_USD Float Total cargo insurance cost in USD
Bunker_Fuel_Consumed_MT Float Total bunker fuel consumed on voyage (MT)
Bunker_Price_USD_per_MT Float Market price of bunker fuel per MT
Bunker_Cost_USD Float Total fuel cost in USD
Total_Voyage_Cost_USD Float Sum of freight + port + demurrage + bunker + insurance

✅ Customs & Compliance

Column Type Description
Customs_Cleared Binary (Yes/No) Whether customs clearance was successfully obtained
ISM_Certified Binary (Yes/No) International Safety Management certification status
Safety_Incident Categorical Incident severity: None, Minor, Moderate
Environmental_Flag Categorical Environmental compliance: Clean, Minor Violation

🤖 Potential Machine Learning Tasks

This dataset is well-suited for a variety of ML tasks:

1. 📈 Regression

  • Freight Rate Prediction: Predict Freight_Rate_USD_per_Unit based on vessel type, cargo type, route, and season
  • Total Voyage Cost Estimation: Predict Total_Voyage_Cost_USD using vessel specs, route, cargo volume, and market conditions
  • Demurrage Cost Forecasting: Predict Demurrage_Cost_USD from cargo type, vessel type, and port
  • Cargo Value Estimation: Predict Cargo_Value_USD from cargo type, volume, and trade direction

2. 🏷️ Classification

  • Customs Clearance Prediction: Predict whether a shipment will be cleared (Customs_Cleared = Yes/No)
  • Safety Incident Classification: Classify shipments by risk level (Safety_Incident: None/Minor/Moderate)
  • Environmental Compliance: Flag potential violations (Environmental_Flag)
  • Trade Direction Classification: Classify shipments as Import or Export from voyage characteristics

3. ⏱️ Time Series Analysis

  • Monthly Trade Volume Trends: Forecast Pakistan's seaborne trade volumes over time
  • Freight Rate Seasonality: Detect seasonal patterns in shipping costs
  • PKR Exchange Rate Impact Analysis: Study correlation between PKR/USD rate and trade volumes

4. 🔍 Clustering / Unsupervised Learning

  • Shipment Profiling: Cluster shipments by cost structure and route efficiency
  • Port Congestion Analysis: Group shipments by clearance delays and demurrage
  • Vessel Performance Segmentation: Cluster vessels by fuel efficiency and speed

5. 🔎 Anomaly Detection

  • Fraud / Undervaluation Detection: Detect shipments with unusually low declared cargo values
  • Outlier Voyage Cost Detection: Identify abnormally expensive or cheap voyages

⚠️ Disclaimer

This is a synthetic dataset generated for educational and research purposes. All shipment IDs, vessel names, and financial figures are algorithmically simulated. They do not represent actual PNSC transactions or real trade data. The dataset is intended for ML model development, data science practice, and exploratory analysis.



Related Datasets

Share link

Anyone who has the link will be able to view this.