Baselight

BigQuery Fintech Dataset

Comprehensive fintech data for loan and customer analysis.

@kaggle.mustafakeser4_bigquery_fintech_dataset

About this Dataset

BigQuery Fintech Dataset

Dataset: cloud-training-demos.fintech

This dataset, hosted on BigQuery, is designed for financial technology (fintech) training and analysis. It comprises six interconnected tables, each providing detailed insights into various aspects of customer loans, loan purposes, and regional distributions. The dataset is ideal for practicing SQL queries, building data models, and conducting financial analytics.

Tables:

  1. customer:
    Contains records of individual customers, including demographic details and unique customer IDs. This table serves as a primary reference for analyzing customer behavior and loan distribution.

  2. loan:
    Includes detailed information about each loan issued, such as the loan amount, interest rate, and tenure. The table is crucial for analyzing lending patterns and financial outcomes.

  3. loan_count_by_year:
    Provides aggregated loan data by year, offering insights into yearly lending trends. This table helps in understanding the temporal dynamics of loan issuance.

  4. loan_purposes:
    Lists various reasons or purposes for which loans were issued, along with corresponding loan counts. This data can be used to analyze customer needs and market demands.

  5. loan_with_region:
    Combines loan data with regional information, allowing for geographical analysis of lending activities. This table is key for regional market analysis and understanding how loan distribution varies across different areas.

  6. state_region:
    Maps state names to their respective regions, enabling a more granular geographical analysis when combined with other tables in the dataset.

Use Cases:

  • Customer Segmentation: Analyze customer data to identify distinct segments based on demographics and loan behaviors.
  • Loan Analysis: Explore loan issuance patterns, interest rates, and purposes to uncover trends and insights.
  • Regional Analysis: Combine loan and region data to understand how loan distributions vary by geography.
  • Temporal Trends: Utilize the loan_count_by_year table to observe how lending patterns evolve over time.

This dataset is ideal for those looking to enhance their skills in SQL, financial data analysis, and BigQuery, providing a comprehensive foundation for fintech-related projects and case studies.

Share link

Anyone who has the link will be able to view this.