Baselight

Clustering Exercises

Clustering Using Methods You Know

@kaggle.joonasyoon_clustering_exercises

Loading...
Loading...

About this Dataset

Clustering Exercises

Overview

Context

The method of disuniting similar data is called clustering. you can create dummy data for classifying clusters by method from sklearn package but it needs to put your effort into job.

For users who making hard test cases for example of clustering, I think this dataset helps them.

Try out to select a meaningful number of clusters, and dividing the data into clusters. Here are exercises for you.

Dataset

All csv files contain a lots of x, y and color, and you can see above figures.

If you want to use position as type of integer, scale it and round off to integer as like x = round(x * 100).

Furthermore, here is GUI Tool to generate 2D points for clustering. you can make your dataset with this tool. https://www.joonas.io/cluster-paint


Stay tuned for further updates! also if any idea, you can comment me.

Tables

Basic1

@kaggle.joonasyoon_clustering_exercises.basic1
  • 196.31 kB
  • 9,794 rows
  • 3 columns
Loading...
CREATE TABLE basic1 (
  "x" DOUBLE,
  "y" DOUBLE,
  "color" BIGINT
);

Basic2

@kaggle.joonasyoon_clustering_exercises.basic2
  • 64.56 kB
  • 3,192 rows
  • 3 columns
Loading...
CREATE TABLE basic2 (
  "x" DOUBLE,
  "y" DOUBLE,
  "color" BIGINT
);

Basic3

@kaggle.joonasyoon_clustering_exercises.basic3
  • 114.11 kB
  • 5,710 rows
  • 3 columns
Loading...
CREATE TABLE basic3 (
  "x" DOUBLE,
  "y" DOUBLE,
  "color" BIGINT
);

Basic4

@kaggle.joonasyoon_clustering_exercises.basic4
  • 250.36 kB
  • 12,529 rows
  • 3 columns
Loading...
CREATE TABLE basic4 (
  "x" DOUBLE,
  "y" DOUBLE,
  "color" BIGINT
);

Basic5

@kaggle.joonasyoon_clustering_exercises.basic5
  • 79.8 kB
  • 4,000 rows
  • 3 columns
Loading...
CREATE TABLE basic5 (
  "x" DOUBLE,
  "y" DOUBLE,
  "color" BIGINT
);

Blob

@kaggle.joonasyoon_clustering_exercises.blob
  • 81.47 kB
  • 4,086 rows
  • 3 columns
Loading...
CREATE TABLE blob (
  "x" DOUBLE,
  "y" DOUBLE,
  "color" BIGINT
);

Box

@kaggle.joonasyoon_clustering_exercises.box
  • 144.31 kB
  • 7,351 rows
  • 3 columns
Loading...
CREATE TABLE box (
  "x" DOUBLE,
  "y" DOUBLE,
  "color" BIGINT
);

Boxes

@kaggle.joonasyoon_clustering_exercises.boxes
  • 182.12 kB
  • 8,901 rows
  • 3 columns
Loading...
CREATE TABLE boxes (
  "x" DOUBLE,
  "y" DOUBLE,
  "color" BIGINT
);

Boxes2

@kaggle.joonasyoon_clustering_exercises.boxes2
  • 225.48 kB
  • 11,271 rows
  • 3 columns
Loading...
CREATE TABLE boxes2 (
  "x" DOUBLE,
  "y" DOUBLE,
  "color" BIGINT
);

Boxes3

@kaggle.joonasyoon_clustering_exercises.boxes3
  • 440.38 kB
  • 21,600 rows
  • 3 columns
Loading...
CREATE TABLE boxes3 (
  "x" DOUBLE,
  "y" DOUBLE,
  "color" BIGINT
);

Chrome

@kaggle.joonasyoon_clustering_exercises.chrome
  • 221.98 kB
  • 11,093 rows
  • 3 columns
Loading...
CREATE TABLE chrome (
  "x" DOUBLE,
  "y" DOUBLE,
  "color" BIGINT
);

Dart

@kaggle.joonasyoon_clustering_exercises.dart
  • 143.85 kB
  • 7,278 rows
  • 3 columns
Loading...
CREATE TABLE dart (
  "x" DOUBLE,
  "y" DOUBLE,
  "color" BIGINT
);

Dart2

@kaggle.joonasyoon_clustering_exercises.dart2
  • 134.24 kB
  • 6,738 rows
  • 3 columns
Loading...
CREATE TABLE dart2 (
  "x" DOUBLE,
  "y" DOUBLE,
  "color" BIGINT
);

Face

@kaggle.joonasyoon_clustering_exercises.face
  • 34.54 kB
  • 1,273 rows
  • 4 columns
Loading...
CREATE TABLE face (
  "unnamed_0_1" BIGINT  -- Unnamed: 0.1,
  "x" DOUBLE,
  "y" DOUBLE,
  "color" BIGINT
);

Hyperplane

@kaggle.joonasyoon_clustering_exercises.hyperplane
  • 36.7 kB
  • 1,796 rows
  • 3 columns
Loading...
CREATE TABLE hyperplane (
  "x" DOUBLE,
  "y" DOUBLE,
  "color" BIGINT
);

Isolation

@kaggle.joonasyoon_clustering_exercises.isolation
  • 11.34 kB
  • 464 rows
  • 3 columns
Loading...
CREATE TABLE isolation (
  "x" DOUBLE,
  "y" DOUBLE,
  "color" BIGINT
);

Lines

@kaggle.joonasyoon_clustering_exercises.lines
  • 81.59 kB
  • 4,065 rows
  • 3 columns
Loading...
CREATE TABLE lines (
  "x" DOUBLE,
  "y" DOUBLE,
  "color" BIGINT
);

Lines2

@kaggle.joonasyoon_clustering_exercises.lines2
  • 124.27 kB
  • 6,195 rows
  • 3 columns
Loading...
CREATE TABLE lines2 (
  "x" DOUBLE,
  "y" DOUBLE,
  "color" BIGINT
);

Network

@kaggle.joonasyoon_clustering_exercises.network
  • 53.86 kB
  • 2,634 rows
  • 3 columns
Loading...
CREATE TABLE network (
  "x" DOUBLE,
  "y" DOUBLE,
  "color" BIGINT
);

Outliers

@kaggle.joonasyoon_clustering_exercises.outliers
  • 19.2 kB
  • 876 rows
  • 3 columns
Loading...
CREATE TABLE outliers (
  "x" DOUBLE,
  "y" DOUBLE,
  "color" BIGINT
);

Ring

@kaggle.joonasyoon_clustering_exercises.ring
  • 22.57 kB
  • 1,056 rows
  • 3 columns
Loading...
CREATE TABLE ring (
  "x" DOUBLE,
  "y" DOUBLE,
  "color" BIGINT
);

Sparse

@kaggle.joonasyoon_clustering_exercises.sparse
  • 11.54 kB
  • 474 rows
  • 3 columns
Loading...
CREATE TABLE sparse (
  "x" DOUBLE,
  "y" DOUBLE,
  "color" BIGINT
);

Spiral

@kaggle.joonasyoon_clustering_exercises.spiral
  • 176.66 kB
  • 8,913 rows
  • 3 columns
Loading...
CREATE TABLE spiral (
  "x" DOUBLE,
  "y" DOUBLE,
  "color" BIGINT
);

Spiral2

@kaggle.joonasyoon_clustering_exercises.spiral2
  • 185.89 kB
  • 9,325 rows
  • 3 columns
Loading...
CREATE TABLE spiral2 (
  "x" DOUBLE,
  "y" DOUBLE,
  "color" BIGINT
);

Spirals

@kaggle.joonasyoon_clustering_exercises.spirals
  • 47.61 kB
  • 2,328 rows
  • 3 columns
Loading...
CREATE TABLE spirals (
  "x" DOUBLE,
  "y" DOUBLE,
  "color" BIGINT
);

Share link

Anyone who has the link will be able to view this.