Baselight

Clustering Exercises

Clustering Using Methods You Know

@kaggle.joonasyoon_clustering_exercises

Loading...
Loading...

About this Dataset

Clustering Exercises

Overview

Context

The method of disuniting similar data is called clustering. you can create dummy data for classifying clusters by method from sklearn package but it needs to put your effort into job.

For users who making hard test cases for example of clustering, I think this dataset helps them.

Try out to select a meaningful number of clusters, and dividing the data into clusters. Here are exercises for you.

Dataset

All csv files contain a lots of x, y and color, and you can see above figures.

If you want to use position as type of integer, scale it and round off to integer as like x = round(x * 100).

Furthermore, here is GUI Tool to generate 2D points for clustering. you can make your dataset with this tool. https://www.joonas.io/cluster-paint


Stay tuned for further updates! also if any idea, you can comment me.

Tables

Basic1

@kaggle.joonasyoon_clustering_exercises.basic1
  • 191.71 KB
  • 9794 rows
  • 3 columns
Loading...

CREATE TABLE basic1 (
  "x" DOUBLE,
  "y" DOUBLE,
  "color" BIGINT
);

Basic2

@kaggle.joonasyoon_clustering_exercises.basic2
  • 63.05 KB
  • 3192 rows
  • 3 columns
Loading...

CREATE TABLE basic2 (
  "x" DOUBLE,
  "y" DOUBLE,
  "color" BIGINT
);

Basic3

@kaggle.joonasyoon_clustering_exercises.basic3
  • 111.44 KB
  • 5710 rows
  • 3 columns
Loading...

CREATE TABLE basic3 (
  "x" DOUBLE,
  "y" DOUBLE,
  "color" BIGINT
);

Basic4

@kaggle.joonasyoon_clustering_exercises.basic4
  • 244.49 KB
  • 12529 rows
  • 3 columns
Loading...

CREATE TABLE basic4 (
  "x" DOUBLE,
  "y" DOUBLE,
  "color" BIGINT
);

Basic5

@kaggle.joonasyoon_clustering_exercises.basic5
  • 77.93 KB
  • 4000 rows
  • 3 columns
Loading...

CREATE TABLE basic5 (
  "x" DOUBLE,
  "y" DOUBLE,
  "color" BIGINT
);

Blob

@kaggle.joonasyoon_clustering_exercises.blob
  • 79.56 KB
  • 4086 rows
  • 3 columns
Loading...

CREATE TABLE blob (
  "x" DOUBLE,
  "y" DOUBLE,
  "color" BIGINT
);

Box

@kaggle.joonasyoon_clustering_exercises.box
  • 140.93 KB
  • 7351 rows
  • 3 columns
Loading...

CREATE TABLE box (
  "x" DOUBLE,
  "y" DOUBLE,
  "color" BIGINT
);

Boxes

@kaggle.joonasyoon_clustering_exercises.boxes
  • 177.85 KB
  • 8901 rows
  • 3 columns
Loading...

CREATE TABLE boxes (
  "x" DOUBLE,
  "y" DOUBLE,
  "color" BIGINT
);

Boxes2

@kaggle.joonasyoon_clustering_exercises.boxes2
  • 220.19 KB
  • 11271 rows
  • 3 columns
Loading...

CREATE TABLE boxes2 (
  "x" DOUBLE,
  "y" DOUBLE,
  "color" BIGINT
);

Boxes3

@kaggle.joonasyoon_clustering_exercises.boxes3
  • 430.06 KB
  • 21600 rows
  • 3 columns
Loading...

CREATE TABLE boxes3 (
  "x" DOUBLE,
  "y" DOUBLE,
  "color" BIGINT
);

Chrome

@kaggle.joonasyoon_clustering_exercises.chrome
  • 216.78 KB
  • 11093 rows
  • 3 columns
Loading...

CREATE TABLE chrome (
  "x" DOUBLE,
  "y" DOUBLE,
  "color" BIGINT
);

Dart

@kaggle.joonasyoon_clustering_exercises.dart
  • 140.47 KB
  • 7278 rows
  • 3 columns
Loading...

CREATE TABLE dart (
  "x" DOUBLE,
  "y" DOUBLE,
  "color" BIGINT
);

Dart2

@kaggle.joonasyoon_clustering_exercises.dart2
  • 131.09 KB
  • 6738 rows
  • 3 columns
Loading...

CREATE TABLE dart2 (
  "x" DOUBLE,
  "y" DOUBLE,
  "color" BIGINT
);

Face

@kaggle.joonasyoon_clustering_exercises.face
  • 33.73 KB
  • 1273 rows
  • 4 columns
Loading...

CREATE TABLE face (
  "unnamed_0_1" BIGINT,
  "x" DOUBLE,
  "y" DOUBLE,
  "color" BIGINT
);

Hyperplane

@kaggle.joonasyoon_clustering_exercises.hyperplane
  • 35.84 KB
  • 1796 rows
  • 3 columns
Loading...

CREATE TABLE hyperplane (
  "x" DOUBLE,
  "y" DOUBLE,
  "color" BIGINT
);

Isolation

@kaggle.joonasyoon_clustering_exercises.isolation
  • 11.08 KB
  • 464 rows
  • 3 columns
Loading...

CREATE TABLE isolation (
  "x" DOUBLE,
  "y" DOUBLE,
  "color" BIGINT
);

Lines

@kaggle.joonasyoon_clustering_exercises.lines
  • 79.68 KB
  • 4065 rows
  • 3 columns
Loading...

CREATE TABLE lines (
  "x" DOUBLE,
  "y" DOUBLE,
  "color" BIGINT
);

Lines2

@kaggle.joonasyoon_clustering_exercises.lines2
  • 121.36 KB
  • 6195 rows
  • 3 columns
Loading...

CREATE TABLE lines2 (
  "x" DOUBLE,
  "y" DOUBLE,
  "color" BIGINT
);

Network

@kaggle.joonasyoon_clustering_exercises.network
  • 52.6 KB
  • 2634 rows
  • 3 columns
Loading...

CREATE TABLE network (
  "x" DOUBLE,
  "y" DOUBLE,
  "color" BIGINT
);

Outliers

@kaggle.joonasyoon_clustering_exercises.outliers
  • 18.75 KB
  • 876 rows
  • 3 columns
Loading...

CREATE TABLE outliers (
  "x" DOUBLE,
  "y" DOUBLE,
  "color" BIGINT
);

Ring

@kaggle.joonasyoon_clustering_exercises.ring
  • 22.04 KB
  • 1056 rows
  • 3 columns
Loading...

CREATE TABLE ring (
  "x" DOUBLE,
  "y" DOUBLE,
  "color" BIGINT
);

Sparse

@kaggle.joonasyoon_clustering_exercises.sparse
  • 11.27 KB
  • 474 rows
  • 3 columns
Loading...

CREATE TABLE sparse (
  "x" DOUBLE,
  "y" DOUBLE,
  "color" BIGINT
);

Spiral

@kaggle.joonasyoon_clustering_exercises.spiral
  • 172.52 KB
  • 8913 rows
  • 3 columns
Loading...

CREATE TABLE spiral (
  "x" DOUBLE,
  "y" DOUBLE,
  "color" BIGINT
);

Spiral2

@kaggle.joonasyoon_clustering_exercises.spiral2
  • 181.53 KB
  • 9325 rows
  • 3 columns
Loading...

CREATE TABLE spiral2 (
  "x" DOUBLE,
  "y" DOUBLE,
  "color" BIGINT
);

Spirals

@kaggle.joonasyoon_clustering_exercises.spirals
  • 46.49 KB
  • 2328 rows
  • 3 columns
Loading...

CREATE TABLE spirals (
  "x" DOUBLE,
  "y" DOUBLE,
  "color" BIGINT
);

Supernova

@kaggle.joonasyoon_clustering_exercises.supernova
  • 209.48 KB
  • 10714 rows
  • 3 columns
Loading...

CREATE TABLE supernova (
  "x" DOUBLE,
  "y" DOUBLE,
  "color" BIGINT
);

Triangle

@kaggle.joonasyoon_clustering_exercises.triangle
  • 12.17 KB
  • 517 rows
  • 3 columns
Loading...

CREATE TABLE triangle (
  "x" DOUBLE,
  "y" DOUBLE,
  "color" BIGINT
);

Un

@kaggle.joonasyoon_clustering_exercises.un
  • 115.48 KB
  • 5957 rows
  • 3 columns
Loading...

CREATE TABLE un (
  "x" DOUBLE,
  "y" DOUBLE,
  "color" BIGINT
);

Un2

@kaggle.joonasyoon_clustering_exercises.un2
  • 120.87 KB
  • 6202 rows
  • 3 columns
Loading...

CREATE TABLE un2 (
  "x" DOUBLE,
  "y" DOUBLE,
  "color" BIGINT
);

Wave

@kaggle.joonasyoon_clustering_exercises.wave
  • 248.99 KB
  • 12762 rows
  • 3 columns
Loading...

CREATE TABLE wave (
  "x" DOUBLE,
  "y" DOUBLE,
  "color" BIGINT
);