Baselight

Leukemia Gene Expression - CuMiDa

GSE9476 microarray experiment

@kaggle.brunogrisci_leukemia_gene_expression_cumida

Loading...
Loading...

About this Dataset

Leukemia Gene Expression - CuMiDa

Dataset GSE9476 on leukemia gene expression from CuMiDa

  • 5 classes
  • 22284 genes
  • 64 samples

About

Here we present the Curated Microarray Database (CuMiDa), a repository containing 78 handpicked cancer microarray datasets, extensively curated from 30.000 studies from the Gene Expression Omnibus (GEO), solely for machine learning. The aim of CuMiDa is to offer homogeneous and state-of-the-art biological preprocessing of these datasets, together with numerous 3-fold cross validation benchmark results to propel machine learning studies focused on cancer research. The database make available various download options to be employed by other programs, as well for PCA and t-SNE results. CuMiDa stands different from existing databases for offering newer datasets, manually and carefully curated, from samples quality, unwanted probes, background correction and normalization, to create a more reliable source of data for computational research.

http://sbcb.inf.ufrgs.br/cumida

References

  • Feltes, B.C.; Chandelier, E.B.; Grisci, B.I.; Dorn, M. (2019) CuMiDa: An Extensively Curated Microarray Database for Benchmarking and Testing of Machine Learning Approaches in Cancer Research. Journal of Computational Biology, 26 (4), 376-386. [https://doi.org/10.1089/cmb.2018.0238]

  • Grisci, B. I., Feltes, B. C., & Dorn, M. (2019). Neuroevolution as a tool for microarray gene expression pattern identification in cancer research. Journal of biomedical informatics, 89, 122-133. [https://doi.org/10.1016/j.jbi.2018.11.013]

Inspiration

  • How to deal with class imbalance for classification?
  • How to identify the most important genes for the classification of each cancer subtype?
  • Is it possible to discover subtypes?
  • How to beat the classification and clustering benchmarks for this dataset listed on the CuMiDa website?

Tables

Leukemia Gse9476

@kaggle.brunogrisci_leukemia_gene_expression_cumida.leukemia_gse9476
  • 26.85 MB
  • 64 rows
  • 22285 columns
Loading...

CREATE TABLE leukemia_gse9476 (
  "samples" BIGINT,
  "type" VARCHAR,
  "n_1007_s_at" DOUBLE,
  "n_1053_at" DOUBLE,
  "n_117_at" DOUBLE,
  "n_121_at" DOUBLE,
  "n_1255_g_at" DOUBLE,
  "n_1294_at" DOUBLE,
  "n_1316_at" DOUBLE,
  "n_1320_at" DOUBLE,
  "n_1405_i_at" DOUBLE,
  "n_1431_at" DOUBLE,
  "n_1438_at" DOUBLE,
  "n_1487_at" DOUBLE,
  "n_1494_f_at" DOUBLE,
  "n_1598_g_at" DOUBLE,
  "n_160020_at" DOUBLE,
  "n_1729_at" DOUBLE,
  "n_1773_at" DOUBLE,
  "n_177_at" DOUBLE,
  "n_179_at" DOUBLE,
  "n_1861_at" DOUBLE,
  "n_200000_s_at" DOUBLE,
  "n_200001_at" DOUBLE,
  "n_200002_at" DOUBLE,
  "n_200003_s_at" DOUBLE,
  "n_200004_at" DOUBLE,
  "n_200005_at" DOUBLE,
  "n_200006_at" DOUBLE,
  "n_200007_at" DOUBLE,
  "n_200008_s_at" DOUBLE,
  "n_200009_at" DOUBLE,
  "n_200010_at" DOUBLE,
  "n_200011_s_at" DOUBLE,
  "n_200012_x_at" DOUBLE,
  "n_200013_at" DOUBLE,
  "n_200014_s_at" DOUBLE,
  "n_200015_s_at" DOUBLE,
  "n_200016_x_at" DOUBLE,
  "n_200017_at" DOUBLE,
  "n_200018_at" DOUBLE,
  "n_200019_s_at" DOUBLE,
  "n_200020_at" DOUBLE,
  "n_200021_at" DOUBLE,
  "n_200022_at" DOUBLE,
  "n_200023_s_at" DOUBLE,
  "n_200024_at" DOUBLE,
  "n_200025_s_at" DOUBLE,
  "n_200026_at" DOUBLE,
  "n_200027_at" DOUBLE,
  "n_200028_s_at" DOUBLE,
  "n_200029_at" DOUBLE,
  "n_200030_s_at" DOUBLE,
  "n_200031_s_at" DOUBLE,
  "n_200032_s_at" DOUBLE,
  "n_200033_at" DOUBLE,
  "n_200034_s_at" DOUBLE,
  "n_200035_at" DOUBLE,
  "n_200036_s_at" DOUBLE,
  "n_200037_s_at" DOUBLE,
  "n_200038_s_at" DOUBLE,
  "n_200039_s_at" DOUBLE,
  "n_200040_at" DOUBLE,
  "n_200041_s_at" DOUBLE,
  "n_200042_at" DOUBLE,
  "n_200043_at" DOUBLE,
  "n_200044_at" DOUBLE,
  "n_200045_at" DOUBLE,
  "n_200046_at" DOUBLE,
  "n_200047_s_at" DOUBLE,
  "n_200048_s_at" DOUBLE,
  "n_200049_at" DOUBLE,
  "n_200050_at" DOUBLE,
  "n_200051_at" DOUBLE,
  "n_200052_s_at" DOUBLE,
  "n_200053_at" DOUBLE,
  "n_200054_at" DOUBLE,
  "n_200055_at" DOUBLE,
  "n_200056_s_at" DOUBLE,
  "n_200057_s_at" DOUBLE,
  "n_200058_s_at" DOUBLE,
  "n_200059_s_at" DOUBLE,
  "n_200060_s_at" DOUBLE,
  "n_200061_s_at" DOUBLE,
  "n_200062_s_at" DOUBLE,
  "n_200063_s_at" DOUBLE,
  "n_200064_at" DOUBLE,
  "n_200065_s_at" DOUBLE,
  "n_200066_at" DOUBLE,
  "n_200067_x_at" DOUBLE,
  "n_200068_s_at" DOUBLE,
  "n_200069_at" DOUBLE,
  "n_200070_at" DOUBLE,
  "n_200071_at" DOUBLE,
  "n_200072_s_at" DOUBLE,
  "n_200073_s_at" DOUBLE,
  "n_200074_s_at" DOUBLE,
  "n_200075_s_at" DOUBLE,
  "n_200076_s_at" DOUBLE,
  "n_200077_s_at" DOUBLE
);

Share link

Anyone who has the link will be able to view this.