Baselight

Brain Cancer Gene Expression - CuMiDa

GSE50161 microarray experiment

@kaggle.brunogrisci_brain_cancer_gene_expression_cumida

Loading...
Loading...

About this Dataset

Brain Cancer Gene Expression - CuMiDa

Dataset GSE50161 on brain cancer gene expression from CuMiDa

  • 5 classes
  • 54676 genes
  • 130 samples

About

Here we present the Curated Microarray Database (CuMiDa), a repository containing 78 handpicked cancer microarray datasets, extensively curated from 30.000 studies from the Gene Expression Omnibus (GEO), solely for machine learning. The aim of CuMiDa is to offer homogeneous and state-of-the-art biological preprocessing of these datasets, together with numerous 3-fold cross validation benchmark results to propel machine learning studies focused on cancer research. The database make available various download options to be employed by other programs, as well for PCA and t-SNE results. CuMiDa stands different from existing databases for offering newer datasets, manually and carefully curated, from samples quality, unwanted probes, background correction and normalization, to create a more reliable source of data for computational research.

http://sbcb.inf.ufrgs.br/cumida

References

  • Feltes, B.C.; Chandelier, E.B.; Grisci, B.I.; Dorn, M. (2019) CuMiDa: An Extensively Curated Microarray Database for Benchmarking and Testing of Machine Learning Approaches in Cancer Research. Journal of Computational Biology, 26 (4), 376-386. [https://doi.org/10.1089/cmb.2018.0238]

  • Grisci, B. I., Feltes, B. C., & Dorn, M. (2019). Neuroevolution as a tool for microarray gene expression pattern identification in cancer research. Journal of biomedical informatics, 89, 122-133. [https://doi.org/10.1016/j.jbi.2018.11.013]

Inspiration

  • How to deal with class imbalance for classification?
  • How to identify the most important genes for the classification of each cancer subtype?
  • Is it possible to discover subtypes?
  • How to beat the classification and clustering benchmarks for this dataset listed on the CuMiDa website?

Tables

Brain Gse50161

@kaggle.brunogrisci_brain_cancer_gene_expression_cumida.brain_gse50161
  • 102.85 MB
  • 130 rows
  • 54,677 columns
Loading...
CREATE TABLE brain_gse50161 (
  "samples" BIGINT,
  "type" VARCHAR,
  "n_1007_s_at" DOUBLE  -- 1007 S At,
  "n_1053_at" DOUBLE  -- 1053 At,
  "n_117_at" DOUBLE  -- 117 At,
  "n_121_at" DOUBLE  -- 121 At,
  "n_1255_g_at" DOUBLE  -- 1255 G At,
  "n_1294_at" DOUBLE  -- 1294 At,
  "n_1316_at" DOUBLE  -- 1316 At,
  "n_1320_at" DOUBLE  -- 1320 At,
  "n_1405_i_at" DOUBLE  -- 1405 I At,
  "n_1431_at" DOUBLE  -- 1431 At,
  "n_1438_at" DOUBLE  -- 1438 At,
  "n_1487_at" DOUBLE  -- 1487 At,
  "n_1494_f_at" DOUBLE  -- 1494 F At,
  "n_1552256_a_at" DOUBLE  -- 1552256 A At,
  "n_1552257_a_at" DOUBLE  -- 1552257 A At,
  "n_1552258_at" DOUBLE  -- 1552258 At,
  "n_1552261_at" DOUBLE  -- 1552261 At,
  "n_1552263_at" DOUBLE  -- 1552263 At,
  "n_1552264_a_at" DOUBLE  -- 1552264 A At,
  "n_1552266_at" DOUBLE  -- 1552266 At,
  "n_1552269_at" DOUBLE  -- 1552269 At,
  "n_1552271_at" DOUBLE  -- 1552271 At,
  "n_1552272_a_at" DOUBLE  -- 1552272 A At,
  "n_1552274_at" DOUBLE  -- 1552274 At,
  "n_1552275_s_at" DOUBLE  -- 1552275 S At,
  "n_1552276_a_at" DOUBLE  -- 1552276 A At,
  "n_1552277_a_at" DOUBLE  -- 1552277 A At,
  "n_1552278_a_at" DOUBLE  -- 1552278 A At,
  "n_1552279_a_at" DOUBLE  -- 1552279 A At,
  "n_1552280_at" DOUBLE  -- 1552280 At,
  "n_1552281_at" DOUBLE  -- 1552281 At,
  "n_1552283_s_at" DOUBLE  -- 1552283 S At,
  "n_1552286_at" DOUBLE  -- 1552286 At,
  "n_1552287_s_at" DOUBLE  -- 1552287 S At,
  "n_1552288_at" DOUBLE  -- 1552288 At,
  "n_1552289_a_at" DOUBLE  -- 1552289 A At,
  "n_1552291_at" DOUBLE  -- 1552291 At,
  "n_1552293_at" DOUBLE  -- 1552293 At,
  "n_1552295_a_at" DOUBLE  -- 1552295 A At,
  "n_1552296_at" DOUBLE  -- 1552296 At,
  "n_1552299_at" DOUBLE  -- 1552299 At,
  "n_1552301_a_at" DOUBLE  -- 1552301 A At,
  "n_1552302_at" DOUBLE  -- 1552302 At,
  "n_1552303_a_at" DOUBLE  -- 1552303 A At,
  "n_1552304_at" DOUBLE  -- 1552304 At,
  "n_1552306_at" DOUBLE  -- 1552306 At,
  "n_1552307_a_at" DOUBLE  -- 1552307 A At,
  "n_1552309_a_at" DOUBLE  -- 1552309 A At,
  "n_1552310_at" DOUBLE  -- 1552310 At,
  "n_1552311_a_at" DOUBLE  -- 1552311 A At,
  "n_1552312_a_at" DOUBLE  -- 1552312 A At,
  "n_1552314_a_at" DOUBLE  -- 1552314 A At,
  "n_1552315_at" DOUBLE  -- 1552315 At,
  "n_1552316_a_at" DOUBLE  -- 1552316 A At,
  "n_1552318_at" DOUBLE  -- 1552318 At,
  "n_1552319_a_at" DOUBLE  -- 1552319 A At,
  "n_1552320_a_at" DOUBLE  -- 1552320 A At,
  "n_1552321_a_at" DOUBLE  -- 1552321 A At,
  "n_1552322_at" DOUBLE  -- 1552322 At,
  "n_1552323_s_at" DOUBLE  -- 1552323 S At,
  "n_1552325_at" DOUBLE  -- 1552325 At,
  "n_1552326_a_at" DOUBLE  -- 1552326 A At,
  "n_1552327_at" DOUBLE  -- 1552327 At,
  "n_1552329_at" DOUBLE  -- 1552329 At,
  "n_1552330_at" DOUBLE  -- 1552330 At,
  "n_1552332_at" DOUBLE  -- 1552332 At,
  "n_1552334_at" DOUBLE  -- 1552334 At,
  "n_1552335_at" DOUBLE  -- 1552335 At,
  "n_1552337_s_at" DOUBLE  -- 1552337 S At,
  "n_1552338_at" DOUBLE  -- 1552338 At,
  "n_1552340_at" DOUBLE  -- 1552340 At,
  "n_1552343_s_at" DOUBLE  -- 1552343 S At,
  "n_1552344_s_at" DOUBLE  -- 1552344 S At,
  "n_1552347_at" DOUBLE  -- 1552347 At,
  "n_1552348_at" DOUBLE  -- 1552348 At,
  "n_1552349_a_at" DOUBLE  -- 1552349 A At,
  "n_1552354_at" DOUBLE  -- 1552354 At,
  "n_1552355_s_at" DOUBLE  -- 1552355 S At,
  "n_1552359_at" DOUBLE  -- 1552359 At,
  "n_1552360_a_at" DOUBLE  -- 1552360 A At,
  "n_1552362_a_at" DOUBLE  -- 1552362 A At,
  "n_1552364_s_at" DOUBLE  -- 1552364 S At,
  "n_1552365_at" DOUBLE  -- 1552365 At,
  "n_1552367_a_at" DOUBLE  -- 1552367 A At,
  "n_1552368_at" DOUBLE  -- 1552368 At,
  "n_1552370_at" DOUBLE  -- 1552370 At,
  "n_1552372_at" DOUBLE  -- 1552372 At,
  "n_1552373_s_at" DOUBLE  -- 1552373 S At,
  "n_1552375_at" DOUBLE  -- 1552375 At,
  "n_1552377_s_at" DOUBLE  -- 1552377 S At,
  "n_1552378_s_at" DOUBLE  -- 1552378 S At,
  "n_1552379_at" DOUBLE  -- 1552379 At,
  "n_1552381_at" DOUBLE  -- 1552381 At,
  "n_1552383_at" DOUBLE  -- 1552383 At,
  "n_1552384_a_at" DOUBLE  -- 1552384 A At,
  "n_1552386_at" DOUBLE  -- 1552386 At,
  "n_1552388_at" DOUBLE  -- 1552388 At,
  "n_1552389_at" DOUBLE  -- 1552389 At
);

Share link

Anyone who has the link will be able to view this.