Genome Information By Organism
genomes including sequences, maps, chromosomes, assemblies and annotations
@kaggle.lsind18_genome_information_by_organism
genomes including sequences, maps, chromosomes, assemblies and annotations
@kaggle.lsind18_genome_information_by_organism
organism_nameOrganism Name | organism_groupsOrganism Groups | strainStrain | biosampleBioSample | bioprojectBioProject | assemblyAssembly | levelLevel | size_mbSize(Mb) | gcGC% | repliconsReplicons | wgsWGS | scaffoldsScaffolds | cdsCDS | release_dateRelease Date | genbank_ftpGenBank FTP | refseq_ftpRefSeq FTP | genesGenes |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Pyropia yezoensis | Eukaryota;Other;Other | nan | SAMN13316713 | PRJNA589917 | GCA_009829735.1 | Chromosome | 107.591 | 64.8454 | chromosome 1:CM020618.1; chromosome 2:CM020619.1; chromosome 3:CM020620.1 | WMLA01 | 28 | Fri Jan 03 2020 00:00:00 GMT+0000 (Coordinated Universal Time) | ftp://ftp.ncbi.nlm.nih.gov/genomes/all/GCA/009/829/735/GCA_009829735.1_ASM982973v1 | nan | ||
Emiliania huxleyi CCMP1516 | Eukaryota;Protists;Other Protists | CCMP1516 | SAMN02744062 | PRJNA77753 | GCA_000372725.1 | Scaffold | 167.676 | 64.5 | nan | AHAL01 | 7795 | 38554 | Fri Apr 19 2013 00:00:00 GMT+0000 (Coordinated Universal Time) | ftp://ftp.ncbi.nlm.nih.gov/genomes/all/GCA/000/372/725/GCA_000372725.1_Emiliana_huxleyi_CCMP1516_main_genome_assembly_v1.0 | ftp://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/000/372/725/GCF_000372725.1_Emiliana_huxleyi_CCMP1516_main_genome_assembly_v1.0 | 38549 |
Arabidopsis thaliana | Eukaryota;Plants;Land Plants | nan | SAMN03081427 | PRJNA10719 | GCA_000001735.2 | Chromosome | 119.669 | 36.0529 | chromosome 1:NC_003070.9/CP002684.1; chromosome 2:NC_003071.7/CP002685.1; chromosome 3:NC_003074.8/CP002686.1; chromosome 4:NC_003075.7/CP002687.1; chromosome 5:NC_003076.8/CP002688.1; mitochondrion MT:NC_037304.1/BK010421.1; chloroplast Pltd:NC_000932.1/AP000423.1 | nan | 7 | 48265 | Mon Aug 13 2001 00:00:00 GMT+0000 (Coordinated Universal Time) | ftp://ftp.ncbi.nlm.nih.gov/genomes/all/GCA/000/001/735/GCA_000001735.2_TAIR10.1 | ftp://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/000/001/735/GCF_000001735.4_TAIR10.1 | 38311 |
Glycine max | Eukaryota;Plants;Land Plants | nan | SAMN00002965 | PRJNA19861 | GCA_000004515.4 | Chromosome | 979.046 | 35.1153 | chromosome 1:NC_016088.3/CM000834.3; chromosome 2:NC_016089.3/CM000835.3; chromosome 3:NC_016090.3/CM000836.3; chromosome 4:NC_016091.3/CM000837.3; chromosome 5:NC_038241.1/CM000838.2; chromosome 6:NC_038242.1/CM000839.3; chromosome 7:NC_038243.1/CM000840.3; chromosome 8:NC_038244.1/CM000841.3; chro… | ACUP03 | 1579 | 71219 | Tue Jan 05 2010 00:00:00 GMT+0000 (Coordinated Universal Time) | ftp://ftp.ncbi.nlm.nih.gov/genomes/all/GCA/000/004/515/GCA_000004515.4_Glycine_max_v2.1 | ftp://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/000/004/515/GCF_000004515.5_Glycine_max_v2.1 | 59847 |
Medicago truncatula | Eukaryota;Plants;Land Plants | A17 | SAMN02299339 | PRJNA10791 | GCA_000219495.2 | Chromosome | 412.924 | 34.047 | chromosome 1:NC_016407.2/CM001217.2; chromosome 2:NC_016408.2/CM001218.2; chromosome 3:NC_016409.2/CM001219.2; chromosome 4:NC_016410.2/CM001220.2; chromosome 5:NC_016411.2/CM001221.2; chromosome 6:NC_016412.2/CM001222.2; chromosome 7:NC_016413.2/CM001223.2; chromosome 8:NC_016414.2/CM001224.2; chlo… | APNO01 | 2187 | 41939 | Fri Aug 12 2011 00:00:00 GMT+0000 (Coordinated Universal Time) | ftp://ftp.ncbi.nlm.nih.gov/genomes/all/GCA/000/219/495/GCA_000219495.2_MedtrA17_4.0 | ftp://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/000/219/495/GCF_000219495.3_MedtrA17_4.0 | 37603 |
Solanum lycopersicum | Eukaryota;Plants;Land Plants | nan | SAMN02981290 | PRJNA119 | GCA_000188115.3 | Chromosome | 828.349 | 35.6991 | chromosome 1:NC_015438.3/CM001064.3; chromosome 2:NC_015439.3/CM001065.3; chromosome 3:NC_015440.3/CM001066.3; chromosome 4:NC_015441.3/CM001067.3; chromosome 5:NC_015442.3/CM001068.3; chromosome 6:NC_015443.3/CM001069.3; chromosome 7:NC_015444.3/CM001070.3; chromosome 8:NC_015445.3/CM001071.3; chro… | AEKE03 | 3150 | 37660 | Fri Dec 10 2010 00:00:00 GMT+0000 (Coordinated Universal Time) | ftp://ftp.ncbi.nlm.nih.gov/genomes/all/GCA/000/188/115/GCA_000188115.3_SL3.0 | ftp://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/000/188/115/GCF_000188115.4_SL3.0 | 31200 |
Oryza sativa Japonica Group | Eukaryota;Plants;Land Plants | nan | SAMD00000397 | PRJNA12269 | GCA_001433935.1 | Chromosome | 374.423 | 43.5769 | chromosome 1:NC_029256.1/AP014957.1; chromosome 2:NC_029257.1/AP014958.1; chromosome 3:NC_029258.1/AP014959.1; chromosome 4:NC_029259.1/AP014960.1; chromosome 5:NC_029260.1/AP014961.1; chromosome 6:NC_029261.1/AP014962.1; chromosome 7:NC_029262.1/AP014963.1; chromosome 8:NC_029263.1/AP014964.1; chro… | nan | 58 | 42578 | Sat Oct 10 2015 00:00:00 GMT+0000 (Coordinated Universal Time) | ftp://ftp.ncbi.nlm.nih.gov/genomes/all/GCA/001/433/935/GCA_001433935.1_IRGSP-1.0 | ftp://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/001/433/935/GCF_001433935.1_IRGSP-1.0 | 35219 |
Triticum aestivum | Eukaryota;Plants;Land Plants | nan | SAMEA4791365 | PRJEB27788 | GCA_900519105.1 | Chromosome | 14547.3 | 46.0544 | chromosome 1A:LS992080.1; chromosome 1B:LS992081.1; chromosome 1D:LS992082.1; chromosome 2A:LS992083.1; chromosome 2B:LS992084.1; chromosome 2D:LS992085.1; chromosome 3A:LS992086.1; chromosome 3B:LS992087.1; chromosome 3D:LS992088.1; chromosome 4A:LS992089.1; chromosome 4B:LS992090.1; chromosome 4D:… | nan | 22 | Sun Aug 19 2018 00:00:00 GMT+0000 (Coordinated Universal Time) | ftp://ftp.ncbi.nlm.nih.gov/genomes/all/GCA/900/519/105/GCA_900519105.1_iwgsc_refseqv1.0 | nan | ||
Zea mays | Eukaryota;Plants;Land Plants | nan | SAMN04296295 | PRJNA10769 | GCA_000005005.6 | Chromosome | 2135.08 | 46.9109 | chromosome 1:NC_024459.2/CM007647.1; chromosome 2:NC_024460.2/CM007648.1; chromosome 3:NC_024461.2/CM007649.1; chromosome 4:NC_024462.2/CM000780.4; chromosome 5:NC_024463.2/CM000781.4; chromosome 6:NC_024464.2/CM000782.4; chromosome 7:NC_024465.2/CM007650.1; chromosome 8:NC_024466.2/CM000784.4; chro… | LPUQ01 | 598 | 58411 | Fri Jan 29 2010 00:00:00 GMT+0000 (Coordinated Universal Time) | ftp://ftp.ncbi.nlm.nih.gov/genomes/all/GCA/000/005/005/GCA_000005005.6_B73_RefGen_v4 | ftp://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/000/005/005/GCF_000005005.2_B73_RefGen_v4 | 49296 |
Pneumocystis carinii B80 | Eukaryota;Fungi;Ascomycetes | B80 | SAMN02380717 | PRJNA223511 | GCA_001477545.1 | Contig | 7.66146 | 27.8 | nan | LFVZ01 | 62 | 3646 | Tue Dec 22 2015 00:00:00 GMT+0000 (Coordinated Universal Time) | ftp://ftp.ncbi.nlm.nih.gov/genomes/all/GCA/001/477/545/GCA_001477545.1_Pneu_cari_B80_V3 | ftp://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/001/477/545/GCF_001477545.1_Pneu_cari_B80_V3 | 3695 |
CREATE TABLE eukaryotes (
"organism_name" VARCHAR,
"organism_groups" VARCHAR,
"strain" VARCHAR,
"biosample" VARCHAR,
"bioproject" VARCHAR,
"assembly" VARCHAR,
"level" VARCHAR,
"size_mb" DOUBLE,
"gc" DOUBLE,
"replicons" VARCHAR,
"wgs" VARCHAR,
"scaffolds" BIGINT,
"cds" BIGINT,
"release_date" TIMESTAMP,
"genbank_ftp" VARCHAR,
"refseq_ftp" VARCHAR,
"genes" BIGINT
);
Anyone who has the link will be able to view this.