Baselight

Structural Protein Sequences

Sequence and meta data for various protein structures

@kaggle.shahir_protein_data_set

Pdb Data No Dups
@kaggle.shahir_protein_data_set.pdb_data_no_dups

  • 7.13 MB
  • 141401 rows
  • 14 columns
structureid

StructureId

classification

Classification

experimentaltechnique

ExperimentalTechnique

macromoleculetype

MacromoleculeType

residuecount

ResidueCount

resolution

Resolution

structuremolecularweight

StructureMolecularWeight

crystallizationmethod

CrystallizationMethod

crystallizationtempk

CrystallizationTempK

densitymatthews

DensityMatthews

densitypercentsol

DensityPercentSol

pdbxdetails

PdbxDetails

phvalue

PhValue

publicationyear

PublicationYear

100DDNA-RNA HYBRIDX-RAY DIFFRACTIONDNA/RNA Hybrid201.96360.3VAPOR DIFFUSION, HANGING DROP1.7830.89pH 7.00, VAPOR DIFFUSION, HANGING DROP71994
101DDNAX-RAY DIFFRACTIONDNA242.257939.35nan238.45nan1995
101MOXYGEN TRANSPORTX-RAY DIFFRACTIONProtein1542.0718112.8nan3.0960.23.0 M AMMONIUM SULFATE, 20 MM TRIS, 1MM EDTA, PH 9.091999
102DDNAX-RAY DIFFRACTIONDNA242.27637.17VAPOR DIFFUSION, SITTING DROP2772.2846.06pH 7.00, VAPOR DIFFUSION, SITTING DROP, temperature 277.00K71995
102LHYDROLASE(O-GLYCOSYL)X-RAY DIFFRACTIONProtein1651.7418926.61nan2.7555.28nan1993
102MOXYGEN TRANSPORTX-RAY DIFFRACTIONProtein1541.8418010.64nan3.0960.23.0 M AMMONIUM SULFATE, 20 MM TRIS, 1MM EDTA, PH 9.091999
103DDNASOLUTION NMRDNA247502.93nannan1994
103LHYDROLASE(O-GLYCOSYL)X-RAY DIFFRACTIONProtein1671.919092.72nan2.754.46nan1993
103MOXYGEN TRANSPORTX-RAY DIFFRACTIONProtein1542.0718093.78nan3.0960.33.0 M AMMONIUM SULFATE, 20 MM TRIS, 1MM EDTA, PH 9.091999
104DDNA-RNA HYBRIDSOLUTION NMRDNA/RNA Hybrid247454.78nannan1995

CREATE TABLE pdb_data_no_dups (
  "structureid" VARCHAR,
  "classification" VARCHAR,
  "experimentaltechnique" VARCHAR,
  "macromoleculetype" VARCHAR,
  "residuecount" BIGINT,
  "resolution" DOUBLE,
  "structuremolecularweight" DOUBLE,
  "crystallizationmethod" VARCHAR,
  "crystallizationtempk" DOUBLE,
  "densitymatthews" DOUBLE,
  "densitypercentsol" DOUBLE,
  "pdbxdetails" VARCHAR,
  "phvalue" DOUBLE,
  "publicationyear" DOUBLE
);