ReLeaSE is a dataset, consisting of molecular structures and their corresponding

Read this article to get unlock the wonderful world Deep Reinforcement Learning for Drug Design

ReLeaSE is a public dataset, consisting of molecular structures and their corresponding binding affinity to proteins. The dataset was created for the purpose of evaluating and comparing machine learning models for the prediction of protein-ligand binding affinity.

The dataset contains a total of 10,000 molecules and their binding affinity to several target proteins, including thrombin, kinase, and protease. The molecular structures are represented using Simplified Molecular Input Line Entry System (SMILES) notation, which is a standardized method for representing molecular structures as a string of characters. The binding affinity is represented as a negative logarithm of the dissociation constant (pKd), which is a measure of the strength of the interaction between the molecule and the target protein.

The ReLeaSE dataset provides a standardized benchmark for evaluating machine learning models for protein-ligand binding affinity prediction. The dataset is publicly available and can be used for research purposes, making it an important resource for the drug discovery community.

Related Datasets

Optimism Blockchain

@blt
Emoticon Dataset

@kaggle
TGS SC2 Nasal Positivity

@cdc
Dhds Dataset

@cdc
Bioconcentration Factor (logBCF) Dataset Curated And Enriched Using The Enalos Tools And Enalos KNIME Nodes For Machine Learning Analysis (SCENARIOS Project)

@zenodo
Pl@ntNet-300K-v2 Image Dataset

@zenodo

Optimism Blockchain

Emoticon Dataset

TGS SC2 Nasal Positivity

Dhds Dataset

Bioconcentration Factor (logBCF) Dataset Curated And Enriched Using The Enalos Tools And Enalos KNIME Nodes For Machine Learning Analysis (SCENARIOS Project)

Pl@ntNet-300K-v2 Image Dataset