The ibm/biomed.omics.bl.sm.ma-ted-458m model family is a biomedical foundation model and its finetuned variants trained on over 2 billion biological samples across multiple modalities, including proteins, small molecules, and single-cell gene data. Designed for robust performance, it achieves state-of-the-art results over a variety of tasks across the entire drug discovery pipeline and the diverse biomedical domains.
Based on the MAMMAL - Molecular Aligned Multi-Modal Architecture and Language, a flexible, multi-domain architecture with an adaptable task prompt syntax. The syntax allows for dynamic combinations of tokens and scalars, enabling classification, regression, and generation tasks either within a single domain or with cross-domain entities.
This page demonstrates a variety of drug discovery and biomedical tasks for the model family. Select the task to access the specific demos.
Link to model card
Mammal based Protein-Protein Interaction (PPI) demonstration
Given two protein sequences, estimate if the proteins interact or not.
<SENTINEL_ID_0> contains the binding affinity class, which is <1> for interacting and <0> for non-interacting
Mammal based Drug-Target binding affinity demonstration
Given a protein sequence and a drug (in SMILES), estimate the binding affinity.
Mammal based protein solubility estimation
Given the protein sequence, estimate if it's water-soluble.
Mammal based TCRbeta-epitope binding affinity demonstration
Given a TCR beta chain and epitope amino acid sequences, estimate the binding affinity score.
Mammal based small molecule blood-brain barrier penetration demonstration
Given a drug (in SMILES), estimate the likelihood that it will penetrate the Blood-Brain Barrier.
Mammal based small molecule toxicity trials failer estimation demonstration
Given a drug (in SMILES), estimate the likelihood that it will fail in clinical toxicity trials.
Mammal based small molecule drug FDA approval demonstration
Given a drug (in SMILES), estimate the likelihood that it will be approved by the FDA.