BioLiP protein function database

BioLiP is a semi-manually curated database for high-quality, biologically relevant ligand-protein binding interactions. The structure data are collected primarily from the Protein Data Bank, with biological insights mined from literature and other specific databases. BioLiP aims to construct the most comprehensive and accurate database for serving the needs of ligand-protein docking, virtual ligand screening and protein function annotation. Questions about the BioLiP Database can be posted at the Service System Discussion Board.

Since ligand molecules (e.g., Glycerol, Ethylene glycol) are often used as additives (i.e., false positives) for solving the protein structures, not all ligands present in the PDB database are biologically relevant. BioLiP uses a composite automated and manual procedure for examining the biological relevance of ligands in the PDB database. Each entry in BioLiP contains a comprehensive list of annotations on: To annotate the function of uncharacterized proteins using the BioLiP database, we developed a new algorithm COACH to predict ligand-binding sites. The COACH server is available at http://zhanglab.ccmb.med.umich.edu/COACH/


BioLiP in numbers


BioLiP is updated weekly (every Friday) and the current version (Aug 01, 2014) contains:
  • Number of entries: 285,404
  • Number of proteins from PDB: 62,638
  • Number of DNA/RNA ligands: 33,979
  • Number of peptide ligands: 13,221
  • Number of metal ligands: 80,208
  • Number of regular ligands: 157,996
  • Number of entries with binding affinity data: 23,492
    (10,971 from Binding MOAD, 16,980 from PDBbind-CN, 7,331 from BindingDB, and 64 from manual survey of the original literature)


BioLiP news


  • Current version: Aug 01, 2014 with 285,404 entries.
  • 2013-08-19: A link was added at the search and browse pages to allow users to export searching results in CSV format.
  • 2013-03-15: BioLiP was rebuilt with the following changes:
    • All DNA/RNA-protein interactions from PDB were added, which should be useful for studying Nucleic Acids-protein interactions. In previous versions, only small DNA/RNA ligands were considered.
    • The k-mer ligand "UUU" was divided into component ligands and removed from the database.
  • 2012-12-14: BioLiP was updated with 211,382 entries.
    • 102 ligands were added to the artifact ligand list, which helped us reduce about 1000 false positive entries.
  • 2012-12-07: BioLiP was updated with 211,819 entries.
    • 642 false postive entries for the ligands RG1 and IMD were manually removed.
  • 2012-11-30: BioLiP was updated with 212,148 entries.
    • The COACH algorithm was redesigned and is available at here.
  • 2012-11-16: BioLiP was updated with 211,040 entries.
    • 233 false postive entries for the ligands FTT and MYR were manually removed.
  • 2012-11-09: BioLiP was updated with 210,894 entries.
    • The number of BioLiP entries with binding affinity data was increased from 20127 to 22091 with the latest realse of the PDBbind-CN database.
  • 2012-11-02: BioLiP was updated with 210,377 entries.
    • 108 false postive entries for the ligands TRD and UND were manually removed.

  • >> Read more about BioLiP news ...

Reference:
Jianyi Yang, Ambrish Roy, and Yang Zhang. BioLiP: a semi-manually curated database for biologically relevant ligand-protein interactions, Nucleic Acids Research, 41: D1096-D1103 (2013) (download the PDF file).
 


yangzhanglabumich.edu | (734) 647-1549 | 100 Washtenaw Avenue Ann Arbor, MI 48109-2218