'The BindingDB is a public, web-accessible database of measured binding affinities for biomolecules, genetically or chemically modified biomolecules, and synthetic compounds.The database currently contains data generated by isothermal titration calorimetry (ITC) and enzyme inhibition (Enz. Inhib.) methods; other techniques will be included in the future. ... [Information of the supplier]
ChemMine is a compound mining database that facilitates drug and agrochemical discovery and chemical genomics screens. Its web service is divided into three major functional components: (1) Compound Database, comprising Annotation Search and Structure Search; (2) Cheminformatics Workbench; and (3) Screening Database. A detailed tutorial for using ChemMine's online services is available on the ReadMe page. ... [Information of the supplier, modified]
Cell Circuits is an open-access database of molecular network models that: (a) Bridges the gap between databases of individual pair-wise molecular interactions and databases of validated pathways. (b) Contains functional network hypothesis produced by algorithms that screen molecular interaction networks based on their correspondence with expression or phenotypic data, their internal structure, or their conservation across species. (c) Is searchable using protein/gene names and Gene Ontology terms. Models are available either as images or in machine-readable formats. (d) Serves as a clearinghouse in which theorists may distribute or revise models in need of validation and experimentalists may search for models or specific hypotheses relevant to their interests. ... [Information of the supplier, modified]
The purpose of the BioSamples database is to provide unified storage and access to information about biological samples. These samples may have investigation information stored in other databases (e.g. sequencing, structure, expression). Physical samples may also be available for further study from suppliers. The definition of a sample is flexible. Most samples are straightforward, such as tissue from an individual organism. However, a sample can also be an environmental sample (e.g. for meta-genomic analysis), a hybrid between two species, infected tissue, etc. Within the BioSamples Database, collections of samples are managed as "submission". Each submission is one SampleTab format file emailed to biosamples@ebi.ac.uk. Some highly-curated and highly-referenced submissions may become "reference samples", for example the ENCODE cell lines. These are samples that are of particular interest and have datasets from multiple technologies associated with them. ... [Information of the supplier]
Today, vast amounts of protein-small molecule binding sites can be found in the Protein Data Bank (PDB). Exhaustive comparison of them is computationally demanding, but useful in the prediction of protein functions and drug discovery. We proposed a tremendously fast algorithm called "SketchSort" that enables the enumeration of similar pairs in a huge number of protein-ligand binding sites. We conducted all-pair similarity searches for 3.4 million known and potential binding sites using the proposed method and discovered over 24 million similar pairs of binding sites. We present the results as a relational database Pocket Similarity Search using Multiple-Sketches (PoSSuM), which includes all the discovered pairs with annotations of various types (e.g., CATH, SCOP, EC number, Gene ontology). PoSSuM enables rapid exploration of similar binding sites among structures with different global folds as well as similar ones. Moreover, PoSSuM is useful for predicting the binding ligand for unbound structures. Basically, the users can search similar binding pockets using two search modes: i) "Search K" is useful for finding similar binding sites for a known ligand-binding site. Post a known ligand-binding site (a pair of "PDB ID" and "HET code") in the PDB, and PoSSuM will search similar sites for the query site; ii) "Search P" is useful for predicting ligands that potentially bind to a structure of interest. Post a known protein structure (PDB ID) in the PDB, and PoSSuM will search similar known-ligand binding sites for the query structure. ... [Information of the supplier]
Rhea is a reaction database, where all reaction participants (reactants and products) are linked to the ChEBI database (Chemical Entities of Biological Interest) which provides detailed information about structure, formula and charge. Rhea provides built-in validations that ensure both elemental and charge balance of the reactions. We have populated the database with the reactions found in the EC list (and in the IntEnz and ENZYME databases), extending it with additional known reactions of biological interest. While the main focus of Rhea is enzyme-catalysed reactions, other biochemical reactions (including those that are often termed "spontaneous") also are included. Rhea is a manually annotated resource and it provides: a)Stable reaction identifiers for each of its reactions. These identifiers are independent of EC numbers but are linked to EC numbers via cross-references; b)Directionality information if the physiological direction of the reaction is known; c)The possibility to link several reactions together to form complex reactions. This feature can also be used to split reactions into partial reactions; d)Extensive cross-references to other resources including enzyme-catalysed and other metabolic reactions, such as the EC list (in IntEnz), KEGG, MetaCyc and UniPathway; e)Chemical substructure and similarity searches on compounds in Rhea, thereby allowing reactions to be found where similar compounds are involved. The database is extensively cross-referenced. Reactions are currently linked to the EC list, KEGG and MetaCyc, and the reactions will be used in the IntEnz database and in all relevant UniProtKB entries. Furthermore, the reactions will also be used in the UniPathway database to generate pathways and metabolic networks. ... [Information of the supplier]
The European Bioinformatics Institute (EBI) is home to a number of bioinformatics resources that hold data relevant to the biomedical field, and is thus in a unique position to enhance the annotation of biological data by cross-referencing and mapping between resources. The "Structure Integration with Function, Taxonomy and Sequence" (SIFTS) initiative is a close collaboration between UniProt and the Protein Data Bank in Europe (PDBe) and aims to improve the integration of structural databases like the PDB and sequence databases like UniProt. SIFTS is the authoritative source of up-to-date residue-level annotation of structures in the PDB with data available in UniProt, IntEnz, CATH, SCOP, GO, InterPro, Pfam and PubMed. SIFTS is used by major resources such as RCSB, PDBsum, Pfam, SCOP, InterPro, DAS server providers and many research and service groups around the world. The project was started in 2000 and has resulted in creation of a mechanism for enhancing annotation and exchanging data between these two primary data resources. ... [Information of the supplier]
We present the Homo Sapiens Comprehensive Model Collection (HOCOMOCO) of transcription factor (TF) binding models obtained by careful integration of data from different sources. HOCOMOCO contains 426 non-redundant curated binding models for 401 human TFs. [Information of the supplier]