JASPAR is a collection of transcription factor DNA-binding preferences, modeled as matrices. These can be converted into Position Weight Matrices (PWMs or PSSMs), used for scanning genomic sequences. JASPAR is the only database with this scope where the data can be used with no restrictions (open-source). For a comprehensive review of models and how they can be used, please see the following reviews: DNA binding sites: representation and discovery. In: Bioinformatics. 2000 Jan;16(1):16-23; Applied bioinformatics for the identification of regulatory elements. In: Nat Rev Genet. 2004 Apr;5(4):276-87. The JASPAR CORE database contains a curated, non-redundant set of 123 profiles, derived from published collections of experimentally defined transcription factor binding sites for multicellular eukaryotes. The prime difference to similar resources (TRANSFAC, TESS etc) consist of the open data acess, non-redundancy and quality: JASPAR CORE is a smaller set that is non-redundant and curated. ... [Information of the supplier]
The Database for Aligned Ribosomal Complexes (DARC) site provides a resource for directly comparing the structures of available ribosomal complexes. The site offers a collection of files deposited in the RCSB protein data bank and the Electron Microscopy Data Bank, which have been aligned so as to make direct comparison of the structures possible. ... [Information of the supplier]
MfunGD provides a resource for annotated mouse proteins and their occurence in protein networks. Protein function annotation is performed using the Functional Catalogue (FunCat) annotation scheme, which is a hierarchically structured classification system (Ruepp et al., NAR, 2004). To provide up-to-date similarity search results and InterPro domain analyses, the protein entries are interconnected with the SIMAP database (Rattei et al., NAR, 2006). The gene models are based on the RefSeq mouse cDNAs (Pruitt et al., NAR, 2007) The work of our group is focussed on the annotation of biological systems. Therefore, results from the Mammalian Protein-Protein Interaction Database ( MPPI, (Pagel et al., Bioinformatics, 2005 )) and the Comprehensive Resource of Mammalian Protein Complexes (CORUM, (Ruepp et al., NAR, 2007)) are linked to the MfunGD dataset. Links to external resources are also provided (e.g. Refseq, Uniprot, UCSC Genome Browser). MfunGD is implemented in GenRE, a J2EE based component oriented multi-tier architecture (Mewes et al., NAR, 2006). ... [Information of the supplier]
The Death domain (DD) superfamily is one of the largest classes of protein interaction modules and plays a pivotal role in the apoptosis, inflammation, necrosis, and immune cell signaling pathways. Critical caspase activating complexes in the apoptosis and inflammation signaling pathways are assembled via the DD superfamily. These domains are also involved in recruiting downstream effectors for immune cell receptor signaling, intracellular pathogen sensing, and response to DNA damage. To stimulate future researches among scientists who are interested in the DD superfamily mediated signaling pathway, we have developed the Death Domain Database, a manually curated database that aims to provide comprehensive information on PPIs of human DD superfamily. The current version of the Death Domain Database documents 175 PPI pairs among 99 DDS by curating 295 peer-reviewed publications. The Death Domain Database provides a detailed summary of PPI data, which fits into 3 categories: interaction, characterization, and functional role. Users can find in-depth information specified in the literature on relevant analytical methods, structural information. The Death Domain Database has a user-friendly interface with several helpful features, including a search engine, an interaction map, and a function for cross-referencing useful external databases. Our Death Domain Database will provide a valuable tool to assist in understanding and organizing the molecular interaction network of the DD superfamily. ... [Information of the supplier]
No other database than ESTHER holds all alpha/beta hydrolase fold proteins together: Interpro, Prosite, Pfam, have multiple entries for subsets of this structural superfamily. A table Synthese shows the correspondance between these database entries and the subfamilies in ESTHER. The ESTHER Table is now a little to big to be usefull. Each file contains one of the 31219 non redundant proteins/genes. The tables grouped in the family table, the syntheses table or the structure table may be more usefull. The Gene_locus nomenclature for these non-redundant entries is a name with 5 characters for the organisms (3 for genera, 2 for the species, except when a common 5 character name exists. ex: ratno is for Rattus norvegicus and human for man. This allows us to keep close to the Swiss-Prot nomenclature). The last characters define the protein, ex: human-acche represents human acetylcholinesterase. ... [Information of the supplier]
SUPERFAMILY is a database of structural and functional annotation for all proteins and genomes. The SUPERFAMILY annotation is based on a collection of hidden Markov models, which represent structural protein domains at the SCOP superfamily level. A superfamily groups together domains which have an evolutionary relationship. The annotation is produced by scanning protein sequences from over 2,414 completely sequenced genomes against the hidden Markov models. For each protein you can: a) Submit sequences for SCOP classification; b) View domain organisation, sequence alignments and protein sequence details For each genome you can: a) Examine superfamily assignments, phylogenetic trees, domain organisation lists and networks; b) Check for over- and under-represented superfamilies within a genome For each superfamily you can: a) Inspect SCOP classification, functional annotation, Gene Ontology annotation, InterPro abstract and genome assignments; b) Explore taxonomic distribution of a superfamily across the tree of life All annotation, models and the database dump are freely available for download to everyone. SUPERFAMILY is a member of the InterPro consortium of protein annotation databases, and has been integrated into the Ensembl eukaryotic genome project and The Arabidopsis Information Resource. To date, the SUPERFAMILY publications have been cited over 1,000 times. SUPERFAMILY has been used in structural, functional, evolutionary and phylogenetic research projects. ... [Information of the supplier]
M phase, also called as cell division, is the most crucial and fundamental affair of the eukaryotic cell cycle. After the chromosomes have been replicated during the S phase, the sister chromatids are separated and distributed into two daughter cells equally and faithfully. Also, each daughter cell receives the almost average and necessary intracellular constituents and organelles from the mother cell. Generally, cell division consists of six stages, including prophase, prometaphase, metaphase, anaphase, telophase and cytokinesis. And the first five stages constituent mitosis. During mitosis, numerous proteins organize protein super-complexes at the three distinct regions of centrosome, kinetochore/centromere and cleavage furrow/midbody. Although many proteins have been identified to be localized on centrosome, kinetochore and/or midbody, an integrated resource on this area still remains not to be available. In this work, we have collected all proteins identified to be localized on kinetochore, centrosome, and/or midbody from two fungi (S. cerevisiae and S. pombe) and five animals, including C. elegans, D. melanogaster, X. laevis, M. musculus and H. sapiens. From the related literature of PubMed, numerous proteins have been manually curated to be localized on at least one of the sub-cellular localizations of kinetochore, centrosome and midbody. And to promise the quality of data, based on the rationale of "Seeing is believing", these proteins have been unambiguously observed under fluorescent microscope as directly supportive evidences. Then an integrated and searchable database MiCroKit - Midbody, Centrosome and Kinetochore has been established. The MiCroKit database is the first integrative resource to pin point most of identified components and related scientific information of midbody, centrosome and kinetochore. The version 1.0 of MiCroKit database was set up on Nov. 2nd, 2005, containing 1,065 unique proteins. The MiCroKit version 2.0 was released on Jun. 5th, 2006, with 1,120 entries. Currently, the MiCroKit 3.0 database was updated on July 9, 2009, containing 1,489 unique protein entries. ... [Information of the supplier, modified]
SEVENS summarizes GPCR (G-protein coupled receptor) genes that are identified with high accuracy from 56 eukaryote genomes, by a pipeline integrating such software as a gene finder, a sequence alignment tool, a motif and domain assignment tool, and a transmembrane helix predictor. This treats a larger data space (than that in currently available other databases), which should include not only the expressed sequences but also the newly identified sequences that cannot be detected by in vivo experiments, although they definitely exist on the genome sequence and are just waiting for the opportunity to express their functions. SEVENS can provides the infrastructure of general information of "GPCR universe" for comparative genomics. ... [Information of the supplier]