The primary goal of computational molecular biology, like molecular biology itself, is to understand the meaning of the genomic information and how this information is expressed. We are interested in the problems of predicting biological function of genes and gene products from their primary sequence and structure (sometimes known as functional genomics). We are interested in predicting structure of protein and DNA from its sequence, and understanding how and when genes are expressed. ... [Information of the supplier]
BOXSHADE is a program for pretty-printing multiple alignment output. The program itself doesn't do any alignment, you have to use a multiple alignment program like ClustalW or Pileup and use the output of these programs as input for BOXSHADE. Of course, you can also use manually aligned sequences (in a proper format). [Information of the supplier]
The HIV databases contain data on HIV genetic sequences, immunological epitopes, drug resistance-associated mutations, and vaccine trials. The website also gives access to a large number of tools that can be used to analyze these data. This project is funded by the Division of AIDS of the National Institute of Allergy and Infectious Diseases (NIAID), a part of the National Institutes of Health (NIH). ... [Information of the supplier]
The new J. Craig Venter Institute was formed in October 2006 through the merger of several affiliated and legacy organizations--The Institute for Genomic Research (TIGR) and The Center for the Advancement of Genomics (TCAG), The J. Craig Venter Science Foundation, The Joint Technology Center, and the Institute for Biological Energy Alternatives (IBEA). Today all these organizations have become one large multidisciplinary genomic-focused organization. With more than 500 scientists and staff, more than 250, 000 square feet of laboratory space, and locations in Rockville, Maryland and La Jolla, California, the new JCVI is a world leader in genomic research. ... [Information of the supplier]
CATH is a hierarchical classification of protein domain structures, which clusters proteins at four major levels, Class(C), Architecture(A), Topology(T) and Homologous superfamily (H).Class, derived from secondary structure content, is assigned for more than 90% of protein structures automatically. Architecture, which describes the gross orientation of secondary structures, independent of connectivities, is currently assigned manually. The topology level clusters structures into fold groups according to their topological connections and numbers of secondary structures. The homologous superfamilies cluster proteins with highly similar structures and functions. The assignments of structures to fold groups and homologous superfamilies are made by sequence and structure comparisons. The boundaries and assignments for each protein domain are determined using a combination of automated and manual procedures. These include computational techniques, empirical and statistical evidence, literature review and expert analysis. ... [Information of the supplier]
The emap Atlas is a digital Atlas of mouse embryonic development. It is based on the definitive books of mouse embryonic development by Theiler (1989) and Kaufman (1992) yet extends these studies by creating a series of interactive three-dimensional computer models of mouse embryos at successive stages of development with defined anatomical domains linked to a stage-by-stage ontology of anatomical names. The three components of the spatial framework are the 3D models (3D grey-level voxel images) of the underlying histology, a standard mouse anatomical nomenclature (MAN) and a set of spatial regions or domains defined within the 3D models which link the geometric space of the 3D images with the textual description of the same space given by the nomenclature. These three parts form the Edinburgh Mouse Atlas and are maintained as an online database as well as a series of CD-ROMs. The atlas form the spatio-temporal framework for an image-mapped gene-expression database emage. The database is now ready for public access and submission. See the emage pages for more details. emage is complemetary to the text-based gene-expression database (GXD) at the Jackson Lab. The nomenclature is shared and the two database are interoperable. ... [Information of the supplier]
The Rat Genome Database RatMap is focused on presenting rat genes, DNA-markers, QTL:s etc that is localized to chromosome. The database is dedicated to rat gene nomenclature and should be consulted for queries in such. RatMap is formally sorting under the (RGNC) and is maintained at the Dept for Cell and Molecular Biology, Göteborg University, Sweden. Within RatMap you can find information on: rat gene nomenclature, chromosomal positions for genes, DNA-markers, QTL:s etc., predicted position for more than 6000 rat genes (see GAPP), gene function, literature references, DNA-sequences with links to DDBJ/EMBL/GenBank, unigene and Locus Link ID:s and links. ... [Information of the supplier]
MOLMOL is a molecular graphics program for displaying, analyzing, and manipulating the three-dimensional structure of biological macromolecules, with special emphasis on the study of protein or DNA structures determined by NMR. The program runs on UNIX and Windows NT/95/98/2000 and is freely available. [Information of the supplier]
The HUSAR Bioinformatics Lab at the German Cancer Research Center (DKFZ, Heidelberg) provides cutting-edge bioinformatics support and training to the scientist of the (post-) genomic era. This includes the Heidelberg Unix Sequence Analysis Resources (HUSAR), a large collection of essential sequence analysis tools. We provide the most up-to-date databases and software packages. Our main package - HUSAR - offers more than 260 applications for DNA and protein analysis. In addition to the latest, complete version of the GCG package (about 50% of our applications), you can use tools developed by us and by other ranking researchers in the bioinformatics field. We offer more than 200 databases, which are updated as soon as new versions become available. The major databases are updated every night (e.g. EMBL, GenBank) or weekly (Swissprot, Unigene). With HUSAR you can analyse the newest sequence data available. This is essential, as the DNA databases currently double their sizes in less than 9 months. ... [Information of the supplier]
We present a neural network based method (ChloroP) for identifying chloroplast transit peptides and their cleavage sites. Using cross-validation, 88% of the sequences in our homology reduced training set were correctly classified as transit peptides or nontransit peptides. This performance level is well above that of the so far only publicly available chloroplast localization predictor PSORT. Cleavage sites are predicted using a scoring matrix derived by an automatic motif-finding algorithm. Approximately 60% of the known cleavage sites in our sequence collection were predicted to within +- 2 residues from the cleavage sites given in SWISS-PROT. An analysis of 715 A. thaliana sequences from SWISS-PROT suggests that the ChloroP method should be useful for the identification of putative transit peptides in genome-wide sequence data. ... [Information of the supplier]