﻿<?xml version="1.0" encoding="utf-8"?><rss version="2.0"><channel><title>Advances in Bioinformatics</title><link>http://www.hindawi.com</link><description>The latest articles from Hindawi Publishing Corporation</description><copyright>&amp;#169; 2008, Hindawi Publishing Corporation. All rights reserved.</copyright><item><title>Metagenome Fragment Classification Using N-Mer  Frequency Profiles</title><link>http://www.hindawi.com/GetArticle.aspx?doi=10.1155/2008/205969</link><description>A vast amount of microbial sequencing data is being generated through large-scale projects in ecology, agriculture, and human health. Efficient high-throughput methods are needed to analyze the mass amounts of metagenomic data, all DNA present in an environmental sample. A major obstacle in metagenomics is the inability to obtain accuracy using technology that yields short reads. We construct the unique  N-mer frequency profiles of 635 microbial genomes publicly available as of February 2008. These profiles  are used to train a naive Bayes classifier (NBC) that can be used to identify the genome of any fragment. We show that our method is comparable to BLAST for small 25 bp fragments but does not have the ambiguity of BLAST&amp;#39;s tied top scores. We demonstrate that this approach is scalable to identify any fragment from hundreds of genomes. It also performs quite well at the strain, species, and genera levels and achieves strain resolution despite classifying ubiquitous genomic fragments (gene and nongene regions). Cross-validation analysis demonstrates that species-accuracy achieves 90&amp;#37;  for highly-represented species containing an average of 8 strains. We demonstrate that such a tool can be used on the Sargasso Sea dataset, and our analysis shows that NBC can be further enhanced.</description><Author>Gail Rosen, Elaine Garbarine, Diamantino Caseiro, Robi Polikar, and Bahrad Sokhansanj</Author><copyright>&amp;#169; 2008, Hindawi Publishing Corporation. All rights reserved.</copyright></item><item><title>Genomic Promoter Analysis Predicts Functional Transcription Factor Binding</title><link>http://www.hindawi.com/GetArticle.aspx?doi=10.1155/2008/369830</link><description>Background. The computational identification of functional transcription factor binding sites (TFBSs) remains a major challenge of computational biology. 
                  Results. 
                  We have analyzed the conserved promoter sequences for the complete set of human RefSeq genes using our conserved transcription factor binding site (CONFAC) software.  CONFAC identified 16296 human-mouse ortholog gene pairs, and of those pairs, 9107 genes contained conserved TFBS in the 3&amp;#x02009;kb proximal promoter and first intron. To attempt to predict in vivo occupancy of transcription factor binding sites, we developed a novel marginal effect isolator algorithm that builds upon Bayesian methods for multigroup TFBS filtering and predicted the in vivo occupancy of two transcription factors with an overall accuracy of 84&amp;#37;.   
                  Conclusion. Our analyses show that integration of chromatin immunoprecipitation data with conserved TFBS analysis can be used to generate accurate predictions of functional TFBS.  They also show that TFBS cooccurrence can be used to predict transcription factor binding to promoters in vivo.</description><Author>J. Sunil Rao, Suresh Karanam, Colleen D. McCabe, and Carlos S. Moreno</Author><copyright>&amp;#169; 2008, Hindawi Publishing Corporation. All rights reserved.</copyright></item><item><title>Comparing Quantitative Trait Loci and Gene Expression Data</title><link>http://www.hindawi.com/GetArticle.aspx?doi=10.1155/2008/719818</link><description>We develop methods to compare the positions of quantitative trait loci (QTL) with a set of genes selected by other methods, such as microarray experiments, from a sequenced genome. We apply our methods to QTL for addictive behavior in mouse, and a set of genes upregulated in a region of the brain associated with addictive behavior, the nucleus accumbens (NA). The association between the QTL and NA genes is not significantly stronger than expected by chance. However, chromosomes 2 and 16 do show strong associations suggesting that genes on these chromosomes might be associated with addictive behavior. The statistical methodology developed for this study can be applied to similar studies to assess the mutual information in microarray and QTL analyses.</description><Author>Bing Han, Naomi S. Altman, Jessica A. Mong, Laura Cousino Klein, Donald W. Pfaff, and David J. Vandenbergh</Author><copyright>&amp;#169; 2008, Hindawi Publishing Corporation. All rights reserved.</copyright></item><item><title>Genevestigator V3: A Reference Expression Database for the Meta-Analysis of Transcriptomes</title><link>http://www.hindawi.com/GetArticle.aspx?doi=10.1155/2008/420747</link><description>The Web-based software tool Genevestigator provides powerful tools for biologists to explore gene
expression across a wide variety of biological contexts. Its first releases, however, were limited by the scaling
ability of the system architecture, multiorganism data storage and analysis capability, and availability of
computationally intensive analysis methods. Genevestigator V3 is a novel meta-analysis system resulting
from new algorithmic and software development using a client/server architecture, large-scale manual
curation and quality control of microarray data for several organisms, and curation of pathway data for mouse
and Arabidopsis. In addition to improved querying features, Genevestigator V3 provides new tools to analyze
the expression of genes in many different contexts, to identify biomarker genes, to cluster genes into
expression modules, and to model expression responses in the context of metabolic and regulatory networks.
Being a reference expression database with user-friendly tools, Genevestigator V3 facilitates discovery
research and hypothesis validation.</description><Author>Tomas Hruz, Oliver Laule, Gabor Szabo, Frans Wessendorp, Stefan Bleuler, Lukas Oertle, Peter Widmayer, Wilhelm Gruissem, and Philip Zimmermann</Author><copyright>&amp;#169; 2008, Hindawi Publishing Corporation. All rights reserved.</copyright></item><item><title>A Tutorial of the Poisson Random Field Model in Population Genetics</title><link>http://www.hindawi.com/GetArticle.aspx?doi=10.1155/2008/257864</link><description>Population genetics is the study of allele frequency changes driven by various evolutionary forces such as mutation, natural selection, and random genetic drift.  Although natural selection is widely recognized as a bona-fide phenomenon, the extent to which it drives evolution continues to remain unclear and controversial. Various qualitative techniques, or so-called &amp;#8220;tests of neutrality&amp;#8221;, have been introduced to detect signatures of natural selection. A decade and a half ago, Stanley Sawyer and Daniel Hartl provided a mathematical framework, referred to as the Poisson random field (PRF), with which to determine quantitatively the intensity of selection on a particular gene or genomic region. The recent availability of large-scale genetic polymorphism data has sparked widespread interest in genome-wide investigations of natural selection. To that end, the original PRF model is of particular interest for geneticists and evolutionary genomicists. In this article, we will provide a tutorial of the mathematical derivation of the original Sawyer and Hartl PRF model.</description><Author>Praveen Sethupathy and Sridhar Hannenhalli</Author><copyright>&amp;#169; 2008, Hindawi Publishing Corporation. All rights reserved.</copyright></item><item><title>Automated Quantitative Assessment of Proteins&amp;#39; Biological Function in Protein Knowledge Bases</title><link>http://www.hindawi.com/GetArticle.aspx?doi=10.1155/2008/897019</link><description>Primary protein sequence data are archived in databases together with information regarding corresponding biological functions. In this respect, UniProt/Swiss-Prot is currently the most comprehensive collection and it is routinely cross-examined when trying to unravel the biological role of hypothetical proteins. Bioscientists frequently extract single entries and further evaluate those on a subjective basis. In lieu of a standardized procedure for scoring the existing knowledge regarding individual proteins, we here report about a computer-assisted method, which we applied to score the present knowledge about any given Swiss-Prot entry.   Applying this quantitative score allows the comparison of proteins with respect to their sequence yet highlights the comprehension of functional data. pfs analysis may be also applied for quality control of individual entries or for database management in order to rank entry listings.</description><Author>Gabriele Mayr, G&amp;#252;nter Lepperdinger, and Peter Lackner</Author><copyright>&amp;#169; 2008, Hindawi Publishing Corporation. All rights reserved.</copyright></item></channel></rss>