Articolo in rivista, 2014, ENG, 10.1371/journal.pone.0085260

Genome-Wide Analysis of Promoters: Clustering by Alignment and Analysis of Regular Patterns

Lucia Pettinato (1,2); Elisa Calistri (3,4); Francesca Di Patti (1,3); Roberto Livi (1,2,3,5); Stefano Luccioli (5,6)

(1) Dipartimento di Fisica e Astronomia, Università degli Studi di Firenze, Sesto Fiorentino, Italy (2) Istituto Nazionale di Fisica Nucleare, Sesto Fiorentino, Italy (3) Centro Interdipartimentale per lo Studio delle Dinamiche Complesse, Sesto Fiorentino, Italy (4) Dipartimento di Biologia, Università degli Studi di Firenze, Sesto Fiorentino, Italy (5) Istituto dei Sistemi Complessi, Consiglio Nazionale delle Ricerche, Sesto Fiorentino, Italy (6) Joint Italian-Israeli Laboratory on Integrative Network Neuroscience, Tel Aviv University, Ramat Aviv, Israel

In this paper we perform a genome-wide analysis of H. sapiens promoters. To this aim, we developed and combined two mathematical methods that allow us to (i) classify promoters into groups characterized by specific global structural features, and (ii) recover, in full generality, any regular sequence in the different classes of promoters. One of the main findings of this analysis is that H. sapiens promoters can be classified into three main groups. Two of them are distinguished by the prevalence of weak or strong nucleotides and are characterized by short compositionally biased sequences, while the most frequent regular sequences in the third group are strongly correlated with transposons. Taking advantage of the generality of these mathematical procedures, we have compared the promoter database of H. sapiens with those of other species. We have found that the above-mentioned features characterize also the evolutionary content appearing in mammalian promoters, at variance with ancestral species in the phylogenetic tree, that exhibit a definitely lower level of differentiation among promoters.

PloS one 9 (1), pp. e85260–?

Keywords

DNA promoters, sequence alignment, clustering, regular sequences

CNR authors

Luccioli Stefano, Livi Roberto

CNR institutes

ISC – Istituto dei sistemi complessi

ID: 278573

Year: 2014

Type: Articolo in rivista

Creation: 2014-03-06 10:00:00.000

Last update: 2016-02-03 11:30:51.000

External IDs

CNR OAI-PMH: oai:it.cnr:prodotti:278573

DOI: 10.1371/journal.pone.0085260

ISI Web of Science (WOS): 000330283100032

Scopus: 2-s2.0-84899865528