On the structural repertoire of pools of short, random RNA sequences


A detailed knowledge of the mapping between sequence and structure spaces in populations of RNA molecules is essential to better understand their present-day functional properties, to envisage a plausible early evolution of RNA in a prebiotic chemical environment and to improve the design of in vitro evolution experiments, among others. Analysis of natural RNAs, as well as in vitro and computational studies, show that certain RNA structural motifs are much more abundant than others, pointing out a complex relation between sequence and structure. Within this framework, we have investigated computationally the structural properties of a large pool (10 molecules) of single-stranded, 35 nt-long, random RNA sequences. The secondary structures obtained are ranked and classified into structure families. The number of structures in main families is analytically calculated and compared with the numerical results. This permits a quantification of the fraction of structure space covered by a large pool of sequences. We further show that the number of structural motifs and their frequency is highly unbalanced with respect to the nucleotide composition: simple structures such as stem-loops and hairpins arise from sequences depleted in G, while more complex structures require an enrichment of G. In general, we observe a strong correlation between subfamilies-characterized by a fixed number of paired nucleotides-and nucleotide composition. Our results are compared to the structural repertoire obtained in a second pool where isolated base pairs are prohibited.

Publication DOI: https://doi.org/10.1016/j.jtbi.2008.02.018
Divisions: College of Engineering & Physical Sciences > Systems analytics research institute (SARI)
Additional Information: NOTICE: this is the author’s version of a work that was accepted for publication in Journal of theoretical biology. Changes resulting from the publishing process, such as peer review, editing, corrections, structural formatting, and other quality control mechanisms may not be reflected in this document. Changes may have been made to this work since it was submitted for publication. A definitive version was subsequently published in Stich, M, Briones, C & Manrubia, SC, 'On the structural repertoire of pools of short, random RNA sequences' Journal of theoretical biology, vol. 252, no. 4 (2008) DOI http://dx.doi.org/10.1016/j.jtbi.2008.02.018
Uncontrolled Keywords: RNA motif,genotype–phenotype map,RNA folding,RNA world,structural family
Publication ISSN: 1095-8541
Last Modified: 08 Mar 2024 08:09
Date Deposited: 24 Sep 2013 10:00
Full Text Link:
Related URLs: http://www.scop ... tnerID=8YFLogxK (Scopus URL)
PURE Output Type: Article
Published Date: 2008-06-21
Authors: Stich, Michael (ORCID Profile 0000-0001-8862-1044)
Briones, Carlos
Manrubia, Susanna C.



Version: Accepted Version

Export / Share Citation


Additional statistics for this record