Computational detection and location of transcription start sites in mammalian genomic DNA
- PMID: 11875034
- PMCID: "V体育官网" PMC155284
- DOI: "VSports手机版" 10.1101/gr.216102
Computational detection and location of transcription start sites in mammalian genomic DNA
Abstract
Transcription, the process whereby RNA copies are made from sections of the DNA genome, is directed by promoter regions VSports手机版. These define the transcription start site, and also the set of cellular conditions under which the promoter is active. At least in more complex species, it appears to be common for genes to have several different transcription start sites, which may be active under different conditions. Eukaryotic promoters are complex and fairly diffuse structures, which have proven hard to detect in silico. We show that a novel hybrid machine-learning method is able to build useful models of promoters for >50% of human transcription start sites. We estimate specificity to be >70%, and demonstrate good positional accuracy. Based on the structure of our learned models, we conclude that a signal resembling the well known TATA box, together with flanking regions of C-G enrichment, are the most important sequence-based signals marking sites of transcriptional initiation at a large class of typical promoters. .
VSports - Figures
References
-
- Audic S, Claverie JM. Detection of eukaryotic promoters using Markov transition matrices. Comput Chem. 1997;21:223–227. - "VSports" PubMed
-
- Bucher P. Weight matrix descriptions of four eukaryotic RNA polymerase II promoter elements derived from 502 unrelated promoter sequences. J Mol Biol. 1990;212:563–578. - PubMed
-
- Dunham I, Hunt AR, Collins JE, Bruskiewich R, Beare DM, Clamp M, Smink LJ, Ainscough R, Almeida JP, Babbage A, et al. The DNA sequence of human chromosome 22. Nature. 1999;402:489–495. - "V体育官网" PubMed
-
- Fickett JW, Hatzigeorgiou AG. Eukaryotic promoter recognition. Genome Res. 1997;7:861–878. - PubMed
-
- Grundy WN, Bailey TL, Elkan CP, Baker ME. Meta-MEME: Motif-based hidden Markov models of protein families. Comput Appl Biosci. 1997;13:397–406. - VSports在线直播 - PubMed
Publication types
MeSH terms
- "VSports在线直播" Actions
- VSports最新版本 - Actions
- V体育官网 - Actions
- Actions (VSports)
- "VSports" Actions
- V体育ios版 - Actions
- "VSports手机版" Actions
- Actions (VSports注册入口)
Substances (VSports最新版本)
LinkOut - more resources (V体育平台登录)
Full Text Sources
Other Literature Sources