Home > Research > Publications & Outputs > High-resolution human cytomegalovirus transcrip...

Links

Text available via DOI:

View graph of relations

High-resolution human cytomegalovirus transcriptome

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Published
  • Derek Gatherer
  • Sepehr Seirafian
  • Charles Cunningham
  • Mary Holton
  • Derrick J. Dargan
  • Katarina Baluchova
  • Ralph D. Hector
  • Julie Galbraith
  • Pawel Herzyk
  • Gavin W. G. Wilkinson
  • Andrew J. Davison
Close
<mark>Journal publication date</mark>6/12/2011
<mark>Journal</mark>Proceedings of the National Academy of Sciences of the United States of America
Issue number49
Volume108
Number of pages6
Pages (from-to)19755-19760
Publication StatusPublished
<mark>Original language</mark>English

Abstract

Deep sequencing was used to bring high resolution to the human cytomegalovirus (HCMV) transcriptome at the stage when infectious virion production is under way, and major findings were confirmed by extensive experimentation using conventional techniques. The majority (65.1%) of polyadenylated viral RNA transcription is committed to producing four noncoding transcripts (RNA2.7, RNA1.2, RNA4.9, and RNA5.0) that do not substantially overlap designated protein-coding regions. Additional noncoding RNAs that are transcribed antisense to protein-coding regions map throughout the genome and account for 8.7% of transcription from these regions. RNA splicing is more common than recognized previously, which was evidenced by the identification of 229 potential donor and 132 acceptor sites, and it affects 58 protein-coding genes. The great majority (94) of 96 splice junctions most abundantly represented in the deep-sequencing data was confirmed by RT-PCR or RACE or supported by involvement in alternative splicing. Alternative splicing is frequent and particularly evident in four genes (RL8A, UL74A, UL124, and UL150A) that are transcribed by splicing from any one of many upstream exons. The analysis also resulted in the annotation of four previously unrecognized protein-coding regions (RL8A, RL9A, UL150A, and US33A), and expression of the UL150A protein was shown in the context of HCMV infection. The overall conclusion, that HCMV transcription is complex and multifaceted, has implications for the potential sophistication of virus functionality during infection. The study also illustrates the key contribution that deep sequencing can make to the genomics of nuclear DNA viruses.