Scientists reveal how cell corrects errors made in gene transcription

RNA polymerase II (Pol II), a key enzyme in our gene expression, is responsible for transcribing DNA into messenger RNA. Errors in transcription can cause deleterious effect upon repeated translation of erroneous mRNA into protein. Transcription infidelity may result in aging and human diseases such as cancer. During transcription, Pol II can detect the mis-incorporated RNA and backtrack to correct errors to ensure that each messenger RNA created will match with template DNA. However, it remains largely a mystery how Pol II controls the fidelity of gene transcription.

Scientists from the Hong Kong University of Science and Technology (HKUST) elucidated the dynamics of backtracking at atomistic level by building a kinetic model (Markov State Model) based on extensive molecular dynamics simulations. They discovered that Pol II backtracking occurs in a stepwise mode, where first the RNA moves away from the active site (reaching a "frayed" state) before the enzyme can reach the backtracked state. They also found that the rate-limiting step of the process is the transition from the frayed state to the final backtracked.

Their findings were published in Nature Communications on April 19, 2016 (DOI: 10.1038/ncomms11244).

pic Representative structures for each of the four metastable states identified by our Markov State Model (MSM; S1-S4) and connected by the top one pathway from the pre-translocation state (S1) to the backtracked state (S4): S1right arrowS2right arrowS3right arrowS4. The equilibrium population for each state (left bottom corner) and the Mean First Passage Time (MFPT) between each transition (beside arrow, unit in μs) are presented, with their corresponding errors given as followings: 10.5±1.5%, 22.4±2.1%, 4.0±0.7% and 63.1±2.8% (equilibrium population); 0.1±0.0?μs, 0.8±0.1?μs, 5.1±3.0?μs, 1.0±0.7?μs, 95.9±42.3?μs and 191.8±69.0?μs (MFPT). The hybrid RNA/DNA chains (red/blue), the Trigger Loop (purple), Bridge helix (green), Rpb1 residue Y836 and Rpb2 residue Y769 (both in grey) are shown. The DNA TN (dG) and its mismatched RNA nucleotide (rG) are highlighted with cyan and orange stick model, respectively. Credit: Department of Chemistry, HKUST

"We found that a critical amino acid (Rpb1 Threonine 831) serves as the sensing probe to detect the weak interaction between the mis-incorporated RNA and the template DNA. The bending motion of bridge helix, the protein motif where the important Threonine locates, promotes the RNA to go to the frayed state," said Xuhui Huang, associate professor of chemistry at HKUST and leader of the research group. "We further validated the predictions from our simulations by site-directed mutagenesis experiments and transcript cleavage assays in collaboration with Prof. Dong Wang's group from UCSD."

"This work requires large-scale high-performance computing resources: the ~50 microseconds of simulations contains 25 billion MD steps, which were computed on 10,000 CPU cores for 4 weeks on the Shaheen Supercomputer at KAUST (Ranked #7 worldwide in July 2015). Moreover, the Markov State Model enables us to identify intermediate conformational states and kinetics of backtracking from this massive MD simulation dataset. " said Huang.

"Our results will provide insight into the fundamental mechanisms of transcription, and open up perspectives for understanding human diseases and aging problems related to transcription infidelity," said Huang. "We could also go beyond the one-nucleotide backtracking event and study the backtracking mechanisms for two or even more RNA 3?-end nucleotides, which may give insight into the molecular mechanisms of the transcriptional pausing and arrest. It would also be important to investigate the backtracking mechanisms for different mutagenic DNA lesions, such as 8-oxoG and O6-methyguanine (O6-mdG) etc., and how some of these lesions can escape Pol II proofreading."

source: Hong Kong University of Science and Technology