Ribonucleotides are important macromolecules in organisms, as well as carriers of genetic information of cells, some viruses, and viroids. RNA can be divided into many types according to its function, mainly including the following types: mRNA, tRNA, rRNA, miRNA, snRNA, etc. For RNA with DNA coding, the work in the cell can be completed by the production of the protein it carries. Under the premise that the actual genetic sequence does not change, the change of gene expression will be affected by the chemical modification of RNA. This kind of epigenetic modification affects the biological processes of many organisms. In 1951, in-depth research was carried out in the field of RNA modification. With the continuous discovery of RNA modification research, RNA plays an important role in regulating the body. RNA modification plays an important role in the occurrence and development of life activities and diseases. Many RNAs in nature have undergone post-transcriptional modifications. These post-transcriptional modifications play an important role in the structure, function, and metabolism of RNA. So far, more than 180 different types of RNA modifications have been identified. These RNA modifications exist in the RNA of most cells in the three major areas of life (see Figure 1). Generally, the use of RNA modification is mainly used to expand the chemical and topological properties of the four basic nucleotides, thereby affecting the biogenesis, stability, kinetics and function of the RNA/RNP carrying them.
Figure 1. Distribution of RNA methylation in different organisms. (DOI: 10.1002/wrna.79)
Types of RNA modification
There are many types of RNA modifications, and the content and functions of different types of RNA modifications are also very different. Among them, methylation modification is one of the main forms of RNA modification, accounting for about two-thirds of the total amount of RNA modification. RNA methylation modification mainly occurs at special positions such as the nitrogen atom and carbon atom on the base group, and the oxygen atom of ribose 2'OH. RNA methylation modifications that are ubiquitous in archaea, prokaryotic bacteria and eukaryotes include m6A, m6,6A, m1A, m5C, 3-methylcytosine (m3C), 7-methylguanine (m7G), Cm, Gm, m5U and Um, etc. In addition to methylation modification, there are also some non-methylation modifications on RNA, such as acetylation, methoxy modification, pseudouracil (ψ) and hypoxanthine (inosine, I) (see Figure 2). In addition to these modifications that exist in nature, there are currently many types of chemical modifications introduced into the chemical synthesis of RNA. They can be roughly divided into: base modification, sugar ring modification, main chain modification, architectural approach, end protrusion or modification, modification of the conjugate and so on.
Figure 2. Common types of RNA modifications
- m6A
- m1A
- m5C
- m7G
- ψ
- I
m6A modification is commonly found in the RNA of eukaryotes and is the most common type of modification in RNA modification. The first discovery of m6A modification was in 1974. The 6th nitrogen atom of RNA molecule adenine has undergone m6A methylation modification. The m6A modification has been proven to be reversible, and this process involves the participation of methyltransferase, demethylase, and methylation reading protein.
Unlike m6A, m1A modification occurs on the first N atom of the adenine base group, and it has a positive charge under physiological conditions. m1A was first found in non-coding RNAs such as rRNA and tRNA, and it is widely present in prokaryotic and various eukaryotic mRNAs.
The discovery of m5C modification in mRNA also has a history of more than 40 years. The distribution and function of m5C may be species and tissue cell specific. The role and metabolic pathways of m5C in eukaryotic mRNA are not fully understood. Researchers have found its derivatives hm5C and fm5C in the RNA of Drosophila and mammalian cells and brain tissue, indicating that m5C is a highly dynamic and modified form with potential regulatory function.
Methylation modification in the 5'cap structure of mRNA m7G (5')ppp (5') N is the most common modification in eukaryotic mRNA. The m7G cap structure plays a variety of functions in cells. It can not only bind the translation initiation factor EIF4F to promote mRNA translation, but also prevent mRNA from being degraded by intracellular exonuclease. The m7G modification of the cap structure is not found in prokaryotic mRNA, but it is present in some viral RNAs.
ψ is one of the most abundant forms of post-transcriptional modification in RNA. It is widely present in cellular RNA and is highly conserved among species. Since ψ is formed by the sequence-specific isomerization of uracil (U), this modification is abundant in tRNA and rRNA. ψ can change the characteristics of base complementary pairing so that it not only affects the structure of rRNA, but also affects mRNA coding, indicating that it may be a potential regulatory element involved in post-transcriptional regulation.
Adenine-hypoxanthine (A-I) is the most common form of RNA editing in mammalian mRNA, and it mainly occurs in the double-stranded structure region of RNA. Adenosine deaminase (ADAR) can deaminate the adenosine on double-stranded RNA to produce I. Since I is paired with C during base pairing, the nucleotide A to G is converted, which may lead to a series of functional changes.
The role of RNA modification
In vitro, various types of chemical modification of RNA can improve the thermodynamics and nuclease stability of RNA, increase their half-life in the body, improve biodistribution and pharmacokinetic properties, improve binding affinity to the target, improve specificity, and reduce off-target effects and other effects. In organisms, there are many types of RNA modifications of mRNA, which exist in various regions of the transcript. For example, m6A exists in 5'UTR, CDS (coding region), 3'UTR and introns of precursor mRNA, and some non-coding RNA and miRNA. These modifications can be used to identify the start position of the transcript. In addition, in the normal RNA processing process, the cap structure added to the 5'end of the mRNA through the capping process has a very important regulatory effect on gene expression, transcript stability, etc., and is the translation of eukaryotic mRNA into protein and a key stabilizing factor. In short, the 5'end methylation modification of these mRNAs is very important for mRNA stability, translation initiation and gene expression. In addition to being vital to the cell's own translation process, tRNA methylation modification can also perform a variety of other functions. It serves as a primer for retroviral reverse transcriptase and participates in processes such as DNA damage repair, stress response, and gene transcription. rRNA methylation modification has potential regulatory effects on mRNA translation, mitochondrial transcription, and bacterial drug resistance mechanisms. The methylation modification on other small RNAs increases the stability of the RNA and also participates in the RNA repair process. ψ is one of the most abundant forms of post-transcriptional modification in RNA. It is widely present in cellular RNA and is highly conserved among species. This modification is abundant in tRNA and rRNA. ψ can change the characteristics of base complementary pairing so that it not only affects the structure of rRNA, but also affects mRNA coding, indicating that it may be a potential regulatory element involved in post-transcriptional regulation.