Circular RNA: functions, applications and prospects

Following improvement and integration of novel genome sequencing techniques, a new stage light has been shone upon circular RNAs (circRNA) in regard to their structure and functionality. In comparison to their linear counterparts, circRNA possess a multitude of remarkable functions such as microRNA (miRNA) sponging, RNAbinding protein (RBP) regulation and translational capabilities whose research have recently gained traction. Comprehensive computational modelling and analysis have revealed the relationship between the sequence composition of circRNAs and their biogenesis and structural formation as well as spatial identification. Due to fluctuations in expressional activity in tissue-specific environments and stimulations by tumour cells, circRNAs have sparked considerable interest in being employed as plausible biomarkers in disease control and treatment as consequence of their impressive specificity and biocompatibility. Additionally, an increasing number of studies have proposed them as viable solutions to be just as competent as presently used disease markers and medicine, if not better. In this review, I briefly summarized the characteristics, biogenesis and function of circRNA and introduced the potential applications and prospects of circRNA. With vigorous research being carried out regarding their still unclear diversified roles and precise molecular structure, circRNAs are bound to become the new revolutionary perspective on cellular regulation, protein signalling and disease pathogenesis.


Introduction
Circular RNA (circRNA) was discovered in RNA viruses as viroids in the mid 70s, initially hypothesised to be an endogenous RNA splicing error [1]. Thanks to advancement in computational analysis and RNA sequencing techniques in the same decade, these misunderstood circular structures have finally been recognized correctly and deeply both in structure and functionality [2]. At its core, circRNA is a single stranded RNA, but it differs from the far better known linear RNA in that it continuously closed in on itself by covalently joining its 5′ and 3′ ends [3], thus presenting some fascinating properties which are not fully explored: protein complex scaffolding, parental gene modulation, RNA-protein interactions and microRNA (miRNA) sponge, just to name a few [4,5]. They are now considered to provide essential regulatory function for plants and animals alike [6]. An increasing number of study groups have shown and verified to an extent the level of effectiveness and efficacy displayed in circular RNAs that is typically required in viable medical treatments and other biotechnological applications. For instance, cases of traditional biomarkers being vastly outperformed by proposed cir-cRNA substitutes are being reported often. Backed up by growing support and evidence about the promosing capabilities of circRNAs, more investigation and interest ought to be brought out as such, not merely from a basic comprehensive biological understanding of its structures and mechanisms, but also on a systematic level of their interactions with surrounding molecules and environments. Applicably speaking, circRNAs are on par in terms of its potential and viability to target cancer and other malignant diseases with other novel treatments such as personalised medicine and stem cell therapies.

CircRNA characteristics
CircRNAs generally consist of 1-5 exons and the introns flanking the exons are up to 3 times as long as their linear counterpart. Closer analysis has revealed the presence of many complementary inverter Alu repeats in the intron segments [7], leading some to speculate this particular arrangement actually facilitates the splice sites to easily locate each other and promote circularisation. Since they are closely-looped structures, there are effectively no 5′ and 3′ end structures such as poly-A tails and 5′ caps in circRNA, rendering them immune to exonuclease cleavage [8][9][10]. Empirically, they last 2.5 times longer than their linear counterparts in mammary cells, as illustrated in a study carried out by Enuka et al. [10]. Owing to these physical properties, common laboratory screening techniques such as RNase R degradationan enzyme that exclusively degrades linear RNAas well as poly-A tail testing can accurately select close looped structures over linear forms. Several research groups in recent years have shifted their focus on identifying potential circRNA isoforms, structures which are originally expressed from the same parental DNA but are slightly different from one another in its final mature form due to differentiation in specificity of spliceosomes to recognise exons and introns on the pre-mRNA strand [11]. Notable groups include Salzman, Jeck, Memczak, Guo and Zhang [7,[12][13][14][15]. As such the incredible diversity of circRNAs explained: 20, 000 different types have been identified in eukaryotes so far, a number that remains to this day open-ended [16].

CircRNA biogenesis and classification
The formation of circRNAs arise from the positioning and scrambling of exon and intron groups, which are segments that are reserved and eliminated in the final post-transcriptionally modified product respectively [17]. Normally a mature messenger RNA is formed when a protein-RNA complex named the spliceosome catalyses the cleaving-out of intron segments in a precursor-mRNA molecule, usually by recognition of specific sequences flanking the intron segment at both ends. The exon segments fuse, while the intronic segments are consequentially taken out and degraded. This conventional perception does not take into consideration the deviation and difference in potency across all splice sites [18], some of which, as a result, the spliceosome may ignore and inevitably lead to the synthesis of circRNA. Furthermore, the contribution of the spatial arrangement of the 5′ and 3′ splice site is not to be ignored, for if the former is positioned downstream of the latter, then the spliceosome favourably constructs a covalently closed circular structure over a linear exonic molecule [19]. This mechanism, commonly referred to as "Exon Scrambling" gives rise to different types of circRNAs including exonic, intronic, exon-intronic and intergenic [20]. In cancer specific cases, the internal structure is even more difficult to determine due to the expansive and thus invasive nature of malignant tumors [21]. We briefly depicted the biogenesis and functionality of cir-cRNAs in Fig. 1.

CircRNA function: microRNA sponges
Owing to the uniqueness in structures of circRNA, they do not code for proteins like the linear forms [3]. Studies have shown, with supporting empirical evidence, that certain circRNAs act as microRNA sponges and effectively obstruct their mechanism. MicroRNAs are 21-nt long non-coding RNA sequences that aid in the posttranscriptional regulation of gene expression, typically by latching themselves onto mRNAs and inhibiting its translation into protein via either competitive or noncompetitive fashion. They are classified into families by their seed regions, depending on whether they share the same sequence of nucleotides from positions 2 to 7 [8]. CircRNAs possess the complementarity to counterattach themselves onto microRNAs, recognising the seed regions of the miRNAs, and competitively deactivating them. Two circRNAs in particular, respectively CDR1as and circSRY, are the spotlight for scientific research at present. It is observed that CDR1as contains 70 conserved binding sites for miRNA-7, far significant than any other linear miRNA sponge. Its sponging capability is confirmed by Memczak et al. [22], which utilised the sequestering of CDR1as molecules against elevated expression of miR-7 in zebrafish brains to obtain supporting evidence of CDR1as inhibitory activities against the targeted miRNA by monitoring subsequent zebrafish midbrain developments. CircSRY, on the other hand, is tested in murine testes and its complementary attack on the miR-138 seed region is noticed [23]. As it contains 16 specific binding sites, a number still impressive amongst all sponge molecules, their sponge functionalisation hypothesis is confirmed [24].

CircRNA function: interaction with RBPs and protein translation
Some have found circRNA to regulate gene transcription and expression via other pathways. They may interact with RNA binding proteins (RBPs) such as circ-Foxo3 and together form a complex that impacts cell survival and proliferation by interacting with p21 and CDK2 [25]; some strengthens mRNA stability by forming duplex structures such as in the case of CDR1as. On a more controversial note, groups such as Legnini I. et al. and Pamudurti N.R. et al. discovered that certain cir-cRNAs can translate for proteins, one in murine myoblasts and one in fly heads [26,27]. Such news brings forth new hypothesis' on circRNA capabilities, conventionally thought to be non-coding [27]. Since the first discovery of proteins translated from hepatitis virus-a single-stranded circRNA, some have verified the activation of circRNA translational capability by inserting an IRES (internal ribosome entry site) upstream of the start codon [28]. Much more is needed to be done to fully understand the exact translational mechanism of these circRNAs, and why they work whilst the majority of others do not.

CircRNA application potential
On a more practical note, circRNAs are viable biomarkers for the diagnosis and treatment of diseases as they cannot be easily degraded by exonucleases owing to their closed circular structure. In some cases, circRNAs have been found to trump conventional biomarkers. For example, the upregulation of circ-PVT1 in Gastric Cancer (GC) tissues enhances sponging activity of miR-125 d ecircRNA or ElcircRNA is produced regardless of Circularisation route. In some cases, intronic segments resides in the loop, hence giving rise to ElcircRNAs as opposed to ecircRNA which contains purely exonic segments. e Functionalities of mature circRNAs include miRNA sponging, acting as a double inhibitor for some chemical reactions; Protein Translation is possible, albeit quite rare and research is being carried to understand how it differs from linear RNA translations; RBP-Protein Complex formations help regulate and moderate pathways and indirectly impact the production of other circRNAs; mRNA Interactions, be it facilitative or inhibitory and subsequently encourages GC proliferation [29]; hsa_ circ_0000190 has also attracted attention by operating just the opposite waydownregulation occurs when it comes into contact with GC, and it is tested to be more sensitive and specific than biomarkers like CEA and CA 19-9 [30]. Another example is in Hepatocellular Carcinoma (HCC), where the present biomarker in predominate use is Alpha-fetoprotein (AFP) [31]. AFP shows poor sensitivity, whereby 40% of all patients with HCC tested normal AFP levels. The constructive way to boost this sensitivity is by combination with other markers, which is not an effective solution. Alternatively, Xingchen Shang et al. has suggested the correlation between circ_ 005075 and tumour size [32], listing it as a viable prognostic biomarker that is superior both in effectiveness and potential due to their stability and specificity. This suggests that the development and invasion of HCCs are closely linked to circRNAs, albeit its full mechanism still unclear. Nevertheless, the list of feasible circRNA biomarkers applicable to cancer research is not limited to solely these two diseases. We summarized available studies concerning circRNAs involved in various human diseases which can be found in Table 1.
Additional recent studies have identified and are attempting to decode the enrichment and stability of cir-cRNAs in exosomes, a combination which could further enhance the targeting ability of circRNAs. Exosomes are extracellular vesicles whose main function is to transport various cell contents, chemicals as well as factors, thus allowing cell-to-cell interaction and response [33]. As such, a considerable number of cellular changes and tissue response are a consequence of whether its corresponding vesicle of the same compatibility has successfully reached its destination and illicited responses or transported factors [34]. Gaining an understanding of exosome mechanism can assist in deriving mediations on tumor microenvironments and intercellular networking, therefore sparking great interest in the exosome cir-cRNA recently in light of the possibility of strengthened effectiveness and targeting ability on malignant or malfunctioning cells.
The origin of circRNAs ultimately depend on corresponding miRNA levels in their donor cells, which can be both immune and nonimmune in nature. Exosome RNAs can minimise damage to DNA by accelerating the cell cycle, as is shown in a recent case of the overexpression of miR-217, resulting in the reduction of clclin-D1 and EZH2 expressions. This behaviour is believed to be linked to deregulated proliferation in the formations of neoplasmia [33,35]. Moreoever, many experimental results have concluded the direct relation between exosomes and neoplastic transformation, as well as cir-cRNA's mechanistic effect on tumor microenvironment [36]. Taking pancreatic ductal adenocarcinoma (PDAC) for example, it has been associated the the abnormal high expression of the exosome circ-PED8A [37]; the exosome circNRIP1 promotes metastatis in Gastric Cancer (GC) by sponging miR-149-5p, in another study. Perhaps most significant is the role exosome circPTGR1 has on the development of Hepatocellular Carcinoma (HCC) [38,39], whereby upregulating the exosome cir-cRNA encouraged tumor invasion. Owing to these highly-correlational findings, exosome circRNAs are proposed as diagnostic indicators for their corresponding malignant tumors based on how respondent they are in changing expression levels and their excellent stability, coupled with its innate targeted delivery mechanism [40]. As of present, over 1000 circRNAs has been identified in exosomes throughout the human body, with more research being carried out on discovering extra exosome-circRNA-cancer combinations.

CircRNA challenges and prospects
Despite increasing research have been carried out in parallel with the rise of circRNA popularity, the biological function of most circRNAs still remains a mystery. For instance, it is observed that the majority of circRNAs patrol in the cytoplasm, but they originate from the nucleus of the cell, so the question of how they fit through the tiny nuclear pore is raised. Furthermore, the fact that many of the circularised exons (85%) overlap with protein-coding sequences but the majority of circRNAs do not code for proteins remains to be investigated. On a more clinical note, they require further testing to be able to completely replace traditional diagnostic procedures. Concerns such as trauma-causing patient tissue extraction and expensive circRNA detection in tissue remains to be addressed alongside obtaining a full comprehensive understanding regarding their secondary structures and differing roles amongst themselves. Failure to properly administer suitable circRNA biomarkers in patients can obfuscate clinical results which must be overcome through gaining a better picture of the generation, localisation and degradation of proposed circRNAs.
Nevertheless, circRNAs are still attractive options for the development of a range of biological therapeutic tools. There are already reports of both in vitro and in vivo RNA construction utilising group I permuted intron-exon (PIE) sequences for the complementary targeting of spliceosomal-backsplice-site flanking sequences, and this mechanism can possibly be extrapolated to include any sequence or protein of known structure, if we so desire. As a side note, there is much room for improvement in broadening the diversity of circRNA's diagnostic possibilities. In one example, present molecular analysis of blood is retained at analysing cell-free fragments of genomic-DNA; A great future prospect would be to consider the sampling of disease-specific extra-cellular vesicles to monitor disease onset and progression with more detail. These ideas lay the foundation for further suggestions of selective protein regulation and programmed cell signalling. As demonstrated time and time again in ongoing experiments, circRNAs have confidently displayed their sponging and biomarking potential, which should urge us on into unlocking the secrets of the long-misunderstood circRNAs.

Conclusion
RNAs have always attracted sufficient attention in scientific research, and it is no surprise that much research has been vigorously carried out when a circular form of RNA had been discovered, distinguishing itself from its well-known linear counterpart. Despite both of them identifying as RNAs, the circularity of circRNAs nonintuitively introduces vastly different functionalities that the more famous linear RNAs do not normally possess: miRNA sponging, RBP-Protein Complex formations and mRNA interactions, and it is safe to assume that this is just the tip of the iceberg. Many mysteries regarding circular RNAs, either in their precise formation mechanisms, or transport and maturation processes await answers. Perhaps most fascinatingly, why few, out of numerous types of circRNAs, are able to code for proteins. This recent discovery inevitably brings forth possibilities in potential circRNA modifications and the plausible unlocking of new functions that were never meticulously considered: protein translation by circRNAs on a scale comparable to linear RNA translation, for example, which could alter and invigorate our perception of how protein is translated. Exosomal transportation and enhancement to expand the target list of circRNAs is another option worth putting thought into. Many research groups have utilised the independent properties of circRNAs in empirical experimentations and clinical trials, with many reporting the effective utilisation of circRNAs as miRNA sponges and others in potential biomarkers. Owing to their natural biocompatibility and molecular specificity, circular RNAs are proposed to trump traditional markers used in cancers by their superior effectiveness and targeting abilities, making them a highly promising alternative treatment when compared to chemotherapy, radiotherapy, immunotherapy and surgery.