Development of LncRNA Biomarkers in Extracellular Vesicle of Amniotic Fluid Associated with Antenatal Hydronephrosis


1. Introduction

Antenatal hydronephrosis (ANH) is one of the most common congenital anomalies affecting the kidney and urinary tract (CAKUT), with a series of clinical manifestations characterized by the separation of the renal pelvis collecting system or by dilatation of the calyces or ureters [1]. Pathological hydronephrosis occurs in 0.3–0.6% of newborns, and anomalies and pathological conditions after birth include ureteropelvic junction obstruction (UPJO), posterior urethral valves (PUV), ureteral obstruction, etc. [2,3]. Approximately 15–20% of children with ANH need surgery, especially when both kidneys are blocked, which leads to the rapid deterioration of renal function [4]. Therefore, early detection and intervention are important means to prevent the loss of renal function caused by ANH.
The management of ANH has been controversial for several decades. The distinction between transient hydronephrosis and UPJO remains one of the most controversial challenges [5,6]. Prenatal ultrasonography (US) and magnetic resonance imaging are the most commonly used methods for detecting and diagnosing ANH during pregnancy, and ultrasonic parameters such as amniotic fluid volume, renal cortical thinning, and fetal renal parenchyma structure, especially fetal anteroposterior renal pelvic diameter (APD/APRPD), are considered predictive indicators of postnatal renal function [7]. However, APD, the most common method for determining ANH grade, is a single measurement standard and has poor accuracy in predicting patient prognosis. There is an urgent need to develop accurate and practical biomarkers for the early diagnosis of ANH.
It is well known that urine is a promising source of biomarkers for kidney-related diseases [8,9]. Amniotic fluid (AF), which is mainly derived from fetal urine, is also an unexploited but potentially very valuable resource for assessing fetal renal status and development. In the clinic, the risk of amniocentesis is much lower than that of renal pelvis aspiration (RPA). AF is collected via amniocentesis during the second trimester, and AF cells are routinely used to detect fetal complications, such as fetal genetic abnormalities, fetal infections, and encephalopathies [10]. AF supernatant contains various potential biomarkers, including lipids, DNA, RNA, proteins, and small extracellular vesicles (sEVs) [11,12,13,14]. Moreover, an increasing number of studies have shown that sEV-derived long RNA species, including messenger RNA (mRNA), circular RNA (circRNA), and long noncoding RNA (lncRNA), have functional and clinical implications [15,16]. Xie et al. proposed that HSA-miR-300 and HSA-miR-299-5p in AF sEVs might serve as biomarkers for the diagnosis of congenital obstructive kidney disease and renal fibrosis [11]. LncRNAs in sEVs were reported to serve as diagnostic and prognostic biomarkers for various diseases [17], including kidney disease. LncRNAs, transcripts greater than 200 nucleotides in length, are expressed not only at the cellular or tissue level but also during specific phases of development [18]. They also modulate progression and drug resistance in kidney-related tumors [19]. So far, studies examining the association between sEV lncRNAs and ANH are rare.

In this study, we aimed to perform comprehensive AF-derived sEV profiling of RNA AF-derived sEVs and further elucidate the potential mechanism of ANH. We extracted sEVs from the discarded supernatant of routine amniocenteses from ultimate UPJO fetuses with severe ANH during the second trimester and performed a transcriptomic analysis of AF supernatant sEVs. The expression profiles and biological functions of both the mRNAs and known lncRNAs were investigated, and novel lncRNAs were also predicted and explored. Functional annotations of the cis-/trans-target genes and enrichment analyses of the differentially expressed lncRNAs (DE-lncRNAs) were performed to identify the significant biological functions and signaling pathways associated with embryonic kidney development. Quantitative real-time polymerase chain reaction (qRT–PCR) was used to verify the hub DE-lncRNAs, and the three upregulated lncRNAs were identified as candidate biomarkers for ANH diagnosis. This study may provide new insights into the pathogenesis of ANH and lay the foundation for the use of lncRNAs as diagnostic biomarkers in the future.

2. Materials and Methods

2.1. Patients

In total, 10 pregnant women with Grade III-IV ANH and 10 normal pregnant women in the second trimester of pregnancy were recruited from May 2009 to May 2022 at the Xiangya Hospital Obstetrics Department. To rule out fetal genetic problems, prenatal testing was compulsory for all of them. Routine amniocentesis was also conducted during mid-pregnancy. The basic information, detailed US, and postnatal diagnosis are summarized in Table 1 and Supplementary Tables S1 and S2. Sonographers specializing in obstetrics evaluated the quantitative values of APD, calyceal/ureteral dilation, and renal pelvis splitting using the Society for Fetal Urology (SFU) 5-point numerical grading system [2], and the data were collected from medical records. Postnatal diagnosis was obtained through follow-up. Informed consent was signed by all participants, and the ethics committee of Xiangya Hospital approved this study. All methods were performed in accordance with the relevant guidelines and regulations.

2.2. Sample Handling

The acquired AF samples were centrifuged at 3000× g for 15 min at 4 °C, after which the AF supernatant was retained. To minimize contamination of the platelets, the supernatant was transferred to new tubes and centrifuged at 3000× g for 15 min at 4 °C. The supernatant was filtered through a 0.22 μm filter and stored at −80 °C until isolation and identification.

2.3. Isolation of sEVs by Size-Exclusion Chromatography (SEC)

Ten-milliliter AF supernatant samples were thawed at 4 °C overnight. Then, the thawed samples were concentrated in a 50 mL 100 kDa filtration tube (1:100) and centrifuged at 4000× g for 15 min. One milliliter of condensed AF supernatant was loaded into a Sepharose-based CL-2B column (Echo9103A-10 mL; ECHO BIOTECH, Beijing China), which was prewashed with more than 20 mL of sterile PBS in advance. When no liquid was observed flowing out of the bottom of the columns after the addition of a complete sample into the columns, PBS was used to elute the sEVs and other fractions. Each 500 µL of effluent represented one fraction. A 100 kDa ultrafiltration tube was used to further purify the 4 to 7 fractions collected. We collected the enriched sEVs into a tube and centrifuged them for 15 min at 4000× g to enrich them [20].

2.4. Nanoparticle Tracking Analysis (NTA)

The vesicle-enriched suspension was stabilized at concentrations between 1 × 107/mL and 1 × 109/mL. The particles were examined by a ZetaView PMX 110 instrument (Particle Metrix, Meerbusch, Germany) equipped with a 405 nm laser, which determines the size and quantity of the particles isolated. An analysis of particle movement was conducted using NTA software via a video captured with a frame rate of 30 frames/second over a duration of 60 s (ZetaView 8.02.28) [21,22].

2.5. Transmission Electron Microscopy (TEM)

We incubated 20 µL of enriched sEVs on a copper mesh for 10 min at room temperature. After washing with sterile distilled water, the sEV-enriched fraction was stained with uranyl oxalate solution for 1 min and then allowed to dry under an incandescent lamp for 2 min. Images were observed and taken under a transmission electron microscope (JEOL-JEM1400, Tokyo, Japan) [23].

2.6. Western Blot (WB) Analysis

The proteins of the enriched sEVs were denatured in 5× sodium dodecyl sulfonate (SDS) loading buffer and subjected to Western blot analysis (10% SDS–polyacrylamide gel electrophoresis; 50 μg protein/lane). The following antibodies were used for detection: CD24 (67627-1-IG; proteintech, Wuhan, China), TSG101 (sc-13,611; Santa Cruz, CA, USA), and calnexin (10,427–2-AP; Promega, Madison, WI, USA). The results were visualized on a Tanon 4600 automatic chemiluminescence image analysis system (Tanon, Shanghai, China).

2.7. RNA Isolation and Sequencing

Total RNA was extracted and purified from sEV-enriched fractions using an miRNeasy® Mini Kit (Qiagen, cat. No. 217,004) according to the manufacturer’s instructions. Agarose gels (1.5%) were used to monitor the degradation and contamination of RNA, especially DNA. Using a NanoDrop 2000 spectrophotometer (Thermo Fisher Scientific, Wilmington, DE, USA), we measured the concentration and purity of the RNA. RNA integrity was analyzed using an Agilent Bioanalyzer 2100 system (Agilent Technologies, Santa Clara, CA, USA) and an RNA Nano 6000 assay kit.

In the experimental group, 6 samples, including severe ANH (n = 3) and normal (n = 3) sEV samples, were subjected to library preparation and sequencing. For mRNAs and lncRNAs, a total of 5 ng of RNA per sample was analyzed with a Ribo-ZeroTM Magnetic Kit (Epicenter, Madison, WI, USA) to remove rRNA. The sequencing libraries were generated on an Ovation RNA-Seq system (NuGEN, Redwood City, CA, USA) following the manufacturer’s instructions, and index codes were used to attribute the sequences to the samples. The RNA samples were ligated with barcodes containing unique adaptor sequences to allow pooling of the samples. The pooled DNA library was then eluted and processed with a NovaSeq 6000 platform for cluster generation and sequencing. The quality of the libraries was assessed using a Qubit fluorometer (Thermo Fisher Scientific, Waltham, MA, USA).

For each sample, no less than 65 M clean reads were generated and the Q30 percentage was no less than 87.25% for sequencing. The raw reads were filtered by fastQC. The transcriptome was assembled using StringTie based on the reads mapped to the GRCh38 human genome by using HISAT2 [24]. The assembled transcripts were annotated using the gffcompare program. StringTie (1.3.1) [25] was used to calculate FPKMs of both lncRNAs and coding genes in each sample. Differential expression analysis of the two groups was performed using the edgeR R package [26].

2.8. Known lncRNA Identification

The analysis of known lncRNAs was based on the high-confidence set of the lncipedia database version 5.2. The lncipedia database integrates lncRNAdb, the Broad Institute (Human Body Map lincRNAs), Ensembl (release 92), GENCODE, RefSeq, NONCODE, and several other databases and lncRNAs in the literature. The transcripts of the lncRNA genes were included in the high-confidence dataset of lncipedia version 5.2.

2.9. Quantitative Real-Time PCR (qRT–PCR) Analysis

In the validation cohort, RNA samples from ANH patients (n = 7) and normal individuals (n = 7) were reverse transcribed into complementary DNA (cDNA) using HiScript Reverse Transcriptase (Vazyme; cat #R101-01) according to the manufacturer’s instructions. The cDNA was PCR-processed using ChamQ Universal SYBR qPCR Master Mix (Vazyme; cat #Q711-02). Real-time quantitative PCR (qPCR) was performed on a Roche LightCycler 480 II instrument. The qPCR was performed with the following procedure: 37 cycles at 94 °C for 15 s and 62 °C for 7 min. The primers used in this study are listed in Supplementary Table S3. Ct values for each sample were determined and normalized to that of β-actin and miR-16-5p, respectively. The 2−ΔΔC(t) equation was used to calculate the relative expression data.

2.10. Prediction of Novel lncRNAs

The discovery of novel lncRNAs was divided into two stages: basic screening and potential coding ability screening. First, transcripts with a fragment per kilobase of exon model per million mapped fragments (FPKM) ≥ 0.1, a transcript length ≥ 200 bp, and a total number of exons ≥ 2 were selected. Following basic screening, we obtained transcript sequence information, and after removing transcripts with potential coding ability, the remaining novel predicted lncRNAs were identified. Since lncRNAs do not encode proteins, the transcripts were excluded because of the coding potential of the candidate lncRNAs. The most widely used coding potential analysis methods were also combined for further screening, mainly including Coding Potential Calculator (CPC), Coding Non-Coding Index (CNCI), Coding Potential Assessment Tool (CPAT), and Pfam protein structural domain analysis. CPC was used primarily to compare transcripts with known protein databases. Coding-noncoding transcripts were distinguished by adjacent nucleotide triplet features to predict noncoding transcripts (score < 0). CPAT analysis determined the coding and noncoding ability of transcripts by building logistic regression models and calculating Fickett and hexamer scores based on ORF length and ORF coverage. The Pfam database provides a comprehensive classification system for protein structural domain annotation; transcripts with a specific protein structural domain that was thought to have coding ability were excluded.

2.11. Prediction of Novel lncRNA Target Genes

Based on the mode of interaction of the lncRNAs and their target genes, two prediction methods were used. Cis-target genes are involved since lncRNAs regulate the expression of neighboring genes, which is mainly predicted based on the spatial relationship between lncRNAs and genes, and neighboring genes within a 100 kb region around the genomic location of these lncRNAs are potential target genes. The trans-target genes and the interactions between lncRNAs and mRNAs were determined based on their complementary pairing. LncTar was used for calculating the free energy of the pairing and normalized site of the complementary sequence between lncRNAs and mRNAs, and those below the normalized free energy threshold were considered lncRNA target genes.

2.12. Differential Expression Analysis of Novel lncRNAs

StringTie software was used to normalize the expression of novel lncRNAs. StringTie software simultaneously assembled and quantified the transcript sequence, yielding more quantitative and accurate results. The FPKM value was also used as the output to measure the expression levels of the novel lncRNAs.

2.13. Statistical Analysis

We used plyr and reshape2 to sort and restructure the sequence data. The plots were generated using corrplot (0.92), ggbiplot (0.55), ggvenn (0.1.9), ggplot2 (3.3.6), pheatmap (1.0.12), circlize (0.4.15), or ComplexHeatmap (1.10.2). Statistical tests were performed using R 3.5.1 (http://www.r-project.org, accessed on 8 February 2023); p < 0.05 and |log2FC| > 1 indicate differentially expressed RNAs. Wilcoxon rank-sum tests were used for group comparisons. The qPCR data were analyzed using GraphPad Prism 8.0. The unpaired t test was used for comparisons of the relative expression levels of candidate lncRNAs via qRT–PCR. The data in all the figures are expressed as the mean ± standard deviation (SD). A significance level of p < 0.05 was used. The q-value is the adjusted p value found using the optimized false discovery rate (FDR) method.

4. Discussion

ANH represents a wide range of urinary diseases, but not all instances of ANH indicate an underlying pathology. Transient hydronephrosis does not correlate with postpartum pathology during pregnancy. However, stable, persistent, or worsening ANH varies and needs further investigation. Exploring biomarkers aims not only to complement the first-line US but also to confirm pathological ANH at an earlier time. In this study, children with a final diagnosis of UPJO were recruited. Two of ten labor induction procedures were used in late pregnancy because of the deteriorated ANH, as assessed by advanced MRI and US. The multidisciplinary team (MDT) evaluated the high risk of UPJO and the need for multiple operations after birth. We considered that transcriptional changes might vary based on timing and location. We also strictly controlled the ANH grades of the recruited samples to minimize bias related to gestational age and gender. In the discovery cohort, three cases of bilateral severe ANH were selected for transcriptome sequencing to reduce within-group differences. The cluster of mRNAs and lncRNAs in PCA indicated high similarity in each group, and the differences among groups were evident. Despite the small sample size, a rigorous sample analysis and transcriptome quality control were sufficient to ensure the reliability of the results. In the validation cohort, considering the unilateral cases would more accurately reflect the amniotic fluid of the functioning kidney, the unilateral (n = 4) and bilateral (n = 3) ANH results were also compared; there was no statistical difference in the expression levels of the three DE-lncRNAs. Although the sample size was limited, this finding suggests that DE-lncRNAs could serve as promising diagnostic markers for UPJO. More samples with prenatal ANH but no obstruction after birth will be collected to assess DE-lncRNAs as a potential predictor of UPJO in the future. We also plan to cooperate with pediatric nephrology for urine collection from UPJO. Urine sEVs-derived lncRNAs may be sequenced, or the DE-lncRNAs would be tested again postnatally to explore the possibility of sEVs-derived lncRNAs as risk factors for CKD.

AF has been used as a promising biomarker exploration for a long time. Klein et al. conducted a prospective multicenter peptidome analysis, and 98 peptides in AF, especially thymosin-β4, were identified to diagnose prenatally detected fetuses with CAKUT [30]. In clinical practice, AF cells are routinely used for fetal genetic testing in prenatal diagnosis, while the leftover supernatant is usually discarded. Therefore, it is beneficial to utilize the overlooked AF supernatant. This method allows for sample collection from the expectant mother without additional invasive procedures and considers the AF supernatant, mainly derived from fetal urine, as an excellent candidate for studying ANH. During the identification of sEVs, we purposely selected CD24 as one of markers. Keller et al. demonstrated that CD24 is secreted from exosomes into the urine and AF. Moreover, CD24 EVs found in the amniotic fluid of pregnant women originate from the fetal kidneys [28]. The sEVs extracted in this study were positive for CD24. Therefore, AF-derived sEVs contain inevitable biomarkers related to renal development that can be used for the diagnosis of fetal kidney disease. In a previous study, we described the proteomic profile of AF and confirmed that upregulated Moesin in sEVs could serve as a diagnostic marker for predicting postnatal renal obstruction in ANH patients [31]. In this study, we focused on sEV-derived noncoding RNAs in ANH.
Among the known lncRNAs, six DE-lncRNAs were identified. Four upregulated lncRNAs were selected for qRT–PCR analysis, three of which exhibited significant differences between the ANH and control samples. The results not only verified the sequencing data but also explored the potential of these genes as biomarkers for ANH. In particular, ENST00000454380 confirmed its official name, LINC02863 from NCBI data; it is characterized by high expression in cells and low expression in exosomes. The biological function and mechanism of LINC02863 have seldom been reported. A high expression level of LINC02863 was also found to be a necroptosis-related lncRNA that predicts the prognosis of bladder cancer [32].
LncRNAs are widely expressed in the human body and play crucial roles in both physiological and pathological processes. The mechanism of lncRNAs regulating gene expressions is complicated. LncRNAs interact with DNA, RNA, and proteins to regulate chromatin structure, functioning, and gene transcriptional activity, while also influencing RNA splicing, stability, and translation. Additionally, lncRNAs also play a role in regulating intranuclear structure [33,34]. Pathologic ANH involves renal dysplasia or urinary tract obstruction. The obstruction leads to a series of changes including renal dilation, impaired renal function, fibrosis, and inflammatory response. Specific lncRNAs may be upregulated during these processes, and they play roles in the expression of genes related to kidney development and in regulating signaling pathways related to kidney function [35]. In the present study, the expression of LINC02863 in AF-derived sEVs was significantly increased in ANH patients. When predicting its target genes, LINC02863 was found to interact with the Paired Box 8 (PAX8) gene. PAX8 is one of nine transcription factors within the PAX gene family that may determine renal fate specification and morphogenesis [36]. PAX8 is also one of the earliest specific markers of the renal lineage and is a crucial player in kidney and urinary tract development in the urogenital system [29]. Although PAX8 is essential in early embryonic development, the study of Pax8 knockout mice showed no renal abnormalities in vivo [37,38]. However, a recent study suggested that PAX8 was essential for mesenchymal–epithelial transition (MET) in de novo nephrons during human kidney development [39]. More importantly, the renal MET downstream was initiated by PAX8 through Wnt/β-catenin signaling; it coincided with the enriched Wnt pathway by GSEA in our study. The sEV-derived LINC02863 may inhibit the target gene PAX8 through underlying mechanisms including transcription factor replacement, miRNA isolation, or mRNA instability. The inhibited PAX8 downregulates Wnt/β-catenin signaling, ultimately specifying the fate of the nephron; further mechanistic studies are needed to determine the specific mechanisms involved.
The tissue and conditional specific expression of lncRNAs suggest their potential as diagnostic biomarkers. In addition, as therapeutic targets, lncRNAs exhibit many advantages such as low potential toxicity, lack of translations, and rapid turnaround and contribute to highly effective and low-dose treatments [33]. Current therapeutic methods targeting lncRNAs include developing antibodies and small interfering RNAs (siRNAs) [40], particularly antisense oligonucleotides (ASOs) [41], aimed at inhibiting their expressions, ultimately reversing the disease process. Knocking out or modifying specific lncRNAs is also conducted to evaluate their roles in the diseases [42]. Increasingly, studies revealed that lncRNAs might be involved in the pathophysiology of kidney disorders, and new treatments that target lncRNAs to reduce kidney damage are being studied extensively [43]. However, due to the difficulty of analyses regarding RNA expression changes in fetal-related diseases, there was rare research on lncRNA treatments for fetal renal abnormalities. Our study is the first time anyone systematically analyzed global RNA expressions in ANH-derived EVs. DE-RNAs in severe ANH were probably related to fetal renal development; meanwhile, DE-lncRNAs were associated with postnatal UPJO outcomes. This study provides a foundation for further research on lncRNAs as therapeutic targets in ANH and helps clinicians as well as molecular biologists chart a clear path towards therapeutics.
Novel lncRNAs were also explored in our study. We assembled 17355 novel lncRNAs and identified four upregulated novel DE-lncRNAs that may play a role in ANH. The identification of their target genes suggested that the DE-lncRNAs were closely correlated with protein-coding genes related to mesenchymal morphogenesis and the STAT3 signaling pathway. The concerted morphogenesis of the epithelium and the development of the mesenchyme play important roles in the development of various organs. Nephrons are formed by a reciprocal transition from the mesenchyme to the epithelium in the metanephric mesenchyme [44]. It was also reported that the STAT3/mTOR pathway might be involved in the process of kidney injury [45]. The potential targets involved in these pathways are frequently linked to renal fibrosis. Among the mesenchyme morphogenesis-related genes, bone morphogenetic protein receptor type 1A (BMPR1A) and its ligand bone morphogenetic protein (BMP) play a role in renal fibrosis in chronic kidney disease [46]. Smad2 protects against TGF-beta/Smad3-mediated renal fibrosis [47]. Neuregulin-1 (NRG1) administration prevents hypertrophy and fibrosis of the cardiac and renal systems caused by angiotensin II (ANG II) infusion and endothelial NO synthase (eNOS) deficiency in a mouse model [48]. In the STAT3 signaling pathway, the inhibition of STAT3 in tubular epithelial cells prevents kidney fibrosis and nephropathy in streptozotocin (STZ)-induced diabetic mice [49]. Therefore, target genes assigned to these pathways can be crucial proteins in renal development, and sEV-derived DE-lncRNAs may be potential targets as well as biomarkers for ANH.



Source link

Ying Fu www.mdpi.com