How Reliable Is microRNA Expression Data? Limitations of Analysis and Considerations in the Age of AI | Revisiting 2019 Insights in 2026

What is the difference between miRNA-Seq vs RNA-Seq?
The most important difference is reproducibility . This difference is most clearly reflected in the consistency of results across datasets.

Technologies for measuring gene expression levels had already reached a mature stage by around 2004. Therefore, data generated by skilled experimentalists can generally be considered reliable. In contrast, measuring microRNA expression remains challenging even today. It is important to recognize that miRNA expression data are not as reliable as gene expression data.

Here, we compare gene expression and miRNA expression data across multiple datasets of hepatocellular carcinoma (HCC).

RNA-Seq shows high reproducibility: consistent results even across different technologies such as microarrays

The heatmap below compares two datasets: TCGA-LIHC and GSE14520. Gene expression data in TCGA were generated by RNA-Seq, while GSE14520 was measured using two types of Affymetrix GeneChip arrays (HG-U133A 2.0 and HT_HG-U133A), resulting in three datasets in total. All datasets were reprocessed from raw data and transformed into log2 ratios (Tumor vs Normal). Red indicates upregulation in tumors, while blue indicates downregulation.

Despite differences in researchers and experimental platforms, the patterns of upregulated and downregulated genes are largely consistent across these datasets. This indicates that RNA-Seq-based gene expression data are highly reliable.

This consistency across datasets is supported by the high reproducibility observed within each dataset.

GX-comparing data sets

RNA-Seq is highly reproducible within datasets

The TCGA-LIHC RNA-Seq dataset consists of 50 normal and 370 tumor samples. The GSE14520 dataset includes 18 paired samples in the HG-U133A 2.0 platform and 214 pairs in the HT-U133A platform. Although some samples may have quality issues, overall these datasets are of high quality. Within each dataset, tumor expression profiles appear largely consistent, supporting the idea that these data reflect true biological states.

GX - heatmaps of each data set

Why does miRNA-Seq show poor reproducibility?

In contrast to RNA-Seq, miRNA-Seq results often fail to match across independent datasets. This fundamental difficulty arises from intrinsic properties of miRNAs.

Three major factors that reduce reproducibility in miRNA-Seq

Short sequences with high similarity

miRNAs are very short (~22 nucleotides), and many family members differ by only a few bases. This leads to cross-hybridization in microarrays and ambiguity in sequence mapping.

Library preparation bias

Bias introduced during library preparation—especially adapter ligation efficiency and extraction methods—has a much stronger impact than in mRNA sequencing.

Limited number of molecules

The number of miRNAs is much smaller (only a few thousand) compared to genes. As a result, normalization assumptions commonly used in RNA-Seq—such as “most genes do not change”—often break down.
Even with computational correction, uncertainty remains. In many cases, external spike-in controls are required for accurate normalization.

miRNA-Seq shows low reproducibility across datasets: inconsistent results

Let us now examine actual miRNA expression datasets. Unlike gene expression data, results across datasets show little agreement.
These observations indicate that miRNA expression data are inherently difficult to interpret.

miRNA - comparing data sets

miRNA-Seq can also show low reproducibility within datasets

The largest dataset examined is the TCGA-LIHC miRNA-Seq dataset. Within this dataset, tumor samples appear relatively consistent, suggesting that the experiment itself was successful.

However, when compared with other datasets, the results do not match at all. The lack of agreement across datasets is striking. Although some discrepancies may arise from platform differences and data integration issues, the degree of inconsistency observed here is far beyond what can be explained by such factors alone.

miRNA - TCGA

GSE110217 on Agilent のHuman miRNA v16 microarray

The signal intensities in the latter half of the replicates (5-8) were markedly lower than the first half (1-4), likely due to batch effect.

Even after excluding low-quality samples, the miRNAs identified as up- or down-regulated in these datasets show almost no overlap with the TCGA results. The discrepancy is too vast to be explained simply by platform differences or gene mapping inconsistencies.

miRNA - Agilent

GSE74618 on Affymetrix miRNA v2 Array,

miRNA - Affymetrix v2

GSE115016 on Affymetrix miRNA v4 Array,

miRNA - Affymetrix v4

GSE10694 on CapitalBio Mammalian miRNA Array,

miRNA - CapitalBio

and GSE28854 on Milteny Biotec miRXplore miRNA Microarray.

miRNA - Milteny

You see that the latter one is more noisier and less concordant among HCC samples. I don't intend to judge which platform is better of worse, because it can be due to technology or experimenters' skill or other factors we don't know. My point is that miRNA expression data is far less reliable than gene expression's. Don't you think that it is very hard to say which miRNA is really up- or down-regulated in HCC?

Comprehensive miRNA measurement is an evolving technology that still faces unresolved hurdles. When using miRNA datasets or relying on "up-regulated miRNA lists" from published papers, extreme caution is required.

If you are planning a miRNA experiment, remember that it demands even higher technical proficiency than gene expression studies.

Conclusion: miRNA-Seq vs RNA-Seq

Aspect	RNA-Seq (mRNA)	miRNA-Seq
Reproducibility	High	Low
Consistency across datasets	High	Low
Measurement stability	Stable	Highly biased
Assumptions for analysis	Valid	Often violated

Download the data for your Subio Platform

If you want to look closer these data sets with Subio Platform by yourself, download the SOA file which works like a bundle of SSA files.

Open "Import Archive..." under Platform menu, and select the SOA file. Subio Platform automatically shuts down when it completes importing. Please restart the software to see the all data sets.

[2026 Update] Never Ask AI, "Which miRNAs are Up-regulated in X?"

While AI-driven analysis is becoming more common in 2026, it is dangerous to blindly accept the "answers" provided by AI when the quality of the underlying data varies so drastically.

For instance, if you ask an AI, "Which miRNAs are up-regulated in HCC (Hepatocellular Carcinoma)?", it will confidently present a list extracted from past publications. However, the technical reality demonstrated in this article is that results vary significantly from paper to paper, often incorporating data of questionable reliability.

Furthermore, AI models can extract patterns even from noisy data, sometimes identifying features that may not reflect true biological signals. Ultimately, the responsibility for determining whether these results are biologically valid lies not with the AI, but with the analyst.

It is also important to note that the same caution applies to databases such as miRmine and miRNAMap, which aggregate expression levels across various tissues. While these databases can be useful, they often integrate data generated under different experimental conditions, making it difficult to distinguish biological differences from technical variation.

For omics data analysis, it is essential to first examine the data directly, rather than relying solely on AI-generated answers.

Support

Help - Theory & Case Study

How Reliable Is microRNA Expression Data? Limitations of Analysis and Considerations in the Age of AI | Revisiting 2019 Insights in 2026