Illumina, PacBio, and Oxford Nanopore Technologies (ONT) are now among the most widely used next-generation sequencing (NGS) platforms in the world. Each one has their own strengths and limitations, and which one to use depends on the needs of your project. Below, I share some general comparisons that may help you decide.
Cost In general, Illumina remains the most cost-effective platform for short-read sequencing. Per Gb cost can often be below $50 now, especially on NovaSeq or NextSeq series. PacBio and ONT used to be more expensive, with older platforms costing around $1000-2000 per Gb. But things are changing. PacBio's newer Revio system and ONT's PromethION flowcells now allow higher throughput, which brings the cost down, especially for whole-genome or large transcriptome projects. Still, total cost also depends on experimental design, read depth, and coverage requirement.
Error Rates Illumina has long been known for low error rate, around 0.1-0.5%. This is enough for most variant calling and gene expression studies. For long-read platforms, error rate is higher - traditionally 10-15% for both PacBio and ONT. But newer PacBio CCS (circular consensus sequencing) protocol produces so-called HiFi reads with accuracy close to Illumina. It works by sequencing the same molecule multiple times and building a consensus, but it does increase cost and reduce total output.
ONT has also made some progress: Q20+ chemistry and duplex reads help reduce error rate - but still not as accurate as HiFi or Illumina. Some tools now can polish ONT reads with help of reference or short reads, but this adds complexity.
Read Lengths This is where PacBio and ONT shine. Illumina normally produces 2x150bp or 2x300bp reads, which is enough for most gene-level analysis but limited for structural variants or assembly. PacBio can routinely generate 15-25kb HiFi reads. ONT can produce even longer reads - 100kb or longer - with ultra-long protocols. In some special projects (e.g. telomere-to-telomere or repeat-rich regions), ONT long reads have clear advantage.
Applications Illumina has been the standard for most classical applications like whole exome sequencing (WES), RNA-seq, ChIP-seq, and Hi-C. Many existing workflows and pipelines were developed based on short reads. If your study fits into these well-established frameworks, Illumina is still very suitable and reliable.
But for projects requiring full-length transcript isoforms, de novo assembly, large indel detection, or epigenetic modifications (e.g. methylation), then long-read platforms like PacBio and ONT become better choices. Also, in microbiome and metagenomics studies, long reads help to resolve species or even strain level structure, especially when reference is incomplete.
Bioinformatics Analysis Illumina data is easier to work with. Many tools are mature and stable. Common mappers like BWA or STAR, and variant callers like GATK, are all well-supported. PacBio and ONT data are more challenging at first, due to higher error and more variability.
But now, many tools like minimap2, Flye, Shasta, and NextDenovo support long-read analysis. PacBio HiFi data can be processed by tools like Hifiasm, and ONT assemblies can be polished by Medaka or Homopolish. ONT basecalling (e.g. Guppy) is still being updated frequently - sometimes too frequent, so it is a bit hard to keep stable pipeline.
One challenge for long-read bioinformatics is that pipeline design depends more on project goal. Whether to do assembly first or align to reference? Should we do error correction before variant calling? These questions don't always have one fixed answer. That's why it is important to work with people experienced with these platforms, otherwise mistakes can occur and waste time.
Other Considerations Access to sequencing core or service provider is still important. Illumina instruments are widely available - nearly every core has it. PacBio and ONT, especially high-throughput models, are less common but growing.
Also, one should consider platform maturity. Illumina is more stable - not changing much anymore. PacBio and ONT are still developing fast. Sometimes their chemistry or basecalling software changes suddenly, and users must adjust. ONT often updates nanopore proteins and models, which is exciting but not always compatible with older tools. PacBio system is also quite closed - basecaller is proprietary - so some flexibility is missing.
We also observe that long-read data can be harder to revisit after a few years if formats or tools change. So, planning ahead is useful.
About the Author: Justin Li earned his Ph.D. in Neurobiology from the University of Wisconsin–Madison and an M.S. in Computer Science from the University of Houston, following a B.S. in Biophysics. He served as an Assistant Professor at the University of Minnesota Medical School (2004–2009) and as Chief Bioinformatics Officer at LC Sciences (2009–2013) before joining AccuraScience as Lead Bioinformatician in 2013. Justin has published around 50 research papers and led the development of 12 bioinformatics databases and tools - including miRecords, siRecords, and PepCyber - while securing over $3.4M in research funding between 2004 and 2009 as PI, co-PI, or co-I. He has worked on NGS data analysis since 2007, with broad expertise in genome assembly, RNA-seq, scRNA-seq, scATAC-seq, Multiome, ChIP-seq and epigenomics, metagenomics, and long-read technologies. His recent work includes machine learning applications in genomics, AlphaFold modeling, structural bioinformatics, spatial transcriptomics, immune repertoire analysis, and multi-omics integration. More at https://www.accurascience.com/our_team.html.
Need assistance in your Illumina, PacBio or Oxford Nanopore data analysis project? Send us an inquiry, chat with us online, or reach us in other ways!