N
The Daily Insight

Why does the rate of errors increase by the end of Illumina reads?

Author

Andrew Mclaughlin

Updated on February 26, 2026

Previous studies have shown that Illumina errors are not random and that biases are likely to be related to sequence context. A general increase of errors towards the end of the reads has been observed as well as a strand bias [5–9].

What is error rate in sequencing?

One of the most widely used sequencing techniques is sequencing-by-synthesis. The average error rate of this approach is reported to be 0.1% per nucleotide, most of which are single nucleotide substitutions2.

Why is long read sequencing less accurate?

There are two key types of accuracy in DNA sequencing technologies: read accuracy and consensus accuracy. However, because of their read length, short reads cannot always map to the long reads unambiguously, limiting their ability to improve accuracy.

What is the error rate of Illumina sequencing?

Distribution of error rates for each platform

..Error rate (%)
PlatformNumber of samplesMedian
MiniSeq400.613
NextSeq 5001600.429
NextSeq 5501710.593

Is PacBio more accurate than Illumina?

PacBio reads typically have a really high error rate (~15% compared with ~0.1% for Illumina.) However, their errors tend to be random, so if the same region is sequenced several times, the errors average out resulting in a “consensus” sequence.

What are error rates?

the frequency with which errors are made. Examples include the proportion of an experimenter’s data recordings that are wrong or the number of Type I errors that occur during significance testing.

Is Long-Read Sequencing better than short read sequencing?

The predominant difference between LRS and the conventional SR-NGS approaches is the significant increase in read length. In contrast to short reads (150–300 bp), LRS has the capacity to sequence on average over 10 kb in one single read, thereby requiring less reads to cover the same gene (illustrated in top panel).

Is Illumina or nanopore more accurate?

What is the Difference Between Nanopore and Illumina Sequencing? Nanopore sequencing is a third-generation sequencing technique that uses a nanopore to detect the sequence of DNA molecules. Moreover, nanopore sequencing has 92-97% accuracy, while illumina sequencing has 99% accuracy.

Why is PacBio better than Illumina?

PacBio provides longer read length than Illumina’s short-length reads. Longer reads offer better opportunity for genome assembly, structural variant calling. It is not worse than short reads for calling SNP/indels, quantifying transcripts. Sounds like PacBio can do whatever Illumina platform can offer.

They may also lead to increased false-positive variant calls, resulting in inaccurate conclusions. As shown below, a quality score of 20 represents an error rate of 1 in 100, with a corresponding call accuracy of 99%. Illumina technology enables massively parallel sequencing with optimized SBS chemistry.

How accurate is Illumina NGS sequencing chemistry?

Illumina sequencing chemistry delivers high accuracy, with a vast majority of bases scoring Q30 and above. This level of accuracy is ideal for a range of sequencing applications, including clinical research. Learn why PhiX can be used as an in-run control for run quality monitoring in Illumina NGS.

How accurate is 100 bp sequencing?

every 100 bp sequencing read will likely contain an error. When se-quencing quality reaches Q30, virtually all of the reads will be perfect, having zero errors and ambiguities. This is why Q30 is considered a benchmark for quality in next-generation sequencing. By comparison, Sanger sequencing systems generally produce base call accuracy of

What is the impact of quality scores on sequence data?

Sequencing data with lower quality scores can result in a significant por- tion of the reads being unusable, resulting in wasted time and expense. PhiX quality scores for the MiSeq®and HiSeq systems show that nearly all bases have scores > Q30 for single and paired-end reads (Figure 2).