• DRAGEN
  • Product updates
  • 07/11/2023

DRAGEN™ 4.2: Enhanced machine learning, new targeted callers, and more

Author:
  • Headshot Varvara
    Victor Varvara
DRAGEN v4.2

We’re celebrating 5 years of DRAGEN at Illumina with DRAGEN version 4.2! Version 4.1 enabled high throughput DRAGEN workflows on NovaSeq X, now v4.2 continues to push the boundaries of accuracy and comprehensiveness for secondary analysis on cloud and on-premises. Access the latest software and learn more about new features on our support page.

­May 2023 marked 5 years of DRAGEN at Illumina and over 25 years of Illumina innovation. We are thrilled to celebrate these anniversaries by releasing our most accurate, most comprehensive version of DRAGEN secondary analysis ever. DRAGEN v4.2 continues to expand its award-winning accuracy1,2 with major Germline accuracy improvements driven by updated references and improved machine learning. Additionally, this version introduces a lineup of exciting new features. In this blog, we will delve into significant small CNV accuracy improvements, expanded caller offerings, greater gVCF Genotyper efficiency, and much more.

As DRAGEN continues to serve and support comprehensive application areas, here is a quick preview of the updated key features in DRAGEN v4.2 before we take a closer look at a few major improvements.

Updated Key Features in DRAGEN v4.2 by Application Area

Germline Accuracy Improvements

We’ve made multiple updates to the reference genomes to drive significant accuracy gains.

  • New, more diverse population haplotypes integrated with the DRAGEN Multigenome (graph) reduce non-European small variant calling errors by 15-17%.
  • With the updated hg38 reference and Multigenome (graph) enabled, DRAGEN v4.2 reduces errors of small variant calling in Challenging, Medically Relevant Genes (CMRGs) by 25% compared to DRAGEN 4.0.
  • DRAGEN v4.2 now supports the latest Telomere-to-Telomere CHM13v2.0, the first complete gapless sequence of a human genome, that enables improved small variant calling.
DRAGEN v4.2 Reference Support

Updated DRAGEN-ML and Multigenome (Graph) enables an average SNV error reduction of 14%.

Starting from DRAGEN v4.0, the default setting includes DRAGEN-ML, which reduces SNV errors. We recommend enabling Multigenome (Graph) in your analysis to further contribute to significant error reduction.

DRAGEN v4.2 Improved Accuracy with Machine Learning and Multigenome (Graph)

Significant Accuracy Improvements for Small CNV

Increase Copy Number Variant (CNV) accuracy in DRAGEN v4.2 with CNV detection that leverages Structural variation (SV) information.

New joint SV and CNV caller enables higher accuracy at all event size. Most notably, gains of over 90% in recall, 25% in precision, and 41.7% in the F1 score are achieved for CNVs in the 1kbp-10Kbp range.

DRAGEN v4.2 Improved CNV Accuracy

Expanded Caller Offerings

Increase accuracy in focused variant detection with 4 new targeted callers.

Improve carrier screening insights with new targeted callers CYP21A2 and HBA. Expand cardiovascular and blood typing insights with the new targeted callers LPA and RH, respectively. Improving upon previous callers, DRAGEN v4.2 also improves CYP2D6 calling accuracy and supports SMN silent carrier calling.

Learn more about our recent callers:

Targeted Callers in DRAGEN v4.2

Improve pharmacogenomic insights into optimal genotype and corresponding metabolism status with 5 new genes in STAR allele caller.

Genes in STAR Allele Caller for DRAGEN v4.2

A More Efficient gVCF Genotyper

Produce more insights, in half the time with the improved VCF Genotyper.

Significant run time improvements enable larger cohort analysis and reduced cost with comprehensive efficiency improvements of more than 2x the speed of the gVCF to msVCF workflow compared to DRAGEN 4.0. With improved msVCF configurability, filter out unnecessary variants while highlighting valuable insights with new outputs like inbreeding coefficient, Hardy-Weinberg equilibrium, and Excess Heterozygosity. Learn more about how gVCF Genotyper improvements are enabling genotyping variants at a population-scale in our recent article.

New DRAGEN Pipelines for Infectious Disease

Beta feature: DRAGEN Secondary Analysis pipelines for Urinary Pathogen ID/AMR Panel (UPIP) and Respiratory ID/AMR Panel (RPIP).

DRAGEN v4.2 enables bioinformaticians to explore Explify UPIP and RPIP pipelines on server with a command line interface. The UPIP/RPIP pipeline implements a precision metagenomics method using a k-mer classification tool. This is offered as DRAGEN's first metagenomics standalone tool.

UPIP panel and pipeline offers comprehensive detection and quantification of >170 pathogens, including fastidious, slow-growing, and anaerobic uropathogens and sexually transmitted microorganisms.

RPIP panel and pipeline offers comprehensive detection and quantification of >280 pathogens, including SARS-CoV-2, Influenza, RSV, and Mycobacterium and Legionella species.

For researchers interested in additional analysis and reporting functionality with an easy-to-use interface, check out the Explify UPIP and RPIP analysis applications on BaseSpace Sequence hub.

Additional Features

  • ORA Compression Runtime improvements: Run ORA compression and DRAGEN map/align in parallel for a 15-30% speed increase from sequential processing.
  • High Sensitivity Mode: 1) Ambiguous SNV calling in seg dup regions and 2) Enable low allele frequency (mosaic) calling with DRAGEN V4.2 high sensitivity mode (non-default).
  • Sex Chromosome Mosaic Variant Support
  • Structural Variance Improvements: Multigenome (graph) haplotypes guides alignments, revealing true breakpoint signatures to improve germline SV recall while maintaining high precision.
  • New Systematic Noise Files + Alu regions Blocklists: significant FP reduction in somatic TO pipelines with updated systematic noise files and optional Alu regions blocklist.
  • Fully Supported Amplicon Analysis: DRAGEN v4.2 fully supports amplicon analysis for SNV, RNA-fusion, CNV and SV.
  • Enhanced Bulk-RNA QC metrics
  • VCF Imputation: Enabled for haploid species, and chromosome X for diploid species.

Learn More

This blog highlights only five of the many key features in DRAGEN v4.2. Explore and learn more in the user guide available on our support page.

Visit the DRAGEN page to learn more about our technology .

Other Resources:

DRAGEN Support

Contact Us

  1. https://precision.fda.gov/challenges/10
  2. https://precision.fda.gov/challenges/22/results
  3. Illumina Data on File, 2023

For Research Use Only. Not for use in diagnostic procedures