Use of the QIAGEN GeneReader NGS system for detection of KRAS mutations, validated by the QIAGEN Therascreen PCR kit and alternative NGS platform

Background The detection of somatic mutations in primary tumors is critical for the understanding of cancer evolution and targeting therapy. Multiple technologies have been developed to enable the detection of such mutations. Next generation sequencing (NGS) is a new platform that is gradually becoming the technology of choice for genotyping cancer samples, owing to its ability to simultaneously interrogate many genomic loci at massively high efficiency and increasingly lower cost. However, multiple barriers still exist for its broader adoption in clinical research practice, such as fragmented workflow and complex bioinformatics analysis and interpretation. Methods We performed validation of the QIAGEN GeneReader NGS System using the QIAact Actionable Insights Tumor Panel, focusing on clinically meaningful mutations by using DNA extracted from formalin-fixed paraffin-embedded (FFPE) colorectal tissue with known KRAS mutations. The performance of the GeneReader was evaluated and compared to data generated from alternative technologies (PCR and pyrosequencing) as well as an alternative NGS platform. The results were further confirmed with Sanger sequencing. Results The data generated from the GeneReader achieved 100% concordance with reference technologies. Furthermore, the GeneReader workflow provides a truly integrated workflow, eliminating artifacts resulting from routine sample preparation; and providing up-to-date interpretation of test results. Conclusion The GeneReader NGS system offers an effective and efficient method to identify somatic (KRAS) cancer mutations. Electronic supplementary material The online version of this article (doi:10.1186/s12885-017-3328-z) contains supplementary material, which is available to authorized users.

Sanger sequencing has been the 'gold standard' for mutation analysis in cancer detection since the 1970s [11]. However, limited by its low sensitivity (10-20% mutant allele frequency (MAF)) and low throughput [10], Sanger sequencing is no longer sufficient for the needs of today's cancer molecular diagnostics.
The therascreen KRAS RGQ PCR kit is a real-time qPCR-based assay used to detect the most common KRAS mutations including those in codons 12 and 13. It has greatly improved sensitivity over Sanger sequencing, and has been approved by the Food and Drug Administration (FDA) [9] for colorectal cancer patient stratification.
Pyrosequencing also offers an attractive alternative to Sanger due to its fast turnaround time (TAT) and lower sensitivity threshold, even in tissues with low tumor cell content [5].
Next-generation sequencing (NGS) differs radically from the above mentioned methods. Coupled with amplicon-based targeting technology, NGS has the capability to simultaneously sequence in a massively parallel way multiple genetic loci with minimal amounts of nucleic acid input and limited time and expense [12][13][14][15]. This technology has revolutionized the speed of genetic and genomic discovery, and advanced our understanding of molecular mechanisms of diseases. In recent years, NGS has played an important role in advancing personalized healthcare and precision medicine by enabling the identification of mutations associated with therapeutic response or resistance. As more clinically significant genetic biomarkers and targeted therapies become available, the profiling of such genetic variations is becoming increasingly more critical. Several NGS platforms are already commercially available for sequencing and identification of genetic alterations associated with diseases, such as point mutations, deletions, insertions and copy number variants [16]. However, QIAGEN's GeneReader System presented here includes all upstream sample processing steps starting from nucleic acid extraction, together with an integrated downstream bioinformatics solution that enables a direct access to real-time updates from the rapidly evolving literature, and clinical knowledge and evidence.
To this end, we recently evaluated the QIAGEN GeneReader System workflow from DNA extraction and purification from FFPE tissue samples, to library preparation, sequencing and data analysis and interpretation. Herein we show that the GeneReader presents a unified workflow that provides accurate results and a simple solution for any laboratory to use in clinical research.

Methods
Sample and DNA isolation FFPE Tumor material from colorectal cancer tumors (Origene Technologies, MD, USA and Asterand Biosciences, MI, USA) was used to prepare 56 DNA samples with known KRAS mutation status, previously determined using therascreen assay (Pyrosequencing and PCR) and Sanger sequencing according to methods further described below. Tissue sections of 10 μm in thickness, ranging from 3 to 20 years of age were used for DNA extraction utilizing either: i) the QIAamp DNA FFPE Tissue Kit (QIAGEN, Hilden, Germany) or ii) the GeneRead FFPE DNA Kit (QIAGEN, Hilden, Germany) according to manufacturer's instructions. DNA concentration was determined using the Nanodrop System (Thermo Fisher Scientific, MA, USA) and Qubit dsDNA HS assay (Life Technologies, Gaithersburg, USA). The DNA was assessed using the GeneRead DNA QuantiMIZE System (QIAGEN, Hilden, Germany) which utilizes a qPCR-based approach to determine the quality of sample DNA prior to NGS. Furthermore, both NA12878 (Coriell Institute for Medical Research) (for which the Genome in the Bottle (GIAB) consortium has published a set of high confident variants [17]) and AcroMetrix (Thermo Fisher Scientific, MA, USA) samples were used as a gold standard set of variant calls.

GeneReader sample preparation and sequencing run
In total, 40 ng of DNA measured by Qubit (Thermo Fisher Scientific, MA, USA) was used as template to generate libraries for sequencing. Libraries were prepared using the QIAGEN Library Kit v2.0 and the GeneRead QIAact Actionable Insight Tumor Panel (QIAGEN, Hilden, Germany), which amplifies 330 amplicons covering 16.7 kb, containing 773 unique variant positions in 12 genes (KRAS, NRAS, KIT, BRAF, PDGFRA, ALK, EGFR, ERBB2, PIK3CA, ERBB3, ESR1 and RAF1). All steps of library preparation were performed according to the manufacturer's protocol. The libraries were then quantified using a Qubit dsDNA HS Assay Kit (Life Technologies, MA, USA) and QIAxcel (QIAGEN, Hilden, Germany). Ten individual libraries were pooled prior to emulsion PCR and bead enrichment steps that were carried out using an automated protocol on the GeneRead QIAcube (QIAGEN, Hilden, Germany) using the GeneRead Clonal Amp Q Kit (QIAGEN, Hilden, Germany), according to the manufacturer's protocol. Following bead enrichment, the pooled libraries were sequenced using the GeneReader platform (QIAGEN, Hilden, Germany).

GeneReader data processing
QIAGEN Clinical Insight (QCI™) Analyze software (QIAGEN, Hilden, Germany) was used to QC, align the read data to the hg19 reference genome sequence, call sequence variants, and generate an interactive report for visualization of the sequencing results, as well as a summary of the data. QCI Analyze software reports a set of high-and low-confidence variants based on the coverage of variant positions. Users have an option to analytically confirm if a variant listed should be valid or invalid before uploading to QCI Interpret software for the clinical interpretation. For each sample the report was used to assess the quality of the overall sequencing run and to identify/call the individual variants. After review, variants confirmed as analytically valid were uploaded to QCI Interpret for creation of a report for each sample based on detected variants and curated content, with a summary of findings and direct links to evidence sources.

Illumina MiSeq
The Actionable Insight Tumor Panel (QIAGEN, Hilden, Germany) was used for a MiSeq (Illumina, CA, USA) sequencing run. The Kapa "with bead" PCR free protocol (KAPABiosystems, MA, USA) was used in further Illumina library preparation steps. Samples were then paired-end sequenced on a MiSeq instrument (Illumina, CA, USA) according to Illumina guidelines. The resulting reads were mapped to the hg19 reference genome sequence using BWA mem software followed by GATK (best practices) to recalibrate base quality scores. Variants were called using MuTect. Variants were then filtered using GATK (best practice) and annotated using SnpEff. Variants at hotspot positions were selected using GATK.

Pyrosequencing and Sanger analyses
The sample DNA obtained with the QIAamp FFPE DNA Kit (QIAGEN, Hilden, Germany) was subjected to Pyrosequencing analysis and Sanger sequencing. For Pyrosequencing the samples were analyzed using the therascreen RAS Extension Pyro Kit (QIAGEN, Hilden, Germany) which covers mutations in KRAS codons 59, 61, 117 and 146 as well as NRAS codons 59, 117 and 146. Samples with mutations in KRAS or NRAS codons 12 and 13 were further analyzed with the therascreen KRAS or NRAS Pyro Kit (QIAGEN, Hilden, Germany) according to manufacturer's instructions. In addition, samples that failed the initial PyroMark KRAS analysis were subjected to a second round of analysis. Samples with an initial "check" status, or with an indicated mutation signal of LOD + 3% ("Potential low level mutation") were subjected to a second round of analysis performed in duplicate. Sanger sequencing was performed using Big Dye Terminator Technology and an ABI 3730xl sequencer (Thermo Fisher Scientific, MA, USA). Mutations were detected by analyzing the sequence trace files and the quantity of a base at a certain position was calculated from the area under the curve (AUC) at the mutation specific position in the electropherogram.

Therascreen qPCR
The therascreen KRAS RGQ PCR Kit (QIAGEN, Hilden, UK) is an allele-specific PCR-based technology with specific primers for the seven most common KRAS codon 12 and 13 mutations. The assay screens for the following mutations: 12 GCT (Ala), 12 GAT (Asp), 12 CGT (Arg), 12 TGT (Cys), 12 AGT (Ser), 12 GTT (Val), and 13 GAC (Asp). Mutation analysis was performed according to manufacturer's instructions, using the RotorGene real-time PCR instrument (QIAGEN, Hilden, UK). Analysis of results was performed following the recommendations in the manual, e.g. samples with a control assay with a cycle threshold (Ct) of 35 or higher were deemed invalid and excluded from the analysis. Samples were called mutation positive based on the delta Ct values reported in the handbook. Values over 40 cycles were scored as negative (wild-type).

Results
Evaluation of DNA quality by QuantiMIZE FFPE samples with ages ranging from 3 to 20 years were used for this study. The quality of the extracted DNA was measured by the GeneRead DNA QuantiMIZE QC assay (QIAGEN, Hilden, UK). Thirteen out of 56 samples failed quality checks and were excluded from further analysis (Additional file 1: Table S1). For the remaining 43 samples, 3 to 9 PCR cycles were added (depending on the QuantiMIZE quality scores) to compensate for differences in DNA quality during enrichment PCR. The additional cycles ensured that poor quality (highly fragmented) DNA samples yielded enough material for downstream library preparation. The quality of DNA purified from formalin fixated tissue decreases over the sample storage period time [18][19][20], but also depends on how tissues were treated, handled and processed before and during sample fixation [19,21,22].

GeneReader sequencing performance
The QIAact Actionable Insights Tumor Panel (QIAGEN, Hilden, UK) contains 773 unique variant positions in 12 genes (Table 1). An analysis of the reads mapped to the reference showed coverage levels that met the industrystandard 5% sensitivity criteria, even with aged FFPE samples. A 200× minimum read coverage cutoff was used for calling a variant at any position in the panel. For the 43 FFPE samples analyzed, an average amplicon coverage of 97.2% was observed, and an average variant insight coverage (hotspot coverage) of 99.8% was observed at read depths ≥200× (Table 1). For NA12878 samples, an average amplicon coverage of 98.5% was observed and an average variant insight coverage of 99.9% was observed at read depths of ≥200× (Table 1).  Table 4).
The use of the NA12878 control (Fig. 1, Additional file 2: Table S2) and AcroMetrix (Fig. 1, Additional file 3: Tables S3) reference standard materials demonstrated the good performance of the GeneReader platform on high frequency and low frequency variants, respectively. NA12878 has been used extensively as a reference standard material for verifying NGS platforms [17] and acts as a useful control in establishing background error. Besides its use as a GeneReader platform performance standard, AcroMetrix has also been used previously as a control for variant calls [26].

Discussion
A major advantage of NGS over traditional mutation detection methods is the ability to sequence multiple genes and variants simultaneously. Other advantages include minimal DNA input, faster turnaround time;  If KRAS is mutant by Therascreen KRAS RGQ PCR assay or Therascreen RAS extension Pyrosequencing assay, the condition is recorded as a mutant (MT) (b) For Actionable Insights Tumor Panel, a 5% allelic frequency cut off was used to call variants for codon 12, 13, 59, 61, 117 and 146, which are addressed by established QIAGENTherascreen PCR assays lower overall cost and higher throughput and sensitivity compared to traditional methods [12,[27][28][29]. NGS has revolutionized the speed of genetic and genomic discovery, and advanced our understanding of the molecular mechanisms of disease and potential treatment options. However, several major hurdles remain and still prevent NGS from being broadly adopted in clinical practice.
This is especially true for laboratories that are new to this technology, and may lack the in-house expertise required for processing complex bioinformatics data and interpretation of results. Such expertise is crucial to construct a bioinformatics pipeline and to evaluate the software and generate quality reports. The QIAGEN GeneReader NGS System allows users to perform experiments from sample to insight, tissue sample to decipherable report based on the interpretation of sequence variants detected. The QIAGEN GeneReader NGS workflow utilizes 'QCI Analyze' and 'QCI Interpret' for bioinformatics analysis and reporting of variants, including read mapping, variant calling and interpretation of results. It provides visualization of the alignment of sequencing results (Fig. 2) as well as a summary of the data. Quality assessment is also supported, both at the overall sequencing run level and for the analytic validity of individual variants to reduce false positive and negative results. Using the data visualization tools within QCI Analyze, it is possible to determine the quality of the results and assess any variants of interest. Further analysis of variants using QCI Interpret provides access to the curated information contained within the QIAGEN Knowledge Base enabling a deeper analysis and interpretation of results for each sample (Fig. 3). With all relevant information, a report can be created with a summary of findings and direct links to evidence sources. At the single variant level the QCI software is able to identify an individual variant as an actionable cancer mutation, and provides links to current clinical research insights, e.g. the KRAS G12D somatic variant it is established to confer resistance to the colorectal cancer drugs cetuximab and panitumumab, based on evidence curated from their FDA drug labels and clinical practice guidelines. Within QCI-Interpret information on active clinical trials recruiting colorectal cancer patients with particular mutations are provided with drug, nearest location, and trial phase information.
The relationship between FFPE DNA quality and sequencing accuracy is a critical point for any sequencing analysis. The GeneReader workflow starts with the Gene-Read FFPE DNA Kit for DNA extraction and is specifically designed to reduce artifacts known to commonly occur in FFPE treated samples. As seen in Table 2, by using FFPE samples aged from 3 to 20 years, the GeneRead FFPE DNA Kit successfully reduced the number of low frequency false positive variants detected. These low frequency false positive variants are likely caused by cytosine deamination and other fixation associated artifacts. Similar phenomena were observed by Bourgon [23], where pretreatment of FFPE samples with uracil DNA glycosylase (UDG) resulted in a dramatic reduction of false positives, with overall reductions of 77% for C > T and 94% for G > A changes, respectively. Biochemical removal of deaminated DNA eliminates deamination-associated false positive results; however, for samples with very low quality DNA such as highly fragmented FFPE treated samples, UDG-treated may constitute an issue, as the treatment introduces possible further strand breaks leading to even higher fragmentation and lower availability of intact template strands. Therefore, using the QuantiMIZE assay to identify those samples suitable for sequencing, based on an assessment of original intact and amplifiable templates, before starting an experiment is a critical point for an amplification based NGS technology. Previous reports observed that samples with lower amounts of amplifiable DNA are more likely to give a markedly increased number of false positive results [30,31].