SARS-CoV-2 Next Generation Sequencing (NGS) data from clinical isolates from the East Texas Region of the United States

Authors
Rob E. Carpenter, Vaibhav K. Tamrakar, Sadia Almas, Aditya Sharma, Rahul Sharma
Publication date
June 21, 2023
Journal
Data in Brief
Volume
49
Issue
109312
Pages
1-6
Publisher
Elsevier Inc.
Description
The SARS-CoV-2 virus has evolved throughout the pandemic and is likely to continue evolving into new variants. Some of these variants may affect functional properties, including infectivity, interactions with host immunity, and disease severity. And compromised vaccine efficacy is an emerging concern with every new viral variant. Next-generation sequencing (NGS) has emerged as the tool of choice for discovering new variants and understanding the transmission dynamics of SARS-CoV-2. Deciphering the SARS-CoV-2 genome has enabled epidemiological survivance and forecast of altered etiologically. Clinical presentations of the infection are influenced by comorbidities such as age, immune status, diabetes, and the infecting variant. Thus, clinical management and vaccine efficacy may differ for new variants. For example, some monoclonal antibody treatments are variant-specific, and some vaccines are less efficacious against the omicron and delta variants of SARS-CoV-2. Consequently, determining the local outbreaks and monitoring SARS-CoV-2 Variants of Concern (VOC) is one of the primary strategies for the pandemic's containment. Although next-generation sequencing (NGS) is a gold standard for genomic surveillance and variant discovery, the assays are not approved for variant diagnosis for clinical decision-making. Advanta Genetics, Texas, USA, optimized Illumina COVID-seq protocol to reduce cost without compromising accuracy and validated the Illumina COVID-Seq assay as a Laboratory Developed Test (LDT) according to the guidelines prescribed by the College of American Pathologists (CAP) and Clinical Laboratory Improvement Amendments (CLIA). The whole genome of the virus was sequenced in ( n = 161) samples from the East Texas region using the Illumina MiniSeq® instrument and analyzed by using Illumina baseSpace ( https://basespace.illumina.com ) bioinformatics pipeline. Briefly, the library was prepared by using Illumina COVIDSeq research use only (RUO) kit, and the individual libraries were normalized using the DNA concentration measured by Qubit Flex Fluorometer, and the pooled libraries were sequenced on Illumina MiniSeq® Instrument. Illumina baseSpace application was used for sequencing QC, FASTQ generation, genome assembly, and identification of SARS-CoV-2 variants. This whole genome shotgun project ( n = 161) has been deposited at GISAID.