Comprehensive analysis of structural variants in breast cancer genomes using single-molecule sequencing.

TitleComprehensive analysis of structural variants in breast cancer genomes using single-molecule sequencing.
Publication TypeJournal Article
Year of Publication2020
AuthorsAganezov, S, Goodwin, S, Sherman, RM, Sedlazeck, FJ, Arun, G, Bhatia, S, Lee, I, Kirsche, M, Wappel, R, Kramer, M, Kostroff, K, Spector, DL, Timp, W, W McCombie, R, Schatz, MC
JournalGenome Res
Volume30
Issue9
Pagination1258-1273
Date Published2020 Sep
ISSN1549-5469
Abstract

Improved identification of structural variants (SVs) in cancer can lead to more targeted and effective treatment options as well as advance our basic understanding of the disease and its progression. We performed whole-genome sequencing of the SKBR3 breast cancer cell line and patient-derived tumor and normal organoids from two breast cancer patients using Illumina/10x Genomics, Pacific Biosciences (PacBio), and Oxford Nanopore Technologies (ONT) sequencing. We then inferred SVs and large-scale allele-specific copy number variants (CNVs) using an ensemble of methods. Our findings show that long-read sequencing allows for substantially more accurate and sensitive SV detection, with between 90% and 95% of variants supported by each long-read technology also supported by the other. We also report high accuracy for long reads even at relatively low coverage (25×-30×). Furthermore, we integrated SV and CNV data into a unifying karyotype-graph structure to present a more accurate representation of the mutated cancer genomes. We find hundreds of variants within known cancer-related genes detectable only through long-read sequencing. These findings highlight the need for long-read sequencing of cancer genomes for the precise analysis of their genetic instability.

DOI10.1101/gr.260497.119
Alternate JournalGenome Res
PubMed ID32887686
PubMed Central IDPMC7545150
Grant ListP30 CA045508 / CA / NCI NIH HHS / United States
R01 HG006677 / HG / NHGRI NIH HHS / United States
R01 HG009190 / HG / NHGRI NIH HHS / United States
R21 CA220411 / CA / NCI NIH HHS / United States