english Icono del idioma   español Icono del idioma  

Por favor, use este identificador para citar o enlazar este ítem: https://hdl.handle.net/20.500.12008/39617 Cómo citar
Título: Nanopore quality score resolution can be reduced with little effect on downstream analysis
Autor: Rivara-Espasandín, Martín
Balestrazzi, Lucía
Dufort y Álvarez, Guillermo
Ochoa, Idoia
Seroussi, Gadiel
Smircich, Pablo
Sotelo Silveira, José Roberto
Martín, Álvaro
Tipo: Artículo
Palabras clave: Nanopore sequencing, Bioinformatic
Fecha de publicación: 2022
Resumen: Motivation: The use of high precision for representing quality scores in nanopore sequencing data makes these scores hard to compress and, thus, responsible for most of the information stored in losslessly compressed FASTQ files. This motivates the investigation of the effect of quality score information loss on downstream analysis from nanopore sequencing FASTQ files. Results: We polished de novo assemblies for a mock microbial community and a human genome, and we called variants on a human genome. We repeated these experiments using various pipelines, under various coverage level scenarios and various quality score quantizers. In all cases, we found that the quantization of quality scores causes little difference (or even sometimes improves) on the results obtained with the original (non-quantized) data. This suggests that the precision that is currently used for nanopore quality scores may be unnecessarily high, and motivates the use of lossy compression algorithms for this kind of data. Moreover, we show that even a non-specialized compressor, such as gzip, yields large storage space savings after the quantization of quality scores.
Editorial: Oxford University Press
EN: Bioinformatics Advances, 2022, 2(1): 1–7.
Financiadores: ANII: FSDA_1_2018_1_154790.
DOI: 10.1093/bioadv/vbac054
ISSN: 2635-0041
Citación: Rivara-Espasandín, M, Balestrazzi, L, Dufort y Álvarez, G, [y otros autores]. "Nanopore quality score resolution can be reduced with little effect on downstream analysis". Bioinformatics Advances. [en línea] 2022, 2(1): 1–7. 7 h. DOI: 10.1093/bioadv/vbac054
Licencia: Licencia Creative Commons Atribución (CC - By 4.0)
Aparece en las colecciones: Publicaciones académicas y científicas - Facultad de Ciencias

Ficheros en este ítem:
Fichero Descripción Tamaño Formato   
101093bioadvvbac054.pdf538,33 kBAdobe PDFVisualizar/Abrir


Este ítem está sujeto a una licencia Creative Commons Licencia Creative Commons Creative Commons