Compression of quality factors in next generation sequencing

O. U. Nalbantoĝlu, K. Sayood

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Scopus citations

Abstract

We propose a compression algorithm for the quality scores contained in FASTQ files which are generated in large volumes during high throughput sequencing. The proposed algorithm is a context dependent arithmetic coder which is based on observations of the structure of quality scores in FASTQ files. Simulation results indicate a significantly superior performance of the algorithm to the current state of the art.

Original languageEnglish (US)
Title of host publicationProceedings - DCC 2014
Subtitle of host publication2014 Data Compression Conference
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages419
Number of pages1
ISBN (Print)9781479938827
DOIs
StatePublished - 2014
Event2014 Data Compression Conference, DCC 2014 - Snowbird, UT, United States
Duration: Mar 26 2014Mar 28 2014

Publication series

NameData Compression Conference Proceedings
ISSN (Print)1068-0314

Conference

Conference2014 Data Compression Conference, DCC 2014
CountryUnited States
CitySnowbird, UT
Period3/26/143/28/14

Keywords

  • Biological sequence compression
  • DNA
  • Quality factor

ASJC Scopus subject areas

  • Computer Networks and Communications

Fingerprint Dive into the research topics of 'Compression of quality factors in next generation sequencing'. Together they form a unique fingerprint.

Cite this