TY - Generic T1 - BlindCall: ultra-fast base-calling of high-throughput sequencing data by blind deconvolution. Y1 - 2014 A1 - Ye, Chengxi A1 - Hsiao, Chiaowen A1 - Corrada Bravo, Hector KW - algorithms KW - High-Throughput Nucleotide Sequencing KW - HUMANS KW - Probability KW - Reproducibility of Results KW - Sequence Analysis, DNA KW - software KW - Time factors AB -

MOTIVATION: Base-calling of sequencing data produced by high-throughput sequencing platforms is a fundamental process in current bioinformatics analysis. However, existing third-party probabilistic or machine-learning methods that significantly improve the accuracy of base-calls on these platforms are impractical for production use due to their computational inefficiency.

RESULTS: We directly formulate base-calling as a blind deconvolution problem and implemented BlindCall as an efficient solver to this inverse problem. BlindCall produced base-calls at accuracy comparable to state-of-the-art probabilistic methods while processing data at rates 10 times faster in most cases. The computational complexity of BlindCall scales linearly with read length making it better suited for new long-read sequencing technologies.

JA - Bioinformatics VL - 30 CP - 9 M3 - 10.1093/bioinformatics/btu010 ER -