Monday, June 13, 2016

Fast kmer counting table algorithm using perfect hash function: C++ pseudo-code integration into R using Rcpp API


Abstract



Counting kmers (substrings of length k in DNA sequence data) is an essential component of many methods in bioinformatics, including data preprocessing for de novo assembly, repeat detection, and sequencing coverage estimation. We proposed a simple algorithm to calculate the kmer count using perfect hash table implemented in C++ and using Rcpp API to be able exported into R. The pdf version is available at: Fast kmer counting table algorithm using perfect hash function: C++ pseudo-code integration into R using Rcpp API




References




No comments:

Post a Comment