KEC: unique sequence search by K-mer exclusion

Bioinformatics. 2021 Oct 11;37(19):3349-3350. doi: 10.1093/bioinformatics/btab196.

Abstract

Summary: Searching for amino acid or nucleic acid sequences unique to one organism may be challenging depending on size of the available datasets. K-mer elimination by cross-reference (KEC) allows users to quickly and easily find unique sequences by providing target and non-target sequences. Due to its speed, it can be used for datasets of genomic size and can be run on desktop or laptop computers with modest specifications.

Availability and implementation: KEC is freely available for non-commercial purposes. Source code and executable binary files compiled for Linux, Mac and Windows can be downloaded from https://github.com/berybox/KEC.

Supplementary information: Supplementary data are available at Bioinformatics online.