CRISPR Detection from Short Reads Using Partial Overlap Graphs

4 years 10 months ago
CRISPR Detection from Short Reads Using Partial Overlap Graphs
Clustered regularly interspaced short palindromic repeats (CRISPR) are structured regions in bacterial and archaeal genomes, which are part of an adaptive immune system against phages. Most of the automated tools that detect CRISPR loci rely on assembled genomes. However, many assemblers do not successfully handle repetitive regions. The first tool to work directly on raw sequence data is Crass, which requires that reads are long enough to contain two copies of the same repeat. We developed a method to identify CRISPR repeats from a raw sequence data of short reads. The algorithm is based on an observation differentiating CRISPR repeats from other types of repeats, and it involves a series of partial constructions of the overlap graph. A preliminary implementation of the algorithm shows good results and detects CRISPR repeats in cases where other tools fail to do so.
Ilan Ben-Bassat, Benny Chor
Added 17 Apr 2016
Updated 17 Apr 2016
Type Journal
Year 2015
Authors Ilan Ben-Bassat, Benny Chor
Comments (0)