Mining mutation chains in biological sequences

10 years 21 days ago
Mining mutation chains in biological sequences
— The increasing infectious disease outbreaks has led to a need for new research to better understand the disease’s origins, epidemiological features and pathogenicity caused by fast-mutating, fast-spreading viruses. Traditional sequence analysis methods do not take into account the spatio-temporal dynamics of rapidly evolving and spreading viral species. They are also focused on identifying single-point mutations. In this paper, we propose a novel approach that incorporates space-time relationships for studying changes in protein sequences from fast mutating viruses. We aim to detect both single-point mutations as well as k-mutations in the viral sequences. We define the problem of mutation chain pattern mining and design algorithms to discover valid mutation chains. Compact data structures to facilitate the mining process as well as pruning strategies to increase the scalability of the algorithms are devised. Experiments on both synthetic datasets and real world influenza A vir...
Chang Sheng, Wynne Hsu, Mong-Li Lee, Joo Chuan Ton
Added 17 May 2010
Updated 17 May 2010
Type Conference
Year 2010
Where ICDE
Authors Chang Sheng, Wynne Hsu, Mong-Li Lee, Joo Chuan Tong, See-Kiong Ng
Comments (0)