Abstract. Data stream mining has become a novel research topic of growing interest in knowledge discovery. Most proposed algorithms for data stream mining assume that each data blo...
Our LAMDAer team has won the PAKDD'06 Data Mining Competition (Open Category) Grand Champion. This report presents our solution to PAKDD'06 Data Mining Competition. Follo...
Yang Yu, De-Chuan Zhan, Xu-Ying Liu, Ming Li, Zhi-...
Many applications depend on efficient management of large sets of distinct strings in memory. For example, during index construction for text databases a record is held for each d...
This paper introduces deep syntactic structures to syntax-based Statistical Machine Translation (SMT). We use a Head-driven Phrase Structure Grammar (HPSG) parser to obtain the de...
Phrase-based decoding is conceptually simple and straightforward to implement, at the cost of drastically oversimplified reordering models. Syntactically aware models make it pos...