We describe a single convolutional neural network architecture that, given a sentence, outputs a host of language processing predictions: part-of-speech tags, chunks, named entity...
This paper describes a new approach towards detecting plagiarism and scientific documents that have been read but not cited. In contrast to existing approaches, which analyze docu...
We propose an unsupervised segmentation method based on an assumption about language data: that the increasing point of entropy of successive characters is the location of a word ...
Abstract—Video shot boundary detection is one of the fundamental tasks of video indexing and retrieval applications. Although many methods have been proposed for this task, find...
Duy-Dinh Le, Shin'ichi Satoh, Thanh Duc Ngo, Duc A...
It is relatively common for different people or organizations to share the same name. Given the increasing amount of information available online, this results in the ever growing...