Sciweavers

1149 search results - page 86 / 230
» Classification of Web Documents Using a Graph Model
Sort
View
ICIP
2003
IEEE
16 years 3 months ago
An entropy based segmentation algorithm for computer-generated document images
This paper presents an efficient compression-oriented segmentation algorithm for computer-generated document images. In this algorithm, a document image is represented in a block-...
Lijie Liu, Yan Dong, Xiaomu Song, Guoliang Fan
LREC
2008
146views Education» more  LREC 2008»
15 years 3 months ago
On the Use of Web Resources and Natural Language Processing Techniques to Improve Automatic Speech Recognition Systems
Language models used in current automatic speech recognition systems are trained on general-purpose corpora and are therefore not relevant to transcribe spoken documents dealing w...
Gwénolé Lecorvé, Guillaume Gr...
SOFTVIS
2006
ACM
15 years 8 months ago
Semantic web data visualization with graph style sheets
Visual paradigms such as node-link diagrams are well suited to the representation of Semantic Web data encoded with the Resource Description Framework (RDF), whose data model can ...
Emmanuel Pietriga
AIRWEB
2007
Springer
15 years 8 months ago
Transductive Link Spam Detection
Web spam can significantly deteriorate the quality of search engines. Early web spamming techniques mainly manipulate page content. Since linkage information is widely used in we...
Dengyong Zhou, Chris Burges, Tao Tao
CIKM
2008
Springer
15 years 4 months ago
Semi-supervised text categorization by active search
In automated text categorization, given a small number of labeled documents, it is very challenging, if not impossible, to build a reliable classifier that is able to achieve high...
Zenglin Xu, Rong Jin, Kaizhu Huang, Michael R. Lyu...