Sciweavers

182 search results - page 21 / 37
» Next Generation Web Search: Setting Our Sites
Sort
View
CIDM
2007
IEEE
15 years 8 months ago
Distributed Document Clustering Using Word-clusters
−Document clustering has become an increasingly important task in analyzing huge numbers of documents distributed among various sites. The challenging aspect is to analyze this e...
Debzani Deb, Rafal A. Angryk
WWW
2007
ACM
16 years 2 months ago
EPCI: extracting potentially copyright infringement texts from the web
In this paper, we propose a new system extracting potentially copyright infringement texts from the Web, called EPCI. EPCI extracts them in the following way: (1) generating a set...
Takashi Tashiro, Takanori Ueda, Taisuke Hori, Yu H...
INTERACTION
2009
ACM
15 years 8 months ago
DTD2OWL: automatic transforming XML documents into OWL ontology
DTD and its instance have been considered the standard for data representation and information exchange format on the current web. However, when coming to the next generation of w...
Pham Thi Thu Thuy, Young-Koo Lee, Sungyoung Lee
WWW
2010
ACM
15 years 8 months ago
Entity relation discovery from web tables and links
The World-Wide Web consists not only of a huge number of unstructured texts, but also a vast amount of valuable structured data. Web tables [2] are a typical type of structured in...
Cindy Xide Lin, Bo Zhao, Tim Weninger, Jiawei Han,...
HICSS
2008
IEEE
105views Biometrics» more  HICSS 2008»
15 years 8 months ago
Using Visual Features for Fine-Grained Genre Classification of Web Pages
The field of automatic genre classification has primarily focused on extracting textual features from documents. The goal of this research is to investigate whether visual feature...
Ryan Levering, Michal Cutler, Lei Yu