Sciweavers

382 search results - page 60 / 77
» Using symbolic objects to cluster web documents
Sort
View
CIKM
2004
Springer
15 years 3 months ago
Distributional term representations: an experimental comparison
A number of content management tasks, including term categorization, term clustering, and automated thesaurus generation, view natural language terms (e.g. words, noun phrases) as...
Alberto Lavelli, Fabrizio Sebastiani, Roberto Zano...
84
Voted
WSDM
2010
ACM
204views Data Mining» more  WSDM 2010»
15 years 4 months ago
Learning URL patterns for webpage de-duplication
Presence of duplicate documents in the World Wide Web adversely affects crawling, indexing and relevance, which are the core building blocks of web search. In this paper, we pres...
Hema Swetha Koppula, Krishna P. Leela, Amit Agarwa...
EMNLP
2008
14 years 11 months ago
Soft-Supervised Learning for Text Classification
We propose a new graph-based semisupervised learning (SSL) algorithm and demonstrate its application to document categorization. Each document is represented by a vertex within a ...
Amarnag Subramanya, Jeff Bilmes
SIGDOC
2005
ACM
15 years 3 months ago
Information fragments for a pervasive world
Is the second paragraph dead? Technology and users are tending to create and consume information in ever decreasing chunks, forcing content creators to create shorter fragments of...
Russell Beale
IAJIT
2010
96views more  IAJIT 2010»
14 years 8 months ago
A Data Mashup for Dynamic Composition of Adaptive Courses
: This paper presents a novel adaptive course composition system that based on mashing up learning content in a web application. The system includes three major components, static ...
Mohammed Al-Zoube, Baha Khasawneh