Sciweavers

1066 search results - page 136 / 214
» Untangling the World-Wide Web
Sort
View
WSDM
2010
ACM
204views Data Mining» more  WSDM 2010»
15 years 5 months ago
Learning URL patterns for webpage de-duplication
Presence of duplicate documents in the World Wide Web adversely affects crawling, indexing and relevance, which are the core building blocks of web search. In this paper, we pres...
Hema Swetha Koppula, Krishna P. Leela, Amit Agarwa...
JAIR
2008
173views more  JAIR 2008»
14 years 10 months ago
Creating Relational Data from Unstructured and Ungrammatical Data Sources
In order for agents to act on behalf of users, they will have to retrieve and integrate vast amounts of textual data on the World Wide Web. However, much of the useful data on the...
Matthew Michelson, Craig A. Knoblock
ICDE
2008
IEEE
118views Database» more  ICDE 2008»
15 years 4 months ago
OntoNet: Scalable knowledge-based networking
Recent years have seen a proliferation of work on the Semantic Web, an initiative to enable intelligent agents to reason about and utilize World Wide Web content and services. Con...
Joseph B. Kopena, Boon Thau Loo
DOCENG
2006
ACM
15 years 4 months ago
Templates, microformats and structured editing
Microformats and semantic XHTML add semantics to web pages while taking advantage of the existing (X)HTML infrastructure. This approach enables new applications that can be deploy...
Francesc Campoy Flores, Vincent Quint, Irèn...
70
Voted
W4A
2006
ACM
15 years 4 months ago
Dialog generation for voice browsing
In this paper we present our voice browser system, HearSay, which provides efficient access to the World Wide Web to people with visual disabilities. HearSay includes contentbased...
Zan Sun, Amanda Stent, I. V. Ramakrishnan