Search Sciweavers | Sciweavers

2677 search results - page 186 / 536

» Extracting Structured Data from Web Pages

126

click to vote

SIGIR
2003
ACM

116views Information Technology» more SIGIR 2003»

ReCoM: reinforcement clustering of multi-type interrelated data objects

15 years 9 months ago

Download research.microsoft.com

Most existing clustering algorithms cluster highly related data objects such as Web pages and Web users separately. The interrelation among different types of data objects is eith...

Jidong Wang, Hua-Jun Zeng, Zheng Chen, Hongjun Lu,...

claim paper

Read More »

123

click to vote

IJCNLP
2005
Springer

168views Natural Language Processing» more IJCNLP 2005»

Aligning Needles in a Haystack: Paraphrase Acquisition Across the Web

15 years 9 months ago

Download www.aclweb.org

This paper presents a lightweight method for unsupervised extraction of paraphrases from arbitrary textual Web documents. The method diﬀers from previous approaches to paraphrase...

Marius Pasca, Péter Dienes

claim paper

Read More »

134

click to vote

DEXA
2003
Springer

130views Database» more DEXA 2003»

Finding Neighbor Communities in the Web Using Inter-site Graph

15 years 9 months ago

Download www.tkl.iis.u-tokyo.ac.jp

In recent years, link-based information retrieval methods from the Web are developed. A framework of these methods is a Web graph using pages as vertices and Web-links as edges. In...

Yasuhito Asano, Hiroshi Imai, Masashi Toyoda, Masa...

claim paper

Read More »

115

click to vote

CIB
2002

100views more CIB 2002»

Web-log Mining for Quantitative Temporal-Event Prediction

15 years 3 months ago

Download www.comp.hkbu.edu.hk

The web log data embed much of web users' browsing behavior. From the web logs, one can discover patterns that predict the users' future requests based on their current b...

Qiang Yang, Hui Wang, Wei Zhang

claim paper

Read More »

134

click to vote

CIVR
2009
Springer

146views Image Analysis» more CIVR 2009»

Web news categorization using a cross-media document graph

15 years 10 months ago

Download ctp.di.fct.unl.pt

In this paper we propose a multimedia categorization framework that is able to exploit information across different parts of a multimedia document (e.g., a Web page, a PDF, a Micr...

José Iria, Fabio Ciravegna, João Mag...

claim paper

Read More »

« Prev « First page 186 / 536 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers