Search Sciweavers | Sciweavers

95

Voted

WWW
2007
ACM

224views Internet Technology» more WWW 2007»

EPCI: extracting potentially copyright infringement texts from the web

16 years 1 months ago

Download www2007.org

In this paper, we propose a new system extracting potentially copyright infringement texts from the Web, called EPCI. EPCI extracts them in the following way: (1) generating a set...

Takashi Tashiro, Takanori Ueda, Taisuke Hori, Yu H...

claim paper

Read More »

100

click to vote

EACL
2006
ACL Anthology

156views Natural Language Processing» more EACL 2006»

Large Linguistically-Processed Web Corpora for Multiple Languages

15 years 2 months ago

Download acl.ldc.upenn.edu

The Web contains vast amounts of linguistic data. One key issue for linguists and language technologists is how to access it. Commercial search engines give highly compromised acc...

Marco Baroni, Adam Kilgarriff

claim paper

Read More »

152

click to vote

CIKM
2008
Springer

174views Information Technology» more CIKM 2008»

A language for manipulating clustered web documents results

15 years 2 months ago

Download dblab.cs.nccu.edu.tw

We propose a novel conception language for exploring the results retrieved by several internet search services (like search engines) that cluster retrieved documents. The goal is ...

Gloria Bordogna, Alessandro Campi, Giuseppe Psaila...

claim paper

Read More »

96

click to vote

WEBI
2005
Springer

123views Internet Technology» more WEBI 2005»

Metadata Propagation in the Web Using Co-Citations

15 years 6 months ago

Download www.emse.fr

Given the large heterogeneity of the World Wide Web, using metadata on the search engines side seems to be a useful track for information retrieval. Though, because a manual quali...

Camille Prime-Claverie, Michel Beigbeder, Thierry ...

claim paper

Read More »

125

Voted

ECAI
2006
Springer

161views Artificial Intelligence» more ECAI 2006»

Automatic Term Categorization by Extracting Knowledge from the Web

15 years 4 months ago

Download www.dii.unisi.it

This paper addresses the problem of categorizing terms or lexical entities into a predefined set of semantic domains exploiting the knowledge available on-line in the Web. The prop...

Leonardo Rigutini, Ernesto Di Iorio, Marco Ernande...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers