Search Sciweavers | Sciweavers

154 search results - page 16 / 31

» Using Wikipedia and Wiktionary in Domain-Specific Informatio...

click to vote

SIGIR
2010
ACM

146views Information Technology» more SIGIR 2010»

Crowdsourcing a wikipedia vandalism corpus

15 years 1 months ago

Download www.uni-weimar.de

We report on the construction of the PAN Wikipedia vandalism corpus, PAN-WVC-10, using Amazon’s Mechanical Turk. The corpus compiles 32 452 edits on 28 468 Wikipedia articles, a...

Martin Potthast

claim paper

Read More »

click to vote

SIGIR
2011
ACM

257views Information Technology» more SIGIR 2011»

No free lunch: brute force vs. locality-sensitive hashing for cross-lingual pairwise similarity

14 years 13 days ago

Download www.umiacs.umd.edu

This work explores the problem of cross-lingual pairwise similarity, where the task is to extract similar pairs of documents across two diﬀerent languages. Solutions to this pro...

Ferhan Ture, Tamer Elsayed, Jimmy J. Lin

claim paper

Read More »

click to vote

CORR
2010
Springer

128views Education» more CORR 2010»

TAGME: on-the-fly annotation of short text fragments (by Wikipedia entities)

14 years 9 months ago

Download www.di.unipi.it

We designed and implemented Tagme, a system that is able to efficiently and judiciously augment a plain-text with pertinent hyperlinks to Wikipedia pages. The specialty of Tagme w...

Paolo Ferragina, Ugo Scaiella

claim paper

Read More »

click to vote

ICTIR
2009
Springer

137views Information Technology» more ICTIR 2009»

What's in a Link? From Document Importance to Topical Relevance

15 years 4 months ago

Download staff.science.uva.nl

Web information retrieval is best known for its use of the Web’s link structure as a source of evidence. Global link evidence is by nature query-independent, and is therefore no ...

Marijn Koolen, Jaap Kamps

claim paper

Read More »

120

click to vote

APCCM
2009

165views Knowledge Management» more APCCM 2009»

Extracting and Modeling the Semantic Information Content of Web Documents to Support Semantic Document Retrieval

14 years 10 months ago

Download crpit.com

Existing HTML mark-up is used only to indicate the structure and lay-out of documents, but not the document semantics. As a result web documents are difficult to be semantically p...

Shahrul Azman Noah, Lailatulqadri Zakaria, Arifah ...

claim paper

Read More »

« Prev « First page 16 / 31 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers