Search Sciweavers | Sciweavers

154

WWW
2009
ACM

106views Internet Technology» more WWW 2009»

News article extraction with template-independent wrapper

16 years 26 days ago

We consider the problem of template-independent news extraction. The state-of-the-art news extraction method is based on template-level wrapper induction, which has two serious li...

Junfeng Wang, Xiaofei He, Can Wang, Jian Pei, Jiaj...

claim paper

Read More »

172

click to vote

BMCBI
2008

83views more BMCBI 2008»

Terminologies for text-mining; an experiment in the lipoprotein metabolism domain

15 years 6 months ago

Download www.biomedcentral.com

Background: The engineering of ontologies, especially with a view to a text-mining use, is still a new research field. There does not yet exist a well-defined theory and technolog...

Dimitra Alexopoulou, Thomas Wächter, Laura Pi...

claim paper

Read More »

174

click to vote

SIGIR
2011
ACM

204views Information Technology» more SIGIR 2011»

Social context summarization

14 years 9 months ago

Download keg.cs.tsinghua.edu.cn

We study a novel problem of social context summarization for Web documents. Traditional summarization research has focused on extracting informative sentences from standard docume...

Zi Yang, Keke Cai, Jie Tang, Li Zhang, Zhong Su, J...

claim paper

Read More »

204

Voted

EMNLP
2011

164views Natural Language Processing» more EMNLP 2011»

Watermarking the Outputs of Structured Prediction with an application in Statistical Machine Translation

14 years 5 months ago

Download cs.jhu.edu

We propose a general method to watermark and probabilistically identify the structured outputs of machine learning algorithms. Our method is robust to local editing operations and...

Ashish Venugopal, Jakob Uszkoreit, David Talbot, F...

claim paper

Read More »

215

click to vote

SIGIR
2012
ACM

234views Information Technology» more SIGIR 2012»

Predicting quality flaws in user-generated content: the case of wikipedia

13 years 8 months ago

Download www.uni-weimar.de

The detection and improvement of low-quality information is a key concern in Web applications that are based on user-generated content; a popular example is the online encyclopedi...

Maik Anderka, Benno Stein, Nedim Lipka

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers