Search Sciweavers | Sciweavers

3180 search results - page 303 / 636

» Knowledge-based Document Analysis

188

click to vote

PLDI
2010
ACM

361views Programming Languages» more PLDI 2010»

A Context-free Markup Language for Semi-structured Text

16 years 3 months ago

Download www.cs.princeton.edu

An ad hoc data format is any non-standard, semi-structured data format for which robust data processing tools are not available. In this paper, we present ANNE, a new kind of mark...

Qian Xi, David Walker

claim paper

Read More »

179

click to vote

EMNLP
2008

158views Natural Language Processing» more EMNLP 2008»

An Analysis of Active Learning Strategies for Sequence Labeling Tasks

15 years 7 months ago

Download www.cs.cmu.edu

Active learning is well-suited to many problems in natural language processing, where unlabeled data may be abundant but annotation is slow and expensive. This paper aims to shed ...

Burr Settles, Mark Craven

claim paper

Read More »

167

click to vote

ICTIR
2009
Springer

159views Information Technology» more ICTIR 2009»

An Analysis of NP-Completeness in Novelty and Diversity Ranking

15 years 3 months ago

Download ir.cis.udel.edu

Abstract. A useful ability for search engines is to be able to rank objects with novelty and diversity: the top k documents retrieved should cover possible interpretations of a que...

Ben Carterette

claim paper

Read More »

146

click to vote

GFKL
2005
Springer

142views Data Mining» more GFKL 2005»

15 years 11 months ago

Near Similarity Search and Plagiarism Analysis

Download www.uni-weimar.de

Abstract. Existing methods to text plagiarism analysis mainly base on “chunking”, a process of grouping a text into meaningful units each of which gets encoded by an integer nu...

Benno Stein, Sven Meyer zu Eissen

claim paper

Read More »

169

click to vote

IPM
2007

145views more IPM 2007»

Text mining techniques for patent analysis

15 years 6 months ago

Download dblab.mgt.ncu.edu.tw

Patent documents contain important research results. However, they are lengthy and rich in technical terminology such that it takes a lot of human eﬀorts for analyses. Automatic...

Yuen-Hsien Tseng, Chi-Jen Lin, Yu-I Lin

claim paper

Read More »

« Prev « First page 303 / 636 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers