Search Sciweavers | Sciweavers

104

CICLING
2005
Springer

118views Natural Language Processing» more CICLING 2005»

Instance Pruning by Filtering Uninformative Words: An Information Extraction Case Study

15 years 7 months ago

In this paper we present a novel instance pruning technique for Information Extraction (IE). In particular, our technique ﬁlters out uninformative words from texts on the basis o...

Alfio Massimiliano Gliozzo, Claudio Giuliano, Raff...

claim paper

Read More »

117

Voted

ACL
2012

181views Computational Linguistics» more ACL 2012»

Syntactic Annotations for the Google Books NGram Corpus

13 years 4 months ago

Download www.petrovi.de

We present a new edition of the Google Books Ngram Corpus, which describes how often words and phrases were used over a period of ﬁve centuries, in eight languages; it reﬂects...

Yuri Lin, Jean-Baptiste Michel, Erez Aiden Lieberm...

claim paper

Read More »

102

Voted

ACL
1994

95views Computational Linguistics» more ACL 1994»

15 years 3 months ago

Similarity-Based Estimation of Word Cooccurrence Probabilities

Download www.aclweb.org

In many applications of natural language processing it is necessary to determine the likelihood of a given word combination. For example, a speech recognizer may need to determine...

Ido Dagan, Fernando C. N. Pereira, Lillian Lee

claim paper

Read More »

86

click to vote

COLING
2000

87views Computational Linguistics» more COLING 2000»

Local context templates for Chinese constituent boundary prediction

15 years 3 months ago

Download www.csai.tsinghua.edu.cn

: In this paper, we proposed a shallow syntactic knowledge description: constituent boundary representation and its simple and efficient prediction algorithm, based on different lo...

Qiang Zhou

claim paper

Read More »

108

Voted

EMNLP
2010

141views Natural Language Processing» more EMNLP 2010»

An Efficient Algorithm for Unsupervised Word Segmentation with Branching Entropy and MDL

14 years 11 months ago

Download www.aclweb.org

This paper proposes a fast and simple unsupervised word segmentation algorithm that utilizes the local predictability of adjacent character sequences, while searching for a leaste...

Valentin Zhikov, Hiroya Takamura, Manabu Okumura

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers