Sciweavers

735 search results - page 84 / 147
» Corpora and data preparation
Sort
View
NLPRS
2001
Springer
15 years 2 months ago
A Separate-and-Learn Approach to EM Learning of PCFGs
WeproposeanewapproachtoEMlearning of PCFGs. We completely separate the process of EM learning from that of parsing, andfor theformer, weintroduce a new EM algorithm called the gra...
Taisuke Sato, Shigeru Abe, Yoshitaka Kameya, Kiyoa...
NLPRS
2001
Springer
15 years 2 months ago
A Probabilistic Model for Japanese Zero Pronoun Resolution Integrating Syntactic and Semantic Features
This paper proposes a method to resolve Japanese zero pronouns by identifying their antecedents. Our method uses a probabilistic model, which is decomposed into syntactic and sema...
Kazuhiro Seki, Atsushi Fujii, Tetsuya Ishikawa
CPM
2009
Springer
131views Combinatorics» more  CPM 2009»
15 years 1 months ago
Linear Time Suffix Array Construction Using D-Critical Substrings
In this paper we present in detail a new efficient linear time and space suffix array construction algorithm(SACA), called the D-CriticalSubstring algorithm. The algorithm is built...
Ge Nong, Sen Zhang, Wai Hong Chan
ACL
2008
14 years 11 months ago
An Unsupervised Approach to Biography Production Using Wikipedia
We describe an unsupervised approach to multi-document sentence-extraction based summarization for the task of producing biographies. We utilize Wikipedia to automatically constru...
Fadi Biadsy, Julia Hirschberg, Elena Filatova
AAAI
2010
14 years 11 months ago
Kernelized Sorting for Natural Language Processing
Kernelized sorting is an approach for matching objects from two sources (or domains) that does not require any prior notion of similarity between objects across the two sources. U...
Jagadeesh Jagarlamudi, Seth Juarez, Hal Daum&eacut...