Sciweavers

EMNLP
2009
13 years 2 months ago
On the Role of Lexical Features in Sequence Labeling
We use the technique of SVM anchoring to demonstrate that lexical features extracted from a training corpus are not necessary to obtain state of the art results on tasks such as N...
Yoav Goldberg, Michael Elhadad
TCBB
2010
136views more  TCBB 2010»
13 years 2 months ago
Efficient Extraction of Protein-Protein Interactions from Full-Text Articles
—Proteins and their interactions govern virtually all cellular processes, such as regulation, signaling, metabolism, and structure. Most experimental findings pertaining to such ...
Jörg Hakenberg, Robert Leaman, Nguyen Ha Vo, ...
CEC
2010
IEEE
13 years 5 months ago
Evolving natural language grammars without supervision
Unsupervised grammar induction is one of the most difficult works of language processing. Its goal is to extract a grammar representing the language structure using texts without a...
Lourdes Araujo, Jesus Santamaria
COLING
2000
13 years 6 months ago
Text Genre Detection Using Common Word Frequencies
In this paper we present a method for detecting the text genre quickly and easily following an approach originally proposed in authorship attribution studies which uses as style m...
Efstathios Stamatatos, Nikos Fakotakis, George K. ...
LREC
2010
164views Education» more  LREC 2010»
13 years 6 months ago
Study of Word Sense Disambiguation System that uses Contextual Features - Approach of Combining Associative Concept Dictionary a
We propose a Word Sense Disambiguation (WSD) method that accurately classifies ambiguous words to concepts in the Associative Concept Dictionary (ACD) even when the test corpus an...
Kyota Tsutsumida, Jun Okamoto, Shun Ishizaki, Mako...
ACL
2007
13 years 6 months ago
SVM Model Tampering and Anchored Learning: A Case Study in Hebrew NP Chunking
We study the issue of porting a known NLP method to a language with little existing NLP resources, specifically Hebrew SVM-based chunking. We introduce two SVM-based methods – ...
Yoav Goldberg, Michael Elhadad
ACL
2008
13 years 6 months ago
Robust Extraction of Named Entity Including Unfamiliar Word
This paper proposes a novel method to extract named entities including unfamiliar words which do not occur or occur few times in a training corpus using a large unannotated corpus...
Masatoshi Tsuchiya, Shinya Hida, Seiichi Nakagawa
CIKM
2004
Springer
13 years 8 months ago
InfoAnalyzer: a computer-aided tool for building enterprise taxonomies
In this paper we study the problem of collecting training samples for building enterprise taxonomies. We develop a computer-aided tool named InfoAnalyzer, which can effectively as...
Li Zhang, Shixia Liu, Yue Pan, Liping Yang
SIGIR
2010
ACM
13 years 8 months ago
Cross-language retrieval using link-based language models
We propose a cross-language retrieval model that is solely based on Wikipedia as a training corpus. The main contri
Benjamin Roth, Dietrich Klakow