Sciweavers

1261 search results - page 57 / 253
» Extracting Text from PostScript
Sort
View
LREC
2008
90views Education» more  LREC 2008»
14 years 11 months ago
Yet another Platform for Extracting Knowledge from Corpora
The research field of "extracting knowledge bases from text collections" seems to be mature: its target and its working hypotheses are clear. In this paper we propose a ...
Francesca Fallucchi, Fabio Massimo Zanzotto
ASWC
2008
Springer
14 years 11 months ago
Catriple: Extracting Triples from Wikipedia Categories
As an important step towards bootstrapping the Semantic Web, many efforts have been made to extract triples from Wikipedia because of its wide coverage, good organization and rich ...
Qiaoling Liu, Kaifeng Xu, Lei Zhang, Haofen Wang, ...
DOCENG
2010
ACM
14 years 11 months ago
Glyph extraction from historic document images
This paper is about the reproduction of ancient texts with vectorised fonts. While for OCR only recognition rates count, a reproduction process does not necessarily require the re...
Lothar Meyer-Lerbs, Arne Schuldt, Björn Gottf...
DL
2000
Springer
162views Digital Library» more  DL 2000»
15 years 2 months ago
Snowball: extracting relations from large plain-text collections
Text documents often contain valuable structured data that is hidden in regular English sentences. This data is best exploited if available as a relational table that we could use...
Eugene Agichtein, Luis Gravano
EMNLP
2009
14 years 7 months ago
A Rich Feature Vector for Protein-Protein Interaction Extraction from Multiple Corpora
Because of the importance of proteinprotein interaction (PPI) extraction from text, many corpora have been proposed with slightly differing definitions of proteins and PPI. Since ...
Makoto Miwa, Rune Sætre, Yusuke Miyao, Jun-i...