Sciweavers

1261 search results - page 85 / 253
» Extracting Text from PostScript
Sort
View
123
Voted
SIGIR
2000
ACM
15 years 6 months ago
OCELOT: a system for summarizing Web pages
Abstract We introduce OCELOT, a prototype system for automatically generating the “gist” of a web page by summarizing it. Although most text summarization research to date has ...
Adam L. Berger, Vibhu O. Mittal
103
Voted
ACL
2008
15 years 3 months ago
A Joint Model of Text and Aspect Ratings for Sentiment Summarization
Online reviews are often accompanied with numerical ratings provided by users for a set of service or product aspects. We propose a statistical model which is able to discover cor...
Ivan Titov, Ryan T. McDonald
ICDAR
2005
IEEE
15 years 7 months ago
A Model for Detecting and Merging Vertically Spanned Table Cells in Plain Text Documents
A spanned cell in a table is a single, complete unit that physically occupies multiple columns and/or multiple rows. Spanned cells are common in tables, and they are a significan...
Vanessa Long, Robert Dale, Steve Cassidy
IAJIT
2008
117views more  IAJIT 2008»
15 years 2 months ago
Using WordNet for Text Categorization
: This paper explores a method that use WordNet concept to categorize text documents. The bag of words representation used for text representation is unsatisfactory as it ignores p...
Zakaria Elberrichi, Abdellatif Rahmoun, Mohamed Am...
EDBT
2009
ACM
123views Database» more  EDBT 2009»
15 years 9 months ago
High-performance information extraction with AliBaba
A wealth of information is available only in web pages, patents, publications etc. Extracting information from such sources is challenging, both due to the typically complex langu...
Peter Palaga, Long Nguyen, Ulf Leser, Jörg Ha...