Sciweavers

1261 search results - page 110 / 253
» Extracting Text from PostScript
Sort
View
IJCAI
2003
14 years 11 months ago
Domain Event Extraction and Representation with Domain Ontology
With domain ontology, a meaningful index of document indexing, such as the domain events structure in this paper, can be defined. Since the construction of domain ontology is cost...
Shih-Hung Wu, Tzong-Han Tsai, Wen-Lian Hsu
AAAI
2000
14 years 11 months ago
Information Extraction with HMM Structures Learned by Stochastic Optimization
Recent research has demonstrated the strong performance of hidden Markov models applied to information extraction--the task of populating database slots with corresponding phrases...
Dayne Freitag, Andrew McCallum
WWW
2003
ACM
15 years 10 months ago
DOM-based content extraction of HTML documents
Web pages often contain clutter (such as pop-up ads, unnecessary images and extraneous links) around the body of an article that distracts a user from actual content. Extraction o...
Suhit Gupta, Gail E. Kaiser, David Neistadt, Peter...
WEBI
2005
Springer
15 years 3 months ago
A Semi-Supervised Document Clustering Algorithm Based on EM
Document clustering is a very hard task in Automatic Text Processing since it requires to extract regular patterns from a document collection without a priori knowledge on the cat...
Leonardo Rigutini, Marco Maggini
ER
2008
Springer
136views Database» more  ER 2008»
14 years 11 months ago
Automating the Extraction of Rights and Obligations for Regulatory Compliance
Abstract. Government regulations are increasingly affecting the security, privacy and governance of information systems in the United States, Europe and elsewhere. Consequently, co...
Nadzeya Kiyavitskaya, Nicola Zeni, Travis D. Breau...