Sciweavers

322 search results - page 19 / 65
» A Software System for Topic Extraction and Document Classifi...
Sort
View
118
Voted
KDD
2004
ACM
210views Data Mining» more  KDD 2004»
16 years 24 days ago
Probabilistic author-topic models for information discovery
We propose a new unsupervised learning technique for extracting information from large text collections. We model documents as if they were generated by a two-stage stochastic pro...
Mark Steyvers, Padhraic Smyth, Michal Rosen-Zvi, T...
115
Voted
RIAO
2000
15 years 1 months ago
Assisting requirements engineering with semantic document analysis
Requirements engineering is the first stage in the software life-cycle and is concerned with discovering and managing a software system's services, constraints and goals. Req...
Paul Rayson, Roger Garside, Peter Sawyer
107
Voted
DGO
2006
136views Education» more  DGO 2006»
15 years 1 months ago
Automated classification of congressional legislation
For social science researchers, content analysis and classification of United States Congressional legislative activities has been time consuming and costly. The Library of Congre...
Stephen Purpura, Dustin Hillard
91
Voted
ECIR
2004
Springer
15 years 1 months ago
Identification of Relevant and Novel Sentences Using Reference Corpus
In the novelty task on sentence level, the amount of information used in similarity computation is the major challenging issue. A shallow NLP approach extracts noun and verb featu...
Hsin-Hsi Chen, Ming-Feng Tsai, Ming-Hung Hsu
102
Voted
ICDAR
2009
IEEE
14 years 10 months ago
Form Field Frame Boundary Removal for Form Processing System in Gurmukhi Script
Machine recognition of hand-filled forms is a challenging task. Form processing involves many activities including form field location, field frame boundary removal and data image...
Dharam Veer Sharma, Gurpreet Singh Lehal