Sciweavers

180 search results - page 23 / 36
» Iterated Document Content Classification
Sort
View
ICAIL
2007
ACM
15 years 3 months ago
The Legal-RDF Ontology. A Generic Model for Legal Documents
Legal-RDF.org1 publishes a practical ontology that models both the layout and content of a document and metadata about the document; these have been built using data models implici...
John McClure
CHI
2005
ACM
15 years 11 months ago
The role of the author in topical blogs
Web logs, or blogs, challenge the notion of authorship. Seemingly, rather than a model in which the author's writings are themselves a contribution, the blog author weaves a ...
Scott Carter
VLDB
2002
ACM
161views Database» more  VLDB 2002»
14 years 11 months ago
Distributed Search over the Hidden Web: Hierarchical Database Sampling and Selection
Many valuable text databases on the web have non-crawlable contents that are "hidden" behind search interfaces. Metasearchers are helpful tools for searching over many s...
Panagiotis G. Ipeirotis, Luis Gravano
KDD
2006
ACM
179views Data Mining» more  KDD 2006»
15 years 11 months ago
Extracting key-substring-group features for text classification
In many text classification applications, it is appealing to take every document as a string of characters rather than a bag of words. Previous research studies in this area mostl...
Dell Zhang, Wee Sun Lee
DGO
2006
136views Education» more  DGO 2006»
15 years 22 days ago
Automated classification of congressional legislation
For social science researchers, content analysis and classification of United States Congressional legislative activities has been time consuming and costly. The Library of Congre...
Stephen Purpura, Dustin Hillard