Sciweavers

91 search results - page 6 / 19
» Digitizing a Million Books: Challenges for Document Analysis
Sort
View
IJDAR
2008
136views more  IJDAR 2008»
14 years 9 months ago
Matching word images for content-based retrieval from printed document images
As large quantity of document images is getting archived by the digital libraries, there is a need for an efficient search strategies to make them available as per users informatio...
Million Meshesha, C. V. Jawahar
CIKM
2008
Springer
14 years 11 months ago
Identifying table boundaries in digital documents via sparse line detection
Most prior work on information extraction has focused on extracting information from text in digital documents. However, often, the most important information being reported in an...
Ying Liu, Prasenjit Mitra, C. Lee Giles
POLICY
2004
Springer
15 years 2 months ago
Towards a Multi-dimensional Characterization of Dissemination Control
Dissemination control (DCON) is emerging as one of the most important and challenging goals for information security. DCON is concerned with controlling information and digital ob...
Roshan K. Thomas, Ravi S. Sandhu
88
Voted
KDD
2008
ACM
120views Data Mining» more  KDD 2008»
15 years 10 months ago
Entity categorization over large document collections
Extracting entities (such as people, movies) from documents and identifying the categories (such as painter, writer) they belong to enable structured querying and data analysis ov...
Arnd Christian König, Rares Vernica, Venkates...
DIAL
2004
IEEE
138views Image Analysis» more  DIAL 2004»
15 years 1 months ago
Retrieving Imaged Documents in Digital Libraries Based on Word Image Coding
A great number of documents are scanned and archived in the form of digital images in digital libraries, to make them available and accessible in the Internet. Information retriev...
Yue Lu, Li Zhang, Chew Lim Tan