Sciweavers

602 search results - page 44 / 121
» Integrating Data and Probabilistically Structured Text Docum...
Sort
View
AIIA
2007
Springer
15 years 6 months ago
Harvesting Relational and Structured Knowledge for Ontology Building in the WPro Architecture
We present two algorithms for supporting semi-automatic ontology building, integrated in WPro, a new architecture for ontology learning from Web documents. The first algorithm auto...
Daniele Bagni, Marco Cappella, Maria Teresa Pazien...
KDD
2007
ACM
136views Data Mining» more  KDD 2007»
16 years 8 days ago
Information genealogy: uncovering the flow of ideas in non-hyperlinked document databases
We now have incrementally-grown databases of text documents ranging back for over a decade in areas ranging from personal email, to news-articles and conference proceedings. While...
Benyah Shaparenko, Thorsten Joachims
ICML
2010
IEEE
15 years 28 days ago
Proximal Methods for Sparse Hierarchical Dictionary Learning
We propose to combine two approaches for modeling data admitting sparse representations: on the one hand, dictionary learning has proven effective for various signal processing ta...
Rodolphe Jenatton, Julien Mairal, Guillaume Obozin...
DOCENG
2005
ACM
15 years 1 months ago
Enhancing composite digital documents using XML-based standoff markup
Document representations can rapidly become unwieldy if they try to encapsulate all possible document properties, ranging tract structure to detailed rendering and layout. We pres...
Peter L. Thomas, David F. Brailsford
IPM
2008
141views more  IPM 2008»
14 years 12 months ago
Towards a unified approach to document similarity search using manifold-ranking of blocks
Document similarity search (i.e. query by example) aims to retrieve a ranked list of documents similar to a query document in a text corpus or on the Web. Most existing approaches...
Xiaojun Wan, Jianwu Yang, Jianguo Xiao