Sciweavers

441 search results - page 9 / 89
» Using structured text for large-scale attribute extraction
Sort
View
JAIR
2008
173views more  JAIR 2008»
14 years 9 months ago
Creating Relational Data from Unstructured and Ungrammatical Data Sources
In order for agents to act on behalf of users, they will have to retrieve and integrate vast amounts of textual data on the World Wide Web. However, much of the useful data on the...
Matthew Michelson, Craig A. Knoblock
84
Voted
ACL
2010
14 years 7 months ago
Extraction and Approximation of Numerical Attributes from the Web
We present a novel framework for automated extraction and approximation of numerical object attributes such as height and weight from the Web. Given an object-attribute pair, we d...
Dmitry Davidov, Ari Rappoport
SIGMOD
2009
ACM
137views Database» more  SIGMOD 2009»
15 years 9 months ago
Enabling enterprise mashups over unstructured text feeds with InfoSphere MashupHub and SystemT
Enterprise mashup scenarios often involve feeds derived from data created primarily for eye consumption, such as email, news, calendars, blogs, and web feeds. These data sources c...
David E. Simmen, Frederick Reiss, Yunyao Li, Sures...
119
Voted
WWW
2010
ACM
14 years 9 months ago
Exploiting content redundancy for web information extraction
We propose a novel extraction approach that exploits content redundancy on the web to extract structured data from template-based web sites. We start by populating a seed database...
Pankaj Gulhane, Rajeev Rastogi, Srinivasan H. Seng...
ICNC
2005
Springer
15 years 3 months ago
An Improved Method of Feature Selection Based on Concept Attributes in Text Classification
The feature selection and weighting are two important parts of automatic text classification. In this paper we give a new method based on concept attributes. We use the DEF Terms o...
Shasha Liao, Minghu Jiang