Sciweavers

1042 search results - page 38 / 209
» Logic-based Web Information Extraction
Sort
View
DEBU
2000
95views more  DEBU 2000»
14 years 9 months ago
Accurately and Reliably Extracting Data from the Web: A Machine Learning Approach
A critical problem in developing information agents for the Web is accessing data that is formatted for human use. We have developed a set of tools for extracting data from web si...
Craig A. Knoblock, Kristina Lerman, Steven Minton,...
ITCC
2005
IEEE
15 years 3 months ago
Elimination of Redundant Information for Web Data Mining
These days, billions of Web pages are created with HTML or other markup languages. They only have a few uniform structures and contain various authoring styles compared to traditi...
Shakirah Mohd Taib, Soon-ja Yeom, Byeong Ho Kang
JCDL
2004
ACM
99views Education» more  JCDL 2004»
15 years 3 months ago
Toward information retrieval web services for digital libraries
Information retrieval (IR) functions serve a critical role in many digital library systems. There are numerous mature IR algorithms that have been implemented and it will be a was...
Yueyu Fu, Javed Mostafa
WEBI
2005
Springer
15 years 3 months ago
Automated Metadata and Instance Extraction from News Web Sites
In this paper, we present automated techniques for extracting metadata instance information by organizing and mining a set of news Web sites. We develop algorithms that detect and...
Srinivas Vadrevu, Saravanakumar Nagarajan, Fatih G...
AAAI
2008
15 years 8 days ago
Turning Web Text and Search Queries into Factual Knowledge: Hierarchical Class Attribute Extraction
A seed-based framework for textual information extraction allows for weakly supervised acquisition of open-domain class attributes over conceptual hierarchies, from a combination ...
Marius Pasca