Sciweavers

253 search results - page 41 / 51
» Learning-Based Approaches for Matching Web Data Entities
Sort
View
WIRI
2005
IEEE
15 years 3 months ago
A Fast Linkage Detection Scheme for Multi-Source Information Integration
Record linkage refers to techniques for identifying records associated with the same real-world entities. Record linkage is not only crucial in integrating multi-source databases ...
Akiko N. Aizawa, Keizo Oyama
BMCBI
2010
163views more  BMCBI 2010»
14 years 9 months ago
Integration of open access literature into the RCSB Protein Data Bank using BioLit
Background: Biological data have traditionally been stored and made publicly available through a variety of on-line databases, whereas biological knowledge has traditionally been ...
Andreas Prlic, Marco A. Martinez, Dimitris Dimitro...
95
Voted
CIKM
2009
Springer
14 years 10 months ago
Improving search engines using human computation games
Work on evaluating and improving the relevance of web search engines typically use human relevance judgments or clickthrough data. Both these methods look at the problem of learni...
Hao Ma, Raman Chandrasekar, Chris Quirk, Abhishek ...
ESCIENCE
2007
IEEE
15 years 1 months ago
A Scalable and Efficient Prefix-Based Lookup Mechanism for Large-Scale Grids
Data sources, storage, computing resources and services are entities on Grids that require mechanisms for publication and lookup. A discovery service relies on efficient lookup to...
Philip Chan, David Abramson
WWW
2008
ACM
15 years 10 months ago
Recrawl scheduling based on information longevity
It is crucial for a web crawler to distinguish between ephemeral and persistent content. Ephemeral content (e.g., quote of the day) is usually not worth crawling, because by the t...
Christopher Olston, Sandeep Pandey