Sciweavers

68 search results - page 14 / 14
» COMMIX: towards effective web information extraction, integr...
Sort
View
KDD
2005
ACM
182views Data Mining» more  KDD 2005»
14 years 5 months ago
Making holistic schema matching robust: an ensemble approach
The Web has been rapidly "deepened" by myriad searchable databases online, where data are hidden behind query interfaces. As an essential task toward integrating these m...
Bin He, Kevin Chen-Chuan Chang
VLDB
2007
ACM
132views Database» more  VLDB 2007»
13 years 11 months ago
EntityRank: Searching Entities Directly and Holistically
As the Web has evolved into a data-rich repository, with the standard “page view,” current search engines are becoming increasingly inadequate for a wide range of query tasks....
Tao Cheng, Xifeng Yan, Kevin Chen-Chuan Chang
PVLDB
2010
118views more  PVLDB 2010»
13 years 3 months ago
Global Detection of Complex Copying Relationships Between Sources
Web technologies have enabled data sharing between sources but also simplified copying (and often publishing without proper attribution). The copying relationships can be complex...
Xin Dong, Laure Berti-Equille, Yifan Hu, Divesh Sr...