Sciweavers

4354 search results - page 538 / 871
» Matching Objects with Patterns
Sort
View
ICDE
2003
IEEE
149views Database» more  ICDE 2003»
16 years 6 months ago
Indexing Weighted-Sequences in Large Databases
We present an index structure for managing weightedsequences in large databases. A weighted-sequence is defined as a two-dimensional structure where each element in the sequence i...
Haixun Wang, Chang-Shing Perng, Wei Fan, Sanghyun ...
WWW
2006
ACM
16 years 5 months ago
GoGetIt!: a tool for generating structure-driven web crawlers
We present GoGetIt!, a tool for generating structure-driven crawlers that requires a minimum effort from the users. The tool takes as input a sample page and an entry point to a W...
Altigran Soares da Silva, Edleno Silva de Moura, J...
WWW
2005
ACM
16 years 5 months ago
The volume and evolution of web page templates
Web pages contain a combination of unique content and template material, which is present across multiple pages and used primarily for formatting, navigation, and branding. We stu...
David Gibson, Kunal Punera, Andrew Tomkins
KDD
2003
ACM
142views Data Mining» more  KDD 2003»
16 years 5 months ago
Extracting information from text and images for location proteomics
There is extensive interest in automating the collection, organization and summarization of biological data. Data in the form of figures and accompanying captions in literature pr...
Zhenzhen Kou, William W. Cohen, Robert F. Murphy
KDD
2002
ACM
126views Data Mining» more  KDD 2002»
16 years 5 months ago
Integrating feature and instance selection for text classification
Instance selection and feature selection are two orthogonal methods for reducing the amount and complexity of data. Feature selection aims at the reduction of redundant features i...
Dimitris Fragoudis, Dimitris Meretakis, Spiros Lik...