Sciweavers

182 search results - page 5 / 37
» Next Generation Web Search: Setting Our Sites
Sort
View
CIKM
2003
Springer
15 years 2 months ago
Extracting unstructured data from template generated web documents
We propose a novel approach that identifies web page templates and extracts the unstructured data. Extracting only the body of the page and eliminating the template increases the ...
Ling Ma, Nazli Goharian, Abdur Chowdhury, Misun Ch...
IADIS
2003
14 years 11 months ago
Evaluation Resources Gateways Websites
This paper describes/represents an approach to the design of an information retrieval of providing an search of users. The Next generation of information systems will rely on coll...
Omar Larouk, Salah Dalhoumi
AAAI
2008
14 years 12 months ago
An Unsupervised Approach for Product Record Normalization across Different Web Sites
An unsupervised probabilistic learning framework for normalizing product records across different retailer Web sites is presented. Our framework decomposes the problem into two ta...
Tak-Lam Wong, Tik-Shun Wong, Wai Lam
IMC
2006
ACM
15 years 3 months ago
Generating a privacy footprint on the internet
As a follow up to characterizing traffic deemed as unwanted by Web clients such as advertisements, we examine how information related to individual users is aggregated as a result...
Balachander Krishnamurthy, Craig E. Wills
103
Voted
ISCIS
2009
Springer
15 years 2 months ago
PopulusLog: People information database
—Information about individuals on publicly available web sites stands as a valuable, yet unorganized, data source. Turning such an enormous data source into a “database” is h...
Ali Cakmak, Mustafa Kirac, Gultekin Özsoyoglu