Sciweavers

1243 search results - page 186 / 249
» Search Engines: Information Retrieval in Practice
Sort
View
WWW
2006
ACM
15 years 10 months ago
Optimizing scoring functions and indexes for proximity search in type-annotated corpora
We introduce a new, powerful class of text proximity queries: find an instance of a given "answer type" (person, place, distance) near "selector" tokens matchi...
Soumen Chakrabarti, Kriti Puniyani, Sujatha Das
SOUPS
2006
ACM
15 years 3 months ago
Power strips, prophylactics, and privacy, oh my!
While Internet users claim to be concerned about online privacy, their behavior rarely reflects those concerns. In this paper we investigate whether the availability of compariso...
Julia Gideon, Lorrie Faith Cranor, Serge Egelman, ...
WWW
2007
ACM
15 years 10 months ago
Detecting near-duplicates for web crawling
Near-duplicate web documents are abundant. Two such documents differ from each other in a very small portion that displays advertisements, for example. Such differences are irrele...
Gurmeet Singh Manku, Arvind Jain, Anish Das Sarma
DEXA
2001
Springer
91views Database» more  DEXA 2001»
15 years 2 months ago
Towards the Development of Heuristics for Automatic Query Expansion
Abstract. In this paper we study the performance of linguisticallymotivated conflation techniques for Information Retrieval in Spanish. In particular, we have studied the applicat...
Jesús Vilares, Manuel Vilares Ferro, Miguel...
ACL
2008
14 years 11 months ago
Decompounding query keywords from compounding languages
Splitting compound words has proved to be useful in areas such as Machine Translation, Speech Recognition or Information Retrieval (IR). Furthermore, real-time IR systems (such as...
Enrique Alfonseca, Slaven Bilac, Stefan Pharies