Sciweavers

1319 search results - page 72 / 264
» Using the Structure of HTML Documents to Improve Retrieval
Sort
View
GIR
2008
ACM
15 years 4 months ago
Geographic features in web search retrieval
We conduct large-scale search engine relevance experiments, using the 12% of queries that contain placenames, matching the placenames to places in the documents, and examining the...
Rosie Jones, Ahmed Hassan, Fernando Diaz
COLING
2002
15 years 3 months ago
An Annotation System for Enhancing Quality of Natural Language Processing
Natural languageprocessingNLP programsare confronted with various di culties in processing HTML and XML documents, and have the potential to produce better results if linguistic i...
Hideo Watanabe, Katashi Nagao, Michael C. McCord, ...
ECIR
2009
Springer
16 years 17 days ago
Integrating Proximity to Subjective Sentences for Blog Opinion Retrieval
Opinion finding is a challenging retrieval task, where it has been shown that it is especially difficult to improve over a strongly performing topic-relevance baseline. In this pa...
Rodrygo L. T. Santos, Ben He, Craig Macdonald, Iad...
CIKM
2008
Springer
15 years 5 months ago
Identifying table boundaries in digital documents via sparse line detection
Most prior work on information extraction has focused on extracting information from text in digital documents. However, often, the most important information being reported in an...
Ying Liu, Prasenjit Mitra, C. Lee Giles
165
Voted
NIPS
2007
15 years 4 months ago
Mining Internet-Scale Software Repositories
Large repositories of source code create new challenges and opportunities for statistical machine learning. Here we first develop Sourcerer, an infrastructure for the automated c...
Erik Linstead, Paul Rigor, Sushil Krishna Bajracha...