Sciweavers

311 search results - page 56 / 63
» Cleaning Web Pages for Effective Web Content Mining
Sort
View
JCDL
2004
ACM
114views Education» more  JCDL 2004»
15 years 2 months ago
Translating unknown cross-lingual queries in digital libraries using a web-based approach
Users’ cross-lingual queries to a digital library system might be short and not included in a common translation dictionary (unknown terms). In this paper, we investigate the fe...
Jenq-Haur Wang, Jei-Wen Teng, Pu-Jen Cheng, Wen-Hs...
COMPLEX
2009
Springer
15 years 4 months ago
Exploring and Understanding Scientific Metrics in Citation Networks
This paper explores scientific metrics in citation networks in scientific communities, how they differ in ranking papers and authors, and why. In particular we focus on network eff...
Mikalai Krapivin, Maurizio Marchese, Fabio Casati
KDD
2009
ACM
228views Data Mining» more  KDD 2009»
15 years 10 months ago
A generalized Co-HITS algorithm and its application to bipartite graphs
Recently many data types arising from data mining and Web search applications can be modeled as bipartite graphs. Examples include queries and URLs in query logs, and authors and ...
Hongbo Deng, Michael R. Lyu, Irwin King
CIKM
2005
Springer
15 years 3 months ago
Maximal termsets as a query structuring mechanism
Search engines process queries conjunctively to restrict the size of the answer set. Further, it is not rare to observe a mismatch between the vocabulary used in the text of Web p...
Bruno Pôssas, Nivio Ziviani, Berthier A. Rib...
ACL
2009
14 years 7 months ago
A Non-negative Matrix Tri-factorization Approach to Sentiment Classification with Lexical Prior Knowledge
Sentiment classification refers to the task of automatically identifying whether a given piece of text expresses positive or negative opinion towards a subject at hand. The prolif...
Tao Li, Yi Zhang 0005, Vikas Sindhwani