Sciweavers

171 search results - page 32 / 35
» A parallel learning algorithm for text classification
Sort
View
EMNLP
2009
14 years 7 months ago
The role of named entities in Web People Search
The ambiguity of person names in the Web has become a new area of interest for NLP researchers. This challenging problem has been formulated as the task of clustering Web search r...
Javier Artiles, Enrique Amigó, Julio Gonzal...
WSDM
2010
ACM
215views Data Mining» more  WSDM 2010»
15 years 6 months ago
GeoFolk: Latent spatial semantics in Web 2.0 social media
We describe an approach for multi-modal characterization of social media by combining text features (e.g. tags as a prominent example of short, unstructured text labels) with spat...
Sergej Sizov
84
Voted
KDD
2002
ACM
186views Data Mining» more  KDD 2002»
15 years 10 months ago
Topic-conditioned novelty detection
Automated detection of the first document reporting each new event in temporally-sequenced streams of documents is an open challenge. In this paper we propose a new approach which...
Yiming Yang, Jian Zhang, Jaime G. Carbonell, Chun ...
104
Voted
SDM
2012
SIAM
216views Data Mining» more  SDM 2012»
12 years 12 months ago
Feature Selection "Tomography" - Illustrating that Optimal Feature Filtering is Hopelessly Ungeneralizable
:  Feature Selection “Tomography” - Illustrating that Optimal Feature Filtering is Hopelessly Ungeneralizable George Forman HP Laboratories HPL-2010-19R1 Feature selection; ...
George Forman
CIKM
2004
Springer
15 years 1 months ago
InfoAnalyzer: a computer-aided tool for building enterprise taxonomies
In this paper we study the problem of collecting training samples for building enterprise taxonomies. We develop a computer-aided tool named InfoAnalyzer, which can effectively as...
Li Zhang, Shixia Liu, Yue Pan, Liping Yang