Search Sciweavers | Sciweavers

95 search results - page 19 / 19

» A cross-collection mixture model for comparative text mining

click to vote

WWW
2008
ACM

189views Internet Technology» more WWW 2008»

Detecting image spam using visual features and near duplicate detection

14 years 6 months ago

Download www2008.org

Email spam is a much studied topic, but even though current email spam detecting software has been gaining a competitive edge against text based email spam, new advances in spam g...

Bhaskar Mehta, Saurabh Nangia, Manish Gupta 0002, ...

claim paper

Read More »

click to vote

DOCENG
2003
ACM

104views Document Analysis» more DOCENG 2003»

Methods for the semantic analysis of document markup

13 years 11 months ago

Download www.uni-giessen.de

We present an approach on how to investigate what kind of semantic information is regularly associated with the structural markup of scientiﬁc articles. This approach addresses ...

Petra Saskia Bayerl, Harald Lüngen, Daniela G...

claim paper

Read More »

click to vote

ICDM
2009
IEEE

233views Data Mining» more ICDM 2009»

Semi-Supervised Sequence Labeling with Self-Learned Features

14 years 8 days ago

Download www.cs.cmu.edu

—Typical information extraction (IE) systems can be seen as tasks assigning labels to words in a natural language sequence. The performance is restricted by the availability of l...

Yanjun Qi, Pavel Kuksa, Ronan Collobert, Kunihiko ...

claim paper

Read More »

click to vote

CIKM
2008
Springer

155views Information Technology» more CIKM 2008»

Scalable community discovery on textual data with relations

13 years 7 months ago

Download research.microsoft.com

Every piece of textual data is generated as a method to convey its authors' opinion regarding specific topics. Authors deliberately organize their writings and create links, ...

Huajing Li, Zaiqing Nie, Wang-Chien Lee, C. Lee Gi...

claim paper

Read More »

click to vote

GFKL
2007
Springer

152views Data Mining» more GFKL 2007»

Supporting Web-based Address Extraction with Unsupervised Tagging

13 years 11 months ago

Download wortschatz.uni-leipzig.de

Abstract. The manual acquisition and modeling of tourist information as e.g. addresses of points of interest is time and, therefore, cost intensive. Furthermore, the encoded inform...

Berenike Loos, Chris Biemann

claim paper

Read More »

« Prev « First page 19 / 19 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers