Search Sciweavers | Sciweavers

1261 search results - page 242 / 253

» Extracting Text from PostScript

169

click to vote

WSDM
2010
ACM

204views Data Mining» more WSDM 2010»

Learning URL patterns for webpage de-duplication

16 years 1 months ago

Download www.wsdm-conference.org

Presence of duplicate documents in the World Wide Web adversely aﬀects crawling, indexing and relevance, which are the core building blocks of web search. In this paper, we pres...

Hema Swetha Koppula, Krishna P. Leela, Amit Agarwa...

claim paper

Read More »

164

Voted

ICSM
2005
IEEE

112views Software Engineering» more ICSM 2005»

Co-Change Visualization

15 years 11 months ago

Download mtc.epfl.ch

Clustering layouts of software systems combine two important aspects: they reveal groups of related artifacts of the software system, and they produce a visualization of the resul...

Dirk Beyer

claim paper

Read More »

178

click to vote

AGENTS
1997
Springer

110views Security Privacy» more AGENTS 1997»

A Scalable Comparison-Shopping Agent for the World-Wide Web

15 years 10 months ago

Download www.cs.washington.edu

The World-Wide-Web is less agent-friendly than we might hope. Most information on the Web is presented in loosely structured natural language text with no agent-readable semantics...

Robert B. Doorenbos, Oren Etzioni, Daniel S. Weld

claim paper

Read More »

205

Voted

DMIN
2006

146views Data Mining» more DMIN 2006»

A Comparison of Two Document Clustering Approaches for Clustering Medical Documents

15 years 7 months ago

Download ww1.ucmss.com

Medical data is often presented as free text in the form of medical reports. Such documents contain important information about patients, disease progression and management, but ar...

Fathi H. Saad, Beatriz de la Iglesia, Duncan G. Be...

claim paper

Read More »

172

Voted

SEMWEB
2010
Springer

189views Internet Technology» more SEMWEB 2010»

Supporting Natural Language Processing with Background Knowledge: Coreference Resolution Case

15 years 4 months ago

Download iswc2010.semanticweb.org

Systems based on statistical and machine learning methods have been shown to be extremely effective and scalable for the analysis of large amount of textual data. However, in the r...

Volha Bryl, Claudio Giuliano, Luciano Serafini, Ka...

claim paper

Read More »

« Prev « First page 242 / 253 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers