Search Sciweavers | Sciweavers

2190 search results - page 387 / 438

» Unweaving a web of documents

118

Voted

ICAIL
2007
ACM

147views Artificial Intelligence» more ICAIL 2007»

Essential deduplication functions for transactional databases in law firms

15 years 4 months ago

Download www.conradweb.org

As massive document repositories and knowledge management systems continue to expand, in proprietary environments as well as on the Web, the need for duplicate detection becomes i...

Jack G. Conrad, Edward L. Raymond

claim paper

Read More »

118

click to vote

DASFAA
2004
IEEE

135views Database» more DASFAA 2004»

Semi-supervised Text Classification Using Partitioned EM

15 years 4 months ago

Download www.cs.uic.edu

Text classification using a small labeled set and a large unlabeled data is seen as a promising technique to reduce the labor-intensive and time consuming effort of labeling traini...

Gao Cong, Wee Sun Lee, Haoran Wu, Bing Liu

claim paper

Read More »

click to vote

WWW
2006
ACM

129views Internet Technology» more WWW 2006»

FeedEx: collaborative exchange of news feeds

16 years 1 months ago

Download www2006.org

As most blogs and traditional media support RSS or Atom feeds, the news feed technology becomes increasingly prevalent. Taking advantage of ubiquitous news feeds, we design FeedEx...

Seung Jun, Mustaque Ahamad

claim paper

Read More »

Voted

WWW
2008
ACM

168views Internet Technology» more WWW 2008»

Performance of compressed inverted list caching in search engines

16 years 1 months ago

Download www2008.org

Due to the rapid growth in the size of the web, web search engines are facing enormous performance challenges. The larger engines in particular have to be able to process tens of ...

Jiangong Zhang, Xiaohui Long, Torsten Suel

claim paper

Read More »

104

Voted

WWW
2005
ACM

150views Internet Technology» more WWW 2005»

Extracting context to improve accuracy for HTML content extraction

16 years 1 months ago

Download www1.cs.columbia.edu

Web pages contain clutter (such as ads, unnecessary images and extraneous links) around the body of an article, which distracts a user from actual content. Extraction of "use...

Suhit Gupta, Gail E. Kaiser, Salvatore J. Stolfo

claim paper

Read More »

« Prev « First page 387 / 438 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers