Search Sciweavers | Sciweavers

781 search results - page 73 / 157

» Extracting Useful Information from the Full Text of Fiction

click to vote

EMNLP
2004

114views Natural Language Processing» more EMNLP 2004»

Trained Named Entity Recognition using Distributional Clusters

14 years 11 months ago

Download www.cs.cmu.edu

This work applies boosted wrapper induction (BWI), a machine learning algorithm for information extraction from semi-structured documents, to the problem of named entity recogniti...

Dayne Freitag

claim paper

Read More »

click to vote

ICASSP
2008
IEEE

159views Signal Processing» more ICASSP 2008»

An iterative unsupervised learning method for information distillation

15 years 4 months ago

Download www.icsi.berkeley.edu

Information distillation techniques are used to analyze and interpret large volumes of speech and text archives in multiple languages and produce structured information of interes...

Kamand Kamangar, Dilek Hakkani-Tür, Gökh...

claim paper

Read More »

click to vote

KDD
1995
ACM

173views Data Mining» more KDD 1995»

Knowledge Discovery in Textual Databases (KDT)

15 years 1 months ago

Download www.aaai.org

The information age is characterizedby a rapid growth in the amountof information availablein electronicmedia. Traditional data handling methods are not adequate to cope with this...

Ronen Feldman, Ido Dagan

claim paper

Read More »

click to vote

CIKM
2008
Springer

153views Information Technology» more CIKM 2008»

Creating tag hierarchies for effective navigation in social media

14 years 12 months ago

Download ir.mathcs.emory.edu

In social media, such as blogs, since the content naturally evolves over time, it is hard or in many cases impossible to organize the content for effective navigation. Thus, one c...

K. Selçuk Candan, Luigi Di Caro, Maria Luis...

claim paper

Read More »

click to vote

WWW
2010
ACM

257views Internet Technology» more WWW 2010»

CETR: content extraction via tag ratios

15 years 4 months ago

Download www.cs.illinois.edu

We present Content Extraction via Tag Ratios (CETR) – a method to extract content text from diverse webpages by using the HTML document’s tag ratios. We describe how to comput...

Tim Weninger, William H. Hsu, Jiawei Han

claim paper

Read More »

« Prev « First page 73 / 157 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers