Sciweavers

781 search results - page 73 / 157
» Extracting Useful Information from the Full Text of Fiction
Sort
View
EMNLP
2004
14 years 11 months ago
Trained Named Entity Recognition using Distributional Clusters
This work applies boosted wrapper induction (BWI), a machine learning algorithm for information extraction from semi-structured documents, to the problem of named entity recogniti...
Dayne Freitag
ICASSP
2008
IEEE
15 years 4 months ago
An iterative unsupervised learning method for information distillation
Information distillation techniques are used to analyze and interpret large volumes of speech and text archives in multiple languages and produce structured information of interes...
Kamand Kamangar, Dilek Hakkani-Tür, Gökh...
KDD
1995
ACM
173views Data Mining» more  KDD 1995»
15 years 1 months ago
Knowledge Discovery in Textual Databases (KDT)
The information age is characterizedby a rapid growth in the amountof information availablein electronicmedia. Traditional data handling methods are not adequate to cope with this...
Ronen Feldman, Ido Dagan
CIKM
2008
Springer
14 years 12 months ago
Creating tag hierarchies for effective navigation in social media
In social media, such as blogs, since the content naturally evolves over time, it is hard or in many cases impossible to organize the content for effective navigation. Thus, one c...
K. Selçuk Candan, Luigi Di Caro, Maria Luis...
WWW
2010
ACM
15 years 4 months ago
CETR: content extraction via tag ratios
We present Content Extraction via Tag Ratios (CETR) – a method to extract content text from diverse webpages by using the HTML document’s tag ratios. We describe how to comput...
Tim Weninger, William H. Hsu, Jiawei Han