Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

19

CIKM
2009
Springer

favoriteEmaildiscussreport

120views Information Technology» more CIKM 2009»

Identifying interesting assertions from the web

14 years 4 months ago

Identifying interesting assertions from the web

Download turing.cs.washington.edu

How can we cull the facts we need from the overwhelming mass of information and misinformation that is the Web? The TextRunner extraction engine represents one approach, in which people pose keyword queries or simple questions and TextRunner returns concise answers based on tuples extracted from Web text. Unfortunately, the results returned by engines such as TextRunner include both informative facts (e.g., “the FDA banned ephedra”) and less useful statements (e.g., “the FDA banned products”). This paper therefore investigates filtering TextRunner results to enable people to better focus on interesting assertions. We first develop three distinct models of what assertions are likely to be interesting in response to a query. We then fully operationalize each of these models as a filter over TextRunner results. Finally, we develop a more sophisticated filter that combines the different models using relevance feedback. In a study of human ratings of the interestingness of TextRunn...

Thomas Lin, Oren Etzioni, James Fogarty

Real-time Traffic

CIKM 2009 | Database | FDA Banned Ephedra | People Pose Keyword | TextRunner Extraction Engine |

claim paper

Related Content

» GeneChaser Identifying all biological and clinical conditions in which genes of interest a...

» Introducing the Webb Spam Corpus Using Email Spam to Identify Web Spam Automatically

» Unsupervised Resolution of Objects and Relations on the Web

» Identifying InterDomain Similarities Through ContentBased Analysis of Hierarchical WebDire...

» Revealing Hidden Community Structures and Identifying Bridges in Complex Networks An Appli...

» Relevance criteria identified by health information users during Web searches

» Tagbased social interest discovery

» GeneWebEx Gene Annotation Web Extraction Aggregation and Updating from WebBased Biomolecul...

» From xrays to silly putty via Uranus serendipity and its role in web search

Post Info
More Details (n/a)

Added	26 May 2010
Updated	26 May 2010
Type	Conference
Year	2009
Where	CIKM
Authors	Thomas Lin, Oren Etzioni, James Fogarty

Comments (0)