Sciweavers

2677 search results - page 248 / 536
» Extracting Structured Data from Web Pages
Sort
View
CIKM
2009
Springer
15 years 10 months ago
Exploiting bidirectional links: making spamming detection easier
Previous anti-spamming algorithms based on link structure suffer from either the weakness of the page value metric or the vagueness of the seed selection. In this paper, we propos...
Yan Zhang, Qiancheng Jiang, Lei Zhang, Yizhen Zhu
ACL
2009
15 years 1 months ago
Employing Topic Models for Pattern-based Semantic Class Discovery
A semantic class is a collection of items (words or phrases) which have semantically peer or sibling relationship. This paper studies the employment of topic models to automatical...
Huibin Zhang, Mingjie Zhu, Shuming Shi, Ji-Rong We...
115
Voted
APWEB
2005
Springer
15 years 9 months ago
A Pattern Restore Method for Restoring Missing Patterns in Server Side Clickstream Data
Abstract. When analyzing patterns in server side data, it becomes quickly apparent that some of the data originating from the client is lost, mainly due to the caching of web pages...
I-Hsien Ting, Chris Kimble, Daniel Kudenko
VLDB
2004
ACM
126views Database» more  VLDB 2004»
15 years 8 months ago
Instance-based Schema Matching for Web Databases by Domain-specific Query Probing
In a Web database that dynamically provides information in response to user queries, two distinct schemas, interface schema (the schema users can query) and result schema (the sch...
Jiying Wang, Ji-Rong Wen, Frederick H. Lochovsky, ...
99
Voted
ACL
2008
15 years 4 months ago
A Novel Feature-based Approach to Chinese Entity Relation Extraction
Relation extraction is the task of finding semantic relations between two entities from text. In this paper, we propose a novel feature-based Chinese relation extraction approach ...
Wenjie Li, Peng Zhang, Furu Wei, Yuexian Hou, Qin ...