HTML documents | Sciweavers

15

IPM
2007

149views more IPM 2007»

Web page title extraction and its application

13 years 4 months ago

This paper is concerned with automatic extraction of titles from the bodies of HTML documents (web pages). Titles of HTML documents should be correctly defined in the title fields...

Yewei Xue, Yunhua Hu, Guomao Xin, Ruihua Song, Shu...

claim paper

Read More »

22

click to vote

AAAI
1997

162views Intelligent Agents» more AAAI 1997»

Template-Based Information Mining from HTML Documents

13 years 5 months ago

Download research.microsoft.com

Tools for mining information from data can create added value for the Internet. As the majority of electronic documents available over the network are in unstructured textual form...

Jane Yung-jen Hsu, Wen-tau Yih

claim paper

Read More »

11

click to vote

NAACL
2004

123views Computational Linguistics» more NAACL 2004»

Acquiring Hyponymy Relations from Web Documents

13 years 5 months ago

Download www.aclweb.org

This paper describes an automatic method for acquiring hyponymy relations from HTML documents on the WWW. Hyponymy relations can play a crucial role in various natural language pr...

Keiji Shinzato, Kentaro Torisawa

claim paper

Read More »

12

click to vote

FLAIRS
2001

131views Artificial Intelligence» more FLAIRS 2001»

Extracting Partial Structures from HTML Documents

13 years 5 months ago

Download qir.kyushu-u.ac.jp

The new wrapper model for extractiong text data from HTML documents is introduced. The Kushmerick's wrapper class (Kusshmerick 2000) may be unsuccessful in the case that suff...

Hiroshi Sakamoto, Yoshitsugu Murakami, Hiroki Arim...

claim paper

Read More »

9

click to vote

IADIS
2004

99views Internet Technology» more IADIS 2004»

Using the concept of user policies for improving HTML documents accessibility

13 years 5 months ago

Download www.iadis.net

In this paper, we introduce the concept of "user policies" and its applications to the browsing of HTML documents. The objective of policies is to specify user preferenc...

Benoît Encelle, Nadine Baptiste-Jessel

claim paper

Read More »

11

click to vote

ACL
2006

141views Computational Linguistics» more ACL 2006»

Automatic Construction of Polarity-Tagged Corpus from HTML Documents

13 years 5 months ago

Download acl.ldc.upenn.edu

This paper proposes a novel method of building polarity-tagged corpus from HTML documents. The characteristics of this method is that it is fully automatic and can be applied to a...

Nobuhiro Kaji, Masaru Kitsuregawa

claim paper

Read More »

14

click to vote

COOPIS
1998
IEEE

118views Information Technology» more COOPIS 1998»

Wrapper Generation for Web Accessible Data Sources

13 years 8 months ago

Download reference.kfupm.edu.sa

There is an increase in the number of data sources that can be queried across the WWW. Such sources typically support HTML forms-based interfaces and search engines query collecti...

Jean-Robert Gruser, Louiqa Raschid, Maria-Esther V...

claim paper

Read More »

11

click to vote

ICTAI
1999
IEEE

101views Artificial Intelligence» more ICTAI 1999»

A New Study on Using HTML Structures to Improve Retrieval

13 years 8 months ago

Download www.cs.binghamton.edu

Locating useful information effectively from the World Wide Web (WWW) is of wide interest. This paper presents new results on a methodology of using the structures and hyperlinks ...

Michal Cutler, H. Deng, S. Maniccam, Weiyi Meng

claim paper

Read More »

14

click to vote

IDEAS
2002
IEEE

125views Database» more IDEAS 2002»

Integrating HTML Tables Using Semantic Hierarchies And Meta-Data Sets

13 years 9 months ago

Download students.cs.byu.edu

As the Internet is a global network, there is a demand on accessing closely related data without browsing through di erent Web documents. A signi cant amount of these data are pre...

Seung Jin Lim, Yiu-Kai Ng, Xiaochun Yang

claim paper

Read More »

11

click to vote

ICDAR
2003
IEEE

143views Document Analysis» more ICDAR 2003»

Automatic Discovery of Semantic Structures in HTML Documents

13 years 9 months ago

Download www.cs.sunysb.edu

Template-driven HTML documents posses an implicit, ﬁxed schema denoting concepts and their relationships in a hierarchical fashion. Discovering this schema remains a relatively ...

Saikat Mukherjee, Guizhen Yang, Wenfang Tan, I. V....

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers