Extracting and Modeling the Semantic Information Content of Web Documents to Support Semantic Document Retrieval

13 years 5 months ago

Download crpit.com

Existing HTML mark-up is used only to indicate the structure and lay-out of documents, but not the document semantics. As a result web documents are difficult to be semantically processed, retrieved and explored by computer applications. Existing information extraction system mainly concerns with extracting important keywords or key phrases that represent the content of the documents. The semantic aspects of such keywords have not been explored extensively. In this paper we propose an approach meant to assist in extracting and modeling the semantic information content of web documents using natural language analysis technique and a domain specific ontology. Together with the user's participation, the tool gradually extracts and constructs the semantic document model which is represented as XML. The semantic models representing each document are then being integrated to form a global semantic model. Such a model provides users with a global knowledge model of some domains.

Shahrul Azman Noah, Lailatulqadri Zakaria, Arifah

Real-time Traffic

APCCM 2007 | APCCM 2009 | Semantic | Semantic Model | Web Documents |

claim paper

» Indexing Documents by Discourse and Semantic Contents from Automatic Annotations of Texts

» A Natural Language Interface for Information Retrieval on Semantic Web Documents

» Semantic Web Search Model for Information Retrieval of the Semantic Data

» Thresher automating the unwrapping of semantic content from the World Wide Web

» Query and content suggestion based on latent interest and topic class

» Annotating wikipedia articles with semantic tags for structured retrieval

» Discovering informative content blocks from Web documents

» Towards a Media Interpretation Framework for the Semantic Web

Post Info
More Details (n/a)

Added	08 Nov 2010
Updated	08 Nov 2010
Type	Conference
Year	2009
Where	APCCM
Authors	Shahrul Azman Noah, Lailatulqadri Zakaria, Arifah Che Alhadi

Comments (0)

Sciweavers

Extracting and Modeling the Semantic Information Content of Web Documents to Support Semantic Document Retrieval

APCCM 2007 | APCCM 2009 | Semantic | Semantic Model | Web Documents |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers