Sciweavers

912 search results - page 30 / 183
» Web page summarization using dynamic content
Sort
View
CIKM
2008
Springer
15 years 2 months ago
Coreex: content extraction from online news articles
We developed and tested a heuristic technique for extracting the main article from news site Web pages. We construct the DOM tree of the page and score every node based on the amo...
Jyotika Prasad, Andreas Paepcke
106
Voted
IAT
2007
IEEE
15 years 6 months ago
An Intelligent Web Agent to Mine Bilingual Parallel Pages via Automatic Discovery of URL Pairing Patterns
This paper describes an intelligent agent to facilitate bitext mining from the Web via automatic discovery of URL pairing patterns (or keys) for retrieving parallel web pages. The...
Chunyu Kit, Jessica Yee Ha Ng
102
Voted
LREC
2008
169views Education» more  LREC 2008»
15 years 1 months ago
A Large-Scale Web Data Collection as a Natural Language Processing Infrastructure
In recent years, language resources acquired from the Web are released, and these data improve the performance of applications in several NLP tasks. Although the language resource...
Keiji Shinzato, Daisuke Kawahara, Chikara Hashimot...
103
Voted
VL
2009
IEEE
152views Visual Languages» more  VL 2009»
15 years 7 months ago
FireCrystal: Understanding interactive behaviors in dynamic web pages
For developers debugging their own code, augmenting the code of others, or trying to learn the implementation details of interactive behaviors, understanding how web pages work is...
Stephen Oney, Brad A. Myers
ICADL
2005
Springer
112views Education» more  ICADL 2005»
15 years 6 months ago
A Method for Creating a High Quality Collection of Researchers' Homepages from the Web
This paper proposes a method for creating a high quality collection of researchers’ homepages. The proposed method consists of three phases: rough filtering of the possible web p...
Yuxin Wang, Keizo Oyama