Sciweavers

219 search results - page 29 / 44
» Web page language identification based on URLs
Sort
View
102
Voted
DIS
2007
Springer
15 years 6 months ago
Unsupervised Spam Detection Based on String Alienness Measures
We propose an unsupervised method for detecting spam documents from Web page data, based on equivalence relations on strings. We propose 3 measures for quantifying the alienness (...
Kazuyuki Narisawa, Hideo Bannai, Kohei Hatano, Mas...
JOT
2008
136views more  JOT 2008»
15 years 10 days ago
The Stock Statistics Parser
This paper describes how use the HTMLEditorKit to perform web data mining on stock statistics for listed firms. Our focus is on making use of the web to get information about comp...
Douglas Lyon
VEE
2009
ACM
246views Virtualization» more  VEE 2009»
15 years 7 months ago
Tracing for web 3.0: trace compilation for the next generation web applications
Today’s web applications are pushing the limits of modern web browsers. The emergence of the browser as the platform of choice for rich client-side applications has shifted the ...
Mason Chang, Edwin W. Smith, Rick Reitmaier, Micha...
101
Voted
SEMWEB
2010
Springer
14 years 10 months ago
I18n of Semantic Web Applications
Recently, the use of semantic technologies has gained quite some traction. With increased use of these technologies, their maturation not only in terms of performance, robustness b...
Sören Auer, Matthias Weidl, Jens Lehmann, Amr...
SOCIALCOM
2010
14 years 10 months ago
Using Text Analysis to Understand the Structure and Dynamics of the World Wide Web as a Multi-Relational Graph
A representation of the World Wide Web as a directed graph, with vertices representing web pages and edges representing hypertext links, underpins the algorithms used by web search...
Harish Sethu, Alexander Yates