Sciweavers

308 search results - page 2 / 62
» Syntactic Similarity of Web Documents
Sort
View
WWW
2005
ACM
14 years 6 months ago
An adaptive, fast, and safe XML parser based on byte sequences memorization
XML (Extensible Markup Language) processing can incur significant runtime overhead in XML-based infrastructural middleware such as Web service application servers. This paper prop...
Toshiro Takase, Hisashi Miyashita, Toyotaro Suzumu...
KDD
2009
ACM
239views Data Mining» more  KDD 2009»
14 years 6 months ago
Applying syntactic similarity algorithms for enterprise information management
: ? Applying Syntactic Similarity Algorithms for Enterprise Information Management Ludmila Cherkasova, Kave Eshghi, Charles B. Morrey III, Joseph Tucek, Alistair Veitch HP Laborato...
Ludmila Cherkasova, Kave Eshghi, Charles B. Morrey...
FLAIRS
2001
13 years 6 months ago
Syntactic Folding and its Application to the Information Extraction from Web Pages
Thepaper deals with investigations concerning potential structures of documentsthat will be subject to automated information extraction. The focus is on folding principles and the...
Jörg Herrmann
NDSS
2009
IEEE
14 years 3 days ago
Document Structure Integrity: A Robust Basis for Cross-site Scripting Defense
Cross-site scripting (or XSS) has been the most dominant class of web vulnerabilities in 2007. The main underlying reason for XSS vulnerabilities is that web markup and client-sid...
Yacin Nadji, Prateek Saxena, Dawn Song
COLING
2010
13 years 11 days ago
Enhancing Cross Document Coreference of Web Documents with Context Similarity and Very Large Scale Text Categorization
Cross Document Coreference (CDC) is the task of constructing the coreference chain for mentions of a person across a set of documents. This work offers a holistic view of using do...
Jian Huang 0002, Pucktada Treeratpituk, Sarah M. T...