Sciweavers

385 search results - page 21 / 77
» A language for manipulating clustered web documents results
Sort
View
DEXAW
2008
IEEE
123views Database» more  DEXAW 2008»
15 years 4 months ago
Text Extraction from the Web via Text-to-Tag Ratio
– We describe a method to extract content text from diverse Web pages by using the HTML document’s Text-to-Tag Ratio rather than specific HTML cues that may not be constant acr...
Tim Weninger, William H. Hsu
73
Voted
ICASSP
2009
IEEE
15 years 4 months ago
Incorporating monolingual corpora into bilingual latent semantic analysis for crosslingual LM adaptation
The major limitation in bilingual latent semantic analysis (bLSA) is the requirement of parallel training corpora. Motivated by semi-supervised learning, we propose a clusterbased...
Yik-Cheung Tam, Tanja Schultz
ICDM
2007
IEEE
109views Data Mining» more  ICDM 2007»
15 years 4 months ago
Language-Independent Set Expansion of Named Entities Using the Web
Set expansion refers to expanding a given partial set of objects into a more complete set. A well-known example system that does set expansion using the web is Google Sets. In thi...
Richard C. Wang, William W. Cohen
115
Voted
FLAIRS
2009
14 years 7 months ago
Organizing Knowledge as an Ontology of the Domain of Resilient Computing by Means of Natural Language Processing - An Experience
Scientists typically need to take a large volume of information into account in order to deal with re-occurring tasks such as inspecting proceedings, finding related work, or revi...
Algirdas Avizienis, Gintare Grigonyte, Johann Hall...
ASE
2002
99views more  ASE 2002»
14 years 9 months ago
XMILE: An XML Based Approach for Incremental Code Mobility and Update
The eXtensible Markup Language (XML) was originally defined to represent Web content, but it is increasingly used to define languages, such as XPL, that are used for coding execut...
Cecilia Mascolo, Luca Zanolin, Wolfgang Emmerich