: The paper presents YAWN, a system to convert the well-known and widely used Wikipedia collection into an XML corpus with semantically rich, self-explaining tags. We introduce alg...
Ralf Schenkel, Fabian M. Suchanek, Gjergji Kasneci
This paper presents an approach for applying inductive logic programming to information extraction from HTML documents structured as unranked ordered trees. We consider information...
The structural features of XML components are an extra source of information that should be used in a contentoriented retrieval task on this type of documents. This paper explores...
We describe a user interface for wireless information devices, specifically designed to facilitate learning about users’ individual interests in daily news stories. User feedbac...
At Iowa State University, the Office of Academic Information Technologies, the English Department, and the Department of Residence have developed a strategic alliance to plan and ...