Sciweavers

1211 search results - page 196 / 243
» Topics in 0--1 data
Sort
View
LREC
2010
216views Education» more  LREC 2010»
15 years 1 months ago
BlogBuster: A Tool for Extracting Corpora from the Blogosphere
This paper presents BlogBuster, a tool for extracting a corpus from the blogosphere. The topic of cleaning arbitrary web pages with the goal of extracting a corpus from web data, ...
Georgios Petasis, Dimitrios Petasis
LREC
2010
189views Education» more  LREC 2010»
15 years 1 months ago
CASIA-CASSIL: a Chinese Telephone Conversation Corpus in Real Scenarios with Multi-leveled Annotation
CASIA-CASSIL is a large-scale corpus base of Chinese human-human naturally-occurring telephone conversations in restricted domains. The first edition consists of 792 90-second con...
Keyan Zhou, Aijun Li, Zhigang Yin, Chengqing Zong
LREC
2010
203views Education» more  LREC 2010»
15 years 1 months ago
MPC: A Multi-Party Chat Corpus for Modeling Social Phenomena in Discourse
In this paper, we describe our experience with collecting and creating an annotated corpus of multi-party online conversations in a chat-room environment. This effort is part of a...
Samira Shaikh, Tomek Strzalkowski, Aaron Broadwell...
LREC
2010
162views Education» more  LREC 2010»
15 years 1 months ago
The Sign Linguistics Corpora Network: Towards Standards for Signed Language Resources
The Sign Linguistics Corpora Network is a three-year network initiative that aims to collect existing knowledge and practices on the creation and use of signed language resources....
Onno Crasborn
LREC
2010
346views Education» more  LREC 2010»
15 years 1 months ago
Twitter as a Corpus for Sentiment Analysis and Opinion Mining
Microblogging today has become a very popular communication tool among Internet users. Millions of users share opinions on different aspects of life everyday. Therefore microblogg...
Alexander Pak, Patrick Paroubek