Sciweavers

SIGMOD
2009
ACM
171views Database» more  SIGMOD 2009»
13 years 9 months ago
Peta-scale data warehousing at Yahoo!
Mona Ahuja, Cheng Che Chen, Ravi Gottapu, Jör...
SIGMOD
2009
ACM
165views Database» more  SIGMOD 2009»
13 years 9 months ago
Query by output
It has recently been asserted that the usability of a database is as important as its capability. Understanding the database schema, the hidden relationships among attributes in t...
Quoc Trung Tran, Chee-Yong Chan, Srinivasan Partha...
SIGMOD
2009
ACM
186views Database» more  SIGMOD 2009»
13 years 11 months ago
What's on the grapevine?
User generated content and social media (in the form of blogs, wikis, online video, microblogs, etc) are proliferating online. Grapevine conducts large scale data analysis on the ...
Albert Angel, Nick Koudas, Nikos Sarkas, Divesh Sr...
SIGMOD
2009
ACM
119views Database» more  SIGMOD 2009»
13 years 11 months ago
Search your memory ! - an associative memory based desktop search system
Jidong Chen, Hang Guo, Wentao Wu, Chunxin Xie
SIGMOD
2009
ACM
161views Database» more  SIGMOD 2009»
13 years 11 months ago
Dependency-aware reordering for parallelizing query optimization in multi-core CPUs
The state of the art commercial query optimizers employ cost-based optimization and exploit dynamic programming (DP) to find the optimal query execution plan (QEP) without evalua...
Wook-Shin Han, Jinsoo Lee
SIGMOD
2009
ACM
140views Database» more  SIGMOD 2009»
13 years 11 months ago
Robust web extraction: an approach based on a probabilistic tree-edit model
On script-generated web sites, many documents share common HTML tree structure, allowing wrappers to effectively extract information of interest. Of course, the scripts and thus ...
Nilesh N. Dalvi, Philip Bohannon, Fei Sha
SIGMOD
2009
ACM
172views Database» more  SIGMOD 2009»
13 years 11 months ago
Self-organizing tuple reconstruction in column-stores
Column-stores gained popularity as a promising physical design alternative. Each attribute of a relation is physically stored as a separate column allowing queries to load only th...
Stratos Idreos, Martin L. Kersten, Stefan Manegold
SIGMOD
2009
ACM
476views Database» more  SIGMOD 2009»
13 years 11 months ago
MobileMiner: a real world case study of data mining in mobile communication
Mobile communication data analysis has been often used as a background application to motivate many data mining problems. However, very few data mining researchers have a chance t...
Tengjiao Wang, Bishan Yang, Jun Gao, Dongqing Yang...
SIGMOD
2009
ACM
269views Database» more  SIGMOD 2009»
14 years 4 months ago
Efficient approximate entity extraction with edit distance constraints
Named entity recognition aims at extracting named entities from unstructured text. A recent trend of named entity recognition is finding approximate matches in the text with respe...
Wei Wang 0011, Chuan Xiao, Xuemin Lin, Chengqi Zha...
SIGMOD
2009
ACM
190views Database» more  SIGMOD 2009»
14 years 4 months ago
Optimizing complex extraction programs over evolving text data
Most information extraction (IE) approaches have considered only static text corpora, over which we apply IE only once. Many real-world text corpora however are dynamic. They evol...
Fei Chen 0002, Byron J. Gao, AnHai Doan, Jun Yang ...