data sets | Sciweavers

240

VLUDS
2010

184views Visualization» more VLUDS 2010»

Advanced Visualization and Interaction Techniques for Large High-Resolution Displays

15 years 1 months ago

Large high-resolution displays combine the images of multiple smaller display devices to form one large display area. A total resolution that can easily comprise several hundred m...

Sebastian Thelen

claim paper

Read More »

280

click to vote

PVLDB
2010

195views more PVLDB 2010»

Trie-Join: Efficient Trie-based String Similarity Joins with Edit-Distance Constraints

15 years 2 months ago

Download www.comp.nus.edu.sg

A string similarity join finds similar pairs between two collections of strings. It is an essential operation in many applications, such as data integration and cleaning, and has ...

Jiannan Wang, Guoliang Li, Jianhua Feng

claim paper

Read More »

260

click to vote

PROMISE
2010

121views Software Engineering» more PROMISE 2010»

Replication of defect prediction studies: problems, pitfalls and recommendations

15 years 2 months ago

Download promisedata.org

Background: The main goal of the PROMISE repository is to enable reproducible, and thus verifiable or refutable research. Over time, plenty of data sets became available, especial...

Thilo Mende

claim paper

Read More »

235

click to vote

JMLR
2010

161views more JMLR 2010»

Training and Testing Low-degree Polynomial Data Mappings via Linear SVM

15 years 2 months ago

Download www.csie.ntu.edu.tw

Kernel techniques have long been used in SVM to handle linearly inseparable problems by transforming data to a high dimensional space, but training and testing large data sets is ...

Yin-Wen Chang, Cho-Jui Hsieh, Kai-Wei Chang, Micha...

claim paper

Read More »

236

click to vote

JMLR
2010

182views more JMLR 2010»

Quadratic Programming Feature Selection

15 years 2 months ago

Download jmlr.csail.mit.edu

Identifying a subset of features that preserves classification accuracy is a problem of growing importance, because of the increasing size and dimensionality of real-world data se...

Irene Rodriguez-Lujan, Ramón Huerta, Charle...

claim paper

Read More »

264

click to vote

JMLR
2010

156views more JMLR 2010»

Classification with Incomplete Data Using Dirichlet Process Priors

15 years 2 months ago

Download people.ee.duke.edu

A non-parametric hierarchical Bayesian framework is developed for designing a classifier, based on a mixture of simple (linear) classifiers. Each simple classifier is termed a loc...

Chunping Wang, Xuejun Liao, Lawrence Carin, David ...

claim paper

Read More »

237

click to vote

WWW
2011
ACM

290views Internet Technology» more WWW 2011»

Parallel boosted regression trees for web search ranking

15 years 2 months ago

Download www.cse.wustl.edu

Gradient Boosted Regression Trees (GBRT) are the current state-of-the-art learning paradigm for machine learned websearch ranking — a domain notorious for very large data sets. ...

Stephen Tyree, Kilian Q. Weinberger, Kunal Agrawal...

claim paper

Read More »

237

click to vote

NAR
2011

241views Computer Vision» more NAR 2011»

PRIDB: a protein-RNA interface database

15 years 2 months ago

Download www.cs.iastate.edu

The Protein–RNA Interface Database (PRIDB) is a comprehensive database of protein–RNA interfaces extracted from complexes in the Protein Data Bank (PDB). It is designed to fac...

Benjamin A. Lewis, Rasna R. Walia, Michael Terribi...

claim paper

Read More »

232

click to vote

BMCBI
2011

248views Artificial Intelligence» more BMCBI 2011»

Learning genetic epistasis using Bayesian network scoring criteria

15 years 2 months ago

Download www.biomedcentral.com

Background: Gene-gene epistatic interactions likely play an important role in the genetic basis of many common diseases. Recently, machine-learning and data mining methods have be...

Xia Jiang, Richard E. Neapolitan, M. Michael Barma...

claim paper

Read More »

247

click to vote

DEBU
2010

138views more DEBU 2010»

A Rule-Based Citation System for Structured and Evolving Datasets

15 years 4 months ago

Download sites.computer.org

We consider the requirements that a citation system must fulfill in order to cite structured and evolving data sets. Such a system must take into account variable granularity, con...

Peter Buneman, Gianmaria Silvello

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers