Data Wrangling for Big Data: Challenges and Opportunities

9 years 9 months ago

Download openproceedings.org

Data wrangling is the process by which the data required by an application is identiﬁed, extracted, cleaned and integrated, to yield a data set that is suitable for exploration and analysis. Although there are widely used Extract, Transform and Load (ETL) techniques and platforms, they often require manual work from technical and domain experts at different stages of the process. When confronted with the 4 V’s of big data (volume, velocity, variety and veracity), manual intervention may make ETL prohibitively expensive. This paper argues that providing cost-effective, highly-automated approaches to data wrangling involves signiﬁcant research challenges, requiring fundamental changes to established areas such as data extraction, integration and cleaning, and to the ways in which these areas are brought together. Speciﬁcally, the paper discusses the importance of comprehensive support for context awareness within data wrangling, and the need for adaptive, pay-as-you-go solutions...

Tim Furche, Georg Gottlob, Leonid Libkin, Giorgio

Real-time Traffic

Database | EDBT 2016 |

claim paper

» Streaming data integration Challenges and opportunities

» Dealing proactively with data corruption Challenges and opportunities

» Integrating Renewable Energy Using Data Analytics Systems Challenges and Opportunities

» Semantic Search in Linked Data Opportunities and Challenges

» Opportunities and challenges to unify workload power and cooling management in data center...

» Temporal Analytics on Big Data for Web Advertising

» The data deluge Challenges and opportunities of unlimited data in statistical signal proce...

» Eventbased systems opportunities and challenges at exascale

» Inside Big Data management ogres onions or parfaits

Post Info
More Details (n/a)

Added	02 Apr 2016
Updated	02 Apr 2016
Type	Journal
Year	2016
Where	EDBT
Authors	Tim Furche, Georg Gottlob, Leonid Libkin, Giorgio Orsi, Norman W. Paton

Comments (0)

Sciweavers

Data Wrangling for Big Data: Challenges and Opportunities

Database | EDBT 2016 |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers