Principles of Data Wrangling (O'Reilly)
Monday, 14 August 2017

This practical guide shows how data wrangling, the process of converting raw data into something truly useful, can be achieved. Authors Tye Rattenbury, Joe Hellerstein, Jeffrey Heer, Sean Kandel and Connor Carreras provide business analysts with an overview of various data wrangling techniques and tools, and put the practice of data wrangling into context by asking, "What are you trying to do and why?"

<ASIN:1491938927>

Wrangling data consumes roughly 50-80% of an analyst's time before any kind of analysis is possible. Written by executives at Trifacta (who have a platform for exploring and preparing data for analysis), the book explores several factors--time, granularity, scope, and structure.

Author: Tye Rattenbury, Joe Hellerstein, Jeffrey Heer, Sean Kandel and Connor Carreras
Publisher: O'Reilly
Date: July 2017
Pages: 94
ISBN: 978-1491938928
Print: 1491938927
Kindle: B073HMH8XG
Audience: Data managers
Level: Introductory
Category: Data Science

 

 

  • Understand what kind of data is available
  • Choose which data to use and at what level of detail
  • Meaningfully combine multiple sources of data
  • Decide how to distill the results to a size and shape that can drive downstream analysis

For recommendations of Big Data books see Reading Your Way Into Big Data in our Programmer's Bookshelf section.

Follow @bookwatchiprog on Twitter or subscribe to I Programmer's Books RSS feed for each day's new addition to Book Watch and for new reviews.

To have new titles included in Book Watch contact  BookWatch@i-programmer.info

Banner
 


Classic Computer Science Problems in Python

Author: David Kopec
Publisher: Manning
Date: March 2019
Pages: 224
ISBN: 978-1617295980
Print: 1617295981
Kindle: ‎ ‎ B09782BT4Q
Level: Intermediate
Audience: Python developers
Category: Python
Rating: 4
Reviewer: Mike James
Classic algorithms in Python - the world's favourite language.



Domain Storytelling (Pearson)

Author: Stefan Hofer
Publisher: Pearson
Pages: 288
ISBN:978-0137458912
Print:0137458916
Kindle:B099ZNXCJT
Audience: software architects
Rating: 4.5
Reviewer: Kay Ewbank

This book sets out to be a practical guide to database domains, bringing together domain experts, software developers, designers and bus [ ... ]


More Reviews