Principles of Data Wrangling (O'Reilly)
Monday, 14 August 2017

This practical guide shows how data wrangling, the process of converting raw data into something truly useful, can be achieved. Authors Tye Rattenbury, Joe Hellerstein, Jeffrey Heer, Sean Kandel and Connor Carreras provide business analysts with an overview of various data wrangling techniques and tools, and put the practice of data wrangling into context by asking, "What are you trying to do and why?"

Wrangling data consumes roughly 50-80% of an analyst's time before any kind of analysis is possible. Written by executives at Trifacta (who have a platform for exploring and preparing data for analysis), the book explores several factors--time, granularity, scope, and structure.

Author: Tye Rattenbury, Joe Hellerstein, Jeffrey Heer, Sean Kandel and Connor Carreras
Publisher: O'Reilly
Date: July 2017
Pages: 94
ISBN: 978-1491938928
Print: 1491938927
Kindle: B073HMH8XG
Audience: Data managers
Level: Introductory
Category: Data Science

 

 

  • Understand what kind of data is available
  • Choose which data to use and at what level of detail
  • Meaningfully combine multiple sources of data
  • Decide how to distill the results to a size and shape that can drive downstream analysis

For recommendations of Big Data books see Reading Your Way Into Big Data in our Programmer's Bookshelf section.

Follow @bookwatchiprog on Twitter or subscribe to I Programmer's Books RSS feed for each day's new addition to Book Watch and for new reviews.

To have new titles included in Book Watch contact  BookWatch@i-programmer.info

Banner
 


SQL Server Query Tuning and Optimization (Packt)

Author: Benjamin Nevarez
Publisher: Packt Publishing Pages: 446
ISBN: 9781803242620
Print: 1803242620
Kindle: B0B42SVBFY
Audience: Intermediate to advanced DBAs and developers
Rating: 4.7
Reviewer: Ian Stirk 

This book aims to give you the tools and knowledge to get peak performance from your que [ ... ]



Data Structures & Algorithms in Python

Author: Dr. John Canning, Alan Broder and Robert Lafore
Publisher: Addison-Wesley
Date: October 2022
Pages: 928
ISBN:978-0134855684
Print: 013485568X
Kindle: B0B1WJF1K9
Audience: Python developers
Rating: 4
Reviewer: Mike James
Data structures in Python - a good idea!


More Reviews