Principles of Data Wrangling (O'Reilly)
Monday, 14 August 2017

This practical guide shows how data wrangling, the process of converting raw data into something truly useful, can be achieved. Authors Tye Rattenbury, Joe Hellerstein, Jeffrey Heer, Sean Kandel and Connor Carreras provide business analysts with an overview of various data wrangling techniques and tools, and put the practice of data wrangling into context by asking, "What are you trying to do and why?"

<ASIN:1491938927>

Wrangling data consumes roughly 50-80% of an analyst's time before any kind of analysis is possible. Written by executives at Trifacta (who have a platform for exploring and preparing data for analysis), the book explores several factors--time, granularity, scope, and structure.

Author: Tye Rattenbury, Joe Hellerstein, Jeffrey Heer, Sean Kandel and Connor Carreras
Publisher: O'Reilly
Date: July 2017
Pages: 94
ISBN: 978-1491938928
Print: 1491938927
Kindle: B073HMH8XG
Audience: Data managers
Level: Introductory
Category: Data Science

 

 

  • Understand what kind of data is available
  • Choose which data to use and at what level of detail
  • Meaningfully combine multiple sources of data
  • Decide how to distill the results to a size and shape that can drive downstream analysis

For recommendations of Big Data books see Reading Your Way Into Big Data in our Programmer's Bookshelf section.

Follow @bookwatchiprog on Twitter or subscribe to I Programmer's Books RSS feed for each day's new addition to Book Watch and for new reviews.

To have new titles included in Book Watch contact  BookWatch@i-programmer.info

Banner
 


Object-Oriented Python

Author: Irv Kalb
Publisher: No Starch Press
Date: January 2022
Pages: 416
ISBN: 978-1718502062
Print: 1718502060
Kindle: ‎ B0957SHYQL
Audience: Python developers
Rating: 3
Reviewer: Mike James
Python, Object-Oriented? Not a lot of programmers know that!



Beginning Programming All-in-One For Dummies

Author: Wallace Wang
Publisher: For Dummies
Pages: 800
ISBN: 978-1119884408
Print: 1119884403
Kindle: B0B1BLY87B
Audience: Novice programmers
Rating: 3
Reviewer: Kay Ewbank

This is a collection of seven shorter books introducing key aspects of programming, but it fails through trying to cover too [ ... ]


More Reviews