Principles of Data Wrangling (O'Reilly)
Monday, 14 August 2017

This practical guide shows how data wrangling, the process of converting raw data into something truly useful, can be achieved. Authors Tye Rattenbury, Joe Hellerstein, Jeffrey Heer, Sean Kandel and Connor Carreras provide business analysts with an overview of various data wrangling techniques and tools, and put the practice of data wrangling into context by asking, "What are you trying to do and why?"

<ASIN:1491938927>

Wrangling data consumes roughly 50-80% of an analyst's time before any kind of analysis is possible. Written by executives at Trifacta (who have a platform for exploring and preparing data for analysis), the book explores several factors--time, granularity, scope, and structure.

Author: Tye Rattenbury, Joe Hellerstein, Jeffrey Heer, Sean Kandel and Connor Carreras
Publisher: O'Reilly
Date: July 2017
Pages: 94
ISBN: 978-1491938928
Print: 1491938927
Kindle: B073HMH8XG
Audience: Data managers
Level: Introductory
Category: Data Science

 

 

  • Understand what kind of data is available
  • Choose which data to use and at what level of detail
  • Meaningfully combine multiple sources of data
  • Decide how to distill the results to a size and shape that can drive downstream analysis

For recommendations of Big Data books see Reading Your Way Into Big Data in our Programmer's Bookshelf section.

Follow @bookwatchiprog on Twitter or subscribe to I Programmer's Books RSS feed for each day's new addition to Book Watch and for new reviews.

To have new titles included in Book Watch contact  BookWatch@i-programmer.info

Banner
 


Query Store for SQL Server 2019 (Apress)

Author: Tracy Boggiano & Grant Fritchey
Publisher: Apress
Pages: 234
ISBN: 978-1484250037
Print: 1484250036
Kindle: B07YNL3X4X
Audience: SQL Server DBAs and Devs
Rating: 4
Reviewer: Ian Stirk

This book aims to use Query Store to improve your SQL Server queries, how does it fare?



Machine Learning For Dummies, 2e (Wiley)

Author: John Paul Mueller
Publisher: For Dummies
Date: January 2021
Pages: 464
ISBN: 978-1119724018
Print: 1119724015
Kindle: B08SZHJGJW
Audience: General, but not too dumb
Rating: 4
Reviewer: Mike James
Dummies probably need machine learning to cope...


More Reviews