Fast Data Processing with Spark 2nd Ed (Packt)
Thursday, 14 May 2015

This step-by-step tutorial from Krishna Sankar and Holden Karau is for software developers who want to learn how to write distributed programs with Spark. In it you will develop a machine learning system with Spark's MLlib and scalable algorithms and deploy Spark jobs to various clusters such as Mesos, EC2, Chef, YARN, EMR, and so on. No previous experience with distributed programming is necessary. However it assumes knowledge of either Java, Scala, or Python.

 

Authors: Krishna Sankar and Holden Karau
Publisher: Packt Publishing

Date: March 31, 2015
Pages: 184

ISBN: 9781784392574
Print: 178439257X
Kindle: B00VIBPW3U
Category: Data Science
Level: No previous experience with distributed programming is necessary. Assumes knowledge of either Java, Scala, or Python.

See Kay Ewbank's review of Learning Spark co-authored by Holden Karau 

Visit Book Watch Archive for hundreds more titles.

Follow @bookwatchiprog on Twitter or subscribe to I Programmer's Books RSS feed for each day's new addition to Book Watch and for new reviews.

 

To have new titles included in Book Watch contact  BookWatch@i-programmer.info

Banner
 


Microsoft Azure For Dummies, 2nd Edition

Author: Jack A. Hyman
Publisher: For Dummies
Pages: 416
ISBN: 978-1119898061
Print:1119898064
Kindle: B0BNWG1HYK
Audience: Azure novices?!
Rating: 1 or 4.5 (see review)
Reviewer: Ian Stirk

This book aims to provide a gentle yet thorough introduction to Microsoft Azure, how does it fare? 



SQL Server 2022 Query Performance Tuning (Apress)

Author: Grant Fritchey
Publisher: Apress
Pages: 745
ISBN:978-1484288900
Print:1484288904
Kindle:B0BLYD98SQ
Audience: DBAs & SQL Devs
Rating: 4.7
Reviewer: Ian Stirk 

A popular performance tuning book gets updated for SQL Server 2022, how does it fare?


More Reviews