Fast Data Processing with Spark 2nd Ed (Packt)

Thursday, 14 May 2015

This step-by-step tutorial from Krishna Sankar and Holden Karau is for software developers who want to learn how to write distributed programs with Spark. In it you will develop a machine learning system with Spark's MLlib and scalable algorithms and deploy Spark jobs to various clusters such as Mesos, EC2, Chef, YARN, EMR, and so on. No previous experience with distributed programming is necessary. However it assumes knowledge of either Java, Scala, or Python.

<ASIN:178439257X>

Authors: Krishna Sankar and Holden Karau
Publisher: Packt Publishing

Date: March 31, 2015
Pages: 184

ISBN: 9781784392574
Print: 178439257X
Kindle: B00VIBPW3U
Category: Data Science
Level: No previous experience with distributed programming is necessary. Assumes knowledge of either Java, Scala, or Python.

See Kay Ewbank's review of Learning Spark co-authored by Holden Karau

Visit Book Watch Archive for hundreds more titles.

Follow @bookwatchiprog on Twitter or subscribe to I Programmer's Books RSS feed for each day's new addition to Book Watch and for new reviews.

To have new titles included in Book Watch contact BookWatch@i-programmer.info

Continuous Architecture In Practice (Addison-Wesley)

Author: Murat Erder, Pierre Pureur and Eoin Woods
Publisher: Addison-Wesley
Pages: 352
ISBN: 978-0136523567
Print: 0136523560
Kindle: ‎B08ZRTQGLJ
Audience: Software Architects
Rating: 3
Reviewer: Kay Ewbank

This book sets out the case for why software architecture is more important than ever, and in p [ ... ]

+ Full Review

Fundamentals of Database Management Systems

Author: Dr. Mark L. Gillenson
Publisher: Wiley
Pages: 416
ISBN:978-1119907466
Print:1119907462
Audience: Database managers
Rating: 3
Reviewer: Kay Ewbank

This book is aimed at people taking a one-semester course in database management as part of their larger information systems management course. As suc [ ... ]

+ Full Review

More Reviews