Hadoop in 24 Hours (Sams Teach Yourself)
Tuesday, 11 April 2017

Learn all the skills and techniques you'll need to deploy each key component of a Hadoop platform in 24 lessons of an hour or less. Jeffrey Aven shows how to put a platform together in your local environment or in the cloud, building a fully functional Hadoop cluster and using it with real programs and datasets.

<ASIN:0672338521>

The book has step-by-step instructions to walk you through common questions, issues, and tasks; Q-and-As, Quizzes, and Exercises build and test your knowledge; "Did You Know?" tips; and "Watch Out!" alerts help you avoid pitfalls.

Author: Jeffrey Aven
Publisher: Sams
Date: April 2017
Pages: 500
ISBN: 978-0672338526
Print: 0672338521
Kindle: B06XYM3XH4
Audience: Big data developers
Level: intermediate
Category: Data Science

 

  • Understanding Hadoop and the Hadoop Distributed File System (HDFS)
  • Importing data into Hadoop, and process it there
  • Mastering basic MapReduce Java programming, and using advanced MapReduce API concepts
  • Making the most of Apache Pig and Apache Hive
  • Implementing and administering YARN
  • Taking advantage of the full Hadoop ecosystem
  • Managing Hadoop clusters with Apache Ambari
  • Working with the Hadoop User Environment (HUE)
  • Scaling, securing, and troubleshooting Hadoop environments
  • Integrating Hadoop into the enterprise
  • Deploying Hadoop in the cloud
  • Getting started with Apache Spark

Also see Reading Your Way Into Big Data, an article on Programmer's Bookshelf in which Ian Stirk provides a roadmap of the reading required to take you from novice to competent in areas relating to data science.

Follow @bookwatchiprog on Twitter or subscribe to our Books RSS feed for each day's new addition to Book Watch and for new reviews.

To have new titles included in Book Watch contact  BookWatch@i-programmer.info

 

Banner
 


High Performance SQL Server, 2nd Ed (Apress)

Author: Benjamin Nevarez
Publisher: Apress
Pages: 420
ISBN: 978-1484264904
Print: 1484264908
Kindle: B08TQR3NMF
Audience: SQL Server DBAs and Devs
Rating: 4.5
Reviewer: Ian Stirk

This book aims to improve the performance of your SQL Server, how does it fare?



SQL Query Design Patterns and Best Practices

Author: Steve Hughes et al
Publisher: Packt Publishing
Pages: 270
ISBN: 978-1837633289
Print: 1837633282
Kindle: B0BWRD7HQ7
Audience: Query writers
Rating: 2.5
Reviewer: Ian Stirk

This book aims to improve your SQL queries using design patterns, how does it fare? 


More Reviews