Practical Data Science with Hadoop and Spark (Addison Wesley)
Practical Data Science with Hadoop and Spark (Addison Wesley)
Thursday, 12 January 2017

This book shows how to solve real data science problems with Hadoop and Spark. Authors Ofer Mendelevitch, Casey Stella, and Douglas Eadline draw on their experience with Hadoop and big data to bring together everything you need: high-level concepts, deep-dive techniques, real-world use cases, practical applications, and hands-on tutorials.

<ASIN:0134024141>

The essentials of data science and the modern Hadoop ecosystem are introduced, along with guidance on data ingestion, data cleansing, and visualization.

The book then moves on to focus on specific applications, including machine learning, predictive modeling for sentiment analysis, clustering for document analysis, anomaly detection, and natural language processing (NLP).

Author: Ofer Mendelevitch, Casey Stella, and Douglas Eadline
Publisher: Addison Wesley
Date: December 2016
Pages: 256
ISBN: 978-0134024141
Print: 0134024141
Kindle: B01N7G1M8J
Audience: Data mining developers
Level: Intermediate
Category: Data Science

 

Covers: 

  • What data science is, how it has evolved, and how to plan a data science career
  • How data volume, variety, and velocity shape data science use cases
  • Hadoop and its ecosystem, including HDFS, MapReduce, YARN, and Spark
  • Data importation with Hive and Spark
  • Data quality, preprocessing, preparation, and modeling
  • Visualization: surfacing insights from huge data sets
  • Machine learning: classification, regression, clustering, and anomaly detection
  • Algorithms and Hadoop tools for predictive modeling
  • Cluster analysis and similarity functions
  • Large-scale anomaly detection
  • NLP: applying data science to human language

 

Related Articles

Reading Your Way Into Big Data

What is a Data Scientist and How Do I Become One?

 

Follow @bookwatchiprog on Twitter or subscribe to I Programmer's Books RSS feed for each day's new addition to Book Watch and for new reviews.

To have new titles included in Book Watch contact  BookWatch@i-programmer.info

Banner
 


Quantum Mechanics: The Theoretical Minimum

Author: Leonard Susskind, Art Friedman
Publisher: Basic Books 
Pages: 384
ISBN: 978-0465062904
Print: 0465062903
Kindle: B00IFTT8GA
Audience: Students of Quantum Mechanics and the curious.
Rating: 5
Reviewer: Mike James

There are lots of people who want to know about  [ ... ]



Speaking JavaScript

Author: Axel Rauschmayer
Publisher: O'Reilly
Pages: 460
ISBN: 978-1449365035
Print: 1449365035
Kindle: B00INES03O
Audience: JavaScript Programmers
Rating: 4.8
Reviewer: Ian Elliot

If you need a "big book of everything JavaScript" this might be the one you are looking for.


More Reviews

 
 

   
RSS feed of all content
I Programmer - full contents
Copyright © 2018 i-programmer.info. All Rights Reserved.
Joomla! is Free Software released under the GNU/GPL License.