Books to start learning big data [closed]

2019-03-08 00:57发布

问题:

I would like to start learning about the big data technologies. I want to work in this area in the future. Does anyone know good books to start learning about it? Hadoop, HBase.

Beginner - intermediate - advanced -

Thanks in advance

回答1:

I think a good start for beginner will be the Big Data course from Coursera

For example I've learnt the basics of MapReduce techonlology.



回答2:

How about Hadoop: The Definitive Guide, from O'Reilly Media. It covers everything to do with Hadoop, MapReduce, HDFS and more.



回答3:

Besides the Cloudera resources I'd highly recommend you the reference books from O'Reilly :

  • Hadoop: The Definitive Guide
  • Programming Pig
  • Programming Hive
  • HBase: The Definitive Guide

You might also check it's data science kit as well.



回答4:

If you are interested in Hive and Pig there are also more specialised books about these technologies:

  • Programming Hive
  • Programming Pig


回答5:

I would suggest to learn machine learning alongside the technology part https://www.coursera.org/course/ml. Learning statistics is also very important.