Best of Books and Resources to Get Started with Hadoop

Hello Guys!

I will be starting a series of Hadoop Tutorials, that will help you understand the basic concepts of Hadoop, Hadoop HDFS and Hadoop MapReduce.

But before that i would like to let you know about a few of the books and web resources from where you can start reading the same.

So if you are reading this blog article and interested in learning Hadoop, you must be familiar about the power of Big Data and why people are going gaga over this Big Data. You can refer to this small article about What is Big Data and How Big is this Big Data   to understand the quantity and quality of data we are generating in our day to day life and there different generating sources.

Now lets get back to the business and know about

Best of Books and Resources to Get Started with Hadoop

The first book i would recommend you guys out there will be: Hadoop The Definitive Guide 3rd Edition by Tom White. I started my Big Data Journey with this book and believe me it is the best resource for you if you are naive in the Big Data World. The book is elegantly written to understand the concept topic-wise. It also gives you an Example of Wearther Dataset which is carried almost through out the book to help you understand how things go in hadoop.

The second book i like reading and which is also very helpful is: Hadoop in Practice by Alex Holmes. Hadoop in Practice collects 85 battle-tested examples and presents them in a problem/solution format. It balances conceptual foundations with practical recipes for key problem areas like data ingress and egress, serialization, and LZO compression. You'll explore each technique step by step, learning how to build a specific solution along with the thinking that went into it. As a bonus, the book's examples create a well-structured and understandable codebase you can tweak to meet your own needs.

The third one which is written real simpl will be: Hadoop in Action by Chuck Lam. Hadoop in Action teaches readers how to use Hadoop and write MapReduce programs. The intended readers are programmers, architects, and project managers who have to process large amounts of data offline. Hadoop in Action will lead the reader from obtaining a copy of Hadoop to setting it up in a cluster and writing data analytic programs. 
Note: this book uses old Hadoop API

And lastly if you are more into administrative side you can go for Hadoop's Operations by Eric Sammer. Along with the development this book talks mainly about administrating and maintenance of huge clusters for large data-set in the production environment. Eric Sammer, Principal Solution Architect at Cloudera, shows you the particulars of running Hadoop in production, from planning, installing, and configuring the system to providing ongoing maintenance.

Well these are the books that you can refer for your understanding and better conceptual visualization and practical Hands-on of working with Hadoop Farmework

Apart from these books if want to go for the API, you can see Hadoop API Docs here 

Hope you will find these books and resources helpful to understand in-depth of Hadoop and its power.

If you have any question or you want any specific tutorial on Hadoop you can go request for the same HERE. I will try to get back to you as soon as possible :)

Find Comments below or Add one


HI thanks for such a beautiful blog.The link above given for Hadoop The definative guide 3rd edition seems like broken now..I got the same ebook from following link.Hope it will help someone...

Unknown said...

Thanks aniket for the link :)

Unknown said...

very useful links and wonderful blog...keep it up..

Post a Comment