Best way to learn MapReduce [closed] - hadoop

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 7 years ago.
Improve this question
I'm familiar and have worked with Hive, Pig, HBase. I have also gone through the Hadoop Definitive guide. I am familiar with core java, MapReduce architecture and MapReduce internals. However, I don't have any hands on experience in MapReduce and I need to learn MapReduce in terms of practical scenarios. Is there any book or link which you would recommend? I would appreciate any help or suggestions. Thanks!!

You can follow this book. This contains different mapreduce scenarios and code snippets.
MapReduce Patterns

Related

Information about big data and hadoop [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 4 years ago.
Improve this question
I am a programmer and I am good in Database concept. I want to learn about big data and hadoop. But I don't have any past experience in these subjects. If anyone having good experience in big data and haddop please share how to start learning. If there are any institute or good websites please suggest. And if I need to do any course, please suggest that also. Thanks in advance.:-)
I find the tutorials from hortonworks a rather good starting point
http://hortonworks.com/tutorials/#tuts-developers
To deep dive a must read is Tom White's 'Hadoop: The Definitve Guide'.
'Hadoop in Practice' shows a lot of cookbook like examples.

Is there a good ETL framework for data warehouse in Hadoop [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 7 years ago.
Improve this question
I have investigated Oozie and Azkaban, but I think they are only used to schedule some jobs.
DW often need large of jobs to schedule, and is there a good framework for it?
You can Use Pentaho data integration tool . Check this out. http://www.pentaho.com/product/data-integration
You may also check Talend for Data integration in hadoop based warehouse. It offers graphical tools to create data intgeration flow between the hadoop components and it is opensource too.
please check http://www.talend.com/resource/hadoop-tools.html

Cloudera Certification for Hadoop developer [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 6 years ago.
Improve this question
I want to write Cloudera certification for Hadoop developer as people suggested that it is more complex compared to Hortonworks certification.
I was looking for materials for my preparation. All the materials in Cloudera site is very expensive atleast for me. Can you please let me know the books which I can use to clear this certification?
If you know the content well in the following books, then you should be decently prepared to attempt the exam:
Hadoop: The Definitive Guide
Hadoop Operations
I am also planning to write CCA Spark and Hadoop Developer Exam (CCA175) this year.
Kindly find my steps of preparation:
Haddop: The Definitive Guide
Pro Apache Hadoop
Hands on Practice by following youtube channel by itversity
Kindly add if you find more useful references.

Good Tutorials on Hadoop Cascading [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 8 years ago.
Improve this question
Could anybody point me some simple examples,links and books to learn Hadoop cascading.I couldnt find any good resources except Cascading site http://www.cascading.org/
http://www.cascading.org/documentation/ must be the best learning source for Cascading
My personal experience was: Cascading User Guide + Cascading for the Impatient
There's an O'Reilly book for Cascading: http://shop.oreilly.com/product/0636920028536.do
Also found an interesting 'Test driven learning' project:
BertrandDechoux/cascading.learn
other useful resources are available on: http://www.concurrentinc.com/resources/
If anything not clear, you can always ask questions on user group and get answers from there.

Hadoop source code [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 4 years ago.
Improve this question
I am new to Hadoop and trying to learn more about it. Sometimes, I felt that looking at the source code for a particular class would be beneficial in understanding the underlying functionality better. However, I do not know where to find the source code for each of the Java classes present in the Hadoop API.
Can anyone please let me know how to go about it? Is it part of the Hadoop Jar files I download for running the framework? Please clarify my doubts...thanks
SVN repository for trunk: http://svn.apache.org/viewvc/hadoop/common/trunk/
For specific branches: http://svn.apache.org/viewvc/hadoop/common/branches/
You can browse Hadoop code online at zGrepCode. The good part is that it allows navigation within classes just like any IDE. Here is the link: https://zgrepcode.com/hadoop/apache/
Please see this link and go through particularly getting started with Hadoop... module3.... http://developer.yahoo.com/hadoop/tutorial/
Apache Hadoop
http://hadoop.apache.org/docs/stable/mapred_tutorial.html
Book
http://lintool.github.io/MapReduceAlgorithms/
Good Luck !

Resources