Hadoop Oozie Like Projects [closed] - hadoop

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 6 years ago.
Improve this question
Oozie is a workflow/coordination engine to orchestrate Hadoop jobs where Oozie Workflow jobs are Directed Acyclical Graphs (DAGs) of actions.
What are other Oozile like workflow engines to manage job chains on a cluster?
Is there a generic Oozile like workflow engine that is capable of orchestrating jobs any cluster through a plug-in or something?

although I personally prefer oozie you could also checkout azkaban.

Hortonworks' NIFI is an awesome workflow engine which does what you are asking.

Related

Is there a go framework that only uses Protocol buffers to communicate with Hadoop? [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 7 years ago.
Improve this question
I was exploring options to write hadoop map/reduce programs in golang. I want to make use native hadoop apis using Hadoop RPC and not alternative solutions like streaming etc.

Best way to learn MapReduce [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 7 years ago.
Improve this question
I'm familiar and have worked with Hive, Pig, HBase. I have also gone through the Hadoop Definitive guide. I am familiar with core java, MapReduce architecture and MapReduce internals. However, I don't have any hands on experience in MapReduce and I need to learn MapReduce in terms of practical scenarios. Is there any book or link which you would recommend? I would appreciate any help or suggestions. Thanks!!
You can follow this book. This contains different mapreduce scenarios and code snippets.
MapReduce Patterns

Is there a good ETL framework for data warehouse in Hadoop [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 7 years ago.
Improve this question
I have investigated Oozie and Azkaban, but I think they are only used to schedule some jobs.
DW often need large of jobs to schedule, and is there a good framework for it?
You can Use Pentaho data integration tool . Check this out. http://www.pentaho.com/product/data-integration
You may also check Talend for Data integration in hadoop based warehouse. It offers graphical tools to create data intgeration flow between the hadoop components and it is opensource too.
please check http://www.talend.com/resource/hadoop-tools.html

Cloudera Certification for Hadoop developer [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 6 years ago.
Improve this question
I want to write Cloudera certification for Hadoop developer as people suggested that it is more complex compared to Hortonworks certification.
I was looking for materials for my preparation. All the materials in Cloudera site is very expensive atleast for me. Can you please let me know the books which I can use to clear this certification?
If you know the content well in the following books, then you should be decently prepared to attempt the exam:
Hadoop: The Definitive Guide
Hadoop Operations
I am also planning to write CCA Spark and Hadoop Developer Exam (CCA175) this year.
Kindly find my steps of preparation:
Haddop: The Definitive Guide
Pro Apache Hadoop
Hands on Practice by following youtube channel by itversity
Kindly add if you find more useful references.

Crontab schedule visualization tool for Windows [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 7 years ago.
Improve this question
We have a crontab scheduler running on our Unix server. Currently when someone adds a job to the crontab, he has to add the job to an outlook calendar manually.
However, this outlook calendar has lost its functionality, partly due to old jobs not being removed, partly due to number of jobs, partly due to the lack of visualisation options.
I've found this question is-there-a-tool-that-allows-visualization-of-crontab-entries
but I'm looking for a tool that allows me to load in the crontab schedule file and visualize it a Windows environement.
Does anyone have a good tool for this or an alternative solution ?

Resources