Integrating Nutch on Hortownworks OR YARN - hadoop

I am trying to crawl the web. Preferably with Nutch.
Did not find the references if Hortownworks out of the box supports Nutch.
Has any one integrated Nutch on YARN specially with Hortonworks HDP ?
Or someone has tried integrating Nutch on the Hadoop 2.x (YARN) ?
Thanks in advance.

HDP 2.3 doesn't support Nutch out of the box (There is a chart on the HDP website showing supported services: HDP2.3 What's New). However it does support the services that Nutch depends on. A custom Ambari Service could be defined and added to the HDP 2.3 stack definition to enable support for Nutch.

Related

How is HBase packaged in HDP different from Apache HBase

How is HBase packaged in Hortonworks Data Platform (HDP) different from Apache HBase. We use HDP in production but for dev purposes, test with Apache HBase.
What should we do in our code to allow for any differences?
HDP packages all open source components. There should be no difference

does hadoop 2.8 support apache spark cluster 2.1?

Could you please let me know that is Apache Hadoop 2.8 is compatible with Apache spark 2.1.1 or not?
I have already set up a test cluster where Apache Hadoop 2.8 is installed , and now we need apache spark 2.1.1 to be installed on the top of that.
If yes , then please let us know that which package will be good to install? (Please provide the URL here).

Not able to install Spark 2.0 in CDH 5.7.5

I am trying to install SPARK 2.0 in my CDH 5.7.5 cluster. While doing that i am getting below error
CDH (lower than 5.12) parcel required for SPARK2 (2.0.0.cloudera1-1.cdh5.7.0.p0.113931) is not available
P.S: Followed documentation
uncheck Validate Parcel Relations in Parcel configuration
According to Cloudera support, it is a bug in 5.7 and 5.8. This has been fixed in 5.9 and newer.
Adding to the #Ruslan answer, Who ever using CDH 5.7 and 5.8, they need to follow the below workaround
under parcel configurations, uncheck Validate Parcel Relations

Integration of Hortonworks HDP with Mesos

I have to integrate HDP with Mesos. I don't want to do it with cloudbreak, because it's not a mature project. Is there any other ways you can integrate HDP with Mesos ?
See Apache Myriad (incubating) at http://myriad.incubator.apache.org/

Can not find Solr in HDP 2.3

I have installed HDP2.3 but can not find Solr while trying to add service
in admin panel under version I can see the HDP version as HDP-2.3.0.0-2557 but cnt find solr while trying to add service
Solr is no longer a service in the HDP stack. Instead it is an addon that can be downloaded from the HDP-Utils repository and installed separately. Solr is now part of HDP Search, you can read more about it in the HDP documentation.

Resources