Apache Hadoop

The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage. Rather than rely on hardware to deliver high-availability, the library itself is designed to detect and handle failures at the application layer, so delivering a highly-available service on top of a cluster of computers, each of which may be prone to failures.

This plug-in enables you to run a Hadoop application on a NamedNode as part of an UrbanCode Deploy process.

This plug-in is developed and supported by the UrbanCode Deploy Community on the IBM DevOps Services platform. Click the “Visit Project” button at the top of the page to navigate to the project for further information.

Available Steps

Create Directory – Creates Directory in DFS

Copy Inputs – Copy inputs into DFS

Run Jar – Runs Jar file on Hadoop