Hadoop Distributions

Hadoop Distributions

  • A standard open source Hadoop distribution (Apache Hadoop) includes:
    • The Hadoop MapReduce framework for running computations in parallel.
    • The Hadoop Distributed File System (HDFS).
    • Hadoop YARN – a resource management platform responsible for managing resources in clusters and them for scheduling of users’s applications.
    • Hadoop Common, a set of libraries and utilities used by other Hadoop modules.
    • This is only a basic set of Hadoop components; there are other solutions available– such as Apache Hive, Apache Pig and Apache Zookeeper, etc.  That are widely used to solve specific tasks, speed up computations, optimize routine tasks, etc.

Commercial Hadoop Distribuitors

  • Amazon Web services
  • Cloudera
  • HortonWorks
  • IBM InfoSphere
  • MapR Technologies
  • Teradata
  • Intel

Amazon Web services



Cloudera


HortonWorks

IBM InfoSphere


MapR



Difference between top three enterprise edition providers



No hay comentarios:

Publicar un comentario