Hadoop Distributions
Hadoop Distributions
- A standard open source Hadoop distribution (Apache Hadoop) includes:
- The Hadoop MapReduce framework for running computations in parallel.
- The Hadoop Distributed File System (HDFS).
- Hadoop YARN – a resource management platform responsible for managing resources in clusters and them for scheduling of users’s applications.
- Hadoop Common, a set of libraries and utilities used by other Hadoop modules.
- This is only a basic set of Hadoop components; there are other solutions available– such as Apache Hive, Apache Pig and Apache Zookeeper, etc. That are widely used to solve specific tasks, speed up computations, optimize routine tasks, etc.
Commercial Hadoop Distribuitors
- Amazon Web services
- Cloudera
- HortonWorks
- IBM InfoSphere
- MapR Technologies
- Teradata
- Intel
Amazon Web services
Cloudera
HortonWorks
IBM InfoSphere
MapR
Difference between top three enterprise edition providers
No hay comentarios:
Publicar un comentario