Hadoop 2.2.0 and Hive 0.12.0

Hadoop 2.2.0 is the GA release of Apache Hadoop 2.x.

Users are encouraged to immediately move to 2.2.0 since this release is significantly more stable and is guaranteed to remain compatible in terms of both APIs and protocols.

To recap, this release has a number of significant highlights compared to Hadoop 1.x:

    • YARN – A general purpose resource management system for Hadoop to allow MapReduce and other other data processing frameworks and services
    • High Availability for HDFS
    • HDFS Federation
    • HDFS Snapshots
    • NFSv3 access to data in HDFS
    • Support for running Hadoop on Microsoft Windows
    • Binary Compatibility for MapReduce applications built on hadoop-1.x
    • Substantial amount of integration testing with rest of projects in the ecosystem

A couple of important points to note while upgrading to hadoop-2.2.0:

    • HDFS – The HDFS community decided to push the symlinks feature out to a future 2.3.0 release and is currently disabled.
    • YARN/MapReduce – Users need to change ShuffleHandler service name from mapreduce.shuffle to mapreduce_shuffle.
Hive release 0.12.0 available

This release is the latest release of Hive and it works with Hadoop 0.20.x, 0.23.x.y, 1.x.y, 2.x.y

Oglasi
Ovaj unos je objavljen u Nekategorizirano. Bookmarkirajte stalnu vezu.

Komentiraj

Popunite niže tražene podatke ili kliknite na neku od ikona za prijavu:

WordPress.com Logo

Ovaj komentar pišete koristeći vaš WordPress.com račun. Odjava / Izmijeni )

Twitter picture

Ovaj komentar pišete koristeći vaš Twitter račun. Odjava / Izmijeni )

Facebook slika

Ovaj komentar pišete koristeći vaš Facebook račun. Odjava / Izmijeni )

Google+ photo

Ovaj komentar pišete koristeći vaš Google+ račun. Odjava / Izmijeni )

Spajanje na %s