Apache Hive and new Hadoop mapreduce API

https://cwiki.apache.org/confluence/display/Hive/Hadoop-compatible+Input-Output+Format+for+Hive
Overview
This is a proposal for adding API to hive which allows reading and writing using a Hadoop compatible API. Specifically, the interfaces being implemented are:

The classes will be named HiveApiInputFormat and HiveApiOutputFormat.

My comments:
That proposal is dated Dec 20, 2012, and still there is no solution for the compatibility with the new API. I have a few remarks:

  1. New API InputFormat is a class, not an interface.
  2. The latest Hive 0.11.0 API does not cover that proposal
    (there is no HiveApiInputFormat and HiveApiOutputFormat classes).
  3. Using classes with new API ‘org.apache.hadoop.mapreduce.lib.input.TextInputFormat’ results in error: “FAILED: Error in semantic analysis: 1:21 Input format must implement InputFormat. “
  4. New API lacked few core features, and thus old API was ‘undeprecated’ in the latest release. It is advisable to use old API as most likely it will be the one that is here to stay.
Oglasi
Ovaj unos je objavljen u Nekategorizirano. Bookmarkirajte stalnu vezu.

Komentiraj

Popunite niže tražene podatke ili kliknite na neku od ikona za prijavu:

WordPress.com Logo

Ovaj komentar pišete koristeći vaš WordPress.com račun. Odjava / Izmijeni )

Twitter picture

Ovaj komentar pišete koristeći vaš Twitter račun. Odjava / Izmijeni )

Facebook slika

Ovaj komentar pišete koristeći vaš Facebook račun. Odjava / Izmijeni )

Google+ photo

Ovaj komentar pišete koristeći vaš Google+ račun. Odjava / Izmijeni )

Spajanje na %s