Mjesečna Arhiva: Listopad 2014.

Apache Hadoop 2.5.0 on Docker

http://blog.sequenceiq.com/blog/2014/08/18/hadoop-2-5-0-docker/ SequenceIQ has dockerized most of the Hadoop ecosystem – MR2, Spark, Storm, Hive, HBase, Pig, Oozie, etc in Docker containers – on bare metal and in the cloud as well. Beside this Hadoop image, SequenceIQ has released and maintain … Nastavi čitati

Objavljeno u Nekategorizirano | Ostavi komentar

All-pairs similarity via DIMSUM

We are often interested in finding users, hashtags and ads that are very similar to one another, so they may be recommended and shown to users and advertisers. To do this, we must consider many pairs of items, and evaluate … Nastavi čitati

Objavljeno u Nekategorizirano | Ostavi komentar

Hadoop High Availability

HDFS NameNode High AvailabilityThe HDFS NameNode High Availability feature enables you to run redundant NameNodes in the same cluster in an Active/Passive configuration with a hot standby. This eliminates the NameNode as a potential single point of failure (SPOF) in … Nastavi čitati

Objavljeno u Nekategorizirano | Ostavi komentar

Anomaly Detection with Apache Spark

These phenomenal lecture shows new IT achievements by which mathematical analysis can be practically applied. And with only a few lines of code !!!

Objavljeno u Nekategorizirano | 2 komentara

Introducing the Total Data Warehouse

Objavljeno u Nekategorizirano | Ostavi komentar

Spark and Hive Integration

There has been considerable excitement about Spark since it became an Apache top-level project.  The reasons are that it is a hundred times faster than Hadoop MapReduce in memory, can run on Hadoop 2’s YARN cluster, can read any existing … Nastavi čitati

Objavljeno u Nekategorizirano | Ostavi komentar

Scalable Collaborative Filtering with Spark MLlib

http://databricks.com/blog/2014/07/23/scalable-collaborative-filtering-with-spark-mllib.html Recommendation systems are among the most popular applications of machine learning. The idea is to predict whether a customer would like a certain item: a product, a movie, or a song. Scale is a key concern for recommendation systems, … Nastavi čitati

Objavljeno u Nekategorizirano | Ostavi komentar