• Posted by Intent Media 26 Nov
  • 0 Comments

Distributed Classification with ADMM

Today we presented our paper on ADMM for Hadoop at the IEEE BigData 2013 conference.

The paper describes sadfasfour implementation of Boyd’s ADMM algorithm in Hadoop Map Reduce. We talk about the statistical details of implementing ADMM as well as the nuances of storing state on Hadoop.

In our presentation we present background on the data pipeline we have built at Intent Media and motivate why a Hadoop Map Reduce job is the appropriate run-time for us to use. We mention the alternatives for building distributed logistic regression models, such as sampling the data, Apache Mahout, Vowpal Wabbit, and Spark.

We also discuss alternatives specifically designed for iterative computation on Hadoop, such as HaLoop and Twister.

Our presentation is here: https://speakerdeck.com/pld/distributed-classification-with-admm

You may also read the full paper Practical Distributed Classification using the Alternating Direction Method of Multipliers Algorithm.

The paper describes our open source Hadoop based implementation of the ADMM algorithm and how to use it to compute a distributed logistic regression model

Peter Lubell-Doughtie
Software Engineer

Post Comments 0