High-Performance Distributed ML at Scale through Parameter Server Consistency Models

8 years 8 months ago

Download www.cs.cmu.edu

As Machine Learning (ML) applications embrace greater data size and model complexity, practitioners turn to distributed clusters to satisfy the increased computational and memory demands. Effective use of clusters for ML programs requires considerable expertise in writing distributed code, but existing highlyabstracted frameworks like Hadoop that pose low barriers to distributed-programming have not, in practice, matched the performance seen in highly specialized and advanced ML implementations. The recent Parameter Server (PS) paradigm is a middle ground between these extremes, allowing easy conversion of single-machine parallel ML programs into distributed ones, while maintaining high throughput through relaxed “consistency models” that allow asynchronous (and, hence, inconsistent) parameter reads. However, due to insufﬁcient theoretical study, it is not clear which of these consistency models can really ensure correct ML algorithm output; at the same time, there remain many t...

Wei Dai, Abhimanu Kumar, Jinliang Wei, Qirong Ho,

Real-time Traffic

AAAI 2015 | Intelligent Agents |

claim paper

Post Info
More Details (n/a)

Added	27 Mar 2016
Updated	27 Mar 2016
Type	Journal
Year	2015
Where	AAAI
Authors	Wei Dai, Abhimanu Kumar, Jinliang Wei, Qirong Ho, Garth A. Gibson, Eric P. Xing

Comments (0)

Sciweavers

High-Performance Distributed ML at Scale through Parameter Server Consistency Models

AAAI 2015 | Intelligent Agents |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers