Subordination: Cluster management without distributed consensus

Ivan Gankevich, Yuri Tipikin, Vladimir Gaiduchok

Nowadays, many cluster management systems rely on distributed consensus algorithms to elect a leader that orchestrates subordinate nodes. Contrary to these studies we propose consensus-free algorithm that arranges cluster nodes into multiple levels of subordination. The algorithm structures IP address range of cluster network so that each node has ranked list of candidates, from which it chooses a leader. The results show that this approach easily scales to a large number of nodes due to its asynchronous nature, and enables fast recovery from node failures as they occur only on one level of hierarchy. Multiple levels of subordination are useful for efficiently collecting monitoring and accounting data from large number of nodes, and for scheduling general-purpose tasks on a cluster.

Bibtex
@inproceedings{gankevich2015subord,
  title={Subordination: Cluster management without distributed consensus},
  author={Ivan Gankevich and Yuri Tipikin and Vladimir Gaiduchok},
  booktitle={Proceedings of HPCS'15},
  year={2015},
  month={01},
  language={english},
  note={Outstanding poster paper award.},
  doi={10.1109/HPCSim.2015.7237106},
  pages={639--642},
  type={inproceedings}
}

Publication: Proceedings of HPCS'15