Running applications on a hybrid cluster

A. Bogdanov, I. Gankevich, V. Gaiduchok, N. Yuzhanin

A hybrid cluster implies the use of computational devices with radically different architectures. Usually, these are conventional CPU architecture (e.g. x86_64) and GPU architecture (e. g. NVIDIA CUDA). Creating and exploiting such a cluster requires some experience: in order to harness all computational power of the described system and get substantial speedup for computational tasks many factors should be taken into account. These factors consist of hardware characteristics (e.g. network infrastructure, a type of data storage, GPU architecture) as well as software stack (e.g. MPI implementation, GPGPU libraries). So, in order to run scientific applications GPU capabilities, software features, task size and other factors should be considered.

This report discusses opportunities and problems of hybrid computations. Some statistics from tests programs and applications runs will be demonstrated. The main focus of interest is open source applications (e. g. OpenFOAM) that support GPGPU (with some parts rewritten to use GPGPU directly or by replacing li-braries).

There are several approaches to organize heterogeneous computations for different GPU architectures out of which CUDA library and OpenCL framework are compared. CUDA library is becoming quite typical for hy-brid systems with NVIDIA cards, but OpenCL offers portability opportunities which can be a determinant factor when choosing framework for development. We also put emphasis on multi-GPU systems that are often used to build hybrid clusters. Calculations were performed on a hybrid cluster of SPbU computing center.

Bibtex
@article{bogdanov2015hybrid,
  title={Running applications on a hybrid cluster},
  author={A. Bogdanov and I. Gankevich and V. Gaiduchok and N. Yuzhanin},
  publisher={Institute of Computer Science},
  journal={Computer Research and Modeling},
  url={http://crm-en.ics.org.ru/journal/article/2295/},
  year={2015},
  month={01},
  language={english},
  pages={475--483},
  number={3},
  volume={7},
  type={article}
}

Publication: Computer Research and Modeling
Publisher: Institute of Computer Science