Home Publications all years 2018 2017 2016 2015 2014 2013 2012 2011 2010 2009 2008 2007 2006 2005 2004 theses techreports presentations edited volumes conferences Awards Research Teaching BLOG Miscellaneous Full CV [pdf]
Events

Past Events
|
Publications of Torsten Hoefler
Copyright Notice:
The documents distributed by this server have been provided by the
contributing authors as a means to ensure timely dissemination of
scholarly and technical work on a noncommercial basis. Copyright and all
rights therein are maintained by the authors or by other copyright
holders, notwithstanding that they have offered their works here
electronically. It is understood that all persons copying this
information will adhere to the terms and constraints invoked by each
author's copyright. These works may not be reposted without the explicit
permission of the copyright holder.
P. Balaji, D. Buntinas, D. Goodell, W. Gropp, T. Hoefler, S. Kumar, E. Lusk, R. Thakur and J. L. Traeff:
| | MPI on Millions of Cores
(Parallel Processing Letters (PPL). Vol 21, Nr. 1, pages 45-60, World Scientific Publishing Company, Mar. 2011)
AbstractPetascale parallel computers with more than a million processing cores are expected to
be available in a couple of years. Although MPI is the dominant programming interface today for large-scale systems that at the highest end already have close to 300,000
processors, a challenging question to both researchers and users is whether MPI will
scale to processor and core counts in the millions. In this paper, we examine the issue
of scalability of MPI to very large systems. We first examine the MPI specification itself
and discuss areas with scalability concerns and how they can be overcome. We then
investigate issues that an MPI implementation must address in order to be scalable. To
illustrate the issues, we ran a number of simple experiments to measure MPI memory
consumption at scale up to 131,072 processes, or 80%, of the IBM Blue Gene/P system
at Argonne National Laboratory. Based on the results, we identified nonscalable aspects
of the MPI implementation and found ways to tune it to reduce its memory footprint.
We also briefly discuss issues in application scalability to large process counts and features of MPI that enable the use of other techniques to alleviate scalability limitations
in applications.
Documentsdownload article:  | | BibTeX | @article{mpi-millions-cores, author={P. Balaji and D. Buntinas and D. Goodell and W. Gropp and T. Hoefler and S. Kumar and E. Lusk and R. Thakur and J. L. Traeff}, title={{MPI on Millions of Cores}}, journal={Parallel Processing Letters (PPL)}, year={2011}, month={Mar.}, pages={45-60}, volume={21}, number={1}, publisher={World Scientific Publishing Company}, source={http://www.unixer.de/~htor/publications/}, } |
|
|