Discamus continentiam augere, luxuriam coercere
Home -> Publications
Home
  Publications
    
all years
    2019
    2018
    2017
    2016
    2015
    2014
    2013
    2012
    2011
    2010
    2009
    2008
    2007
    2006
    2005
    2004
    theses
    techreports
    presentations
    edited volumes
    conferences
  Awards
  Research
  Teaching
  BLOG
  Miscellaneous
  Full CV [pdf]






  Events








  Past Events





Publications of Torsten Hoefler
Copyright Notice:

The documents distributed by this server have been provided by the contributing authors as a means to ensure timely dissemination of scholarly and technical work on a noncommercial basis. Copyright and all rights therein are maintained by the authors or by other copyright holders, notwithstanding that they have offered their works here electronically. It is understood that all persons copying this information will adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.

Alexandros Nikolaos Ziogas, Tal Ben-Nun, Guillermo Indalecio Fernández, Timo Schneider, Mathieu Luisier, Torsten Hoefler:

 A Data-Centric Approach to Extreme-Scale Ab initio Dissipative Quantum Transport Simulations

(In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC19), Nov. 2019)
Gordon Bell Prize finalist

Abstract

The computational efficiency of a state of the art ab initio quantum transport (QT) solver, capable of revealing the coupled electro- thermal properties of atomically-resolved nano-transistors, has been improved by up to two orders of magnitude through a data cen- tric reorganization of the application. The approach yields coarse- and fine-grained data-movement characteristics that can be used for performance and communication modeling, communication- avoidance, and dataflow transformations. The resulting code has been tuned for two top-6 hybrid supercomputers, reaching a sus- tained performance of 85.45 Pflop/s on 4,560 nodes of Summit (42.55% of the peak) in double precision, and 90.89 Pflop/s in mixed precision. These computational achievements enable the restruc- tured QT simulator to treat realistic nanoelectronic devices made of more than 10,000 atoms within a 14× shorter duration than the original code needs to handle a system with 1,000 atoms, on the same number of CPUs/GPUs and with the same physical accuracy.

Documents

 

BibTeX

@inproceedings{,
  author={Alexandros Nikolaos Ziogas and Tal Ben-Nun and Guillermo Indalecio Fernández and Timo Schneider and Mathieu Luisier and Torsten Hoefler},
  title={{A Data-Centric Approach to Extreme-Scale Ab initio Dissipative Quantum Transport Simulations}},
  year={2019},
  month={Nov.},
  booktitle={Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC19)},
  source={http://www.unixer.de/~htor/publications/},
}

serving: 34.231.21.123:38546© Torsten Hoefler