Non quia difficilia sunt non audemus, sed quia non audemus difficilia sunt
Home -> Publications
Home
  Publications
    
all years
    2020
    2019
    2018
    2017
    2016
    2015
    2014
    2013
    2012
    2011
    2010
    2009
    2008
    2007
    2006
    2005
    2004
    theses
    techreports
    presentations
    conferences
    edited volumes
  Awards
  Research
  Teaching
  BLOG
  Miscellaneous
  Full CV [pdf]






  Events








  Past Events





Publications of Torsten Hoefler
Copyright Notice:

The documents distributed by this server have been provided by the contributing authors as a means to ensure timely dissemination of scholarly and technical work on a noncommercial basis. Copyright and all rights therein are maintained by the authors or by other copyright holders, notwithstanding that they have offered their works here electronically. It is understood that all persons copying this information will adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.

Citation Listings: DBLP   CSB   Google Scholar   ACM Digital Library   Semantic Scholar

Research overview                  Using Advanced MPI                 Edited volumes
      

Peer-Reviewed Conference or Journal Articles

PVLDB'20
[1] Claude Barthels, Ingo Müller, Konstantin Taranov, Torsten Hoefler, Gustavo Alonso:
 Strong consistency is not hard to get: TwoPhase Locking and TwoPhase Commit on Thousands of Cores In Proceedings of the VLDB Endowment, Vol. 12, No. 13, VLDB Endowment, Sep. 2020,
ML4PS'19
[2] Peter Grönquist, Tal Ben-Nun, Nikoli Dryden, Peter Dueben, Luca Lavarini, Shigang Li, Torsten Hoefler:
 Predicting Weather Uncertainty with Deep Convnets In Machine Learning and the Physical Sciences Workshop at the 33rd Conference on Neural Information Processing Systems (NeurIPS), presented in Vancouver, BC, Canada, Dec. 2019,
SC19
[3] Cedric Renggli, Dan Alistarh, Mehdi Aghagolzadeh, Torsten Hoefler:
 SparCML: High-Performance Sparse Communication for Machine Learning In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC19), Nov. 2019, (acceptance rate: 22.7% (78/344))
SC19
[4] Tiziano De Matteis, Johannes de Fine Licht, Jakub Beránek, Torsten Hoefler:
 Streaming Message Interface: High-Performance DistributedMemory Programming on Reconfigurable Hardware In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC19), Nov. 2019, (acceptance rate: 22.7% (78/344))
SC19
[5] Alexandros Nikolaos Ziogas, Tal Ben-Nun, Guillermo Indalecio Fernández, Timo Schneider, Mathieu Luisier, Torsten Hoefler:
 Optimizing the Data Movement in Quantum Transport Simulations via Data-Centric Parallel Programming In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC19), Nov. 2019, (acceptance rate: 22.7% (78/344))
SC19
[6] Alexandros Nikolaos Ziogas, Tal Ben-Nun, Guillermo Indalecio Fernández, Timo Schneider, Mathieu Luisier, Torsten Hoefler:
 A Data-Centric Approach to Extreme-Scale Ab initio Dissipative Quantum Transport Simulations In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC19), Nov. 2019, Gordon Bell Prize finalist
SC19
[7] Salvatore Di Girolamo, Konstantin Taranov, Andreas Kurth, Michael Schaffner, Timo Schneider, Jakub Beránek, Maciej Besta, Luca Benini, Duncan Roweth, Torsten Hoefler:
 Network-Accelerated Non-Contiguous Memory Transfers In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC19), Nov. 2019, (acceptance rate: 22.7% (78/344))
SC19
[8] Daniele De Sensi, Salvatore Di Girolamo, Torsten Hoefler:
 Mitigating Network Noise on Dragonfly Networks through Application-Aware Routing In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC19), Nov. 2019, (acceptance rate: 22.7% (78/344))
SC19
[9] Tal Ben-Nun, Johannes de Fine Licht, Alexandros Nikolaos Ziogas, Timo Schneider, Torsten Hoefler:
 Stateful Dataflow Multigraphs: A Data-Centric Model for Performance Portability on Heterogeneous Architectures In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC19), Nov. 2019, (acceptance rate: 22.7% (78/344))
SC19
[10] Maciej Besta, Simon Weber, Lukas Gianinazzi, Robert Gerstenberger, Andrey Ivanov, Yishai Oltchik, Torsten Hoefler:
 Slim Graph: Practical Lossy Graph Compression for Approximate Graph Processing, Storage, and Analytics In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC19), Nov. 2019, (acceptance rate: 22.7% (78/344)) Best Paper Finalist, Best Student Paper Finalist
SC19
[11] Grzegorz Kwasniewski and Marko Kabić and Maciej Besta and Joost VandeVondele and Raffaele Solcà and Torsten Hoefler:
 Red-Blue Pebbling Revisited: Near Optimal Parallel Matrix-Matrix Multiplication In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC19), Nov. 2019, (acceptance rate: 22.7% (78/344)) Best Paper Finalist, Best Student Paper Finalist
PACT'19
[12] Tobias Gysi, Tobias Grosser, Torsten Hoefler:
 Absinthe: Learning an Analytical Performance Model to Fuse and Tile Stencil Codes in One Shot In Proceedings of the 28th International Conference on Parallel Architectures and Compilation Techniques (PACT), presented in Seattle, WA, USA, IEEE, Sep. 2019,
ACM CSUR
[13] Tal Ben-Nun, Torsten Hoefler:
 Demystifying Parallel and Distributed Deep Learning: An In-Depth Concurrency Analysis ACM Comput. Surv.. Vol 52, Nr. 4, pages 65:1--65:43, ACM, ISSN: 0360-0300, Aug. 2019,
IEEE TPDS
[14] Sergei Shudler, Yannick Berens, Alexandru Calotoiu, Torsten Hoefler, Alexandre Strube, Felix Wolf:
 Engineering Algorithms for Scalability through Continuous Validation of Performance Expectations IEEE Transactions on Parallel and Distributed Systems (TPDS). Vol 30, Nr. 8, IEEE, Jul. 2019,
arXiv
[15] Tiziano De Matteis and Johannes de Fine Licht and Torsten Hoefler:
 FBLAS: Streaming Linear Algebra on FPGA CoRR. Vol abs/1907.07929, Jul. 2019,
PASC'19
[16] Felix Thaler, Stefan Moosbrugger, Carlos Osuna, Mauro Bianco, Hannes Vogt, Anton Afanasyev, Lukas Mosimann, Oliver Fuhrer, Thomas Schulthess, Torsten Hoefler:
 Porting the COSMO Weather Model to Intel KNL presented in Zurich, Switzerland, ACM, Jun. 2019, Accepted at the ACM Platform for Advanced Scientific Computing Conference (PASC19)
DAC'19
[17] Niels Gleinig and Frances Ann Hubis and Torsten Hoefler:
 Embedding Functions Into Reversible Circuits: A Probabilistic Approach to the Number of Lines In Proceedings of the 56th Annual Design Automation Conference, presented in Las Vegas, NV, USA, ACM, ISBN: 978-1-4503-6725-7/19/06, Jun. 2019,
PLDI'19
[18] Tobias Gysi, Tobias Grosser, L. Brandner, Torsten Hoefler:
 A Fast Analytical Model of Fully Associative Caches In Proceedings of the 40th ACM SIGPLAN Conference on Programming Language Design and Implementation, presented in Phoenix, AZ, USA, pages 816--829, ACM, ISBN: 978-1-4503-6712-7, Jun. 2019,
ICS'19
[19] Paul R. Eller, Torsten Hoefler, William Gropp:
 Using Performance Models to Understand Scalable Krylov Solver Performance at Scale for Structured Grid Problems In Proceedings of the 2019 ACM International Conference on Supercomputing (ICS'19), presented in Phoenix, AZ, ACM, Jun. 2019,
IPDPS'19
[20] Salvatore Di Girolamo, P. Schmid, Thomas Schulthess, Torsten Hoefler:
 SimFS: A Simulation Data Virtualizing File System Interface In Proceedings of the 33st IEEE International Parallel & Distributed Processing Symposium (IPDPS'19), presented in Rio de Janeiro, Brazil, IEEE, May 2019,
IPDPS'19
[21] Tal Ben-Nun, Maciej Besta, S. Huber, Alexandros Nikolaos Ziogas, D. Peter, Torsten Hoefler:
 A Modular Benchmarking Infrastructure for High-Performance and Reproducible Deep Learning IEEE, May 2019, Accepted at the 33rd IEEE International Parallel & Distributed Processing Symposium (IPDPS'19)
PPoPP'19
[22] Martin Kuettler, Maksym Planeta, Jan Bierbaum, Carsten Weinhold, Hermann Haertig, Amnon Barak, Torsten Hoefler:
 Corrected Trees for Reliable Group Communication Feb. 2019, Accepted at The ACM Conference Principles and Practice of Parallel Programming 2019 (PPoPP'19) (acceptance rate: 19% (29/152))
FPGA'19
[23] Maciej Besta, Marc Fischer, Tal Ben-Nun, Johannes de Fine Licht, Torsten Hoefler:
 Substream-Centric Maximum Matchings on FPGA Feb. 2019, In Proceedings of the 27th ACM/SIGDA International Symposium on Field-Programmable Gate Arrays (acceptance rate: 23%) Best Paper Finalist (4/30)
arXiv
[24] Maciej Besta, Dimitri Stanojevic, Johannes de Fine Licht, Tal Ben-Nun, Torsten Hoefler:
 Graph Processing on FPGAs: Taxonomy, Survey, Challenges CoRR. Vol abs/1903.06697, Feb. 2019,
CiSE
[25] Thomas Schulthess, P. Bauer, Oliver Fuhrer, Torsten Hoefler, C. Schaer, N. Wedi:
 Reflecting on the goal and baseline for exascale computing: a roadmap based on weather and climate simulations Computing in Science and Engineering (CiSE). Vol 21, Nr. 1, IEEE Computer Society, ISSN: 1521-9615, Jan. 2019,
NIPS'18
[26] Tal Ben-Nun, Alice Shoshana Jakobovits, Torsten Hoefler:
 Neural Code Comprehension: A Learnable Representation of Code Semantics In Advances in Neural Information Processing Systems 31, presented in Montreal, Canada, pages 3589--3601, Curran Associates, Inc., Dec. 2018,
NIPS'18
[27] Dan Alistarh, Torsten Hoefler, Mikael Johansson, Sarit Khirirat, Nikola Konstantinov, Cedric Renggli:
 The Convergence of Sparsified Gradient Methods In Advances in Neural Information Processing Systems 31, presented in Montreal, Canada, Curran Associates, Inc., Dec. 2018,
PACT'18
[28] Maciej Besta, Dimitri Stanojevic, T. Zivic, J. Singh, M. Hoerold, Torsten Hoefler:
 Log(Graph): A Near-Optimal High-Performance Graph Representation presented in Limassol, Cyprus, ACM, Nov. 2018, Accepted at the 27th International Conference on Parallel Architectures and Compilation (PACT'18)
SC18
[29] Heng Lin, Xiaowei Zhu, Bowen Yu, Xiongchao Tang, Wei Xue, Wenguang Chen, Lufei Zhang, Torsten Hoefler, Xiaosong Ma, Xin Liu, Weimin Zheng, Jingfang Xu:
 ShenTu: Processing Multi-Trillion Edge Graphs on Millions of Cores in Seconds In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC18) - Gordon Bell Award Finalist, presented in Denver, CO, USA, ACM, Nov. 2018, Gordon Bell Award Finalist
CACM
[30] Robert Gerstenberger, Maciej Besta, Torsten Hoefler:
 Enabling Highly-Scalable Remote Memory Access Programming with MPI-3 One Sided In Communications of the ACM, ACM, Oct. 2018, Research Highlights
Cluster'18
[31] Y. Oyama, Tal Ben-Nun, Torsten Hoefler, Satoshi Matsuoka:
 Accelerating Deep Learning Frameworks with Micro-batches In {IEEE} International Conference on Cluster Computing, {CLUSTER} 2018, Belfast, UK, September 10-13, 2018, presented in Belfast, UK, IEEE, ISBN: 978-1-5386-8319-4, Sep. 2018, (28% (44/154))
Cluster'18
[32] Alexandru Calotoiu, Alexander Graf, Torsten Hoefler, Daniel Lorenz, Sebastian Rinke, Felix Wolf:
 Lightweight Requirements Engineering for Exascale Co-design In {IEEE} International Conference on Cluster Computing, {CLUSTER} 2018, Belfast, UK, September 10-13, 2018, presented in Belfast, UK, IEEE, ISBN: 978-1-5386-8319-4, Sep. 2018, (28% (44/154))
arXiv
[33] Maciej Besta, Torsten Hoefler:
 Survey and Taxonomy of Lossless Graph Compression and Space-Efficient Graph Representations CoRR. Vol abs/1806.01799, Jun. 2018,
GMD
[34] Oliver Fuhrer, T. Chadha, Torsten Hoefler, Grzegorz Kwasniewski, X. Lapillonne, D. Leutwyler, D. Luethi, Carlos Osuna, C. Schaer, Thomas Schulthess, Hannes Vogt:
 Near-global climate simulation at 1 km resolution: establishing a performance baseline on 4888 GPUs with COSMO 5.0 Geoscientific Model Development. Vol 11, Nr. 4, Copernicus Publications, May 2018,
arXiv
[35] Johannes de Fine Licht, Maciej Besta, S. Meierhans, Torsten Hoefler:
 Transformations of High-Level Synthesis Codes for High-Performance Computing CoRR. Vol abs/1805.08288, May 2018,
EuroSys' 18
[36] Konstantin Taranov, Gustavo Alonso, Torsten Hoefler:
 Fast and strongly-consistent per-item resilience in key-value stores ISBN: 978-1-4503-5584-1/18/04, Apr. 2018, EuroSys '18: Thirteenth EuroSys Conference 2018, April 23--26, 2018, Porto, Portugal (acceptance rate: 16% (43/262))
IEEE TPDS
[37] Shigang Li, Yunquan Zhang, Torsten Hoefler:
 Cache-Oblivious MPI All-to-All Communications Based on Morton Order IEEE Transactions on Parallel and Distributed Systems (TPDS). Vol 29, Nr. 3, IEEE, Mar. 2018,
ASPLOS'18
[38] Maciej Besta, S. M. Hassan, S. Yalamanchili, R. Ausavarungnirun, Onur Mutlu, Torsten Hoefler:
 Slim NoC: A Low-Diameter On-Chip Network Topology for High Energy Efficiency and Scalability Mar. 2018, Accepted at the 23rd ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS'18)
PPoPP'18
[39] Lukas Gianinazzi, Pavel Kalvoda, Alessandro De Palma, Maciej Besta, Torsten Hoefler:
 Communication-Avoiding Parallel Minimum Cuts and Connected Components Feb. 2018, Accepted at The ACM Conference Principles and Practice of Parallel Programming 2018 (PPoPP'18) (acceptance rate: 20% (28/138))
PPoPP'18
[40] Johannes de Fine Licht, M. Blott, Torsten Hoefler:
 Designing scalable FPGA architectures using high-level synthesis In Proceedings of the 23rd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, presented in Vienna, Austria, pages 403--404, ACM, ISBN: 978-1-4503-4982-6, Feb. 2018,
VMCAI
[41] Cedric Baumann, Andrei Marian Dan, Yuri Meshman, Torsten Hoefler, Martin Vechev:
 Automatic Verification of RMA Programs via Abstraction Extrapolation Springer International Publishing, Feb. 2018,
ICDE'18
[42] Ingo Mueller, Andrea Arteaga, Torsten Hoefler, Gustavo Alonso:
 Reproducible Floating-Point Aggregation in RDBMSs Feb. 2018, In Proceedings of the 2018 IEEE 34th International Conference on Data Enineering
SC17
[43] Edgar Solomonik, Maciej Besta, F. Vella, Torsten Hoefler:
 Scaling Betweenness Centrality using Communication-Efficient Sparse Matrix Multiplication Nov. 2017, Accepted at The International Conference for High Performance Computing, Networking, Storage and Analysis (SC'17) (acceptance rate: 18% (61/327))
SC17
[44] Torsten Hoefler, Salvatore Di Girolamo, Konstantin Taranov, R. E. Grant, Ron Brightwell:
 sPIN: High-performance streaming Processing in the Network In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC17), Nov. 2017, (acceptance rate: 18% (61/327)) Best Paper Finalist at SC17 (5/61)
IEEE TPDS
[45] Didem Unat, Anshu Dubey, Torsten Hoefler, John Shalf, Mark Abraham, Mauro Bianco, Bradford L. Chamberlain, Romain Cledat, H. Carter Edwards, Hal Finkel, Karl Fuerlinger, Frank Hannig, Emmanuel Jeannot, Amir Kamil, Jeff Keasler, Paul H J Kelly, Vitus Leung, Hatem Ltaief, Naoya Maruyama, Chris J. Newburn, and Miquel Pericas:
 Trends in Data Locality Abstractions for HPC Systems IEEE Transactions on Parallel and Distributed Systems (TPDS). Vol 28, Nr. 10, IEEE, Oct. 2017,
VLDB'17
[46] C. Barthels, Timo Schneider, Ingo Mueller, Gustavo Alonso, Torsten Hoefler:
 Distributed Join Algorithms on Thousands of Cores Vol 10, Nr. 5, In Proc. VLDB Endow., presented in Munich, Germany, pages 517--528, VLDB Endowment, ISSN: 2150-8097, Aug. 2017,
HOTI'17
[47] P. Yebenes, J. Escudero-Sahuquillo, P. J. Garcia, F. J. Quiles, Torsten Hoefler:
 Improving Non-Minimal and Adaptive Routing Algorithms in Slim Fly Networks In Proceedings of the 25th Annual Symposium on High-Performance Interconnects (HOTI'17), Aug. 2017, Best Student Paper at HOTI'17
HOTI'17
[48] Timo Schneider, J. Dinan, M. Flajslik, K. D. Underwood, and Torsten Hoefler:
 Fast Networks and Slow Memories: A Mechanism for Mitigating Bandwidth Mismatches In Proceedings of the 25th Annual Symposium on High-Performance Interconnects (HOTI'17), Aug. 2017,
HPDC'17
[49] Marius Poke, Torsten Hoefler, C. W. Glass:
 AllConcur: Leaderless Concurrent Atomic Broadcast presented in Washington, DC, USA, ACM, Jun. 2017, (acceptance rate: 19%)
HPDC'17
[50] Maciej Besta, M. Podstawski, L. Groner, Edgar Solomonik, Torsten Hoefler:
 To Push or To Pull: On Reducing Communication and Synchronization in Graph Computations In Proceedings of the 26th International Symposium on High-Performance Parallel and Distributed Computing (HPDC'17), presented in Washington, DC, USA, ACM, Jun. 2017, (acceptance rate: 19%)
ICCS'17
[51] Andrea Arteaga, Oliver Fuhrer, Torsten Hoefler, Thomas Schulthess:
 Model-Driven Choice of Numerical Methods for the Solution of the Linear Advection Equation In Proceedings of the International Conference on Computational Science (ICCS'17), presented in Zurich, Switzerland, Elsevier, Jun. 2017,
SPAA'17
[52] Edgar Solomonik, Grey Ballard, James Demmel, Torsten Hoefler:
 A Communication-Avoiding Parallel Algorithm for the Symmetric Eigenvalue Problem Nr. 11, In Proceedings of the 29th ACM Symposium on Parallelism in Algorithms and Architectures (SPAA'17), presented in Washington, DC, USA, pages 111--121, ACM, ISBN: 978-1-4503-4593-4, Jun. 2017,
IPDPS'17
[53] Maciej Besta, F. Marending, Edgar Solomonik, Torsten Hoefler:
 SlimSell: A Vectorized Graph Representation for Breadth-First Search In Proceedings of the 31st IEEE International Parallel & Distributed Processing Symposium (IPDPS'17), presented in Orlando, FL, USA, IEEE, May 2017, (acceptance rate: 22%, 116/516)
IPDPS'17
[54] Salvatore Di Girolamo, F. Vella and Torsten Hoefler:
 Transparent Caching for RMA Systems In Proceedings of the 31st IEEE International Parallel & Distributed Processing Symposium (IPDPS'17), presented in Orlando, FL, USA, IEEE, May 2017, (acceptance rate: 22%, 116/516)
IPDPS'17
[55] Torsten Hoefler, Amnon Barak, A. Shiloh and Z. Drezner:
 Corrected Gossip Algorithms for Fast Reliable Broadcast on Unreliable Systems In Proceedings of the 31st IEEE International Parallel & Distributed Processing Symposium (IPDPS'17), presented in Orlando, FL, USA, IEEE, May 2017, (acceptance rate: 22%, 116/516)
IPDPS'17
[56] T. Wicky, Edgar Solomonik and Torsten Hoefler:
 Communication-Avoiding Parallel Algorithms for Solving Triangular Systems of Linear Equations In Proceedings of the 31st IEEE International Parallel & Distributed Processing Symposium (IPDPS'17), presented in Orlando, FL, USA, IEEE, May 2017, (acceptance rate: 22%, 116/516)
IPDPS'17
[57] Sabela Ramos and Torsten Hoefler:
 Capability Models for Manycore Memory Systems: A Case-Study with Xeon Phi KNL In Proceedings of the 31st IEEE International Parallel & Distributed Processing Symposium (IPDPS'17), presented in Orlando, FL, USA, IEEE, May 2017, (acceptance rate: 22%, 116/516)
CIAC'17
[58] K. T. Foerster, L. Groner, Torsten Hoefler, M. Koenig, S. Schmid, R. Wattenhofer:
 Multi-agent Pathfinding with n Agents on Graphs with n Vertices: Combinatorial Classification and Tight Algorithmic Bounds In Algorithms and Complexity - 10th International Conference, {CIAC} 2017, Athens, Greece, May 24-26, 2017, Proceedings, presented in Athens, Greece, May 2017,
TCDE
[59] C. Barthels, Gustavo Alonso, Torsten Hoefler:
 Designing Databases for Future High-Performance Networks IEEE Technical Committee on Data Engineering. Vol 40, Nr. 1, IEEE, Mar. 2017,
PPoPP'17
[60] Sergei Shudler, Alexandru Calotoiu, Torsten Hoefler, Felix Wolf:
 Isoefficiency in Practice: Configuring and Understanding the Performance of Task-based Applications In Proceedings of the 22nd ACM SIGPLAN symposium on Principles and practice of parallel programming, presented in College Station, TX, ACM, Feb. 2017, (acceptance rate: 21%, 29/139)
SC16
[61] M. Martinasso, Grzegorz Kwasniewski, S. R. Alam, Thomas Schulthess, Torsten Hoefler:
 A PCIe Congestion-Aware Performance Model for Densely Populated Accelerator Servers In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC16), presented in Salt Lake City, Utah, pages 63:1--63:11, IEEE Press, ISBN: 978-1-4673-8815-3, Nov. 2016, (acceptance rate: 18% (82/446))
SC16
[62] W. Tang, B. Wang, S. Ethier, Grzegorz Kwasniewski, Torsten Hoefler, K. Z. Ibrahim, K. Madduri, S. Williams, Leonid Oliker, C. Rosales-Fernandez, T. Williams:
 Extreme Scale Plasma Turbulence Simulations on Top Supercomputers Worldwide In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC16), presented in Salt Lake City, Utah, pages 43:1--43:12, IEEE Press, ISBN: 978-1-4673-8815-3, Nov. 2016, (acceptance rate: 18% (82/446))
SC16
[63] Jens Domke, Torsten Hoefler:
 Scheduling-Aware Routing for Supercomputers Nov. 2016, Accepted at The International Conference for High Performance Computing, Networking, Storage and Analysis (SC'16) (acceptance rate: 18% (82/446))
SC16
[64] Tobias Gysi, J. Baer, Torsten Hoefler:
 dCUDA: Hardware Supported Overlap of Computation and Communication In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC16), presented in Salt Lake City, Utah, pages 52:1--52:12, IEEE Press, ISBN: 978-1-4673-8815-3, Nov. 2016, (acceptance rate: 18% (82/446))
OOPSLA'16
[65] Andrei Marian Dan, Patrick Lam, Torsten Hoefler, Martin Vechev:
 Modeling and Analysis of Remote Memory Access Programming In Proceedings of the 2016 ACM SIGPLAN International Conference on Object-Oriented Programming, Systems, Languages, and Applications, presented in Amsterdam, Netherlands, pages 129--144, ACM, ISBN: 978-1-4503-4444-9, Nov. 2016, Outstanding Paper Award at OOPSLA'16 (4/52)
Cluster'16
[66] Alexandru Calotoiu, D. Beckingsale, C. W. Earl, Torsten Hoefler, I. Karlin, M. Schulz, Felix Wolf:
 Fast Multi-Parameter Performance Modeling Oct. 2016, Accepted at IEEE International Conference on Cluster Computing (Cluster'16) (acceptance rate: 24% (39/162))
HOTI'16
[67] Timo Schneider, O. Bibartiu, Torsten Hoefler:
 Ensuring Deadlock-Freedom in Low-Diameter InfiniBand Networks In Proceedings of the 24th Annual Symposium on High-Performance Interconnects (HOTI'16), Aug. 2016, Best Student Paper at HOTI'16
IEEE MICRO
[68] Salvatore Di Girolamo, P. Jolivet, K. D. Underwood, Torsten Hoefler:
 Exploiting Offload Enabled Network Interfaces IEEE MICRO. Vol 36, Nr. 4, IEEE, Jul. 2016,
HPDC'16
[69] Jens Domke, Torsten Hoefler, Satoshi Matsuoka:
 Routing on the Dependency Graph: A New Approach to Deadlock-Free High-Performance Routing In Proceedings of the 25th Symposium on High-Performance Parallel and Distributed Computing (HPDC'16), Jun. 2016, (acceptance rate: 16% (20/129))
HPDC'16
[70] P. Schmid, Maciej Besta, Torsten Hoefler:
 High-Performance Distributed RMA Locks In Proceedings of the 25th Symposium on High-Performance Parallel and Distributed Computing (HPDC'16), Jun. 2016, (acceptance rate: 16% (20/129)) Karsten Schwan Best Paper Award at HPDC'16 (1/20)
ICS'16
[71] Tobias Grosser, Torsten Hoefler:
 Polly-ACC: Transparent compilation to heterogeneous hardware In Proceedings of the the 30th International Conference on Supercomputing (ICS'16), Jun. 2016, (acceptance rate: 24% (43/178))
PASC'16
[72] Torsten Hoefler:
 Selecting Technical Papers for an Interdisciplinary Conference: The PASC Review Process In Proceedings of the 3rd Platform of Advanced Scientific Computing Conference (PASC'16), Jun. 2016,
IJHPCA
[73] P. M. Widener, S. Levy, K. B. Ferreira, Torsten Hoefler:
 On noise and the performance benefit of nonblocking collectives The International Journal of High Performance Computing Applications. Vol 30, Nr. 1, pages 121-133, Sage, ISSN: 1094-3420, Jan. 2016, accepted for publication on Nov. 2nd 2015
IEEE TPDS
[74] Sabela Ramos, Torsten Hoefler:
 Cache Line Aware Algorithm Design for Cache-Coherent Architectures IEEE Transactions on Parallel and Distributed Systems (TPDS). Vol PP, Nr. 99, IEEE, Jan. 2016,
SC15
[75] Torsten Hoefler, Roberto Belli:
 Scientific Benchmarking of Parallel Computing Systems presented in Austin, TX, USA, pages 73:1--73:12, ACM, ISBN: 978-1-4503-3723-6, Nov. 2015, Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC15) (acceptance rate: 22%, 79/358)
SC15
[76] G. Kathareios, C. Minkenberg, B. Prisacari, G. Rodriguez, Torsten Hoefler:
 Cost-Effective Diameter-Two Topologies: Analysis and Evaluation presented in Austin, TX, USA, ACM, ISBN: 978-1-4503-3723-6, Nov. 2015, In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC15) (acceptance rate: 22%, 79/358)
PACT'15
[77] A. Bhattacharyya, Grzegorz Kwasniewski, Torsten Hoefler:
 Using Compiler Techniques to Improve Automatic Performance Modeling presented in San Francisco, CA, USA, ACM, Oct. 2015, Accepted at the 24th International Conference on Parallel Architectures and Compilation (PACT'15) (acceptance rate: 21%, 38/179)
PACT'15
[78] H. Schweizer, Maciej Besta, Torsten Hoefler:
 Evaluating the Cost of Atomic Operations on Modern Architectures presented in San Francisco, CA, USA, ACM, Oct. 2015, Accepted at the 24th International Conference on Parallel Architectures and Compilation (PACT'15) (acceptance rate: 21%, 38/179)
HOTI'15
[79] Salvatore Di Girolamo, P. Jolivet, K. D. Underwood, Torsten Hoefler:
 Exploiting Offload Enabled Network Interfaces In Proceedings of the 23rd Annual Symposium on High-Performance Interconnects (HOTI'15), presented in Oracle Santa Clara Campus, CA, USA, IEEE, Aug. 2015, Best Student Paper at HOTI'15
ICS'15
[80] Sergei Shudler, Alexandru Calotoiu, Torsten Hoefler, Alexandre Strube, Felix Wolf:
 Exascaling Your Library: Will Your Implementation Meet Your Expectations? In Proceedings of the 29th International Conference on Supercomputing (ICS'15), presented in Newport Beach, CA, USA, pages 161--175, ACM, ISBN: 978-1-4503-3559-1, Jun. 2015, (acceptance rate: 25% (40/160))
HPDC'15
[81] Marius Poke, Torsten Hoefler:
 DARE: High-Performance State Machine Replication on RDMA Networks In Proceedings of the 24th International Symposium on High-Performance Parallel and Distributed Computing (HPDC'15), presented in Portland, OR, USA, pages 107--118, ACM, ISBN: 978-1-4503-3550-8, Jun. 2015, (acceptance rate: 16% (19/116))
ICS'15
[82] Tobias Gysi, Tobias Grosser, Torsten Hoefler:
 MODESTO: Data-centric Analytic Optimization of Complex Stencil Programs on Heterogeneous Architectures In Proceedings of the 29th International Conference on Supercomputing (ICS'15), presented in Newport Beach, CA, USA, pages 177--186, ACM, ISBN: 978-1-4503-3559-1, Jun. 2015, (acceptance rate: 25% (40/160))
ICS'15
[83] Maciej Besta, Torsten Hoefler:
 Active Access: A Mechanism for High-Performance Distributed Data-Centric Computations In Proceedings of the 29th International Conference on Supercomputing (ICS'15), presented in Newport Beach, CA, USA, pages 155--164, ACM, ISBN: 978-1-4503-3559-1, Jun. 2015, (acceptance rate: 25% (40/160))
HPDC'15
[84] Maciej Besta, Torsten Hoefler:
 Accelerating Irregular Computations with Hardware Transactional Memory and Active Messages In Proceedings of the 24th Symposium on High-Performance Parallel and Distributed Computing (HPDC'15), presented in Portland, OR, USA, pages 161--172, ACM, ISBN: 978-1-4503-3550-8, Jun. 2015, (acceptance rate: 16% (19/116)) Best Paper at HPDC'15 (1/19)
HPDC'15
[85] Sabela Ramos, Torsten Hoefler:
 Cache Line Aware Optimizations for ccNUMA Systems In Proceedings of the 24th International Symposium on High-Performance Parallel and Distributed Computing (HPDC'15) (short paper), presented in Portland, OR, USA, pages 85--88, ACM, ISBN: 978-1-4503-3550-8, Jun. 2015,
IPDPS'15
[86] Roberto Belli, Torsten Hoefler:
 Notified Access: Extending Remote Memory Access Programming Models for Producer-Consumer Synchronization In Proceedings of the 29th IEEE International Parallel & Distributed Processing Symposium (IPDPS'15), presented in Hyderabad, India, IEEE, May 2015, (acceptance rate: 21,8%, 108/496) Best Paper at IPDPS'15 (4/108)
HotOS XV
[87] Torsten Hoefler, R. Ross, T. Roscoe:
 Distributing the Data Plane for Remote Storage Access presented in Kartause Ittingen, Switzerland, USENIX, May 2015, Proceedings of the 15th Workshop on Hot Topics in Operating Systems (acceptance rate: 32% (29/90))
CFI'15
[88] T. Lee, C. Pappas, C. Basescu, J. Han, Torsten Hoefler, A. Perrig:
 Source-Based Path Selection: The Data Plane Perspective In Proceedings of the 10th International Conference on Future Internet, presented in Seoul, Republic of Korea, pages 41--45, ACM, ISBN: 978-1-4503-3564-5, May 2015,
ACM TOPC
[89] Torsten Hoefler, J. Dinan, Rajeev Thakur, Brian Barrett, P. Balaji, William Gropp, K. Underwood:
 Remote Memory Access Programming in MPI-3 ACM Transactions on Parallel Computing (TOPC). ACM, Jan. 2015, accepted for publication on Dec. 4th
SC14
[90] Jens Domke, Torsten Hoefler, Satoshi Matsuoka:
 Fail-in-Place Network Design: Interaction between Topology, Routing Algorithm and Failures presented in New Orleans, LA, USA, Nov. 2014, Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis (SC14) (acceptance rate: 21%, 82/394)
SC14
[91] K. B. Ferreira, P. Widener, S. Levy, D. Arnold, Torsten Hoefler:
 Understanding the Effects of Communication and Coordination on Checkpointing at Scale presented in New Orleans, LA, USA, Nov. 2014, Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis (SC14) (acceptance rate: 21%, 82/394)
SC14
[92] Maciej Besta, Torsten Hoefler:
 Slim Fly: A Cost Effective Low-Diameter Network Topology presented in New Orleans, LA, USA, Nov. 2014, Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis (SC14) (acceptance rate: 21%, 82/394) SC14 Best Student Paper (1/82)
Adv MPI
[93] William Gropp, Torsten Hoefler, Rajeev Thakur, E. Lusk:
 Using Advanced MPI: Modern Features of the Message-Passing Interface presented in Cambridge, MA, MIT Press, ISBN: 978-0262527637, Nov. 2014,
JSFI
[94] Torsten Hoefler, D. Moor:
 Energy, Memory, and Runtime Tradeoffs for Implementing Collective Communication Operations Journal of Supercomputing Frontiers and Innovations. Vol 1, Nr. 2, pages 58--75, SuperFri Open Journal, Oct. 2014,
EuroMPI'14
[95] P. Widener, K. Ferreira, S. Levy, Torsten Hoefler:
 Exploring the effect of noise on the performance benefit of nonblocking allreduce In Proceedings of the 21st European MPI Users' Group Meeting, presented in Kyoto, Japan, pages 77:77--77:82, ACM, ISBN: 978-1-4503-2875-3, Sep. 2014, Invited to a journal special issue on top picks from EuroMPI'14.
PACT'14
[96] A. Bhattacharyya, Torsten Hoefler:
 PEMOGEN: Automatic Adaptive Performance Modeling During Program Runtime In Proceedings of the 23rd International Conference on Parallel Architectures and Compilation (PACT'14), presented in Edmonton, Alberta, Canada, pages 393-404, ACM, ISBN: 978-1-4503-2809-8, Aug. 2014,
HPDC'14
[97] B. Prisacari, G. Rodriguez, P. Heidelberger, D. Chen, C. Minkenberg, Torsten Hoefler:
 Efficient Task Placement and Routing in Dragonfly Networks In Proceedings of the 23rd ACM International Symposium on High-Performance Parallel and Distributed Computing (HPDC'14), presented in Vancouver, Canada, ACM, Jun. 2014, (acceptance rate: 16%, 21/130)
HPDC'14
[98] Maciej Besta, Torsten Hoefler:
 Fault Tolerance for Remote Memory Access Programming Models In Proceedings of the 23rd ACM International Symposium on High-Performance Parallel and Distributed Computing (HPDC'14), presented in Vancouver, Canada, ACM, Jun. 2014, (acceptance rate: 16%, 21/130) Best Paper Nominee at HPDC'14 (3/21)
SPAA'14
[99] Torsten Hoefler, Grzegorz Kwasniewski:
 Automatic Complexity Analysis of Explicitly Parallel Programs In Proceedings of the 26th ACM Symposium on Parallelism in Algorithms and Architectures (SPAA'14), presented in Prague, Czech Republic, ACM, Jun. 2014, (acceptance rate: 25%, 30/122)
Computing
[100] Timo Schneider, Robert Gerstenberger, Torsten Hoefler:
 Application-oriented ping-pong benchmarking: how to assess the real communication overheads Journal of Computing. Vol 96, Nr. 4, pages 279-292, Springer Vienna, ISSN: 0010-485X, Apr. 2014, Special issue on top picks from EuroMPI'12.
IPDPS'14
[101] Andrea Arteaga, Oliver Fuhrer, Torsten Hoefler:
 Designing Bit-Reproducible Portable High-Performance Applications In Proceedings of the 28th IEEE International Parallel and Distributed Processing Symposium (IPDPS), presented in Phoenix, AR, USA, IEEE Computer Society, Apr. 2014, (acceptance rate: 21.1%, 114/541)
Cluster Computing
[102] Shigang Li, Torsten Hoefler, C. Hu, Marc Snir:
 Improved MPI collectives for MPI processes in shared address spaces Journal of Cluster Computing. pages 1-17, Springer US, ISSN: 1386-7857, Mar. 2014,
ACM TACO
[103] B. Prisacari, G. Rodriguez, C. Minkenberg, Torsten Hoefler:
 Fast Pattern-Specific Routing for Fat Tree Networks ACM Transactions on Architecture and Code Optimization. Vol 10, Nr. 4, presented in New York, NY, USA, pages 36:1--36:25, ACM, ISSN: 1544-3566, Dec. 2013, (acceptance rate: 24% (2011))
SC13
[104] Alexandru Calotoiu, Torsten Hoefler, Marius Poke, Felix Wolf:
 Using Automated Performance Modeling to Find Scalability Bugs in Complex Codes In Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis (SC13), presented in Denver, Colorado, USA, pages 45:1--45:12, ACM, ISBN: 978-1-4503-2378-9, Nov. 2013, (acceptance rate: 20%, 92/457)
SC13
[105] Robert Gerstenberger, Maciej Besta, Torsten Hoefler:
 Enabling Highly-Scalable Remote Memory Access Programming with MPI-3 One Sided In Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis, presented in Denver, Colorado, USA, pages 53:1--53:12, ACM, ISBN: 978-1-4503-2378-9, Nov. 2013, (acceptance rate: 20%, 92/457) Best Student Paper Finalist (8/92) and SC13 Best Paper (1/92)
SC13
[106] A. Friedley, G. Bronevetsky, Andrew Lumsdaine, Torsten Hoefler:
 Hybrid MPI: Efficient Message Passing for Multi-core Systems In IEEE/ACM International Conference on High Performance Computing, Networking, Storage and Analysis (SC13), presented in Denver, Colorado, USA, pages 18:1--18:11, ISBN: 978-1-4503-2378-9, Nov. 2013, (acceptance rate: 20%, 92/457)
PMBS'13
[107] S. Levy, B. Topp, K. Ferreira, D. Arnold, Torsten Hoefler, P. Widener:
 Using Simulation to Evaluate the Performance of Resilience Strategies at Scale presented in Denver, CO, USA, Nov. 2013, Proceedings of the 4th International Workshop in Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS13)
ICPP'13
[108] Timo Schneider, Torsten Hoefler, R. Grant, Brian Barrett, Ron Brightwell:
 Protocols for Fully Offloaded Collective Operations on Accelerated Network Adapters In Parallel Processing (ICPP), 2013 42nd International Conference on, presented in Lyon, France, pages 593-602, ISSN: 0190-3918, Oct. 2013,
EuroMPI'13
[109] Timo Schneider and F. Kjolstad and Torsten Hoefler:
 MPI Datatype Processing using Runtime Compilation In Proceedings of the 20th European MPI Users' Group Meeting, presented in Madrid, Spain, pages 19--24, ACM, ISBN: 978-1-4503-1903-4, Sep. 2013, Best Paper Award at EuroMPI'13 (1/25)
LCPC'13
[110] Timo Schneider, Robert Gerstenberger, Torsten Hoefler:
 Compiler Optimizations for Non-Contiguous Remote Data Movement presented in Santa Clara, CA, USA, Sep. 2013, Proceedings of the 26th International Workshop on Languages and Compilers for Parallel Computing
HPDC'13
[111] Sabela Ramos and Torsten Hoefler:
 Modeling Communication in Cache-Coherent SMP Systems - A Case-Study with Xeon Phi In Proceedings of the 22nd international symposium on High-performance parallel and distributed computing, presented in New York City, NY, USA, pages 97--108, ACM, ISBN: 978-1-4503-1910-2, Jun. 2013, (acceptance rate: 15%, 20/131)
HPDC'13
[112] Shigang Li, Torsten Hoefler and Marc Snir:
 NUMA-Aware Shared Memory Collective Communication for MPI In Proceedings of the 22nd international symposium on High-performance parallel and distributed computing, presented in New York City, NY, USA, pages 85--96, ACM, ISBN: 978-1-4503-1910-2, Jun. 2013, (acceptance rate: 15%, 20/131) Nominated for Best Paper Award at HPDC'13 (3/20)
ICS'13
[113] B. Prisacari, G. Rodriguez, C. Minkenberg and Torsten Hoefler:
 Bandwidth-optimal All-to-all Exchanges in Fat Tree Networks In Proceedings of the 27th International ACM Conference on International Conference on Supercomputing, presented in Eugene, OR, USA, pages 139--148, ACM, ISBN: 978-1-4503-2130-3, Jun. 2013, (acceptance rate: 21%, 41/198)
Computing
[114] Torsten Hoefler, J. Dinan, D. Buntinas, P. Balaji, Brian Barrett, Ron Brightwell, William Gropp, V. Kale and Rajeev Thakur:
 MPI + MPI: a new hybrid approach to parallel programming with MPI plus shared memory Journal of Computing. Springer, May 2013, doi: 10.1007/s00607-013-0324-2
PPoPP'13
[115] A. Friedley, Torsten Hoefler, G. Bronevetsky, Andrew Lumsdaine:
 Ownership Passing: Efficient Distributed Memory Programming on Multi-core Systems In Proceedings of the 18th ACM SIGPLAN symposium on Principles and practice of parallel programming, presented in Shenzen, China, pages 177--186, ACM, ISBN: 978-1-4503-1922-5, Feb. 2013, (acceptance rate: 18%, 26/146)
SC12
[116] Torsten Hoefler, Timo Schneider:
 Optimization Principles for Collective Neighborhood Communications In Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis, presented in Salt Lake City, Utah, USA, pages 98:1--98:10, IEEE Computer Society Press, ISBN: 978-1-4673-0804-5, Nov. 2012, (acceptance rate: 21%, 100/472)
EuroMPI'12
[117] Timo Schneider, Robert Gerstenberger, Torsten Hoefler:
 Micro-Applications for Communication Data Access Patterns and MPI Datatypes Vol 7490, In Recent Advances in the Message Passing Interface - 19th European MPI Users' Group Meeting, EuroMPI 2012, Vienna, Austria, September 23-26, 2012. Proceedings, presented in Vienna, Austria, pages 121-131, Springer, ISBN: 978-3-642-33517-4, Sep. 2012, Invited to a journal special issue on top picks from EuroMPI'12.
EuroMPI'12
[118] Simone Pellegrini, Torsten Hoefler, T. Fahringer:
 Exact Dependence Analysis for Increased Communication Overlap In Recent Advances in the Message Passing Interface - 19th European MPI Users' Group Meeting, EuroMPI 2012, Vienna, Austria, September 23-26, 2012. Proceedings, presented in Vienna, Austria, Springer, ISBN: 978-3-642-33517-4, Sep. 2012,
EuroMPI'12
[119] Torsten Hoefler, J. Dinan, D. Buntinas, P. Balaji, Brian Barrett, Ron Brightwell, William Gropp, V. Kale, Rajeev Thakur:
 Leveraging MPI's One-Sided Communication Interface for Shared-Memory Programming Vol 7490, In Recent Advances in the Message Passing Interface - 19th European MPI Users' Group Meeting, EuroMPI 2012, Vienna, Austria, September 23-26, 2012. Proceedings, presented in Vienna, Austria, Springer, ISBN: 978-3-642-33517-4, Sep. 2012, Invited to journal special issue on top picks from EuroMPI'12.
PACT'12
[120] Torsten Hoefler, Timo Schneider:
 Runtime Detection and Optimization of Collective Communication Patterns In Proceedings of the 21st international conference on Parallel Architectures and Compilation Techniques (PACT), presented in Minneapolis, MN, USA, pages 263--272, ACM, ISBN: 978-1-4503-1182-3, Sep. 2012, (acceptance rate: 18.9%, 39/207)
Cluster'12
[121] Simone Pellegrini, Torsten Hoefler, T. Fahringer:
 On the Effects of CPU Caches on MPI Point-to-Point Communications In Proceedings of the 2012 IEEE International Conference on Cluster Computing, presented in Beijing, China, pages 495--503, IEEE Computer Society, ISBN: 978-0-7695-4807-4, Sep. 2012, (acceptance rate: 28.9%, 58/200)
CCGrid'12
[122] P. Gottschling and Torsten Hoefler:
 Productive Parallel Linear Algebra Programming with Unstructured Topology Adaption In Proceedings of the 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012), presented in Ottawa, Canada, pages 9--16, IEEE Computer Society, ISBN: 978-0-7695-4691-9, May 2012, (acceptance rate: 27%, 83/302)
CCGrid'12
[123] G. Bauer, S. Gottlieb and Torsten Hoefler:
 Performance Modeling and Comparative Analysis of the MILC Lattice QCD Application su3 rmd In Proceedings of the 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012), presented in Ottawa, Canada, pages 652--659, IEEE Computer Society, ISBN: 978-0-7695-4691-9, May 2012, (acceptance rate: 27%, 83/302)
PDP'12
[124] K. Kharbas, D. Kim, Torsten Hoefler and F. Mueller:
 Assessing HPC Failure Detectors for MPI Jobs In Proceedings of the 2012 20th Euromicro International Conference on Parallel, Distributed and Network-based Processing, presented in Munich, Germany, pages 81--88, IEEE Computer Society, ISBN: 978-0-7695-4633-9, Feb. 2012,
PPoPP'12
[125] Torsten Hoefler and Timo Schneider:
 Communication-Centric Optimizations by Dynamically Detecting Collective Operations In Proceedings of the 17th ACM symposium on Principles and practice of parallel programming, Feb. 2012, (poster paper) (acceptance rate (posters): 17%, 32/185)
PPoPP'12
[126] F. Kjolstad, Torsten Hoefler and Marc Snir:
 Automatic Datatype Generation and Optimization In Proceedings of the 17th ACM symposium on Principles and practice of parallel programming, Feb. 2012, (poster paper) (acceptance rate (posters): 17%, 32/185)
SC11
[127] Torsten Hoefler, William Gropp, Marc Snir and W. Kramer:
 Performance Modeling for Systematic Performance Tuning In International Conference for High Performance Computing, Networking, Storage and Analysis (SC'11), SotP Session, Nov. 2011,
EuroMPI'11
[128] William Gropp, Torsten Hoefler, Rajeev Thakur and Jesper Larsson Träff:
 Performance Expectations and Guidelines for MPI Derived Datatypes Vol 6960, In Recent Advances in the Message Passing Interface (EuroMPI'11), presented in Santorini, Greece, pages 150-159, Springer, ISBN: 978-3-642-24448-3, Sep. 2011,
EuroMPI'11
[129] V. Venkatesan, M. Chaarawi, E. Gabriel and Torsten Hoefler:
 Design and Evaluation of Nonblocking Collective I/O Operations Vol 6960, In Recent Advances in the Message Passing Interface (EuroMPI'11), presented in Santorini, Greece, pages 90-98, Springer, ISBN: 978-3-642-24448-3, Sep. 2011,
EuroMPI'11
[130] Torsten Hoefler and Marc Snir:
 Writing Parallel Libraries with MPI - Common Practice, Issues, and Extensions Vol 6960, In Recent Advances in the Message Passing Interface - 18th European MPI Users' Group Meeting, EuroMPI 2011, Santorini, Greece, September 18-21, 2011. Proceedings, presented in Santorini, Greece, pages 345--355, Springer, ISBN: 978-3-642-24448-3, Sep. 2011, Keynote paper at IMUDI/EuroMPI 2011.
EuroPar'11
[131] Timo Schneider, Sven Eckelmann, Torsten Hoefler, and Wolfgang Rehm:
 Kernel-Based Offload of Collective Operations - Implementation, Evaluation and Lessons Learned In Proceedings of the 17th international conference on Parallel processing - Volume Part II, presented in Bordeaux, France, pages 264--275, Springer-Verlag, ISBN: 978-3-642-23396-8, Aug. 2011, (acceptance rate 29.9%, 81/271)
TG'11
[132] S. Harrell, P. Smith, D. Smith, Torsten Hoefler, A. Labutina and T. Overmeyer:
 Methods of Creating Student Cluster Competition Teams In Proceedings of the 2011 TeraGrid Conference: Extreme Digital Discovery, presented in Salt Lake City, Utah, pages 50:1--50:6, ACM, Jul. 2011,
ICS'11
[133] Torsten Hoefler and Marc Snir:
 Generic Topology Mapping Strategies for Large-scale Parallel Architectures In Proceedings of the 2011 ACM International Conference on Supercomputing (ICS'11), presented in Tucson, AZ, pages 75--85, ACM, ISBN: 978-1-4503-0102-2, Jun. 2011, (acceptance rate 21.7%, 35/161)
ICS'11
[134] Jeremiah Willcock, Torsten Hoefler, Nicholas Edmonds and Andrew Lumsdaine:
 Active Pebbles: Parallel Programming for Data-Driven Applications In Proceedings of the 2011 ACM International Conference on Supercomputing (ICS'11), presented in Tucson, AZ, pages 235--245, ACM, ISBN: 978-1-4503-0102-2, Jun. 2011, (acceptance rate 21.7%, 35/161)
LSAP'11
[135] Torsten Hoefler and Marc Snir:
 Performance Engineering: A Must for Petaflops and Beyond Jun. 2011, Extended Abstract for Keynote at Large-scale System and Application Performance Workshop 2011 Keynote Paper at LSAP'11
IPDPS'11
[136] Jens Domke, Torsten Hoefler and W. Nagel:
 Deadlock-Free Oblivious Routing for Arbitrary Topologies In Proceedings of the 25th IEEE International Parallel \& Distributed Processing Symposium (IPDPS), presented in Anchorage, AL, USA, pages 613--624, IEEE Computer Society, ISBN: 0-7695-4385-7, May 2011, (acceptance rate: 19.6%, 112/571)
PPL
[137] P. Balaji, D. Buntinas, D. Goodell, William Gropp, Torsten Hoefler, S. Kumar, E. Lusk, Rajeev Thakur and Jesper Larsson Träff:
 MPI on Millions of Cores Parallel Processing Letters (PPL). Vol 21, Nr. 1, pages 45-60, World Scientific Publishing Company, Mar. 2011,
PPoPP'11
[138] Jeremiah Willcock, Torsten Hoefler, Nicholas Edmonds and Andrew Lumsdaine:
 Active Pebbles: A Programming Model For Highly Parallel Fine-Grained Data-Driven Computations In Proceedings of the 16th ACM symposium on Principles and practice of parallel programming, pages 305--306, ISBN: 978-1-4503-0119-0, Feb. 2011, (poster paper) (acceptance rate: 25%, 26/165 papers + 16/165 poster) PPoPP'11 Best Poster Award
PADL'11
[139] E. Holk, W. E. Byrd, Jeremiah Willcock, Torsten Hoefler, A. Chauhan and Andrew Lumsdaine:
 Kanor -- A Declarative Language for Explicit Communication In Proceedings of the 13th international conference on Practical aspects of declarative languages, presented in Austin, TX, USA, pages 190--204, Springer-Verlag, ISBN: 978-3-642-18377-5, Jan. 2011,
CiSE
[140] Torsten Hoefler:
 Software and Hardware Techniques for Power-Efficient HPC Networking Computing in Science and Engineering (CiSE). Vol 12, Nr. 6, pages 30-37, IEEE Computer Society, ISSN: 0740-7475, Dec. 2010,
HiPC'10
[141] Nicholas Edmonds, Torsten Hoefler and Andrew Lumsdaine:
 A Space-Efficient Parallel Algorithm for Computing Betweenness Centrality in Distributed Memory In International Conference on High Performance Computing, presented in Goa, India, pages 1 - 10, ISBN: 978-1-4244-8518-5 , Dec. 2010, (acceptance rate: 19.2%)
HiPC'10
[142] Nicholas Edmonds, J. Willock, Torsten Hoefler and Andrew Lumsdaine:
 Design of a Large-Scale Hybrid-Parallel Graph Library In International Conference on High Performance Computing, Student Research Symposium, presented in Goa, India, IEEE, Dec. 2010,
PROPER'10
[143] Torsten Hoefler:
 Bridging Performance Analysis Tools and Analytic Performance Modeling for HPC In Proceedings of Workshop on Productivity and Performance (PROPER 2010), presented in Ischia, Italy, Springer, Dec. 2010, Keynote extended abstract for PROPER'10.
SC10
[144] Torsten Hoefler, Timo Schneider and Andrew Lumsdaine:
 Characterizing the Influence of System Noise on Large-Scale Applications by Simulation In International Conference for High Performance Computing, Networking, Storage and Analysis (SC'10), Nov. 2010, (acceptance rate 19.8%, 50/253) SC10 Best Paper Award
EuroMPI'10
[145] Torsten Hoefler, G. Bronevetsky, Brian Barrett, Bronis R. de Supinski and Andrew Lumsdaine:
 Efficient MPI Support for Advanced Hybrid Programming Models Vol LNCS 6305, In Recent Advances in the Message Passing Interface (EuroMPI'10), presented in Stuttgart, Germany, pages 50--61, Springer, ISSN: 0302-9743, ISBN: 078-3-642-15645-8, Sep. 2010,
EuroMPI'10
[146] Torsten Hoefler, William Gropp, Rajeev Thakur and Jesper Larsson Träff:
 Toward Performance Models of MPI Implementations for Understanding Application Scaling Issues Vol LNCS 6305, In Recent Advances in the Message Passing Interface (EuroMPI'10), presented in Stuttgart, Germany, pages 21--30, Springer, ISSN: 0302-9743, ISBN: 078-3-642-15645-8, Sep. 2010,
EuroMPI'10
[147] Torsten Hoefler and S. Gottlieb:
 Parallel Zero-Copy Algorithms for Fast Fourier Transform and Conjugate Gradient using MPI Datatypes Vol LNCS 6305, In Recent Advances in the Message Passing Interface (EuroMPI'10), presented in Stuttgart, Germany, pages 132--141, Springer, ISSN: 0302-9743, ISBN: 078-3-642-15645-8, Sep. 2010,
PACT'10
[148] Jeremiah Willcock, Torsten Hoefler, Nicholas Edmonds and Andrew Lumsdaine:
 AM++: A Generalized Active Message Framework In Proceedings of the 19th international conference on Parallel architectures and compilation techniques, presented in Vienna, Austria, pages 401--410, ACM, ISBN: 978-1-4503-0178-7, Sep. 2010, (acceptance rate: 17%, 46/266)
CCPE
[149] Torsten Hoefler, Rolf Rabenseifner, H. Ritzdorf, Bronis R. de Supinski, Rajeev Thakur and Jesper Larsson Träff:
 The Scalable Process Topology Interface of MPI 2.2 Concurrency and Computation: Practice and Experience. Vol 23, Nr. 4, pages 293-310, John Wiley & Sons, Ltd., ISSN: 1532-0634, Aug. 2010,
HotI'10
[150] B. Arimilli, R. Arimilli, V. Chung, S. Clark, W. Denzel, B. Drerup, Torsten Hoefler, J. Joyner, J. Lewis, J. Li, N. Ni and R. Rajamony:
 The PERCS High-Performance Interconnect IBM. In Proceedings of 18th Symposium on High-Performance Interconnects (Hot Interconnects 2010), IEEE, Aug. 2010,
IJPEDS
[151] Torsten Hoefler, Timo Schneider and Andrew Lumsdaine:
 Accurately Measuring Overhead, Communication Time and Progression of Blocking and Nonblocking Collective Operations at Massive Scale International Journal of Parallel, Emergent and Distributed Systems. Vol 25, Nr. 4, pages 241-258, Taylor & Francis Group, ISSN: 1744-5779, Jul. 2010,
LSAP'10
[152] Torsten Hoefler, Timo Schneider and Andrew Lumsdaine:
 LogGOPSim - Simulating Large-Scale Applications in the LogGOPS Model In Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing, presented in Chicago, Illinois, pages 597--604, ACM, ISBN: 978-1-60558-942-8, Jun. 2010, LSAP'10 Best Paper Award
AMP'10
[153] Torsten Hoefler, Jeremiah Willcock, A. Chauhan and Andrew Lumsdaine:
 The Case for Collective Pattern Specification Jun. 2010, Accepted at the 1st ACM Workshop on Advances in Message Passing (AMP'10)
SciDAC'10
[154] Rajeev Thakur, P. Balaji, D. Buntinas, D. Goodell, William Gropp, Torsten Hoefler, S. Kumar, E. Lusk and Jesper Larsson Träff:
 MPI at Exascale In Procceedings of SciDAC 2010, presented in Chattanooga, Tennessee, Jun. 2010,
PPoPP'10
[155] Torsten Hoefler, Christian Siebert and Andrew Lumsdaine:
 Scalable Communication Protocols for Dynamic Sparse Data Exchange In Proceedings of the 2010 ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP'10), presented in Bangalore, India, pages 159--168, ACM, ISBN: 978-1-60558-708-0, Jan. 2010, (acceptance rate 16.8%, 29/173)
HiPC'09
[156] P. Kambadur, A. Gupta, Torsten Hoefler and Andrew Lumsdaine:
 Demand-driven Execution of Static Directed Acyclic Graphs Using Task Parallelism presented in Kochi, India, pages 284-293, ISBN: 978-1-4244-4922-4, Dec. 2009, (acceptance rate 11%, 35/320)
SIMPAT
[157] Torsten Hoefler, Timo Schneider and Andrew Lumsdaine:
 LogGP in Theory and Practice - An In-depth Analysis of Modern Interconnection Networks and Benchmarking Methods for Collective Operations. Elsevier Journal of Simulation Modelling Practice and Theory (SIMPAT). Vol 17, Nr. 9, pages 1511-1521, Elsevier, ISSN: 1569-190X, Oct. 2009,
EuroMPI'09
[158] Torsten Hoefler, Andrew Lumsdaine and Jack Dongarra:
 Towards Efficient MapReduce Using MPI In Recent Advances in Parallel Virtual Machine and Message Passing Interface, 16th European PVM/MPI Users' Group Meeting, presented in Helsinki, Finland, Springer, Sep. 2009,
ICPP'09
[159] Torsten Hoefler, Christian Siebert and Andrew Lumsdaine:
 Group Operation Assembly Language - A Flexible Way to Express Collective Communication In ICPP-2009 - The 38th International Conference on Parallel Processing, presented in Vienna, Austria, IEEE, ISBN: 978-0-7695-3802-0, Sep. 2009, (acceptance rate 32%, 71/220)
HotI'09
[160] Torsten Hoefler, Timo Schneider and Andrew Lumsdaine:
 Optimized Routing for Large-Scale InfiniBand Networks In 17th Annual IEEE Symposium on High Performance Interconnects (HOTI 2009), presented in New York, NY, Aug. 2009,
PPL
[161] Torsten Hoefler and Timo Schneider and Andrew Lumsdaine:
 The Effect of Network Noise on Large-Scale Collective Communications Parallel Processing Letters (PPL). Vol 19, Nr. 4, pages 573-593, World Scientific Publishing Company, Aug. 2009,
HIPS'09
[162] Torsten Hoefler and Jesper Larsson Träff:
 Sparse Collective Operations for MPI In Proceedings of the 23rd IEEE International Parallel & Distributed Processing Symposium, HIPS'09 Workshop, presented in Rome, Italy, ISSN: 1530-2075, ISBN: 978-1-4244-3750-4, May 2009,
CAC'09
[163] C. Kaiser, Torsten Hoefler, B. Bierbaum and T. Bemmerl:
 Implementation and Analysis of Nonblocking Collective Operations on SCI Networks In Proceedings of the 23rd IEEE International Parallel & Distributed Processing Symposium, CAC'09 Workshop, presented in Rome, Italy, ISSN: 1530-2075, ISBN: 978-1-4244-3750-4, May 2009,
CAC'09
[164] Torsten Hoefler, Timo Schneider and Andrew Lumsdaine:
 A Power-Aware, Application-Based, Performance Study Of Modern Commodity Cluster Interconnection Networks In Proceedings of the 23rd IEEE International Parallel & Distributed Processing Symposium, CAC'09 Workshop, presented in Rome, Italy, ISSN: 1530-2075, ISBN: 978-1-4244-3750-4, May 2009,
LSPP'09
[165] Torsten Hoefler, Timo Schneider and Andrew Lumsdaine:
 The Impact of Network Noise at Large-Scale Communication Performance In Proceedings of the 23rd IEEE International Parallel & Distributed Processing Symposium, LSPP'09 Workshop, presented in Rome, Italy, ISSN: 1530-2075, ISBN: 978-1-4244-3750-4, May 2009, Invited to a journal special issue on top picks from LSPP'09.
LCI'09
[166] J. Mueller, Timo Schneider, Jens Domke, R. Geyer, M. Haesing, Torsten Hoefler, S. Hoehlig, G. Juckeland, Andrew Lumsdaine, M. Mueller and W. Nagel:
 Cluster Challenge 2008: Optimizing Cluster Configuration and Applications to Maximize Power Efficiency In In proceedings of the 10th LCI International Conference on High-Performance Clustered Computing, presented in Boulder, CO, Mar. 2009, LCI'09 Best Paper Award
Cluster'08
[167] Torsten Hoefler, Timo Schneider and Andrew Lumsdaine:
 Multistage Switches are not Crossbars: Effects of Static Routing in High-Performance Networks In Proceedings of the 2008 IEEE International Conference on Cluster Computing, presented in Tsukuba, Japan, IEEE Computer Society, ISSN: 1552-5244, ISBN: 978-1-4244-2640, Oct. 2008, (acceptance rate 30%, 28/92)
Cluster'08
[168] Torsten Hoefler and Andrew Lumsdaine:
 Message Progression in Parallel Computing - To Thread or not to Thread? In Proceedings of the 2008 IEEE International Conference on Cluster Computing, presented in Tsukuba, Japan, IEEE Computer Society, ISSN: 1552-5244, ISBN: 978-1-4244-2640, Oct. 2008, (acceptance rate 30%, 28/92)
EuroMPI'08
[169] Torsten Hoefler, M. Schellmann, S. Gorlatch and Andrew Lumsdaine:
 Communication Optimization for Medical Image Reconstruction Algorithms Vol LNCS 5205, In Recent Advances in Parallel Virtual Machine and Message Passing Interface, 15th European PVM/MPI Users' Group Meeting, presented in Dublin, Ireland, pages 75-83, Springer, ISSN: 0302-9743, ISBN: 078-3-540-87474-4, Sep. 2008,
EuroMPI'08
[170] Torsten Hoefler, F. Lorenzen and Andrew Lumsdaine:
 Sparse Non-Blocking Collectives in Quantum Mechanical Calculations Vol LNCS 5205, In Recent Advances in Parallel Virtual Machine and Message Passing Interface, 15th European PVM/MPI Users' Group Meeting, presented in Dublin, Ireland, pages 55-63, Springer, ISSN: 0302-9743, ISBN: 078-3-540-87474-4, Sep. 2008,
HotI'08
[171] P. Geoffray and Torsten Hoefler:
 Adaptive Routing Strategies for Modern High Performance Networks In 16th Annual IEEE Symposium on High Performance Interconnects, HOTI'08, presented in Stanford, CA, USA, pages 165-172, IEEE Computer Society, ISBN: 978-0-7695-3380-3, Aug. 2008, (acceptance rate 30%, 14/47)
SPAA'08
[172] Torsten Hoefler, P. Gottschling and Andrew Lumsdaine:
 Brief Announcement: Leveraging Non-blocking Collective Communication in High-performance Applications In Proceedings of the Twentieth Annual Symposium on Parallelism in Algorithms and Architectures, SPAA'08, presented in Munich, Germany, pages 113-115, Association for Computing Machinery (ACM), ISBN: 978-1-59593-973-9, Jun. 2008, (short paper) (acceptance rate: 28%, 36/128)
CCGrid'08
[173] Torsten Hoefler and Andrew Lumsdaine:
 Overlapping Communication and Computation with High Level Communication Routines In Proceedings of the 8th IEEE Symposium on Cluster Computing and the Grid (CCGrid 2008), presented in Lyon, France, May 2008, (acceptance rate: 32%)
PMEO'08
[174] Torsten Hoefler, Timo Schneider and Andrew Lumsdaine:
 Accurately Measuring Collective Operations at Massive Scale In Proceedings of the 22nd IEEE International Parallel & Distributed Processing Symposium, PMEO'08 Workshop, presented in Miami, FL, ISSN: 1530-2075, ISBN: 978-1-4244-1694-3, Apr. 2008, Invited to a journal special issue on top picks from PMEO'08.
CAC'08
[175] Torsten Hoefler and Andrew Lumsdaine:
 Optimizing non-blocking Collective Operations for InfiniBand In Proceedings of the 22nd IEEE International Parallel & Distributed Processing Symposium, CAC'08 Workshop, presented in Miami, FL, ISSN: 1530-2075, ISBN: 978-1-4244-1694-3, Apr. 2008,
PASA'08
[176] Timo Schneider, Torsten Hoefler, Simon Wunderlich, Torsten Mehlan and Wolfgang Rehm:
 An optimized ZGEMM implementation for the Cell BE In Proceedings of the 9th Workshop on Parallel Systems and Algorithms (PASA), presented in Dresden, Germany, ISSN: 1617-5468, ISBN: 978-3-88579-218-5, Feb. 2008,
KiCC'07
[177] A. Friedley, Torsten Hoefler, M. Leininger, Andrew Lumsdaine:
 Scalable High Performance Message Passing over InfiniBand for Open MPI In Proceedings of 3rd KiCC Workshop 2007, presented in Aachen, Germany, RWTH Aachen, Dec. 2007,
SC07
[178] Torsten Hoefler, Andrew Lumsdaine and Wolfgang Rehm:
 Implementation and Performance Analysis of Non-Blocking Collective Operations for MPI In Proceedings of the 2007 International Conference on High Performance Computing, Networking, Storage and Analysis, SC07, presented in Reno, USA, IEEE Computer Society/ACM, Nov. 2007, (acceptance rate 20%, 54/268)
EuroMPI'07
[179] Torsten Hoefler, P. Kambadur, R. L. Graham, G. Shipman and Andrew Lumsdaine:
 A Case for Standard Non-Blocking Collective Operations Vol 4757, In Recent Advances in Parallel Virtual Machine and Message Passing Interface, EuroPVM/MPI 2007, presented in Paris, France, pages 125-134, Springer, ISSN: 0302-9743, ISBN: 978-3-540-75415-2, Oct. 2007,
PARCO
[180] Torsten Hoefler, P. Gottschling, Andrew Lumsdaine and Wolfgang Rehm:
 Optimizing a Conjugate Gradient Solver with Non-Blocking Collective Operations Elsevier Journal of Parallel Computing (PARCO). Vol 33, Nr. 9, pages 624-633, Elsevier, ISSN: 0167-8191, Sep. 2007,
HPCC'07
[181] Torsten Hoefler, Torsten Mehlan, Andrew Lumsdaine and Wolfgang Rehm:
 Netgauge: A Network Performance Measurement Framework Vol 4782, In Proceedings of High Performance Computing and Communications, HPCC'07, presented in Houston, USA, pages 659-671, Springer, ISBN: 978-3-540-75443-5, Sep. 2007,
PMEO'07
[182] Torsten Hoefler, Andre Lichei and Wolfgang Rehm:
 Low-Overhead LogGP Parameter Assessment for Modern Interconnection Networks TU Chemnitz. In Proceedings of the 21st IEEE International Parallel & Distributed Processing Symposium, PMEO'07 Workshop, presented in Long Beach, CA, USA, IEEE Computer Society, ISBN: 1-4244-0909-8, Mar. 2007,
CAC'07
[183] Torsten Hoefler, Christian Siebert and Wolfgang Rehm:
 A practically constant-time MPI Broadcast Algorithm for large-scale InfiniBand Clusters with Multicast TU Chemnitz. In Proceedings of the 21st IEEE International Parallel & Distributed Processing Symposium (CAC'07 Workshop), presented in Long Beach, CA, USA, pages 232, IEEE Computer Society, ISBN: 1-4244-0909-8, Mar. 2007,
KiCC'07
[184] Frank Mietke, D. Dunger, Torsten Mehlan, Torsten Hoefler and Wolfgang Rehm:
 A native InfiniBand Transporter for MySQL Cluster TU Chemnitz. In Proceedings of the 2nd Workshop 'Kommunikation in Clusterrechnern und Clusterverbundsystemen' (KiCC'07), presented in Chemnitz, Germany, Feb. 2007,
FHPCN'06
[185] Torsten Hoefler, J. Squyres, Wolfgang Rehm and Andrew Lumsdaine:
 A Case for Non-Blocking Collective Operations Vol 4331/2006, In Frontiers of High Performance Computing and Networking - ISPA'06 Workshops, presented in Sorrento, Italy, pages 155-164, Springer Berlin / Heidelberg, ISBN: 978-3-540-49860-5, Dec. 2006,
HPCNano'06
[186] Torsten Hoefler and R. Janisch and Wolfgang Rehm:
 Parallel scaling of Teter's minimization for Ab Initio calculations presented in Tampa, FL, USA, Nov. 2006, Presented at the workshop HPC Nano in conjunction with the IEEE international conference on Supercomputing (SC'06)
EuroMPI'06
[187] Torsten Hoefler, P. Gottschling, Wolfgang Rehm and Andrew Lumsdaine:
 Optimizing a Conjugate Gradient Solver with Non-Blocking Collective Operations In Recent Advantages in Parallel Virtual Machine and Message Passing Interface. 13th European PVM/MPI User's Group Meeting, Proceedings, LNCS 4192, presented in Bonn, Germany, pages 374-382, Springer, ISSN: 0302-9743, ISBN: 3-540-39110-X, Sep. 2006, Invited to a journal special issue on top picks from EuroMPI'06.
PARELEC'06
[188] Torsten Hoefler, C. Viertel, Torsten Mehlan, Frank Mietke, Wolfgang Rehm:
 Assessing Single-Message and Multi-Node Communication Performance of InfiniBand In Proceedings of IEEE International Conference on Parallel Computing in Electrical Engineering (PARELEC'06), presented in Bialystok, Poland, pages 227-232, IEEE Computer Society, ISBN: 0-7695-2554-7, Sep. 2006,
PARELEC'06
[189] Torsten Mehlan, J. Strunk, Torsten Hoefler, Frank Mietke and Wolfgang Rehm:
 IRS - A portable Interface for Reconfigurable Systems In Proceedings of IEEE International Conference on Parallel Computing in Electrical Engineering (PARELEC'06), presented in Bialystok, Poland, pages 187-191, IEEE Computer Society, ISBN: 0-7695-2554-7, Sep. 2006,
DAPSYS'06
[190] Torsten Hoefler, J. Squyres, G. Fagg, G. Bosilca, Wolfgang Rehm and Andrew Lumsdaine:
 A New Approach to MPI Collective Communication Implementations In Distributed and Parallel Systems - From Cluster to Grid Computing (DAPSYS'06), presented in Innsbruck, Austria, pages 45-54, Springer, ISBN: 978-0-387-69857-1, Sep. 2006,
EuroPar'06
[191] Frank Mietke, R. Baumgartl, R. Rex, Torsten Mehlan, Torsten Hoefler and Wolfgang Rehm:
 Analysis of the Memory Registration Process in the Mellanox InfiniBand Software Stack In Proceedings of Euro-Par 2006 Parallel Processing, presented in Dresden, Germany, pages 124-133, Springer-Verlag Berlin, ISBN: 3-540-37783-2, Aug. 2006, (acceptance rate 37.9%, 110/290)
CAC'06
[192] Torsten Hoefler, Torsten Mehlan, Frank Mietke and Wolfgang Rehm:
 Fast Barrier Synchronization for InfiniBand In Proceedings of the 20th IEEE International Parallel & Distributed Processing Symposium (IPDPS), CAC'06 Workshop, presented in Rhodes, Greece, ISBN: 1-4244-0054-6, Apr. 2006,
PMEO'06
[193] Torsten Hoefler, Torsten Mehlan, Frank Mietke and Wolfgang Rehm:
 LogfP - A Model for small Messages in InfiniBand In Proceedings of the 20th IEEE International Parallel & Distributed Processing Symposium (IPDPS), PMEO-PDS'06 Workshop, presented in Rhodes, Greece, ISBN: 1-4244-0054-6, Apr. 2006,
ARCS'06
[194] Torsten Hoefler, Torsten Mehlan, Frank Mietke and Wolfgang Rehm:
 Adding Low-Cost Hardware Barrier Support to Small Commodity Clusters In Proceedings of 19th International Conference on Architecture and Computing Systems - ARCS'06, presented in Frankfurt, Germany, pages 343-250, ISSN: 3-88579-175-7, Mar. 2006,
HPCE'05
[195] Torsten Hoefler, R. Janisch and Wolfgang Rehm:
 Improving the parallel scaling of ABINIT CINECA Consorzio Interuniversitario. In Science and Supercomputing in Europe - Report 2005, presented in Caseleccio di Reno, Italy, pages 551-559, CINECA Conzorzio Interuniversitario, ISBN: 88-86037-17-1, Dec. 2005,
Book Chapter
[196] Torsten Hoefler, R. Janisch and Wolfgang Rehm:
 A Performance Analysis of ABINIT on a Cluster System TU Chemnitz. In Parallel Algorithms and Cluster Computing, presented in Chemnitz, Germany, pages 37-51, Springer, Lecture Notes in Computational Science and Engineering, ISBN: 3-540-33539-0, Dec. 2005,
ICPP-W'05
[197] Torsten Hoefler, L. Cerquetti, Torsten Mehlan, Frank Mietke and Wolfgang Rehm:
 A practical approach to the rating of barrier algorithms using the LogP model and Open-MPI In Proceedings of the 2005 International Conference on Parallel Processing Workshops, presented in Oslo, Norway, pages 562--569, ISBN: 0-7659-2381-1, Jun. 2005,
PARS'06
[198] Torsten Hoefler and Wolfgang Rehm:
 A Communication Model for Small Messages with InfiniBand PARS. In PARS Mitteilungen, presented in Luebeck, Germany, pages 32-41, PARS, ISSN: 0177-0454, Jun. 2005, PARS Junior Researcher Prize
PARS'05
[199] Frank Mietke, M. Steiger, Torsten Mehlan, Torsten Hoefler und Wolfgang Rehm:
 SHIBA Shared Memory Support for InfiniBand MPICH2 Device In PARS Mitteilungen 2005, presented in Luebeck, Germany, pages 14-23, ISSN: 0177-0454, Jun. 2005,

serving: 18.205.60.226:45282© Torsten Hoefler