|
|
|
Carlos Osuna,Tobias Wicky,Fabian Thuering,Torsten Hoefler,Oliver Fuhrer
Pág. 79 - 97
High-level programming languages that allow to express numerical methods and generate efficient parallel implementations are of key importance for the productivity of domain-scientists. The diversity and complexity of hardware architectures is imposing a...
ver más
|
|
|
|
|
|
|
Torsten Hoefler,Dmitry Moor
Pág. 58 - 75
Collective operations are among the most important communication operations in shared- and distributed-memory parallel applications. In this paper, we analyze the tradeoffs between energy, memory, and runtime of different algorithms to implement suc...
ver más
|
|
|
|