In order for collective communication routines to achieve high performance on different platforms, they must be able to adapt to the system architecture and use different algori...
Traditional collective communication algorithms are designed with the assumption that a node can communicate with only one other node at a time. On new parallel architectures such...
Ernie Chan, Robert A. van de Geijn, William Gropp,...