Sciweavers

901 search results - page 102 / 181
» Hiding Communication Latency in Data Parallel Applications
Sort
View
CLUSTER
2009
IEEE
14 years 2 months ago
Message passing for GPGPU clusters: CudaMPI
—We present and analyze two new communication libraries, cudaMPI and glMPI, that provide an MPI-like message passing interface to communicate data stored on the graphics cards of...
Orion S. Lawlor
HPCA
2000
IEEE
14 years 10 days ago
A Prefetching Technique for Irregular Accesses to Linked Data Structures
Prefetching offers the potential to improve the performance of linked data structure (LDS) traversals. However, previously proposed prefetching methods only work well when there i...
Magnus Karlsson, Fredrik Dahlgren, Per Stenstr&oum...
FGCN
2007
IEEE
109views Communications» more  FGCN 2007»
14 years 2 months ago
Flow Balancing Hardware for Parallel TCP Streams on Long Fat Pipe Network
Parallel TCP streams are used for data transfer between clusters in today's high performance applications. When parallel TCP streams are used on LFN, part of streams fail to ...
Yutaka Sugawara, Mary Inaba, Kei Hiraki
PACT
2009
Springer
14 years 2 months ago
Parallel FFT with Eden Skeletons
The notion of Fast Fourier Transformation (FFT) describes a range of efficient algorithms to compute the discrete Fourier transformation, frequency distribution in a signal. FFT pl...
Jost Berthold, Mischa Dieterle, Oleg Lobachev, Rit...
LCN
2006
IEEE
14 years 1 months ago
Considerations of SCTP Retransmission Delays for Thin Streams
The popularity of distributed interactive applications has exploded in the last few years. For example, massive multi-player online games have become a fast growing, multi-million...
Jon Pedersen, Carsten Griwodz, Pål Halvorsen