Search Sciweavers | Sciweavers

420 search results - page 83 / 84

» Scalable Parallel Programming with CUDA

143

click to vote

MICRO
2010
IEEE

156views Hardware» more MICRO 2010»

Explicit Communication and Synchronization in SARC

15 years 21 days ago

Download archvlsi.ics.forth.gr

SARC merges cache controller and network interface functions by relying on a single hardware primitive: each access checks the tag and the state of the addressed line for possible...

Manolis Katevenis, Vassilis Papaefstathiou, Stamat...

claim paper

Read More »

114

click to vote

ISCA
2009
IEEE

146views Hardware» more ISCA 2009»

Multi-execution: multicore caching for data-similar executions

15 years 9 months ago

Download www.cs.ucsb.edu

While microprocessor designers turn to multicore architectures to sustain performance expectations, the dramatic increase in parallelism of such architectures will put substantial...

Susmit Biswas, Diana Franklin, Alan Savage, Ryan D...

claim paper

Read More »

114

click to vote

MICRO
2008
IEEE

107views Hardware» more MICRO 2008»

A distributed processor state management architecture for large-window processors

15 years 8 months ago

Download www.ics.uci.edu

— Processor architectures with large instruction windows have been proposed to expose more instruction-level parallelism (ILP) and increase performance. Some of the proposed arch...

Isidro Gonzalez, Marco Galluzzi, Alexander V. Veid...

claim paper

Read More »

128

click to vote

CCGRID
2006
IEEE

131views Distributed And Parallel Com...» more CCGRID 2006»

Proposal of MPI Operation Level Checkpoint/Rollback and One Implementation

15 years 8 months ago

Download icl.cs.utk.edu

With the increasing number of processors in modern HPC(High Performance Computing) systems, there are two emergent problems to solve. One is scalability, the other is fault tolera...

Yuan Tang, Graham E. Fagg, Jack Dongarra

claim paper

Read More »

138

click to vote

IEEEPACT
2005
IEEE

146views Distributed And Parallel Com...» more IEEEPACT 2005»

A Distributed Control Path Architecture for VLIW Processors

15 years 7 months ago

Download www.ptlsim.org

VLIW architectures are popular in embedded systems because they offer high-performance processing at low cost and energy. The major problem with traditional VLIW designs is that t...

Hongtao Zhong, Kevin Fan, Scott A. Mahlke, Michael...

claim paper

Read More »

« Prev « First page 83 / 84 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers