Sciweavers

468 search results - page 41 / 94
» A compiler for high performance computing with many-core acc...
Sort
View
RT
2001
Springer
14 years 2 days ago
Interactive Distributed Ray Tracing of Highly Complex Models
Abstract. Many disciplines must handle the creation, visualization, and manipulation of huge and complex 3D environments. Examples include large structural and mechanical engineeri...
Ingo Wald, Philipp Slusallek, Carsten Benthin, Mar...
MASCOTS
2004
13 years 9 months ago
Design and Implementation of a High Speed Microprocessor Simulator BurstScalar
This paper describes the design and implementation of our high speed simulator for out-of-order microprocessors named BurstScalar. The simulator is based on the wellknown SimpleSc...
Takashi Nakada, Hiroshi Nakashima
ISSAC
2007
Springer
128views Mathematics» more  ISSAC 2007»
14 years 1 months ago
Productivity and performance using partitioned global address space languages
Partitioned Global Address Space (PGAS) languages combine the programming convenience of shared memory with the locality and performance control of message passing. One such langu...
Katherine A. Yelick, Dan Bonachea, Wei-Yu Chen, Ph...
IPPS
2003
IEEE
14 years 27 days ago
A Compilation Framework for Distributed Memory Parallelization of Data Mining Algorithms
With the availability of large datasets in a variety of scientific and commercial domains, data mining has emerged as an important area within the last decade. Data mining techni...
Xiaogang Li, Ruoming Jin, Gagan Agrawal
FPL
2009
Springer
156views Hardware» more  FPL 2009»
14 years 8 days ago
A highly scalable Restricted Boltzmann Machine FPGA implementation
Restricted Boltzmann Machines (RBMs) — the building block for newly popular Deep Belief Networks (DBNs) — are a promising new tool for machine learning practitioners. However,...
Sang Kyun Kim, Lawrence C. McAfee, Peter L. McMaho...