Sciweavers

298 search results - page 49 / 60
» Parallel Program Performance Metrics: A Comparison and Valid...
Sort
View
LREC
2010
156views Education» more  LREC 2010»
15 years 1 months ago
U-Compare: An Integrated Language Resource Evaluation Platform Including a Comprehensive UIMA Resource Library
Language resources, including corpus and tools, are normally required to be combined in order to achieve a user's specific task. However, resources tend to be developed indep...
Yoshinobu Kano, Ruben Dorado, Luke McCrohon, Sophi...
PLDI
1995
ACM
15 years 3 months ago
Unifying Data and Control Transformations for Distributed Shared Memory Machines
We present a unified approach to locality optimization that employs both data and control transformations. Data transformations include changing the array layout in memory. Contr...
Michal Cierniak, Wei Li
ICS
2003
Tsinghua U.
15 years 4 months ago
Estimating cache misses and locality using stack distances
Cache behavior modeling is an important part of modern optimizing compilers. In this paper we present a method to estimate the number of cache misses, at compile time, using a mac...
Calin Cascaval, David A. Padua
ICPP
2008
IEEE
15 years 6 months ago
Optimizing JPEG2000 Still Image Encoding on the Cell Broadband Engine
JPEG2000 is the latest still image coding standard from the JPEG committee, which adopts new algorithms such as Embedded Block Coding with Optimized Truncation (EBCOT) and Discret...
Seunghwa Kang, David A. Bader
120
Voted
ISCA
2011
IEEE
313views Hardware» more  ISCA 2011»
14 years 3 months ago
FabScalar: composing synthesizable RTL designs of arbitrary cores within a canonical superscalar template
A growing body of work has compiled a strong case for the single-ISA heterogeneous multi-core paradigm. A single-ISA heterogeneous multi-core provides multiple, differently-design...
Niket Kumar Choudhary, Salil V. Wadhavkar, Tanmay ...