Cellular Multiprocessors

Supported by:


(#GR/S79572/01)

(#IP 27648 (FP6))

(#443570)

Principal Investigator

Marcelo Cintra - School of Informatics - University of Edinburgh

Research Assistants

Christian Fensch (Ph.D. 2008) - School of Informatics - University of Edinburgh
Pedro Díaz (Ph.D. candidate) - School of Informatics - University of Edinburgh
Vasileios Porpodas (Ph.D. candidate) - School of Informatics - University of Edinburgh

Tom Ashby (Post-doc - 2005-2007) - School of Informatics - University of Edinburgh

Research Collaborators

Michael O'Boyle - School of Informatics - University of Edinburgh
Wolfgang Karl - Faculty of Informatics - University of Karlsruhe
Manuel E. Acacio - Department of Computer Engineering and Technology - University of Murcia

Project Objectives

To investigate the use of single chip multiprocessor architectures based on the replication of very simple processor-memory "cells". This investigation follows two main lines of research. The first is the development of novel compilation techniques to extract larger degrees of TLP from traditionally hard to parallelise applications. The second is the development of improved architectures through the systematic study of architectural design alternatives under various performance, complexity, and power dissipation constraints.

Project Contributions

In [4] we proposed a hybrid software-hardware mechanism for cache coherence in tiled CMPs with scalable on-chip interconnects. The mechanism is relies on hardware to perform remote cache accesses and moves the responsibility for data mapping and coherence to the OS.
In [3] we presented a tool that can be used to identify the breakdown of cache miss types and, thus, assist programmers in improving the data locality behavior of their programs.
In [2] we proposed a novel class of hardware prefetchers that allow the global miss address stream to be first localized according to different correlation criteria and later chained following their original temporal behavior. This mechanism allows for the simultaneous exploitation of different types of correlation while maintaining timeliness.

Publications (sorted by date)

  • [1] Distance-Aware Round-Robin Mapping for Large NUCA Caches
    Alberto Ros, Marcelo Cintra, Manuel E. Acacio, and Jose M. Garcia
    Intl. Conf. on High Performance Computing (HiPC), December 2009
  • [2] Stream Chaining: Exploiting Multiple Levels of Correlation in Data Prefetching
    Pedro Diaz and Marcelo Cintra
    Intl. Symp. on Computer Architecture (ISCA), p 81-92, June 2009
  • [3] A Generic Tool Supporting Cache Design and Optimisation on Shared Memory Systems
    Martin Schindewolf, Jie Tao, Wolfgang Karl, and Marcelo Cintra
    Wksp. on Parallel Systems and Algorithms (PASA), February 2008
  • [4] An OS-Based Alternative to Full Hardware Coherence on Tiled CMPs
    Christian Fensch and Marcelo Cintra
    Intl. Symp. on High-Performance Computer Architecture (HPCA), p 355-366, February 2008

    Thesis

  • An OS-Based Alternative to Full Hardware Coherence on Tiled Chip-Multiprocessors.
    Christian Fensch
    Ph.D., School of Informatics, University of Edinburgh, 2008.

    Talks

  • Stream Chaining: Exploiting Multiple Levels of Correlation in Data Prefetching
    Department of Informatics, University of Munich, Munich, Germany, November 2009
  • Stream Chaining: Exploiting Multiple Levels of Correlation in Data Prefetching (presentation)
    Intl. Symp. on Computer Architecture, Austin, USA, June 2009
  • Stream Chaining: Exploiting Multiple Levels of Correlation in Data Prefetching
    Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, USA, May 2009
  • An OS-Based Alternative to Full Hardware Coherence on Tiled CMPs
    Department of Electrical and Computer Engineering, University of Rochester, Rochester, USA, March 2009
  • An OS-Based Alternative to Full Hardware Coherence on Tiled CMPs
    Intel Germany Research Laboratory, Braunschweig, Germany, July 2008
  • An OS-Based Alternative to Full Hardware Coherence on Tiled CMPs
    Department of Computer Engineering, Universidad de Murcia, Murcia, Spain, May 2008
  • An OS-Based Alternative to Full Hardware Coherence on Tiled CMPs (presentation)
    Intl. Symp. on High-Performance Computer Architecture, Salt Lake City, USA, February 2008
  • Alternatives to Eager Hardware Cache Coherence on Large-Scale CMPs (presentation)
    IBM T. J. Watson Research Center, Yorktown Heights, USA, July 2007
  • Alternatives to Eager Hardware Cache Coherence on Large-Scale CMPs
    Department of Informatics, University of Erlangen-Nuremberg, Erlangen, Germany, June 2007
  • Alternatives to Eager Hardware Cache Coherence on Large-Scale CMPs
    Oak Ridge National Laboratory, Oak Ridge, USA, March 2007
  • Alternatives to Eager Hardware Cache Coherence on Large-Scale CMPs
    ARM Ltd., Cambridge, UK, February 2007

    Related Projects (in alphabetical order)

  • Blue Gene at IBM Research (USA)
  • IACOMA at University of Illinois (USA)
  • RATS at University of Southern California (USA)
  • RAW at MIT (USA)
  • SCALE at MIT (USA)
  • Smart Memories at Stanford University (USA)
  • TRIPS at University of Texas (USA)
  • Wavescalar at University of Washington (USA)
    Last modified: November 02 2007