Interprocessor Communication : Towards Cache Integrated Network Interfaces Vassilis Papaefstathiou and Michael Papamichael Foundation for Research and Technology-Hellas (FORTH) Institute of Computer Science (ICS) – member of HiPEAC Computer Architecture and VLSI Systems Laboratory (CARV) Work funded by SARC

Motivation and Context ♦ Many on-chip cores available today (CMPs) – – – –

P

P

P

L2

L2

L1

Key scalability issue : efficient interprocessor communication Network-on-Chip (NoC) for the interconnection Memory resources in every tile used as L1 or L2 cache NoC interface in every tile Æ low cost to afford



L1 NI

L2 L1

NI

NI

Network-on-Chip NI

NI

L1

NI



L1 L2

L2

P

P

L1 L2 P

Cache Integrated Network Interfaces ♦ Tightly-coupled processor - network architecture

P1

– Processor and NI share local memory Æ lightweight NI – Allocation of memory blocks for NI use ƒ coarse or fine-grain block allocation (cache-line) – Fast and low latency mechanism for explicit messaging ƒ stores (send) and loads (receive) – Integrate NI mechanisms into the cache controller ƒ transmission similar to cache write-back ƒ reception similar to cache miss

Local Memory & Cache Controller

NI

NÆ1

Network-on-Chip

♦ Desired communication primitives

NI

– RDMA for bulk transfers ƒ post descriptors in cache-lines – Queues for small explicit transfers ƒ specify destination, size and payload ƒ send queues ( one-to-many ) ƒ receive queues ( many-to-one )

P2

NI

Local Memory & Cache Controller

1ÆN Local Memory & Cache Controller

Cache Line

P3

NI Queue

Runtime Configurable Memory Resources NI

♦ Applications configure the memory type ♦ Computation intensive applications

– Configure the degree of cache associativity

P

♦ Communication intensive applications – Fine grain allocation of blocks for NI

Configurable HW

– Diverse applications Æ different memory requirements

♦ Real-time embedded applications – Configure as addressable local store - scratchpad ƒ predictable performance

Local Memory Resources

Cache Line

NI Queue

Scratchpad

Interprocessor Communication : Towards Cache Integrated Network ...

Institute of Computer Science (ICS) – member of HiPEAC. Foundation for Research ... Configure the degree of cache associativity. ♢ Communication intensive ...

96KB Sizes 2 Downloads 253 Views

Recommend Documents

Interprocessor Communication : Towards Cache Integrated Network ...
RDMA for bulk transfers. ▫ post descriptors in cache-lines. – Queues for small explicit transfers. ▫ specify destination, size and payload. ▫ send queues ...

Interprocessor Communication: Towards Cache ...
and reception. This allocation will ... might have some virtual part to be translated by the network switches and/or the network interfaces – this ... Computer Architecture (ISCA 1996), pages 247–258, Philadelphia, PA USA, May. 1996. [MK96].

In-Network Cache Coherence
valid) directory-based protocol [7] as a first illustration of how implementing the ..... Effect on average memory latency. ... round-trips, a 50% savings. Storage ...

In-Network Cache Coherence
Abstract— We propose implementing cache coherence pro- tocols within the network, demonstrating how an in-network implementation of the MSI directory-based protocol allows for in-transit optimizations of read and write delay. Our results show 15% a

In-Network Cache Coherence
protocol [5] as an illustration of how implementing the pro- .... Here, as an illustration, we will discuss the in-network ...... els: A tutorial,” IEEE Computer, vol.

Security and Communication Network
Soft computing techniques, such as Fuzzy Logic, Neural Networks, Evolutionary. Computing, Rough Sets and other similar techniques, have been proved ...

Security and Communication Network
pattern detection, data segmentation, data mining, adaptive control, information assurance, etc. Recently, soft computing is widely used in information system for assurance. For example, neural networks are used for intrusion detection or prevention,

On fair network cache allocation to content providers - Semantic Scholar
Apr 12, 2016 - theoretic rules outperform in terms of content access latency the naive cache ... With the advent of broadband and social networks, the In-.

Towards Reproducible Performance Studies Of Datacenter Network ...
Data Storage Institute ... codes for our simulation set- ups are publicly available at http://code.google.com/p/ntu-dsi- dcn/. ... fully functional datacenter network of 50,000 servers [5], with .... such as as higher network capacity and graceful pe

Towards Implicit Communication and Program ...
Key-Words: - Separability, Algorithm, Program, Implicit, Communication, Distribution. 1 Introduction ... source code. 1. This will simplify the programming stage, abstracting the concept of data location and access mode. 2. This will eliminate the co

On fair network cache allocation to content providers - Semantic Scholar
Available online 12 April 2016. Keywords: In-network caching ..... (i.e., possibly offering different prices to different classes) would be easily considered .... x ∈ Rn. + containing the cache space portion to allocate to each CP. (i.e., the value

Towards a novel integrated neuro-cognitive ...
sub-systems via a learning memory modeling approach. The strength of INCA ... derived from a top-down, analytical AI approach operating on abstract symbols ...... P. Wang, B. Goertzel, and S. Franklin, Eds. Amsterdam: IOS Press,. 2008, pp.

Towards An Integrated Environment for Networked Music ... - zagorac
include standard Mac and Window computers, portable iOS (iPad) and Android ..... hierarchical layer structures were created (Figures 10 and Figure 12).

Read Online Integrated Marketing Communication ...
... across the country weren’t too excited about the idea of hosting pro Trump anti Muslim rallies ... from Idea to Implementation Online , Read Best Book Online Integrated Marketing ... Publisher : Rowman & Littlefield Publishers 2014-07-10.