TY - JOUR
T1 - The performance of NoCs for very large manycore systems under locality-based traffic
AU - Al Khanjari, Sharifa
AU - Vanderbauwhede, Wim
N1 - Funding Information:
The authors gratefully acknowledge the support provided by Sultanate of Oman, Ministry of Higher Education and the German University of Technology in Oman (GUTech).
Publisher Copyright:
© 2016 University of Bahrain. All rights reserved.
PY - 2016/3/1
Y1 - 2016/3/1
N2 - The scaling of semiconductor technologies is leading to processors with increasing numbers of cores. A key enabler in manycore systems is the use of Networks-on-Chip (NoC) as a global communication mechanism. The adoption of NoCs in manycore systems requires a shift in focus from computation to communication, as communication is fast becoming the dominant factor in processor performance. In large manycore systems, performance is predicated on the locality of communication. In this work, we investigate the performance of three NoC topologies for systems with thousands of processor cores under two types of localised traffic models. We present latency and throughput results comparing fat quadtree, concentrated mesh and mesh topologies under different degrees of localisation. Our results, obtained using a modified version of the HNOCS NoC simulator and based on the ITRS physical data for 2023, show that the type of locality traffic and the degree of localisation significantly affects the NoC performance, and that scale-invariant topologies perform worse than flat topologies.
AB - The scaling of semiconductor technologies is leading to processors with increasing numbers of cores. A key enabler in manycore systems is the use of Networks-on-Chip (NoC) as a global communication mechanism. The adoption of NoCs in manycore systems requires a shift in focus from computation to communication, as communication is fast becoming the dominant factor in processor performance. In large manycore systems, performance is predicated on the locality of communication. In this work, we investigate the performance of three NoC topologies for systems with thousands of processor cores under two types of localised traffic models. We present latency and throughput results comparing fat quadtree, concentrated mesh and mesh topologies under different degrees of localisation. Our results, obtained using a modified version of the HNOCS NoC simulator and based on the ITRS physical data for 2023, show that the type of locality traffic and the degree of localisation significantly affects the NoC performance, and that scale-invariant topologies perform worse than flat topologies.
KW - Locality
KW - Manycore
KW - NUMA
KW - Network on Chip
UR - http://www.scopus.com/inward/record.url?scp=85104916930&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85104916930&partnerID=8YFLogxK
U2 - 10.12785/IJCDS/050202
DO - 10.12785/IJCDS/050202
M3 - Article
AN - SCOPUS:85104916930
SN - 2210-142X
VL - 5
SP - 115
EP - 124
JO - International Journal of Computing and Digital Systems
JF - International Journal of Computing and Digital Systems
IS - 2
ER -