Copyright Notice:

The documents distributed by this server have been provided by the contributing authors as a means to ensure timely dissemination of scholarly and technical work on a noncommercial basis. Copyright and all rights therein are maintained by the authors or by other copyright holders, notwithstanding that they have offered their works here electronically. It is understood that all persons copying this information will adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.

Publications of SPCL

T. Hoefler, D. Roweth, K. Underwood, R. Alverson, M. Griswold, V. Tabatabaee, M. Kalkunte, S. Anubolu, S. Shen, M. McLaren, A. Kabbani, S. Scott:

 Data Center Ethernet and Remote Direct Memory Access: Issues at Hyperscale

(IEEE Computer. Vol 56, Nr. 7, pages 67-77, IEEE Computer Society, ISSN: 1521-9615, Jul. 2023)
Cover Feature Technology Predictions

Publisher Reference

Abstract

We observe that emerging artificial intelligence, high-performance computing, and storage workloads pose new challenges for large-scale datacenter networking. RDMA over Converged Ethernet (RoCE) was an attempt to adopt modern Remote Direct Memory Access (RDMA) features into existing Ethernet installations. Now, a decade later, we revisit RoCE’s design points and conclude that several of its shortcomings must be addressed to fulfill the demands of hyperscale datacenters. We predict that both the datacenter and high-performance computing markets will converge and adopt modernized Ethernet-based high-performance networking solutions that will replace TCP and RoCE within a decade.

Documents

download article:
access preprint on arxiv:
 

BibTeX

@article{hoefler-datacenter,
  author={Torsten Hoefler and Duncan Roweth and Keith Underwood and Robert Alverson and Mark Griswold and Vahid Tabatabaee and Mohan Kalkunte and Surendra Anubolu and Siyuan Shen and Moray McLaren and Abdul Kabbani and Steve Scott},
  title={{Data Center Ethernet and Remote Direct Memory Access: Issues at Hyperscale}},
  journal={IEEE Computer},
  year={2023},
  month={07},
  pages={67-77},
  volume={56},
  number={7},
  publisher={IEEE Computer Society},
  issn={1521-9615},
  doi={10.1109/MC.2023.3261184},
}