Cedar: Difference between revisions

no edit summary
No edit summary
No edit summary
Line 107: Line 107:


= Performance = <!--T:17-->
= Performance = <!--T:17-->
Theoretical peak double precision performance of Cedar is 936 teraflops for CPUs, plus 2,744 for GPUs, yielding over 3.6 petaflops of theoretical peak double precision performance. 22 fully connected "islands" of 32 base or large nodes each have 1024 cores in a fully non-blocking topology (Omni-Path fabric), with each island designed to yield over 30 teraflops of double-precision performance (measured with high performance LINPACK). There is a 2:1 blocking factor between the 1024 core islands. The Skylake nodes also span 20 non-blocking islands of 32 nodes each, forming islands of 1536 cores.
Theoretical peak double precision performance of Cedar is 6547 teraflops for CPUs, plus 7434 for GPUs, yielding almost 14 petaflops of theoretical peak double precision performance. 22 fully connected "islands" of 32 base or large nodes each have 1024 cores in a fully non-blocking topology (Omni-Path fabric), with each island designed to yield over 30 teraflops of double-precision performance (measured with high performance LINPACK). There is a 2:1 blocking factor between the 1024 core islands. Similarly the Skylake and Cascade Lake nodes span 44 non-blocking islands of 32 nodes each, forming islands of 1536 cores.


<!--T:16-->
<!--T:16-->
cc_staff
28

edits