• 615781
  • Public Content
Contents

3rd Generation Intel® Xeon® Scalable Processors

Performance varies by use, configuration and other factors.

Performance results are based on testing as of dates shown in configurations and may not reflect all publicly available updates. See configuration disclosure for details. No product or component can be absolutely secure.

Intel optimizations, for Intel compilers or other products, may not optimize to the same degree for non-Intel products.

Estimates of SPECrate®2017_​int_​base and SPECrate®2017_​fp_​base based on Intel internal measurements. SPEC®, SPECrate® and SPEC CPU® are registered trademarks of the Standard Performance Evaluation Corporation. See www.spec.org for more information.

Claim Processor Family System Configuration Measurement Measurement Period
[125] 1.46x average performance gains with 3rd Gen Intel Xeon Platinum 8380 processor vs. prior generation 3rd Generation Intel® Xeon® Platinum processor 1.46x average performance gain - Ice Lake vs. Cascade Lake: Geomean of 1.5x SPECrate2017_​int_​base (est), 1.52x SPECrate2017_​fp_​base (est), 1.47x STREAM Triad, 1.38x Intel distribution of LINPACK. New: Platinum 8380: 1-node, 2x Intel Xeon Platinum 8380 processor on Coyote Pass with 512 GB (16 slots/ 32GB/ 3200) total DDR4 memory, ucode 0x261, HT on (SPECcpu2017), off (others), Turbo on, Ubuntu 20.04, 5.4.0-66-generic, 1x S4610 SSD 960G, SPECcpu2017 (est) v1.1.0, STREAM Triad, LINPACK, ic19.1u2, MPI: Version 2019u9; MKL:2020.4.17, test by Intel on 3/15/2021. Baseline: Platinum 8280: 1-node, 2x Intel Xeon Platinum 8280 processor on Wolf Pass with 384 GB (12 slots/ 32GB/ 2933) total DDR4 memory, ucode 0x5003003, HT on (SPECcpu2017), off (others), Turbo on, Ubuntu 20.04, 5.4.0-62-generic, 1x S3520 SSD 480G, SPECcpu2017 (est) v1.1.0, STREAM Triad, Intel distribution of LINPACK, ic19.1u2, MPI: Version 2019u9; MKL:2020.4.17, test by Intel on 2/4/2021. Geomean of

Integer throughput/Floating Point throughput/STREAM/LINPACK

New: March 15, 2021

Baseline: Feb 04,2021

[123] 1.45x higher INT8 real-time inference throughput with 3rd Gen Intel® Xeon® Scalable processor supporting Intel® DL Boost vs. prior generation

1.74x higher INT8 batch inference throughput on BERT-Large SQuAD with 3rd Gen Intel® Xeon® Scalable processor supporting Intel® DL Boost vs. prior generation

3rd Generation Intel® Xeon® Platinum processor BERT-Large SQuAD : 1.45x higher INT8 real-time inference throughput & 1.74x higher INT8 batch inference throughput on Ice Lake vs. prior generation Cascade Lake Platinum 8380: New:1-node, 2x Intel Xeon Platinum 8380 processor on Coyote Pass with 512 GB (16 slots/ 32GB/ 3200) total DDR4 memory, ucode X261, HT on, Turbo on, Ubuntu 20.04 LTS, 5.4.0-65-generic, 1x Intel_​SSDSC2KG96, Intel SSDPE2KX010T8, BERT - Large SQuAD, gcc-9.3.0, oneDNN 1.6.4, BS=1,128 INT8, TensorFlow 2.4.1 with Intel optimizations for 3rd Gen Intel Xeon Scalable processor, upstreamed to TensorFlow- 2.5 (container- intel/intel-optimized-tensorflow:tf-r2.5-icx-b631821f), Model zoo: https://github.com/IntelAI/models/tree/icx-launch-public/quickstart/, test by Intel on 3/12/2021. Baseline: Platinum 8280: 1-node, 2x Intel Xeon Platinum 8280 processor on Wolf Pass with 384 GB (12 slots/ 32GB/ 2933) total DDR4 memory, ucode 0x5003003, HT on, Turbo on, Ubuntu 20.04 LTS, 5.4.0-48-generic, 1x Samsung_​SSD_​860, Intel SSDPE2KX040T8, BERT - Large SQuAD, gcc-9.3.0, oneDNN 1.6.4, BS=1,128 INT8, TensorFlow 2.4.1 with Intel optimizations for 3rd Gen Intel Xeon Scalable processor, upstreamed to TensorFlow- 2.5 (container- intel/intel-optimized-tensorflow:tf-r2.5-icx-b631821f), Model zoo: https://github.com/IntelAI/models/tree/icx-launch-public/quickstart/, test by Intel on 2/17/2021. BERT- Large SQuAD New: March 12, 2021

Baseline: Feb 17,2021

[122] 1.59x higher INT8 real-time inference throughput with 3rd Gen Intel® Xeon® Scalable processor supporting Intel® DL Boost vs. prior generation.

1.66x higher INT8 batch inference throughput on MobileNet-v1 with 3rd Gen Intel® Xeon® Scalable processor supporting Intel® DL Boost vs. prior generation

3rd Generation Intel® Xeon® Platinum processor MobileNet-v1: 1.59x higher INT8 real-time inference throughput & 1.66x higher INT8 batch inference throughput on Ice Lake vs. prior generation Cascade Lake Platinum. New: 8380: 1-node, 2x Intel Xeon Platinum 8380 processor on Coyote Pass with 512 GB (16 slots/ 32GB/ 3200) total DDR4 memory, ucode X261, HT on, Turbo on, Ubuntu 20.04 LTS, 5.4.0-65-generic, 1x Intel_​SSDSC2KG96, Intel SSDPE2KX010T8, MobileNet-v1, gcc-9.3.0, oneDNN 1.6.4, BS=1,56 INT8, TensorFlow 2.4.1 with Intel optimizations for 3rd Gen Intel Xeon Scalable processor, upstreamed to TensorFlow- 2.5 (container- intel/intel-optimized-tensorflow:tf-r2.5-icx-b631821f), Model zoo: https://github.com/IntelAI/models/tree/icx-launch-public/quickstart/, test by Intel on 3/12/2021. Baseline: Platinum 8280: 1-node, 2x Intel Xeon Platinum 8280 processor on Wolf Pass with 384 GB (12 slots/ 32GB/ 2933) total DDR4 memory, ucode 0x5003003, HT on, Turbo on, Ubuntu 20.04 LTS, 5.4.0-48-generic, 1x Samsung_​SSD_​860, Intel SSDPE2KX040T8,, MobileNet-v1, gcc-9.3.0, oneDNN 1.6.4, BS=1,56 INT8, TensorFlow 2.4.1 with Intel optimizations for 3rd Gen Intel Xeon Scalable processor, upstreamed to TensorFlow- 2.5 (container- intel/intel-optimized-tensorflow:tf-r2.5-icx-b631821f), Model zoo: https://github.com/IntelAI/models/tree/icx-launch-public/quickstart/, test by Intel on 2/17/2021. MobileNet-v1 New: March 12, 2021

Baseline: Feb 17,2021

[121] 1.52x higher INT8 real-time inference throughput with 3rd Gen Intel® Xeon® Scalable processor supporting Intel® DL Boost vs. prior generation

1.56x higher INT8 batch inference throughput on ResNet50 with 3rd Gen Intel® Xeon® Scalable processor supporting Intel® DL Boost vs. prior generation

3rd Generation Intel® Xeon® Platinum processor ResNet-50 v1.5 : 1.52x higher INT8 real-time inference throughput & 1.56x higher INT8 batch inference throughput on Ice Lake vs. prior generation Cascade Lake. New: Platinum 8380: 1-node, 2x Intel Xeon Platinum 8380 processor on Coyote Pass with 512 GB (16 slots/ 32GB/ 3200) total DDR4 memory, ucode X261, HT on, Turbo on, Ubuntu 20.04 LTS, 5.4.0-65-generic, 1x Intel_​SSDSC2KG96, Intel SSDPE2KX010T8, ResNet-50 v1.5, gcc-9.3.0, oneDNN 1.6.4, BS=1,128 INT8, TensorFlow 2.4.1 with Intel optimizations for 3rd Gen Intel Xeon Scalable processor, upstreamed to TensorFlow- 2.5 (container- intel/intel-optimized-tensorflow:tf-r2.5-icx-b631821f), Model zoo: https://github.com/IntelAI/models/tree/icx-launch-public/quickstart/, test by Intel on 3/12/2021. Baseline: Platinum 8280: 1-node, 2x Intel Xeon Platinum 8280 processor on Wolf Pass with 384 GB (12 slots/ 32GB/ 2933) total DDR4 memory, ucode 0x5003003, HT on, Turbo on, Ubuntu 20.04 LTS, 5.4.0-48-generic, 1x Samsung_​SSD_​860, Intel SSDPE2KX040T8, ResNet-50 v1.5, gcc-9.3.0, oneDNN 1.6.4, BS=1,128 INT8, TensorFlow 2.4.1 with Intel optimizations for 3rd Gen Intel Xeon Scalable processor, upstreamed to TensorFlow- 2.5 (container- intel/intel-optimized-tensorflow:tf-r2.5-icx-b631821f), Model zoo: https://github.com/IntelAI/models/tree/icx-launch-public/quickstart/, test by Intel on 2/17/2021. ResNet50 v1.5 New: March 12, 2021

Baseline: Feb 17,2021

[120] 1.39x higher INT8 real time inference throughput on SSD-ResNet34 with 3rd Gen Intel® Xeon® Scalable processor supporting Intel® DL Boost vs. prior generation 3rd Generation Intel® Xeon® Platinum processor SSD-ResNet34: 1.39x higher INT8 batch inference throughput on Ice Lake vs. prior generation Cascade Lake. New: Platinum 8380: 1-node, 2x Intel Xeon Platinum 8380 processor on Coyote Pass with 512 GB (16 slots/ 32GB/ 3200) total DDR4 memory, ucode, X261 HT on, Turbo on, Ubuntu 20.04 LTS, 5.4.0-65-generic, 1x Intel_​SSDSC2KG96, Intel SSDPE2KX010T8, SSD-ResNet34, gcc-9.3.0, oneDNN 1.6.4, BS=1 INT8, TensorFlow 2.4.1 with Intel optimizations for 3rd Gen Intel Xeon Scalable processor, upstreamed to TensorFlow- 2.5 (container- intel/intel-optimized-tensorflow:tf-r2.5-icx-b631821f), Model zoo: https://github.com/IntelAI/models/tree/icx-launch-public/quickstart/, test by Intel on 3/12/2021.Baseline: Platinum 8280: 1-node, 2x Intel Xeon Platinum 8280 processor on Wolf Pass with 384 GB (12 slots/ 32GB/ 2933) total DDR4 memory, ucode 0x5003003, HT on, Turbo on, Ubuntu 20.04 LTS, 5.4.0-48-generic, 1x Samsung_​SSD_​860, Intel SSDPE2KX040T8, SSD-ResNet34, gcc-9.3.0, oneDNN 1.6.4, BS=1 INT8, TensorFlow 2.4.1 with Intel optimizations for 3rd Gen Intel Xeon Scalable processor, upstreamed to TensorFlow- 2.5 (container- intel/intel-optimized-tensorflow:tf-r2.5-icx-b631821f), Model zoo: https://github.com/IntelAI/models/tree/icx-launch-public/quickstart/, test by Intel on 2/17/2021. SSD-ResNet34 New: March 12, 2021

Baseline: Feb 17,2021

[119] 1.35x higher INT8 real-time inference throughput & with 3rd Gen Intel® Xeon® Scalable processor supporting Intel® DL Boost vs. prior generation

1.42x higher INT8 batch inference throughput on SSD-MobileNet-v1 with 3rd Gen Intel® Xeon® Scalable processor supporting Intel® DL Boost vs. prior generation

3rd Generation Intel® Xeon® Platinum processor SSD-MobileNet-v1: 1.35x higher INT8 real-time inference throughput & 1.42x higher INT8 batch inference throughput on Ice Lake vs. prior generation Cascade Lake. New: Platinum 8380: 1-node, 2x Intel Xeon Platinum 8380 processor on Coyote Pass with 512 GB (16 slots/ 32GB/ 3200) total DDR4 memory, ucode X261, HT on, Turbo on, Ubuntu 20.04 LTS, 5.4.0-65-generic, 1x Intel_​SSDSC2KG96, Intel SSDPE2KX010T8, SSD-MobileNet-v1, gcc-9.3.0, oneDNN 1.6.4, BS=1,448 INT8, TensorFlow 2.4.1 with Intel optimizations for 3rd Gen Intel Xeon Scalable processor, upstreamed to TensorFlow- 2.5 (container- intel/intel-optimized-tensorflow:tf-r2.5-icx-b631821f), Model zoo: https://github.com/IntelAI/models/tree/icx-launch-public/quickstart/, test by Intel on 3/12/2021. Baseline: Platinum 8280: 1-node, 2x Intel Xeon Platinum 8280 processor on Wolf Pass with 384 GB (12 slots/ 32GB/ 2933) total DDR4 memory, ucode 0x5003003, HT on, Turbo on, Ubuntu 20.04 LTS, 5.4.0-48-generic, 1x Samsung_​SSD_​860, Intel SSDPE2KX040T8,, SSD-MobileNet-v1, gcc-9.3.0, oneDNN 1.6.4, BS=1,448 INT8, TensorFlow 2.4.1 with Intel optimizations for 3rd Gen Intel Xeon Scalable processor, upstreamed to TensorFlow- 2.5 (container- intel/intel-optimized-tensorflow:tf-r2.5-icx-b631821f), Model zoo: https://github.com/IntelAI/models/tree/icx-launch-public/quickstart/, test by Intel on 2/17/2021. SSD-MobileNet-v1 New: March 12, 2021

Baseline: Feb 17,2021

[118] Ice Lake customers who utilize Intel-optimization for Tensor Flow and Intel DL Boost (VNNI ) will gain over 11X higher batch AI inference performance on ResNet50 compared with stock Cascade Lake FP32 configuration 3rd Generation Intel® Xeon® Platinum processor 11X higher batch AI inference performance with Intel-optimized Tensor Flow vs. stock Cascade Lake FP32 configuration New: 8380: 1-node, 2x Intel Xeon Platinum 8380 processor on Coyote Pass with 512 GB (16 slots/ 32GB/ 3200) total DDR4 memory, ucode X261, HT on, Turbo on, Ubuntu 20.04 LTS, 5.4.0-65-generic, 1x Intel_​SSDSC2KG96, Intel SSDPE2KX010T8, ResNet-50 v1.5, gcc-9.3.0, oneDNN 1.6.4, BS=128 FP32,INT8, TensorFlow 2.4.1 with Intel optimizations for 3rd Gen Intel Xeon Scalable processor, upstreamed to TensorFlow- 2.5 (container- intel/intel-optimized-tensorflow:tf-r2.5-icx-b631821f), Model zoo: https://github.com/IntelAI/models/tree/icx-launch-public/quickstart/, Unoptimized model : TensorFlow- 2.4.1, Modelzoo:https://github.com/IntelAI/models -b master, test by Intel on 3/12/2021. Baseline: 8280: 1-node, 2x Intel Xeon Platinum 8280 processor on Wolf Pass with 384 GB (12 slots/ 32GB/ 2933) total DDR4 memory, ucode 0x5003003, HT on, Turbo on, Ubuntu 20.04 LTS, 5.4.0-48-generic, 1x Samsung_​SSD_​860, Intel SSDPE2KX040T8, ResNet-50 v1.5, gcc-9.3.0, oneDNN 1.6.4, BS=128 FP32,INT8, Optimized model : TensorFlow 2.4.1 with Intel optimizations for 3rd Gen Intel Xeon Scalable processor, upstreamed to TensorFlow- 2.5 (container- intel/intel-optimized-tensorflow:tf-r2.5-icx-b631821f), Model zoo: https://github.com/IntelAI/models/tree/icx-launch-public/quickstart/, Unoptimized model : TensorFlow- 2.4.1, Modelzoo:https://github.com/IntelAI/models -b master, test by Intel on 2/17/2021. ResNet50 v1.5 - opt/unopt New: March 12, 2021

Baseline: Feb 17,2021

[117] Up to 100x gains due to software improvement on SciKit learn workloads : linear regression fit, SVC inference, kdtree_​knn inference and elastic-net fit on Ice Lake with Daal4py optimizations compared with stock Scikit-learn 3rd Generation Intel® Xeon® Platinum processor Up to 100x gains due to software improvement on SciKit learn workloads : linear regression fit, SVC inference, kdtree_​knn inference and elastic-net fit on Ice Lake with Daal4py optimizations compared with stock Scikit-learn New: 8380: 1-node, 2x Intel Xeon Platinum 8380 (40C/2.3GHz, 270W TDP) processor on Intel Software Development Platform with 512 GB (16 slots/ 32GB/ 3200) total DDR4 memory, ucode X55260, HT on, Turbo on, Ubuntu 20.04 LTS, 5.4.0-64-generic, 2x Intel_​SSDSC2KG96, Unoptimized : Python : Python 3.7.9, SciKit-Learn : Sklearn 0.24.1, Optimized : oneDAL : Daal4py 2021.2, Benchmarks: https://github.com/IntelPython/scikit-learn_bench, tested by Intel, and results as of March 2021 Scikit-learn Software optimizations New: March 12, 202
[115] Complete graph analytics computations used in search, social networks, recommender systems, bioinformatics, and fraud detection 2X faster on average when using 3rd Gen Intel Xeon Scalable processors with Intel Optane persistent memory 200 series. 3rd Generation Intel® Xeon® Platinum processor and Intel® Optane™ persistent memory 200 series . Katana Graph: New: Platinum 8368: 1-node, 2x Intel Xeon Platinum 8368 processor on Coyote Pass with 1024 GB (16 slots/ 64GB/ 3200) total DDR4 memory, 8192 GB (16 slots/ 512 GB/ 3200) total Pmem, ucode 0x261, HT off, Turbo on, Ubuntu 20.04.1 LTS, 5.4.0-65-generic, 1x Intel 480GB SSD, 2x Intel 2TB SSD, 1x Intel XC710, Galoishttps://github.com/IntelligentSoftwareSystems/Galois, GCC 9.3.0, Algorithms: Betweenness Centrality, Breadth First Search, Connected Components, test by Intel on 3/15/2021. Baseline: Platinum 8260: 1-node, 2x Intel Xeon Platinum 8260 processor on Wolf Pass with 768 GB (12 slots/ 64GB/ 2666) total DDR4 memory, 6144 GB (12 slots/ 512 GB/ 2666) total Pmem, ucode 0x5003003, HT off, Turbo on, Ubuntu 20.04.1 LTS, 5.4.0-65-generic, 1x Intel 480GB SSD, 2x Intel 2TB SSD, 1x Intel XC710, Galois https://github.com/IntelligentSoftwareSystems/Galois, GCC 9.3.0, Algorithms: Betweenness Centrality, Breadth First Search, Connected Components, test by Intel on 3/15/2021. Katana Graph New: March 15, 2021

Baseline: March 15,2021

[114]

OpenVINO FP32 model running on Intel® Xeon® Platinum 8380 CPU @ 2.30GHz gives 2.2X latency improvement over baseline on Intel® Xeon® Platinum 8280 CPU @ 2.70GHz.

OpenVINO FP32 model running on Intel® Xeon® Platinum 8380 CPU @ 2.30GHz gives 1.6X throughput improvement over baseline on Intel® Xeon® Platinum 8280 CPU @ 2.70GHz.

3rd Generation Intel® Xeon® Platinum processor

Custom Deep Learning based Encoder Decoder model: Optimized:New: Tested by Intel as of 03/25/2021. 2 socket Intel® Xeon® Platinum 8380 Processor, 40 cores per socket, Ucode 0xd000270, HT On, Turbo On, OS Ubuntu 18.04.5 LTS, Kernel 5.4.0-65-generic, Total Memory 256GB, BIOS SE5C6200.86B.0022.D08.2103221623, Framework: Intel OpenVINO toolkit 2021.2.185, Python 3.6.13, Intel-openmp 2021.1.2, Numpy 1.19.5, GCC 7.5.0, model – custom Autoencoder

Baseline: Tested by Intel as of 03/25/2021. 2 socket Intel® Xeon® Platinum 8280 Processor, 28 cores per socket, Ucode 0x5003003, HT On, Turbo On, OS Ubuntu 18.04.5 LTS, Kernel 5.4.0-65-generic, Total Memory 384GB, BIOS SE5C620.86B.02.01.0011.032620200659, Framework: Intel OpenVINO toolkit 2021.2.185, Python 3.6.13, Intel-openmp 2021.1.2, Numpy 1.19.5, GCC 7.5.0, model – custom Autoencoder

custom Deep Learning based Encoder Decoder model  -Fujitsu New: March 25, 2021

Baseline: March 25,2021

[113] 1.4X improvement in hyperparameter tuning during training with Intel® Xeon® Platinum 8380 CPU vs. Intel® Xeon® Platinum 8280 CPU. 3rd Generation Intel® Xeon® Platinum processor

Predictive Analytics using XGBoost

Optimized:New: Tested by Intel as of 02/24/2021. 2 socket Intel® Xeon® Platinum 8380 Processor, 40 cores per socket, Ucode 0x8d05a260, HT On, Turbo On, OS Ubuntu 18.04.5 LTS, Kernel 5.4.0-65-generic, Total Memory 256GB, BIOS SE5C6200.86B.3021.D40.2103160200, Framework: XGBoost 1.3.3, Intel-openmp 2020.2, Intel MKL 2020.2, Numpy 1.19.2 (Intel), Pandas 1.2.1 (Intel), scikit-learn 0.23.2 (Intel), Anaconda Python 3.7.9, GCC 7.5.0, model trained – GBT Classifier, custom train data

Baseline: Tested by Intel as of 02/24/2021. 2 socket Intel® Xeon® Platinum 8280 Processor, 28 cores per socket, Ucode 0x5003003, HT On, Turbo On, OS Ubuntu 18.04.5 LTS, Kernel 5.4.0-65-generic, Total Memory 384GB, BIOS SE5C620.86B.02.01.0011.032620200659, Framework: XGBoost 1.3.3, Intel-openmp 2020.2, Intel MKL 2020.2, Numpy 1.19.2 (Intel), Pandas 1.2.1 (Intel), scikit-learn 0.23.2 (Intel), Anaconda Python 3.7.9, GCC 7.5.0, model trained – GBT Classifier, custom train data

Nordigen Predictive Analytics using XGBoost New: Feb 25, 2021

Baseline: Feb 25,2021

[108] Up to 1.53x higher HPC performance on 3rd Gen Intel Xeon Scalable platform vs. prior gen

Up to 1.53x higher FSI Kernel performance on 3rd Gen Intel Xeon Scalable platform vs. prior gen

Up to 1.60x higher Life and Material Science performance on 3rd Gen Intel Xeon Scalable platform vs. prior gen

Up to 1.41x higher HPCG performance on 3rd Gen Intel Xeon Scalable platform vs. prior gen

Up to 1.38x higher HPL performance on 3rd Gen Intel Xeon Scalable platform vs. prior gen

Up to 1.47x higher STREAM Triad Performance on 3rd Gen Intel Xeon Scalable platform vs. prior gen

Up to 1.58x higher WRF performance on 3rd Gen Intel Xeon Scalable platform vs. prior gen

Up to 1.28x higher Binomial Options performance on 3rd Gen Intel Xeon Scalable platform vs. prior gen

Up to 1.67x higher Black Scholes performance on 3rd Gen Intel Xeon Scalable platform vs. prior gen

Up to 1.70x higher Monte Carlo performance on 3rd Gen Intel Xeon Scalable platform vs. prior gen

Up to 1.51x higher OpenFOAM performance on 3rd Gen Intel Xeon Scalable platform vs. prior gen

Up to 1.64x higher GROMACS performance on 3rd Gen Intel Xeon Scalable platform vs. prior gen

Up to 1.60x higher LAMMPS performance on 3rd Gen Intel Xeon Scalable platform vs. prior gen

Up to 1.57x higher NAMD performance on 3rd Gen Intel Xeon Scalable platform vs. prior gen

Up to 1.61x higher RELION Plasmodium Ribosome performance on 3rd Gen Intel Xeon Scalable platform vs. prior gen

3rd Generation Intel® Xeon® Platinum processor

New: 8380: 1-node, 2x Intel Xeon Platinum 8380 (40C/2.3GHz, 270W TDP) processor on Intel Software Development Platform with 256 GB (16 slots/ 16GB/ 3200) total DDR4 memory, ucode 0x055261, HT on, Turbo on, CentOS Linux 8.3.2011, 4.18.0-240.1.1.el8_​3.crt1.x86_​64, 1x Intel_​SSDSC2KG96 . Tested by Intel between March 12, 2021 and March 29, 2021.

Baseline: 8280: 1-node, 2x Intel Xeon Platinum 8280 (28C/2.7GHz, 205W TDP) processor on Intel Software Development Platform with 192GB (12 slots/ 16GB/ 2933) total DDR4 memory, ucode 0x4002f01, HT on, Turbo on, CentOS Linux 8.3.2011, 4.18.0-240.1.1.el8_​3.crt1.x86_​64, 1x Intel_​SSDSC2KG48 . Tested by Intel between February 1, 2021 to February 20, 2021.

1.53x higher HPC performance (geomean HPL, HPCG, STREAM Triad, WRF, Binomial Options, Black Scholes, Monte Carlo, OpenFOAM, GROMACS, LAMMPS, NAMD, RELION)

1.53x higher FSI Kernel performance (geomean Binomial Options, Black Scholes, Monte Carlo)

1.60x higher Life and Material Science performance (geomean GROMACS, LAMMPS, NAMD, RELION)

1.41x higher HPCG performance App Version: 2019u5 MKL; Build notes: Tools: Intel MKL 2020u4, Intel C Compiler 2020u4, Intel MPI 2019u8; threads/core: 1; Turbo: used; Build knobs: -O3 -ip -xCORE-AVX512

1.38x higher HPL performance App Version: The Intel Distribution for LINPACK Benchmark 2019u5; Build notes: threads/core: 1; Turbo: used; Build: build script from Intel Distribution for LINPACK package; 1 rank per NUMA node: 1 rank per socket

1.47x higher STREAM Triad Performance App Version: McCalpin_​STREAM_​OMP-version; Build notes: Tools: Intel C Compiler 2019u5; threads/core: 1; Turbo: used; BIOS settings: HT=off Turbo=On SNC=On

1.58x higher WRF performance (geomean Conus-12km, Conus-2.5km, NWSC-3-NA-3km) App Version: 4.2.2; Build notes: Intel Fortran Compiler 2020u4, Intel MPI 2020u4; threads/core: 1; Turbo: used; Build knobs:-ip -w -O3 -xCORE-AVX2 -vec-threshold0 -ftz -align array64byte -qno-opt-dynamic-align -fno-alias $(FORMAT_​FREE) $(BYTESWAPIO) -fp-model fast=2 -fimf-use-svml=true -inline-max-size=12000 -inline-max-total-size=30000

1.28x higher Binomial Options performance App Version: v1.0; Build notes: Tools: Intel C Compiler 2020u4, Intel Threading Building Blocks ; threads/core: 2; Turbo: used; Build knobs: -O3 -xCORE-AVX512 -qopt-zmm-usage=high -fimf-domain-exclusion=31 -fimf-accuracy-bits=11 -no-prec-div -no-prec-sqrt

1.67x higher Black Scholes performance App Version: v1.3; Build notes: Tools: Intel MKL, Intel C Compiler 2020u4, Intel Threading Building Blocks 2020u4; threads/core: 1; Turbo: used; Build knobs: -O3 -xCORE-AVX512 -qopt-zmm-usage=high -fimf-precision=low -fimf-domain-exclusion=31 -no-prec-div -no-prec-sqrt -fimf-domain-exclusion=31

1.70x higher Monte Carlo performance App Version: v1.1; Build notes: Tools: Intel MKL 2020u4, Intel C Compiler 2020u4, Intel Threading Building Blocks 2020u4; threads/core: 1; Turbo: used; Build knobs: -O3 -xCORE-AVX512 -qopt-zmm-usage=high -fimf-precision=low -fimf-domain-exclusion=31 -no-prec-div -no-prec-sqrt

1.51x higher OpenFOAM performance (geomean 20M_​cell_​motorbike, 42M_​cell_​motorbike) App Version: v8; Build notes: Tools: Intel FORTRAN Compiler 2020u4, Intel C Compiler 2020u4, Intel MPI 2019u8; threads/core: 1; Turbo: used; Build knobs: -O3 -ip -xCORE-AVX512

OpenFOAM Disclaimer: This offering is not approved or endorsed by OpenCFD Limited, producer and distributor of the OpenFOAM software via www.openfoam.com, and owner of the OPENFOAM® and OpenCFD® trademark

1.64x higher GROMACS performance (geomean ion_​channel_​pme, lignocellulose_​rf, water_​pme, water_​rf) App Version: v2020.5_​SP; Build notes: Tools: Intel MKL 2020u4, Intel C Compiler 2020u4, Intel MPI 2019u8; threads/core: 2; Turbo: used; Build knobs: -O3 -ip -xCORE-AVX512

1.60x higher LAMMPS performance (geomean Polyethylene, Stillinger-Weber, Tersoff, Water) App Version: v2020-10-29; Build notes: Tools: Intel MKL 2020u4, Intel C Compiler 2020u4, Intel Threading Building Blocks 2020u4, Intel MPI 2019u8; threads/core: 2; Turbo: used; Build knobs: -O3 -ip -xCORE-AVX512 -qopt-zmm-usage=high

1.57x higher NAMD performance (geomean Apoa1, f1atpase, STMV) App Version: 2.15-Alpha1 (includes AVX tiles algorithm); Build notes: Tools: Intel MKL, Intel C Compiler 2020u4, Intel MPI 2019u8, Intel Threading Building Blocks 2020u4; threads/core: 2; Turbo: used; Build knobs: -ip -fp-model fast=2 -no-prec-div -qoverride-limits -qopenmp-simd -O3 -xCORE-AVX512 -qopt-zmm-usage=high

1.61x higher RELION Plasmodium Ribosome performance App Version: 3_​1_​1; Build notes: Tools: Intel C Compiler 2020u4, Intel MPI 2019u9; threads/core: 2; Turbo: used; Build knobs: -O3 -ip -g -debug inline-debug-info -xCOMMON-AVX512 -qopt-report=5 –restrict

HPC, HPCG, HPL, STREAM, WRF, FSI, Binomial Options, Black Scholes, Monte Carlo, OpenFOAM, Life and Material Science, GROMACS, LAMMPS, NAMD, RELION New: March 2021

Baseline: February 2021

[107]

1.54x higher NAMD STMV performance using the AVX Tile Algorithm on Platinum 8380 vs. without AVX Tiles

2.43x higher NAMD STMV performance using the AVX Tiles Alogrithm on Platinum 8380 vs. prior gen without AVX Tiles

1.57x higher NAMD STMV performance on Platinum 8380 vs. prior gen without AVX Tiles

3rd Generation Intel® Xeon® Platinum processor

New: 8380: 1-node, 2x Intel Xeon Platinum 8380 (40C/2.3GHz, 270W TDP) processor on Intel Software Development Platform with 256 GB (16 slots/ 16GB/ 3200) total DDR4 memory, ucode 0x261, HT on, Turbo on, CentOS Linux 8.3.2011, 4.18.0-240.1.1.el8_​3.crt1.x86_​64, 1x Intel_​SSDSC2KG96, Tested by Intel between March 12, 2021 and March 29, 2021.

Baseline: 8280: 1-node, 2x Intel Xeon Platinum 8280 (28C/2.7GHz, 205W TDP) processor on Intel Software Development Platform with 192GB (12 slots/ 16GB/ 2933) total DDR4 memory, ucode 0x4002f01, HT on, Turbo on, CentOS Linux 8.3.2011, 4.18.0-240.1.1.el8_​3.crt1.x86_​64, 1x Intel_​SSDSC2KG48 .Tested by Intel between February 1, 2021 to February 20, 2021

1.54x higher performance on NAMD STMV from using the AVX Tiles Algorithm vs. without AVX Tiles

2.43x higher performance on NAMD STMV with AVX Tiles Algorithm vs. prior gen without AVX Tile

1.57x higher performance on NAMD STMV without AVX Tiles Algorithm vs. prior gen

NAMD with Tiles: App Version: 2.15-Alpha1 (includes AVX tiles algorithm); Build notes: Tools: Intel MKL, Intel C Compiler 2020u4, Intel MPI 2019u8, Intel Threading Building Blocks 2020u4; threads/core: 2; Turbo: used; Build knobs: -ip -fp-model fast=2 -no-prec-div -qoverride-limits -qopenmp-simd -O3 -xCORE-AVX512 -qopt-zmm-usage=high

NAMD without Tiles: App Version: 2.15-Alpha1 (built without AVX tiles algorithm); Build notes: Tools: Intel MKL, Intel C Compiler 2020u4, Intel MPI 2019u8, Intel Threading Building Blocks 2020u4; threads/core: 2; Turbo: used; Build knobs: -ip -fp-model fast=2 -no-prec-div -qoverride-limits -qopenmp-simd -O3 -xCORE-AVX512 -qopt-zmm-usage=high -DNAMD_​KNL

tested by Intel and results as of March 2021

NAMD 3rd Gen Intel Xeon and 2nd Gen Intel Xeon claims for demo New: March 2021

Baseline: February 2021

[106] 1.42x higher OpenFOAM performance on Gold 6354 vs. Gold 6154 3rd Generation Intel® Xeon® Platinum processor

1.42x higher performance on OpenFOAM Motorbike 42M

New: 6354: 1-node, 2x Intel Xeon Gold 6354 (18C/3.0GHz, 205W TDP) processor on Intel Software Development Platform with 512 GB (16 slots/ 32GB/ 3200) total DDR4 memory, ucode 0x261, HT on, Turbo on, CentOS Linux 8.3, 4.18.0-240.10.1.el8_​3.x86_​64, 1x Intel_​SSDSC2KG96 .Tested by Intel between March 12, 2021 and March 29, 2021.

Baseline: 6154: 1-node, 2x Intel Xeon Gold 6154 (18C/3.0GHz, 200W TDP) processor on Intel Software Development Platform with 192GB (16 slots/ 16GB/ 3200) total DDR4 memory, ucode 0x2006a0a, HT on, Turbo on, CentOS Linux 8.3, 4.18.0-240.10.1.el8_​3.x86_​64, 1x Intel_​SSDSC2KG96 . Tested by Intel between February 1, 2021 to February 20, 2021

App Version: v8; Build notes: Tools: Intel FORTRAN Compiler 2020u4, Intel C Compiler 2020u4, Intel MPI 2019u8; threads/core: 1; Turbo: used; Build knobs: -O3 -ip -xCORE-AVX512

tested by Intel and results as of March 2021

OpenFOAM Disclaimer: This offering is not approved or endorsed by OpenCFD Limited, producer and distributor of the OpenFOAM

OpenFOAM for Oracle New: March 2021

Baseline: February 2021

[105] up to 1.52x higher manufacturing performance on 3rd Gen Intel Xeon Scalable platform vs. prior gen 3rd Generation Intel® Xeon® Platinum processor

1.52x higher manufacturing performance (geomean Altair RADIOSS, Ansys Fluent, Ansys LS-DYNA, Converge, Numeca, OpenFOAM)

New: 8380: 1-node, 2x Intel Xeon Platinum 8380 (40C/2.3GHz, 270W TDP) processor on Intel Software Development Platform with 256 GB (16 slots/ 16GB/ 3200) total DDR4 memory, ucode 0x261, HT on, Turbo on, CentOS Linux 8.3.2011, 4.18.0-240.1.1.el8_​3.crt1.x86_​64, 1x Intel_​SSDSC2KG96 .Tested by Intel between March 12, 2021 and March 29, 2021.

Baseline: 8280: 1-node, 2x Intel Xeon Platinum 8280 (28C/2.7GHz, 205W TDP) processor on Intel Software Development Platform with 192GB (12 slots/ 16GB/ 2933) total DDR4 memory, ucode 0x4002f01, HT on, Turbo on, CentOS Linux 8.3.2011, 4.18.0-240.1.1.el8_​3.crt1.x86_​64, 1x Intel_​SSDSC2KG48. Tested by Intel between February 1, 2021 to February 20, 2021.

1.47x higher Altair RADIOSS performance (geomean Neon1M/80ms, T10M/8ms) App Version: 2020; Build notes: Tools: Intel FORTRAN Compiler 2021u1, Intel C Compiler 2021u1, Intel MPI 2021u1; threads/core: 1; Turbo: used;

1.54x higher Ansys Fluent performance (geomean aircraft_​wing_​14m. aircraft_​wing_​2m, combustor_​12m, combustor_​16m, combustor_​71m, exhaust_​system_​33m, fluidized_​bed_​2m, ice_​2m, landing_​gear_​15m, oil_​rig_​7m, pump_​2m, rotor_​3m, sedan_​4m) App Version: 2021 R1; Build notes: One thread per core; Multi-threading Enabled; Turbo Boost Enabled; Intel FORTRAN Compiler 19.5.0; Intel C/C++ Compiler 19.5.0; Intel Math Kernel Library 2020.0.0; Intel MPI Library 2019 Update 8

1.48x higher Ansys LS-DYNA performance (geomean 3cars-150ms, car2car-120ms, ODB_​10M-30ms) App Version: R11; Build notes: Tools: Intel Compiler 2019u5 (AVX512), Intel MPI 2019u9; threads/core: 1; Turbo: used

1.52x higher Converge SI8_​engine_​PFI_​SAGE_​transient_​RAN performance App Version: 3.0.17; Build notes: Tools: Intel MPI 2019u9; threads/core: 1; Turbo: used; 3.0.17 Converge official converge-intelmp binary

1.61x higher Numeca performance (geomean FO_​hpcc_​single_​passage, FT_​hpcc_​single_​passage) FineOpen App Version: v10.1; Build notes: Tools: Customer pre-built binaries (Intel Fortran Compiler 2019, Intel C Compiler 2015), Intel MPI 2019u9; threads/core:1 ; Turbo: used; Build knobs: Fortran = -O2 -fp-model precise, C = -O2 -fPIC -pipe -Wno-deprecated -Wreturn-type -fp-model precise -std=c++11 FineTurbo App Version: v15.1; Build notes: Tools: Customer pre-built binaries (Intel Fortran Compiler 2015, Intel C Compiler 2015), Intel MPI 2019u4; threads/core:1 ; Turbo: used; Build knobs: Fortran = -O2 -fp-model precise, C = -O2 -fPIC -pipe -Wno-deprecated -Wreturn-type -fp-model precise -std=c++11

1.51x higher OpenFOAM performance (geomean 20M_​cell_​motorbike, 42M_​cell_​motorbike) App Version: v8; Build notes: Tools: Intel FORTRAN Compiler 2020u4, Intel C Compiler 2020u4, Intel MPI 2019u8; threads/core: 1; Turbo: used; Build knobs: -O3 -ip -xCORE-AVX512

OpenFOAM Disclaimer: This offering is not approved or endorsed by OpenCFD Limited, producer and distributor of the OpenFOAM

tested by Intel and results as of March 2021

Manufacturing New: March 2021

Baseline: February 2021

[99] 3rd Gen Intel Xeon Platinum 8380 processor delivers up to 1.65x higher performance on cloud data analytics usage vs. prior generation platform enabling faster business decisions. 3rd Generation Intel® Xeon® Platinum processor 1.65x higher responses with CloudXPRT - Data Analytics: New: Platinum 8380: 1-node, 2x Intel Xeon Platinum 8380 processor on Coyote Pass with 512 GB (16 slots/ 32GB/ 3200) total DDR4 memory, ucode 0x261, HT on, Turbo on, Ubuntu 20.04, 5.4.0-65-generic​, 1x S4610 SSD 960G, CloudXPRT v1.0, Data Analytics (Analytics per minute @ p.95 <= 90s), test by Intel on 3/12/2021. Baseline: Platinum 8280: 1-node, 2x Intel Xeon Platinum 8280 processor on Wolf Pass with 384 GB (12 slots/ 32GB/ 2933) total DDR4 memory, ucode 0x5003003, HT on, Turbo on, Ubuntu 20.04, 5.4.0-65-generic​, 1x S3520 SSD 480G, CloudXPRT v1.0, test by Intel on 2/4/2021. Intel contributes to the development of benchmarks by participating in, sponsoring, and/or contributing technical support to various benchmarking groups, including the BenchmarkXPRT Development Community administered by Principled Technologies. CloudXPRT Data Analytics New: March 12, 2021

Baseline: Feb 04, 2021

[98,97,81] over 50% higher performance on latency sensitive workloads such as database, e-commerce, and web server applications with 3rd Gen Intel Xeon Scalable platform 3rd Generation Intel® Xeon® Platinum processor Geomean of (HammerDB MySQL, Server Side Java, Wordpress with HTTPS)

1.64x HammerDB MySQL: New: Platinum 8380: 1-node, 2x Intel Xeon Platinum 8380 processor on Coyote Pass with 512 GB (16 slots/ 32GB/ 3200) total DDR4 memory, ucode 0x261, HT on, Turbo on, Redhat 8.3, 4.18.0-240.el8.x86_​64 x86_​64, 1x Intel SSD 960GB OS Drive, 1x Intel P5800 1.6T, x Onboard 1G/s, HammerDB 4.0, MySQL 8.0.22, test by Intel on 3/11/2021. Baseline: Platinum 8280: 1-node, 2x Intel Xeon Platinum 8280 processor on Wolf Pass with 384 GB (12 slots/ 32GB/ 2933) total DDR4 memory, ucode 0x5003003, HT on, Turbo on, Redhat 8.3, 4.18.0-240.el8.x86_​64 x86_​64, 1x Intel 240GB SSD OS Drive, 1x Intel 6.4T P4610, x Onboard 1G/s, HammerDB 4.0, MySQL 8.0.22, test by Intel on 2/5/2021.

1.6x higher throughput under SLA and 1.4x higher throughput for Server Side Java: New: Platinum 8380: 1-node, 2x Intel Xeon Platinum 8380 processor on Coyote Pass with 512 GB (16 slots/ 32GB/ 3200) total DDR4 memory, ucode 0x261, HT on, Turbo on, Ubuntu 20.04.1 LTS, 5.4.0-64-generic, 1x SSDSC2BA40, Java workload, JDK 1.15.0.1, test by Intel on 3/15/2021. Baseline: Platinum 8280: 1-node, 2x Intel Xeon Platinum 8280 processor on Wolf Pass with 384 GB (12 slots/ 32GB/ 2933) total DDR4 memory, ucode 0x5003003, HT on, Turbo on, Ubuntu 20.04.1 LTS, 5.4.0-64-generic, 1x INTEL_​SSDSC2KG01, Java workload, JDK 1.15.0.1, test by Intel on 2/18/2021.

1.48x higher responses on Wordpress with HTTPS: New: Platinum 8380: 1-node, 2x Intel Xeon Platinum 8380 processor on Coyote Pass with 512 GB (16 slots/ 32GB/ 3200) total DDR4 memory, ucode 0x261, HT on, Turbo on, Ubuntu 20.04, 5.4.0-65-generic, 1x Intel 895GB SSDSC2KG96, 1x XL710-Q2, Wordpress 4.2 with HTTPS, gcc 9.3.0, GLIBC 2.31-0ubuntu9.1, mysqld Ver 10.3.25-MariaDB-0ubuntu0.20.04.1, PHP 7.4.9-dev (fpm-fcgi), Zend Engine v3.4.0, test by Intel on 3/15/2021. Baseline: Platinum 8280: 1-node, 2x Intel Xeon Platinum 8280 processor on Wolf Pass with 384 GB (12 slots/ 32GB/ 2933) total DDR4 memory, ucode 0x5003003, HT on, Turbo on, Ubuntu 20.04, 5.4.0-65-generic, 1x Intel 1.8T SSDSC2KG01, 1x Intel X722, test by Intel on 2/5/2021.

geomean of MySQL DB, Server Side Java, Wordpress New: March 15, 2021

Baseline: Feb 05,2021

[98] 3rd Gen Intel Xeon Platinum 8380 processor delivers 1.58x higher performance on cloud microservices usage vs. prior generation platform enabling faster business decisions 3rd Generation Intel® Xeon® Platinum processor 1.58x higher responses with CloudXPRT Web Microservices: New: Platinum 8380: 1-node, 2x Intel Xeon Platinum 8380 processor on Coyote Pass with 512 GB (16 slots/ 32GB/ 3200) total DDR4 memory, ucode 0x261, HT on, Turbo on, Ubuntu 20.04, 5.4.0-65-generic​, 1x S4610 SSD 960G, CloudXPRT v1.0, Web Microservices (Requests per minute @ p.95 latency <= 3s), test by Intel on 3/12/2021. Baseline: Platinum 8280: 1-node, 2x Intel Xeon Platinum 8280 processor on Wolf Pass with 384 GB (12 slots/ 32GB/ 2933) total DDR4 memory, ucode 0x5003003, HT on, Turbo on, Ubuntu 20.04, 5.4.0-54-generic, 1x S3520 SSD 480G, CloudXPRT v1.0, test by Intel on 2/4/2021. Intel contributes to the development of benchmarks by participating in, sponsoring, and/or contributing technical support to various benchmarking groups, including the BenchmarkXPRT Development Community administered by Principled Technologies. CloudXPRT Web Microservices New: March 12, 2021

Baseline: Feb 04, 2021

[97] 3rd Gen Intel Xeon Platinum 8380 processor can process up to 1.48x higher secure requests to content management system vs. prior generation platform 3rd Generation Intel® Xeon® Platinum processor 1.48x higher responses on Wordpress with HTTPS: New: Platinum 8380: 1-node, 2x Intel Xeon Platinum 8380 processor on Coyote Pass with 512 GB (16 slots/ 32GB/ 3200) total DDR4 memory, ucode 0x261, HT on, Turbo on, Ubuntu 20.04, 5.4.0-65-generic, 1x Intel 895GB SSDSC2KG96, 1x XL710-Q2, Wordpress 4.2 with HTTPS, gcc 9.3.0, GLIBC 2.31-0ubuntu9.1, mysqld Ver 10.3.25-MariaDB-0ubuntu0.20.04.1, PHP 7.4.9-dev (fpm-fcgi), Zend Engine v3.4.0, test by Intel on 3/15/2021. Baseline:Platinum 8280: 1-node, 2x Intel Xeon Platinum 8280 processor on Wolf Pass with 384 GB (12 slots/ 32GB/ 2933) total DDR4 memory, ucode 0x5003003, HT on, Turbo on, Ubuntu 20.04, 5.4.0-65-generic, 1x Intel 1.8T SSDSC2KG01, 1x Intel X722, test by Intel on 2/5/2021. Wordpress New: March 15, 2021

Baseline: Feb 05, 2021

[96] Up to 1.6x higher Server Side JAVA throughput performance within a given SLA with 3rd Gen Intel® Xeon® Platinum 8380 processor vs. prior generation platform.

Up to 1.4x higher Server Side JAVA throughput performance with 3rd Gen Intel® Xeon® Platinum 8380 processor vs. prior generation platform

3rd Generation Intel® Xeon® Platinum processor 1.6x higher throughput under SLA and 1.4x higher throughput for Server Side Java: New: Platinum 8380: 1-node, 2x Intel Xeon Platinum 8380 processor on Coyote Pass with 512 GB (16 slots/ 32GB/ 3200) total DDR4 memory, ucode 0x261, HT on, Turbo on, Ubuntu 20.04.1 LTS, 5.4.0-64-generic, 1x SSDSC2BA40, Java workload, JDK 1.15.0.1, test by Intel on 3/15/2021. Baseline: Platinum 8280: 1-node, 2x Intel Xeon Platinum 8280 processor on Wolf Pass with 384 GB (12 slots/ 32GB/ 2933) total DDR4 memory, ucode 0x5003003, HT on, Turbo on, Ubuntu 20.04.1 LTS, 5.4.0-64-generic, 1x INTEL_​SSDSC2KG01, Java workload, JDK 1.15.0.1, test by Intel on 2/18/2021. Server Side JAVA New: March 15, 2021

Baseline: Feb 18, 2021

[91,92] 1.62x average performance gains across network and communications workloads on 3rd Gen Intel Xeon Scalable "N" processors and Intel Ethernet 800 series compared to prior generation platform

With Intel® 3rd Gen Xeon® Scalable processors, CoSP's can achieve up to 21% boost in vBNG performance, while enabling increased flexibility for fixed and mobile convergence, manageability and scalability to expand use cases to address both mobile and broadband workloads.

With Intel® 3rd Gen Xeon® Scalable processors, CoSP's can increase 5G UPF performance by 42%. Combined with Intel Ethernet 800 series adapters, they can deliver the performance, efficiency and trust for use cases that require low latency, including augmented reality, cloud-based gaming, discrete automation and even robotic-aided surgery.

With Intel® 3rd Gen Xeon® Scalable processors and the latest Intel® Optane™ Persistent Memory you can get up to 63% higher throughput and 33% more memory capacity, enabling you to serve the same number of subscribers at higher resolution or a greater number of subscribers at the same resolution.

With Intel® 3rd Gen Xeon® Scalable processors, you can support up to 94% more secure networking connections and achieve significantly faster speeds to support cloud, edge and work-from-home use cases.

With the higher core performance and new crypto acceleration of Intel® 3rd Gen Xeon® Scalable processors, CoSP's can achieve 72% better CMTS platform performance. Additional QAT offload can add up to another 10% boost.

With Intel® 3rd Gen Xeon® Scalable processors, enhance Vector Packet Processing - Forward Information Base performance by 66% vs. the prior generation.

With Intel® 3rd Gen Xeon® Scalable processors, enhance DPDK L3 Forwarding performance by 88% vs. the prior generation.

With Intel® 3rd Gen Intel® Xeon® Scalable processors, Ethernet 800 series and vRAN dedicated accelerators, CoSP's can get to 2x Massive MIMO throughput in a similar power envelope for a best-in-class 3x100mhz 64T64R configuration.

3rd Generation Intel® Xeon® Platinum processor

& Intel® Ethernet 800 Series Network Adapters

1.62x average network performance gains: geomean of Virtual Broadband Network Gateway, 5G User Plane Function, Virtual Cable Modem Termination System, Vector Packet Processing - Forward Information Base 512B, DPDK L3 Forward 512B, CDN-Live, Vector Packet Processing - IP Security 1420B.

1.2x  Virtual Broadband Network Gateway: New: Gold 6338N: 1-node, 2(1 socket used)x Intel Xeon Gold 6338N on Intel* Whitley with 256 GB (16 slots/ 16GB/ 2666) total DDR4 memory, ucode 0x261, HT on, Turbo off, Ubuntu 20.04 LTS (Focal Fossa)​, 5.4.0-40-generic, 1x INTEL* 240G SSD , 3x E810-CQDA2 (Tacoma Rapids), vBNG 20.07, Gcc 9.3.0​, test by Intel on 3/11/2021. Baseline: Gold 6252N: 1-node, 2(1 socket used)x Intel Xeon Gold 6252N on SuperMicro* X11DPG-QT with 192 GB (12 slots/ 16GB/ 2933)  total DDR4 memory, ucode 0x5002f01, HT on, Turbo off, Ubuntu 20.04 LTS (Focal Fossa)​, 5.4.0-40-generic, 1x INTEL* 240G SSD , 3x E810-CQDA2 (Tacoma Rapids), vBNG 20.07, Gcc 9.3.0​,  test by Intel on 2/2/2021.

1.42x 5G User Plane Function: New:1-node, 2(1 socket used)x Intel Xeon Gold 6338N on Whitley Coyote Pass 2U  with 128 GB (8 slots/ 16GB/ 2666)  total DDR4 memory, ucode 0x261, HT on, Turbo off, Ubuntu 18.04.5 LTS, 4.15.0-134-generic, 1x Intel 810 (Columbiaville), FlexCore 5G UPF, Jan’ 2021​ MD5 checksum: c4ad7f8422298ceb69d01e67419ff1c1, GCC 7.5.0, 5G UPF228 Gbps / 294 Gbps,  test by Intel on 3/16/2021. Baseline:1-node, 2(1 socket used)x Intel Xeon Gold 6252N on SuperMicro* X11DPG-QT with 96 GB (6 slots/ 16GB/ 2934)  total DDR4 memory, ucode 0x5003003, HT on, Turbo off, Ubuntu 18.04.5 LTS, 4.15.0-132-generic, 1x Intel 810 (Columbiaville), FlexCore 5G UPF, Jan’ 2021  MD5 checksum: c4ad7f8422298ceb69d01e67419ff1c1, GCC 7.5.0, 5G UPF161 Gbps / 213 Gbps,  test by Intel on 2/12/2021.

1.63x CDN Live: New:1 node, 2x Intel® Xeon® Gold 6338N Processor, 32 core HT ON Turbo ON, Total DRAM 256GB (16 slots/16GB/2666MT/s), Total Optane Persistent Memory 200 Series 2048GB (16 slots/128GB/2666MT/s), BIOS SE5C6200.86B.2021.D40.2103100308 (ucode: 0x261), 4x Intel® E810, Ubuntu 20.04, kernel 5.4.0-65-generic, gcc 9.3.0 compiler, openssl 1.1.1h, varnish-plus 6.0.7r2. 2 clients, Test by Intel as of 3/11/2021. Baseline: Gold 6252N: 2x Intel® Xeon® Gold 6252N Processor, 24 core HT ON Turbo ON, Total DRAM 192GB (12 slots/16GB/2666MT/s), Total Optane Persistent Memory 100 Series 1536GB(12 slots/128GB/2666MT/s), 1x Mellanox MCX516A-CCAT, BIOS: SE5C620.86B.02.01.0013.121520200651 (ucode: 0x5003003), Ubuntu 20.04, kernel 5.4.0-65-generic, wrk master 4/17/2019. Test by Intel as of 2/15/2021. Throughput measured with 100% Transport Layer Security (TLS) traffic with 93.3% target cache hit ratio and keep alive on, 512 total connections.

1.94x Vector Packet Processing - IP Security 1420B: New: 1-node, 2(1 socket used)x Intel Xeon Gold 6338N on Intel* Whitley with 128 GB (8 slots/ 16GB/ 2666)  total DDR4 memory, ucode 0x261, HT on, Turbo off, Ubuntu 20.04 LTS (Focal Fossa)​, 5.4.0-40-generic, 1x INTEL* 240G SSD , 1x E810-2CQDA2 (Chapman Beach), v21.01-release, Gcc 9.3.0​, VPPIPSEC(24c24t) test by Intel on 3/17/2021 .Baseline: 1-node, 2(1 socket used)x Intel Xeon Gold 6252N on SuperMicro* X11DPG-QT with 96 GB (6 slots/ 16GB/ 2933)  total DDR4 memory, ucode 0x5002f01, HT off, Turbo off, Ubuntu 20.04 LTS (Focal Fossa)​, 5.4.0-40-generic, 1x INTEL* 240G SSD , 1x E810-CQDA2 (Tacoma Rapids), v21.01-release, Gcc 9.3.0​,  VPPIPSEC(18c18t) test by Intel on 2/2/2021.

1.72x Virtual Cable Modem Termination System: New: Gold 6338N: 1-node, 2(1 socket used)x Intel Xeon Gold 6338N on Coyote Pass with 256 GB (16 slots/ 16GB/ 2666) total DDR4 memory, ucode 0x261, HT on, Turbo off(no SST-BF)/on(SST-BF), Ubuntu 20.04 LTS (Focal Fossa)​, 5.4.0-40-generic, 1x INTEL* 240G SSD , 3x E810-CQDA2 (Tacoma Rapids), vCMTS 20.10​, Gcc 9.3.0​, SST-BF (2.4 Ghz,1.9 Ghz frequencies for the priority cores and the other cores respectively ), test by Intel on 3/11/2021. Baseline: Gold 6252N: 1-node, 2(1 socket used)x Intel Xeon Gold 6252N on SuperMicro* X11DPG-QT with 192 GB (12 slots/ 16GB/ 2933)  total DDR4 memory, ucode 0x5002f01, HT on, Turbo off, Ubuntu 20.04 LTS (Focal Fossa)​, 5.4.0-40-generic, 1x INTEL* 240G SSD , 2x E810-CQDA2 (Tacoma Rapids), vCMTS 20.10​, Gcc 9.3.0​, vCMTS90 (14 instances),  test by Intel on 2/2/2021.

1.66 Vector Packet Processing - Forward Information Base 512B: New: 1-node, 2(1 socket used)x Intel Xeon Gold 6338N on Intel* Whitley with 128 GB (8 slots/ 16GB/ 2666)  total DDR4 memory, ucode 0x261, HT on, Turbo off, Ubuntu 20.04 LTS (Focal Fossa)​, 5.4.0-40-generic, 1x INTEL* 240G SSD , 1x E810-2CQDA2 (Chapman Beach), v20.05.1-release, Gcc 9.3.0​, VPPFIB(24c24t)​,  test by Intel on 3/17/2021. Baseline: 1-node, 2(1 socket used)x Intel Xeon Gold 6252N on SuperMicro* X11DPG-QT with 96 GB (6 slots/ 16GB/ 2933)  total DDR4 memory, ucode 0x5002f01, HT off, Turbo off, Ubuntu 20.04 LTS (Focal Fossa)​, 5.4.0-40-generic, 1x INTEL* 240G SSD , 1x E810-CQDA2 (Tacoma Rapids), v20.05.1-release, Gcc 9.3.0​, VPPFIB (18c18t)​,  test by Intel on 2/2/2021.

1.88x DPDK L3 Forward 512B: New: 1-node, 2(1 socket used)x Intel Xeon Gold 6338N on Intel* Whitley with 128 GB (8 slots/ 16GB/ 2666)  total DDR4 memory, ucode 0x261, HT on, Turbo off, Ubuntu 20.04 LTS (Focal Fossa)​, 5.4.0-40-generic, 1x INTEL* 240G SSD , 1x E810-2CQDA2 (Chapman Beach), v20.08.0, Gcc 9.3.0​, DPDKL3FWD (24c24t), test by Intel on 3/17/2021, Baseline: 2(1 socket used)x Intel Xeon Gold 6252N on SuperMicro* X11DPG-QT with 96 GB (6 slots/ 16GB/ 2933)  total DDR4 memory, ucode 0x5002f01, HT off, Turbo off, Ubuntu 20.04 LTS (Focal Fossa)​, 5.4.0-40-generic, 1x INTEL* 240G SSD , 1x E810-CQDA2 (Tacoma Rapids), v20.08.0, Gcc 9.3.0​​, DPDKL3FWD (12c12t),  test by Intel on 2/2/2021.

FlexRAN: 2x MIMO Throughput: Results have been estimated or simulated. Based on 2x estimated throughput from 32Tx32R (5Gbps) on 2nd Gen Intel® Xeon® Gold 6212U processor to 64Tx64R (10Gbps) on 3rd Gen Intel Xeon Gold 6338N processor at similar power ~185W.

geomean of Virtual Broadband Network Gateway, 5G User Plane Function, CDN Video-on-Demand, Virtual Cable Modem Termination System, Vector Packet Processing - Forward Information Base 512B, DPDK L3 Forward 512B, CDN-Live, Vector Packet Processing - IP Security 1420B.

FlexRAN

New: March 17, 2021

Baseline: Feb 02, 2021

[90] Up to 4.2x more TSL encrypted web server connections per second with NGINX on 3rd Gen Intel Xeon Scalable processor with built in enhanced crypto acceleration and E810 compared to prior generation platform. 3rd Generation Intel® Xeon® Platinum processor 4.2x NGINX (TLS 1.2 Handshake) web server connections/sec with ECDHE-X25519-RSA2K Multi-buffer: New: 1-node, 2x Intel® Xeon® Gold 6338N processor on Coyote Pass with 256 GB (16 slots/ 16GB/ 2666)  total DDR4 memory, ucode x261, HT on, Turbo off, Ubuntu 20.04.1 LTS, 5.4.0-65-generic, x 3 x Quad Ethernet Controller E810-C for SFP 25 GBE, Async NGINX v0.4.3, OpenSSL 1.1.1h, QAT Engine v0.6.4, Crypto MB-ippcp_​2020u3, GCC 9.3.0, GLIBC 2.31,  test by Intel on 3/22/2021. Baseline: 1-node, 2x Intel® Xeon® Gold 6252N processor on Supermicro X11DPG-QT with 192 GB (12 slots/ 16GB/ 2933)  total DDR4 memory, ucode 0x5003003, HT on, Turbo off, Ubuntu 20.04.1 LTS, 5.4.0-65-generic, x 2 x Quad Ethernet Controller XXV710 for 25GbE SFP28, 1 x Dual Ethernet Controller XXV710 for 25GbE SFP28, Async NGINX v0.4.3, OpenSSL 1.1.1h, GCC 9.3.0, GLIBC 2.31,  test by Intel on 1/17/2021. NGINX v0.4.3, OpenSSL 1.1.1h New: March 22, 2021

Baseline: January 17,2021

[84] Up to 1.72x higher virtualization performance with 3rd Gen Intel® Xeon® Scalable processor with Intel® SSD D5-P5510 Series and Intel® Ethernet Network Adapter E810 vs. prior generation platform 3rd Generation Intel® Xeon® Platinum processor 1.72x higher virtualization performance vs. prior generation: New: Platinum 8380: 1-node, 2x Intel Xeon Platinum 8380 processor on Coyote Pass with 2048 GB (32 slots/ 64GB/ 3200) total DDR4 memory, ucode 0x261, HT on, Turbo on, RedHat 8.3, 4.18.0-240.el8.x86_​64, 1x S4610 SSD 960G, 4x P5510 3.84TB NVME, 2x Intel E810, Virtualization workload, Qemu-kvm 4.2.0-34 (inbox), WebSphere 8.5.5, DB2 v9.7, Nginx 1.14.1, test by Intel on 3/14/2021. Baseline: Platinum 8280: 1-node, 2x Intel Xeon Platinum 8280 processor on Wolf Pass with 1536 GB (24 slots/ 64GB/ 2933[2666]) total DDR4 memory, ucode 0x5003005, HT on, Turbo on, RedHat 8.1 (Note: selected higher of RedHat 8.1 and 8.3 scores for baseline), 4.18.0-147.el8.x86_​64, 1x S4510 SSD 240G, 4x P4610 3.2TB NVME, 2x Intel XL710, Virtualization workload, Qemu-kvm 4.2.0-34 (inbox), WebSphere 8.5.5, DB2 v9.7, Nginx 1.14.1, test by Intel on 12/22/2020.

Virtualization workload

New: March 14, 2021

Baseline: Dec 22,2020

[83] Process up to 1.55x higher transactions per minute with the 3rd Gen Intel Xeon Platinum 8380 processor and Intel® Optane™ SSD P5800X series vs. prior generation platform 3rd Generation Intel® Xeon® Platinum processor 1.55x higher Transactions on OLTP Database: New: Platinum 8380: 1-node, 2x Intel Xeon Platinum 8380 processor on Coyote Pass with 512 GB (16 slots/ 32GB/ 3200) total DDR4 memory, ucode 0x261, HT on, Turbo on, Redhat 8.3, 4.18.0-240.el8.x86_​64 x86_​64, 1x Intel SSD 960GB OS Drive, 4x Intel® Optane™ SSD P5800X Series 1.6T (2xDATA, 2XREDO), x Onboard 1G/s, HammerDB 4.0, Oracle 19c, test by Intel on 3/16/2021. Baseline: Platinum 8280 1-node, 2x Intel Xeon Platinum 8280 processor on Wolf Pass with 384 GB (12 slots/ 32GB/ 2933) total DDR4 memory, ucode 0x5003003, HT on, Turbo on, Redhat 8.3, 4.18.0-240.el8.x86_​64 x86_​64, 1x Intel 240GB SSD OS Drive, 4x Intel 3.2T P4610 (2xDATA, 2xREDO), x Onboard 1G/s, HammerDB 4.0, Oracle 19c, test by Intel on 11/30/2020. HammerDB OLTP w/Oracle New: March 16, 2021

Baseline: Nov 30,2020

[82] Support your growing business needs with the new 3rd Gen Intel® Xeon® Scalable Platform and realize up to 1.53x higher OLTP database transactions on Microsoft SQL Server compared to prior generation 3rd Generation Intel® Xeon® Platinum processor 1.53x higher OLTP brokerage performance: New: Platinum 8380: 1-node, 2x Intel Xeon Platinum 8380 processor on Wilson City with 1536 GB (24 slots/ 64GB/ 2933) total DDR4 memory, ucode 0x261, HT on, Turbo on, Windows Server 2019, 10.0.17763 Build 17763.1339, 1x Intel 1.6TB SSD OS Drive, db & logx (69x Intel SSD D3-S4510 (960GB), 5x Intel SSD D3 S4510 (960GB), 28x Intel SSD DC S4600 (1.92TB), 8x Intel SSD D3-S4510 (960GB) ), 2x Intel X520-2 10GBASE-T, OLTP brokerage, Microsoft SQL Server 2019 RTM Cumulative Update 8, test by Intel on 3/10/2021. Baseline: Platinum 8280: 1-node, 2x Intel Xeon Platinum 8380 processor on S2600WFT with 1536 GB (24 slots/ 64GB/ 2933[2666]) total DDR4 memory, ucode 0x00B001008D, HT on, Turbo on, Windows Server 2019, 10.0.17763 Build 17763.1339, 1x Intel 1.6TB SSD OS Drive, db & logx (69x Intel SSD D3-S4510 (960GB), 5x Intel SSD D3 S4510 (960GB), 28x Intel SSD DC S4600 (1.92TB), 8x Intel SSD D3-S4510 (960GB) ), 2x Intel X520-2 10GBASE-T, OLTP brokerage, Microsoft SQL Server 2019 RTM Cumulative Update 8, test by Intel on 2/6/21. OLTP brokerage w/Microsoft New: March 10, 2021

Baseline: Feb 06,2021

[81] Process up to 1.64x higher transactions per minute with the 3rd Gen Intel Xeon Platinum 8380 processor and Intel® Optane™ SSD P5800X Series vs. prior generation platform 3rd Generation Intel® Xeon® Platinum processor 1.64x HammerDB MySQL: New: Platinum 8380: 1-node, 2x Intel Xeon Platinum 8380 processor on Coyote Pass with 512 GB (16 slots/ 32GB/ 3200) total DDR4 memory, ucode 0x261, HT on, Turbo on, Redhat 8.3, 4.18.0-240.el8.x86_​64 x86_​64, 1x Intel SSD 960GB OS Drive, 1x Intel® Optane™ SSD P5800X Series 1.6T, x Onboard 1G/s, HammerDB 4.0, MySQL 8.0.22, test by Intel on 3/11/2021. Baseline: Platinum 8280: 1-node, 2x Intel Xeon Platinum 8280 processor on Wolf Pass with 384 GB (12 slots/ 32GB/ 2933) total DDR4 memory, ucode 0x5003003, HT on, Turbo on, Redhat 8.3, 4.18.0-240.el8.x86_​64 x86_​64, 1x Intel 240GB SSD OS Drive, 1x Intel 6.4T P4610, x Onboard 1G/s, HammerDB 4.0, MySQL 8.0.22, test by Intel on 2/5/2021. HammerdDB w/ MySQL New: March 11, 2021

Baseline: Feb 05,2021

[80] Up to 2.5x higher transactions on the new 3rd Gen Intel Xeon Scalable processor with Intel Optane Pmem 200 and Intel Ethernet E810 Network Adaptor running Aerospike with index and data in PMEM vs. prior generation platform

Up to 1.43x higher transactions on the new 3rd Gen Intel Xeon Scalable processor with Intel Optane Pmem 200, Intel P5510 SSD and Intel Ethernet E810 Network Adaptor running Aerospike with index in PMem and data in SSD vs. prior generation platform

3rd Generation Intel® Xeon® Platinum processor and Intel® Optane™ persistent memory 200 series . 2.5x higher transactions with Index+Data in PMem and 1.43x with Index(PMem)+Data(SSD) for Aerospike Database: New: Platinum 8368: 1-node, 2x Intel Xeon Platinum 8368 processor on Coyote Pass with 256 GB (16 slots/ 16GB/ 3200) total DDR4 memory, 8192 GB (16 slots/ 512 GB/ 3200) total Pmem, ucode x261, HT on, Turbo on, CentOS 8.3.2011, 4.18.0-193.el8.x86_​64, 1x Intel 960GB SSD, 7x P5510 3.84TB, 2x Intel E810-C 100Gb/s, Aerospike Enterprise Edition 5.5.0.2; Aerospike C Client 5.1.0 Benchmark Tool; 70R/30W. Dataset size: 1.1TB, 9.3 billion 64B records, PMDK libpmem, Index (Pmem)+data (SSD) and Index+data (Pmem), test by Intel on 3/16/2021. Baseline: Platinum 8280: 1-node, 2x Intel Xeon Platinum 8280L processor on Wolf Pass with 768 GB (12 slots/ 64GB/ 2666) total DDR4 memory, 3072 GB (12 slots/ 256 GB/ 2666) total Pmem, ucode 0x5003003, HT on, Turbo on, CentOS 8.3.2011, 4.18.0-193.el8.x86_​64, 7x P4510 1.8TB PCIe 3. 1, 2x Intel XL710 40Gb/s, Aerospike Enterprise Edition 5.5.0.2; Aerospike C Client 5.1.0 Benchmark Tool; 70R/30W. Dataset size: 1.1TB, 9.3 billion 64B records, PMDK libpmem, Index (Pmem)+data (SSD), test by Intel on 3/16/2021. Aerospike New: March 16, 2021

Baseline: March 16,2021

[79] Up to 1.41x faster performance for Online Analytical Processing workloads running with Microsoft SQL Server 2019 on the new 3rd Gen Intel® Xeon® Scalable Platform compared to prior generation 3rd Generation Intel® Xeon® Platinum processor 1.41x higher OLAP Decision Support: New: Platinum 8380: 1-node, 2x Intel Xeon Platinum 8380 processor on Wilson City with 2048 GB (32 slots/ 64GB/ 3200) total DDR4 memory, ucode 0x261, HT on, Turbo on, Windows Server 2019, 17763.rs5_​release.180914-1434, 1x Intel 200GB SSD OS Drive, 2x P4608 6.4TB PCIe NVME, 1x Intel X520-2, OLAP workload (3TB dataset), Microsoft SQL Server 2019 RTM Cumulative Update 8, test by Intel on 3/10/2021. Baseline: Platinum 8280: 1-node, 2x Intel Xeon Platinum 8280L processor on Wolf Pass with 1536 GB (24 slots/ 64GB/ 2933[2666]) total DDR4 memory, ucode 0x003300005, HT on, Turbo on, Windows Server 2019, 17763.rs5_​release.180914-1434, 1x Intel 200GB SSD OS Drive, 2x P4608 6.4TB PCIe NVME, 1x Intel X520-2, OLAP workload (3TB dataset), Microsoft SQL Server 2019 RTM Cumulative Update 8, test by Intel on 1/31/21. Decision Support New: March 10,2021

Baseline: Jan 31, 2021

[71] 3.34x higher IPSec AES-GCM performance,3.78x higher IPSec AES-CMAC performance,3.84x higher IPSec AES-CTR performance,1.5x higher IPSec ZUC performance on 3rd Gen Intel® Xeon® Platinum 8380 processor 3rd Generation Intel® Xeon® Platinum processor 3.34x higher IPSec AES-GCM performance,3.78x higher IPSec AES-CMAC performance,3.84x higher IPSec AES-CTR performance,1.5x higher IPSec ZUC performance: New: 8380: 1-node, 2x Intel® Xeon® Platinum 8380 CPU on M50CYP2SB2U with 512 GB GB (16 slots/ 32GB/ 3200) total DDR4 memory, ucode 0x8d055260, HT On, Turbo Off, Ubuntu 20.04.2 LTS, 5.4.0-66-generic, 1x Intel 1.8TB SSD OS Drive, intel-ipsec-mb v0.55, gcc 9.3.0, Glibc 2.31, test by Intel on 3/17/2021. Baseline: 8280M: 1-node, 2x Intel® Xeon® Platinum 8280M CPU on S2600WFT with 384 GB GB (12 slots/ 32GB/ 2934) total DDR4 memory, ucode 0x4003003, HT On, Turbo Off, Ubuntu 20.04.2 LTS, 5.4.0-66-generic, 1x Intel 1.8TB SSD OS Drive, intel-ipsec-mb v0.55, gcc 9.3.0, Glibc 2.31, test by Intel on 3/8/2021. Crypto New: March 17, 2021

Baseline: March 08, 2021

[70]5.63x higher OpenSSL RSA Sign 2048 performance,1.90x higher OpenSSL ECDSA Sign p256 performance,4.12x higher OpenSSL ECDHE x25519 performance,2.73x higher OpenSSL ECDHE p256 performance 3rd Gen Intel® Xeon® Platinum 8380 processor 3rd Generation Intel® Xeon® Platinum processor 5.63x higher OpenSSL RSA Sign 2048 performance,1.90x higher OpenSSL ECDSA Sign p256 performance,4.12x higher OpenSSL ECDHE x25519 performance,2.73x higher OpenSSL ECDHE p256 performance, New: 8380: 1-node, 2x Intel® Xeon® Platinum 8380 CPU on M50CYP2SB2U with 512 GB (16 slots/ 32GB/ 3200) total DDR4 memory, ucode 0xd000270, HT On, Turbo Off, Ubuntu 20.04.1 LTS, 5.4.0-65-generic, 1x INTEL_​SSDSC2KG01, OpenSSL 1.1.1j, GCC 9.3.0, QAT Engine v0.6.4, test by Intel on 3/24/2021. 8380: 1-node, 2x Intel® Xeon® Platinum 8380 CPU on M50CYP2SB2U with 512 GB (16 slots/ 32GB/ 3200) total DDR4 memory, ucode 0xd000270, HT On, Turbo Off, Ubuntu 20.04.1 LTS, 5.4.0-65-generic, 1x INTEL_​SSDSC2KG01, OpenSSL 1.1.1j, GCC 9.3.0, QAT Engine v0.6.5, test by Intel on 3/24/2021. Baseline: 8280M:1-node, 2x Intel® Xeon® Platinum 8280M CPU on S2600WFT with 384 GB (12 slots/ 32GB/ 2934) total DDR4 memory, ucode 0x5003003, HT On, Turbo Off, Ubuntu 20.04.1 LTS, 5.4.0-65-generic, 1x INTEL_​SSDSC2KG01, OpenSSL 1.1.1j, GCC 9.3.0, test by Intel on 3/5/2021. Crypto New: March 24, 2021

Baseline: March 05, 2021

[69] Up to 1.15x higher Compression performance, up to 1.09x Hashing, 2.3x Data Integrity, up to 3.9x Encryption performance on 3rd Gen Intel Xeon Platinum 8380 vs. prior generation 3rd Generation Intel® Xeon® Platinum processor ISA-L: New: 1-node, 2x Intel® Xeon® Platinum 8380 Processor, 40 cores HT On Turbo OFF Total Memory 512 GB (16 slots/ 32GB/ 3200 MHz), Data protection (Reed Solomon EC (10+4)), Data integrity (CRC64), Hashing (Multibuffer MD5),Data encryption (AES-XTS 128 Expanded Key), Data Compression (Level 3 Compression (Calgary Corpus)), BIOS: SE5C6200.86B.3021.D40.2103160200 (ucode: 0x8d05a260), Ubuntu 20.04.2, 5.4.0-67-generic, gcc 9.3.0 compiler, yasm 1.3.0, nasm 2.14.02, isal 2.30, isal_​crypto 2.23, OpenSSL 1.1.1.i, zlib 1.2.11, Test by Intel as of 03/19/2021. Baseline: 1-node, 2x Intel® Xeon® Platinum 8280 Processor, 28 cores HT On Turbo OFF Total Memory 384 GB (12 slots/ 32GB/ 2933 MHz), BIOS: SE5C620.86B.02.01.0013.121520200651 (ucode:0x4003003), Ubuntu 20.04.2, 5.4.0-67-generic,, gcc 9.3.0 compiler, yasm 1.3.0, nasm 2.14.02, isal 2.30, isal_​crypto 2.23, OpenSSL 1.1.1.i, zlib 1.2.11 Test by Intel as of 2/9/2021. Gen on gen comparison based on cycle/Byte performance measured on single core. ISA-L New: March 19,2021

Baseline: February 09, 2021

[63]

Up to 3x higher 4KB Rand Read/Write 70/30 IOPS performance with 3rd Gen Intel Xeon® scalable platform supporting PCIe Gen4 Intel Optane™ SSDs vs. prior generation Intel Xeon® Scalable platform supporting Intel DC P4610 SSDs

Up to 1.37x higher 4KB Rand Read/Write 70/30 IOPS performance with 3rd Gen Intel Xeon Scalable platform with Intel® SSD DC 5510 series vs. prior generation Intel Xeon® Scalable platform supporting Intel DC P4610 SSDs

Up to 2.6x higher 4KB Random Read IOPS performance with 3rd Gen Intel Xeon® scalable platform supporting PCIe Gen4 Intel Optane™ SSDs vs. prior generation Intel Xeon® Scalable platform supporting Intel DC P4610 SSDs

Up to 1.5x higher 4KB Random Read IOPS performance with 3rd Gen Intel Xeon® scalable platform and Intel DC P5510 SSDs vs. prior generation Intel Xeon® Scalable platform supporting Intel DC P4610 SSDs

Upgrade to the latest 3rd Gen Intel Xeon Scalable family and Latest Intel® SSDs and benefit from significantly lower latency and enhanced performance Up to 15% Lower latency with Intel DC P5510 SSDs and up to 94% lower latency with Intel Optane™ SSDs

3rd Generation Intel® Xeon® Platinum processor and Intel® Optane™ persistent memory 200 series . Local IOPS: New:1-node, 2x Intel® Xeon® Platinum 8380 Processor, 40 cores HT On Turbo ON Total Memory 1024 GB (16 slots/ 64GB/ 3200 MHz), BIOS:SE5C6200.86B.2021.D40.2103100308 (ucode:0x261), Fedora 30, Linux Kernel 5.7.12, gcc 9.3.1 compiler, fio 3.20, SPDK 21.01, Storage: 16x Intel® SSD D7-P5510 7.68 TB (QD = 256) or 16x Intel® Optane™ SSD 800GB P5800X (QD = 128), Network: 2x 100GbE Intel E810-C, Test by Intel as of 3/17/2021. Baseline:1-node, 2x Intel® Xeon® Platinum 8280 Processor, 28 cores HT On Turbo ON Total Memory 768 GB (24 slots/ 32GB/ 2666 MHz), BIOS: SE5C620.86B.02.01.0013.121520200651 (ucode:0x4003003), Fedora 30, Linux Kernel 5.7.12, gcc 9.3.1 compiler, fio 3.20, SPDK 21.01, Storage: 16x Intel® SSD DC P4610 1.6TB, Network: 1x 100GbE Intel E810-C, Test by Intel as of 2/10/2021.  FIO 3.20 New: March 17,2021

Baseline: February 10, 2021

[62]

Up to 1.5x more 4KB Random Read IOPS/VM performance with 3rd Gen Intel Xeon® scalable platform supporting PCIe Gen4 Intel Optane™ SSDs vs. prior generation Intel Xeon® Scalable platform supporting Intel DC P4610 SSDs

Up to 1.3x more 4KB Random Read IOPS/VM performance with 3rd Gen Intel Xeon® scalable platform and Intel DC P5510 SSDs vs. prior generation Intel Xeon® Scalable platform supporting Intel DC P4610 SSDs

Up to 286K IOPS/VM on 3rd Gen Intel Xeon® scalable platform and PCIe Gen4 Intel Optane™ SSDs for 4KB Random Read vs. prior generation Intel Xeon® Scalable platform supporting Intel DC P4610 SSDs

Up to 192K IOPS/VM on 3rd Gen Intel Xeon® scalable platform and Intel DC P5510 SSDs for 4KB Random Read vs. prior generation Intel Xeon® Scalable platform supporting Intel DC P4610 SSDs

3rd Generation Intel® Xeon® Platinum processor and Intel® Optane™ persistent memory 200 series . Storage Virtualization: New:1-node, 2x Intel® Xeon® Platinum 8380 Processor, 40 cores HT On Turbo ON Total Memory 1024 GB (16 slots/ 64GB/ 3200 MHz BIOS:SE5C6200.86B.2021.D40.2103100308 (ucode:0x261), Fedora 30, Linux Kernel 5.7.12, gcc 9.3.1 compiler, fio 3.20, SPDK 21.01, Storage: 16x Intel® SSD D7-P5510 7.68 TB (QD = 256) or 16x Intel® Optane™ SSD 800GB P5800X (QD = 128), Network: 2x 100GbE Intel E810-C, Test by Intel as of 3/17/2021. Baseline: 1-node, 2x Intel® Xeon® Platinum 8280 Processor, 28 cores HT On Turbo ON Total Memory 768 GB (24 slots/ 32GB/ 2666 MHz), BIOS: SE5C620.86B.02.01.0013.121520200651 (ucode:0x4003003), Fedora 30, Linux Kernel 5.7.12, gcc 9.3.1 compiler, fio 3.20, SPDK 21.01, Storage: 16x Intel® SSD DC P4610 1.6TB, Network: 1x 100GbE Intel E810-C, Test by Intel as of 2/10/2021.  FIO 3.20 New: March 17,2021

Baseline: February 10, 2021

[61]

Up to 2.7x higher IOPS throughput (4K random 70R/30W) for NVMe-over-TCP with the 3rd Gen Intel Xeon Scalable platform with Intel® Optane™ SSD P5800X Series vs. prior generation Intel Xeon® Scalable platform supporting Intel DC P4610 SSDs

Up to 2.7x higher IOPS throughput (4K random 70R/30W) for NVMe-over-TCP with the 3rd Gen Intel Xeon Scalable platform with Intel® SSD DC 5510 series vs. prior generation Intel Xeon® Scalable platform supporting Intel DC P4610 SSDs

3rd Generation Intel® Xeon® Platinum processor & Intel® Optane™ persistent memory 200 series NVMe-over-TCP IOPS Throughput : New: 1-node, 2x Intel® Xeon® Platinum 8380 Processor, 40 cores HT On Turbo ON Total Memory 1024 GB (16 slots/ 64GB/ 3200 MHz), BIOS:SE5C6200.86B.2021.D40.2103100308 (ucode:0x261), Fedora 30, Linux Kernel 5.7.12, gcc 9.3.1 compiler, fio 3.20, SPDK 21.01, Storage: 16x Intel® SSD D7-P5510 7.68 TB (QD = 256) or 16x Intel® Optane™ SSD 800GB P5800X (QD = 128), Network: 2x 100GbE Intel E810-C, Test by Intel as of 3/17/2021. Baseline: 1-node, 2x Intel® Xeon® Platinum 8280 Processor, 28 cores HT On Turbo ON Total Memory 768 GB (24 slots/ 32GB/ 2666 MHz), BIOS: SE5C620.86B.02.01.0013.121520200651 (ucode:0x4003003), Fedora 30, Linux Kernel 5.7.12, gcc 9.3.1 compiler, fio 3.20, SPDK 21.01, Storage: 16x Intel® SSD DC P4610 1.6TB, Network: 1x 100GbE Intel E810-C, Test by Intel as of 2/10/2021. FIO 3.20 New: March 17,2021

Baseline: February 10, 2021

[60] Up to 1.54x higher IOPS throughput (4K random 70R/30W) for CEPH with the 3rd Gen Intel Xeon Scalable platform with Intel® Optane™ SSD DC 5800X series along with Intel® SSD DC 5510 serie vs. generation Intel Xeon® Scalable platform supporting Intel DC P4510 SSDs along with Intel SSD DC P4800X series 3rd Generation Intel® Xeon® Platinum processor 1.54x Ceph: New: 8368: 5-node, 2x Intel Xeon Platinum 8368 cpu on Coyote Pass with 256 GB (16 slots/ 16GB/ 3200) total DDR4 memory, ucode 0x8d055260, HT on, Turbo on, RHEL 8.3, 4.18.0-240.10.1.el8_​3.x86_​64, 1x Intel SSD 535 256GB M.2, 6x Intel SSD DC P5510 3.84TB, 2x Intel SSD DC P5800X 400GB, 1x Intel E810-C 100GbE, FIO 3.19, 8.3.1 20191121 (Red Hat 8.3.1-5), Podman 2.0.5, Ceph Octopus 15.2.8, test by Intel on 3/16/2021. Baseline: 8280: 5-node, 2x Intel Xeon Platinum 8280 cpu on Wolf Pass with 192 GB (12 slots/ 16GB/ 2666) total DDR4 memory, ucode 0x5003003, HT on, Turbo on, RHEL 8.3, 4.18.0-240.10.1.el8_​3.x86_​64, 1x Intel SSD DC S3700 200GB, 6x Intel SSD DC P4510 4TB, 2x Intel SSD DC P4800X 375GB, 2x Intel XXV710 2x25GbE (100GbE bond)​FIO 3.19, 8.3.1 20191121 (Red Hat 8.3.1-5), Podman 2.0.5, Ceph Octopus 15.2.8, test by Intel on 3/26/2021. CEPH New: March 16,2021

Baseline: March 26,2021

[59]Up to 1.91x higher performance with the new 3rd Gen Intel Xeon Scalable Platform featuring gen 4 Intel DC P5510 SSD for Video-On-Demand CDN use case vs. prior generation Intel Xeon® Scalable platform supporting Intel® P4510 SSDs 3rd Generation Intel® Xeon® Platinum processor 1.91x CDN-Video-on-Demand with Intel SSD: New: 1 node, 2x Intel® Xeon® Platinum 8380 Processor, 40 core HT ON Turbo ON, Total Memory 256GB (16 slots/16GB/2666MT/s), BIOS SE5C6200.86B.2021.D40.2103100308  (ucode: 0x261), 8x Intel® P5510, 4x Intel® E810, Ubuntu 20.04, kernel 5.4.0-65-generic, gcc 9.3.0 compiler, openssl 1.1.1h, varnish-plus-6.0.7r2 revision eab14f54182a8cfe32e7db037050f246740452d8 wrk master 4/17/2019, (keep alive on, 512 total connections)  Test by Intel as of 3/17/2021. Baseline : 1 node, 2x Intel® Xeon® Gold 6258R Processor, 28 core HT ON Turbo ON, Total Memory 192GB (12 slots/16GB/2666MT/s), BIOS Dell 2.10.0 (ucode: 0x5003003), 10x Intel® P4510, 2x Intel® E810, Ubuntu 20.04, kernel 5.4.0-65-generic, gcc 9.3.0 compiler, openssl 1.1.1h, varnish-plus-6.0.7r2 revision eab14f54182a8cfe32e7db037050f246740452d8., wrk master 4/17/2019, (keep alive on, 512 total connections) Test by Intel as of 2/15/2021. Throughput measured with 100% Transport Layer Security (TLS) traffic with 100% target cache hit ratio. CDN-Video-on-Demand with Varnish plus New : March 17, 2021

Baseline: February 15, 2021

[58]Up to 1.72x higher performance with the new 3rd Gen Intel Xeon Scalable Platform supporting Intel® Optane™ PMEM 200 Series for CDN Live use case vs. prior generation Intel Xeon® Scalable platform supporting Intel® Total Optane Persistent Memory 100 Series 3rd Generation Intel® Xeon® Platinum processor 1.72x CDN-Live with Intel PMEM: New: 1 node 2x Intel® Xeon® Platinum 8380 Processor, 40 core HT ON Turbo ON, Total DRAM 256GB (16 slots/16GB/2666MT/s), Total Optane Persistent Memory 200 Series 2048GB (16 slots/128GB/2666MT/s), BIOS SE5C6200.86B.2021.D40.2103100308 (ucode: 0x261), 4x Intel® E810, Ubuntu 20.04, kernel 5.4.0-65-generic, gcc 9.3.0 compiler, openssl 1.1.1h, varnish-plus 6.0.7r2. Test by Intel as of 3/17/2021. (keep alive off, 512 total connections), Baseline:1 node, 2x Intel® Xeon® Gold 6258R Processor, 28 core HT ON Turbo ON, Total DRAM 192GB (12 slots/16GB/2666MT/s), Total Optane Persistent Memory 100 Series 1536GB (12 slots/128GB/2666MT/s), BIOS Dell 2.10.0 (ucode: 0x5003003), 2x Intel® E810, Ubuntu 20.04, kernel 5.4.0-65-generic, gcc 9.3.0 compiler, openssl 1.1.1h, varnish-plus-6.0.7r2 revision eab14f54182a8cfe32e7db037050f246740452d8., (keep alive off, 512 total connections), wrk master 4/17/2019,Test by Intel as of 2/15/2021. Throughput measured with 100% Transport Layer Security (TLS) traffic with 93.3% target cache hit ratio. CDN-Live Linear with Varnish plus New : March 17, 2021

Baseline: February 15, 2021

[55] Federated Learning Training: Penn's 3DResUnet tumor segmentation model - 11% accuracy improvement detecting tumor boundaries using a model trained with data from 23 hospitals over a single hospitals data. 3rd Generation Intel® Xeon® Platinum processor Federated Learning Training: Penn's 3DResUnet tumor segmentation model :This demo uses data from the public BraTS data set. Led by Perelman School of Medicine, University of Pennsylvania Federated Tumor Segmentation (FeTS) project deployed to a total of 23 locations representing 29 institutions' data (of 64 committed), with 1653/9000 patient data samples. UPenn successfully deploy batch Graphene-SGX protected OpenFL workloads to the 3rd Gen Intel® Xeon® Scalable servers based HPC nodes using their existing job management infrastructure, enabling access to the medical datasets needed for their contributions to the FeTS project. Training of Penn's 3DResUnet tumor segmentation model yields results demonstrating following: i) The validation score of the model pretrained on a small dataset dropped when validated against the larger federation validation dataset (from 0.759 to 0.724). ii) Once trained on the federation training dataset, the validation score increases and surpasses the original model (from 0.724 to 0.805, an improvement of 11.18%). Demo: Privacy Preserving Analytics New : March 19, 2021

Baseline: March 19, 2021

[54] Up to 4.23X increase in image per second - Tencent PRNet Model on Intel-Tensorflow 2.4.0 Throughput Performance on 3nd Generation Intel® Xeon® Processor Scalable Family .

Up to 5.13x increase in connections per second - Tencent TGW NGINX TLS1.2 Webserver Connection-Per-Second Performance on 3nd Generation Intel® Xeon® Processor Scalable Family

3rd Generation Intel® Xeon® Platinum processor Tencent PRNet Model :

New: Test by Intel as of 03/19/2021. 2-node, 2x 3rd Gen Intel® Xeon® Scalable Processor, 36 cores HT On Turbo ON Total Memory 256 GB (16 slots/ 16GB/ 3200 MHz), BIOS: SE5C6200.86B.3020.P19.2103170131 (ucode: 0x8d05a260), CentOS 8.3, 4.18.0-240.1.1.el8_​3.x86_​64, gcc 8.3.1 compiler, PRNet Model, Deep Learning Framework: Intel-Tensorflow 2.4.0, https://github.com/Intel-tensorflow/tensorflow/releases/tag/v2.4.0, BS=1, Dummy Data, 18 instances/2 sockets, Datatype: FP32/INT8

Baseline: Test by Intel as of 03/19/2021. 2-node, 2x 2nd Gen Intel® Xeon® Scalable Processor, 24 cores HT On Turbo ON Total Memory 192 GB (12 slots/ 16GB/ 2933 MHz), BIOS: SE5C620.86B.0D.01.0438.032620191658(ucode:0x5003003), CentOS 8.3, 4.18.0-240.10.1.el8_​3.x86_​64, gcc 8.3.1 compiler, PRNet Model, Deep Learning Framework: Intel-Tensorflow 2.4.0, https://github.com/Intel-tensorflow/tensorflow/releases/tag/v2.4.0, BS=1, Dummy Data, 12 instances/2 sockets, Datatype: FP32/INT8

Tencent TGW:

New: Test by Intel as of 3/19/2021. 1-node, 2x 3rd Gen Intel® Xeon® Scalable Processor, 36 cores HT On Turbo ON Total Memory 256 GB (16 slots/ 16GB/ 3200 MHz), BIOS: SE5C6200.86B.3020.P19.2103170131 (ucode: 0x8d05a260), CentOS 8.3, 4.18.0-240.1.1.el8_​3.x86_​64, gcc 8.3.1 compiler, NGINX 1.18, OpenSSL 1.1.1f, QAT Engine 0.6.4, Ipp Crypto MB 2020 update3

Baseline: Test by Intel as of 3/13/2021. 1-node, 2x 2nd Gen Intel® Xeon® Scalable Processor, 24 cores HT On Turbo ON Total Memory 192 GB (12 slots/ 16GB/ 2933 MHz), BIOS: SE5C620.86B.02.01.0013.121520200651 (ucode:0x5003003), CentOS 8.3, 4.18.0-240.10.1.el8_​3.x86_​64, gcc 8.3.1 compiler, NGINX 1.18, OpenSSL 1.1.1f

Demo: Performance Made Flexible - Tensent New : March 19, 2021

Baseline: March 19, 2021

[53] Up to 34% overall gen to gen improvement in Images Per Second processed with Intel® 3rd Gen Xeon® Scalable processors 3rd Generation Intel® Xeon® Platinum processor Claro360 social_​distance_​V1.0, Person-detection-retail-0013 (INT8): New: Test by Intel as of 03/19/2021. 1-node, 2x Intel® Xeon® Platinum 8380 Processor, 40 cores HT On Turbo ON Total Memory 512 GB (16 slots/ 32GB/ 3200 MT/s), BIOS: BIOS: SE5C6200.86B.3021.D40.2103160200 (ucode:0x261), Ubuntu 18.04.5 LTS, 5.4.0-66-generic, claro360 workload not public, score=3659ips Baseline: Test by Intel as of 3/19/2021. 1-node, 2x Intel® Xeon® Platinum 8280 Processor, 28 cores HT On Turbo ON Total Memory 384 GB (12 slots/ 32GB/ 2933 MT/s), BIOS: SE5C620.86B.02.01.0013.121520200651 (ucode:0x5003003), Ubuntu 18.04.5 LTS, 5.4.0-66-generic, claro360 workload not public, score=2716ips Demo: Pandemic Safety Solution New : March 19, 2021

Baseline: March 19, 2021

[51] Delivers up to 50% performance increase and 31% total cost reduction with Intel Xeon Platinum 8360Y vs. the prev gen Intel Xeon Gold 5218 3rd Generation Intel® Xeon® Platinum processor Lightbits FIO: New: (3rd Gen Intel Xeon): Test by Lightbits as of 3/23/2021. 5-node, Intel® Xeon® Platinum 8360Y Processor, 36 cores, Utilized: 24 cores, HT On Turbo ON Total Memory 2560 GB (16 slots/ 32GB/ 3200 MHz, 16 slots/ DCPMM 128GB/ 2666 MHz), BIOS: SE5C6200.86B.3021.D40.2103160200, CentOS 7.8, 4.14.216-41421769bde239058b6e-rel-lb, fio-3.1 Baseline(CLX): Test by Lightbits as of 3/23/2021. 8-node, Intel® Xeon® Gold 5218 Processor, 16 cores, Utilized: 16 cores, HT On Turbo ON Total Memory 704 GB (20 slots/ 32GB/ 2666 MHz, 4 slots/ NVDIMM 16GB/ 2666 MHz), BIOS: 3.3a, CentOS 7.8, 4.14.216-41421769bde239058b6e-rel-lb, fio-3.1

Intel® Optane™ persistent memory pricing & DRAM pricing referenced in TCO calculations is provided for guidance and planning purposes only and does not constitute a final offer. Pricing guidance is subject to change and may revise up or down based on market dynamics. Please contact your OEM/distributor for actual pricing. Pricing guidance as of March, 2021.

Demo: Built for the Cloud : Lightbits FIO New : March 23, 2021

Baseline: March 23, 2021

[45] 3rd Gen Intel® Xeon® Scalable processor supporting Intel® DL Boost INT8 delivers up to 25x better inference throughput vs. AMD Milan FP32 across a diverse set of AI workloads that include Image Classification, Object Detection, Natural Language Processing and Image Recognition 3rd Generation Intel® Xeon® Platinum processor Up to 25x higher AI performance with 3rd Gen Intel® Xeon® Scalable processor supporting Intel® DL Boost vs. FP32 AMD EPYC 7763 (64C Milan):

New: 1-node, 2x Intel Xeon Platinum 8380 processor on Coyote Pass with 512 GB (16 slots/ 32GB/ 3200) total DDR4 memory, ucode X55260, HT on, Turbo on, Ubuntu 20.04 LTS, 5.4.0-65-generic, 1x Intel_​SSDSC2KG96, Intel SSDPE2KX010T8, MobileNet-v1, gcc-9.3.0, oneDNN 1.6.4, BS=1,56, INT8, TensorFlow 2.4.1 with Intel optimizations for 3rd Gen Intel Xeon Scalable processor, upstreamed to TensorFlow- 2.5 (container- intel/intel-optimized-tensorflow:tf-r2.5-icx-b631821f), Model zoo: https://github.com/IntelAI/models/tree/icx-launch-public/quickstart/, test by Intel on March 2021.

AMD:1-node, 2x AMD Epyc 7763 on GigaByte with 512 GB (16 slots/ 32GB/ 3200) total DDR4 memory, ucode 0xa001114, HT on, Turbo on, Ubuntu 20.04 LTS, 5.4.0-65-generic, 1x Samsung_​MZ7LH3T8, MobileNet-v1, gcc-9.3.0, oneDNN 1.6.4, BS=1,56, FP32, TensorFlow- 2.4.1, Model zoo: https://github.com/IntelAI/models/tree/icx-launch-public/benchmarks/image_recognition/tensorflow/mobilenet_v1, tested by Intel and results as of March 2021.

MobileNet-v1 (Up to 25x better inference performance) New: March 2021

Baseline: March 2021

[44] 1.3x higher AI performance with 3rd Gen Intel® Xeon® Scalable processor supporting Intel® DL Boost vs. NVIDIA A100 (geomean of 20 workloads including logistic regression inference, logistic regression fit, ridge regression inference, ridge regression fit, linear regression inference, linear regression fit, elastic net inference, XGBoost Fit, XGBoost predict, SSD-ResNet34 inference, Resnet50-v1.5 inference, Resnet50-v1.5 training, BERT Large SQuaD inference, kmeans inference, kmeans fit, brute_​knn inference, SVC inference, SVC fit, dbscan fit, traintestsplit) 3rd Generation Intel® Xeon® Platinum processor 1.3x higher AI performance with 3rd Gen Intel® Xeon® Scalable processor supporting Intel® DL Boost vs. NVIDIA A100 GPU: (geomean of 20 workloads including logistic regression inference, logistic regression fit, ridge regression inference, ridge regression fit, linear regression inference, linear regression fit, elastic net inference, XGBoost Fit, XGBoost predict, SSD-ResNet34 inference, Resnet50-v1.5 inference, Resnet50-v1.5 training, BERT Large SQuaD inference, kmeans inference, kmeans fit, brute_​knn inference, SVC inference, SVC fit, dbscan fit, traintestsplit)

3rd Gen Intel Xeon: 8380: 1-node, 2x Intel Xeon Platinum 8380 (40C/2.3GHz, 270W TDP) processor on Intel Software Development Platform with 512 GB (16 slots/ 32GB/ 3200) total DDR4 memory, ucode X55260, HT on, Turbo on, Ubuntu 20.04 LTS, 5.4.0-65-generic, 1x Intel_​SSDSC2KG96, Intel SSDPE2KX010T8, tested by Intel, and results as of March 2021.

DL Measurements on A100: 1-node, 2-socket AMD EPYC 7742 (64C) with 256GB (8 slots/ 32GB/ 3200) total DDR4 memory, ucode 0x8301038, HT on, Turbo on, Ubuntu 20.04 LTS, 5.4.0-42-generic, INTEL SSDSC2KB01, NVIDIA A100-PCIe-40GB, HBM2-40GB, Accelerator per node =1, tested by Intel, and results as of March 2021.

ML Measurements on A100 : 1-node, 2-socket AMD EPYC 7742 (64C) with 512 GB (16 slots/ 32GB/ 3200) total DDR4 memory, ucode 0x8301034, HT on, Turbo on, Ubuntu 18.04.5 LTS, 5.4.0-42-generic,NVIDIA A100 (DGX-1), 1.92TB M.2 NVMe, 1.92TB M.2 NVMe RAID tested by Intel, and results as of March 2021.

ResNet50-v1.5 Intel : gcc-9.3.0, oneDNN 1.6.4, BS=1, INT8, TensorFlow 2.4.1 with Intel optimizations for 3rd Gen Intel Xeon Scalable processor, upstreamed to TensorFlow- 2.5 (container- intel/intel-optimized-tensorflow:tf-r2.5-icx-b631821f), Model zoo: https://github.com/IntelAI/models/tree/icx-launch-public/quickstart

ResNet50-v1.5 NVIDIA :A100 (7 instance/GPU), BS=1,TensorFlow - 1.5.5 (NGC: tensorflow:21.02-tf1-py3), https://github.com/NVIDIA/DeepLearningExamples/tree/master/TensorFlow/Classification/ConvNets/resnet50v1.5, TF AMP (FP16+TF32);

ResNet50-v1.5 Training Intel : gcc-9.3.0, oneDNN 1.6.4, BS=256, FP32, TensorFlow 2.4.1 with Intel optimizations for 3rd Gen Intel Xeon Scalable processor, upstreamed to TensorFlow- 2.5 (container- intel/intel-optimized-tensorflow:tf-r2.5-icx-b631821f), Model zoo: https://github.com/IntelAI/models/tree/icx-launch-public/quickstart.

ResNet50-v1.5 Training NVIDIA :A100, BS=256,TensorFlow - 1.5.5 (NGC: tensorflow:21.02-tf1-py3),https://github.com/NVIDIA/DeepLearningExamples/tree/master/TensorFlow/Classification/ConvNets/resnet50v1.5, TF32; BERT-Large SQuAD Intel : gcc-9.3.0, oneDNN 1.6.4, BS=1, INT8,

TensorFlow 2.4.1 with Intel optimizations for 3rd Gen Intel Xeon Scalable processor, upstreamed to TensorFlow- 2.5 (container- intel/intel-optimized-tensorflow:tf-r2.5-icx-b631821f), Model zoo: https://github.com/IntelAI/models/tree/icx-launch-public/quickstart/ A100 : BERT-Large SQuAD, BS=1, A100 (7 instance/GPU), TensorFlow - 1.5.5 (NGC: tensorflow:20.11-tf1-py3), https://github.com/NVIDIA/DeepLearningExamples/tree/master/TensorFlow/LanguageModeling/BERT,TF AMP (FP16+TF32) ; SSD-ResNet34 Intel : gcc-9.3.0, oneDNN 1.6.4, BS=1, INT8,

TensorFlow 2.4.1 with Intel optimizations for 3rd Gen Intel Xeon Scalable processor, upstreamed to TensorFlow- 2.5 (container- intel/intel-optimized-tensorflow:tf-r2.5-icx-b631821f), Model zoo: https://github.com/IntelAI/models/tree/icx-launch-public/quickstart/, SSD-ResNet34 NVIDIA :A100 (7 instance/GPU), BS=1,Pytorch - 1.8.0a0 (NGC Container, latest supported): A100 : SSD-ResNet34 (NGC: pytorch:20.11-py3), https://github.com/NVIDIA/DeepLearningExamples/tree/master/PyTorch/Detection/SSD, AMP (FP16 +TF32) ; Python : Intel: Python 3.7.9, Scikit-Learn : Sklearn 0.24.1, OneDAL : Daal4py 2021.2, XGBoost: XGBoost 1.3.3 Python : NVIDIA A100 : Python 3.7.9, Scikit-Learn : Sklearn 0.24.1, CuML 0.17, XGBoost 1.3.0dev.rapidsai0.17, Nvidia RAPIDS : RAPIDS 0.17, CUDA Toolkit : CUDA 11.0.221 Benchmarks: https://github.com/IntelPython/scikit-learn_bench

geomean of 20 workloads including logistic regression inference, logistic regression fit, ridge regression inference, ridge regression fit, linear regression inference, linear regression fit, elastic net inference, XGBoost Fit, XGBoost predict, SSD-ResNet34 inference, Resnet50-v1.5 inference, Resnet50-v1.5 training, BERT Large SQuaD inference, kmeans inference, kmeans fit, brute_​knn inference, SVC inference, SVC fit, dbscan fit, traintestsplit New: March 2021

Baseline: March 2021

[43] 1.5x higher AI performance with 3rd Gen Intel® Xeon® Scalable processor supporting Intel® DL Boost vs. FP32 AMD EPYC Milan (geomean of 20 workloads including logistic regression inference, logistic regression fit, ridge regression inference, ridge regression fit, linear regression inference, linear regression fit, elastic net inference, XGBoost Fit, XGBoost predict, SSD-ResNet34 inference, Resnet50-v1.5 inference, Resnet50-v1.5 training, BERT Large SQuaD inference, kmeans inference, kmeans fit, brute_​knn inference, SVC inference, SVC fit, dbscan fit, traintestsplit) 3rd Generation Intel® Xeon® Platinum processor 1.5x higher AI performance with 3rd Gen Intel® Xeon® Scalable processor supporting Intel® DL Boost vs. FP32 AMD EPYC 7763 (64C Milan): (geomean of 20 workloads including logistic regression inference, logistic regression fit, ridge regression inference, ridge regression fit, linear regression inference, linear regression fit, elastic net inference, XGBoost Fit, XGBoost predict, SSD-ResNet34 inference, Resnet50-v1.5 inference, Resnet50-v1.5 training, BERT Large SQuaD inference, kmeans inference, kmeans fit, brute_​knn inference, SVC inference, SVC fit, dbscan fit, traintestsplit)

3rd Gen Intel Xeon: 8380: 1-node, 2x Intel Xeon Platinum 8380 (40C/2.3GHz, 270W TDP) processor on Intel Software Development Platform with 512 GB (16 slots/ 32GB/ 3200) total DDR4 memory, ucode X55260, HT on, Turbo on, Ubuntu 20.04 LTS, 5.4.0-65-generic/5.4.0-64-generic, 1x Intel_​SSDSC2KG96, Intel SSDPE2KX010T8, tested by Intel, and results as of March 2021.

AMD: 7763: 1-node, 2-socket AMD EPYC 7763 (64C/2.45GHz, 280W cTDP) on GIGABYTE R282-Z92 server with 512 GB (16 slots/ 32GB/ 3200) total DDR4 memory, ucode 0xa001114, SMT on, Boost on, Power deterministic mode, Ubuntu 20.04 LTS, 5.4.0-65-generic, 1x Samsung_​MZ7LH3T8/INTEL SSDSC2KG019T8, tested by Intel, and results as of March 2021.

ResNet50-v1.5 Intel : gcc-9.3.0, oneDNN 1.6.4, BS=128, INT8, TensorFlow 2.4.1 with Intel optimizations for 3rd Gen Intel Xeon Scalable processor, upstreamed to TensorFlow- 2.5 (container- intel/intel-optimized-tensorflow:tf-r2.5-icx-b631821f), Model zoo: https://github.com/IntelAI/models/tree/icx-launch-public/quickstart, ResNet50-v1.5 AMD : gcc-9.3.0, oneDNN 1.6.4, BS=128, FP32, TensorFlow- 2.4.1, Model zoo: https://github.com/IntelAI/models/tree/icx-launch-public/benchmarks/image_recognition/tensorflow/resnet50v1_5

ResNet50-v1.5 Training Intel : gcc-9.3.0, oneDNN 1.6.4, BS=256, FP32, TensorFlow 2.4.1 with Intel optimizations for 3rd Gen Intel Xeon Scalable processor, upstreamed to TensorFlow- 2.5 (container- intel/intel-optimized-tensorflow:tf-r2.5-icx-b631821f), Model zoo: https://github.com/IntelAI/models/tree/icx-launch-public/quickstart, ResNet50-v1.5 Training AMD : gcc-9.3.0, oneDNN 1.6.4, BS=256, FP32, TensorFlow- 2.4.1, Model zoo: https://github.com/IntelAI/models/tree/icx-launch-public/benchmarks/image_recognition/tensorflow/resnet50v1_5 SSD-ResNet34 Intel : gcc-9.3.0, oneDNN 1.6.4, BS=1, INT8,

TensorFlow 2.4.1 with Intel optimizations for 3rd Gen Intel Xeon Scalable processor, upstreamed to TensorFlow- 2.5 (container- intel/intel-optimized-tensorflow:tf-r2.5-icx-b631821f), Model zoo: https://github.com/IntelAI/models/tree/icx-launch-public/quickstart/, AMD : SSD-ResNet34, gcc-9.3.0, oneDNN 1.6.4, BS=1, FP32, TensorFlow- 2.4, Model zoo: https://github.com/IntelAI/models/tree/icx-launch-public/benchmarks/object_detection/tensorflow/ssd-resnet34 BERT-Large SQuAD Intel : gcc-9.3.0, oneDNN 1.6.4, BS=1, INT8,

TensorFlow 2.4.1 with Intel optimizations for 3rd Gen Intel Xeon Scalable processor, upstreamed to TensorFlow- 2.5 (container- intel/intel-optimized-tensorflow:tf-r2.5-icx-b631821f), Model zoo: https://github.com/IntelAI/models/tree/icx-launch-public/quickstart/, AMD : BERT-Large SQuAD, gcc-9.3.0, oneDNN 1.6.4, BS=1, FP32, TensorFlow- 2.4.1, Model zoo: https://github.com/IntelAI/models/tree/icx-launch-public/benchmarks/language_modeling/tensorflow/bert_large Python : Python 3.7.9, SciKit-Learn : Sklearn 0.24.1, oneDAL : Daal4py 2021.2, XGBoost : XGBoost 1.3.3 : Benchmarks: https://github.com/IntelPython/scikit-learn_bench

geomean of 20 workloads including logistic regression inference, logistic regression fit, ridge regression inference, ridge regression fit, linear regression inference, linear regression fit, elastic net inference, XGBoost Fit, XGBoost predict, Mobilenet-v1 inference, Resnet50-v1.5 inference, Resnet50-v1.5 training, BERT Large SQuaD inference, kmeans inference, kmeans fit, brute_​knn inference, SVC inference, SVC fit, dbscan fit, traintestsplit New: March 2021

Baseline: March 2021

[42] With Intel® 3rd Gen Xeon® Scalable processors and the latest Intel® Optane™ Persistent Memory you can get up to 63% higher throughput and 33% more memory capacity, enabling you to serve the same number of subscribers at higher resolution or a greater number of subscribers at the same resolution.

With Intel® 3rd Gen Xeon® Scalable processors, CoSP's can increase 5G UPF performance by 42%. Combined with Intel Ethernet 800 series adapters, they can deliver the performance, efficiency and trust for use cases that require low latency, including augmented reality, cloud-based gaming, discrete automation and even robotic-aided surgery.

With Intel® 3rd Gen Intel® Xeon® Scalable processors, Ethernet 800 series and vRAN dedicated accelerators, CoSP's can get up to 1.81x MIMO Midhaul Throughput in a similar power envelope for a best-in-class 3x100mhz 64T64R configuration.

3rd Generation Intel® Xeon® Platinum processor 1.63x CDN-Live Linear: New: 1 node, 2x Intel® Xeon® Gold 6338N Processor, 32 core HT ON Turbo ON, Total DRAM 256GB (16 slots/16GB/2666MT/s), Total Optane Persistent Memory 200 Series 2048GB (16 slots/128GB/2666MT/s), BIOS SE5C6200.86B.2021.D40.2103100308 (ucode: 0x261), 4x Intel® E810, Ubuntu 20.04, kernel 5.4.0-65-generic, gcc 9.3.0 compiler, openssl 1.1.1h, varnish-plus 6.0.7r2. 2 clients, Test by Intel as of 3/11/2021. Baseline: Gold 6252N: 2x Intel® Xeon® Gold 6252N Processor, 24 core HT ON Turbo ON, Total DRAM 192GB (12 slots/16GB/2666MT/s), Total Optane Persistent Memory 100 Series 1536GB(12 slots/128GB/2666MT/s), 1x Mellanox MCX516A-CCAT, BIOS: SE5C620.86B.02.01.0013.121520200651 (ucode: 0x5003003), Ubuntu 20.04, kernel 5.4.0-65-generic, wrk master 4/17/2019. Test by Intel as of 2/15/2021. Throughput measured with 100% Transport Layer Security (TLS) traffic with 93.3% target cache hit ratio and keep alive on, 512 total connections.

1.42x 5G UPF : New: 1-node, 2(1 socket used)x 3rd Gen Intel Xeon Gold 6338N on Whitley Coyote Pass 2U with 128 GB (8 slots/ 16GB/ 2666) total DDR4 memory, ucode 0x261, HT on, Turbo off, Ubuntu 18.04.5 LTS, 4.15.0-134-generic, 1x Intel 810 (Columbiaville), FlexCore 5G UPF, Jan' 2021 MD5 checksum: c4ad7f8422298ceb69d01e67419ff1c1, GCC 7.5.0, 5G UPF228 Gbps / 294 Gbps, test by Intel on 3/16/2021. Baseline: 1-node, 2(1 socket used)x Intel Xeon Gold 6252N on SuperMicro* X11DPG-QT with 96 GB (6 slots/ 16GB/ 2934) total DDR4 memory, ucode 0x5003003, HT on, Turbo off, Ubuntu 18.04.5 LTS, 4.15.0-132-generic, 1x Intel 810 (Columbiaville), FlexCore 5G UPF, Jan' 2021 MD5 checksum: c4ad7f8422298ceb69d01e67419ff1c1, GCC 7.5.0, 5G UPF161 Gbps / 213 Gbps, test by Intel on 2/12/2021.

FleXRAN : New: 1 node, 1 socket Intel Xeon Gold 6338N Processor, 32 core HT ON Turbo ON, Total DRAM 128 GB (8 slots/16GB/2666), BIOS WLYDCRB.SYS.WR.64.2021.09.4.04.0636_​0020.P86_​P80260_​LBG_​SPS_​8d055260_​EARLYG (ucode 0x261), Intel Mount Bryce (ACC100), CentOS 7.8.2003, 3.10.0-1127.19.1.rt56.1116.el7.x86_​64, FlexRAN L1 Massive MIMO, tested by Intel on 3/18/2021. Baseline: 1 node, 1 socket Intel Xeon Gold 6212U Processor, 24 core HT ON Turbo ON, Total Dram 96 GB (6 slots/16GB/2993), BIOS SE5C620.86B.02.01.0012.070720200218, Intel Mount Bryce (ACC100), CentOS 7.8.2003, 3.10.0-1127.19.1.rt56.1116.el7.x86_​64, FlexRAN L1 Massive MIMO, tested by Intel on 2/23/2021.

Demo: VRAN, 5G UPF, CDN-Live Linear New : March 11, 2021

Baseline: February 15, 2021

[40] 3.20x higher OpenSSL RSA Sign 2048 performance 3rd Gen Intel® Xeon® Scalable vs. AMD Milan

2.03x higher OpenSSL ECDHE x25519 performance 3rd Gen Intel® Xeon® Scalable vs. AMD Milan

3rd Generation Intel® Xeon® Platinum processor 3.20x higher OpenSSL RSA Sign 2048 performance

2.03x higher OpenSSL ECDHE x25519 performance

3rd Gen Intel Xeon: 8380 : 1-node, 2x Intel® Xeon® Platinum 8380 CPU on M50CYP2SB2U with 512 GB (16 slots/ 32GB/ 3200) total DDR4 memory, ucode 0xd000270, HT On, Turbo Off, Ubuntu 20.04.1 LTS, 5.4.0-65-generic, 1x INTEL_​SSDSC2KG01, OpenSSL 1.1.1j, GCC 9.3.0, QAT Engine v0.6.4, Tested by Intel and results as of March 2021.

AMD: 7763 : 1-node, 2x AMD EPYC 7763 64-Core Processor on R282-Z92-00 with 512 GB (16 slots/ 32GB/ 3200) total DDR4 memory, ucode 0xa001114, HT On, Turbo Off, Ubuntu 20.04.1 LTS, 5.4.0-65-generic, 1x SAMSUNG_​MZ7LH3T8, OpenSSL 1.1.1j, GCC 9.3.0, Tested by Intel and results as of March 2021.

OpenSSL New: March 2021

Baseline: March 2021

[39] 1.18x higher LINPACK performance with 3rd Gen Intel® Xeon® Scalable vs. AMD Milan 3rd Generation Intel® Xeon® Platinum processor 1.18x higher performance on LINPACK

3rd Gen Intel Xeon: 8380: 1-node, 2x Intel Xeon Platinum 8380 (40C/2.3GHz, 270W TDP) processor on Intel Software Development Platform with 256 GB (16 slots/ 16GB/ 3200) total DDR4 memory, ucode 0x261, HT on, Turbo on, CentOS Linux 8.3.2011, 4.18.0-240.1.1.el8_​3.crt1.x86_​64, 1x Intel_​SSDSC2KG96, App Version: The Intel Distribution for LINPACK Benchmark; Build notes: Tools: Intel MPI 2019u7; threads/core: 1; Turbo: used; Build: build script from Intel Distribution for LINPACK package; 1 rank per NUMA node: 1 rank per socket

AMD: 7763: 1-node, 2-socket AMD EPYC 7763 (64C/2.45GHz, 280W cTDP) on GIGABYTE R282-Z92 server with 512 GB (16 slots/ 32GB/3200) total DDR4 memory, ucode 0xa001114, SMT on, Boost on, Power deterministic mode, NPS=4, CentOS Linux 8.3.2011, 4.18.0-240.1.1.el8_​3.crt1.x86_​64, 1x Samsung_​MZ7LH3T8, App Version: AMD official HPL 2.3 MT version with BLIS 2.1; Build notes: Tools: hpc-x 2.7.0; threads/core: 1; Turbo: used; Build: pre-built binary (gcc built) from https://developer.amd.com/amd-aocl/blas-library/; 1 rank per L3 cache, 4 threads per rank Tested by Intel and results as of March 2021

LINPACK New: March 2021

Baseline: March 2021

[38] 1.32x higher RELION performance with 3rd Gen Intel® Xeon® Scalable vs. AMD Milan 3rd Generation Intel® Xeon® Platinum processor 1.32x higher performance on RELION Plasmodium Ribosome

3rd Gen Intel Xeon: 8380: 1-node, 2x 3rd Gen Intel Xeon Platinum 8380 (40C/2.3GHz, 270W TDP) processor on Intel Software Development Platform with 256 GB (16 slots/ 16GB/ 3200) total DDR4 memory, ucode 0x261, HT on, Turbo on, CentOS Linux 8.3.2011, 4.18.0-240.1.1.el8_​3.crt1.x86_​64, 1x Intel_​SSDSC2KG96, App Version: 3_​1_​1; Build notes: Tools: Intel C Compiler 2020u4, Intel MPI 2019u9; threads/core: 2; Turbo: used; Build knobs: -O3 -ip -g -debug inline-debug-info -xCOMMON-AVX512 -qopt-report=5 -restrict

AMD: 7763: 1-node, 2-socket AMD EPYC 7763 (64C/2.45GHz, 280W cTDP) on GIGABYTE R282-Z92 server with 512 GB (16 slots/ 32GB/3200) total DDR4 memory, ucode 0xa001114, SMT on, Boost on, Power deterministic mode, NPS=4, CentOS Linux 8.3.2011, 4.18.0-240.1.1.el8_​3.crt1.x86_​64, 1x Samsung_​MZ7LH3T8, App Version: 3_​1_​1; Build notes: Tools: Intel C Compiler 2020u4, Intel MPI 2019u9; threads/core: 2; Turbo: used; Build knobs: -O3 -ip -g -debug inline-debug-info -march=core-avx2 -qopt-report=5 -restrict Tested by Intel and results as of March 2021

RELION Plasmodium Ribosome New: March 2021

Baseline: March 2021

[37] 1.50x higher Monte Carlo FSI performance with 3rd Gen Intel® Xeon® Scalable vs. AMD Milan 3rd Generation Intel® Xeon® Platinum processor 1.50x higher performance on Monte Carlo FSI Kernel

3rd Gen Intel Xeon:8380: 1-node, 2x 3rd Gen Intel Xeon Platinum 8380 (40C/2.3GHz, 270W TDP) processor on Intel Software Development Platform with 256 GB (16 slots/ 16GB/ 3200) total DDR4 memory, ucode 0x261, HT on, Turbo on, CentOS Linux 8.3.2011, 4.18.0-240.1.1.el8_​3.crt1.x86_​64, 1x Intel_​SSDSC2KG96, App Version: v1.1; Build notes: Tools: Intel MKL 2020u4, Intel C Compiler 2020u4, Intel Threading Building Blocks 2020u4; threads/core: 1; Turbo: used; Build knobs: -O3 -xCORE-AVX512 -qopt-zmm-usage=high -fimf-precision=low -fimf-domain-exclusion=31 -no-prec-div -no-prec-sqrt

AMD: 7763: 1-node, 2-socket AMD EPYC 7763 (64C/2.45GHz, 280W cTDP) on GIGABYTE R282-Z92 server with 512 GB (16 slots/ 32GB/3200) total DDR4 memory, ucode 0xa001114, SMT on, Boost on, Power deterministic mode, NPS=4, CentOS Linux 8.3.2011, 4.18.0-240.1.1.el8_​3.crt1.x86_​64, 1x Samsung_​MZ7LH3T8, App Version: v1.1; Build notes: Tools: Intel MKL 2020u4, Intel C Compiler 2020u4, Intel Threading Building Blocks 2020u4; threads/core: 2; Turbo: used; Build knobs: -O3 -march=core-avx2 -fimf-precision=low -fimf-domain-exclusion=31 -no-prec-div -no-prec-sqrt Tested by Intel and results as of March 2021

Monte Carlo FSI Kernel New: March 2021

Baseline: March 2021

[36] 1.27x higher NAMD performance on 3rd Gen Intel® Xeon® Scalable vs. AMD Milan 3rd Generation Intel® Xeon® Platinum processor 1.27x higher performance on NAMD STMV 1.27x higher performance on NAMD (geomean of Apoa1, STMV)

3rd Gen Intel Xeon: 8380: 1-node, 2x 3rd Gen Intel Xeon Platinum 8380 (40C/2.3GHz, 270W TDP) processor on Intel Software Development Platform with 256 GB (16 slots/ 16GB/ 3200) total DDR4 memory, ucode 0x261, HT on, Turbo on, CentOS Linux 8.3.2011, 4.18.0-240.1.1.el8_​3.crt1.x86_​64, 1x Intel_​SSDSC2KG96, App Version: 2.15-Alpha1 (includes AVX tiles algorithm); Build notes: Tools: Intel MKL, Intel C Compiler 2020u4, Intel MPI 2019u8, Intel Threading Building Blocks 2020u4; threads/core: 2; Turbo: used; Build knobs: -ip -fp-model fast=2 -no-prec-div -qoverride-limits -qopenmp-simd -O3 -xCORE-AVX512 -qopt-zmm-usage=high

AMD: 7763: 1-node, 2x AMD EPYC 7763 (64C/2.45GHz, 280W cTDP) on GIGABYTE R282-Z92 server with 512 GB (16 slots/ 32GB/3200) total DDR4 memory, ucode 0xa001114, SMT on, Boost on, Power deterministic mode, NPS=4, CentOS Linux 8.3.2011, 4.18.0-240.1.1.el8_​3.crt1.x86_​64, 1x Samsung_​MZ7LH3T8, App Version: 2.15-Alpha1 (includes AVX tiles algorithm); Build notes: Tools: Intel MKL, AOCC 2.2.0, gcc 9.3.0, Intel MPI 2019u8; threads/core: 2; Turbo: used; Build knobs: -O3 -fomit-frame-pointer -march=znver1 -ffast-math Tested by Intel and results as of March 2021

NAMD New: March 2021

Baseline: March 2021

[35] 4.5x higher INT8 real-time inference throughput on SSD-ResNet34 with 3rd Gen Intel® Xeon® Scalable processor supporting Intel® DL Boost vs. FP32 AMD EPYC Milan 3rd Generation Intel® Xeon® Platinum processor 4.5x higher INT8 real-time inference throughput on SSD-ResNet34 with 3rd Gen Intel® Xeon® Scalable processor supporting Intel® DL Boost vs. FP32 AMD EPYC Milan

3rd Gen Intel Xeon:8380: 1-node, 2x 3rd Gen Intel Xeon Platinum 8380 (40C/2.3GHz, 270W TDP) processor on Intel Software Development Platform with 512 GB (16 slots/ 32GB/ 3200) total DDR4 memory, ucode X55260, HT on, Turbo on, Ubuntu 20.04 LTS, 5.4.0-65-generic, 1x Intel_​SSDSC2KG96, Intel SSDPE2KX010T8, SSD-ResNet34, gcc-9.3.0, oneDNN 1.6.4, BS=1 INT8, TensorFlow 2.4.1 with Intel optimizations for 3rd Gen Intel Xeon Scalable processor, upstreamed to TensorFlow- 2.5 (container- intel/intel-optimized-tensorflow:tf-r2.5-icx-b631821f), Model zoo: https://github.com/IntelAI/models/tree/icx-launch-public/quickstart/, tested by Intel, and results as of March 2021.

AMD: 7763: 1-node, 2-socket AMD EPYC 7763 (64C/2.45GHz, 280W cTDP) on GIGABYTE R282-Z92 server with 512 GB (16 slots/ 32GB/ 3200) total DDR4 memory, ucode 0xa001114, SMT on, Boost on, Power deterministic mode, NPS=1, Ubuntu 20.04 LTS, 5.4.0-65-generic, 1x Samsung_​MZ7LH3T8, SSD-ResNet34, gcc-9.3.0, oneDNN 1.6.4, BS=1 FP32, TensorFlow- 2.4.1, Model zoo: https://github.com/IntelAI/models/tree/icx-launch-public/benchmarks/object_detection/tensorflow/ssd-resnet34, tested by Intel, and results as of March 2021.

SSD-ResNet34 New: March 2021

Baseline: March 2021

[34] 3.18x higher INT8 real-time inference throughput & 2.17x higher INT8 batch inference throughput on BERT Large SQuAD with 3rd Gen Intel® Xeon® Scalable processor supporting Intel® DL Boost vs. FP32 AMD EPYC Milan 3rd Generation Intel® Xeon® Platinum processor 3.18x higher INT8 real-time inference throughput & 2.17x higher INT8 batch inference throughput on BERT Large SQuAD with 3rd Gen Intel® Xeon® Scalable processor supporting Intel® DL Boost vs. FP32 AMD EPYC Milan

3rd Gen Intel Xeon: 8380: 1-node, 2x Intel Xeon Platinum 8380 (40C/2.3GHz, 270W TDP) processor on Intel Software Development Platform with 512 GB (16 slots/ 32GB/ 3200) total DDR4 memory, ucode X55260, HT on, Turbo on, Ubuntu 20.04 LTS, 5.4.0-65-generic, 1x Intel_​SSDSC2KG96, Intel SSDPE2KX010T8, BERT Large SQuAD, gcc-9.3.0, oneDNN 1.6.4, BS=1,128, INT8, TensorFlow 2.4.1 with Intel optimizations for 3rd Gen Intel Xeon Scalable processor, upstreamed to TensorFlow- 2.5 (container- intel/intel-optimized-tensorflow:tf-r2.5-icx-b631821f), Model zoo: https://github.com/IntelAI/models/tree/icx-launch-public/quickstart/, tested by Intel, and results as of March 2021.

AMD: 7763: 1-node, 2-socket AMD EPYC 7763 (64C/2.45GHz, 280W cTDP) on GIGABYTE R282-Z92 server with 512 GB (16 slots/ 32GB/ 3200) total DDR4 memory, ucode 0xa001114, SMT on, Boost on, Power deterministic mode, NPS=1, Ubuntu 20.04 LTS, 5.4.0-65-generic, 1x Samsung_​MZ7LH3T8, BERT Large SQuAD, gcc-9.3.0, oneDNN 1.6.4, BS=1,128, FP32, TensorFlow- 2.4.1, Model zoo: https://github.com/IntelAI/models/tree/icx-launch-public/benchmarks/language_modeling/tensorflow/bert_large, tested by Intel, and results as of March 2021.

BERT-Large SQuAD New: March 2021

Baseline: March 2021

[33] 4.01x higher INT8 real-time inference throughput & 25.05x higher INT8 batch inference throughput on MobileNet-v1 with 3rd Gen Intel® Xeon® Scalable processor supporting Intel® DL Boost vs. FP32 AMD EPYC Milan 3rd Generation Intel® Xeon® Platinum processor 4.01x higher INT8 real-time inference throughput & 25.05x higher INT8 batch inference throughput on MobileNet-v1 with 3rd Gen Intel® Xeon® Scalable processor supporting Intel® DL Boost vs. FP32 AMD EPYC Milan

3rd Gen Intel Xeon: 8380: 1-node, 2x Intel Xeon Platinum 8380 (40C/2.3GHz, 270W TDP) processor on Intel Software Development Platform with 512 GB (16 slots/ 32GB/ 3200) total DDR4 memory, ucode X55260, HT on, Turbo on, Ubuntu 20.04 LTS, 5.4.0-65-generic, 1x Intel_​SSDSC2KG96, Intel SSDPE2KX010T8, MobileNet-v1, gcc-9.3.0, oneDNN 1.6.4, BS=1,56, INT8, TensorFlow 2.4.1 with Intel optimizations for 3rd Gen Intel Xeon Scalable processor, upstreamed to TensorFlow- 2.5 (container- intel/intel-optimized-tensorflow:tf-r2.5-icx-b631821f), Model zoo: https://github.com/IntelAI/models/tree/icx-launch-public/quickstart/, tested by Intel, and results as of March 2021.

AMD: 7763: 1-node, 2-socket AMD EPYC 7763 (64C/2.45GHz, 280W cTDP) on GIGABYTE R282-Z92 server with 512 GB (16 slots/ 32GB/ 3200) total DDR4 memory, ucode 0xa001114, SMT on, Boost on, Power deterministic mode, NPS=1, Ubuntu 20.04 LTS, 5.4.0-65-generic, 1x Samsung_​MZ7LH3T8, MobileNet-v1, gcc-9.3.0, oneDNN 1.6.4, BS=1, 56, FP32, TensorFlow- 2.4.1, Model zoo: https://github.com/IntelAI/models/tree/icx-launch-public/benchmarks/image_recognition/tensorflow/mobilenet_v1, tested by Intel, and results as of March 2021.

MobileNet-v1 New: March 2021

Baseline: March 2021

[32] 2.79x higher INT8 real-time inference throughput & 12x higher INT8 batch inference throughput on SSD-MobileNet-v1 with 3rd Gen Intel® Xeon® Scalable processor supporting Intel® DL Boost vs. FP32 AMD EPYC Milan 3rd Generation Intel® Xeon® Platinum processor 2.79x higher INT8 real-time inference throughput & 12x higher INT8 batch inference throughput on SSD-MobileNet-v1 with 3rd Gen Intel® Xeon® Scalable processor supporting Intel® DL Boost vs. FP32 AMD EPYC Milan

3rd Gen Intel Xeon: 8380: 1-node, 2x 3rd Gen Intel Xeon Platinum 8380 (40C/2.3GHz, 270W TDP) processor on Intel Software Development Platform with 512 GB (16 slots/ 32GB/ 3200) total DDR4 memory, ucode X55260, HT on, Turbo on, Ubuntu 20.04 LTS, 5.4.0-65-generic, 1x Intel_​SSDSC2KG96, Intel SSDPE2KX010T8, SSD-MobileNet-v1, gcc-9.3.0, oneDNN 1.6.4, BS=1,448, INT8, TensorFlow 2.4.1 with Intel optimizations for 3rd Gen Intel Xeon Scalable processor, upstreamed to TensorFlow 2.5 (container- intel/intel-optimized-tensorflow:tf-r2.5-icx-b631821f (container- intel/intel-optimized-tensorflow:tf-r2.5-icx-b631821f), Model zoo: https://github.com/IntelAI/models/tree/icx-launch-public/quickstart/, tested by Intel, and results as of March 2021.

AMD: 7763: 1-node, 2-socket AMD EPYC 7763 (64C/2.45GHz, 280W cTDP) on GIGABYTE R282-Z92 server with 512 GB (16 slots/ 32GB/ 3200) total DDR4 memory, ucode 0xa001114, SMT on, Boost on, Power deterministic mode, NPS=1, Ubuntu 20.04 LTS, 5.4.0-65-generic, 1x Samsung_​MZ7LH3T8, SSD-MobileNet-v1, gcc-9.3.0, oneDNN 1.6.4, BS=1,448, FP32, TensorFlow- 2.4.1, Model zoo: https://github.com/IntelAI/models/tree/icx-launch-public/benchmarks/object_detection/tensorflow/ssd-mobilenet, tested by Intel, and results as of March 2021.

SSD-MobileNet-v1 New: March 2021

Baseline: March 2021

[31] 3.88x higher INT8 real-time inference throughput & 22.09x higher INT8 batch inference throughput on ResNet-50 with 3rd Gen Intel® Xeon® Scalable processor supporting Intel® DL Boost vs. FP32 AMD EPYC Milan 3rd Generation Intel® Xeon® Platinum processor 3.88x higher INT8 real-time inference throughput & 22.09x higher INT8 batch inference throughput on ResNet-50 with 3rd Gen Intel® Xeon® Scalable processor supporting Intel® DL Boost vs. FP32 AMD EPYC Milan .

3rd Gen Intel Xeon: 8380: 1-node, 2x 3rd Gen Intel Xeon Platinum 8380 (40C/2.3GHz, 270W TDP) processor on Intel Software Development Platform with 512 GB (16 slots/ 32GB/ 3200) total DDR4 memory, ucode X55260, HT on, Turbo on, Ubuntu 20.04 LTS, 5.4.0-65-generic, 1x Intel_​SSDSC2KG96, Intel SSDPE2KX010T8, ResNet50-v1.5, gcc-9.3.0, oneDNN 1.6.4, BS=1,128, INT8, TensorFlow 2.4.1 with Intel optimizations for 3rd Gen Intel Xeon Scalable processor, upstreamed to TensorFlow 2.5 (container- intel/intel-optimized-tensorflow:tf-r2.5-icx-b631821f), Model zoo: https://github.com/IntelAI/models/tree/icx-launch-public/quickstart/, tested by Intel, and results as of March 2021.

AMD:7763: 1-node, 2-socket AMD EPYC 7763 (64C/2.45GHz, 280W cTDP) on GIGABYTE R282-Z92 server with 512 GB (16 slots/ 32GB/ 3200) total DDR4 memory, ucode 0xa001114, SMT on, Boost on, Power deterministic mode, NPS=1, Ubuntu 20.04 LTS, 5.4.0-65-generic, 1x Samsung_​MZ7LH3T8, ResNet50-v1.5, gcc-9.3.0, oneDNN 1.6.4, BS=1,128, FP32, TensorFlow- 2.4.1, Model : https://github.com/IntelAI/models/tree/icx-launch-public/benchmarks/image_recognition/tensorflow/resnet50v1_5, tested by Intel, and results as of March 2021.

ResNet-50 New: March 2021

Baseline: March 2021

[25] 1.54x average performance gains with 3rd Gen Intel Xeon Platinum 8380 processor vs legacy Xeon Platinum 8180 server

2.65x average performance gains with 3rd Gen Intel Xeon Platinum 8380 processor vs legacy E5-v4 server

3.1x average performance gains with 3rd Gen Intel Xeon Platinum 8380 processor vs legacy E5-v3 server

3rd Generation Intel® Xeon® Platinum processor 1.54x average performance gain - Ice Lake vs Skylake: Geomean of 1.6x SPECrate2017_​int_​base (est), 1.62x SPECrate2017_​fp_​base (est), 1.52x Stream Triad, 1.44x Intel distribution of LINPACK.

2.65x average performance gain - Ice Lake vs Broadwell: Geomean of 2.34x SPECrate2017_​int_​base (est), 2.6x SPECrate2017_​fp_​base (est), 2.55x Stream Triad, 3.18x Intel distribution of LINPACK.

3.1x average performance gain - Ice Lake vs Haswell: Geomean of 2.85x SPECrate2017_​int_​base (est), 3.08x SPECrate2017_​fp_​base (est), 2.8x Stream Triad, 3.97x Intel distribution of LINPACK.

New: 1-node, 2x Intel Xeon Platinum 8380 processor on Coyote Pass with 512 GB (16 slots/ 32GB/ 3200) total DDR4 memory, ucode 0x261, HT on (SPECcpu2017), off (others), Turbo on, Ubuntu 20.04, 5.4.0-66-generic, 1x S4610 SSD 960G, SPECcpu2017 (est) v1.1.0, Stream Triad, Linpack, ic19.1u2, MPI: Version 2019u9; MKL:2020.4.17, test by Intel on 3/15/2021.

Skylake Baseline: 1-node, 2x Intel Xeon Platinum 8180 processor on Wolf Pass with 192 GB (12 slots/ 16GB/ 2933[2666]) total DDR4 memory, ucode 0x2006a08, HT on (SPECcpu2017), off (others), Turbo on, Ubuntu 20.04, 5.4.0-62-generic, SPECcpu2017 (est) v1.1.0, Stream Triad, Intel distribution of LINPACK, ic19.1u2, MPI: Version 2019 Update 9 Build 20200923; MKL: psxe_​runtime_​2020.4.17, test by Intel on 1/27/21.

Broadwell Baseline: 1-node, 2x Intel Xeon processor E5-2699v4 on Wildcat Pass with 256 GB (8 slots/ 32GB/ 2400) total DDR4 memory, ucode 0x038, HT on (SPECcpu2017), off (others), Turbo on, Ubuntu 20.04, 5.4.0-62-generic, 1x S3700 400GB SSD, SPECcpu2017 (est) v1.1.0, Stream Triad, Intel distribution of LINPACK, ic19.1u2, MPI: Version 2019 Update 9 Build 20200923; MKL: psxe_​runtime_​2020.4.17, test by Intel on 1/17/21.

Haswell Baseline: 1-node, 2x Intel Xeon processor E5-2699v3 on Wildcat Pass with 256 GB (8 slots/ 32GB/ 2666[2133]) total DDR4 memory, ucode 0x44, HT on (SPECcpu2017), off (others), Turbo on, Ubuntu 20.04, 5.4.0-62-generic, 1x S3700 400GB SSD, SPECcpu2017 (est) v1.1.0, Stream Triad, Intel distribution of LINPACK, ic19.1u2, MPI: Version 2019 Update 9 Build 20200923; MKL: psxe_​runtime_​2020.4.17, test by Intel on 2/3/21.

Geomean of

Integer throughput/Floating Point throughput/STREAM/LINPACK

New: March 15, 2021

Baseline: Jan 17, 2021

[24] 3rd Gen Intel Xeon Platinum 8380 processor delivers 4.5x performance on cloud data microservices usage vs. legacy Intel Xeon E5-v4 platform enabling faster business decisions. 3rd Generation Intel® Xeon® Platinum processor 4.5x higher responses with CloudXPRT Web Microservices vs. legacy Intel Xeon E5-v4 server: New: Platinum 8380: 1-node, 2x Intel Xeon Platinum 8380 processor on Coyote Pass with 512 GB (16 slots/ 32GB/ 3200) total DDR4 memory, ucode 0x261, HT on, Turbo on, Ubuntu 20.04, 5.4.0-65-generic​, 1x S4610 SSD 960G, CloudXPRT v1.0, Web Microservices (Requests per minute @ p.95 latency <= 3s), test by Intel on 3/12/2021. Baseline: Intel Xeon E5-v4: 1-node, 2x Intel Xeon processor E5-2699v4 on Wildcat Pass with 256 GB (8 slots/ 32GB/ 2400) total DDR4 memory, ucode 0x038, HT on, Turbo on, Ubuntu 20.04, 5.4.0-65-generic​, 1x S3700 400GB SSD, CloudXPRT v1.0, test by Intel on 1/17/21. Intel contributes to the development of benchmarks by participating in, sponsoring, and/or contributing technical support to various benchmarking groups, including the BenchmarkXPRT Development Community administered by Principled Technologies. CloudXPRT Web Microservices New: March 12, 2021

Baseline: Jan 17, 2021

[23] 3rd Gen Intel Xeon Platinum 8380 processor delivers 2.3x performance on cloud data analytics usage vs. legacy Intel Xeon E5-v4 platform enabling faster business decisions 3rd Generation Intel® Xeon® Platinum processor 2.3x higher responses with CloudXPRT - Data Analytics vs. legacy Intel Xeon E5-v4 server: New: Platinum 8380:1-node, 2x 3rd Gen Intel Xeon Platinum 8380 processor on Coyote Pass with 512 GB (16 slots/ 32GB/ 3200) total DDR4 memory, ucode 0x261, HT on, Turbo on, Ubuntu 20.04, 5.4.0-65-generic​, 1x S4610 SSD 960G, CloudXPRT v1.0, Data Analytics (Analytics per minute @ p.95 <= 90s), test by Intel on 3/12/2021. Baseline: Intel Xeon E5-v4: 1-node, 2x Intel Xeon processor E5-2699v4 on Wildcat Pass with 256 GB (8 slots/ 32GB/ 2400) total DDR4 memory, ucode 0x038, HT on, Turbo on, Ubuntu 20.04, 5.4.0-65-generic​, 1x S3700 400GB SSD, CloudXPRT v1.0, test by Intel on 1/17/21. Intel contributes to the development of benchmarks by participating in, sponsoring, and/or contributing technical support to various benchmarking groups, including the BenchmarkXPRT Development Community administered by Principled Technologies. CloudXPRT Data Analytics New: March 12, 2021

Baseline: Jan 17, 2021

[22]Up to 2.95x virtualization performance with 3rd Gen Intel® Xeon® Scalable processor with Intel® SSD D5-P5510 Series and Intel® Ethernet Network Adapter E810 vs. legacy Intel Xeon E5 v4 platform 3rd Generation Intel® Xeon® Platinum processor 2.95x higher virtualization performance vs. legacy Intel Xeon E5-v4 server: New: Platinum 8380: 1-node, 2x Intel Xeon Platinum 8380 processor on Coyote Pass with 2048 GB (32 slots/ 64GB/ 3200) total DDR4 memory, ucode 0x261, HT on, Turbo on, RedHat 8.3, 4.18.0-240.el8.x86_​64, 1x S4610 SSD 960G, 4x P5510 3.84TB NVME, 2x Intel E810, Virtualization workload, Qemu-kvm 4.2.0-34 (inbox), WebSphere 8.5.5, DB2 v9.7, Nginx 1.14.1, test by Intel on 3/14/2021. Baseline: Intel Xeon E5-v4: 1-node, 2x Intel Xeon processor E5-2699v4 on Wildcat Pass with 768 GB (24 slots/ 32GB/ 2666[1600]) total DDR4 memory, ucode 0xb000038, HT on, Turbo on, RedHat 8.3, 4.18.0-240.el8.x86_​64, 1x S3700 400GB SSD, 2x P3700 2TB NVME, 2x Intel XL710, Virtualization workload, Qemu-kvm 4.2.0-34 (inbox), WebSphere 8.5.5, DB2 v9.7, Nginx 1.14.1, test by Intel on 1/14/2021.

Virtualization workload

New: March 14, 2021

Baseline: Jan 14, 2021

[21] Process up to 2.4x transactions with the new 3rd Gen Intel Xeon Platinum 8380 platform vs. legacy Intel Xeon E5-v4 platform 3rd Generation Intel® Xeon® Platinum processor 2.4x higher Transactions on OLTP Database vs. legacy Intel Xeon E5-v4 server: New: Platinum 8380: 1-node, 2x 3rd Gen Intel Xeon Platinum 8380 processor on Coyote Pass with 512 GB (16 slots/ 32GB/ 3200) total DDR4 memory, ucode 0x261, HT on, Turbo on, Redhat 8.3, 4.18.0-240.el8.x86_​64 x86_​64, 1x Intel SSD 960GB OS Drive, 4x Intel P5800 1.6T (2xDATA, 2XREDO), x Onboard 1G/s, HammerDB 4.0, Oracle 19c, test by Intel on 3/16/2021. Baseline: Intel Xeon E5-2699v4: 1-node, 2x Intel Xeon processor E5-2699v4 on Wildcat Pass with 384 GB (24 slots/ 16GB/ 2133[1600]) total DDR4 memory, ucode 0x038, HT on, Turbo on, Redhat 8.3, 4.18.0-240.el8.x86_​64 x86_​64, 1x Intel 200GB SSD OS Drive, 4x 2.0T P3700 (2xDATA, 2xREDO), x Onboard 1G/s, HammerDB 4.0, Oracle 19c, test by Intel on 1/27/21. HammerDB OLTP w/Oracle New: March 16, 2021

Baseline: Jan 27, 2021

[14] Up to 39% More Bandwidth with Intel® Optane™ PMem 200 series 512GB module vs. Intel® Optane™ PMem 100 series 512GB module 3rd Generation Intel® Xeon® Platinum processor & Intel® Optane™ persistent memory 200 series New: 1-node, 1x pre-production CPX6 Processor @ 2.9GHz on Intel - Cedar Island Customer Reference Board (CRB) with DRAM: (per socket) 6 slots / 32GB / 2666 MT/s, PMem: (per socket) 1x 512GB Intel® Optane™ PMem 200 series module at 15W (192GB DRAM, 512GB PMem) total memory, ucode WW12'20 (pre-production), running Fedora 30 kernel 5.1.18-200.fc29.x86_​65,using MLC v3.8 with App-Direct. Source: 2020ww18_​CPX_​BPS_​BG, test by Intel on 31 Mar 2020. Baseline: 1-node, 1x Intel® Xeon® Platinum 8280L processor @ 2.7GHz on Intel - Purley Customer Reference Board (CRB) with DRAM: (per socket) 6 slots / 32GB / 2666 MT/s, PMem: (per socket) 1x 512GB Intel® Optane™ PMem 100 series module at 15W (192GB DRAM, 512GB PMem) total memory, ucode 0x04002F00, running Fedora 29 kernel 5.1.18-200.fc29.x86_​64,using MLC v3.8 with App-Direct workload. Source: 2020ww22_​CPX_​BPS_​BG, test by Intel on 27 Apr 2020. MLC ver3.8 with App Direct (from Intel Optane PMem 200 series demo) New: March 31, 2020

Baseline: April 27, 2020

[13] Multi-generation ResNet-50 Training Throughput Performance Improvement with Intel DL Boost supporting INT8 and BF16 3rd Generation Intel® Xeon® Platinum processor

New- 3rd Gen Intel Xeon Scalable Processor: 1-node, 4x 3rd Gen Intel® Xeon® Platinum 8380H processor (pre-production 28C, 250W) on Intel Reference Platform (Cooper City) with 384 GB (24 slots / 16GB / 3200) total memory, ucode 0x700001b, HT on, Turbo on, with Ubuntu 20.04 LTS, Linux 5.4.0-29-generic, Intel SSD 800GB OS Drive, ResNet-50 v1.5 Throughput, https://github.com/Intel-tensorflow/tensorflow -b bf16/base, commit#6ef2116e6a09, Modelzoo: https://github.com/IntelAI/models/ -b v1.6.1, Imagenet dataset, oneDNN 1.4, FP32, BF16, global BS=1024, 4 instances, 28-cores/instance, test by Intel on 06/01//2020.

2nd Gen Intel Xeon Scalable Processor: 1-node, 4x Intel® Xeon® Platinum 8280 processor (28C, 205W) on Intel Reference Platform (Lightning Ridge) with 768 GB (24 slots / 32 GB / 2933 ) total memory, ucode 0x4002f00, HT on, Turbo on, with Ubuntu 20.04 LTS, Linux 5.4.0-29-generic, Intel SSD 800GB OS Drive, ResNet-50 v1.5 Throughput, https://github.com/Intel-tensorflow/tensorflow -b bf16/base, commit# 6ef2116e6a09, Modelzoo: https://github.com/IntelAI/models/ -b v1.6.1, Imagenet dataset, oneDNN 1.4, FP32, global BS=1024, 4 instances, 28-cores/instance, test by Intel on 06/01/2020.

Intel Xeon Scalable Processor: 1-node, 4x Intel® Xeon® Platinum 8180 processor (28C, 205W) on Intel Reference Platform (Lightning Ridge) with 768 GB (24 slots / 32 GB / 2666 ) total memory, ucode 0x2000069, HT on, Turbo on, with Ubuntu 20.04 LTS, 5.4.0-26-generic, Intel SSD 800GB OS Drive, Training: ResNet-50-v1.5,Inference: ResNet-50-v1.5 Throughput, https://github.com/Intel-tensorflow/tensorflow -b bf16/base, commit#6ef2116e6a09, Modelzoo: https://github.com/IntelAI/models/ -b v1.6.1, Imagenet dataset, oneDNN 1.4, FP32, global BS=1024, 4 instances, 28-cores/instance, test by Intel on 6/02/2020.

Baseline- Intel Xeon processor E7 v4: 1-node, 4x Intel® Xeon® processor E7-8890 v4 (24C, 165W) on Intel Reference Platform (Brickland) with 512 GB (32 slots /16GB/ 1600) total memory, ucode 0xb000038, HT on, Turbo on, with Ubuntu 20.04 LTS, Linux 5.4.0-29-generic, Intel SSD 800GB OS Drive, Training: ResNet-50-v1.5,Inference: ResNet-50-v1.5 Throughput, https://github.com/Intel-tensorflow/tensorflow -b bf16/base, commit#6ef2116e6a09, Modelzoo: https://github.com/IntelAI/models/ -b v1.6.1, Imagenet dataset, oneDNN 1.4, FP32, global BS=1024, 4 instances, 24-cores/instance, test by Intel on 6/08/2020

Training: ResNet-50 v1.5 Throughput New: June 01, 2020

2nd Gen: June 01, 2020

1st Gen: June 02, 2020

Baseline: June 08, 2020

[12] Multi-generation ResNet-50 Inference Throughput Performance Improvement with Intel DL Boost supporting INT8 and BF16 3rd Generation Intel® Xeon® Platinum processor

New 3rd Gen Intel Xeon Scalable Processor (Cooper Lake): 1-node, 4x 3rd Gen Intel® Xeon® Platinum 8380H processor (pre-production 28C, 250W) on Intel Reference Platform (Cooper City) with 384 GB (24 slots / 16GB / 3200) total memory, ucode 0x700001b, HT on, Turbo on, with Ubuntu 20.04 LTS, Linux 5.4.0-29-generic, Intel SSD 800GB OS Drive, Inference: ResNet-50 v1.5 Throughput, https://github.com/Intel-tensorflow/tensorflow -b bf16/base, commit#6ef2116e6a09, Modelzoo: https://github.com/IntelAI/models/ -b v1.6.1, Imagenet dataset, oneDNN 1.4, FP32, INT8-VNNI, BF16, BS=128, 4 instances, 28-cores/instance, test by Intel on 06/01//2020.

2nd Gen Intel Xeon Scalable Processor (Cascade Lake): 1-node, 4x Intel® Xeon® Platinum 8280 processor (28C, 205W) on Intel Reference Platform (Lightning Ridge) with 768 GB (24 slots / 32 GB / 2933 ) total memory, ucode 0x4002f00, HT on, Turbo on, with Ubuntu 20.04 LTS, Linux 5.4.0-29-generic, Intel SSD 800GB OS Drive, Inference: ResNet-50 v1.5 Throughput, https://github.com/Intel-tensorflow/tensorflow -b bf16/base, commit# 6ef2116e6a09, Modelzoo: https://github.com/IntelAI/models/ -b v1.6.1, Imagenet dataset, oneDNN 1.4, FP32, INT8-VNNI, BS=128, 4 instances, 28-cores/instance, test by Intel on 06/01/2020.

Intel Xeon Scalable Processor (Skylake): 1-node, 4x Intel® Xeon® Platinum 8180 processor (28C, 205W) on Intel Reference Platform (Lightning Ridge) with 768 GB (24 slots / 32 GB / 2666 ) total memory, ucode 0x2000069, HT on, Turbo on, with Ubuntu 20.04 LTS, 5.4.0-26-generic, Intel SSD 800GB OS Drive, Inference: ResNet-50-v1.5 Throughput, https://github.com/Intel-tensorflow/tensorflow -b bf16/base, commit#6ef2116e6a09, Modelzoo: https://github.com/IntelAI/models/ -b v1.6.1, Imagenet dataset, oneDNN 1.4, FP32, INT8, BS=128, 4 instances, 28-cores/instance, test by Intel on 6/02/2020.

Baseline: Intel Xeon processor E7 v4 (Broadwell): 1-node, 4x Intel® Xeon® processor E7-8890 v4 (24C, 165W) on Intel Reference Platform (Brickland) with 512 GB (32 slots /16GB/ 1600) total memory, ucode 0xb000038, HT on, Turbo on, with Ubuntu 20.04 LTS, Linux 5.4.0-29-generic, Intel SSD 800GB OS Drive, Inference: ResNet-50-v1.5 Throughput, https://github.com/Intel-tensorflow/tensorflow -b bf16/base, commit#6ef2116e6a09, Modelzoo: https://github.com/IntelAI/models/ -b v1.6.1, Imagenet dataset, oneDNN 1.4, FP32, BS=128, 4 instances, 24-cores/instance, test by Intel on 6/08/2020.

Inference: ResNet-50 v1.5 Throughput New: June 01, 2020

2nd Gen: June 01, 2020

1st Gen: June 02, 2020

Baseline: June 08, 2020

[11] 1.9x average performance gain on popular workloads with the new 3rd Gen Intel® Xeon® Platinum 8380H processor vs. 5-year old platform 3rd Generation Intel® Xeon® Platinum processor

Average performance based on Geomean of est SPECrate®2017_​int_​base 1-copy, est SPECrate®2017_​fp_​base 1-copy, est SPECrate®2017_​int_​base, est SPECrate®2017_​fp_​base, STREAM Triad, Intel distribution of LINPACK, Virtualization and OLTP Database workloads. Results have been estimated or simulated.

New: SPECcpu_​2017, STREAM, LINPACK Performance: 1-node, 4x 3rd Gen Intel® Xeon® Platinum 8380H processor (pre-production 28C, 250W) on Intel Reference Platform (Cooper City) with 768 GB (24 slots / 32 GB / 3200) total memory, microcode 0x87000016, HT on for SPECcpu, off for STREAM, LINPACK), Turbo on, with Ubuntu 19.10, 5.3.0-48-generic, 1x Intel 240GB SSD OS Drive, est SPECcpu_​2017, STREAM Triad, Intel distribution of LINPACK, test by Intel on 5/15/2020. HammerDB OLTP Database Performance: New: 1-node, 4x 3rd Gen Intel® Xeon® Platinum 8380H processor (pre-production 28C, 250W) on Intel Reference Platform (Cooper City) with 768 GB (24 slots / 32 GB / 3200) total memory, microcode 0x700001b, HT on, Turbo on, with Redhat 8.1, 4.18.0-147.3.1.el8_​1.x86_​64, 1x Intel 240GB SSD OS Drive, 2x6.4T P4610 for DATA, 2x3.2T P4610 for REDO, 1Gbps NIC, HammerDB 3.2, Popular Commercial Database, test by Intel on 5/13/2020. Virtualization Performance: New: 1-node, 4x 3rd Gen Intel® Xeon® Platinum 8380H processor (pre-production 28C, 250W) on Intel Reference Platform (Cooper City) with 1536 GB (48 slots / 32 GB / 3200 (@2933)) total memory, microcode 0x700001b, HT on, Turbo on, with RHEL-8.1 GA, 4.18.0-147.3.1.el8_​1.x86_​64, 1x Intel 240GB SSD OS Drive, 4x P4610 3.2TB PCIe NVMe, 4 x 40 GbE x710 dual port, Virtualization workload, test by Intel on 5/20/2020.

Baseline: SPECcpu_​2017, STREAM, LINPACK Performance: 1-node, 4x Intel® Xeon® processor E7-8890 v3 on Intel Reference Platform (Brickland) with 512 GB (32 slots / 16 GB / 2133 (@1600)) total memory, microcode 0x16, HT on for SPECcpu, off for STREAM, LINPACK), Turbo on, with Ubuntu 20.04 LTS, 5.4.0-29-generic, 1x Intel 480GB SSD OS Drive, est SPECcpu_​2017, STREAM Triad, Intel distribution of LINPACK, test by Intel on 5/15/2020. HammerDB OLTP Database Performance: 1-node, 4x Intel® Xeon® processor E7-8890 v3 on Intel Reference Platform (Brickland) with 1024 GB (64 slots / 16GB / 1600) total memory, microcode 0x16, HT on, Turbo on, with Redhat 8.1, 4.18.0-147.3.1.el8_​1.x86_​64, 1x Intel 800GB SSD OS Drive, 1x1.6T P3700 for DATA, 1x1.6T P3700 for REDO, 1Gbps NIC, HammerDB 3.2, Popular Commercial Database, test by Intel on 4/20/2020. Virtualization Performance: 1-node, 4x Intel® Xeon® processor E7-8890 v3 on Intel Reference Platform (Brickland) with 1024 GB (64 slots / 16GB / 1600) total memory, microcode 0x0000016, HT on, Turbo on, with RHEL-8.1 GA, 4.18.0-147.3.1.el8_​1.x86_​64, 1x Intel 240GB SSD OS Drive, 4x P3700 2TB PCIe NVMe, 4 x 40 GbE x710 dual port, Virtualization workload, test by Intel on 5/20/2020.

Geomean of est SPECrate®2017_​int_​base(1-copy), est SPECrate®2017_​fp_​base(1-copy), est SPECrate®2017_​int_​base, est SPECrate®2017_​fp_​base, STREAM Triad, Intel distribution of LINPACK, Virtualization and OLTP Database workloads New: May 20, 2020

Baseline: May 20, 2020

[10] Process up to 1.98x more OLTP database transactions per minute with the new 3rd Gen Intel® Xeon® Scalable platform vs. 5-year old 4-socket platform 3rd Generation Intel® Xeon® Platinum processor New: 1-node, 4x 3rd Gen Intel® Xeon® Platinum 8380H processor (pre-production 28C, 250W) on Intel Reference Platform (Cooper City) with 768 GB (24 slots / 32 GB / 3200) total memory, microcode 0x700001b, HT on, Turbo on, with Redhat 8.1, 4.18.0-147.3.1.el8_​1.x86_​64, 1x Intel 240GB SSD OS Drive, 2x6.4T P4610 for DATA, 2x3.2T P4610 for REDO, 1Gbps NIC, HammerDB 3.2, Popular Commercial Database, test by Intel on 5/13/2020.

Baseline: 1-node, 4x Intel® Xeon® processor E7-8890 v3 on Intel Reference Platform (Brickland) with 1024 GB (64 slots / 16GB / 1600) total memory, microcode 0x16, HT on, Turbo on, with Redhat 8.1, 4.18.0-147.3.1.el8_​1.x86_​64, 1x Intel 800GB SSD OS Drive, 1x1.6T P3700 for DATA, 1x1.6T P3700 for REDO, 1Gbps NIC, HammerDB 3.2, Popular Commercial Database, test by Intel on 4/20/2020.

HammerDB OLTP Database New: April 20, 2020

Baseline: April 20, 2020

[9] Up to 1.93x higher AI training performance with 3rd Gen Intel® Xeon® Scalable processor supporting Intel® DL Boost with BF16 vs. prior generation on ResNet50 throughput for image classification 3rd Generation Intel® Xeon® Platinum processor New: 1-node, 4x 3rd Gen Intel® Xeon® Platinum 8380H processor (pre-production 28C, 250W) on Intel Reference Platform (Cooper City) with 384 GB (24 slots / 16GB / 3200) total memory, ucode 0x700001b, HT on, Turbo on, with Ubuntu 20.04 LTS, Linux 5.4.0-26,28,29-generic, Intel 800GB SSD OS Drive, ResNet-50 v1.5 Throughput, https://github.com/Intel-tensorflow/tensorflow -b bf16/base, commit#828738642760358b388d8f615ded0c213f10c99a, Modelzoo: https://github.com/IntelAI/models/ -b v1.6.1, Imagenet dataset, oneDNN 1.4, BF16, BS=512, test by Intel on 5/18/2020.

Baseline: 1-node, 4x Intel® Xeon® Platinum 8280 processor on Intel Reference Platform (Lightning Ridge) with 768 GB (24 slots / 32 GB / 2933 ) total memory, ucode 0x4002f00, HT on, Turbo on, with Ubuntu 20.04 LTS, Linux 5.4.0-26,28,29-generic, Intel 800GB SSD OS Drive, ResNet-50 v1.5 Throughput, https://github.com/Intel-tensorflow/tensorflow -b bf16/base, commit#828738642760358b388d8f615ded0c213f10c99a, Modelzoo: https://github.com/IntelAI/models/ -b v1.6.1, Imagenet dataset, oneDNN 1.4, FP32, BS=512, test by Intel on 5/18/2020.

ResNet-50 v1.5 Image Classification Training Throughput New: May 18, 2020

Baseline: May 18, 2020

[8] Up to 1.9x higher AI inference performance with 3rd Gen Intel® Xeon® Scalable processor supporting Intel® DL Boost with BF16 vs. prior generation with FP32 on BERT throughput for natural language processing 3rd Generation Intel® Xeon® Platinum processor New: 1-node, 4x 3rd Gen Intel® Xeon® Platinum 8380H processor (pre-production 28C, 250W) on Intel Reference Platform (Cooper City) with 384 GB (24 slots / 16GB / 3200) total memory, ucode 0x700001b, HT on, Turbo on, with Ubuntu 20.04 LTS, Linux 5.4.0-26,28,29-generic, Intel 800GB SSD OS Drive, BERT-Large (QA) Throughput, https://github.com/Intel-tensorflow/tensorflow -b bf16/base, commit#828738642760358b388d8f615ded0c213f10c99a, Modelzoo: https://github.com/IntelAI/models/ -b v1.6.1, Squad 1.1 dataset, oneDNN 1.4, BF16, BS=32, 4 instances, 28-cores/instance, test by Intel on 5/18/2020. Baseline: 1-node, 4x Intel® Xeon® Platinum 8280 processor on Intel Reference Platform (Lightning Ridge) with 768 GB (24 slots / 32 GB / 2933 ) total memory, ucode 0x4002f00, HT on, Turbo on, with Ubuntu 20.04 LTS, Linux 5.4.0-26,28,29-generic, Intel 800GB SSD OS Drive, BERT-Large (QA) Throughput, https://github.com/Intel-tensorflow/tensorflow -b bf16/base, commit#828738642760358b388d8f615ded0c213f10c99a, Modelzoo: https://github.com/IntelAI/models/ -b v1.6.1, Squad 1.1 dataset, oneDNN 1.4, FP32, BS=32, 4 instances, 28-cores/instance, test by Intel on 5/18/2020. BERT-Large (QA) Squad Inference Throughput New: May 18, 2020

Baseline: May 18, 2020

[7] 225X faster access to data with Intel® Optane™ persistent memory 200 series vs. NVMe SSD Intel® Optane™ persistent memory 200 series New: Intel® Optane™ persistent memory idle read latency compared to Baseline: Intel® SSD DC P4610 Series TLC NAND solid state drive idle read latency. Memory idle read latency NA
[6] Average of 25% higher memory bandwidth vs. prior gen 3rd Generation Intel® Xeon® Platinum processor & Intel® Optane™ persistent memory 200 series New: 1-node, 1x Intel® Xeon® pre-production CPX6 28C @ 2.9GHz processor on Cooper City with Single PMem module config (6x32GB DRAM; 1x{128GB,256GB,512GB} Intel® Optane™ PMem 200 series module at 15W), ucode pre-production running Fedora 29 kernel 5.1.18-200.fc29.x86_​64, and MLC ver 3.8 with App-Direct. Source: 2020ww18_​CPX_​BPS_​BG. Tested by Intel, on 31 Mar 2020.

Baseline: 1-node, 1x Intel® Xeon® 8280L 28C @ 2.7GHz processor on Neon City with Single PMem module config (6x32GB DRAM; 1x{128GB,256GB,512GB} Intel® Optane™ PMem 100 series module at 15W) ucode Rev: 04002F00 running Fedora 29 kernel 5.1.18-200.fc29.x86_​64, and MLC ver 3.8 with App-Direct. Source: 2020ww18_​CPX_​BPS_​DI. Tested by Intel, on 27 Apr 2020

MLC ver 3.8 with App-Direct New: March 31, 2020

Baseline: April 27, 2020

[5] Up to 1.92x higher performance on cloud data analytics usage models with the new 3rd Gen Intel® Xeon® Scalable processor vs. 5-year old 4-socket platform 3rd Generation Intel® Xeon® Platinum processor

New: 1-node, 4x 3rd Gen Intel® Xeon® Platinum 8380H processor (pre-production 28C, 250W) on Intel Reference Platform (Cooper City) with 1536 GB (48 slots / 32 GB / 3200 (@2933)) total memory, microcode 0x700001b, HT on, Turbo on, with Ubuntu 18.04.4 LTS, 5.3.0-53-generic, 1x Intel 240GB SSD OS Drive, 4x P4610 3.2TB PCIe NVMe, 4 x 40 GbE x710 dual port, CloudXPRT vCP - Data Analytics, Kubernetes, Docker, Kafka, MinIO, Prometheus, XGBoost workload, Higgs dataset, test by Intel on 5/27/2020.

Baseline: 1-node, 4x Intel® Xeon® processor E7-8890 v3 on Intel Reference Platform (Brickland) with 1024 GB (64 slots / 16GB / 1600) total memory, microcode 0x0000016, HT on, Turbo on, with Ubuntu 18.04.4 LTS, 5.3.0-53-generic, 1x Intel 400GB SSD OS Drive, 4x P3700 2TB PCIe NVMe, 4 x 40 GbE x710 dual port, CloudXPRT vCP - Data Analytics, Kubernetes, Docker, Kafka, MinIO, Prometheus, XGBoost workload, Higgs dataset, test by Intel on 5/27/2020.

Intel contributes to the development of benchmarks by participating in, sponsoring, and/or contributing technical support to various benchmarking groups, including the BenchmarkXPRT Development Community administered by Principled Technologies.

CloudXPRT vCP- Data Analytics, Kubernetes, Docker, Kafka, MinIO, Prometheus, XGBoost workload, Higgs dataset New: May 27, 2020

Baseline: May 27, 2020

[4] Up to 2.2x more Virtual Machines with the new 3rd Gen Intel® Xeon® Scalable platform and Intel® SSD Data Center Family vs. 5-year old 4-socket platform 3rd Generation Intel® Xeon® Platinum processor New: 1-node, 4x 3rd Gen Intel® Xeon® Platinum 8380H processor (pre-production 28C, 250W) on Intel Reference Platform (Cooper City) with 1536 GB (48 slots / 32 GB / 3200 (@2933)) total memory, microcode 0x700001b, HT on, Turbo on, with RHEL-8.1 GA, 4.18.0-147.3.1.el8_​1.x86_​64, 1x Intel 240GB SSD OS Drive, 4x P4610 3.2TB PCIe NVMe, 4 x 40 GbE x710 dual port, Virtualization workload, test by Intel on 5/20/2020.

Baseline:1-node, 4x Intel® Xeon® processor E7-8890 v3 on Intel Reference Platform (Brickland) with 1024 GB (64 slots / 16GB / 1600) total memory, microcode 0x0000016, HT on, Turbo on, with RHEL-8.1 GA, 4.18.0-147.3.1.el8_​1.x86_​64, 1x Intel 240GB SSD OS Drive, 4x P3700 2TB PCIe NVMe, 4 x 40 GbE x710 dual port, Virtualization workload, test by Intel on 5/20/2020.

Virtualization workload New: May 20, 2020

Baseline: May 20, 2020

[3] The new Intel® 3D NAND SSDs deliver an improved balance of performance and capacity for your storage requirements-including up to 33% better performance and 40% lower latency. Intel® SSD D7-P5500 series

33% better performance when using Intel ® SSD D7-P5500 series: Source - Intel. Comparing datasheet figures for 4KB Random Read QD256 performance between the Intel ® SSD D7-P5500 Series 7.68TB and Intel ® SSD DC P4510 Series 8TB with both drives running on PCIe 3.1. Measured performance was 854K IOPS and 641.8K IOPS for the D7-P5500 and DC P4510, respectively. Performance for both drives measured using FIO Linux CentOS 7.2 kernel 4.8.6 with 4KB (4,096 bytes) of transfer size with Queue Depth 64 (4 workers). Measurements are performed on a full Logical Block Address (LBA) span of the drive once the workload has reached steady state but including all background activities required for normal operation and data reliability. Power mode set at PM0. Any differences in your system hardware, software or configuration may affect your actual performance. Intel expects to see certain level of variation in data measurement across multiple drives.

40% lower latency when using Intel ® SSD D7-P5500 series: Source - Intel. Comparing datasheet figures for 4KB Random Write QD1 latency between the Intel ® SSD D7-P5500 Series 7.68TB and Intel® SSD DC P4510 Series 8TB with both drives running on PCIe 3.1. Measured latency was 15μs and 25μs for the D7-P5500 and DC P4510, respectively. Performance for both drives measured using FIO Linux CentOS 7.2 kernel 4.8.6 with 4KB (4096 bytes) of transfer size with Queue Depth 1 (1 worker). Measurements are performed on a full Logical Block Address (LBA) span of the drive once the workload has reached steady state but including all background activities required for normal operation and data reliability. Power mode set at PM0. Any differences in your system hardware, software or configuration may affect your actual performance. Intel expects to see certain level of variation in data measurement across multiple drives.

Read/Write IOPS, Latency N/A
[2] Up to 1.87x higher AI Inference performance with 3rd Gen Intel® Xeon® Scalable processor supporting Intel® DL Boost with BF16 vs. prior generation using FP32 on ResNet50 throughput for image classification 3rd Generation Intel® Xeon® Platinum processor New: 1-node, 4x 3rd Gen Intel® Xeon® Platinum 8380H processor (pre-production 28C, 250W) on Intel Reference Platform (Cooper City) with 384 GB (24 slots / 16GB / 3200) total memory, ucode 0x700001b, HT on, Turbo on, with Ubuntu 20.04 LTS, Linux 5.4.0-26,28,29-generic, Intel 800GB SSD OS Drive, ResNet-50 v1.5 Throughput, https://github.com/Intel-tensorflow/tensorflow -b bf16/base, commit#828738642760358b388d8f615ded0c213f10c99a, Modelzoo: https://github.com/IntelAI/models/ -b v1.6.1, Imagenet dataset, oneDNN 1.4, BF16, BS=56, 4 instances, 28-cores/instance, test by Intel on 5/18/2020.

Baseline: 1-node, 4x Intel® Xeon® Platinum 8280 processor on Intel Reference Platform (Lightning Ridge) with 768 GB (24 slots / 32 GB / 2933 ) total memory, ucode 0x4002f00, HT on, Turbo on, with Ubuntu 20.04 LTS, Linux 5.4.0-26,28,29-generic, Intel 800GB SSD OS Drive, ResNet-50 v1.5 Throughput, https://github.com/Intel-tensorflow/tensorflow -b bf16/base, commit#828738642760358b388d8f615ded0c213f10c99a, Modelzoo: https://github.com/IntelAI/models/ -b v1.6.1, Imagenet dataset, oneDNN 1.4, FP32, BS=56, 4 instances, 28-cores/instance, test by Intel on 5/18/2020.

ResNet-50 Inference Throughput New: May 18, 2020

Baseline: May 18, 2020

[1] Up to 1.7x more AI training performance with 3rd Gen Intel® Xeon® Scalable processor supporting Intel® DL Boost with BF16 vs. prior generation on BERT throughput for natural language processing 3rd Generation Intel® Xeon® Platinum processor New: 1-node, 4x 3rd Gen Intel® Xeon® Platinum 8380H processor (pre-production 28C, 250W) on Intel Reference Platform (Cooper City) with 384 GB (24 slots / 16GB / 3200) total memory, ucode 0x700001b, HT on, Turbo on, with Ubuntu 20.04 LTS, Linux 5.4.0-26,28,29-generic, Intel 800GB SSD OS Drive, BERT-Large (QA) Throughput, https://github.com/Intel-tensorflow/tensorflow -b bf16/base, commit#828738642760358b388d8f615ded0c213f10c99a, Modelzoo: https://github.com/IntelAI/models/ -b v1.6.1, Squad 1.1 dataset, oneDNN 1.4, BF16, BS=12, test by Intel on 5/18/2020.

Baseline: 1-node, 4x Intel® Xeon® Platinum 8280 processor on Intel Reference Platform (Lightning Ridge) with 768 GB (24 slots / 32 GB / 2933 ) total memory, ucode 0x4002f00, HT on, Turbo on, with Ubuntu 20.04 LTS, Linux 5.4.0-26,28,29-generic, Intel 800GB SSD OS Drive, BERT-Large (QA) Throughput, https://github.com/Intel-tensorflow/tensorflow -b bf16/base, commit#828738642760358b388d8f615ded0c213f10c99a, Modelzoo: https://github.com/IntelAI/models/ -b v1.6.1, Squad 1.1 dataset, oneDNN 1.4, FP32, BS=12, test by Intel on 5/18/2020.

BERT-Large (QA) Squad Training Throughput New: May 18, 2020

Baseline: May 18, 2020