Performance Index

ID 615781
Date 03/08/2024
Document Table of Contents

Investor Day 2022

Investor Meeting

Speaker Statement Details
Jim Johnson World's Best Desktop Gaming Processor 12th Gen Intel® Core™ i9-12900K is the world's best gaming processor, as measured by unique features and superior in-game benchmark mode performance (score or frames per second) on majority of the 31 game titles tested (as of Oct 14, 2021), including in comparison to AMD Ryzen 5950X. See desktop processor section on www.intel.com/PerformanceIndex for additional details.  Results may vary.
Jim Johnson ADL-S - upto30% faster desktop gaming v.comp As measured by superior performance of 12th Gen Intel® Core™ i9-12900K vs. AMD Ryzen 9 5950X Troy: A Total War Saga onon10/14/2021. See desktop processor section on www.intel.com/PerformanceIndex for additional details.  Results may vary.
Jim Johnson World's Fastest Mobile Processor Based on superior performance of 12th Gen Intel Core i9 12900HK against Intel Core i9 11980HK, AMD Ryzen 9 5900HX, and Apple M1 Max. Intel processor performance is estimated based on measurements with Intel Reference Validation Platforms. AMD processor performance is estimated based on measurements on a Lenovo Legion R9000K with RTX 3080. Apple M1 Max performance is estimated based on public statement made by Apple on 10/18/2021 and measurements on Apple M1 Max 16" 64GB RAM Model A2485. Best available compilers selected for all processors. Binaries compiled with ICC for Intel/ AMD, binaries compiled withXcode13.1 for Apple. The metric used is the geometric mean of an n-copy SPECraterun of the C/C++ integer benchmarks in SPEC CPU 2017. See mobile processor section on www.intel.com/PerformanceIndex for additional workload and configuration details. Results may vary.
Jim Johnson ADL-H Mobile Performance -up to 44% faster Video editing Performance with a 12th Gen Intel® Core™ i9 12900HK processor As measured by Puge tBench Premiere Pro v0.95.4 - (Adobe v.22.0.0) benchmark- Overall score on 12th Gen Intel® Core™ i9-12900HK vs. 11th Gen Intel® Core™ i9- 11980HK. See www.intel.com/PerformanceIndex (12th Gen Intel Core mobile processors) for additional workload and configuration details. Results may vary.
Jim Johnson 12 th GenH2H demo on multiple games Based on superior in-game benchmark mode performance of 12th Gen Intel Core i9-12900HK with NVIDIA RTX 3080 GPU vs 11th Gen Intel Core i9-11980HK with same GPU. Performance results are based on testing as of 12/10/2021. Full Configurations available at intel.com/PerformanceIndex(12 th Gen Intel Core mobile processors). Results may vary. Game titles :Far Cry 5, Middle Earth: Shadow OfWar, Rise of the Tomb Raider, Total War: Warhammer 2 - Campaign, Hitman 2. Full Configuration: Processor: 12th Gen Intel® Core™ i9-12900HK processor, 14C24T; tested on a pre- production OEM design; Memory: 2x16GB DDR5-4800Mhz; Storage: Samsung 980 Pro 1TB ; Display Resolution: 1920x1080; OS: Microsoft Windows 11 21H2 (OS Build 22000.282);Edge Version:v95.0.1020.53; Graphics card: NVIDIA RTX 3080, Graphics driver: 30.0.15.1073; Max GFX power:165W, BIOS version:E17K31MS.T2A; GPU Mode: Discrete; Power Plan set to Balanced, Power Mode(Win 11 feature) set to "Best performance"; OEM power application (MSI Center)set to MSI Center: "Extreme performance" ; VBS enabled, Defender enabled, and Tamper Protection enabled. Processor: AMD Ryzen 9 5900HX processor, 8C16T; tested on Lenovo Legion R9 5900HX laptop, memory: 2X 16 GB DDR4- 3200MHz; Storage: Samsung SSD980 Pro 1TB; Display Resolution: 1920x1080; OS: Microsoft Windows 11 21H2 (OS Build 22000.282 );Edge Version:v95.0.1020. 53;Graphicscard: NVIDIA RTX 3080, Graphics driver: 30.0.14.9709; Max GFX power = 165W, BIOS version:GKCN34WW. GPU Mode: Discrete; Power Plan set to Legion High Performance (the Lenovo App will overwrite Windows Balance); Power Mode(Win 11 Feature) set to "Best Performance"; OEM power Application Setting set to Lenovo Vantage set to Performance Mode; VBS enabled, Defender enabled, and Tamper Protection enabled.
Jim Johnson 12 th Gen Intel i9-12900KS v comp Hitman 3demo Processor AMD Ryzen 9 5950(16 core)system vsIntel Corei9-12900ks intel i9 system. Both systems feature32 gigs of ram and a 3090 graphics card and are running the built in benchmark inside of the game Hitman 3 and 1080P. Testing as of February 17, 2022.
Jim Johnson Raptor Lake - up to double digit performance boost Based on performance projections from Intel on 02/17/2022 on SPECint_​rate_​2017 n Copy ICC 19u4comparing Raptor Lake 125W (8P+ 16 E )versus Core i9 12900K.Results may vary.
Jim Johnson The Intel® Evo™ platform - verified real-world experiences Visit www.intel.com/evo for additional details.
Jim Johnson The Intel vPro® Platform - unrivaled business PC platform Visit www.intel.com/vprofor additional details.
Jim Johnson 12 th Gen Intel Core processors - performance hybrid architecture Performance hybrid architecture combines two new core microarchitectures, Performance-cores (P-cores) and Efficient-cores (E-cores), on a single processor die. Select 12th Gen Intel® Core™ processors (certain 12th Gen Intel Core i5 processors and lower) do not support performance hybrid architecture, only P-cores.
Jim Johnson 12 th Gen Intel Core processors - Intel® Thread Director Built into the hardware, Intel® Thread Director is provided only in performance hybrid architecture configurations of 12th Gen Intel® Core™ processors; OS enablement is required. Available features and functionality vary by OS.
Randhir Thakur New die-to-die interconnect (D2D) has 4X I/O Performance at 1/10th Power, compared to previous generation PCIeG5x16 Die to Die interconnects offer far greater performance density, power density, and performance/power compared to board level interconnects such as PCIe. This example is based on Intel internal estimates based on the industry's current best in class Die to Die interconnects as of 2/17/2022.
Nick McKeown A recent study with Rakuten measured how quickly users could download data on different 5G networks in Japan. They found that Rakuten's vRAN network had the fastest download speeds - nearly 40% faster than other measured download speeds. Rakuten's virtualized network is built on Intel Xeon processors running our FlexRAN software.   Data available here:

https://www.opensignal.com/reports/2021/09/japan/mobile-network-experience-5g

Raja Koduri Next Gen Intel® Xeon® Scalable processors, codenamed Sapphire Rapids processor with high bandwidth memory (HBM) outperforms AMD Milan by 2.8x and AMD Milan-X by 2.1x

Also

Next Gen Intel® Xeon® Scalable processors, codenamed Sapphire Rapids processor with high bandwidth memory (HBM) outperforms 3 rd Gen Xeon Scalable by 2.8x

Test by Intel as of 01/26/2022. 1-node, 2x Intel(R) Xeon(R) Platinum 8380 CPU @ 2.30GHz (Family 6 Model 106 Stepping 6), 80 cores, HT On, Turbo On, Total Memory 256 GB (16x16GB 3200MT/s, Dual-Rank), BIOS Version SE5C6200.86B.0020.P23.2103261309, ucode revision=0xd000270, Rocky Linux 8.5 , Linux version 4.18.0-240.22.1.el8_​3.crt6.x86_​64, OpenFOAM® v1912, Motorbike 28M @ 250 iterations; Build notes: Tools: Intel Parallel Studio 2020u4, Build knobs: -O3 -ip -xCORE-AVX512

Test by Intel as of 01/26/2022. 1-node, 2x AMD EPYC 7763 64-Core Processor @ 2.45GHz (Family 25 Model 1 Stepping 1), 128 cores, HT On, Turbo On, Total Memory 256 GB (16x16GB 3200MT/s, Dual-Rank), BIOS Version 2.1, ucode revision=0xa00111d, Rocky Linux 8.5 , Linux version 4.18.0-240.22.1.el8_​3.crt6.x86_​64, OpenFOAM® v1912, Motorbike 28M @ 250 iterations; Build notes: Tools: Intel Parallel Studio 2020u4, Build knobs: -O3 -ip -xCORE-AVX2

Test by Microsoft® Azure as of 11/08/21. 1-node, 2x AMD EPYC 7V73X on Azure HBv3, 128 cores (120 available), HT Off, Total Memory 448 GB, CentOS 8.1 HPC Image, GNU compiler 9.2.0, OpenFOAM® v1912, Motorbike 28M @ 250 iterations Test by Intel as of 01/26/2022.  1-node, 2x Next Gen Intel Xeon Scalable processor (codenamed Sapphire Rapids, > 40 cores), HT On, Turbo On, Total Memory 512 GB (16x32GB 4800MT/s, Dual-Rank), preproduction platform and BIOS, Red Hat Enterprise Linux 8.4 , Linux version 4.18.0-305.el8.x86_​64, OpenFOAM® v1912, Motorbike 28M @ 250 iterations; Build notes: Tools: Intel Parallel Studio 2020u4, Build knobs: -O3 -ip -xCORE-AVX512 Test by Intel as of 01/26/2022. 1-node, 2x Next Gen Intel® Xeon® (code Sapphire Rapids > 40) Plus HBM, HT Off, Turbo Off, Total Memory 128 GB (HBM2e at 3200 MHz), preproduction platform and BIOS, CentOS 8, Linux version 5.12.0-0507.intel_​next.06_​02_​po.5.x86_​64+server, OpenFOAM® v1912, Motorbike 28M @ 250 iterations; Build notes: Tools: Intel Parallel Studio 2020u4, Build knobs: -O3 -ip -xCORE-AVX512

OPENFOAM® is a registered trade mark of OpenCFD Limited, producer and distributor of the OpenFOAM software via www.openfoam.com.

Raja Koduri Ponte Vecchio outperforms the competition in Financial Services by 2.6x on Binomial Options, 1.9x on Black Scholes, and 1.7x on Monte Carlo

Testing as of 2/14/2022

Intel Platform: 1-node 1x Intel® Xeon® 6336Y, HT On, Turbo Off, total memory 128GB DDR, BIOS Version WLYDCRB1.SYS.0021.P16.2105280638, Ubuntu 20.04, Linux Version 5.10.54+pvc-xtb-po67perf, Ucode 0x8d0002c1, 1x Intel pre-production Ponte Vecchio GPU ;

Competing Platform: 1-node 2x Intel® Xeon® 8360Y, HT On, Turbo On, total memory 256GB DDR, BIOS Version SE5C6200.86B.0022.D08.2103221623, Ubuntu 21.10, Linux Version 5.13.0-27-generic, Ucode 0xd0002a0, 1x NVIDIA A100 80GB PCIe ;

Intel Binomial Options Build notes: Tools: Intel oneAPI 2022.1, Build knobs: -g -fdebug-info-for-profiling -gline-tables-only -fsycl-targets=spir64_​gen -Xsycl-target-backend "-device 0x0bd5 -revision_​id 3" -O3 -fp-model precise -std=c++17 -flto -o binomial.sycl.gpu.precise -I -lpthread

Intel Black- Scholes Build notes: Tools: Intel oneAPI 2022.1, Build knobs: -g -O2 -I/opt/intel/opencl/include/ -L/opt/intel/opencl/lib64/ -ltbb -ltbbmalloc -lOpenCL

Intel Monte Carlo Build notes: Tools: Intel oneAPI 2022.1, Build knobs: -DUSE_​VML=0 -DUSE_​MCG59 -DVEC_​SIZE=8 -DMKL_​ILP64 -Iinclude -I"${MKLROOT}/include" -L"${MKLROOT}/lib/intel64" -lpthread -lmkl_​core -lmkl_​intel_​ilp64 -lmkl_​sequential -lm -ldl -fsycl -fsycl-unnamed-lambda -O /opt/intel/oneapi/mkl/latest/lib/intel64/libmkl_​sycl.a

Competing Platform Binomial Options Build notes: Tools: CUDA SDK 11.4, Build knobs: -I../../common/inc -m64 --threads 0 -gencode arch=compute_​35,code=sm_​35 -gencode arch=compute_​37,code=sm_​37 -gencode arch=compute_​50,code=sm_​50 -gencode arch=compute_​52,code=sm_​52 -gencode arch=compute_​60,code=sm_​60 -gencode arch=compute_​61,code=sm_​61 -gencode arch=compute_​70,code=sm_​70 -gencode arch=compute_​75,code=sm_​75 -gencode arch=compute_​80,code=sm_​80 -gencode arch=compute_​86,code=sm_​86 -gencode arch=compute_​86,code=compute_​86

Competing Platform Black-Scholes Build notes: Tools: CUDA SDK 11.4, Build knobs -ccbin g++ -I/usr/local/cuda-11.4/samples/common/inc -m64 -maxrregcount=16 -gencode arch=compute_​35,code=sm_​35 -gencode arch=compute_​37,code=sm_​37 -gencode arch=compute_​50,code=sm_​50 -gencode arch=compute_​52,code=sm_​52 -gencode arch=compute_​60,code=sm_​60 -gencode arch=compute_​61,code=sm_​61 -gencode arch=compute_​70,code=sm_​70 -gencode arch=compute_​75,code=sm_​75 -gencode arch=compute_​80,code=sm_​80 -gencode arch=compute_​86,code=sm_​86 -gencode arch=compute_​86,code=compute_​86

Competing Platform Monte Carlo Build notes: Tools: CUDA SDK 11.4, Build knobs: -ccbin g++ -I/usr/local/cuda-11.4/samples/common/inc -m64 -gencode arch=compute_​35,code=sm_​35 -gencode arch=compute_​37,code=sm_​37 -gencode arch=compute_​50,code=sm_​50 -gencode arch=compute_​52,code=sm_​52 -gencode arch=compute_​60,code=sm_​60 -gencode arch=compute_​61,code=sm_​61 -gencode arch=compute_​70,code=sm_​70 -gencode arch=compute_​75,code=sm_​75 -gencode arch=compute_​80,code=sm_​80 -gencode arch=compute_​86,code=sm_​86 -gencode arch=compute_​86,code=compute_​86

Raja Koduri Intel GPU Data Center AV1 live streaming "Quality" preset in ultra-low delay configuration is over 30% higher quality than CPU AVC x264 "medium presets (ultra-low delay)

GPU Quality data is measured by Intel 02/09/2022 on pre-production Intel Data Center GPU (150) with IFWI ATS_​M1_​512_​B0_​EU_​008_​IFWI_​21WW50_​01_​GS1463_​PC9446A_​HEADLESS_​OP1040_​P2SB_​DS_​14GT_​LRG_​TR_​DS, on a 1-node dual socket Intel ® 3rd generation Xeon Gold 6336Y microcode 0x11 on an Intel customer reference server board (Coyote pass) 128GB 32-ch 3200 MT/sec, HT enabled, Turbo On, Ubuntu 20.04 LT2 kernel 5.10.54, gfx-driver-master-10017, pre-production FFMPEG. CPU Quality data is measured by Intel 02/09/2022 on an Intel SawTooth Pass Server 1-node, 2-socket Intel(R) Xeon(R) Platinum 8180 CPU @ 2.50GHz with HT on, Turbo on, CPU Microcode 0x4D, Windows 10 Enterprise LTSC 64-bit, 240GB 15-ch, DDR4-2666 DRAM. See command lines at pre-release version of https://github.com/intel/media-delivery. For "2 second delay" data collected with TU1 (Quality mode) using 27 broadcast /gaming sequences, details at https://dgpu-docs.intel.com/devices/iris-xe-max-graphics/guides/media.html, for "ultra low delay" data collected using VBENCH sequences, details at http://arcade.cs.columbia.edu/vbench. PSNRY-BDRATE quality metrics compared to x264-medium (tune PSNR, 1-pass) measured on Intel Xeon 6336Y.

Artic Sound Demo Arctic Sound-M AV1 AI-Enhanced Live Streaming delivers over 20% bitrate savings. BASELINE: Testing 1/29/2022. 1-node, Intel(R) Xeon(R) Gold 6336Y CPU @ 2.40GHz , 24 cores, GPU Discrete card ATS-M M1 512 DVT2A B0 ES2 (150W 512EU), Total Memory 125.6 GB (3200 MHz), Bios: Intel Corporation SE5C6200.86B.0020.P41.2109300305 09/30/2021, Windows 2022 Server, Media driver gfx-master-242549 with DLSR2.0 Disabled (via Windows registry keys), embargo ffmpeg-cartwheel N-103362-g9a31352913 + libVPL v2.5 for ATSM GPU HW AVC Encoder

NEW: Testing as of 1/29/2022. 1-node, Intel(R) Xeon(R) Gold 6336Y CPU @ 2.40GHz, 24 cores, GPU Discrete card ATS-M M1 512 DVT2A B0 ES2 (150W 512EU), Total Memory 125.6 GB (3200 MHz), Bios: Intel Corporation SE5C6200.86B.0020.P41.2109300305 09/30/2021, Windows 2022 Server, Media driver gfx-master-242549 with DLSR2.0 Enabled (via Windows registry keys), embargo ffmpeg-cartwheel N-103362-g9a31352913 + libVPL v2.5 for ATSM GPU HW AV1 Encoder

Sandra Rivera AWS’ EC2 M6i instances offer Ice Lake-based services that deliver up to 15% better price performance and twice the networking speed of the previous generation. Data provided by AWS: AWS’ EC2 M6i instances
Sandra Rivera Google’s Compute Engine N2 VMs tap Ice Lake's built-in AI and crypto acceleration engines to offer over 30% better price performance compared with its previous generation. Data provided by Google: Google’s Compute Engine N2 VMs
Sandra Rivera Customers will experience an almost 6X increase in the number of images they can process per second, reducing the cost for customers to create, deploy and manage their image recognition systems.

Testing as of February 14, 2022.

6X increase in the number of images processed per second: Results have been estimated and based on 1-socket performance expectation comparing pre-production Next Gen Intel Xeon Scalable processor (codenamed Sapphire Rapids, >40 cores) to 3rd Gen Intel Xeon Platinum 8380H processor on SSD-Resnet34, BF16, batch size=1 .

Sandra Rivera Single socket server with 3rd gen Xeon Scalable general purpose  cpu can finishes the End to End - Single Cell -Genomics sequencing in 489 seconds compared with 686 seconds an Nvidia A100 GPU. This means 3rd gen Xeon Scalable is 1.4x faster than Nvidia A100, that equates to over 1.6x better TCO. Single socket server with the next gen general purpose  cpu  can finish the End to End - Single Cell -Genomics sequencing workload in 370 seconds compared with 686 seconds an Nvidia A100 GPU. This means we can deliver nearly 2x the performance of Nvidia's mainstream Training GPU for 2022.

Baseline

Testing as of Dec16th 2020.

Google Cloud instance a2-highgpu-1g, 1x Tesla A100 GPU, 40GB HBM2 Memory, 12 vCPUs, $3.78 cost per hour, dedicated access, Single-cell RNA-seq of 1.3 Million Mouse Brain Cells using SCANPY 1.8.1 Toolkit, score= 686 seconds to compete, total cost to complete $0.70. source: https://github.com/clara-parabricks/rapids-single-cell-examples#example-2-single-cell-rna-seq-of-13-million-mouse-brain-cells

New-1:

Testing as of Feb 5th 2022. Google Cloud instance n2-standard-64, 3rd Gen Intel Xeon Scalable 64vCPUs, 256GB Memory, 257GB Persistant Disk, NIC bandwidth 32Gbps, $3.10 cost per hour dedicated access, Rocky Linux 8.5, Linux version 4.18.0-240.22.1.el8_​3.crt6.x86_​64,Single-cell RNA-seq of 1.3 Million Mouse Brain Cells using SCANPY 1.8.1 Toolkit, score= 489.1 seconds to compete, total cost to complete $0.42

New -2:

Testing as of Jan 20th 2022. 1-node, 1x Next Gen Intel Xeon Scalable processor (codenamed Sapphire Rapids, > 40 cores) on Intel pre-production platform with 512 GB DDR memory (8(1DPC)/64GB/4800 MT/s), HT on, Turbo on, CentOS Linux 8.3, internal pre-production bios, Single-cell RNA-seq of 1.3 Million Mouse Brain Cells using SCANPY 1.8.1 Toolkit, score= 370.2 seconds to compete.