Intel® Ethernet 800 Series
Linux Performance Tuning Guide
Application Settings
Often a single thread (which corresponds to a single network queue) is not sufficient to achieve maximum bandwidth. Experiment with increasing the number of threads used by your application if possible.
Consider using tools like taskset or numactl to pin applications to the NUMA node or CPU cores local to the network device. For some workloads such as storage I/O, moving the application to a non-local node provides benefit.