So I do a lot of BOINC processing, and with a particular application on the GPUGRID project, I'm seeing wildly different CPU utilizations that seems to be related to the Intel vs AMD platforms. The same behavior has been confirmed with lower end AMD (3000/5000-series) consumer parts also.
The application is their Python GPU application, diverges from conventional BOINC GPU tasks in that it's a multithreaded CPU app with CUDA support, instead of a pure GPU application. each task will spawn 32 threads (simulations) plus the main application process. see info about what the application is doing here if interested: https://gpugrid.net/forum_thread.php?id=5233
now when running this application on my Xeon E5-2697Av4 (16c/32t) system (Linux, Ubuntu 22.04, 5,15 kernel), the total amount of CPU used is rather low. about 1-2 threads used per process (100-200% process CPU %). but i seem to start hitting OS scheduler bottlenecks as attempting to scale this up hits diminishing returns with too many concurrent processes to be serviced by only 32 threads, even with low total utilization. I saw the same low CPU use behavior on an even older Xeon E5-1680v2.
So I moved the same GPU and software over to an AMD EPYC platform, with a 7443P to get more threads, and hopefully better IPC. but what I found was puzzling. the AMD system used WAY more CPU to do the same work as the old Xeon. using roughly 5x the CPU support per process (500-700%), negating any improvement by having more threads, and the tasks did not really process any faster despite the higher CPU use.
so my question, is there some kind of lesser known AI acceleration present on the intel Xeon CPUs? I know the 3rd gen Scalable Xeons like to taught their built in AI acceleration, but I can't find any materials that talks about anything like this on the older v2/v3/v4 Xeons.
or are there any other kinds of hardware accelerators at play that are present in even these older Xeons but not the more modern AMD EPYC and Ryzen CPUs? or maybe even some kind of kernel/software thing?
I did try running the 6.0 linux kernel when it first came out, as there was all that buzz about some 20 year old bug limiting performance for AMD, but that seemed to have no effect here and i still saw high CPU use on AMD.
CPU use on Intel Xeon E5-2697Av4:
CPU use on AMD EPYC 7443P:
The application is their Python GPU application, diverges from conventional BOINC GPU tasks in that it's a multithreaded CPU app with CUDA support, instead of a pure GPU application. each task will spawn 32 threads (simulations) plus the main application process. see info about what the application is doing here if interested: https://gpugrid.net/forum_thread.php?id=5233
now when running this application on my Xeon E5-2697Av4 (16c/32t) system (Linux, Ubuntu 22.04, 5,15 kernel), the total amount of CPU used is rather low. about 1-2 threads used per process (100-200% process CPU %). but i seem to start hitting OS scheduler bottlenecks as attempting to scale this up hits diminishing returns with too many concurrent processes to be serviced by only 32 threads, even with low total utilization. I saw the same low CPU use behavior on an even older Xeon E5-1680v2.
So I moved the same GPU and software over to an AMD EPYC platform, with a 7443P to get more threads, and hopefully better IPC. but what I found was puzzling. the AMD system used WAY more CPU to do the same work as the old Xeon. using roughly 5x the CPU support per process (500-700%), negating any improvement by having more threads, and the tasks did not really process any faster despite the higher CPU use.
so my question, is there some kind of lesser known AI acceleration present on the intel Xeon CPUs? I know the 3rd gen Scalable Xeons like to taught their built in AI acceleration, but I can't find any materials that talks about anything like this on the older v2/v3/v4 Xeons.
or are there any other kinds of hardware accelerators at play that are present in even these older Xeons but not the more modern AMD EPYC and Ryzen CPUs? or maybe even some kind of kernel/software thing?
I did try running the 6.0 linux kernel when it first came out, as there was all that buzz about some 20 year old bug limiting performance for AMD, but that seemed to have no effect here and i still saw high CPU use on AMD.
CPU use on Intel Xeon E5-2697Av4:

CPU use on AMD EPYC 7443P:
