@karolherbst @bashbaug you can unfuse fma with a macro: https://github.com/ProjectPhysX/OpenCL-Benchmark/blob/master/src/opencl.hpp#L287
ARM GPUs need that, and Nvidia CMP 170HX mining GPU too as it has fma disabled through hardware or firmware.
Dr. Moritz Lehmann
@projectphysx@mast.hpc.social
Posts
-
I was looking at the phoronix benchmarks and why the ProjectPhysX "fp32" result is bad.