Cursed project end of day 1: clinfo runs
-
I guess that's uhm... well... I guess?
oh shoo.. I'm like 10% slower than Nvidia's implementation 🙃
-
oh shoo.. I'm like 10% slower than Nvidia's implementation 🙃
you all don't want to know the most cursed part about this 🙃
-
you all don't want to know the most cursed part about this 🙃
@karolherbst we do actually, and you know we do 8-D
-
yeah soo.. is it day 5? I think it's day 5. Which is a bit weird because it feels like day 4. But maybe also because I started like almost at the end of the day? Maybe that doesn't count? Does it even matter? No, but anyway....
Status at the end of day 5:
Pass 3577 Fails 179 Crashes 129
What's missing?
- Image support
- Atomics
- Scratch
- Some math is failing validation.
- buffer synchronization issues, still.. I honestly don't know.
- optional gallium/nir stuffPass 3769 Fails 112 Crashes 4 Timeouts 0
Something something atomics... and a few other random things, should be like 3 or 4 bugs in total...
-
Pass 3769 Fails 112 Crashes 4 Timeouts 0
Something something atomics... and a few other random things, should be like 3 or 4 bugs in total...
okay... I found one of the atomic bugs.. it's when two kernels are launched back to back and apparently they can interact weirdly with each other. When I force a flush+wait between them those fails go away... curious
-
okay... I found one of the atomic bugs.. it's when two kernels are launched back to back and apparently they can interact weirdly with each other. When I force a flush+wait between them those fails go away... curious
or maybe it's an ordering issue? mhh
-
or maybe it's an ordering issue? mhh
@karolherbst how would two kernels affect each other, unless they stomp on each other's memory? Did the hardware support concurrent kennel execution?
-
@karolherbst how would two kernels affect each other, unless they stomp on each other's memory? Did the hardware support concurrent kennel execution?
@oblomov either that or just executed in a different order
-
or maybe it's an ordering issue? mhh
Pass 3871 Fails 10 Crashes 4 Timeouts 0
I think that's good enough for an initial MR 🙃
-
Pass 3871 Fails 10 Crashes 4 Timeouts 0
I think that's good enough for an initial MR 🙃
Here it is: https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37831
Hopefully that's not too much of a shock.
I don't have a blog, so my MR is the blog post I guess?
-
@oblomov either that or just executed in a different order
@karolherbst there's hardware whose queues are OOO?
-
Here it is: https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37831
Hopefully that's not too much of a shock.
I don't have a blog, so my MR is the blog post I guess?
@karolherbst interesting! Pocl does something similar too too the same effect, BTW.
And if you can fix typos, there's a cl_gl_shring that needs an a ;-)