fixed point was a mistake

eniko@mastodon.gamedev.place

@natty @lina my CPU is a Ryzen 5 5600g?

slyecho@mdon.ee

@eniko mutex/locking/semaphores?

snowfox@tech.lgbt

@eniko (Highly speculative, since FPUs are pretty good these days:) If the CPU has hyperthreading: Maybe two fixed-point threads share ALUs better than two floating-point threads share FPUs?

You might even find that combined fixed and float makes better overall use of a modern CPU, provided you can do both on a single thread or jump through the hoops to get both threads scheduled on the same core.

lina@vt.social

@eniko That doesn't exist... Ryzen 5 or different model number?

If it's the 5 5600G then that's 6 cores, so with 4 threads you shouldn't have HT effects as long as the OS scheduler isn't dumb about it...

carbonacat@mastodon.social

@eniko these two messages are my constant bistable state about fixed points

eniko@mastodon.gamedev.place

@lina er yeah 5 sorry

eniko@mastodon.gamedev.place

@slyecho wouldn't that make it slower, not faster?

slyecho@mdon.ee

@eniko one would assume 4 times as fast with four threads, not 30% faster. But I don’t know exactly what the code is doing without seeing it

timotimo@peoplemaking.games

@eniko do you already have experience with the kind of profiler that lets you get performance counter values?

on linux my go-to first step is perf stat -d ./myprogram (-d for details gives a couple more numbers. gotta have numbers!) then you'll see a few numbers that may point at a drastic difference.

I'm thinking a higher instruction per cycle number probably means fewer instructions that take many cycles (though I hear integer division is much better nowadays?), or your cache hit rate for data or instruction cache may be a lot better, or maybe your code ends up with fewer total instructions for some reason?

oblomov@sociale.network

@eniko @lina so when running on N threads the old code did X triangles per second and the new code does 2X? How do the respective codes scale with number of threads?

eniko@mastodon.gamedev.place

@timotimo I'm incredibly new at running benchmarks at this level so I don't really know what that is

eniko@mastodon.gamedev.place

@slyecho the 30% improvement was over the same implementation with floating point

eniko@mastodon.gamedev.place

@slyecho as in threaded flat color random triangles with fixed point is 2x as fast as threaded flat color random triangles with floating point

eniko@mastodon.gamedev.place

To be clear, everything improved 0-35% from the previous implementation that used floating point after switching to fixed point

So the current threaded random triangles with flat color metric (fixed point) is 2x as fast as the previous threaded random triangles with flat color metric (floating point)

gabrielesvelto@mas.to

@eniko from my experience the biggest upside of using fixed-point in rasterization is that you get exact sub-pixel precision with as many bits as you like (or need), and it doesn't depend on how far away from the origin you are. That alone would be worth it even without performance improvements

eniko@mastodon.gamedev.place

@gabrielesvelto also helps if you wanna run it on really old CPUs >_>

eniko@mastodon.gamedev.place

@gabrielesvelto also to be clear getting a +100% performance boost is *good* I'm just having a hard time it's not a benchmarking bug. But if it is a bug I sure can't find it, and the threading code is only 300 lines so it's not like there's a lot of places it could be hiding

gabrielesvelto@mas.to

@eniko BTW are you using only scalar math or are you leveraging SIMD extensions? IIUC one of the advantages of fixed-point math is that you could implement some stuff on x86 even with the oldest, crustiest SIMD stuff (hello MMX!) and get at least some benefits

etc@toot.wales

@eniko @midnaw if you stretch the definition a little, things like DateTime and TimeStamp in c# are fixed point… they have an underlying int representation counting timer ticks, and a ratio value that converts ticks to human-friendly units.

midnaw@idtech.space

@etc @eniko oh i hadn't thought about that

Piero Bosio Social Web Site Personale

fixed point was a mistake

Feed RSS

Gli ultimi otto messaggi ricevuti dalla Federazione

Post suggeriti

Tomorrow will be my predicted birthday.

I wrote yery late instead of very late and it didn't sound wrong tbh

@andrea_ferrero per cortesia una curiosità.

📩 Leggi e diffondi la lettera aperta contro il programma di verifica degli sviluppatori Android