- Mar 3, 2017
- 1,747
- 6,598
- 136
You are asking a code monkey who dropped out of the 42 main program coz I couldn't figure out in one month how to write my own secure malloc() function that passed strict Valgrind checksOkay. Elaborate. What kind of optimizations do you propose? Be specific.
You are asking a code monkey who dropped out of the 42 main program coz I couldn't figure out in one month how to write my own secure malloc() function that passed strict Valgrind checks
I have no clue TBH. But my brain says it's possible.
Here's one idea:What sorts of optimizations are you proposing?
With how starved the uncore / memory of Granite Ridge is i'm gonna go out on a limb and say 19% IPC for Turin (at 5600MT/s).What will be the rough average IPC uplift for Turin?
I predict ~17%
Here's one idea:
View attachment 103997
Suppose a developer has done extensive profiling for Zen 4 and made changes to his application so that when Zen 4 is detected, he uses specific hot functions that matter a lot to his application's core performance.
Now suppose he uses the same tool to profile Zen 5 and sees some big differences. Some of his assumptions about Zen 4 no longer hold true with Zen 5. So he creates specific functions for Zen 5 again to make sure that his application can get the most out of the new architecture.
This isn't something unheard of in the software world. Yes, most monkey programmers won't go to all this trouble. But Linux gurus, benchmark writers, game engine developers and authors of widely used software like 7-zip and WinRAR may do that. Can't say anything about the latter since it's closed source but maybe someone can look at 7-zip source and see if there are architecture specific optimizations in that.
Hackers love to hack. If someone like that thinks there is more performance to be squeezed out of a new architecture, you bet they will love to tackle that challenge. Because that's the real fun of hacking. The feeling of satisfaction when you crack a hard problem.
Don't forget John Carmack (his personal pet projects that he doesn't release to the world), Tim Sweeney, all the engine developers of AAA studios, Adobe and other workstation software developers.But with its weird front end and big FPU I do expect hand-tuned assembly could be much better. But no one outside of Oak Ridge and Lawrence Livermore will even consider that.
Good point. Yeah, twelve channels of DDR5-8800 could really make that baby rock!With how starved the uncore / memory of Granite Ridge is i'm gonna go out on a limb and say 19% IPC for Turin (at 5600MT/s).
It's already starved on Raphael and I can only see the gap widening between Desktop and Epyc.
I don't know but it is possible. Epic/Rad did write extremely optimized Zen 2 specific code for Unreal. They even had an article about it but I can't find it now.Don't forget John Carmack (his personal pet projects that he doesn't release to the world), Tim Sweeney, all the engine developers of AAA studios, Adobe and other workstation software developers.
Don't forget about Frostbite engine's former lead dev Johan Andersson who spearheaded the Mantle work with AMD all those years ago.Don't forget John Carmack (his personal pet projects that he doesn't release to the world), Tim Sweeney, all the engine developers of AAA studios, Adobe and other workstation software developers.
Twisting the knife, are you?And remember, you weren't just talking about optimized assembly - which is a thing but not necessarily a highly profitable one most of the time - but improvements just from recompiling with Zen5 specific compiler support.
Twisting the knife, are you?
Clearly he’ll need to add some nested if statements and nested for loops to take advantage of the 2-ahead branch predictorOkay. Elaborate. What kind of optimizations do you propose? Be specific.
Thank you!Clearly he’ll need to add some nested if statements and nested for loops to take advantage of the 2-ahead branch predictor
AI seems to be the new black in that score.What can I say? "It'll work great with our magic future compiler" is a pet peeve of mine.
For some reason.
AI seems to be the new black in that score.
"Oh ye eventually AI will just recode C into highly optimised assembly"
Or "AI will just translate OpenGL backends into optimised Vulkan"
And the reality....."And then a sorcerer is going to come and wake a princess from her eternal sleep and lead us all to a wonderful paradise and..."
"And then a sorcerer is going to come and wake a princess from her eternal sleep and lead us all to a wonderful paradise and..."
AI is only good for one thingAnd the reality.....
For contributing to the discussion. What else?For... what?
For contributing to the discussion. What else?
Zenbook S16? I really wonder why there were no reviews today.Got my laptop today, so far very happy with it. It's the 365/24G model from Bestbuy.