I have to assume here that avx code runs faster than sse2 on piledriver. But that may not be the case in all avx<> sse2 situations and then it is understandable.
The original Bulldozer (8150) had bugs with AVX that made it not worth pursuing as far as I know. But, I haven't seen the same thing said about Piledriver (not with any data to back it up). However, it may be that AVX is still worse than other options in Piledriver such as FMA3,
FMA4, and
XOP.
The key here may be to utilize the best-performing bits in Piledriver instead of just using whatever Intel's CPUs support as if that's all there is. I assume that the reason The Stilt's SIMD build runs so well on Piledriver is because it takes advantage of things other than AVX, like FMA4 and/or XOP. A smart compiler will get the instructions that offer the best performance to run rather than inferior ones.
Let say we have a piledriver based opteron. It supports AVX, i do not know how fast and well it runs, but it works.
PD also supports FMA3 (which Intel does as well), FMA4 (which only Piledriver/Steamroller/Excavator support as far as I know), and XOP (another AMD-only thing).
All this nonsense is because Intel apparently pulled a fast one on AMD and made a moving target out what was supposed to be SSE5. Instead of getting an industry standard SSE5 we got a mess. Read Agner Fog's blog about it. He knows more about it than I do.
Because we cannot ask from Intel to do validation of machine code generation of their ICC compiler for a cpu from another company
Wrong. If a company sells a piece of software that produces binaries that run on others' products that piece of software needs to get rid of support altogether or provide it. Think about all the other examples. Apple sold printers at one time. Did it write versions of Mac OS that caused 3rd party printers to run much much slower than they should? Apple also sold monitors. Did they write the Mac OS so that only their monitors would display a good-quality image?
And then we come to general software, like word processors. If a software company produces a piece of software and also sells printers, scanners, and other pieces of hardware should that piece of software not run correctly on anything but that company's hardware? Mass market products have to be compatible. If a company is going to be brazen enough to create a walled garden then it needs to explicitly drop support. In the case of a compiler it would fail to run without GenuineIntel.
Or we can have word processors that only print on one company's printers and nonsense like that. x86 is a standard that Intel licensed to AMD. It is not Intel's little baby. It is a licensed cross-company standard.
A product is either supported or it isn't. If your compiler produces binaries that run on AMD then you have the responsibility to make sure they leverage the best-performing instructions. People are trying to argue having one's cake and eating it, too. Intel gets to sell an expensive "industry standard" compiler that produces "AMD-compatible" binaries but which cripple performance on AMD CPUs just because AMD is a competitor (even though it has a license for x86)? No. If you sell a product to a customer you owe them explicit support for anything that seems supported. You don't pull a fast one on them. At the very least Intel could have asked AMD which instruction set the compiler should tell the CPU to run.
this choice will most likely present reduced performance for AMD or VIA processors in comparison to Intel cpus.
If a compiler instructs a CPU to run instructions it's up to the CPU vendor to make those instructions run well. AMD wouldn't have had things like FMA4 and XOP if not for Intel's nonsense regarding SSE5. We would have a single SSE5 standard.
It's also up to the compiler vendor to make sure the best-performing instructions are used.
And then there is also SSE4.2. But again, i do not know how well it runs on piledriver cpus. But it is available. Even on the AMD jaguar cpu. Might come in handy for games
Take a look at how much better the SIMD build for Blender runs the Ryzen demo — on Intel and AMD CPUs. Games are definitely not the only thing that can benefit from improved compilation efficiency.