Yes, bring on the dual 128-bit FPUs, again (so their big 4-module CPU can compete with an i3). And, AFAIK, we're still very much in the dark as to how they are implementing non-FP AVX2. Intel's IGP is also already almost at par.this isnt necessarily about amd v. intel, rather apu v cpu, also if we are talking about the future, there is always kaveri that has steamroller cores with avx2 support and gcn core with massive ocl perf boost over vliw4...so lets keep the flame baiting to a minimum, please.
Intel's IGP is also already almost at par.
It's been a while since I've used MediaEspresso, but isn't "OpenCL" just their very poorly named option for using AMD's hardware encoder? I'm almost positive turning on OpenCL is actually turning on VCE, and turning on CUDA turns on NVENC for NVIDIA cards.found theses benchmarks
you can see how the a10-6800k performs in a straight up cpu benchmark, not bad. The Intel parts do out perform it.
now if we look at the benchmark with the a10 6.8k having enabled avx and opencl acceleration, it outperforms the i7-3960x -in cpu transcoding.
This has me thinking that maybe we [by that I mean I] dont fully comprehend what a fully hsa compliant apu is capable of. Although it wont matter if devs dont take advantage of this.
I havent found any other benchmarks like this so its pretty much a cherrypick. In any case what do you think are the implications of HSA?
It's been a while since I've used MediaEspresso, but isn't "OpenCL" just their very poorly named option for using AMD's hardware encoder? I'm almost positive turning on OpenCL is actually turning on VCE, and turning on CUDA turns on NVENC for NVIDIA cards.
In which case that's not an example of HSA nor GPU compute; rather it's an example of using a dedicated DSP.
makes no mention of the fixed function encoders(also note that the fusion e&c-series dont have vce) It also specifically mentions amd app and nvidia cuda.MediaEspresso 6.5's support for AMD® Accelerated Parallel Processing (APP) and Fusion E-series & C-series Accelerated Processors technologies lets users leverage advanced hardware and software technologies that enable AMD graphics processors (GPU), working in concert with the system's central processor (CPU), to accelerate the video conversion process. This ensures more balanced system performance for faster handling of HD video transcoding.
In conjunction with AMD APP technology, MediaEspresso 6.5 accelerates the conversion of standard and HD video into multiple formats for use on various consumer electronics devices. Support is also available for UVD (Unified Video Decoder), a video decoding unit of AMD APP that supports the hardware decode of H.264 and VC-1 video codec, enabling MediaEspresso 6.5 to provide quick output of video content for playback on PSP, iPad, iPod, and iPhone platforms.
Super Performance
Intel Quick Sync Video enables MediaEspresso 6.5 to improve overall performance including encoding, previewing and simultaneous conversion of multiple video file formats.
The graph on the right shows the performance of MediaEspresso 6.5 in converting HD video content for output to a Sony PS3 game console using a 2nd generation Intel Core i7 processor with Intel Quick Sync Video technology compared to the same conversion using the previous generation Intel hardware. Optimizations for the new hardware within MediaEspresso 6 result in significantly shorter video rendering times.
Dunno why you have to bash it so badly. You'll get OCL in virtually any device from now on and HSA in almost any non Intel/Nvidia stuff. As far as I got it, HSA is only a way to speed up even further OpenCL and the likes.
The only reason why HSA enabled products will be faster than OpenCL is the unified memory feature. The rest of HSA is just AMD's latest promise to provide a decent programming framework.
As for unified memory, I've already seen a few rumors that it's something Broadwell also supports. So sure AMD might beat into to the punch with Kaveri, but that doesn't mean it's going to be the superior solution.
Without going too far into the gory details here, Cyberlink doesn't actually implement any of this stuff themselves (i.e. they haven't written any OpenCL/CUDA code for Espresso). What they do is call the appropriate video encode API the video drivers provide, and pass off the bulk of the work in that manner.not so sure about that
makes no mention of the fixed function encoders(also note that the fusion e&c-series dont have vce) It also specifically mentions amd app and nvidia cuda.
also note that the only hardware encoder it support is intel quicksync.
http://www.cyberlink.com/products/mediaespresso/overview_en_US.html?&r=1
There are compute tasks where GPUs are faster than CPUs. That's not something we don't understand the implications of. If that weren't well established by now there wouldn't have ever been a compute hardware market.
Putting aside that encoding is a bad use case to highlight the benefits of GPU compute, the real question is if memory unification and hardware with tight integration for faster communication (IGPs) is enabling any real big compute benefits. This benchmark doesn't do anything to highlight that.
stop focusing on intel v. amd, we are disqusing the current, possible and unseen advantages of hsa in an apu v. cpu.
do you have any idea how exactly unified address space will be advantageous?
It's a bit more complex than that. Although shared memory is one of the big features necessary to enable HSA, the other "huge" feature for HSA is that HSA compliant GPUs will have to offer fast context switching. Currently GPUs are very slow at context switching (with all the registers and cache there's several MB of context to save). Full HSA will bring with it a new iteration of AMD's GCN design that will support fast context switching, which in turn will hopefully allow workloads that previously weren't possible to efficiently process due to the slow context switching.The only reason why HSA enabled products will be faster than OpenCL is the unified memory feature. The rest of HSA is just AMD's latest promise to provide a decent programming framework.
Come on, with your known support of a particular company, you had to know it was going to turn into AMD vs Intel debate, especially since you were touting how a cheap AMD processor was faster than a 1000.00 intel processor.
now if we look at the benchmark with the a10 6.8k having enabled avx and opencl acceleration, it outperforms the i7-3960x -in cpu transcoding.
This has me thinking that maybe we [by that I mean I] dont fully comprehend what a fully hsa compliant apu is capable of. Although it wont matter if devs dont take advantage of this.
I havent found any other benchmarks like this so its pretty much a cherrypick. In any case what do you think are the implications of HSA?
It's a bit more complex than that. Although shared memory is one of the big features necessary to enable HSA, the other "huge" feature for HSA is that HSA compliant GPUs will have to offer fast context switching. Currently GPUs are very slow at context switching (with all the registers and cache there's several MB of context to save). Full HSA will bring with it a new iteration of AMD's GCN design that will support fast context switching, which in turn will hopefully allow workloads that previously weren't possible to efficiently process due to the slow context switching.
An AMD CPU with integrated IGP is an APU.stop focusing on intel v. amd, we are disqusing the current, possible and unseen advantages of hsa in an apu v. cpu.
To work with the same memory in any practical capacity, the CPU or GPU must be able to get exclusive access to a cache line. To do so with dissimilar address spaces requires translation between the address spaces, which may have very different semantics for cache coherency (often mostly-software on the GPU side), different cache line sizes, if any concept of that at all, and different virtual memory management, if any at all, is going to be complicated, to such a point that right-minded people will just avoid it. More generally, just treating everything as non-sharable, if it is writable, such that buffers get copied back and forth a lot, which can result in being slower than just doing it all on the CPU.do you have any idea how exactly unified address space will be advantageous?
You would need an HSA compliant CPU and an HSA compliant dGPU. If AMD doesn't roll out a stand-alone Steamroller product, then I'm not sure if we'll ever see the former. In which case you'd in theory be able to use a dGPU + AMD APU as part of an HSA setup, but the big question about the potential performance hit (due to their vast distance apart) remains unanswered.So I dont completely understand this. Will HSA work with a discrete card? If so will it work with an intel chip or FX, or will it have to be paired with an APU? And if a powerful discrete card is paired with an APU and the IGP is not being used, will HSA still work?
stop focusing on intel v. amd, we are disqusing the current, possible and unseen advantages of hsa in an apu v. cpu.
do you have any idea how exactly unified address space will be advantageous?
found theses benchmarks
you can see how the a10-6800k performs in a straight up cpu benchmark, not bad. The Intel parts do out perform it.
now if we look at the benchmark with the a10 6.8k having enabled avx and opencl acceleration, it outperforms the i7-3960x -in cpu transcoding.
This has me thinking that maybe we [by that I mean I] dont fully comprehend what a fully hsa compliant apu is capable of. Although it wont matter if devs dont take advantage of this.
I havent found any other benchmarks like this so its pretty much a cherrypick. In any case what do you think are the implications of HSA?
OpenCL performance is stellar with the new Haswell chips.
http://www.tomshardware.com/reviews/core-i7-4770k-haswell-review,3521-3.html
Dunno why you have to bash it so badly. You'll get OCL in virtually any device from now on and HSA in almost any non Intel/Nvidia stuff. As far as I got it, HSA is only a way to speed up even further OpenCL and the likes.
Adobe is a major developer collaborating with AMD and supporting OpenCL acceleration. With a great number of new Kaveri mobiles and desktops about to ship, they signal what will be a flood of broadbased HSA/OpenCL app support.
http://www.techpowerup.com/185977/a...th-adobe-photoshop-cc-and-premier-pro-cc.html