It doesn't seem like anyone here has any kind of inside information on the details of the negotiations or any contracts involved in bringing these consoles to market. So why then then do some complain about a lack of evidence disparaging Intel's ability to provide a console class GPU - taken by...
Yea I understand the ISAs in question fairly well, as I've had to work with SSE2 assembly and intrinsics (and I've glanced at the instructions in AVX, nothing in depth though). What I don't get is where those large multipliers come from, but it's OT anyway so forgive the distraction.
I don't know why you bring up x87 when you mentioned SSE2 vs AVX2. I'm still waiting to hear the cases where AVX2 can give you a 20x speedup over General purpose x86 and 8x over SSE2.
I was under the impression that AMD wasn't planning on launching any "next generation" parts for now and that the 7000 series will be "stable" throughout 2013. According to techreport the 7790 which was recently released is still based on the same architecture as the rest of the 7000 series...
Reminds me a lot of this:
http://www.xbitlabs.com/news/multimedia/display/20130319235900_Sales_of_PlayStation_4_Xbox_Next_to_Hit_Around_5_Million_in_Total_in_2013.html
The link from above article...
As with any new instruction set, adoption is what is important. Even if processors with AVX2 sell well, they will be dwarfed by the number of pre-existing processors which do not support it and likewise software support will lag. So it will take time for AVX2 to show returns. The advantage to...
Microarchitectures have fixed (probably) internal 'instruction sets' which are essentially architecture specific languages that expose all the functionality of the hardware as they were designed. So if you designed an ALU that can add two registers you would have some internal instruction to...
Well it seems you aren't grasping the fundamental concepts of my argument. Let me say it one last time:
There are numerous flaws in what your argument:-
1. You are speaking in absolutes; you use a set of benchmarks to represent all program existing and yet written but IPC and "CMT tax" are...
What does cinebench (or itunes for that matter) have to do with any game? In fact what does any benchmark other than the actual games themselves have to do with the games?
Just out of curiosity, how do you know the kinds of instructions used in games?
Anyway, the bolded parts are where we...
Once again you are assuming things you shouldn't; IPC is absolutely worse in all workloads wrt K10 and the "CMT penalty" is significant for all workloads.
Turbo and better scheduling could be one explanation why the 4100 performs the way it does (I don't think its faster btw). During the...
You're the one demonstrating a lack of understanding of IPC. It's mostly a function of the platform architecture as well as the code being executed. Key point here being that the other programs you are using to talk about the supposed performance of games are exactly that. Other programs.
It...
I'm pretty surprised at your response here. You responded to what concisely equates to 'each case must be judged based on the circumstances' with this:
The interpretation of the law has always had room for subjectivity and there is rarely any "black-and-white". It is a natural result of the...
One thing has become clear to me; they really need to thoroughly explain how and what it is they are measuring. Whatever statistical method they are using is unfamiliar to me.
After looking at the values from the chart and the resulting computations on this page http://www.tomshardware.com/reviews/gaming-processor-frame-rate-performance,3427-2.html I'm not sure how they compute whatever 'average difference between consecutive frames' means. It seems at odds with what...
The thing is you are now using 6 ms in a way that is different from what you stated. 6 ms would be the actual frame time equivalent to ~167 fps, but a 6 ms difference between successive frames which is what you implied when you said
would mean that the next frame takes 6 ms more or 6 ms less...
That still makes no sense, using the differences between successive frames does not give you a meaningful and consistent picture of variability. You could satisfy the first few of the above differences with these absolute frame times for example:
10 ms/100 fps, 16 ms/62.5 fps, 22 ms/~45 fps...
Unless I've misinterpreted what "the average time difference between consecutive frames" means, I think their method serves as a poor proxy for the variability in frame times and I'll try and show you why:
Consider a series of frames with the first frame taking 13 ms to render and each...
The latency graphs depict the average, 75th and 95th percentile differences in consecutive frame times. This is more a measure of consistency than a measure of the absolute values of the frame times which is what techreport does. I think it is a poorer metric of experience.
The same article shows the task energy for that benchmark, even though it doesn't account for idle a rough estimate from the graph shows that the 8350 uses a little over 50% more power while finishing the benchmark ~17% faster.
Firstly, IPC is typically measured for a single thread. It isn't a...
They aren't making the decoder larger as far as I know, they are putting two independent decoders in each module. So the threads wont have to share decode bandwidth.
Show me where the 8350 outperforms the i7-3820 enough to negate its higher power.
The premise of CMT was to duplicate the resources that would have caused excessive contention and share the remaining resources as much as possible to reduce area and power consumption. More importantly, those...
I've already said that the additional decoder should only help in multi-threaded scenarios, unless they improved each decoder.
We mostly agree. CMT inherently reduces IPC because less resources are available to each thread at any time. However, I don't think the concept of CMT is...
Comparing TDPs is disingenuous, where are the real world power numbers? What about single threaded performance/watt or area? What about games and other less multi-threaded applications?
Also, why not compare it to an i7-3770k? Why does it matter, for comparison purposes or to a consumer...
Yet they've now decided to provide two decoders. Does that seem like a validation of their previous strategy or a reneging?
Except module sharing has a demonstrably non-trivial effect on per core throughput (some 18% if we are to go by IDCs numbers). Dedicated execution resources that become...
Would the marginal increase in utilization of execution resources outweigh the marginal increase in power and area considerations for adding an additional decoder for each module? Is it more efficient, as you are suggesting, from a perf/power perspective to have execution resources available...
Good points.
I imagine HSA is not just limited to end developers but also platform devs. I'm not sure of the ratio of native to interpreted software but general trends indicate (to me) the desire for improved performance. This should create pressure for native code (how much and the efficacy...
This site uses cookies to help personalise content, tailor your experience and to keep you logged in if you register.
By continuing to use this site, you are consenting to our use of cookies.