http://news.softpedia.com/news/AMD-...each-Retail-Until-Q3-2011-Report-202051.shtml
Anybody know if the delay is true or not?! I'm crying here
Anybody know if the delay is true or not?! I'm crying here
I believe we will not see more than 3.2GHz (base frequency) for an 8 Core BD (16MB caches) at 95W TDP, at least in the beginning.
8core means 8 cores.I didn't read this entire thread, but do the leaked prices of say the 8core BD refer to BD modules or actual cores (ex 2 cores per module). If the BD was 8-module.... that would be interesting.
Well maybe Intel and AMD do have such great tools that allow them to throw the design and constraints into it and get the perfect result back, but in my experience (well not my personal expertise - sure I had the courses but I just say what I've seen and heard from people doing these things for a living) getting optimal clock trees and co takes enough tweaking. Sure the tools are essential but you can't just throw the design at them and get the perfect result back.I'm nuts for other reasons sure, but not that one. Especially when it comes to high performance blocks that are not just straight synthesized logic, any duplication does amount to a copy/paste. Yes, after that's done the automated tools for signal wire routing and timings come into play, but that doesn't negate the starting point. It's far easier than having to design a high performance block that's twice the size.
http://cpu.zol.com.cn/231/2310866.html
translate: http://translate.google.com/translate?js=n&prev=_t&hl=en&ie=UTF-8&layout=2&eotf=1&sl=zh-CN&tl=en&u=http%3A%2F%2Fcpu.zol.com.cn%2F231%2F2310866.html
http://detail.zol.com.cn/picture_index_647/index6469206.shtml
http://detail.zol.com.cn/picture_index_647/index6469207.shtml
ehm, Bulldozer Launch in September ??
ehm, Bulldozer Launch in September ??
Looks to me like launch dates for specific models 8150 (no "P" at 125W TDP?), 8100, 6100, 4100, while models 8130P, 8110, 6110, 4110 might already be launched then. Could also be a fake again.
In the same sense we might cry about Intel delaying i7-2700K.
Looks to me like launch dates for specific models 8150 (no "P" at 125W TDP?), 8100, 6100, 4100, while models 8130P, 8110, 6110, 4110 might already be launched then. Could also be a fake again.
When did AMD say Bulldozer would launch in June?
Why would this be necessarily so? If you were halve the total size/complexity of the logic, and just try to add enough I/O to handle the rest, sure. The reduced/consolidated set would not have linear savings, because what would be halved would be the total performance capability, which is clearly being wasted.I'm guessing that my statement was not properly understood, since that's the only explanation for that response. Let me attempt to rephrase in similar terms. If a fictional design in the intended workloads averaged 90% utilization of its integer resources and 45% utilization on everything else... Then it doesn't make sense to halve everything else in order to have a more balanced core design - sure it would increase the utilization of everything else, simply because there's less of it, heh. It'd also decrease integer resource utilization due to dependencies and drastically decrease performance (aka kill.)
Nor was I. So far, there's only what AMD has officially said, for BD, and what people working on it have said at various places.Eh, okay. I know I sure wasn't there in the high level architecture design meetings 5+ years ago when those decisions were made.
More like both, than neither, hence qualifying every CMT v. CMP+SMT bit as only being about integer.I'll pretend to be an academic for a moment and proclaim that AMD has invented SCMT! Or maybe even P-SCMT if they did the separate issue queues like hyper threading. After all, according to that paper, bulldozer is neither SMT nor CMT because of the shared FPU, quoting from page 2, "The primary difference between the P-SMT and CMT approaches is that the former assigns threads to execution units at issue time, while in the more highly partitioned CMT processor, this assignment is done at dispatch time by steering each instruction to a particular cluster." As for the actual performance and energy conclusions of that paper, it's unfortunate that they only compared various 16-thread designs, none of which are comparable to the processor's we're interested in.
What do know about my pixie dust?!Did Apple steal it again? Sorry, couldn't resist... Continuing with the rest.
Could AMD have made a CPU that ran anything any faster, without the shared front-end and FP (likely, the FP would be half width for each core, too)? It is quite probable that they couldn't, IMO, (or at least couldn't be highly confident that they could), and that this method gets them just as good single-threaded performance, at least as good multithreaded performance, and let's one thread hog enough FPU resources for two cores. Better single-threaded performance makes the assumption that their development budgets are not limited. At a certain level of difficulty (how many manhours? What are the chances it make our schedule slip again?), a feature likely has to get canned, and one easier to do within some time frame will get priority. I'm working on the assumption that AMD has very limited resources, and that potential risk and ROI affect every decision to a fair degree.Pretty sure I didn't say anything about such a design being more power hungry or having less threads... But you are correct in that I should have stated it as far higher potential single-threaded performance, which likely wouldn't be realized often at all. The point being that the same resources in an equivalent SMT configuration could hit the same multi-threaded performance while offering no constraints to single-threaded potential. It's just markedly more difficult to design an adequate scheduler of that width.
So, because there are double the execution units, the rest of the module must be double the space as if they had made it as a single core (with half the FP width)?Okay, it's a copy, reflect, and paste. At least that's what the die shot implies, and is the only sensible way to implement the design. (I did the same thing on a ALU layout for one of my VLSI courses back in college.) Doing it any other way vastly increases the amount of back-end work necessary for no purpose. Oh, and guess I should have been more specific that I'm talking in terms of design implementation.
That's like saying the sky isn't blue, unless you look between the clouds.My claim of nonsense and reference back to point number 1 was that there's no merit of 'CMT' that increases its potential multi-threaded performance in comparison to SMT unless you give it more execution units to work with.
SMT, used with efficient code, guarantees that each thread will run slower, even if the total is faster (which is not always the case, though it's not much of an issue on Nehalem and newer). If response time matters, SMT will get to be worse, as your code gets better. If you consider service time per thread always to be secondary, then you're probably not going to like anything AMD comes up with, because they have made a point of this being a way to differentiate their products. If idle execution units, due to low IPC code, are your main problem, SMT is by far the best way to fix that problem, short of ditching deep pipelines (which, for x86, is not likely to net as much gain as higher speeds).Now the statement of SMT only providing an increase in performance with inefficient code would be correct, but it's still quite 'effective' at keeping execution units busy when running efficient code. On the flip-side, depending upon its implementation, a 'CMT' design could easily find inefficient code resulting in idle execution units - everything available thus far implies that this could well be the case with bulldozer.
That not having unneeded redundancy should be part of the design, where this redundancy is defined by what is well or poorly used when multiple threads are active. Then, that it would be used, where possible, to facilitate those other improvements, including some that could take up more die space or xtors that might be allowed, if they'd gone normal CMP. If it is just done in a vacuum, relative to other features they could improve or add, just to save space/xtors/power, and not using any of those gains to improve overall performance, it will be a failure, because with only that, Intel's process advantage will still be too great. CMT v. just a CMP of the same, or just v. a SMT of the same execution resources, etc., is all to show what happens in a controlled environment, which can show merit, lack thereof, and what may be good directions to take, if there is merit. AMD still needs to replace the guts of the Athlon on steroids (STARS) with something far superior, that can be incrementally improved over the next decade, because they aren't dealing with such a controlled and contrived situation.Smaller and simpler does indeed run faster. But again, how does that turn into an advantage of 'CMT' vs SMT? Sure compared to a huge CMP there's an advantage. But all the rest is more a function of other design decisions rather than some superiority of 'CMT'.
They never said June.. but they said Q2 2011 for Desktop and Q3 for server parts.
Well maybe Intel and AMD do have such great tools that allow them to throw the design and constraints into it and get the perfect result back, but in my experience (well not my personal expertise - sure I had the courses but I just say what I've seen and heard from people doing these things for a living) getting optimal clock trees and co takes enough tweaking. Sure the tools are essential but you can't just throw the design at them and get the perfect result back.
But if you meant that you copy+paste the blocks into the design (well more increase some loop counts) and then still have to do the rest of the work, that we can agree on. And surely that makes it easier - just as always in software modular design has its advantages. It just sounded to me as if you advocated that you could just add some blocks, throw the new design into automated tools and get back with a production ready product.
I don't recall, did they say retail availability?
http://forums.anandtech.com/showpost.php?p=31759086&postcount=2440
Intel, OTOH, just needs to keep their management's head out of the clouds, and the rest more of less takes care of itself, for them .
When did AMD say Bulldozer would launch in June? I can only find one article from March which is a speculation (it says that AMD didn't comment the news story). Has this one FUD being taken for granted here since people are disappointed with the "delay"?
Do they ever? Neither Intel nor AMD speak to availability in the end-customer sense.
They will speak to terms such as "shipping for revenue", "shipping samples", and "launch" but I don't recall either one of them actually communicating the actionable information we all are seeking. (when can I order it from Newegg and have it not be on back-order when I place the order?)
That's my POVAhh yes, those are different models, thx
I'm sorry, but saying "we will launch in Q2" should only mean one thing.