Tom's Hardware > Forum > CPU & Components > CPUs > Collection of AMD K10 data

Collection of AMD K10 data - Page 3

Forum CPU & Components : CPUs - Collection of AMD K10 data

Tom's Hardware: Over 1.4 million members in 6 different countries available to answer all your high-tech questions. Sign up now! Its free!
Word :    Username :           
 

Last message on previous page:

Quote :

pope, can you stop trolling on this thread!
if you have something to say about K8L, go ahead. if not, please don't destroy this thread. Also, you don't have to answer me.




so stating a fact is considered trolling? since when? i agreed with FUD that for his desires...AMD is the platform for him

again how is that trolling?

Reply to lordpope
Sponsored Links
Register or log in to remove.

Quote :

pope, can you stop trolling on this thread!
if you have something to say about K8L, go ahead. if not, please don't destroy this thread. Also, you don't have to answer me.


so stating a fact is considered trolling? since when? i agreed with FUD that for his desires...AMD is the platform for him

For the record, I was already aware that C2Q could handle 4x2GB and was checking to see if I had options on the AMD side.

I'll repeat it once again. I do not care one iota if my 2xQuad is made by AMD, Intel or Hasbro.

Sincerely,

FUD. :lol:

Quote :

can I just kidnap the 200+ good posters here and transplant them to the xbit forums?Its virgin turf there and the mods do a great job of keeping up on stuff. :wink:



If I promise promise promise to send you lots of pictures of naked chicks in compromising positions will you please please please pretty please take me too??? 8)

Reply to CaptRobertApril

Quote :

pope, can you stop trolling on this thread!
if you have something to say about K8L, go ahead. if not, please don't destroy this thread. Also, you don't have to answer me.




so stating a fact is considered trolling? since when? i agreed with FUD that for his desires...AMD is the platform for him

again how is that trolling?
gOJDO (and most everyone else) would prefer that we stay on topic -- especially in this "sticky" topic.

Motherboards, chipsets, and the RAM limits imposed by them do not apply to the K10.

Reply to Ranman68k

Quote :

pope, can you stop trolling on this thread!
if you have something to say about K8L, go ahead. if not, please don't destroy this thread. Also, you don't have to answer me.



so stating a fact is considered trolling? since when? i agreed with FUD that for his desires...AMD is the platform for him

again how is that trolling?
gOJDO (and most everyone else) would prefer that we stay on topic -- especially in this "sticky" topic.

Motherboards, chipsets, and the RAM limits imposed by them do not apply to the K10.

Reasonable. Therefore, allow me to ask does the K10's aggressive fine grained clock gating and improvements in coarse graining offer any performance advantages or is it just a minor electricity saving?

Reply to CaptRobertApril

Quote :

This slide:
http://img528.imageshack.us/img528/2699/amdbarcelona13cn9.jpg
confuses me. I found a lot of articles stating that the L1 would be 128kB, same as on K8. So:
1. the 64kB L1 on the AMD slide is a typo or
2. the L1 is 4-way


In the context of this slide, it is possible the slide is describing the L1 cache for data and not the L1 cache for instruction.

It's clear from the slide that it's talking about data cache. BTW, Opteron's L1 caches are 2-way, but it should be orthogonal to cache size, anyway.

Reply to abinstein

Quote :


Barecelona pipeline: http://img162.imageshack.us/img162/6479/k8lexecutionpipelineye3.jpg



This image is totally insignificant in terms of showing changes in the "Barcelona pipeline." The exact same graph was shown in AMD's 2003 presentation on Opteron.

It's interesting to see the turn of opinion tide right after AMD publicly throw out a list of nouns that they added to Barcelona. Barcelona is a beast, I've said that in the beginning. But it's being so not because this long list of tweaking improvements, but because there's a few bigger, more fundamental superiority over its previous generation and competitions:

1. Native quad-core with shared L3 as inclusive/exclusive victim cache.
2. Fine-grain power control, with little need of software support
3. More than double SSE/FP capability.
4. 32B instruction fetch (just say it's not easy to do it efficiently)

The tweaks are nice, but they are what would've made a P-II out of P-pro, or a Conroe out of Yonah. They make a core better, but not a monster.

BTW, I'm curious - how would a chip being a "monster" but not a "beast"?

Reply to abinstein

Quote :

pope, can you stop trolling on this thread!
if you have something to say about K8L, go ahead. if not, please don't destroy this thread. Also, you don't have to answer me.




so stating a fact is considered trolling? since when? i agreed with FUD that for his desires...AMD is the platform for him

again how is that trolling?
gOJDO (and most everyone else) would prefer that we stay on topic -- especially in this "sticky" topic.

Motherboards, chipsets, and the RAM limits imposed by them do not apply to the K10.

After re-thinking ... you ae correct... wont take much to turn this thread into a AMD vs INTEL flame war...

my apologies..

Reply to lordpope

Quote :

Reasonable. Therefore, allow me to ask does the K10's aggressive fine grained clock gating and improvements in coarse graining offer any performance advantages or is it just a minor electricity saving?


If properly implemented, it should help performance under power saving. More thoughts here.

Reply to abinstein

Quote :

Reasonable. Therefore, allow me to ask does the K10's aggressive fine grained clock gating and improvements in coarse graining offer any performance advantages or is it just a minor electricity saving?


If properly implemented, it should help performance under power saving. More thoughts here.

There was some excellent info there. Thanks!

Reply to CaptRobertApril
- 0 +

Quote :


Barecelona pipeline: http://img162.imageshack.us/img162/6479/k8lexecutionpipelineye3.jpg



This image is totally insignificant in terms of showing changes in the "Barcelona pipeline." The exact same graph was shown in AMD's 2003 presentation on Opteron.I realy appretiate your input about K8L(K1) in this thread. Thank you for your corrections and suggestions. It would be nice if you can find or draw a graph with "Barcelona pipeline" or any useful data and share it with us. I'll update this thread with info and data. Any ideas or suggestions would be nice.

Thanks.

Reply to gOJDO

actually, just because it supports 8Gb of memory has nothing to do with the capacity of the modules it is compatible with. well, not literally nothing, but quite little

Reply to rammedstein

Quote :

someoe needs to explain the map to me.I got as far as understanding the path is ram to decoder to alu to ram.from there i am confused as to what data goes where.


you're right, it's the basic pipeline: instruction fetch (ram), decode (decoder), execute (alu), commit (ram). everything else is just detail. :wink:

Reply to abinstein

Quote :

From the specifications, I am quite sure K8 (at least rev. F) and later CPUs can handle 8GB per socket. :wink:



This is my happy dance:

http://www.ilovehou.com/twonicknames/gifsite/july02/happydance.gif

That's great! So I can go off and get my 2xQuad and 4x2GB sticks and live happily ever after! :D

yeah AMD is the only way u can go with that....965\975 boards dont even support 2 gb modules

explain how i am wrong again? i said it doesnt support 2 GB MODULES... i didnt say it couldnt support 8 gb's

you are wrong here P965 supports up to 8GB of memory
thats 4x2GB

http://www.intel.com/products/chipsets/p965/index.htm


so if there are 4 slots for modules and the board supports 8GB
what is 8GB divided by 4 slots? its 2GB so yes you are wrong

lets pretend there are only 2 slots

whats 8GB divided by 2 slots?

stfu your posts are annoying. know what you are talking about before you talk about it

Rammestein wrote """actually, just because it supports 8Gb of memory has nothing to do with the capacity of the modules it is compatible with. well, not literally nothing, but quite little""""

now tell him he is annoying too .... and post me a link that shows a intel 965/975 running with 2 gig modules.

Reply to lordpope
- 0 +

thread updated. :D

Reply to gOJDO

Actually, the diagram for K10 wouldn't be much different from K8. The FP execution units are wider which wouldn't show. And, the new stack hardware wouldn't show nor would the doubled cache buses or doubled prefetch.

The speed that AMD gives for SSE is 3.6X. Since there are four cores instead of two you could divide by 2 and get 1.8 or 80% faster. However, I suppose it would be more accurate to say that widening the SSE units gives a 100% increase in speed while the second set of cores gives the 80%. So, the per core change in SSE should be 100% and this should be the same as C2D. However, because of the additional operations, you can get C2D to run faster if the data requires the right kind of operations and the SSE instructions are tuned correctly. In other words, in general operation K10 and C2D should be about the same at the same clock but specialized SSE operations could be faster on C2D. It is possible that some operations could be a little faster on K10 because of less cache latency but this would only be a small difference.

I was initially thinking about 10% improvement for integer on K10 which would still have been slower than C2D but would have closed most of the gap. This would be due to additional FastPath instructions, elimination of instruction breaking which would require a second fetch, side band stack optimizer, and elimination of fetch delay. However, we also have improvements in branch prediction and out of order loads so I'm now thinking it will be possible to match C2D's integer IPC. I would guess that the great majority of Integer tests would fall within +-5% between C2D and K10. At 3.0Ghz a 3% difference is only 100Mhz. Differences smaller than this are beyond the accuracy of the tests. For example, you can get 2% difference just by changing motherboards.

I have to say though that a bit of editing in the quotes would be nice. It is a bit annoying when someone quotes ten or twenty lines of text and then adds a one line comment. This is doubly annoying when the poster also has a large sig. The jokes about newbie posters who quote large blocks of text and then say, "I agree!" or newbie posters who try to impress people with a large sig have been around for more than 15 years. These jokes actually predate web forums and go back to the old email lists and proprietary dial up bulletin boards. Show everyone that you are actually smart enough to post something worth reading and edit down to the last comment instead of including seven layers of nested quotes. Chain letters are banned for a good reason; the same thing applies to quotes.

Reply to Scientia_From_AMDZ
- 0 +

Well, if AMD said K10 will perform ~40% better than Core2 quads, it HAS GOT to be something around, because if not, there'll be a lot of people willing to set their farm on fire :lol:

Reply to m25

Here's the latest die shots from Hans de Vries at Chip Architect. Looks like they're scaling cache compared to Brisbane too.

http://www.chip-architect.com/news [...] mages.html

Reply to piesquared

Quote :

Here's the latest die shots from Hans de Vries at Chip Architect. Looks like they're scaling cache compared to Brisbane too.

http://www.chip-architect.com/news [...] mages.html


Is that table correct? If so, AMD scaled the cache on Barcelona better than Intel did with Merom. I thought AMD was having issues at scaling with the transition from 90nm to 65nm.

Reply to Ranman68k

we dropped thios issue as to not taint this thread...there are no intel systems runnin 2gb modules for 8 gbs ok...

lets keep this thread about the k109

Reply to lordpope

Quote :

what about simple math do you people not get? the system supports up to 8GB of memory
there are 4 memory slots
8/4=2
it supports 2GB per slot
i dont know how i can make this any more clear


Quit hijacking this thread. :?

You would think that someone with well over 1000 posts would know better. :lol: But, I could be wrong.

Reply to Ranman68k
- 0 +

Quote :

i wasnt hijaking just correcting inaccurate data from someone posting from before.
if you guys want to talk about this stuff at least be accurate.



May I be the one to shoot this dead horse? YES YOU CAN RUN 2GB MEMORY MODULES WITH P965 CHIPSETS. (I'm with beerandcandy on this one)

Now back to Barcelona: I recall a concern Jack once brought up about the separate power planes of the Barcelona chip. It had something to do with the frequency regulation circuits consuming a significant amount of power, and the idea of having multiples of these circuits to be able to slow cores independently may be counter-productive. Is this the case?

Reply to HotFoot
- 0 +

If we hat such pictures we'd also have benchmarks :roll:

Reply to m25

Anyone hear if K10 solves the problem of having to run 2T when 4 DIMMS are used?

Reply to GlacierFreeze

Quote :

Anyone hear if K10 solves the problem of having to run 2T when 4 DIMMS are used?

Both the independence of the two memory controllers and a more flexible configuration of them might help here. We'll wait and see.

Reply to Dresdenboy

does anyone know if they will release a dual core processor with the same improvements of the Barcelona? aka, better power efficiency, better stepping, and less heat dissipation ect?

Reply to rammedstein
- 0 +

Yes, there will be a desktop dualcore CPU(codename Kuma) with all the K10 improvements and features. It will be clocked higher than the quadcore and will have more resources per core: more shared L2 per core, ODMC shared between 2 cores, etc.

Reply to gOJDO
- 0 +

thanks Jack

Reply to gOJDO
- 0 +

That is a GREAT article..

Finally got a chance to give it a read.

Looks like the rumor of power plane per core is false, likely started by the fact the the Northbridge and the cores (all 4) will have separate planes.

This is shaping up to be a great volley back at Intel...

If the implementation and price are right I think we are looking at a new performance champion. Just a little worried about pricing given the 11 layers and the inclusion of the L3 cache. Is sounding again like binning will be interesting.

Reply to Ches111

Quote :


Looks like the rumor of power plane per core is false, likely started by the fact the the Northbridge and the cores (all 4) will have separate planes.



Can't remember if anything actually said power plane per core (maybe it did somewhere) but at least each core not used will have less power draw. That will help some anyways.

Does anyone have any "not so leaked" info??? I'm sure someone does haha.

Reply to GlacierFreeze

Three Words: Can't Friggin Wait!

Reply to Insane_Maniac

Quote :

http://www.anandtech.com/cpuchipse [...] spx?i=2939

Aside from the relatively large size (albiet a quad core), Anand is quoting 11 metalization layers....

This is a record. Making much more difficult to yield over the 9 layer K8.

I think, the most negative effect would be the additional processing steps required. But the designers wouldn't have chosen the additional layers if there wouldn't be any benefit.

But if you say "much more difficult to yield" you imply, that they added more of the finest metalization layers, which are still more prone to defects than the upper layers with increasing size starting from M5 or so.

Already the smallest metal layer structures (vias) are a multiple of the size of e.g. the transistor gate. And the bigger the structures, the smaller is the effect of variations in processing them. A somewhat too thin SiO layer in the transistors can instead render more chips unusable than a vias, which are thinner by the same number of atom layers.

Reply to Dresdenboy

Quote :

The speed that AMD gives for SSE is 3.6X. Since there are four cores instead of two you could divide by 2 and get 1.8 or 80% faster

Am I wrong or when you go from 4p to 2p wont this number be more like 50%. Saying that thru software utilization, crosstalk etc that that doesnt exist in a duo core and therefore the said increases will actually be over 40% ? Or simply put, instead of 80% it going to be 100% ?

Reply to jaydeejohn

That is to say only on a duo core

Reply to jaydeejohn

Quote :

Typically, designers will increase the number of metalization layers to mitigate the RC delay that is slowly becoming the limiter. This, frankly makes some sense, as the 65 nm process showed slightly larger L2 latency... hinting that the 65 nm process design did not eliminate all the speed paths and RC delay usually goes up for a shrink.


But there are other possible reasons for additional layers. Do you remember the Stanford presentations held by Kevin McGrath and Jerry Moench? I think, it was the latter, who mentioned metal layers dedicated to automated routing of the wires, which sounded like a novum for AMD's CPU design. Intel started to use automated routing heavily for Prescott.

Quote :

The problem is not adding the metallization layers, this is routine as the nodes go up as stated above. What is problematic is they increased a whole 2 layers within the same node for a new architecture, this is not uncommon but not common either. What this means is that it requires at least 2 more masking steps (more expensive, more masks, more litho tools), but it also means the die are also exposed to 2 more cycles of dielectric and metallization defects, increasing the probability of a killer defect in the backend by roughly 2 out of 9, or 22% over the current probability (which is unknown to the world as AMD nor Intel publish such numbers).

Yield goes down with increasing the number of layers, this is a given.


Additional processing steps surely cost, but as I earlier wrote it depends on the feature size, how much it costs and also how prone to defects these steps are. The most critical dimensions exist in the processing of the transistors and the wiring at this level, while the metallization layers - the higher they are, the larger and simpler are their structures. This also allows to use cheaper mask sets and tools. You can see nice examples of these layers in 2 presentations linked on this page:
http://www.malab.com/

Reply to Dresdenboy
- 0 +

Please keep this discussion going. I learn more reading both of yours' posts than anywhere else.

Reply to HotFoot
- 0 +

Props for the list. Very comprehensive and helpful!

Reply to xChaoSx

I am going to have to say good post. glad there a sticky for it.

Reply to AtolSammeek
- 0 +

Somehow i don't see Intel being afraid of the big bad AMD offering.
Too bad, competition would speed up the speed upgrades in the Intel lineup.

Reply to Artmic

Might as well post this here for anyone interested in all the technical details of Barcelona.

Barcelona optimization guide

Reply to piesquared

Quote :

Somehow i don't see Intel being afraid of the big bad AMD offering.
Too bad, competition would speed up the speed upgrades in the Intel lineup.


I like your rig, man, but wonder why not run a 6800 at 3.5Ghz?
not stable?

Reply to hzjeffrey
- 0 +

gOJDO wrote :



Your opinion if you please, do you think it will be the monster they claim ?[/quote]
Yes, it will be a monster, for sure. It is a quadcore after all. How it will compete against current Core2 Quad depends on its the frequency. IMO, clock for clock both will perform similar.



I have to agree, same conclusion I came to as well.


Considering people are already getting 4ghz out of their core2s and quads, I cant see how it will be worth switching over to AMD for those who already have purchased intel chips, unless you got a crappy overclock which doesnt seem to happen very often.. from what Ive seen, 3ghz is very easy to get, alot of people getting 3.2-3.5 on air. Will barcelona be able to reach those frequencies at first? I dont know. But overall it seems like this is merely playing catch up with Intel, not surpassing them. I used to be an AMD fan btw.

Reply to phial
------------------------------ http://www.speedtest.net/result/510440996.png

 

Reply to cherie22984

At least, one post without fight. gOJDO, and everybody else - this is verry, verry nice. I love to read that.

gOJDO - veliki pozdrav iz Kutine iz Hrvatske, ovo sto si napravio je vrhunski. Uzivao sam citati sve to, a nisam mogao vjerovati da si sa baronom pricao bez svadje. Prosli put kad sam ga branio, napisao si mi da je nenormalni majmun koji nema zivot, pa me iznenadilo kako svi lijepo razovarate. Hvala jos jednom na ovom postu.

------------------------------ "You can call me Lood, Robin Lood" (Lood means crazy on my language).
Reply to torcida_kutina
- 0 +

Thank you for sharing that cherie. There's a lot of good insight in that article, and I'm getting a nice lesson in chip architecture.

Yes, it's nice to find a thread that is full of unbiased information.

Reply to clintox

Your welcome. I'm always neutral and open-minded...My first cpu was an AMD 3700, then a x2 4800. I currently have a Q6600, so its really hard for me or any serious enthusiast to be biased.

------------------------------ http://www.speedtest.net/result/510440996.png

 

Reply to cherie22984

heh it seems AMD tried to take on too many new things all at once making its new generation, where as even intel isnt game enough to do something like that - the core 2 duo design is a few tweaks off a core duo, and the pentium m (dothan), and pentium 3/2/pro.

I hope for AMD's sake they pull this one off and have a chance to compete with Intels new cpu (performance crown isnt the only thing to win, efficency and pricing is something else), i think the first gen AMD design here will be ok but the second gen when they realise the potential of there design is where we will see some life here.

Reply to apache_lives

dasickninja wrote :

I don't pretend to understand anything in here, but hopefully this thing overclocks like a dog.



Me, too!This thread is awesome for me because I usually read/post in threads/stickys to try to do three things only:
1) Learn something
2) Help someone/answer OPs questions
3) Ask a question

Thank you gOJDO for the thread very informative even though I don't understand most of this stuff.

Reply to decripple
Tom's Hardware > Forum > CPU & Components > CPUs > Collection of AMD K10 data
Go to:

There are 1351 identified and unidentified users. To see the list of identified users, Click here.

Please mind

You are about to answer a thread that has been inactive for more than 6 months.
If you still wish to proceed, please ensure that your posting is original and does not duplicate or overlap any prior responses to this thread.

Add a reply Cancel
Sponsored links
  • Ask the community now
  • Publish
Ad
They won a badge
Join us in greeting them