Tom's Hardware > Forum > CPU & Components > CPUs > Collection of AMD K10 data

Collection of AMD K10 data - Page 4

Forum CPU & Components : CPUs - Collection of AMD K10 data

Tom's Hardware: Over 1.4 million members in 6 different countries available to answer all your high-tech questions. Sign up now! Its free!
Word :    Username :           
 

Last message on previous page:

dasickninja wrote :

I don't pretend to understand anything in here, but hopefully this thing overclocks like a dog.



Me, too!This thread is awesome for me because I usually read/post in threads/stickys to try to do three things only:
1) Learn something
2) Help someone/answer OPs questions
3) Ask a question

Thank you gOJDO for the thread very informative even though I don't understand most of this stuff.

Reply to decripple
Sponsored Links
Register or log in to remove.
------------------------------ Q6600@3.4+ TT V1 Cooler,SAPPHIRE HD 4870X2,ASUS MAXIMUS FORMULA,4GB OCZ DDR2 800,LG W2452V 1920x1200
Reply to Maziar

piesquared wrote :

Here's something on Barcelona's new Cache structure.



Interesting read!

I only hope that the lack of benchmark data from AMD prior to launch is due to them shocking us all with a major improvement over Intel. I'd hate barcelona to suffer the same fate as the 2900XT.

Reply to monkeymanuk

Me too. What's wrong with the 2900XT? I thought the only issue was that people were comparing it to the 8800GTX, when the 2900XTX was meant for that comparison.

Reply to toasty2

Now if you can add some Doom 3 benchies, we'd be good to go!

------------------------------ djcoolmasterx - "Ofcourse there is nothing that you are doing that will use that kind of power, beacuse you don't have that kind of powr to do things with."
Reply to Falken699

nice job gOJDO, thanks for the info

Reply to Winly

Everything you need to know about the new K10 architecture that will be used by forthcoming AMD processors, like Phenom and Opteron "Barcelona", including roadmaps

http://www.hardwaresecrets.com/article/480


Message edited by sirkillalot on 09-06-2007 at 10:16:17 AM
------------------------------ AMD P2955X4 ATI 5870 DX11 [:boudy:2]
24" dell 1900x1200 2x74 wraptor hd's1 X-FI pro gamer sounds
2G ram log.Z5500 speakers 650w toughpower G15kb dvd....ITS A MEAN GREEN GAMING MACHINE
Reply to sirkillalot

Good contribution, great article. Anyone know how much the cheapest Phenom X4 is likely to cost?

Reply to toasty2

... and a new slideset with more details on caches and imc:
http://nccs.gov/news/workshops/fy0 [...] 073007.pdf

Reply to Dresdenboy

Randys Law...
bleeding edge technology will always bleed your bank account

new information releases almost always preceed a manufacturing setback

Reply to onestar

Is this designed to compete (and beat) the C2D/C2Q or Penryn?

------------------------------ And on the third day, God created the Remington bolt-action rifle, so that Man could fight the dinosaurs. And the homosexuals.
Reply to spaztic7

It is designed to compete with (and possibly beat, I'm not sure on that one...) the C2D/C2Q. I think Penryn is likely to beat K10.

Reply to toasty2

Anyone know what time zone is the reference for 9/10?

Reply to d4NjvRzf

toasty2 wrote :

It is designed to compete with (and possibly beat, I'm not sure on that one...) the C2D/C2Q. I think Penryn is likely to beat K10.



Then for the sack of argument... what’s the point! Is AMD saying by not going to the high end that they give up? Are they admitting that they are second best? This is a sad, sad day if this is true.

Then again, it is AMD's own fault for waiting untill Intel caught up/ surpased them.

------------------------------ And on the third day, God created the Remington bolt-action rifle, so that Man could fight the dinosaurs. And the homosexuals.
Reply to spaztic7

the only data that is relevant is that a barcy is only at most 15% faster than an X2 when processing a single thread or even a pair of threads. That means that a barcy 2350 (for $400) is no better than a $102 X2-4800 for all single and dual thread applications. That is unbelievable.... it means that AMD is essentially abandoning the desktop market.


Message edited by shadowmaster625 on 09-11-2007 at 11:10:09 PM
Reply to shadowmaster625

spaztic,

yeah, it's is pretty logical to assume that AMD can keep up with Intel when it comes to R & D with a very small amount of capital in comparison to what Intel is working with.

Shadowmaster625,

stop being a fanboy.


Message edited by weskurtz81 on 09-20-2007 at 09:51:51 PM
Reply to weskurtz81



:sleep: So after all that crap :bounce: :bounce: DOES IT DO WHAT IT SAY ON THE BOX!!!! :lol:

Reply to PFZ1

This is all great. But the K10 has been out for a while now. Where can I find a reliable performance per Mhz benchmark ?? Kentfield Vs. Phenom ?

Reply to enewmen

Hmmmm, It appears to me that AMD is focusing on the server market.

That said, it might be time for me to upgrade my servers.

Reply to onestar

gOJDO wrote :

http://origin.arstechnica.com/staf [...] celona.gif

Barcelona ES:http://img259.imageshack.us/img259 [...] jdocf8.jpg

Barcelona die (11 metal layers, 283mm^2, 463M transistors):http://pc.watch.impress.co.jp/docs [...] ai_01l.gif

Barcelona waffer:http://img403.imageshack.us/img403 [...] ferns2.jpg


Macro/micro-architectural improvements over K8:

Quad-core
- Native quad-core design
- Redesigned and improved crossbar(northbridge)
- Improved power management
- New level of cache added, L3 VICTIM
Power management - DICE(Dynamic Independent Core Engagement)
- PLLs for each core, clocked independently and varies clock speed depending on usage.
- ODMC power management: ability to shut down read channels if memory is only using writes and vice versa:
* Reduces the power consumption of the memory controller by up to 80% on "many" workloads.
- Aggressive grained clock gating
- Power management state invariant time stamp counter (TSC)
- Enhanced AMD's PowerNow - works independently without OS driver support
Virtualization improvements
- Nested Paging(NP):
* Guest and Host page tables both exist in memory.(The CPU walks both page tables)
* Nested walk can have up to 24 memory acesses! (Hardware caching accelerates the walk)
* "Wire-to-wire" translations are cached in TLBs
* NP eliminates Hypervisor cycles spent managing shadow pages(As much as 75% Hypervisor time)
- Reduced world-switch time by 25%:
* World-switch time: round-trup to Hypervisor and back
Dedicated L1 cache
- 256bit 128kB (64kB instruction/64kB data), 2-way associative
- 2 x 128bit loads/cycle
- lowest latency
Dedicated L2 cache
- 128bit 512kB, 16-way associative
- 128bit bus to northbridge
- reduced latency
- eliminates conflicts common in shared caches - better for virtualization
Shared L3 cache
- 128bit 2MB, 32-way associative
- Victim-cache architecture maximizes efficiency of cache hierarchy
- Fills from L3 leave likely shared lines in the L3
- Sharing-aware replacement policy
- Expandable
Independent DRAM controllers
- Concurrency
- More DRAM banks reduces page conflicts
- Longer burst length improves command efficiency
- Dual channel unbuffered 1066 support(applies to socket AM2+ and s1207+ QFX only)
- Channel Interleaving
Optimized DRAM paging
- Increase page hits
- Decrease page conflicts
Re-architect northbridge for higher bandwidth
- Increase buffer sizes
- Optimize schedulers
- Ready to support future DRAM technologies
Write bursting
- Minimize Rd/Wr Turnaround
DRAM prefetcher
- Track positive and negative, unit and non-unit strides
- Dedicated buffer for prefetched data
- Aggressively fill idle DRAM cycles
Core prefetchers
- DC Prefetcher fills directly to L1 Cache
- IC Prefetcher more flexible
* 2 outstanding requests to any address
HyperTransport 3
- Up to three 16bit cHT links
- Up to 5200MT/s per link
- Un-ganging mode: each 16bit HT link can be divided in two 8bit virutal links
- Can dynamically adjust frequency and bit width to save power
- AC mode (higher latency mode) to allow longer communications distances
- Hot pluggable

Barecelona pipeline (12/18 ALU/FPU stages): http://img162.imageshack.us/img162 [...] ineye3.jpg

CPU Core IPC Enhancements:
Advanced branch prediction
- Dedicated 512-entry Indirect Predictor
- Double return stacksize
- More branch history bits and improved branch hashing
History-based pattern predictor
32B instruction fetch
- Benefits integer code too
- Reduced split-fetch instruction cases
Sideband Stack Optimizer
- Perform stack adjustments for PUSH/POP operations “on the side”
- Stack adjustments don’t occupy functional unit bandwidth
- Breaks serial dependence chains for consecutive PUSH/POPs
Out-of-order load execution
- New technology allows load instructions to bypass:
* Other loads
* Other stores which are known not to alias with the load
- Significantly mitigates L2 cache latency
TLB Optimisations
- Support for 1G pages
- 48bit physical address (256TB)
- Larger TLBs key for:
* Virtualized workloads
* Large-footprint databases and
* transaction processing
- DTLB:
* Fully-associative 48-way TLB (4K, 2M, 1G)
* Backed by L2 TLBs: 512 x 4K, 128 x 2M
- ITLB:
* 16 x 2M entries
Data-dependent divide latency
Additional fastpath instructions
– CALL and RET-Imm instructions
– Data movement between FP & INT
Bit Manipulation extensions
- LZCNT/POPCNT
SSE extensions
- EXTRQ/INSERTQ (SSE4A)
- MOVNTSD/MOVNTSS (SSE4A)
- MWAIT/MONITOR (SSE3)
Comprehensive Upgrades for SSE
- Dual 128-bit SSE dataflow
- Up to 4 dual precision FP OPS/cycle
- Dual 128-bit loads per cycle
- New vector code, SSE128
- Can perform SSE MOVs in the FP “store” pipe
- Execute two generic SSE ops + SSE MOV each cycle (+ two 128-bit SSE loads)
- FP Scheduler can hold 36 Dedicated x 128-bit ops
- SSE Unaligned Load-Execute mode:
* Remove alignment requirements for SSE ld-op instructions
* Eliminate awkward pairs of separate load and compute instructions
* To improve instruction packing and decoding efficiency

Most of the informations are from Ben Sander's presentation at AMD FPF 2006, but also there are other informations included from various internet sites.

AMD Software Optimization Guide for K10

Educative articles about K8L(K10):
http://www.anandtech.com/cpuchipse [...] i=2939&p=1
http://www.xbitlabs.com/articles/c [...] d-k8l.html
http://www.extremetech.com/article [...] 644,00.asp
http://www.eetimes.com/news/semi/s [...] =193200399
http://www.channelinsider.com/arti [...] 008_1.aspx
http://www.realworldtech.com/page. [...] 0206035626
http://www.theregister.co.uk/2007/ [...] _powernow/
http://www.tgdaily.com/2007/02/11/amd_barcelona/

AMD official statements and public presentations:
Syndrome-oc interview with Giuseppe Amato & Philip G. Eisler
HEXUS interview with Patrick Patla, Director of AMD Server Workstation Division
Interview with Randy Allen, AMD's corporate vice president for servers and workstations
AMD Developer Day, London Dec/06/2006 presentation
Game Developers Conference 2007, Justin Boggs AMD

Roadmap(speculative):

Server:
Opteron 8272SE 2.6GHz 120W TDP, socket F, 3 cHT links, 3600MT/s, DDR2-667, Q2 2008
Opteron 8270SE 2.5GHz 95W TDP, socket F, 3 cHT links, 3400MT/s, DDR2-667, Q3 2007
Opteron 8268SE 2.4GHz 89W TDP, socket F, 3 cHT links, 3400MT/s, DDR2-667, Q3 2007
Opteron 8266 2.3GHz 89W TDP, socket F, 3 cHT links, 3200MT/s, DDR2-667, Q3 2007
Opteron 8264 2.2GHz 89W TDP, socket F, 3 cHT links, 3200MT/s, DDR2-667, Q3 2007
Opteron 8262 2.1GHz 89W TDP, socket F, 3 cHT links, 3000MT/s, DDR2-667, Q3 2007
Opteron 8260HE 2.0GHz 68W TDP, socket F, 3 cHT links, 3000MT/s, DDR2-667, Q4 2007
Opteron 8258HE 1.9GHz 68W TDP, socket F, 3 cHT links, 2800MT/s, DDR2-667, Q4 2007
Opteron 2272SE 2.6GHz 120W TDP, socket F, 2 cHT links, 3600MT/s, DDR2-667, Q2 2008
Opteron 2270SE 2.5GHz 95W TDP, socket F, 2 cHT links, 3400MT/s, DDR2-667, Q3 2007
Opteron 2268SE 2.4GHz 89W TDP, socket F, 2 cHT links, 3400MT/s, DDR2-667, Q3 2007
Opteron 2266 2.3GHz 89W TDP, socket F, 2 cHT links, 3200MT/s, DDR2-667, Q3 2007
Opteron 2264 2.2GHz 89W TDP, socket F, 2 cHT links, 3200MT/s, DDR2-667, Q3 2007
Opteron 2262 2.1GHz 89W TDP, socket F, 2 cHT links, 3000MT/s, DDR2-667, Q3 2007
Opteron 2260HE 2.0GHz 68W TDP, socket F, 2 cHT links, 3000MT/s, DDR2-667, Q4 2007
Opteron 2258HE 1.9GHz 68W TDP, socket F, 2 cHT links, 2800MT/s, DDR2-667, Q4 2007
Opteron 1370SE 2.5GHz 95W TDP, socket AM2+, 1 cHT link, 3400MT/s, DDR2-1067, 2008
Opteron 1368SE 2.4GHz 89W TDP, socket AM2+, 1 cHT link, 3400MT/s, DDR2-1067, 2008
Opteron 1366 2.3GHz 89W TDP, socket AM2+, 1 cHT link, 3200MT/s, DDR2-1067, 2008
Opteron 1364 2.2GHz 89W TDP, socket AM2+, 1 cHT link, 3200MT/s, DDR2-1067, 2008
Opteron 1362 2.1GHz 89W TDP, socket AM2+, 1 cHT link, 3000MT/s, DDR2-1067, 2008

Desktop:
AgenaFX 2.6GHz, unknown TDP, socket 1207+(Quad FX), 2 cHT links, 3600MT/s. DDR2-1067, Q3 2007
AgenaFX 2.4GHz, unknown TDP, socket 1207+(Quad FX), 2 cHT links, 3600MT/s. DDR2-1067, Q3 2007
AgenaFX 2.4GHz, unknown TDP, socket 1207+(Quad FX), 2 cHT links, 3200MT/s. DDR2-1067, Q3 2007
AgenaFX 2.2GHz, unknown TDP, socket 1207+(Quad FX), 2 cHT links, 3200MT/s. DDR2-1067, Q3 2007
AgenaFX 2.4GHz, unknown TDP, socket AM2+, 2 cHT links, 3200MT/s. DDR2-1067, Q3 2007
AgenaFX 2.2GHz, unknown TDP, socket AM2+, 2 cHT links, 3200MT/s. DDR2-1067, Q3 2007
Agena 2.4GHz, 89W TDP, socket AM2+, 1 cHT link, 3600MT/s, DDR2-1067, Q4 2007
Agena 2.2GHz, 89W TDP, socket AM2+, 1 cHT link, 3200MT/s, DDR2-1067, Q4 2007
Kuma(dualcore) 2.8GHz, 89W TDP, socket AM2+, 1 cHT link, 4200MT/s, DDR2-1067, Q4 2007
Kuma(dualcore) 2.6GHz, 65W TDP, socket AM2+, 1 cHT link, 3800MT/s, DDR2-1067, Q4 2007
Kuma(dualcore) 2.4GHz, 65W TDP, socket AM2+, 1 cHT link, 3600MT/s, DDR2-1067, Q4 2007
Kuma(dualcore) LP 2.3GHz, 45W TDP, socket AM2+, 1 cHT link, 3400MT/s, DDR2-1067, Q1 2008
Kuma(dualcore) LP 2.1GHz, 45W TDP, socket AM2+, 1 cHT link, 3000MT/s, DDR2-1067, Q1 2008
Kuma(dualcore) LP 1.9GHz, 45W TDP, socket AM2+, 1 cHT link, 2800MT/s, DDR2-1067, Q1 2008


Roadmap sources:
http://www.dailytech.com/Final+AMD [...] ware.co.uk
http://www.dailytech.com/More+Deta [...] le7147.htm (2007-05-03)
[url]http://www.cpilive.net/v3/inside.aspx?scr=n&NID=1320[/quote] (2007-04-15)
http://www.hkepc.com/bbs/itnews.ph [...] &endtime=0 (2007-02-23)
http://trackingamd.blogspot.com/20 [...] ealed.html
http://www.hkepc.com/bbs/itnews.php?tid=709944
http://www.dailytech.com/AMD+Quadc [...] le5992.htm


P.S. Any additional data or informations will be highly appreciated


So the socket 1207+ fasn8 systems are still a go according to this chart, is that right?

------------------------------ Q6600 (overclocked to 3.2ghz) GAp35-DS-3L mobo, 8BG G-SKILL ddr2-1066, gigabyte gts 250(1GB), 2x dvd burner,320gb hard drive,640gb hard drive, black antec p182 case with corsair 750 watt psu.
Reply to reconviperone1
1 2 3 4
Next
Tom's Hardware > Forum > CPU & Components > CPUs > Collection of AMD K10 data
Go to:

There are 523 identified and unidentified users. To see the list of identified users, Click here.

Please mind

You are about to answer a thread that has been inactive for more than 6 months.
If you still wish to proceed, please ensure that your posting is original and does not duplicate or overlap any prior responses to this thread.

Add a reply Cancel
Sponsored links
  • Ask the community now
  • Publish
Ad
They won a badge
Join us in greeting them