Local and Global Data Share
With the RV770, the AMD engineers didn’t stop at optimizing their architecture to only slightly increase the die real-estate— they also borrowed a few good ideas from the competition. The G80 had introduced a small, 16-KB memory area per multiprocessor that’s entirely under the programmer’s control, unlike a cache. This memory area, accessible in CUDA applications, can share data among threads. AMD has introduced its version of this with the RV770. It’s called Local Data Share and is exactly the same size as its competitor’s Shared Memory. It also plays a similar role by enabling GPGPU applications to share data among several threads. The RV770 goes even further, with another memory area (also 16 KB) called Global Data Share to enable communication among SIMD arrays.
While the ALUs haven’t undergone a major modification, the texture units have been completely redesigned. The goal was obvious – as with the rest of the GPU, it was to increase performance significantly while maintaining as small a die area as possible. The engineers set fairly ambitious goals, aiming for an increase of 70% in performance for an equivalent die area. To do that, they focused their efforts largely on the texture cache. The bandwidth of the L1 texture cache was increased to 480 GB/s.
But that’s not all; the L1 cache that was shared by all the SIMD arrays has been broken down into 10 cache memories, one per SIMD array, and each contains only data exclusive to the corresponding SIMD array. Shared data are now stored in an L2 cache, which has also been completely redesigned, now having a bandwidth 384 GB/s to the L1 cache. In order to reduce latency, this L2 cache has been positioned near the memory controllers.Let’s see what the results of these improvements are in practice:
Compared to its direct competitor, the 9800 GTX, the Radeon HD 4850 showed first-rate performance with single and dual texturing, while not giving up any performance in terms of raw fill rate – which is to be expected considering the 40 texture units for 16 ROPs (to simplify, “2.5 texture units per pixel,” as they used to say in another era). On the other hand, with triple and quad texturing, the RV770, logically enough, can’t compete with the G92’s 64 texture units (the equivalent of “4 texture units per pixel”); but in all cases the RV770 proved to be closer to its theoretical performance than its competitor.
Current page: Local and Global Data SharePrev Page The Architecture in Detail Next Page ROPs, Memory Controller
Stay on the Cutting Edge
Join the experts who read Tom's Hardware for the inside track on enthusiast PC tech news — and have for over 25 years. We'll send breaking news and in-depth reviews of CPUs, GPUs, AI, maker hardware and more straight to your inbox.
Wow $200 in Best Buy for a HD 4850,Reply
$450 in Best Buy for a GTX 260.
And the 4850 is pretty close to the 280.
Ouu the 4870 is going to give Nvidia a run for there money
for the first time in a while.
Good stuff now we just need some 4870 benchies!Reply
Oced Asus and 4850 instead of 4870 + too many games based on engines favoring nVidia...Reply
P.S. +1000 -> 2222
Florian you put a 1920×1077 image on the Crysis page! :lol:Reply
For Mass Effect the Engine limits the Maximum framerate to 62FPS. You can change this in the BIOENGINE.INI file (in the Documents\BioWare\Mass Effect\Config\ folder on Vista) by changing the value:Reply
MaxSmoothedFrameRate=62 in the Engine.GameEngine section
I can't believe it took nVidia coming out with a new card again to have tom's make a review finally of the 4850.Reply
"it was unavailable due to the sloppy handling of this launch"
Seriously? AMD can't control if their retail partners screwed the pooch on the release date, because they were so anxious to get people this great product. They made sure the product was readily available well before the launch date.
They should be praised for not having a paper launch, not told that it was a sloppy launch, very poor form saying that.
Hell i went to best buy and bought 2 4850's on sunday, when the cards weren't even supposed to be available yet, the guy told me "they have been in stock for over a month in the back, they aren't supposed to be available yet but i can get two for you." Were the AMD police supposed to come and smack best buy on it's hand and keep me from giving them profits?
Sorry if i'm ranting, just put the blame where it belongs.
No offense, Fedy Abi-Chahla and Florian Charpentier, and thanks for the hard work, but I think the article should be revised a bit. First off, this should be a review of graphics cards.. not a burned out overclocked Asus motherboard. If you attribute your 4850 test crashing due to your motherboard.. why throw in results of 0 across the board for the 4850? You just corrupted your data and made the final fps averages meaningless, which is the thing people were generally interested in. Secondly, why in the world are you including tests that don't fit the definition of "playable" on any card in your test lineup (Crysis 2560x1600). It just throws off averages, as people aren't going to run this game at 7fps! If there's no card in the lineup that gets close to 30fps in a certain test, just move on! Save it for the quad crossfire or triple sli tests or something. You're giving high weights to resolutions that only a fraction of a percentage point of dedicated gamers can utilize (and those wouldn't bother with a single GPU). Lastly, please get those annoying gigantonormous screenies out of the review. It makes the review look like it was done by kindergarteners.Reply
I didn't finish reading the whole article yet but was the driver hotfix and the current 8.6 driver applied to the 4850?? It improved performance and stability greatly as i saw, it make the actual clock speed the card is set it run nicely and gives it great overhead to overclock through the CCCReply
Who wants the Radeon 4800 full revew? (including the 4870, that do better than the GTX 280 in some games!)Reply
In french, but the graphs talk by themselves. Ho, and if you want a short translation = impressive and incredibly more efficient than Nvidia (if you compare the size of the GPU, yes it's A LOT more efficient)
These reviews are getting better! Although I have seen many benchmarks and tests of the 4850 before this, I still love seeing how the 48x0 line is doing against the green machine! Anandtech.com has a kill 4870 review!Reply