AMD (via @momomo_us) has trademarked the term "AMD Infinity Cache." The filing, which is on the Justia Trademarks website, applies to both the chipmaker's processor and graphics cards. In fact, the description of the trademark is so broad that it encompasses just about every type of silicon that AMD manufactures.
But the common consensus is that the trademark correlates with AMD's pending Big Navi launch. Memory bandwidth, among other aspects, is one of the major talking points about Nvidia's Ampere. The GeForce RTX 3090 flaunts an impressive memory bandwidth up to 936.2 GBps. The GeForce RTX 3080 and GeForce RTX 3070 aren't too shabby either, with theoretical values that peak to 760.3 GBps and 448 GBps, respectively.
In contrast, early leaked specifications (which should be taken with a bit of salt) on the Radeon RX 6000 series suggest that the Radeon RX 6900 might be limited to a 256-bit memory interface. The news caused a bit of distress within hardware circles as Big Navi might land with disappointing memory bandwidth. However, the rumors also mentioned the existence of a special cache that could be a game-changer.
Other than the folks at AMD, we doubt anyone has any idea of what the Infinity Cache is truly all about. It might be a new feature, or it could just be a fancy term for an existing concept. For example, AMD branded the L3 cache on its Zen 2 processors as GameCache. It sounds great for marketing, but at the end of the day, it's still just the L3 cache that we've all come to know from most modern CPUs.
When it comes to processors, the cache serves as temporary data storage that allows data to be retrieved quickly. However, the cache is very small, so you can't expect the processor to find all the data it wants inside the cache. A 'cache hit' refers to what happens when the requested data is present in the cache, and a 'cache miss' happens when the data is not readily available.
The same concept applies to graphics cards. For comparison, the Radeon RX 5700 XT is equipped with 4MB of L2 cache. A bigger cache would imply fewer cache misses. If Big Navi were to have a 128MB cache, the graphics card could fetch what it needs from the cache and make fewer trips to the main memory (RAM).
It remains a mystery whether the Infinity Cache actually refers to the L2 cache or a new L3 cache, or something else entirely. Graphics cards commonly come with L1 and L2 caches because the bigger caches are slower and induce higher latency.
There's a possibility that the Infinity Cache may be related to a patent that AMD filed last year on Adaptive Cache Reconfiguration Via Clustering. Subsequently, the authors published a paper on the topic. It talks about the possibility of sharing the L1 caches between GPU cores.
Traditionally, GPU cores have their own individual L1 cache, while the L2 cache is shared among all the cores. The suggested model proposes that each GPU is allowed to access the other's L1 cache. The objective is to optimize the caches' use by eliminating the replicated data in each slice of the cache. The results are pretty amazing. Across a suite of 28 GPGPU applications, the new model improved performance by 22% (up to 52%) and energy efficiency by 49%.
AMD is likely finalizing the preparations to announce the much-awaited Radeon RX 6000 series of graphics cards on October 28. Nvidia's Ampere is a tough nut to crack, so AMD needs to bring its A-game. Perhaps that A-game comes in the form of the Infinity Cache, but we won't know for sure until the company's announcement.