Hardware and software setup

Families of video cards AMD Radeon Reference information. Video cards Amd radeon hd 7800 comparison

Page 2 of 5

"Southern Islands"

First, a little about AMD's labeling of its latest products. The manufacturer has divided them into three levels according to performance. The code name "Cape Verde" refers to the Radeon HD 7700. The name "Pitcairn" refers to today's testers Radeon HD 7870 and HD 7850. High-end products are called "Tahiti" or Radeon HD 7900. This is shown more clearly below.

  • Entry level = Cape Verde = Radeon HD 7700 series;
  • Mainstream = Pitcairn = Radeon HD 7800 series;
  • High performance products = Tahiti = Radeon HD 7900 series.

That is, on this moment AMD has covered all market segments with their 28nm graphics chips. Only the release of a dual-core video card based on Tahiti chips is expected. Tentative name Radeon HD 7990.

AMD Radeon HD 7800 Series Features

The Radeon HD 7800 graphics processor (Pitcairn) has about 2.8 billion transistors and Graphic Core Next microarchitecture. As mentioned above, the Radeon HD 7850 chip (Pitcairn Pro) has 16 computing units, and its maximum TDP is 130 watts. For the Radeon HD 7870 (Pitcairn XT), these figures are 20 and 175, respectively.

The slide below shows the main specifications of the Radeon HD 7850 and HD 7870 video cards.

2GB GDDR5 memory is already becoming standard on most mid-range and high-end models. price range. Thanks to 256-bit. bus and a high clock frequency of 1200 MHz (4800 MHz effective), the bandwidth is 154 GB / s. This will have a positive effect on performance in games with high resolution and picture quality.

PCI Express 3 interface

In the second half of 2011, almost all motherboard manufacturers presented their motherboard models with PCI interface Express 3rd generation. With the release of the Radeon HD 7000 series, video cards with this interface also appeared. PCI Express 3 has twice the bandwidth (32 Gb/s) of the previous generation PCI Express. Compared to PCIe 2, the throughput per lane has been doubled from 500 Mb/s to 1 Gb/s.

Naturally, to take advantage of the new PCIe 3, you need to have not only a video card and motherboard with this interface, but also support from the processor (not all models from the Ivy Bridge family will support PCIe 3).

Eyefinity 2.0

AMD has gone further with their Eyefinity technology, which is designed to display images across multiple monitors. Thanks to the high processing power of the HD 7000 series and support for Eyefinity 2.0, it is now possible to display an image on multiple monitors with a total resolution of 16000 x 16000. This allows you to display the image on 5 displays with a resolution of 2560x1600 set in landscape orientation. To work with such resolutions, a record 3 GB GDDR5 (HD 7970 and HD 7950) is installed on the older models of the family.

AMD Catalyst drivers will support custom resolution starting in February. That is, you can set the required resolution depending on the configuration of the displays in Eyefinity. As of Catalyst 12.2, there is an option to set the Start menu to a display that is convenient for you, instead of the far left as it used to be. In addition, Eyefinity 2 supports stereo HD3D output. It supports the combination of three monitors that work in 3D mode.

Improved tessellation

AMD's Radeon HD 7000 family graphics cards feature the ninth generation tessellator and have seen significant performance improvements in geometry processing in today's games. The GCN core still includes two Graphics Engines, but if they previously contained blocks for tessellation and rasterization, now they consist of an arbitrary number of pipelines designed to process geometry and pixels.

AMD Radeon HD 7800 graphics cards support the HDMI 1.4a interface, which allows you to output a picture of 120 Hz (60 Hz for each eye), which allows you to display a 3D image. With more early versions HDMI it was impossible. Starting in December, AMD enabled HD3D and Eyefinity to work together in their drivers.

DirectX 11.1

Video cards of the Radeon 7000 family will support the upcoming DirectX 11.1. What this will give in practice is too early to say, since DX 11.1 will be released along with Windows 8. The main advantages of the new API are indicated as follows:

  • Independent rasterization;
  • Flexible combination of graphics computing and video processing;
  • Native Stereo 3D support.

AMD Unified Video Decoder

It is a hardware part of AMD GPUs responsible for video stream decoding. In the Radeon 7000 series, UVF received some improvements. In general, UVD retained all the functions of its predecessors, namely support for H.264/AVCHD, MPEG-2, MPEG-4/DivX, VC-1/WMV profile D, Multi-View Codec (MVC), engine Video codec Engine (VCE), AMD Steady Video 2.0. Added support for Dual Stream HD+HD format.

  • Chip codename: "Tahiti"
  • 4.3 billion transistors (more than 60% more than Cayman and exactly twice as much as Cypress)
  • 384-bit memory bus: six 64-bit wide controllers with GDDR5 memory support
  • Core clock: up to 925 MHz (for Radeon HD 7970)
  • 32 GCN Compute Units, including 128 SIMD cores, consisting of a total of 2048 floating point ALUs (integer and float formats, supports FP32 and FP64 precision within the IEEE 754 standard)
  • 128 texture units, with support for trilinear and anisotropic filtering for all texture formats
  • 32 ROPs with support for anti-aliasing modes with the possibility of programmable sampling of more than 16 samples per pixel, including with FP16 or FP32 framebuffer format. Peak performance up to 32 samples per clock, and in colorless mode (Z only) - 128 samples per clock
  • Integrated support for six monitors including HDMI 1.4a and DisplayPort 1.2

Radeon HD 7970 Graphics Specifications

  • Core clock: 925 MHz
  • Number of universal processors: 2048
  • Number of texture units: 128, blending units: 32
  • Effective memory frequency: 5500 MHz (4×1375 MHz)
  • Memory type: GDDR5
  • Memory capacity: 3 gigabytes
  • Memory bandwidth: 264 gigabytes per second
  • Theoretical maximum fill rate: 29.6 gigapixels per second.
  • Theoretical texture sampling rate: 118.4 gigatexels per second
  • Two CrossFire connectors
  • PCI Express 3.0 bus
  • Power consumption: 3 to 250 W
  • One 8-pin and one 6-pin power connector
  • Dual slot design
  • US MSRP: $549

Radeon HD 7950 Graphics Specifications

  • Core frequency: 800 MHz
  • Number of universal processors: 1792
  • Number of texture units: 112, blending units: 32
  • Effective memory frequency: 5000 MHz (4×1250 MHz)
  • Memory type: GDDR5
  • Memory capacity: 3 gigabytes
  • Memory bandwidth: 240 gigabytes per second
  • Theoretical maximum fill rate: 25.6 gigapixels per second.
  • Theoretical texture sampling rate: 89.6 gigatexels per second
  • Two CrossFire connectors
  • PCI Express 3.0 bus
  • Connectors: DVI Dual Link, HDMI 1.4, two Mini-DisplayPort 1.2
  • Power consumption: 3 to 200 W
  • Dual slot design
  • US MSRP: $449

The high complexity of the new chip draws attention - 4.3 billion transistors, which is more than half the number of transistors in the previous top-end graphics processor. The ability to make such a complex crystal was made possible by the use of a modern 28-nanometer process technology, and the new chip turned out to be even slightly smaller in area than the Cayman size. And its practical characteristics that affect performance have been noticeably improved: the number of ALUs, TMUs, memory bus. Only the number of ROPs did not increase, and the frequency of GDDR5 video memory remained at the same level.

The principle of naming video cards of the company remained the same. The Radeon HD 7970 is the most productive single-chip solution from the company, after some time the younger model HD 7950 was released, which was announced a little later. Initially, the HD 7970 had no competitors in the market and did not replace any specific video card from the AMD line, but rather moved it down. As for the comparison with the competitor, NVIDIA released its 28nm solution much later.

The same GDDR5 memory is installed on the new AMD video card, but its volume, instead of 2 gigabytes in the previous generation, has grown to 3 gigabytes. This happened due to the expansion of the memory bus from 256-bit to 384-bit. And now you can put either 1.5 GB or 3 GB on the new board. Naturally, from a marketing point of view, installing a smaller volume would be a clear disadvantage, so the decision was made to put 3 GB, although today this is a bit overkill. Only in ultra-high resolutions and with MSAA 16x 1.5-2 GB will not be enough. However, AMD also has Eyefinity, and for games on three or more monitors, the screen buffer will just take up a very large amount.

So, let's consider the Radeon HD 7970. The new video card of the upper price range has a two-slot cooling system, closed by the usual for all modern AMD boards plastic casing along the entire length of the card. Only the design of this shroud has changed a bit, although the back still extends beyond the PCB. But the design of the bar with pins was changed - to improve the cooling of the video card, one of the two slots (half of the bar) was occupied exclusively by a ventilation hole for heat dissipation.

But users shouldn't suffer from the reduction in the number of DVI connectors soldered right on the board. For their convenience, a special HDMI-DVI adapter will be included in the package, which will allow you to connect two monitors with DVI connectors. By the way, the power consumption of the new card is not lower than that of the Radeon HD 6970, so it had to install a set of one 8-pin and one 6-pin power connectors.

But in new Radeon The HD 7970 has changed its cooling system for the better. A new generation of evaporator chamber and a new larger cooler are used, with a modified blade shape and increased performance (greater airflow is provided). The result is an increase in cooler efficiency while reducing noise.

The switch has not disappeared from the board either. Firmware Dual BIOS, which we wrote about in the description of the Radeon HD 6900. In short: the video card has two BIOS versions, one with the possibility of custom flashing, and the second with hard-wired firmware at the factory. Both users and AMD liked this convenient solution so much that AMD decided to continue to complete top-end solutions with it.

We can only say hello this decision which really helps various occasions, associated both with unexpected problems during flashing (turning off the power in the process, for example), and allows you to fearlessly conduct various experiments with BIOS images. Unsurprisingly, AMD hints again and again at the excellent overclocking capabilities of the new graphics card:

As you can see, overclocking to a frequency of 1 GHz and higher is practically promised, if you do not take into account the small inscription (which was not included in the screenshot) that the guarantee ceases to apply even if the video card fails as a result of an experiment with raising the frequency from the settings video driver.

Architectural features Radeon HD 7970

To assess the relevance of architectural modifications in the Southern Islands, first consider the development of the GPU over the past few years in the AMD view. Until 2002, graphics chips were a specific Hardware capable of purely graphical computing. The video chips of that time had limited functionality, they only knew how to apply and filter textures, process geometry, engage in primitive rasterization and therefore were not at all suitable for universal computing tasks.

Over the next few years, basic programmability was added to the GPU, but also focused exclusively on graphics tasks. This was the time of support for DirectX 8 and 9, limited functionality shader programs with floating point capability. Video chips of that time had specialized ALUs for vertex and pixel processing, as well as dedicated caches for pixels, textures, and other data. Universality was still not even close.

And only in 2007, AMD got a unified shader architecture DirectX 10, as well as the ability to program the GPU using special tools: CAL, Brook, ATI Stream. GPUs of the time already had advanced caching and support for local and global shared data. Architecturally, the chips were based on the VLIW5 and VLIW4 blocks, flexible enough for some basic non-graphical computing, but still focused on graphics algorithms.

And now it's time for a new architecture, even better suited for universal computing - Graphics Core Next (GCN). For AMD, this is a new architectural era, which is why the name was chosen. The new GPUs offer excellent graphics processing capabilities and performance, but the architectural changes made are intended primarily to improve the position in non-graphics computing - increasing performance and efficiency in complex general-purpose tasks. New design The GPU is designed for so-called heterogeneous computing - a mixture of graphical and universal computing in a multitasking environment. The GCN architecture has become more flexible and should be even better suited for energy efficient execution of various tasks.

The basic block in the new architecture is the GCN block. It is on these "bricks" that all new GPUs of the Southern Islands series are based. The architecture for the first time for AMD graphics chips uses a non-VLIW design, it uses vector and scalar blocks, and one of the most important changes is that each of the GCN computing blocks has its own scheduler and can execute instructions from various programs (kernel).

The new computing architecture is designed for high efficiency loading of computing units in a multitasking environment. The GCN computational unit is divided into four subsections, each of which works on its own instruction stream every clock cycle. Threads can also use the scalar block found in GCN for flow control or pointer operations. The combination of vector and scalar blocks offers a very simple programming model. For example, function pointers and stack pointers are much easier to program, and the task of the compiler is now greatly simplified, since the execution units are scalar.

Each GCN block has a dedicated 64 KB local data store for data exchange or local stack extension for registers. Also, the block includes both a first-level cache memory with the ability to read and write, and a full-fledged texture pipeline (sampling and filtering blocks). Therefore, the new computing unit is able to work independently, without a central scheduler, which in previous architectures was responsible for distributing work among blocks. Now each of the GCN blocks is capable of scheduling and distributing commands on its own, one computing unit can execute up to 32 different command streams, which can be from different virtual address spaces in memory and are completely protected and independent of each other.

Previous AMD GPU architectures used the VLIW4 and VLIW5 architecture models, and although they are good enough for graphics tasks, they are not efficient enough for general-purpose computing, since it is very difficult to load all the execution units with work in such conditions. The new GCN architecture offers a similarly large number of execution units, but in a scalar execution that removes the limitations and dependencies of registers and instructions. The transition from the VLIW architecture to the scalar execution provides a noticeable simplification of code optimization tasks.

When executing instructions on the previous VLIW4 architecture, the compiler has to deal with register conflicts, perform complex distribution of instructions to execution units at the code compilation stage, etc. At the same time, to achieve high performance, non-trivial optimization is often required, which is suitable for most graphics tasks and much less flexible for other calculations. The new architecture offers a significant simplification of development and support, simplified creation, analysis and catching errors in low-level code, stable and predictable performance.

Memory caching subsystem

Bandwidth and memory and caches are never enough, and there is always a need and methods to increase them. AMD's new GPUs use a full two-level read/write cache. Each computing unit has 16 kilobytes of the first level cache, and the total volume of the second level cache is 768 kilobytes (in total, the chip gets 512 KB L1 and 768 KB L2), which is 50% more than in the previous chip, which does not have the ability to write at all to L2 cache.

In terms of performance, each GCN computational unit can receive or write 64 bytes of data from / to the L1 cache or global memory in one cycle, which serves to exchange data between command streams. The same amount of data is capable of transmitting and receiving each section of the L2 cache memory. As a result, for the company's top GPU, it turns out 2 terabytes / s for L1 and 700 GB / s for L2, which is 50% more than the previous top AMD Solutions.

Tahiti GPU

Now that we've looked at the low-level architectural changes to the new Southern Islands series, it's time to move on to the details of the line's most powerful solution, the Radeon HD 7900, which includes two models. First of all, let's just note the huge complexity of the new GPU, because it includes more than 4.3 billion transistors, which is twice as much as was in the chip on which the Radeon HD 5870 is based! Naturally, such a powerful chip became possible only thanks to the use of a new 28 nm process technology. So what does he have inside?

The number of geometric blocks has not changed, compared to the Cayman, there are still two of them, but the efficiency of their work has been significantly increased - we will dwell on this in more detail a little later. On the diagram GPU we see 32 compute units of the GCN architecture available on the Radeon HD 7970, and in the case of the junior solution, some of them will be disabled. If we consider the peak computing performance of the solution, then it is almost 3.8 teraflops (the number of floating point operations per second), which is an absolute record for a GPU today.

Each GCN block has 16 texture units, which gives a total figure of 128 TMUs per chip, or more than 118 gigatexels / sec - and this is another record at the time of release, and it will not be the last. But the number of ROP blocks has not changed, there are still 32 of them in 8 enlarged RBE blocks. Another interesting architectural change is that now ROP blocks are “attached” not to memory channels, as it was before, but to GCN blocks.

Although the theoretical framebuffer write speed has not changed much, and the maximum possible are the same 32 color values ​​and 128 depth values ​​per clock, the practical fill rate (fill rate) in real applications has increased significantly due to the increased memory bandwidth. AMD measured the Cayman at only 23 pixels per clock, while the new Tahiti approached the theoretical 32 pixels per clock.

This is understandable, because the new AMD video chip has a 384-bit memory bus - six 64-bit channels, just like the current top-end competitor's solution. It is this 1.5-fold increase in memory bandwidth that makes it possible to increase the actual speed of texture fetches and writing to the framebuffer. The bandwidth of 264 GB / s should help to squeeze out close to theoretical figures of 118 gigatexels / sec and 30 gigapixels / sec, and in the practical part we will check this.

In the case of the “stripped down” Radeon HD 7950 graphics processor, Tahiti includes 28 active computing units of the GCN architecture out of 32 physically available on the chip. In the case of the junior solution of the Radeon HD 7970 series, it was decided to disable four of them. Since each GCN has 16 texture units, the total TMU figure for the new model is 112 TMUs, which gives a throughput of almost 90 gigatexels/sec.

But the number of ROPs and memory controllers in the HD 7950 has not changed, they decided not to cut them down and keep the same 32 and 6 pieces, respectively. Therefore, the Tahiti Pro video chip has the same 384-bit memory bus, assembled from six 64-bit channels, as the top AMD solution. Apparently, it is computing functional devices that suffer the most from marriage during production, and they decided not to cut everything else.

Tessellation and Geometry Processing

From an architectural point of view, nothing special has changed in the geometric blocks of the Tahiti since the Cayman. It still uses two blocks for processing (setting vertices and tessellation) of geometric data and rasterization, and the scheme is very similar to the one we saw earlier, except that the tessellators are already called the 9th generation:

Despite the schematic similarities, the latest generation of these blocks is capable of significantly better tessellation and geometry processing performance, since the blocks have undergone significant modifications. Although the peak performance only increased to almost two billion vertices and primitives per second (925 MHz and two vertices per clock), the actual performance increased more. This was achieved by increasing the size of caches, improving buffering of geometry data, and reusing vertex data.

As a result, tessellation performance is improved at all triangle split ratios by up to four times over the previous generation Radeon HD 6970. But four times is not achieved in all cases, even on the diagram from AMD itself:

The chart shows a comparison of the tessellation performance of the Radeon HD 7970 compared to the HD 6970 at split ratios from 1 to 32. And as you can see, the performance difference is from 1.7 to 4 times. But this is naked synthetics. And in order to get closer to reality, we will give more data on the tessellation speed already in gaming applications:

As you can see, AMD's synthetic numbers are well supported by gaming ones - performance in real applications with "heavy" tessellation has grown significantly. This is a very good result, which we will definitely check in the practical part, using the example of synthetics and gaming applications.

Non-graphic calculations

From the point of view of heterogeneous and non-graphical computing tasks, two asynchronous computing engines (Asynchronous Compute Engines - ACE) have appeared. They are designed to schedule and distribute work between execution units for efficient multitasking and work in conjunction with a graphic command processor (Command Processor).

The Radeon HD 7900 has two independent compute engines and one graphics engine. In total, this gives three programmable blocks and three instruction streams, completely separated from each other. And in addition to asynchronously issuing commands for fast context switching, the new GPU also features two bi-directional direct memory access (DMA) controllers introduced in the Cayman. These two controllers are required to take full advantage of the new PCI Express 3.0 bus.

As we know, from the point of view of serious calculations, not only the speed of performing floating point operations with single precision, but also double precision (double precision floating point) is important. And new AMD architecture does a pretty good job of this. At the moment, it is assumed that there are two versions of GCN computing units with different execution rates for FP64 instructions. For the older GPU, the execution rate is 1/4 of the FP32 speed, and for the younger chips, the execution rate is 1/16, which is quite enough to maintain compatibility, but does not complicate inexpensive solutions too much. As a result, the Radeon HD 7970 is capable of 947 billion double-precision operations per second (oh, they just barely reached the teraflop!) - there is another highest achievement of the new AMD chip.

Moreover, these are not the same gigaflops as in the case of previous architectures, but more “fat” ones. After all, the efficiency of the new GPU in complex computing tasks should seriously increase. First, the memory and caching subsystem has been improved. Secondly, each GCN compute unit has its own scheduler, which should improve branching code execution and overall efficiency. And thirdly, we note the scalar execution, which does not require complex optimizations from the compiler, as a result of which the computing units will be idle much less often. And as a result, in any tasks, it will be easier for the new chip to show high performance and ALU loading.

Among other innovations related to computing capabilities, we note the full support for ECC for DRAM and SRAM. On the software side, it is important that Tahiti is the first GPU with full support new API versions: OpenCL 1.2, DirectCompute 11.1 and C++ AMP and their features. For example, OpenCL 1.2 allows you to combine the capabilities of several computing devices into one, and AMD has already released support for this in the form of AMD APP SDK 2.6 and the Catalyst 11.12 driver.

Architectural Performance and Efficiency

After reviewing all the architectural innovations on the example of the top chip of the Southern Island series, it's time to talk about the effectiveness of all these changes. It is clear that the performance of the new chips is much higher than that of the previous ones, the opposite would be quite surprising. The question is how much faster. V various tasks figures are obtained from 40-50% (minimum!) to a fivefold difference. Improvements in the architecture make it possible to exceed the theoretical 1.4-fold difference in dumb gigaflops. Let's look at this with examples:

The diagram compares the new top solution and the previous single-chip solution: Radeon HD 7970 and HD 6970, which is quite fair. Various performance tests are selected: SmallptGPU and LuxMark are ray tracing on OpenCL, SHA256 is a secure hashing algorithm, and AES256 is symmetric algorithm encryption. Well, Mandelbrot is a well-known problem calculated with double precision calculations.

The vertical broken line in the graph indicates the theoretical difference in performance, but the speed data shows that in three out of five tasks the speed of the new GPU was significantly higher. This is due to all the changes aimed at increasing efficiency: moving away from VLIW, the presence of a scheduler in each computing unit, improved caching, etc.

Changes in rendering quality

Actually, this part could well have been skipped, since there are no special claims to image quality lately and cannot be - for various reasons. For example, the quality of full-screen anti-aliasing for video cards from different manufacturers is very close, especially considering the widespread use of software anti-aliasing methods using post-processing filters, which are performed on all GPUs in exactly the same way.

The same applies to texture filtering - now its quality is such that it's very difficult to distinguish between AMD and NVIDIA solutions even if you do a pixel-by-pixel comparison. The Radeon HD 6900 - the previous generation of the company - has improved anisotropic filtering a little more, and now even a "microscope" will not help to find any significant flaws there. The only remark is that in motion Radeon video cards slightly inferior to GeForce due to more noticeable specific artifacts, such as “noise” or “sand”.

With the release of the new generation of video chips, the texel weights in the texture filter were revised once again, modified in such a way as to reduce such artifacts, sometimes visible on the Radeon HD 6900 in the presence of textures of a certain type (“high-frequency”, with sharp transitions from dark to light, for example). The change in quality is so hard to show with examples that AMD doesn't provide HD 7900 versus HD 6900 comparison pictures, but simply compares the quality of the "hardware" algorithm with a purely software algorithm running on GPU stream processors, and therefore ideal:

On such a small screenshot, the difference in quality is not visible, but AMD assures that all the changes made did not bring any performance drop and did not worsen the image quality in any of the aspects - it still does not depend on the angle and the filtering quality is close to ideal. In one of the future practical materials, we will definitely check this.

Partially Resident Textures

The idea of ​​Partially Resident Textures (PRT) is to use the hardware capability of the presented GPU - virtual memory. Surely many users have already seen id Software's RAGE game, which uses virtual texturing technology, the so-called megatexturing ("MegaTexture"), which provides the ability to use huge amounts of texture data and swap (streaming) them into video memory.

Using virtual video memory, it is very easy to get efficient hardware support for such algorithms, which allow using up to 32 terabytes of textures in an application, which makes it possible to make unique locations in games, without repeating pieces of textures, with no problems with loading texture data. True, AMD's illustrative example is too strange, from which nothing is particularly clear:

PRT allows you to achieve high picture quality and helps to increase the efficiency of video memory usage. Similar algorithms are already used in the id Software engine, and are expected to appear in many next generation engines. Games of the future need to work with huge amounts of data, and the advantage of the new GPU is that local graphics memory in PRT algorithms works like hardware cache memory, and textures are loaded into it when necessary. GPUs from the Southern Islands family support "mega-textures" up to 32 terabytes (resolution up to 16384×16384) and, most importantly, hardware texture filtering for them, which is not available on earlier video chips.

Virtual textures are divided into pieces of 64 kilobytes (kilobytes, not texels) and this piece size is fixed. And only those that are needed when rendering the current frame are loaded into the local memory of the video card. The technology works regardless of the texture format, just the sizes of the pieces in texels will be different. For example, for a regular uncompressed texture with 32 bits per color, the chunk size will be 128x128 texels, and for a DXT3-compressed texture it will be 256x256 texels.

The technology also involves the use of mip-levels of textures (smaller copies used in texture filtering). When rendering and filtering, they need to be accessed multiple times. Consider the operation of the algorithm on an example.

This figure highlights four different pieces from different mip levels required for rendering. When the shader program requests data from them, some of the pieces are already in local memory and this data is immediately sent to the shader for further calculations. But some pieces are missing from the table, and the application must decide what to do next on such a miss. For example, you can request data from a lower resolution mip-level, then the image will be fuzzy, but at least it will look like the truth and will be drawn without delay. And by the time the next frame is rendered, it can already be loaded into the cache - local video memory. Those who played RAGE will understand us.

This is a powerful algorithm that allows you to use huge textures that are unique to each of the objects. Similar algorithms have long been used in offline rendering, except for the need for real-time calculations. AMD even made a demo using the Per-Face Texture Mapping technique developed by Walt Disney Animation Studios for their animated films. Unfortunately, the demo isn't ready yet, and we've only seen low-res screenshots.

The essence of this texture mapping technique is to assign a certain piece of texture to each polygon, without the need to use UV-transformation (finding a correspondence between surface coordinates three-dimensional object and coordinates on a 2D texture). This approach solves some of the problems with creating tessellated content by making the displacement mapping algorithm very simple. And PRT in this method is used for efficient storage and access to texture data.

Media Handling Instructions

An interesting innovation in Southern Islands seems to be support for specialized instructions used in image processing, both static and dynamic. For example, a widely used instruction called "sum of absolute differences", better known as SAD (Sum of Absolute Differences), has been improved. The speed of its execution is a very performance-critical bottleneck in many image and video data processing algorithms, such as motion detection, gesture recognition, image search, computer vision, and many others.

But in our review of the ancient video card Radeon HD 5870, we already wrote about SAD support. Now, in addition to the usual SAD (4×1), Southern Islands has a new instruction - QSAD (quadruple SAD), combining SAD with shift operators to increase performance and energy efficiency, as well as a "masked" MQSAD instruction that ignores background pixels and is used for isolation objects moving in the frame from the background.

The new GPUs can process up to 256 pixels per GCN compute unit per clock, which in the case of the AMD Radeon HD 7970 means that it can process up to 7.6 trillion pixels per second in the case of 8-bit integer color values. Although this is a theoretical figure, the visual processing capabilities of the new GPUs are quite impressive - many video processing tasks can be performed in real time.

PCI Express 3.0

We couldn't pass by the support of the third version of PCI Express by the whole line of new graphic solutions from Southern Islands. This support was quite expected, since the specifications of the third version of PCI Express were finally approved in the fall of 2010, but there were still no hardware solutions with its support, although motherboards are already appearing, video cards were released at the end of 2011, and the corresponding central processors there is.

The updated interface has a transfer rate of 8 gigatransactions per second instead of 5 GT / s for version 2.0, and its bandwidth has once again doubled (up to 32 Gb / s) compared to the PCI Express 2.0 standard. The new bus uses a different coding scheme for data sent over the bus, but compatibility with previous versions of PCI Express has been preserved.

The first motherboards with PCI Express 3.0 support were introduced in the summer of 2011, mainly based on the Intel Z68 chipset, and they appeared on the general market only in the autumn of the same year. So video cards arrived in time, and AMD again became ahead of the rest in terms of the speed of release of new graphics processors with support for the most advanced technologies. But it's too early to judge whether PCI-E 3.0 will be of any practical use.

AMD PowerTune technology

One of the most interesting innovations in the Cayman was PowerTune advanced power management technology. Flexible GPU power management has been around for a long time, but before the Radeon HD 6900, all these technologies were rather primitive and mostly software methods and changed the frequency and voltage in steps, not being able to turn off large parts of the video chips.

Even in the Radeon HD 5000 family, a performance limiter appeared when a certain consumption level was exceeded, and in the Radeon HD 6900 the system moved to a qualitatively different level. To do this, special sensors were included in the chip in all blocks that monitor boot parameters. The GPU constantly measures the load and power consumption and does not allow the latter to go beyond a certain threshold, automatically adjusting the frequency and voltage so that the parameters remain within the specified thermal package.

Unlike early power management technologies, PowerTune provides direct control over GPU power consumption, as opposed to indirect control by changing frequencies and voltages. This technology helps to establish high frequencies GPU, having received high performance in games, and not be afraid that consumption may go beyond safe limits. After all, most games common applications, which use GPU computing, have significantly lower power requirements and do not approach dangerous power consumption limits, unlike stability tests like Furmark and OCCT.

Even the heaviest games do not require maximum power consumption, and if you limit consumption by frequency, testing video cards with extreme tests, then in the case of 3D games, there will be quite a lot of unused performance and power opportunities. In the event that the video card has not reached the limit of the safe consumption level, the GPU will operate at the frequency set at the factory, and in the FurMark and OCCT tests, the GPU frequency will decrease to stay within the consumption limits.

Thus, PowerTune helps to set higher factory frequencies and tune the system for the most efficient use of GPU resources at the maximum power level set. In the example shown above, the HD 5870 does not use PowerTune and due to the GPU frequency limitation of high consumption in endurance tests does not use its full potential. While the maximum TDP is set for the Radeon HD 7970, and the video chip resets the frequencies only when it is exceeded, getting the highest possible performance in any application.

This is clearly shown in the following diagram. In the case of gaming applications, TDP can be achieved by increasing the frequency of the GPU, and for peak loads, endurance tests reduce the frequency to a safe level of power consumption. Without PowerTune, you would have to choose - either to get the probability of a video card failure when FurMark and OCCT are running for a long time, or to cut down on the potential performance in games. New technology address these issues in the most efficient way.

AMD PowerTune features fast responsiveness to changing conditions (microseconds) as it is a hardware technology. It is also distinguished by flexible frequency tuning, and not stepped, as it was in previous chips. All measurements are independent of the driver, but can be adjusted by the user using the video card settings.

The difference between PowerTune and the previously generally accepted approach is that in other cases, thermal throttling is used, which puts the GPU in a significantly reduced consumption mode, and PowerTune simply smoothly reduces its frequency, bringing the GPU consumption to the set limiter. This achieves higher clock frequencies and performance.

AMD ZeroCore Technology

AMD was not limited to the use of power management technology already known from previous solutions. In the first chips of the Southern Islands family, it introduces AMD ZeroCore technology, which helps to achieve even greater energy efficiency in the "deep idle" (or "sleep") mode with a disabled display device, which is supported by all operating systems.

After all, almost any system, even a gaming one, spends most of the time in low-load mode on the graphics processor. And the video card should not consume much power in this mode. And even more so, not to mention the mode with the monitor turned off - in this case, it is advisable to turn off the GPU altogether. That's what AMD did. Thanks to ZeroCore, the new GPU consumes less than 5% of the power in full mode when in deep idle, disabling most of the functional blocks in this mode.

AMD provides a schematic comparison with its own Radeon HD 5870, which did not support this technology. ZeroCore is a Southern Islands-exclusive innovation in desktop solutions from mobile GPUs designed for laptops. By the way, the advantages of this technology are associated not only with reduced consumption. In addition, in the long idle mode, when the display is turned off, the video card also completely turns off the fan on the video card cooler!

This is exactly what many users have been waiting for a long time. The most interesting thing is that according to our data, laboratory tests of solutions like PowerTune and ZeroCore took place several generations of video cards ago. Some of the engineering samples of video cards from AMD's series that have long since left the market worked exactly like this, completely turning off the cooler in idle time.

But it's not just single-GPU users who benefit from noise reduction and power consumption with AMD's new ZeroCore-enabled graphics cards. Similar improvements await happy owners of CrossFire systems based on two, three, and even four GPUs. It is logical, after all, that in the mode of drawing a two-dimensional interface operating system all video cards, except for the main one, should not work at all? But that's how they work now!

In the case of CrossFire systems on video cards with ZeroCore support in 2D mode, all secondary video cards are immersed in a deep sleep with minimal power consumption and a disabled cooler. This mode works both for several single-chip video cards and for two-chip solutions. In addition, the primary CrossFire graphics card will also enter this mode in the event of a long idle time configured in Windows. Visually, the difference in work looks like this:

By the way, the technology is not as simple as it might seem. AMD engineers had to solve a lot of issues related to the operation of the operating system in idle mode. For example, they found that Windows tries to update information on the screen even when the monitor is turned off. Which, of course, does not allow you to disable the GPU at all. Therefore, the company's programmers had to take a workaround, ignoring all screen drawing commands when the monitor was turned off in sleep mode.

AMD Eyefinity 2.0 Technology

Naturally, in the new architecture there was a place for improvements to the proven technology for displaying images on multiple monitors - AMD Eyefinity, now in version 2.0. It has received new features, higher resolutions, support for more displays and increased flexibility.

This technology is quite interesting, although an extremely small number of users will find space in the room and muster up the courage in front of the family to install more than two monitors. But it is better to have the opportunity to always be able to use it than not to have it at all. Moreover, the prices for monitors of large diagonals almost do not decrease, but mid-range solutions are constantly getting cheaper.

Indeed, now it is more profitable to buy three monitors with a screen diagonal of 24″ than one 30-inch one. AMD gives just such an example, where a 30″ 2560x1600 monitor costs over $1,000, while three 24″ FullHD monitors can be bought for half that price:

But how to spend your money and space in the room is a personal matter for each user. The main thing is that there is such an opportunity. Plus, Eyefinity 2.0 now supports image output in HD3D stereo mode - something that was lacking in previous solutions, which were inferior to competing ones in this parameter. United AMD technologies Eyefinity and HD3D The Radeon HD 7970 is the first single-chip solution to support three monitors in stereo mode.

High resolution stereo rendering requires a very fast data transfer interface. And with previous versions of the HDMI outputs, the possibilities were limited to 24Hz per eye, which is just enough for watching Blu-ray 3D movies, but clearly not enough for gamers.

For such tasks, they began to use the frame packing format, when the frames for the left and right eyes are combined into one, and AMD Radeon HD 7970 supports the HDMI 1.4a frame packing format for stereo output. This is the first video card to support 3 GHz HDMI with frame packing, where each eye has a FullHD picture at 60 Hz (120 Hz in total):

Another interesting novelty seems to us to be Discrete Digital Multi-Point Audio (DDMA) multi-channel audio output technology, which works together with Eyefinity. All previous GPUs are capable of outputting via HDMI and DisplayPort only one audio stream. That is, even if three monitors located in different rooms are connected to the PC via HDMI, only one audio channel is transmitted. But the AMD Radeon HD 7900 received support for the simultaneous output of several independent audio channels at once, which may well come in handy in some multi-monitor configurations.

The same feature will be very useful for video conferencing applications with the output of several interlocutors on separate screens, as well as multitasking applications such as playing on three monitors with game audio and watching news on a separate screen with an independent audio stream. Previously, for all this it was necessary to use several separate audio systems, but now everything works as conveniently as possible.

not forgotten and software support Eyefinity, almost every month the technology is updated - new features appear. So, back in October, support for resolutions up to 16384 × 16384 and new multi-monitor configurations appeared: horizontal and vertical 5 × 1, as well as based on six monitors in 3 × 2 mode.

The AMD Catalyst video driver update in December brings Eyefinity and HD3D to work together, and in February, support for custom resolutions, taskbar placement tweaks, and improvements to preset management are announced.

Output to six monitors can be achieved using two DisplayPort 1.2 ports and two MST hubs (which we wrote about earlier), while three or even four monitors require only one port and the corresponding hub. These hubs offer flexible display configurations, support up to four FullHD devices per DisplayPort 1.2 connector, and should be available by summer 2012.

Speaking of permission. High resolution or even ultra-high - Ultra High Resolution. Current devices with a resolution of 4000 pixels on the larger side require connection using several cables at once: two DP 1.1 or four DVI. Monitors of this resolution of the next generation will be connected using only one cable: DP 1.2 HBR2 or HDMI 1.4a 3 GHz. And AMD's new video card is already ready for such monitors, again it became the first in the world.

Video encoding and decoding

It is quite natural that the AMD Radeon HD 7970 includes the same UVD block for decoding video data, which appeared in the previous generation of the company's video chips. It simply doesn't need any modifications, supporting the MVC multi-stream codec, MPEG-2/MPEG-4 (DivX), VC-1 and H.264 decoding, as well as decoding two FullHD streams in all supported formats.

AMD solutions provide the highest quality of video stream decoding, use several dozens of special quality enhancement algorithms and provide maximum results in quality tests like HQV. Among the supported features, we note: color and tone adjustment, noise reduction, sharpening, high-quality scaling, dynamic contrast, advanced deinterlacing, and inverse telecine. Here is an example of on-the-fly contrast enhancement:

But with decoding, all video chips have been more or less in order for a long time. All new GPUs provide decent quality and performance when viewing video data. But video encoding on the GPU is still in its infancy, and the main complaints from users are directed to the low quality of the resulting compressed image.

Perhaps the new Radeon HD 7000 series can help with this, because all the graphics processors in the series have a Video Codec Engine (VCE) video encoding unit. The Radeon HD 7970 was the first video card to support hardware-accelerated encoding and video compression using a specialized block (earlier, stream processors took part in encoding).

Quality and performance should be clearly better than before, with support for 1080p at 60fps encoding, even faster than real-time. It's hard to say anything about quality without tests, but we are promised different levels of encoder optimization for video data and games, as well as variable compression quality (the ability to choose between improving quality or performance).

So far, there is no place to try VCE - there are simply no applications with support for it, but AMD is working with partners such as ArcSoft to provide support for VCE in relevant software products. In the future, we plan to release a software library for accelerating video encoding, which will make it easier for developers to support next-generation AMD products.

Encoding can be done in two modes: full and hybrid (using the capabilities of GPU stream processors). Full mode is designed for applications that require maximum energy efficiency and a consistent level of performance. Full mode encoding on VCE is faster than real time and provides low latency. But there is also a hybrid mode:

In this mode, GPU math blocks work together with VCE. All highly parallelizable stages, which are circled in yellow in the diagram, can use the power of the GCN computational units, and the dedicated VCE unit is engaged in efficient hardware entropy coding. This mode is well suited for video cards with great mathematical power, such as the Radeon HD 7970. There are still questions about the quality of these two modes, but this requires a thorough analysis in a separate article.

AMD Steady Video

In addition to encoding and decoding video data, there is another area where the power of AMD's new graphics can be used - improving poor-quality handheld videos without the use of a tripod or other similar image stabilization tools. The video stabilization technology is called AMD Steady Video, and its second version has already been released.

The algorithm of the software stabilizer is quite simple: based on the video stream, statistics are collected about the movement of the camera (shift, rotation, zoom) and this movement is compensated in the current frame, relative to the previous ones - the image is shifted, rotated and scaled so that the picture does not jump much and remains stable.

As simple as it is in words, it is just as difficult to implement. Simply because there are two million pixels on the screen, and up to 30 or even 60 frames per second. Imagine how many calculations you need to do to track all possible frame shifts. We have already written above about the QSAD function used in video processing, and it is also used in Steady Video 2.0 to speed up the motion detection algorithm. So the GPU must process random shifts with an amplitude of up to 32 pixels in any direction, and this requires a performance corresponding to more than 500 billion SAD operations per second (for 1920x1080 at 60 FPS).

By supporting the new QSAD instructions in the Radeon HD 7970, its advantage over powerful CPUs in the motion detection algorithm exceeds 10x! That is, high-quality video will now be provided to us, and not only when processing home videos in video editors, but also when watching other people's online videos, shot by no one knows what and no one knows how.

Details: Radeon HD 7800 Series

  • Chip codename: "Pitcairn"
  • Production technology: 28 nm
  • 2.8 billion transistors (slightly more than the Cayman, which is the basis of the Radeon HD 6900 series)
  • Unified architecture with an array of common processors for streaming processing of multiple types of data: vertices, pixels, and more.
  • Hardware support for DirectX 11.1, including the shader model Shader Model 5.0
  • 256-bit memory bus: four 64-bit wide controllers with GDDR5 memory support
  • Core clock: up to 1000 MHz (for Radeon HD 7870)
  • 20 GCN Compute Units with 80 SIMD cores for a total of 1280 floating point ALUs (Integer and Float formats, IEEE 754 FP32 and FP64 precision support)
  • 80 texture units, with support for trilinear and anisotropic filtering for all texture formats
  • 32 ROPs with support for anti-aliasing modes with the possibility of programmable sampling of more than 16 samples per pixel, including with FP16 or FP32 framebuffer format. Peak performance up to 32 samples per clock, and in colorless mode (Z only) - 128 samples per clock

Radeon HD 7870 Graphics Specifications

  • Core frequency: 1000 MHz
  • Number of universal processors: 1280
  • Number of texture units: 80, blending units: 32
  • Memory type: GDDR5
  • Memory capacity: 2 gigabytes
  • Theoretical maximum fill rate: 32.0 gigapixels per second.
  • Theoretical texture sampling rate: 80.0 gigatexels per second.
  • One CrossFire connector
  • PCI Express 3.0 bus
  • Connectors: DVI Dual Link, HDMI 1.4, two Mini-DisplayPort 1.2
  • Power consumption: 3 to 175 W
  • Two 6-pin power connectors
  • Dual slot design
  • US MSRP: $349

Radeon HD 7850 Graphics Specifications

  • Core frequency: 860 MHz
  • Number of universal processors: 1024
  • Number of texture units: 64, blending units: 32
  • Effective memory frequency: 4800 MHz (4×1200 MHz)
  • Memory type: GDDR5
  • Memory capacity: 2 gigabytes
  • Memory bandwidth: 153.6 gigabytes per second
  • Theoretical maximum fill rate: 27.5 gigapixels per second.
  • Theoretical texture sampling rate: 55.0 gigatexels per second.
  • One CrossFire connector
  • PCI Express 3.0 bus
  • Connectors: DVI Dual Link, HDMI 1.4, two Mini-DisplayPort 1.2
  • Power consumption: 3 to 130 W
  • Dual slot design
  • US MSRP: $249

And this time, the principle of naming the company's products was not changed and the trends of the previous series were continued. The mid-budget series of video cards based on the GCN architecture differs from the top and budget lines in the second digit in the index: instead of 7 and 9, the number 8 is set, which is quite logical. Since AMD has taken the psychological threshold of 1000 MHz for the frequency of the GPU, the Radeon HD 7870 received the addition of "GHz Edition" to the name, indicating the adoption of this frequency.

From the name it is clear that the Radeon HD 7800 is more productive than the HD 7700, but has a lower speed compared to the older models - HD 7900. As for comparison with NVIDIA solutions, the older HD 7870 released at the time of release competes with the video card GeForce GTX 570, and the younger one is aimed at fighting the GTX 560 Ti, and NVIDIA has not yet released new 28 nm mid-range chips.

Both models of video cards from AMD have GDDR5 memory of the same amount of 2 gigabytes. They both use a 256-bit memory bus, and so you could put 1, 2, or 4 GB on them. 1 gigabyte is too small, and 4 GB is too expensive for this price segment. Therefore, we can say that the ideal amount of 2 GB of video memory has been chosen, which is quite sufficient for the vast majority of games even at high resolutions, and not too costly in terms of cost.

In other respects, from the point of view of the consumer, the HD 7850 and HD 7870 models are still different. The older Radeon HD 7870 has a higher power consumption, so it needs two additional 6-pin power connectors, and the HD 7850 is content with only one of them. Both boards have a two-slot cooling system design, but most manufacturers produce boards with their own design of at least a cooler, and even a PCB.

Architectural features of the Radeon HD 7800 family

Above, we have carefully described all the features of the new Graphics Core Next (GCN) architecture, so we will repeat only the most important. All of the company's new GPUs offer excellent features and performance not only in graphics processing, but also in non-graphical computing, including a mixture of different types of calculations. Also, the new GCN architecture offers a serious simplification of code optimization tasks, simplification of development and support, as well as stable and predictable performance and, in general, quite high efficiency.

The base block of the new architecture is the GCN block, and all GPUs of the Southern Islands series are assembled from them. Consider the block diagram of the Pitcairn chip:

The diagram shows the Radeon HD 7870 GPU (the "simplified" HD 7850 differs from it by several disconnected blocks), we see 20 computing units of the GCN architecture. In the case of the junior solution of the Radeon HD 7800 series, four of them were disabled, and the number of active blocks in it is 16. This corresponds to 1280 and 1024 stream processors, respectively (just like in the case of the HD 7700 family, only there are exactly twice as many blocks) . Since each GCN unit has four texture units, the total number of TMUs for the older model is 80 TMUs, and for the younger one - 64 TMUs.

But the number of ROPs and memory controllers in the HD 7870 and HD 7850 is also the same as in the solutions of the youngest line. The number of ROP blocks was left quite high - 32 pieces for both models. The memory bus for boards based on Pitcairn has been cut down to 256-bit, it is assembled from four 64-bit channels. This is not bad for a solution of this level, although it is one and a half times less than in the top line, because the memory bus is traditionally cut down first. It's good that the use of fast GDDR5 memory gave a relatively high bandwidth of 153 GB / s.

Like the rest of the GCN architecture chips, Pitcairn incorporates a 9th generation tesselator block, featuring numerous buffering and caching optimizations, which can significantly improve geometry processing performance. Here is a comparison of the new AMD board with the solution of the previous generation in a synthetic problem, according to which we can assume an increase in the tessellation speed up to four times:

A lot of AMD technologies that have been introduced and improved in the new Radeon HD 7000 video chips are also supported in the same way. Here is an incomplete list of them: PowerTune, ZeroCore, Eyefinity 2.0, HD3D, Steady Video, texture filtering quality improvements, etc. All this is described in more detail above. To add to the list, the Radeon HD 7800 fully supports both the improved MLAA 2.0 anti-aliasing algorithm and supersampling anti-aliasing (SSAA).

As far as gaming performance is concerned, the Radeon HD 7870 is significantly faster than its direct competitor GeForce GTX 570, especially given the latter's 1.25 GB of VRAM (compared to 2 GB for the solutions in question) observed in modern games at high rendering resolutions . The younger Radeon HD 7850 can be compared with the GeForce GTX 560 Ti, and here it can no longer boast of the amount of memory. However, according to AMD's measurements, their new solution is still faster than the competition in most games.

Details: Radeon HD 7700 Series

  • Chip codename: "Cape Verde"
  • Production technology: 28 nm
  • 1.5 billion transistors (less than Barts, which is the basis of the Radeon HD 6800 series)
  • Unified architecture with an array of common processors for streaming processing of multiple types of data: vertices, pixels, and more.
  • Hardware support for DirectX 11.1, including shader model Shader Model 5.0
  • Core clock: up to 1000 MHz (for Radeon HD 7770)
  • 10 GCN Compute Units with 40 SIMD cores for a total of 640 floating point ALUs (Integer and Float formats, IEEE 754 FP32 and FP64 precision support)
  • 40 texture units, with support for trilinear and anisotropic filtering for all texture formats
  • Integrated support for up to six monitors including HDMI 1.4a and DisplayPort 1.2

Radeon HD 7770 Graphics Specifications

  • Core frequency: 1000 MHz
  • Number of universal processors: 640
  • Number of texture units: 40, blending units: 16
  • Memory type: GDDR5
  • Memory capacity: 1 gigabyte
  • Theoretical texture sampling rate: 40.0 gigatexels per second.
  • One CrossFire connector
  • PCI Express 3.0 bus
  • Connectors: DVI Dual Link, HDMI 1.4, two Mini-DisplayPort 1.2
  • Power consumption: 3 to 80 W
  • One 6-pin power connector
  • Dual slot design
  • US MSRP: $159

Radeon HD 7750 Graphics Specifications

  • Core frequency: 800 MHz
  • Number of universal processors: 512
  • Number of texture units: 32, blending units: 16
  • Effective memory frequency: 4500 MHz (4×1125 MHz)
  • Memory type: GDDR5
  • Memory capacity: 1 gigabyte
  • Memory bandwidth: 72 gigabytes per second
  • Theoretical maximum fill rate: 12.8 gigapixels per second.
  • Theoretical texture sampling rate: 25.6 gigatexels per second
  • PCI Express 3.0 bus
  • Connectors: DVI Dual Link, HDMI 1.4, one DisplayPort 1.2
  • Power consumption: 3 to 55 W
  • Does not require additional power
  • Single slot design
  • US MSRP: $109

The low-cost series of video cards based on the GCN architecture differs from the top and middle lines in the second digit in the index: the 9th place was taken by the number 7, as it was before. The Radeon HD 7770 is a more productive solution, but there is also a younger model - the HD 7750. The older board had no direct competitors on the market at the time of release, being located somewhere between the GeForce GTX 560 and GTX 550 Ti, and the younger one is aimed at fighting the GTX 550 Ti. For the HD 7770, a competitor was later announced in the face of the GeForce GTX 560 SE (all NVIDIA solutions are based on older GPUs).

Both considered models of AMD video cards have GDDR5 memory of the same amount of 1 gigabyte. Due to the use of a 128-bit memory bus, 2 GB could be put on them, but this amount of GDDR5 memory will cost too much for their price segment. Therefore, models with such a volume have so far been released, although options with 2 GB of video memory may be released in the future. In the meantime, we decided to leave this volume for the HD 7800.

In terms of other consumer characteristics, the HD 7750 and HD 7770 models are quite different. If the older Radeon HD 7770 has a two-slot design of the cooling system and its cooler is covered with a plastic casing like in older solutions, then the younger HD 7750 looks noticeably simpler, occupying one slot and having a simple cooler. However, most manufacturers still produce boards with their own designs. The power consumption of the new models in this price range is also different, the older one has one 6-pin auxiliary power connector, and the younger one is powered by PCI Express.

Architectural features Radeon HD 7700

The base block of the new architecture is the GCN block, and all GPUs of the series are assembled from them. Each of the available GCN blocks is capable of scheduling and distributing instructions on its own, and one computing unit can execute up to 32 independent instruction streams. Let's look at the block diagram of the Cape Verde chip:

The diagram shows the Radeon HD 7770 GPU (the “stripped down” HD 7750 features a few disconnected units), we see 10 compute units of the GCN architecture. In the case of the junior solution of the Radeon HD 7700 series, it was decided to disable two of them, and the number of blocks became 8. This corresponds to 640 and 512 stream processors. And since each GCN unit has 4 texture units in its composition, the final figure for the number of TMUs for the older model is 40 TMUs, and for the younger one - 32 TMUs.

The number of ROPs and memory controllers in the HD 7770 and HD 7750 is the same, and we decided not to cut the ROPs too much, leaving them at 16 each. But the memory bus at Cape Verde is cut down to 128-bit, which is assembled from two 64-bit channels. In general, this is three times less than in the top series, and we saw another confirmation that the memory bus is traditionally cut down in inexpensive chips first of all. Although the use of fast GDDR5 memory made it possible to leave a relatively high (for such inexpensive solutions) bandwidth of 72 GB / s.

It remains for us to note a rather large amount of L2 cache - as much as 512 kilobytes (compared to 768 KB for a top-end chip - apparently, the L2 cache does not take up too much space on the chip), as well as improvements in geometric performance. Like the top-of-the-line chip, Cape Verde features a 9th generation tesselator featuring multiple buffering and caching optimizations to deliver a notable improvement in geometry processing performance over the Radeon HD 6000 series.

In general, we will not repeat all the information about AMD technologies that have been implemented and improved in the new Radeon HD 7000 video chips (here is a partial list: PowerTune, ZeroCore, Eyefinity 2.0, HD3D, Steady Video, texture filtering quality improvements, etc. .p.), all this is described in more detail above. The HD 7700 series supports all the features listed there, including AMD Eyefinity 2.0 with six monitors and stereo rendering, as well as an improved video decoding and encoding unit.

But what about the most important thing - performance in games? The first estimates of rendering speed can always be made from the manufacturer's presentations. AMD believes that the Radeon HD 7770 is somewhere in the middle between the GeForce GTX 560 and GeForce GTX 550 Ti, respectively, and compares it in its materials with the second competitor model.

But they do not compare the Radeon HD 7750 with anything, simply noting that most modern games are playable on this model at maximum settings in FullHD resolution. However, this is not surprising, since in recent years there have been practically no PC-exclusives, and multi-platform games are much less demanding. So the Radeon HD 7700 series boards are perfect for undemanding users.

Details: Model Radeon HD 7790

  • Chip codename: "Bonaire"
  • Production technology: 28 nm
  • 2.08 billion transistors (more than Cape Verde in the Radeon HD 7700, but less than Pitcairn in the Radeon HD 7800)
  • Unified architecture with an array of common processors for streaming processing of multiple types of data: vertices, pixels, and more.
  • Hardware support for DirectX 11.1, including shader model Shader Model 5.0
  • 128-bit memory bus: two 64-bit wide controllers with GDDR5 memory support
  • Core frequency: 1000 MHz
  • 14 GCN Compute Units of 56 SIMD Cores with a total of 896 Floating Point ALUs (Integer and Float formats, supports IEEE 754 FP32 and FP64 precision)
  • 56 texture units, with support for trilinear and anisotropic filtering for all texture formats
  • 16 ROPs with support for anti-aliasing modes with the possibility of programmable sampling of more than 16 samples per pixel, including with FP16 or FP32 framebuffer format. Peak performance up to 16 samples per clock, and in colorless mode (Z only) - 64 samples per clock

Radeon HD 7790 Graphics Specifications

  • Core frequency: 1000 MHz
  • Number of universal processors: 896
  • Number of texture units: 56, blending units: 16
  • Memory type: GDDR5
  • Memory capacity: 1 gigabyte
  • Memory bandwidth: 96 gigabytes per second
  • Theoretical maximum fill rate: 16.0 gigapixels per second.
  • Theoretical texture sampling rate: 56.0 gigatexels per second.
  • One CrossFire connector
  • PCI Express 3.0 bus
  • Connectors: DVI Dual Link, HDMI 1.4, two Mini-DisplayPort 1.2
  • Power consumption: 3 to 85 W
  • One 6-pin power connector
  • Dual slot design
  • US MSRP: $149

An inexpensive video card model based on a new mid-budget chip differs from the previous top model of the HD 7700 subfamily by the third digit in the index: instead of 7, they put the number 9, which indicates an increase in performance. At the same time, the Radeon HD 7790 index clearly indicates that it is less performance graphics card, compared to the line one step higher - HD 7800.

However, everything is not so simple here either - it will certainly be able to argue with the younger HD 7850. But the recommended price for the Radeon HD 7790 is set at $149, that is, approximately in the middle between the prices for the HD 7770 and HD 7850. As for the competitor's solutions from the same price segment, the release of the HD 7790 was clearly intended to have something to fight with NVIDIA The GeForce GTX 650 Ti, based on the GK106 chip, sits right between the HD 7770 and HD 7850 in terms of price and speed. But NVIDIA immediately responded to the release of the new board by AMD by launching an overclocked version of the GeForce GTX 650 Ti Boost, which is characterized by greater performance.

This AMD graphics card model has GDDR5 memory with a capacity of only 1 gigabyte. The GPU has a 128-bit memory bus, and theoretically 2 GB could be supplied, but this amount of fast GDDR5 memory is still too expensive for this price segment, and AMD has released a model with a smaller memory, although it may not be enough for some modern games even at low settings and resolutions. However, video cards from partners with 2 GB of video memory are also possible.

Like the models standing next to it in the line, the Radeon HD 7790 has a dual-slot design of the cooling system, which is covered with a plastic casing. Although most manufacturers still release boards with their own cooler design, so the reference one is not that important. Interestingly, the new model's power consumption hasn't increased much compared to the HD 7770, but the improvement in power efficiency was to be expected. By the way, that's why the novelty also has only one 6-pin auxiliary power connector.

architectural features

The new Bonaire GPU, on which the released Radeon HD 7790 is based, belongs to the same Graphics Core Next (GCN) architecture that we have known for a year and a half, but AMD calls it GCN 1.1, hinting at minor changes. In fact, the chip is architecturally almost the same as the previous ones, although there are indeed some minor changes. For example, the new architecture introduced instructions that are useful for heterogeneous architecture (Heterogeneous System Architecture - HSA), support for more simultaneously executing threads, as well as a new version AMD PowerTune technology, which we will talk about later. But all these changes cannot be called significant, because there is nothing new in the basic blocks and improving their efficiency.

Therefore, we can safely refer to, which carefully describes all the features of the new Graphics Core Next (GCN) architecture, and here we will only repeat the most important characteristics and features of a particular product. All of AMD's latest GPUs offer excellent features and performance in both graphics and non-graphics processing, including mixtures of the two. The new GCN architecture also provides a major simplification of optimization and software development tasks, while maintaining high efficiency.

As you know, the basic block of the architecture is the GCN block, from which all the GPUs of the Southern Islands series are assembled. The GCN computing unit is divided into subsections, each of which works on its own instruction stream. GCN blocks have a dedicated 64 KB local data storage for data exchange or local register stack expansion. Also, the block has a first-level cache memory with the ability to read and write and a full-fledged texture pipeline with sampling and filtering blocks. Each of the existing GCN units is capable of scheduling and distributing commands on its own, and one computing unit can execute several independent instruction streams. Let's look at the block diagram of the new chip:

The Bonaire scheme confirms the goal of the new solution to offer performance between Cape Verde, which has 10 GCN compute units, and Pitcairn, which has 20 GCN units. These two GPUs, released in 2012, differ from each other by almost exactly half, so there was a rather large performance gap in the middle between them, which Bonaire has now filled.

The diagram shows the graphics processor in the form of a Radeon HD 7790, which is a complete solution without cutting any blocks. The chip includes 14 computing units of the GCN architecture, which corresponds to 896 stream processors. Since each GCN has 4 texture units, the total number of TMUs for the new model is 56 TMUs. That is, Bonaire is exactly 1.4 times faster than the Cape Verde chip in terms of the speed of mathematical calculations and texture fetches, provided that the frequency is equal.

But the number of ROP units and memory controllers in Bonaire and Radeon HD 7790 is similar to what we saw in Cape Verde and Radeon HD 7770 - it was decided to leave 16 ROP units, and the memory bus of the new chip is 128-bit, assembled from two 64- bit channels. The small number of ROPs can be the “Achilles heel” of the solution, since the use of fast GDDR5 memory made it possible to provide a relatively high throughput of 96 GB / s, but nothing can be done about ROP performance.

But in the new GPU there are improvements in geometric performance and tessellation speed. Yes, Cape Verde also has a 9th generation tesselator, but Bonaire also doubled the number of geometry blocks, rasterizers and command processors (shown as ACE in the diagram) - now there are two of them. This improvement gives Bonaire the ability to process up to two geometric primitives per clock - just like the more powerful Pitcairn and Tahiti.

As you remember, it was in the Radeon HD 7770 that AMD first took the important psychological threshold of the GPU clock frequency, equal to 1 GHz. So, the HD 7790 also has exactly the same reference frequency of 1 GHz, so the increase in performance compared to the HD 7770 will be justified solely by architectural changes and an increase in the number of execution units.

But the frequency of the video memory of the novelty is much higher. If the HD 7770 had a relatively low memory frequency of 4.5 GHz, then the HD 7790 is equipped with a fast GDDR5 memory operating at 6 GHz, which provides a third more bandwidth. The 33% increase in video memory bandwidth compared to the Radeon HD 7700 sub-family resulted in a clear increase in gaming performance. AMD provides this chart, comparing HD 7790 frame rates with memory running at 4.5 and 6.0 GHz:

The maximum speedup from the increase in memory bandwidth was achieved in games such as StarCraft II and Crysis 2. And on average, a 33% increase in memory bandwidth gives somewhere around 10% increase in average frame rate in a set of modern games. Not a bad indicator, showing that memory bandwidth is quite important in our time, although it is not the only emphasis on performance. Although it is quite possible that with more ROP, Bonaire's speed would be even higher...

It is clear that the average power consumption has increased slightly compared to the HD 7770. If for the old model this value is 80 W, then for the HD 7790 it is 85 W - this is a very small price to pay for a theoretical performance increase of 33-40%! Architectural improvements (PowerTune), the design of a new GPU based on the experience of previous ones, as well as the continuous improvement of the technical process at TSMC - all this led to a small increase in consumption with a significant improvement in speed characteristics.

As for the area of ​​the chip and the number of transistors in Bonaire, the new chip is clearly larger than Cape Verde, but the addition of computational, texture and geometric units could not go unnoticed. According to these parameters, Bonaire is also located approximately in the middle between Cape Verde and Pitcairn. Bonaire contains 2.08 billion transistors in a 160 mm 2 chip, for Cape Verde these figures are 1.5 billion and 123 mm 2 , respectively, and for Pitcairn - 2.8 billion transistors and 212 mm 2 chip area.

Naturally, the new chip supports all AMD technologies that have been introduced and improved in the new Radeon HD 7000 family (their incomplete list: PowerTune, ZeroCore, Eyefinity, HD3D, Steady Video, texture filtering quality improvements, etc.), both all this is described in detail in the article AMD Radeon HD 7970: New single-processor leader. The HD 7790 model supports all the features listed there, including AMD Eyefinity 2.0 with six monitors and stereo rendering, and also has an improved video decoding and encoding unit.

Enhanced PowerTune Technology

Back in 2010, AMD introduced PowerTune technology in its Cayman chip (AMD Radeon HD 6900 series). This GPU was the first to feature dynamic power management called PowerTune. It allowed us to increase the maximum clock speeds for typical applications, while avoiding too much power consumption in specialized stability tests like FurMark. Then the technology was applied to the dual-chip model AMD Radeon HD 6990, which needed it even more for obvious reasons.

The technology received a major update in mid-2012, when automatic frequency increase - Boost - was added to AMD PowerTune. In the AMD Radeon HD 7970 GHz Edition, this algorithm allowed even more performance improvements compared to regular version video cards. The PowerTune operation algorithm in video cards without automatic overclocking uses three states: idle (idle), light load mode (low-3D) and full speed. In the HD 7970 GHz, the Boost overclocking mode was added to them. PowerTune serves to stay within the required consumption by switching to a lower load mode when necessary. In this case, the technology sharply reduces the clock frequency values. In practice, such jumps are rare - due to the large gap between the two active modes.

Reducing the GPU clock speed reduces power consumption, but you need to reduce the voltages for better control. This is exactly what the Radeon HD 7790 does. The new Bonaire graphics chip has eight states with different frequency and voltage settings, allowing for higher clock speeds than before, while ensuring that the GPU is always running at optimal voltage and frequency. Switching between states is based on GPU load as well as current GPU power consumption.

In the new algorithm, PowerTune does not have to abruptly drop the frequency when the consumption level is exceeded, and along with the frequency, the voltage also decreases. State transitions must be as fast as possible so as not to exceed the consumption limit even for a short time, so Bonaire switches PowerTune states every 10 ms, that is, in every second, the state of the chip changes 100 times.

With such a constant change in frequencies, third-party applications like MSI Afterburner and GPU-Z will not show instantaneous clock rates, but averages over a period of time - the so-called "effective" frequency. Another interesting innovation is that AMD is opening up new PowerTune settings for third party applications. Partners can also set their own PowerTune settings to help create factory overclocked graphics cards and provide more features beyond AMD's reference values. Truth, different settings PowerTune can lead to the fact that video cards of the same model from different manufacturers will not only have different clock frequencies, but also the algorithm for changing them over time, which makes it difficult to compare under the same conditions.

Sales of video cards of the Radeon HD 7790 model began on the market at the very beginning of April 2013. AMD, together with its partners, organized the release of both motherboards with reference frequencies and factory overclocked solutions. And now both manufacturers are launching new graphics cards to the market in much the same way, with quick availability. various options from their partners. In fact, partners have released almost more overclocked versions of the HD 7790 than conventional ones, and the graphics chips in them operate at frequencies of about 1075 MHz.

Details: Radeon HD 7990 Model

  • Codename "Malta"
  • Production technology: 28 nm
  • 2 chips with 4.3 billion transistors each
  • Unified architecture with an array of common processors for streaming processing of multiple types of data: vertices, pixels, and more.
  • Hardware support for DirectX 11.1, including shader model Shader Model 5.0
  • Dual 384-bit memory bus: twice six 64-bit wide controllers with GDDR5 memory support
  • GPU frequency: 1000 MHz
  • Twice 32 GCN Computing Units, including 128 SIMD cores, consisting of a total of 4096 floating point ALUs (integer and floating formats, support for FP32 and FP64 precision within the IEEE 754 standard)
  • 2x128 texture units, with support for trilinear and anisotropic filtering for all texture formats
  • 2x32 ROPs with support for anti-aliasing modes with the possibility of programmable sampling of more than 16 samples per pixel, including with FP16 or FP32 framebuffer format. Peak performance up to 64 samples per clock, and in colorless mode (Z only) - 256 samples per clock
  • Integrated support for up to six monitors via HDMI 1.4a and DisplayPort 1.2

Radeon HD 7990 Graphics Specifications

  • Core frequency: 1000 MHz
  • Number of universal processors: 4096
  • Number of texture units: 2x128, blending units: 2x32
  • Effective memory frequency: 6000 MHz (4×1500 MHz)
  • Memory type: GDDR5
  • Memory capacity: 2x3 gigabytes
  • Memory bandwidth: 2x288 gigabytes per second
  • Theoretical maximum fill rate: 64 gigapixels per second.
  • Theoretical texture sampling rate: 256 gigatexels per second.
  • One CrossFire connector
  • PCI Express 3.0 bus
  • Connectors: DVI Dual Link, four Mini-DisplayPort 1.2
  • Power consumption up to 375 W
  • Two 8-pin auxiliary power connectors
  • Dual slot design
  • The recommended price for Russia is 32,999 rubles. (for the USA - $999).

Already in the second generation of AMD video cards, the naming principle for dual-chip models remains unchanged. The top solution based on two most powerful video chips differs from the corresponding class of the previous generation model by the first digit in the index: instead of 6, it received the number 7, indicating a new series. The announced video card differs from the single-chip solution by the third digit, indicating the maximum performance within the generation.

As for the comparison with competitors, for the Radeon HD 7990 model announced today, the main rival is the GeForce GTX 690 video card, released almost a year ago, and it is these two-chip solutions that will have to fight each other. True, NVIDIA also has one more powerful solution, but already based on a single GPU - GeForce GTX Titan, which can also be considered a competitor to the AMD board in question.

The new dual-chip Radeon video card is equipped with GDDR5 memory with a capacity of 3 gigabytes per GPU, which is due to the 384-bit memory bus of the Tahiti chips. Such a volume is quite reasonable for a product of such high level, since in some modern gaming applications with maximum settings, anti-aliasing enabled and high resolutions, a smaller amount of memory (2 gigabytes per chip or less) may no longer be enough. And even more so this applies to rendering in stereo mode or on multiple monitors in Eyefinity mode.

It is clear that such a powerful dual-chip graphics card has a massive dual-slot cooling system that is different from traditional AMD card coolers. It features a massive heatsink hidden under a shroud with three large fans running at relatively low speeds. The power consumption of the dual-GPU card is quite high for obvious reasons, and it has two 8-pin power connectors, but at least it's not three, as was the case with non-reference samples based on two Tahiti chips.

Architecture

Since the video card codenamed “Malta” is based on two “Tahiti” GPUs from the Southern Islands family, you can simply refer to, which thoroughly describes all the features of the current Graphics Core Next (GCN) architecture. In the base materials, we repeat only the most important characteristics and features of specific products.

The basic block of the architecture is the GCN block, from which all the GPUs of the series are assembled. The computational unit is divided into subsections, each of which works on its own stream of commands, it has a dedicated local storage for data, a first-level cache memory with the ability to read and write, and a full-fledged texture pipeline with sampling and filtering units. Each of the GCN blocks is capable of scheduling and distributing commands on its own, and one computing block can execute several independent command streams. The Radeon HD 7990 uses two Tahiti chips already known to us:

The graphics processor diagram (there are two of them in the Radeon HD 7990) shows 32 computing units of the GCN architecture, and all of them are active. Previously, it was assumed that for a two-chip solution, some of them would have to be turned off, and even lowered the frequency in order to enter the power consumption of 375 W, but AMD engineers managed to successfully solve this difficult task. Perhaps a special new revision of the Tahiti with lower power consumption has been released, or the chips just pass a very strict selection.

Since each GCN unit has 16 texture units, the number of TMUs is 128 units per chip, which gives a total performance of 256 gigatexels per second, which is very good for a GeForce GTX 690 competitor. The number of ROP units and memory controllers in HD 7990 also did not change compared to its single-chip counterpart, they were left in the amount of 32 and 6 pieces per GPU, respectively. The Radeon HD 7990 has a dual 384-bit memory bus made up of twelve 64-bit channels for a total memory bandwidth of 576 GB/s, another record.

For the rest, new board supports everything modern technologies by AMD, which were introduced and improved in the new video chips of the Radeon HD 7000 line: PowerTune, ZeroCore, Eyefinity 2.0, HD3D, Steady Video, improved quality of texture filtering, etc. All this is described in detail above in the description of the Radeon HD 7970, and there is simply no point in repeating it.

Cooling system and power consumption

In the case of such serious dual-chip boards, a highly efficient cooling system becomes especially important. If, in the case of solutions from partners based on two Tahiti, three-slot solutions were used, and in the case of ASUS ARES II, even water cooling, in this case it was necessary to do with less power, so a cooler was designed that has a very massive heatsink and three fans with improved acoustic characteristics.

The noise of the cooling system and the provided temperature for GPUs are one of the most important consumer characteristics for any video card, including the top solution designed for enthusiasts. Too loud or inefficient cooling system will be regarded by buyers as a less profitable purchase, other things (roughly) being equal. So AMD took this issue very seriously with the Radeon HD 7990 when compared to other top solutions on the market. Consider acoustic characteristics new system:

The diagram shows the noise level from three various video cards: Radeon HD 7990 and two competitors: dual-chip GeForce GTX 690 and single-chip GTX Titan from NVIDIA. Moreover, the noise was measured in different conditions - in idle mode (System Idle) and at maximum load using Furmark. If AMD's numbers are to be believed, even the single-chip Titan falls short of their novelty in terms of cooler noise, not to mention the dual-chip GTX 690, which is the loudest in this comparison.

But isn't it to the detriment of the GPU temperature that such impressive acoustic performance was achieved? The following chart shows GPU temperatures measured on AMD's Radeon HD 7990 and the same two competitors. This time, AMD used only the high load mode when testing at Furmark.

And again, a “cunning” coordinate axis is used with a non-zero origin. The real difference between 80 and 82 degrees for the Radeon HD 7990 and GTX Titan will be virtually imperceptible, although the 87 degrees for the GTX 690 clearly stand out for the worse. Again, we note that all of these tests were carried out by the interested party and are subject to independent verification.

In terms of power consumption, the dual-chip solution is nothing new, but support for the previously announced ZeroCore Power technology is also here. This technology helps to achieve significantly lower power consumption in "deep idle" (or "sleep") mode with the display device turned off. In this mode, the idle GPU is almost completely disabled, and consumes less than 5% of the power of the full mode, turning off most of the functional blocks. And in the case of a two-chip board, it is even more important that in the CrossFire system, when drawing a two-dimensional interface of the operating system, all GPUs, except for the main one, do not work at all. That is, in the case of the Radeon HD 7990, one of the chips in 2D mode will be immersed in a deep sleep with minimal power consumption, and the second one can “fall asleep” in the PC deep idle mode.

AMD has officially unveiled a new series of graphics cards based on the 28nm GCN microarchitecture - AMD Radeon HD 7800. At the moment, it includes two models: and. The novelties are based on AMD "Pitcairn Pro" and AMD "Pitcairn XT" GPUs, which operate at a clock speed of 860 MHz and 1000 MHz, respectively. Thus, the video card will become the second in the company's lineup (after AMD Radeon HD 7770) with the name "Ghz Edition".

The solution is equipped with 16 compute units, 1024 stream processors, 64 texture units and 32 ROP units. The video subsystem of the novelty is made up of 256-bit GDDR5 chips with a total capacity of 2 GB. Their nominal clock frequency is at the level of 1200 MHz, and the effective one is 4800 MHz.

Part external interfaces models include four ports: DVI, HDMI and two miniDisplayPort.

The graphics adapter has a better structure, which includes: 20 compute units, 1280 stream processors, 80 texture units and 32 ROP units. The video subsystem of this novelty, as well as the set of its external interfaces, is identical to the .

Mass sale of solutions and starts from March 19th. The recommended retail price of these new products will be $249 and $349, respectively. comparison table technical specifications of new video cards of the series AMD Radeon HD 7800 has the following form:

Production process standards, nm

microarchitecture

GPU

AMD "Pitcairn Pro"

AMD "Pitcairn XT"

Clock frequency, MHz

Number of computing units

Number of stream processors

Number of texture blocks

Number of ROP blocks

video memory

Volume, GB

Rated clock frequency, MHz

Effective clock frequency, MHz

Bus width, bit

Internal interface

PCI Express 3.0 x16

External interfaces

1 x DVI
1 x HDMI
2x miniDisplayPort

Estimated price, $

Supported instructions and technologies

DirectX 11.1, DirectCompute 11, OpenGL 4.2, OpenCL 1.2, AMD Eyefinity, AMD CrossFireX, AMD HD3D, AMD PowerPlay, AMD PowerTune, AMD Eyespeed, AMD ZeroCore

Modern games every year require more and more powerful video cards for graphics processing. One of the budget solutions for gamers will be the AMD Radeon HD 7800 Series. Consider the technical characteristics of this series, as well as its features and performance in games.

Consider the characteristics of the AMD Radeon HD 7800 Series in the form of a table:

Process technology 28 nm
GPU Pitcairn
Frequency (min. max. on models) 800-1000MHz
RAM A type GDDR5
Volume 2GB
Frequency 800-1200 MHz
Bandwidth 153.6 GB/s
Interfaces Memory capacity 256-bit
Tire type PCI Express 3.0
Architecture GCN
Stream processors per block from 64 to 80
Number of blocks from 16 to 20
Total Stream Processors 1024-1280
Cores for geometry processing 2 pcs.
Cores for asynchronous computing 2 pcs.
Connectors HDMI

Display Port 1.2

Supported technologies and software DirectX 11 Yes
OpenGL 4.2 Yes
Eyefinity (Monitor Linking) up to 6 pcs.
Zero Core Power Sleeping mode
Catalyst Proprietary Drivers and settings
App Acceleration Improving the quality of video playback
AMDHD3D 3D graphics processing
power tune Dynamic power setting

The line was produced in March 2012. Based on it, the following models were released:

  • HD7850;
  • HD7870;
  • HD7890.

The range is currently no longer in production. At the start of sales, the average price in stores was $249 and $349.

Overview

At AMD, after the release of a new process technology, it was decided to divide the general production series into subgroups. Therefore, on the basis of the 28nm process technology, 4 lines were formed, which are presented in the table:


The HD 7800 with "Pitcairn" GPU uses the Graphic Core Next microarchitecture. The series was launched in March 2012 and is currently out of print.

At one time, video cards from Pitcairn were quite popular and showed an excellent combination of price / quality. For 2018, the current series is not popular and it is extremely difficult to find a device in a new condition. Despite the fact that the graphics core is already outdated, in an assembly with a powerful processor, the PC can pull various games at medium and high settings.

What games will run on AMD Radeon HD 7800 Series

Video cards were released back in 2012, but they can still be used on modern toys. Tests in games AMD Radeon HD 7800 Series were made with the following hardware:

  • Processor: Core I5 ​​6500 3.2 GHz.
  • RAM: 16 GB DDR4 2133 Dual.
  • Hard drive: Hitachi 1TB.
  • Motherboard: Asus H170M-Plus.
  • Resolution: 1920x1080px.

The results are as follows:

Name of the game

Graphics quality
Assassin's Creed Syndicate Hight 31
War Thunder Cinema (Ultra) 55-65
Quantum Break The average 30-42
Assassin's Creed Unity The average 30
Shadow Warrior 2 High 35-45
Dying Light High 40-50
Fallout 4 Ultra 38-43
gta 5 Above average 45-50
DOOM High 40
Rise Of The Tomb Raider High 30-40
warfare High 90-100
The Witcher 3: Blood and Wine High 25-35
World Of Tanks High 60-80

Overall performance largely depends on the right combination of processor and graphics card. If you take a powerful processor of the current generation like Ryzen or Core I5, then they will be able to show high FPS in most modern games, even with an old video card.

After analyzing the technical characteristics and tests in games, we come to the following conclusions: it is not recommended to buy for powerful games in 2018, it is better to choose newer models.

Performance will be enough for comfortable homework and to launch multiplayer toys like CS:GO, World Of Tanks.

How to overclock a video card

For maximum performance, you can AMD overclocking Radeon HD 7800. To do this, you need to install the driver and configure it.

The main changes should be made in the "Games" section. If you use an up-to-date driver, profiles can be configured individually for each video game.

Navigate to the desired profile to the Frame Rate Control setting. By default, the video card squeezes the maximum FPS and spends all resources on it.

For a comfortable game of shooters, 60 frames per second is enough. For CS:GO, WarFace, WarThunder it is enough to set the limit to 70 FPS.

The OverDrive setting allows you to adjust the operating parameters: the frequency of the GPU and memory, the efficiency of the fans and the level of power consumption. These settings must be configured individually for each PC build.

Problems when registering on the site? CLICK HERE ! Do not pass by a very interesting section of our site - visitor projects. There you will always find the latest news, jokes, weather forecast (in ADSL newspaper), TV program of on-air and ADSL-TV channels, the latest and most interesting news from the world of high technologies, the most original and amazing pictures from the Internet, a large archive of magazines in recent years , appetizing recipes in pictures , informative . The section is updated daily. Always fresh versions of the best free programs for everyday use in the Required Programs section. There is almost everything that is required for daily work. Start to gradually abandon pirated versions in favor of more convenient and functional free counterparts. If you still do not use our chat, we strongly advise you to get acquainted with it. You will find many new friends there. It is also the fastest and most efficient way to contact project administrators. The Antivirus updates - always up-to-date section continues to work free updates for Dr Web and NOD. Didn't have time to read something? The full content of the ticker can be found at this link.

AMD Radeon HD 7870 video card review. Inexpensive alternative to flagships

Most recently, we told you about the top video cards of the new seven thousandth line of video cards from AMD - Radeon HD 7970 and Radeon HD 7950. But, as you know, you won't succeed with tops alone. Therefore, it's time to talk about the AMD Radeon 7800 Series - a less expensive, but still productive line.

Introduction. Estimated cost

The AMD Radeon HD 7870 video card should be sold in stores at a price starting from 350 US dollars (in our case, it must be converted into rubles at the Central Bank rate). The recommended price for the "younger brother", AMD Radeon HD 7850, is $250. It must be understood that in Russia the price will always be slightly higher (according to price.ru, about Quote Request: AMD Radeon HD 7870 0903 3 for 7870 and around Quote Request: AMD Radeon HD 7850 0903 3 for 7850). So, assuming I buy one of these graphics cards, I would expect to spend about $50-$80 more each time. But even if we do not take into account the markups of the Russian market, the cost of each of the video cards is very high, and for that kind of money, buyers will expect very high performance. Let's find out what we get by buying one of these cards.

Architecture 7800 Series

The 7800 subline is based on a new GPU codenamed Pitcairn.

This GPU is very attractive price / performance ratio. For most buyers of a 7900 line video card, its performance will be excessive, and not everyone wants to overpay more money if they don't see the difference. In turn, HD 7700 video cards may be too weak for a comfortable game at maximum settings in the popular 1920x1080 resolution.

This is a bit like the history of the AMD Radeon HD 5850 graphics card.

Remember this one? Despite the fact that it could not be turned into a HD 5870 by simply flashing the BIOS, it won the love of buyers due to its optimal parameters, which allowed it to run modern games at maximum settings while maintaining an adequate price. The current generation of HD 7800 video cards may well fill this niche.

The new GPU carries 20 GCN (Graphic Core Next) Compute Units. Recall that in the Tahiti GPU, which is equipped with the top-end Radeon HD 7970 video card, there were 32 of them. Each computing unit carries four vector units, and each vector unit, in turn, has 16 scalar.

Thus, the total number of computing modules in the new GPU is 1280 pieces. The number of rasterization units (ROPs) has not changed and remains equal to 32. The GPU clock frequency is 1000 MHz. The amount of video memory is 2048 MB GDDR5. Clock frequency 1200 (4800 effective) MHz. Memory bus 256 bits. All these characteristics are valid for the older video card of the subseries - HD 7870.

The younger brother of the Radeon HD 7850 is equipped with the same GPU, only slightly trimmed. The computing units were cut, their number was reduced to 16. Thus, the junior video card carries only 1024 computing modules on board, i.e. exactly half of their number in the Tahiti GPU. Accordingly, there are also fewer texture processing units, there are 64 of them left. The clock frequency of the GPU is 860 MHz. Other characteristics remained unchanged (relative to the Radeon HD 7870).

So, as mentioned above, the Pitcairn GPU is based on the Graphics Core Next (GCN) architecture. This means that he inherited all solutions from Tahiti (7900 Series). The new GPU is built on a 28nm process. The video card is equipped with PCI-E 3.0 (Gen3). The energy-saving technologies AMD PowerTune and AMD ZeroCore Power have not gone away either. AMD Eyefinity 2.0 technology will allow us to connect multiple displays with a maximum overall resolution of 16384x16384. 3D support is present and implemented by AMD HD3D Technology. AMD also continues to develop technologies for increasing the performance of the entire system using GPUs. In this case, we are offered to use the Video Codec Engine (VCE) technology, which accelerates video encoding by the hardware of the video card, thereby greatly helping CPU and reducing the amount of time required for data processing.

The slide above confirms that the Pitcairn GPU has completely inherited all the goodies from its older brothers. Moreover, the 7700 line is also not deprived of functions, but more on that in the next article.

Judging by AMD's slides, the new video cards have modest appetites. The older HD 7870 consumes no more than 175 watts in load. The TDP of the younger HD 7850 card is 130 watts. When the display is turned off in idle mode, graphics cards consume less than three watts.

The graphs on the slide look convincing. AMD promises almost a twofold increase in performance. However, this is not very surprising: the CrossFireX mode has always scaled well. Drivers play a huge role here. I'm glad that at least here everything is in order. On this we turn to the study of the structure of video cards.

AMD Radeon HD 7870

As usual, let's start by examining the exterior, then move on to dismantling the cooling system and examining the printed circuit board and the elements on it.

The appearance of the video card is absolutely traditional and recognizable. As you may have guessed, we received a reference engineering sample in the laboratory. The length of the board is 245 mm. So far, only one thing can be said about the cooling system: hot air is blown out of the case using a turbine. You can also see that the switch between BIOS chips has disappeared. In fact, looking ahead, I will say that the second BIOS chip also disappeared. From now on, when flashing a video card, there will be a certain risk of not starting it again. Most will think that this is a minus. However, how often do you flash video cards? Especially if the cores are not unlocked at the same time? Well, I don't think it's very often. And for most users, this is a plus, since the absence of such a function leads to a decrease in the cost of the product.

You can also see only one CrossFireX connector. This means that we can combine only two video cards in tandem. The situation is similar to the one we discussed above with the chip and BIOS switch. How many users bundle three or more video cards? I think that such people are an absolute minority, especially among those who use not the most top-end video adapters. At the same time, its absence again reduces the final cost of the product.

On the reverse side there is nothing remarkable, and we move on.

Under the grille for blowing hot air out of the case, in the usual place, there are four video outputs - one DVI, one HDMI and two miniDP.

The video card is equipped with two additional power connectors. This can help with overclocking by increasing the voltage applied to the GPU. At such times, energy consumption increases dramatically.

For better heat transfer, the place of contact between the heatsink and the GPU and the adjacent area are made of copper. Three copper heat pipes aid in heat dissipation.

If someone remembers what the board of the reference HD 6870 looks like, then they will understand me in the feeling that I have already seen it somewhere. Indeed, a board with a very similar arrangement of elements was used in the reference HD 6970.

In the center of the PCB is the Pitcairn XT GPU itself, rotated 45 degrees relative to the substrate.


Around the GPU are eight Hynix video memory chips marked T2C, which means that these chips can operate at frequencies up to 5000 MHz.


The heart of the power subsystem is the CHIL CHL 8225G chip. The GPU power subsystem is five-phase. Pay attention to the location of the power phases. There is one phase per video memory power subsystem. The printed circuit board looks thoughtfully and neatly, there is no feeling that they did it at the last moment “on the knee”.

This is where we finish our acquaintance with the Radeon HD 7870 board and move on to the study of the Radeon HD 7850.

Radeon HD 7850

Let me remind you that this is the younger brother of the Radeon HD 7850 video card.

Externally, the video card is no different from its older brother.

And there are no differences here, except for the only additional power connector. Since the HD 7850 consumes only 130 watts, versus 175 watts for the HD 7870, a single six-pin connector will suffice. The main thing is that this does not leave an imprint on the overclocking potential of the card.

There are also no differences from the HD 7850. We have one DVI, one HDMI and two miniDP at our disposal.

The cooling system does not differ from that of the Radeon HD 7870. We are offered a copper base at the point of contact between the GPU and the cooling system, and three copper heat pipes for efficient heat dissipation.

As expected, the printed circuit boards of the two video cards are also completely identical. No differences in soldered elements are observed. The only difference is the "cut" phase from the GPU power subsystem. For stable operation at nominal value, this is definitely enough, but how the GPU will show itself in overclocking and how stable it will be is another question.


And here is the very heart of the HD 7850 - AMDPitcairnPRO GPU. It is made in China and carries 1024 computing modules. Recall: the video card Radeon HD 6850 carried 960 computing modules, i.е. the gap is not so great, if you do not take into account the new architecture and look only at the numbers.


On the printed circuit board This card has 8 video memory chips with a total capacity of 2 GB. The density of one chip, respectively, is 256 MB. The marking has not changed and reads "T2C", which corresponds to the nominal frequency of 5000 MHz. I would very much like this bar to be overcome in overclocking.

Alternative versions of Radeon HD 7800 Series

As always, most manufacturers will want to move away from reference cooling systems and offer something of their own. Let's see what exactly.

ASUS HD 7870 and HD 7850

Video cards manufactured by ASUS with the well-known DirectCU II cooling system differ very little in appearance, but this is not the main thing. The main thing is that the manufacturer promises up to 20% increased performance compared to the reference version.

Club3D HD 7870 and HD 7850

Club3D decided not to go far from stock. The Radeon HD 7870 looks no different from the reference. The younger brother, apparently, got a heatsink personally for the GPU with one heat pipe and a fan. If at the same time the cooling of the remaining elements did not suffer, then everything is fine.

Gigabyte HD 7870 and HD 7850

Gigabyte decided to equip the 7800 series with its proprietary cooling systems, as well as increase the clock speeds. The HD 7850 video card is content with a cooling system with two fans, while the Radeon HD 7870 got a cooling system with three fans. Outwardly, there is nothing to complain about, except known issue with the output of hot air outside the case.

HIS HD 7870 and HD 7850

HIS has equipped the new products with its proprietary IceQ X cooling systems. These COs have always been famous for their good efficiency. The younger video card got only two heat pipes, while the older video card received all four.

MSI HD 7870 and HD 7850

The TwinFrozr III cooling system is considered highly efficient, but it always raises the price tag. It often happens that it is easier to take the graphics card with the next highest performance. But no one canceled the advantages in the form of high efficiency and low noise level.

PowerColor HD 7870 and HD 7850

PowerColor introduced two versions of the Radeon HD 7870 and one Radeon HD 7850. appearance. This is not scary, since the HD 7850 has a very low power consumption, and, accordingly, heat dissipation. For the HD 7870, two versions are prepared: one reference, and one with a PCS + cooling system.

Sapphire HD 7870 and HD 7850

Video cards manufactured by Sapphire will be equipped with cooling systems very similar to the proprietary FleX. There is no reason to doubt the effectiveness of these COs, since we have already tested similar video cards, and, even with much hotter tempers, the temperatures remained within reasonable limits.

XFXHD 7870

XFX has presented only the older video card Radeon HD 7870 at the moment. It will be equipped with two fans. We have already seen a similar design in the 7900 series video cards from this manufacturer.

Specifications table

Process technology, nm

Stream processors, pcs

Rasterization units (ROPs), pcs

GPU frequency, MHz

Video memory frequency, MHz

Video memory size, MB

Video memory bus, bit

Supported version of DirectX

Cost, rub*

Request a Quote: HD 7870 0903 3

Quote request: 7850 0903 3

Quote request: 6970 0903 3

Quote request: 7950 0903 3

*For prices www.price.ru for Moscow

Overclocking and temperatures

Let's start with the youngest video card, the Radeon HD 7850


The nominal clock speeds are 860 MHz for the GPU and 1200 (4800 effective) MHz for the video memory. The cooling systems in the references inspire confidence, so you can safely count on successful overclocking.


Without increasing the voltage, the video card was overclocked to 1025 MHz for the GPU and 1375 (5500) MHz for the video memory. These are good results. The fan speed was 40%. Higher values ​​made work next to the stand impossible. Still, a series of seven thousand video cards cannot be compared even with vacuum cleaners, they are quieter. Of course, at face value everything is fine, video cards are not audible in idle. But if you disperse it and unscrew the speed manually, then that's it, put out the light.


The nominal frequencies for the older HD 7870 video card are 1000 MHz for the video processor and 1200 (4800) MHz for the video memory.


As in the case with the younger video card, we turned the cooler speed up to 40%, which helped to lower the temperature. The video processor was able to function stably at a frequency of 1120 MHz. But the video memory is very upset. Unfortunately, she could not work stably at higher frequencies at all. Most likely, this is a special case and we were just unlucky. Probably one of the video memory chips was not in the best shape.


The operating temperatures of the two new products practically do not differ from each other in nominal terms. The older video card HD 7870, as expected, loses a few degrees under load, this is normal. During overclocking, temperatures are lower due to increased cooler speeds up to 40%. At the same time, the HD 7950 turns out to be a much hotter video card. But no one can compete with the HD 6970 in terms of heat dissipation. As befits the previous generation of video cards, it is the hottest video adapter among all test participants.

Competitors

To compare performance, we took two video cards: AMD Radeon HD 7950 and AMD Radeon HD 6970.

AMD Radeon HD 7950

The second most powerful video card from AMD today is against single-chip ones. We have already told you about this video card. I wonder how new items will behave against its background. On the side of the HD 7950 video card, the amount of memory is three gigabytes and the number of stream processors is equal to 1792 pcs.

AMD Radeon HD 6970

This video card is the TOP of the previous generation. It has high performance sufficient for modern games. Of course, at a resolution of 2560 x 1600 it has nothing special to catch, but in such modes more powerful video cards are also bent. The amount of video memory is 2048 MB. The number of stream processors is 1536 pieces, they operate at a frequency of 880 MHz.

test stand

  • Processor - Intel Core i7 3960X
  • Motherboard - ASUS P9X79 Deluxe
  • RAM – Corsair XMS3 1600MHz 9CL 4x4GB
  • Hard drive - Intel SSD 160 GB
  • Power supply - Corsair HX850W

Test results AMD Radeon HD 78xx test results in Dirt 3





The new line shows good performance. In most cases, the overclocked Radeon HD 7850 is very close to the stock Radeon HD 7870. The latter, in overclocking, in most cases successfully competes with the nominal Radeon HD 7950. Everything is natural. The performance of the HD 6970 jumps a lot from application to application. In some places it outperforms the HD 7870, and in some places it loses to the HD 7850. In general, the performance of the new line is very pleasing.

Conclusion

The AMD Radeon HD 7800 series turned out to be very successful. At a lower cost, the Radeon HD 7870 often overclocks a video card from the older top line - HD 7950. This will save money for those who want to get more performance for less money.

I really liked the temperature regimes of the new video cards. They are really cold. Under load at stock frequencies, the HD 7870 heated up to 68 degrees, and the HD 7850 - up to 66 degrees. This is very good result with this performance. It becomes very noticeable if you start comparing all the results with the HD 6970 video card.

I really didn't like the noise of the video cards, if you set at least 50% of the cooler RPM. I can’t even imagine who will be able to work near the computer if you set the rotation speed to more than 50%.

As soon as prices are finally established, new items will be an excellent investment. This process usually takes a month or two. Most importantly, the price / performance parameter has not shifted towards higher prices.

Liked the article? Share with friends!
Was this article helpful?
Yes
Not
Thanks for your feedback!
Something went wrong and your vote was not counted.
Thank you. Your message has been sent
Did you find an error in the text?
Select it, click Ctrl+Enter and we'll fix it!