The NVIDIA GeForce GTX 1080 Preview: A Look at What's to Come

Name: The NVIDIA GeForce GTX 1080 Preview: A Look at What's to Come
Item: The NVIDIA GeForce GTX 1080 Preview: A Look at What's to Come
Author: Ryan Smith

by Ryan Smith on May 17, 2016 9:00 AM EST

262 Comments | Add A Comment

262 Comments

Earlier this month NVIDIA announced their latest generation flagship GeForce card, the GeForce GTX 1080. Based on their new Pascal architecture and built on TSMC’s 16nm FinFET process, the GTX 1080 is being launched as the first 16nm/14nm-based video card, and in time-honored fashion NVIDIA is starting at the high-end. The end result is that the GTX 1080 will be setting the new high mark for single-GPU performance.

Unlike past launches, NVIDIA is stretching out the launch of the GTX 1080 a bit more. After previously announcing it back on May 6^th, the company is lifting their performance and architecture embargo today. Gamers however won’t be able to get their hands on the card until the 27^th – next Friday – with pre-order sales starting this Friday. It is virtually guaranteed that the first batch of cards will sell out, but potential buyers will have a few days to mull over the data and decide if they want to throw down $699 for one of the first Founders Edition cards.

As for the AnandTech review, as I’ve only had a few days to work on the article, I’m going to hold it back rather than rush it out as a less thorough article. In the meantime however, as I know everyone is eager to see our take on performance, I wanted to take a quick look at the card and the numbers as a preview of what’s to come. Furthermore the entire performance dataset has been made available in the new GPU 2016 section of AnandTech Bench, for anyone who wants to see results at additional resolutions and settings.

Architecture

NVIDIA GPU Specification Comparison
	GTX 1080	GTX 980 Ti	GTX 980	GTX 780
CUDA Cores	2560	2816	2048	2304
Texture Units	160	176	128	192
ROPs	64	96	64	48
Core Clock	1607MHz	1000MHz	1126MHz	863MHz
Boost Clock	1733MHz	1075MHz	1216MHz	900Mhz
TFLOPs (FMA)	9 TFLOPs	6 TFLOPs	5 TFLOPs	4.1 TFLOPs
Memory Clock	10Gbps GDDR5X	7Gbps GDDR5	7Gbps GDDR5	6Gbps GDDR5
Memory Bus Width	256-bit	384-bit	256-bit	384-bit
VRAM	8GB	6GB	4GB	3GB
FP64	1/32	1/32	1/32 FP32	1/24 FP32
TDP	180W	250W	165W	250W
GPU	GP104	GM200	GM204	GK110
Transistor Count	7.2B	8B	5.2B	7.1B
Manufacturing Process	TSMC 16nm	TSMC 28nm	TSMC 28nm	TSMC 28nm
Launch Date	05/27/2016	06/01/2015	09/18/2014	05/23/2013
Launch Price	MSRP: $599 Founders $699	$649	$549	$649

While I’ll get into architecture in much greater detail in the full article, at a high level the Pascal architecture (as implemented in GP104) is a mix of old and new; it’s not a revolution, but it’s an important refinement. Maxwell as an architecture was very successful for NVIDIA both at the consumer level and the professional level, and for the consumer iterations of Pascal, NVIDIA has not made any radical changes. The basic throughput of the architecture has not changed – the ALUs, texture units, ROPs, and caches all perform similar to how they did in GM2xx.

Consequently the performance aspects of consumer Pascal – we’ll ignore GP100 for the moment – are pretty easy to understand. NVIDIA’s focus on this generation has been on pouring on the clockspeed to push total compute throughput to 9 TFLOPs, and updating their memory subsystem to feed the beast that is GP104.

On the clockspeed front, a great deal of the gains come from the move to 16nm FinFET. The smaller process allows NVIDIA to design a 7.2B transistor chip at just 314mm2, while the use of FinFET transistors, though ultimately outright necessary for a process this small to avoid debilitating leakage, has a significant benefit to power consumption and the clockspeeds NVIDIA can get away with at practical levels of power consumption. To that end NVIDIA has sort of run with the idea of boosting clockspeeds, and relative to Maxwell they have done additional work at the chip design level to allow for higher clockspeeds at the necessary critical paths. All of this is coupled with energy efficiency optimizations at both the process and architectural level, in order to allow NVIDIA to hit these clockspeeds without blowing GTX 1080’s power budget.

Meanwhile to feed GTX 1080, NVIDIA has made a pair of important changes to improve their effective memory bandwidth. The first of these is the inclusion of faster GDDR5X memory, which as implemented on GTX 1080 is capable of reaching 10Gb/sec/pin, a significant 43% jump in theoretical bandwidth over the 7Gb/sec/pin speeds offered by traditional GDDR5 on last-generation Maxwell products. Coupled with this is the latest iteration of NVIDIA’s delta color compression technology – now on its fourth generation – which sees NVIDIA once again expanding their pattern library to better compress frame buffers and render targets. NVIDIA’s figures put the effective memory bandwidth gain at 20%, or a roughly 17% reduction in memory bandwidth used thanks to the newer compression methods.

As for features included, we’ll touch upon that in a lot more detail in the full review. But while Pascal is not a massive overhaul of NVIDIA’s architecture, it’s not without its own feature additions. Pascal gains the ability to pre-empt graphics operations at the pixel (thread) level and compute operations at the instruction level, allowing for much faster context switching. And on the graphics side of matters, the architecture introduces a new geometry projection ability – Simultaneous Multi-Projection – and as a more minor update, gets bumped up to Conservative Rasterization Tier 2.

Looking at the raw specifications then, GTX 1080 does not disappoint. Though we’re looking at fewer CUDA cores than the GM200 based GTX 980 Ti or Titan, NVIDIA’s significant focus on clockspeed means that GP104’s 2560 CUDA cores are far more performant than a simple core count would suggest. The base clockspeed of 1607MHz is some 42% higher than GTX 980 (and 60% higher than GTX 980 Ti), and the 1733MHz boost clockspeed is a similar gain. On paper, GTX 1080 is set to offer 78% better performance than GTX 980, and 47% better performance than GTX 980 Ti. The real world gains are, of course, not quite this great, but they’re also relatively close to these numbers at times.

Gaming Performance, Power, Temperature, & Noise

PRINT THIS ARTICLE

Post Your Comment
Please log in or sign up to comment.

Comments Locked

262 Comments

View All Comments

Ryan Smith - Tuesday, May 17, 2016 - link
Ashes is a game that I only intend to run in DX12. For all intents and purposes it's the marquee DX12 title, and I expect hardware vendors to be able to handle it well. Especially as its engine was more or less designed for low level APIs from the start.

Hitman, on the other hand, had its DX12 implementation essentially bolted on after the fact.
Achaios - Tuesday, May 17, 2016 - link
No reason for anyone playing at 1920X1080 to buy this card and still not quite good at 4K either, meaning that it falls short of the 60 FPS mark @ 4K.

Will wait for 1080TI.
Dritman - Tuesday, May 17, 2016 - link
More half assed content from Anandtech. I'm not even surprised anymore. Can't wait to hear more excuses from Ian, thats what the audience really want right Ian? Keep coming back hoping you guys will get your shit together, but I think I'm ready to say good bye.

Every single other outlet on the net with a 1080 review has achieved more than Anandtech, how do you think that reflects on you?
silverblue - Tuesday, May 17, 2016 - link
Judge them once the review is out.
Ryan Smith - Tuesday, May 17, 2016 - link
I'm always sorry to lose a reader.

But I also don't make any apologies for how I've chosen to publish this. I had 4 days to work on this, and that's not sufficient time for a full AnandTech quality review.
vladx - Tuesday, May 17, 2016 - link
Don't sweat it Ryan, I want an in-depth look into Pascal architecture and I really want to see how Pascal IPC compares to Maxwell's, my bet is it's about 10-15% lower overall.
vladx - Tuesday, May 17, 2016 - link
Bye Anandtech is better without the likes of you with comments like that.
brucek2 - Tuesday, May 17, 2016 - link
If AnandTech was not included among certain sites hand picked to receive early review samples, that may actually reflect quite well on their editorial integrity.

Also, really not feeling the time urgency you seem too. It's not yet even possible to order the card, and a lot of related information that some would consider important -- ie 3rd party cards and their performance -- isn't anywhere close to being on the table either.
Ryan Smith - Wednesday, May 18, 2016 - link
"If AnandTech was not included among certain sites hand picked to receive early review samples, that may actually reflect quite well on their editorial integrity."

To be clear, we received our sample at the same time as everyone else. The issue was that I had another (previously scheduled) function to attend when those samples were distributed. No malice or anyone's part, just bad timing all around.
Michael Bay - Wednesday, May 18, 2016 - link
You`re literally attentionwhoring.
Nobody will miss you.

The NVIDIA GeForce GTX 1080 Preview: A Look at What's to Come

Architecture

Post Your Comment

262 Comments

View All Comments

Ryan Smith - Tuesday, May 17, 2016 - link

Achaios - Tuesday, May 17, 2016 - link

Dritman - Tuesday, May 17, 2016 - link

silverblue - Tuesday, May 17, 2016 - link

Ryan Smith - Tuesday, May 17, 2016 - link

vladx - Tuesday, May 17, 2016 - link

vladx - Tuesday, May 17, 2016 - link

brucek2 - Tuesday, May 17, 2016 - link

Ryan Smith - Wednesday, May 18, 2016 - link

Michael Bay - Wednesday, May 18, 2016 - link

Log in

Don't have an account? Sign up now