Out of nowhere, NVIDIA has revealed the NVIDIA Titan V today at the 2017 Neural Information Processing Systems conference, with CEO Jen-Hsun Huang flashing out the card on stage. A mere 7 months after Volta was announced with the Tesla V100 accelerator and the GV100 GPU inside it, NVIDIA continues its breakneck pace by releasing the GV100-powered Titan V, available for sale today. Aimed at a decidedly more compute-oriented market than ever before, the 815 mm2 behemoth die that is GV100 is now available to the broader public.

NVIDIA Compute Accelerator Specification Comparison
  Titan V Tesla V100
(PCIe)
Tesla P100
(PCIe)
Titan Xp
CUDA Cores 5120 5120 3584 3840
Tensor Cores 640 640 N/A N/A
Core Clock 1200MHz ? ? 1485MHz
Boost Clock 1455MHz 1370MHz 1300MHz 1582MHz
Memory Clock 1.7Gbps HBM2 1.75Gbps HBM2 1.4Gbps HBM2 11.4Gbps GDDR5X
Memory Bus Width 3072-bit 4096-bit 4096-bit 384-bit
Memory Bandwidth 653GB/sec 900GB/sec 720GB/sec 547GB/sec
VRAM 12GB 16GB 16GB 12GB
L2 Cache 4.5MB 6MB 4MB 3MB
Single Precision 13.8 TFLOPS 14 TFLOPS 9.3 TFLOPS 12.1 TFLOPS
Double Precision 6.9 TFLOPS
(1/2 rate)
7 TFLOPS
(1/2 rate)
4.7 TFLOPS
(1/2 rate)
0.38 TFLOPS
(1/32 rate)
Tensor Performance
(Deep Learning)
110 TFLOPS 112 TFLOPS N/A N/A
GPU GV100
(815mm2)
GV100
(815mm2)
GP100
(610mm2)
GP102
(471mm2)
Transistor Count 21.1B 21.1B 15.3B 12B
TDP 250W 250W 250W 250W
Form Factor PCIe PCIe PCIe PCIe
Cooling Active Passive Passive Active
Manufacturing Process TSMC 12nm FFN TSMC 12nm FFN TSMC 16nm FinFET TSMC 16nm FinFET
Architecture Volta Volta Pascal Pascal
Launch Date 12/07/2017 Q3'17 Q4'16 04/07/2017
Price $2999 ~$10000 ~$6000 $1299

For the spec sheet we've gone ahead and lined it up against NVIDA's other Pascal cards, and for good reason. While the Titan series of cards may have started life as a prosumer card in 2013, since then NVIDIA's GPU designs have become increasingly divergent between compute and graphics. And even though the previous Titan Xp was based on the more graphics-focused GP102 GPU, the card itself was primarily (but not solely) pitched as an entry-level compute card, for customers who needed a (relatively) cheap way to do FP32 compute and neural network inferencing in workstations and small clusters.

The Titan V, by extension, sees the Titan lineup finally switch loyalties and start using NVIDIA’s high-end compute-focused GPUs, in this case the Volta architecture based V100. The end result is that rather than being NVIDIA’s top prosumer card, the Titan V is decidedly more focused on compute, particularly due to the combination of the price tag and the unique feature set that comes from using the GV100 GPU. Which isn’t to say that you can’t do graphics on the card – this is still very much a video card, outputs and all – but NVIDIA is first and foremost promoting it as a workstation-level AI compute card, and by extension focusing on the GV100 GPU’s unique tensor cores and the massive neural networking performance advantages they offer over earlier NVIDIA cards.

In this sense the Titan V is a return to form of sorts to the professional side of prosumer for the Titan family. One of the original claims to fame for the original Titan was its high performance in specialized FP64 compute workloads, something that was lost on the later Titan X and Titan Xp.  By switching to NVIDIA’s specialized high-end compute GPUs, the Titan V regains its formerly lost compute capabilities, all the while also gaining all of the compute capabilities NVIDIA has introduced since then. It’s no mistake that Jen-Hsun introduced the card at a neural networking conference, as this is a big chunk of the professional computing audience that NVIDIA is targeting with the card.

Interestingly, comparing it to the PCIe Tesla V100, I’m surprised by just how close the cards are in features and performance. NVIDIA has confirmed that the Titan V gets the GV100 GPU’s full, unrestricted FP64 compute and tensor core performance. To the best of our knowledge (and from what NVIDIA will comment on) it doesn’t appear that they’ve artificially disabled any of the GPU’s core features. What does separate the Titan from the Tesla then from a performance standpoint is quite simple: memory capacity, memory bandwidth, and the lack of NVLink functionality. There are also a number of smaller differences between the cards that help to differentiate them between server and workstation – such as passive versus active cooling, NVLink, and the support policies – but otherwise for customers who are running a small number of cards, the Titan V’s feature set is remarkably close to the much more expensive Tesla V100’s, which is a very interesting development since it goes to show just how confident NVIDIA is that this won’t undermine Tesla sales.

Moving on and diving into the numbers, Titan V features 80 streaming multiprocessors (SMs) and 5120 CUDA cores, the same amount as its Tesla V100 siblings. The differences come with the memory and ROPs. In what's clearly a salvage part for NVIDIA, one of the card's 4 memory partitions has been cut, leaving Titan V with 12GB of HBM2 attached via a 3072-bit memory bus. As each memory controller is associated with a ROP partition and 768 KB of L2 cache, this in turn brings L2 down to 4.5 MB, as well as cutting down the ROP count.

In terms of clockspeeds, the HBM2 has been downclocked slightly to 1.7GHz, while the 1455MHz boost clock actually matches the 300W SXM2 variant of the Tesla V100, though that accelerator is passively cooled. Notably, the number of tensor cores have not been touched, though the official 110 DL TFLOPS rating is lower than the 1370MHz PCIe Tesla V100, as it would appear that NVIDIA is using a clockspeed lower than their boost clock in these calculations.

For the card itself, it features a vapor chamber cooler with copper heatsink and 16 power phases, all for the 250W TDP that has become standard with the single GPU Titan models. Output-wise, the Titan V brings 3 DisplayPorts and 1 HDMI connector. And as for card-to-card communication, PCB itself appears to have NVLink connections on the top, but these look to have been intentionally blocked by the shroud to prevent their use and are presumably disabled.

As mentioned earlier, NVIDIA is unsurprisingly pushing this as a compute accelerator card, especially considering that Titan V features tensor cores and keeps the TITAN branding as opposed to GeForce TITAN. But there are those of us who know better than to assume people won’t drop $3000 to use the latest Titan card for gaming, and while gaming is not the primary (or even secondary) focus of the card, you also won't see NVIDIA denying it. In that sense the Titan V is going to be treated as a jack-of-all-trades card by the company.

To that end, no gaming performance information has been disclosed, but NVIDIA has confirmed that the card uses the standard GeForce driver stack. And on that note, yesterday NVIDIA released 388.59 bringing official Titan V support. Now, how much those drivers have actually been optimized for the GV100 is another matter entirely; Volta is a new architecture, markedly so at times. Speaklng solely off the cuff here, for graphics workloads the card has more resources than the Titan Xp in almost every meaningful metric, but it's also a smaller difference on paper than you might think.

As for NVIDIA's intended market of compute and AI users, the Titan V will be supported by NVIDIA GPU Cloud, which includes TensorRT, a number of deep learning frameworks, and HPC-related tools.

If the golden shroud didn’t already suggest so, the Titan V is also carving out a new eye-watering price point, dropping in at $2999 and on sale now at the NVIDIA store. NVIDIA has, to date, been selling Tesla V100 products as fast as they can produce them, so I'm not going to be surprised if the Titan V sees a similar fate. The $3000 price tag is quite high, even by Titan standards, but with the rare Tesla V100 PCIe card going for around $10,000, the Titan V is markedly cheaper. In fact in some respects I'm surprised NVIDIA is selling a GV100 card for so little; these are GV100 salvage parts that don't make the cut for Tesla - so the alternative would be throwing them away - but it just goes to show how confident NVIDIA is that it won't undermine the Tesla family.

At any rate, for NVIDIA professional users who have been looking to dip their toes into Volta but didn't want a full-fledged Tesla card, the Titan V is clearly going to be a popular card. Over the last two years NVIDIA's AI efforts have been firing on all cylinders, and by bringing a GV100 card down to just $3000, expect to see them crack open the market that much further. I dare say the idea of the "prosumer" Titan has died with this card, but for the rapidly growing professional compute market, this looks to be exactly the kind of card that a lot of developers have been waiting for.

Update (12/8/17): Yesterday, NVIDIA also released driver version 388.59 WHQL, bringing product support to the Titan V, along with Fallout 4 VR support. NVIDIA has noted that the Titan V currently suffers from TDR errors and display blanking during Blu-ray disc playback on high resolutions, as well as from G-Sync display blanking when repeatedly switching between different memory overclocks. Lastly, this minor update features one bug fix, resolving flickering on GTX 1080 equipped G-Sync notebooks.

Comments Locked

159 Comments

View All Comments

  • StrangerGuy - Monday, December 11, 2017 - link

    The vocal miser crowd (aka mostly red fanboys) doesn't seem to get the simple logic that the smart companies like NV and Apple knows they can't make money off them and thus can care less about their inane opinions.
  • mapesdhs - Monday, December 11, 2017 - link

    In a way what NVIDIA is doing reminds me of the headier days of SGI. There was demand for a gfx product that could make possible levels of performance which would not be available via COTS products for at least 5 years, so SGI built what companies wanted. They were expensive as heck (the Onyx in my garage was original $1M), but the performance was lightyears ahead of its time, ditto the features. I saw an Onyx in 1993 doing 12bits/channel and fully subsampled AA, mipmapped textured 3D at HD at a time when proper 3D simply didn't exist on PCs, and wouldn't for several more years (something SGI itself helped get going). Point being, the commercial demand was there, so they made stuff to satisfy that demand, and they made freakin' fortune in the process (some reseller sales reps became millionaires). SGI died due to numerous mistakes, but it's not as if the core demand for something advanced has ever gone away, especially these days with such heavy burdens placed on intelligence gathering, image analysis, data management, etc. Even today it's hard to put together a PC that can do what the Onyx2 Group Station for Defense Imaging could do almost 20 years ago.

    If NVIDIA reckons there's a demand, I'm sure they'll make something that costs 10x as much, and still sell plenty of them, though perhaps these days such products are done on the quiet, unlike SGI which made its tech available openly (more or less; DoD and others received customised versions).
  • smilingcrow - Friday, December 8, 2017 - link

    How dare they release a card for business users where the price is not an issue as the return on investment makes it good value.
  • mapesdhs - Monday, December 11, 2017 - link

    (ok so that's a far briefer and better way of putting it. :D)
  • dagnamit - Friday, December 8, 2017 - link

    At $3k this might be worth it, if it's a true pro-sumer card like the original Titan. Resale value could be very good. I knew people that sold their original Titans for about $100 less than the original price because it was so good as a workstation card. $100 to rent a bleeding edge card only to replace it with the, equal or better, GTX version will be worth it to some. If it gets even within 20% of the V100 for the same tasks, it'll FLY off the shelves.
  • Drumsticks - Friday, December 8, 2017 - link

    Pretty much everything about this card screams compute, not gaming. 1/2 FP64, and 110TF of DL make this a pretty damn nice compute card. Plus, it's based on GV100. As a discounted V100, its price is pretty good.
  • ddriver - Friday, December 8, 2017 - link

    Actually, if the claimed FP64 rates are true, I would personally consider it a decent buy, something nvidia hasn't had for a loooong time.

    It is basically a tesla V100 with one defective and disabled memory channel. Asking 10k for the full version of the chip - that's truly criminal. But getting almost the same performance and 3/4 of the memory at less than 1/3 of the price is pretty decent.

    Now if nvidia were just to brush up their OpenCL support.

    Although it is highly likely that the FP64 rates are limited in software. It would be very odd if they offered that much FP64 performance at that price point. Their DPFLOP price us very consistent across their product range. It is very close for say something gaming grade like the 1080 ti and the quadro gp100, the 1080ti being that "affordable" only because its FP64 rate is capped at 1/32 of the SP rate.
  • nivedita - Friday, December 8, 2017 - link

    The pro equivalent of the 1080ti is the P6000. Those are based on different silicon than the GP100 which is the one that has 1/2-speed DP compute. The P6000 also offers only 1/32 FP64.
  • mapesdhs - Monday, December 11, 2017 - link

    No price is "criminal", that's just stupid. The market decides what makes for a viable price. If it's too high, the product won't sell. I know a company that bought a machine for $1.5M; for their particular task, the system paid for itself in less than an hour. Kinda weird saying some product is too expensive when the price in question doesn't even register on the radar of what some types of tech cost in other fields.
  • Sttm - Friday, December 8, 2017 - link

    This is what happens when AMD is total crap and Nvidia just gets to run wild. They know they got at least a year before they have to sell Volta at reasonable prices. So why not give some whales a $3000 version for now!

    INTEL, YOU ARE ONLY HOPE!

Log in

Don't have an account? Sign up now