Who is the Optane SSD 900P for?

With a price per GB a little over twice that of the the fastest flash-based consumer SSDs, the Optane SSD 900P is an exclusive high-end product. For most desktop usage, drives like the 960 PRO are already fast enough to make storage no longer a severe bottleneck. The most noticeable delays due to storage performance on a 960 PRO are when moving around large files, and the Optane SSD doesn't offer any significant improvement to sequential transfer speeds. Random writes can be a challenge for flash-based SSDs, but volatile write caches and SLC caches allow them to handle short bursts with very high performance.

The unprecedented random read performance of the Optane SSD 900P is its biggest strength on paper, but not one that will often lead to a proportional speedup in overall application performance. Too many programs and filesystems are still designed with mechanical hard drive performance in mind as the baseline, and further increases to SSD performance serve mainly to shift the bottlenecks further onto the CPU, RAM, network, and even the user's own reaction time.

The scenarios where a drive like the Optane SSD 900P can offer meaningful and worthwhile performance improvements can be broadly categorized as as situations where the Optane SSD can help with one of two problems:

1. Storage is too slow

About the only time a desktop could challenge the sequential access performance of a high-end PCIe SSD (based on flash or 3D XPoint) is when dealing with high resolution uncompressed video. The Optane SSD doesn't help much here because of its limited capacity, and the PCIe 3 x4 link itself is a bottleneck at the highest refresh rates and bit depths. For video work, flash-based SSDs are definitely a better choice, and RAID arrays of cheaper SATA SSDs may be a better option than PCIe SSDs. Desktop workloads that require extremely high sustained random write performance are very rare, and SLC caching on a flash-based SSD nicely takes care of most realistic quantities of random writes.

That said, there are some situations where higher random read performance can be quite noticeable. Searching through a large volume of data is a common case, such as searching through a video, but it usually presents enough opportunities for parallelization that the drive's queue depth will climb up to the range where flash-based SSDs come close to the Optane SSD. Game level load times can in theory benefit greatly from faster read speeds, but in practice decompressing the assets after loading them into RAM quickly becomes the bottleneck. Most of the other situations where the performance advantage of the Optane SSD will really help are better described as a different kind of problem:

2. RAM is too small

In the workstation market, there are abundant examples of compute tasks with a memory working set that doesn't fit in RAM. Almost any simulation or rendering task will have a parameter for mesh density or particle count that can very quickly scale the memory requirements from a few GB to tens or hundreds of GB. An Optane SSD is far slower than four to eight channels of DDR4, but 16GB DIMMs are least 6-7 times more expensive per GB than the Optane SSD 900P, and putting more than 128GB of DRAM in an ATX motherboard is even more expensive.

Intel PR provided an example of using SideFX Houdini to render a high-resolution animation that included a 1.1 billion particle water simulation. Their test used a machine with a 10-core CPU and 64GB of RAM, and compared the 512GB Samsung 960 PRO against the 480GB Optane SSD 900P. The total memory requirements (DRAM+swap) of the rendering job were not disclosed, but the resulting 2.7x speedup is very plausible for a task that absolutely hammers the swap device. With a sufficiently high thread count to keep the queue depth high, that margin could be narrower (especially with the fastest 2TB 960 PRO), but then context switch overhead would become problematic. With the Optane SSD 900P, the random read latency is low enough that it would be hard to host more than two swap-limited threads per core without context switch overhead wasting more time than waiting on the SSD.

Star Citizen Bundle

Even though gaming isn't the ideal workload for the Optane SSD 900P to show off its performance, Intel is marketing the 900P to gaming enthusiasts. They're bundling a code for the game Star Citizen with the 900P, and including a new in-game spaceship variant as an exclusive item for Optane SSD customers. Intel has partnered with Star Citizen developer Roberts Space Industries (RSI) to hold a launch event for the 900P at CitizenCon 2017 today, which they are streaming live on Twitch and YouTube. Attendees will have the chance to playtest the Intel-exclusive Sabre Raven ship, but it is still undergoing final QA and will not be immediately available to Optane SSD 900P customers. The web page for redeeming the Star Citizen game code had not gone live as of the time of writing, so I was unable to attempt any testing with the game. (ed: I remember when AMD was offering a Star Citizen bundle in 2014 as well. The game still hasn't shipped.)

At the media briefing for the 900P, an RSI representative said they are exploring ways to optimize the Star Citizen experience on Optane SSDs, but not many specifics were provided. One approach under consideration is using less compression for some game assets, freeing up CPU time but relying on high storage performance. It didn't sound like this work was close to release. In the game's current state, RSI claims they've seen load times improve by 20-25%, but they didn't specify what other storage device they were comparing against.

Introduction Drive Features
POST A COMMENT

205 Comments

View All Comments

  • investlite - Monday, November 6, 2017 - link

    I bet you're still pissed we don't have flying cars. OMG, do you go to every new car unveiling and talk about how crappy each new car is because we were supposed to have flying cars by now? Reply
  • Gothmoth - Friday, October 27, 2017 - link

    when you do huge particle simulations you will want the fastest SSD you can get. Reply
  • ddriver - Friday, October 27, 2017 - link

    I've been doing that, for VFX as well as multiphysics simulations for over a decade. It has always been an in-memory thing. It doesn't seem they simulated it, it seems they read baked simulation data, and stored in some insanely inefficient manner at that.

    As I implied, this has got to be a new record in rigged benchmarks. Shame!
    Reply
  • extide - Friday, October 27, 2017 - link

    Sigh, the Optane drive didn't improve the performance of reading the data into the simulation. The simulation required (significantly?) more RAM than the system had. They put a big swap file on either a 960 PRO or on the Optane drive. It probably doesn't even matter where the simulation data was stored. Reply
  • ddriver - Friday, October 27, 2017 - link

    Well, they should have bought moar ram then. Then maybe they could have kept that CPU busy at 100% and get much better time.

    I mean it is not like hypetane offers terabytes of capacity. Topping at 480 GB - that's entirely doable in RAM. More expressive - sure, but nonetheless a perfectly sensible investment if you are doing such simulations. It will pay for itself, as RAM is tremendously faster, and also doesn't wear, at least nowhere nearly as much as xpoint does.
    Reply
  • extide - Friday, October 27, 2017 - link

    For that particular case, sure, maybe more ram is the way to go, but there are plenty of cases where the drive is better, like several of the ones I mentioned above. Most of those rely on the non-volatile aspect, which obviously RAM doesn't have. Reply
  • ddriver - Friday, October 27, 2017 - link

    As I said - it has its advantages and uses. I also said I might even buy it.

    And the only reason I call it hypetane is because intel shamelessly lied about it, and continues to cheat in order to make it look good even after it became evident that it is not anywhere nearly as good as they initially claimed, and call me old-fashioned, but I have a problem with that.

    It boggles the mind that people around here have such a problem with me just because I don't have my tongue up intel's rectum...
    Reply
  • ddriver - Friday, October 27, 2017 - link

    BTW, 1.1 billion particles, presuming the simulation is ran in FP64 mode, with x y and z coordinates for each particle would only require about 24 gigs of ram.

    Which raises the question, did they allocate each of those points on the heap or something?
    Reply
  • ddriver - Friday, October 27, 2017 - link

    Opsie, silly me, it would take another 24gb for the vector of force for each particle. Now it is a little more plausible that 64gb might not be enough. Reply
  • CaedenV - Sunday, October 29, 2017 - link

    Sure, in a perfect world you buy more RAM. But if you are in a situation where you don't have infinate cash and you can buy more RAM at $7/GB or an Optane SSD for ~$1-2/GB then Optane begins to look a bit more appealing. A 480GB drive running as cache for $600 vs 480GB of DDR4 at $3350.... that would make almost anyone thing twice.

    Or in the case of my work, we have a bunch of clustered servers, and we are maxxed out on ram but not yet ready to do a server upgrade (hoping to get 2 more years out of them), but we need more fast cache for a bunch of different applicaitons. The idea of running those caches on this kind of SSD sounds a lot more appealing than running on traditional SSDs.

    But yes, when we upgrade servers, we will simply have more RAM on board. That is the obvious solution. But when a motherboard can only hold 256GB of RAM and you need more... life is often about compromises, and Optane tech sounds like a good compromise. But what you would use this for in daily life or in a normal computer? Man, that totally beats me! This product is almost too cheap for what it is good for (business class SSDs typically cost more than $1.25/GB still and are far slower than consumer SSDs), and completely useless and overpriced for that they are advertising it for.
    Reply

Log in

Don't have an account? Sign up now