Who is the Optane SSD 900P for?

With a price per GB a little over twice that of the the fastest flash-based consumer SSDs, the Optane SSD 900P is an exclusive high-end product. For most desktop usage, drives like the 960 PRO are already fast enough to make storage no longer a severe bottleneck. The most noticeable delays due to storage performance on a 960 PRO are when moving around large files, and the Optane SSD doesn't offer any significant improvement to sequential transfer speeds. Random writes can be a challenge for flash-based SSDs, but volatile write caches and SLC caches allow them to handle short bursts with very high performance.

The unprecedented random read performance of the Optane SSD 900P is its biggest strength on paper, but not one that will often lead to a proportional speedup in overall application performance. Too many programs and filesystems are still designed with mechanical hard drive performance in mind as the baseline, and further increases to SSD performance serve mainly to shift the bottlenecks further onto the CPU, RAM, network, and even the user's own reaction time.

The scenarios where a drive like the Optane SSD 900P can offer meaningful and worthwhile performance improvements can be broadly categorized as as situations where the Optane SSD can help with one of two problems:

1. Storage is too slow

About the only time a desktop could challenge the sequential access performance of a high-end PCIe SSD (based on flash or 3D XPoint) is when dealing with high resolution uncompressed video. The Optane SSD doesn't help much here because of its limited capacity, and the PCIe 3 x4 link itself is a bottleneck at the highest refresh rates and bit depths. For video work, flash-based SSDs are definitely a better choice, and RAID arrays of cheaper SATA SSDs may be a better option than PCIe SSDs. Desktop workloads that require extremely high sustained random write performance are very rare, and SLC caching on a flash-based SSD nicely takes care of most realistic quantities of random writes.

That said, there are some situations where higher random read performance can be quite noticeable. Searching through a large volume of data is a common case, such as searching through a video, but it usually presents enough opportunities for parallelization that the drive's queue depth will climb up to the range where flash-based SSDs come close to the Optane SSD. Game level load times can in theory benefit greatly from faster read speeds, but in practice decompressing the assets after loading them into RAM quickly becomes the bottleneck. Most of the other situations where the performance advantage of the Optane SSD will really help are better described as a different kind of problem:

2. RAM is too small

In the workstation market, there are abundant examples of compute tasks with a memory working set that doesn't fit in RAM. Almost any simulation or rendering task will have a parameter for mesh density or particle count that can very quickly scale the memory requirements from a few GB to tens or hundreds of GB. An Optane SSD is far slower than four to eight channels of DDR4, but 16GB DIMMs are least 6-7 times more expensive per GB than the Optane SSD 900P, and putting more than 128GB of DRAM in an ATX motherboard is even more expensive.

Intel PR provided an example of using SideFX Houdini to render a high-resolution animation that included a 1.1 billion particle water simulation. Their test used a machine with a 10-core CPU and 64GB of RAM, and compared the 512GB Samsung 960 PRO against the 480GB Optane SSD 900P. The total memory requirements (DRAM+swap) of the rendering job were not disclosed, but the resulting 2.7x speedup is very plausible for a task that absolutely hammers the swap device. With a sufficiently high thread count to keep the queue depth high, that margin could be narrower (especially with the fastest 2TB 960 PRO), but then context switch overhead would become problematic. With the Optane SSD 900P, the random read latency is low enough that it would be hard to host more than two swap-limited threads per core without context switch overhead wasting more time than waiting on the SSD.

Star Citizen Bundle

Even though gaming isn't the ideal workload for the Optane SSD 900P to show off its performance, Intel is marketing the 900P to gaming enthusiasts. They're bundling a code for the game Star Citizen with the 900P, and including a new in-game spaceship variant as an exclusive item for Optane SSD customers. Intel has partnered with Star Citizen developer Roberts Space Industries (RSI) to hold a launch event for the 900P at CitizenCon 2017 today, which they are streaming live on Twitch and YouTube. Attendees will have the chance to playtest the Intel-exclusive Sabre Raven ship, but it is still undergoing final QA and will not be immediately available to Optane SSD 900P customers. The web page for redeeming the Star Citizen game code had not gone live as of the time of writing, so I was unable to attempt any testing with the game. (ed: I remember when AMD was offering a Star Citizen bundle in 2014 as well. The game still hasn't shipped.)

At the media briefing for the 900P, an RSI representative said they are exploring ways to optimize the Star Citizen experience on Optane SSDs, but not many specifics were provided. One approach under consideration is using less compression for some game assets, freeing up CPU time but relying on high storage performance. It didn't sound like this work was close to release. In the game's current state, RSI claims they've seen load times improve by 20-25%, but they didn't specify what other storage device they were comparing against.

Introduction Drive Features
POST A COMMENT

205 Comments

View All Comments

  • Hurr Durr - Friday, October 27, 2017 - link

    He desperately needs some CNC time to calm down, be merciful. Reply
  • Reflex - Friday, October 27, 2017 - link

    We need to compile a list of all the things you supposedly have been doing. I mean seriously, anything anyone brings up as a use case supposedly you've been doing forever at a professional level. And you use that imagined position to then attempt to clobber their own assertions about its applicability in the field they are discussing. Reply
  • ddriver - Friday, October 27, 2017 - link

    What can I say, I am a renaissance man. Also known as a polymath. I have scores of interests in all sorts of fields or science, arts and crafts, and I have employed many of those to make a living over the years. Where you collect pokemons, or postal stamps, or baseball hats or whatever, I collect skill and knowledge. Even the act of commenting here is part of my studies, social and psychological. Reply
  • Reflex - Friday, October 27, 2017 - link

    Not only have you pursued your 'scores' of interests, you are the expert in all of them! We are truly gifted by your presence, I cannot wait to read your memoirs.... Reply
  • MarceloC - Friday, October 27, 2017 - link

    "What can I say, I am a renaissance man. Also known as a polymath. I have scores of interests in all sorts of fields or science, arts and crafts, and I have employed many of those to make a living over the years. Where you collect pokemons, or postal stamps, or baseball hats or whatever, I collect skill and knowledge. Even the act of commenting here is part of my studies, social and psychological."
    LOL LOL LOL!
    Reply
  • sor - Friday, October 27, 2017 - link

    You clearly don’t care about immediate persistence, consistent performance, or random reads.

    The numbers make a clear argument that Optane won’t be all that helpful for light workloads, but we already expected that; there’s only so much super fast IO can do for you if you’re not using IO.

    I’d say those insane random reads and improved latencies live up to 3D Xpoint. There may be some elements to improve on but it is clear we are looking at a different, superior tech at a cost that is surprisingly affordable. Much of the initial talk about Xpoint was around whether or not it was real, if it was a hyped twist on regular flash, it seems clear it is a different animal.
    Reply
  • ddriver - Friday, October 27, 2017 - link

    You clearly didn't pay attention to what I've written. I did indeed acknowledge the random reads as an obviously strong point.

    Judging by the "destroyer", it won't be that helpful in heavy workloads either. In fact, contrary to what you say, where it shines the most is exactly light workloads, which is what low QDs are. Push higher queue depths and NVME catches up.
    Reply
  • Lolimaster - Saturday, October 28, 2017 - link

    With most work using big files, 4k random performance is meaningless, everything is linear so huge sustained write/reads are prefered. 960EVO/Pro wins. Reply
  • ionstorm66 - Sunday, October 29, 2017 - link

    I wonder how much more performance that same test system would get if it had 128GB of ram. The 900P 280GB is 389 bucks, 64GB kit of ram is 500 bucks. I'd take 64GB of system memory over 280gb of ssd storage anyday, especially with a system with a 960 pro already. Reply
  • mapesdhs - Tuesday, October 31, 2017 - link

    Unless of course you're doing something that needs ECC, in which case that cost comparison is rather wide of the mark. Reply

Log in

Don't have an account? Sign up now