Enabling today.
Inspiring tomorrow.

 AMD FirePro™ Accelerators for High Performance Compute

​The World’s First GPU Accelerator with 1TB/s Memory Bandwidth

Accelerate your most complex HPC workloads in data an​alytics or seismic processing on the world’s fastest single-precision compute GPU accelerator, the AMD FirePro™ S9300 x2 Server GPU.6,7

Learn More

​AMD FirePro™ accelerators are a compelling solution for HPC, boasting impressive densities and exceptional performance per watt. Every year, more supercomputers equipped with GPU accelerators are appearing on the Green500™ list as research institutes, government bodies and other science and engineering firms realize the benefits of having GPU acceleration to crunch complex datasets.

AMD continues to make waves in this industry as it introduces cutting edge technologies to the HPC community. In 2014, the L-CSC supercomputer achieved the #1 ranked position on the Green500, a list of the world’s most energy efficient supercomputers, with the AMD FirePro™ S9150. In 2015, AMD introduced the FirePro™ S9170, a GPU equipped with 32GB of GDDR5 memory, the most on-board GPU memory available today, and new for 2016 is the AMD FirePro™ S9300 x2, the industry’s first accelerator to offer 1TB/s memory bandwidth using cutting-edge HBM memory technology. All of these new products and features further enhance the experience of having accelerators in the data center, and more organizations are coming on board as they realize the performance potential and benefits of these solutions.

GPUOpen and Radeon Open Compute Platform (ROCm)

ROCm Platform

The ROCm Platform delivers new opportunities to GPU computing research. ROCm is comprised of three core foundation elements:

  • New Linux® Driver and Runtime Stack optimized for HPC & Ultra-scale class computing,
  • Heterogeneous C and C++ compilers which best address the whole system not just a single device
  • HIP tool which allows for CUDA code conversion, acknowledging the need for platform choice when utilizing GPU computing API

Using our knowledge of the HSA standard and, more importantly, the HSA 1.0 Runtime, AMD has been able to successfully extend support to accelerators with critical features for NUMA class acceleration. As a result, the Radeon Open Compute Kernel (ROCK) driver is composed of several components based on our efforts to develop the Heterogeneous System Architecture for APUs, including the new AMD GPU driver, the Kernel Fusion Driver (KFD), the HSA+ Runtime and an LLVM-based compilation stack for the building of key language support. This support starts with the AMD FirePro™ S9300 x2 accelerator, and support is planned to expand out to include future products.

Learn More

 

OpenCL™, OpenMP and OpenACC Support

OpenCL

AMD continues to support these standards on our latest product offerings1. We believe that most people in the HPC community want open standards as the de facto way of running their projects and simulations, and AMD is committed to supporting this goal and is working extensively with the community to drive open standards forward.

 AMD FirePro™ S-Series Accelerators

AMD FirePro™ S9300 x2

​AMD FirePro™ S9300 x2 Accelerator The World’s First GPU Accelerator with 1TB/s Memory Bandwidth Accelerate your most complex HPC workloads in data analytics or seismic processing on the world’s fastest single-precision compute GPU accelerator, the AMD FirePro™ S9300 x2 Server GPU.2,3 Take advantage of the numerous tools and libraries available at your disposal, including ROCm tools, from our developer page at www.gpuopen.com.

A recent test was undertaken by one of our customers, CGG. CGG is a leader in cutting-edge geoscience and recently conducted proprietary wave equation modelling benchmarking on several different GPU accelerators, including the new AMD FirePro™ S9300 x2 GPU. As the complexity of the wave equation increased, the performance advantage also grew in favor of the AMD FirePro™ S9300 x2 GPU, to a point where it was 2x faster than any other card tested.4

Wave Table 

Chart Provided by CGG

AMD FirePro™ S9100, S9150 and S9170 Accelerators

Those who are looking for great double precision performance can turn to the AMD FirePro™ S9100 series of accelerators. The AMD FirePro™ S9150, powering the #1 ranked supercomputer on the 2014 Green500 list, easily surpasses the competition by offering over 50% more double precision performance than the comparable Tesla K40.5

Watch the video interview of Dr. David Rohr and Professor Lindenstruth talking about the L-CSC cluster, #1 ranked supercomputer on the 2014 Green500.

 

 

DGEMM, or Double-precision General Matrix-Matrix multiplication, measures floating point execution rate for double precision, real matrix-matrix multiplication. There are many real-world applications that take advantage of double-precision matrix operations. These include computational fluid dynamics, finite element analysis and structural modelling, and molecular dynamics.

With our AMD OpenCL BLAS implementation, we are able to achieve 2 TFLOPS of sustained DGEMM performance with the AMD FirePro™ S9150 GPU, while the Tesla K40 achieves 1.3 TFLOPS DGEMM.

The AMD FirePro™ S9170 GPU is great for those who need large matrix-matrix multiplication capabilities, where one can take advantage of the large 32GB GDDR5 memory that this card possesses. The Nvidia K80 and K40, with 24GB and 12GB memory, respectively, cannot compute matrices that are larger than what their smaller onboard memory can handle.

 AMD FirePro™ S-Series Specifications

AMD FirePro™ Server GPU Solutions for High Performance Compute

From academic research in computational fluid dynamics, to oil and gas industries looking into seismic processing and reservoir simulation, AMD FirePro™ S-Series server GPUs provides a complete product stack that can cater to practically any of your needs. With cutting-edge single-precision and double-precision compute performance, AMD FirePro server GPUs are the solution for any computationally complex project requiring the massive parallel processing capabilities of a GPU.2

​S9100​S9150​S9170S9300 x2
AMD FirePro™ S9100AMD FirePro™ S9150AMD FirePro™ S9170 AMD FirePro™ S9300 x2
​GCN Stream Processors​2560​2816​28168192
​Single-Precision (GFLOPS)​4220​5070​5240​13900
​Double-Precision (GFLOPS)​2110​2530​2620​870
​On-board Memory​12GB GDDR5​16GB GDDR5​32GB GDDR5​8GB HBM
​ECC​Yes (external)​Yes (external)​Yes (external)​No
​Memory Bandwidth (GB/s)​320​320​320​1024
​Interface​PCIe 3.0, Dual slot​PCIe 3.0, Dual slot​PCIe 3.0, Dual slot​PCIe 3.0, Dual slot
​Max Power​225W​235W​275W​300W
​Cooling​Passive Heatsink​Passive Heatsink​Passive Heatsink​Passive Heatsink
​Recommended for

​Double precision workflows such as:

Academic and Government Clusters

Oil & Gas – reservoir simulation

​Double precision workflows such as:

Academic and Government Clusters

Oil & Gas – reservoir simulation

​Double precision workflows such as:

Academic and Government Clusters

Oil & Gas – reservoir simulation

​Single-precision workloads such as:

Molecular Dynamics

Geoscience

Deep Neural Networks/Machine Learning

 

 Where to Buy

The AMD FirePro™ accelerators are available from a number of OEMs and SI’s, including Dell, HPE and SuperMicro, amongst others.

Dell ​ ​ ​ ​​ ​HPE
 ​ ​ ​ ​​  ​ ​ ​ ​​  ​ ​ ​ ​​  ​ ​ ​ ​​ SuperMicro ​ ​ ​ ​​ 
​​

For more information on AMD FirePro™ GPU-equipped Dell servers, visit www.amd.com/dell-server

For AMD FirePro™ GPU-equipped HPE servers, visit www.amd.com/hpe

  

  

  

  

  

Footnotes