Radeon Open Compute Platform (ROCm)
Comprised of an open-source Linux® driver optimized for compute, support for GPU acceleration using a new compiler to process code written in the C++ programming language, and other developer tools such as the Heterogeneous-compute Interface for Portability (HIP) Tool to port code written for CUDA to C++.
ROCm is built for scale; it supports multi-GPU peer-to-peer computing including communication through RDMA.
ROCm has a rich system run time with the critical features that large-scale application, compiler and language-run-time development requires.
HSA Compliant Runtime and Driver for AMD Radeon™ and FirePro™ GPU’s
Heterogeneous-compute Interface for Portability (HIP) Tool
Easily convert your code to C++ with this free, open source tool, while maintaining compatibility with CUDA compilers. The HIP tool allows developers to port the majority of their CUDA code over to C++ in a snap. Get started on the AMD FirePro S9300 x2 GPU, an open-source friendly accelerator from AMD, today.
OpenCL™ 1.2 Support
Helps professionals tap into the parallel computing power of modern GPUs and multicore CPUs to accelerate compute-intensive tasks in leading CAD/CAM/CAE and Media & Entertainment applications that support OpenCL. The AMD FirePro S9300 x2 Server GPU supports OpenCL™ 1.2, allowing developers to take advantage of new features that give GPUs more freedom to do the work they are designed to do.
13.9 TFLOPS of Peak Single Precision
Helps speed up time required to complete single precision floating point operations used within Simulations, Video Enhancement, Signal Processing, Video Transcoding and Digital Rendering applications where high performance takes precedence over accuracy. With the AMD FirePro™ S9300 x2 delivering 13.9 TFLOPS of peak single precision compute performance, one can configure a 2P server with 8 GPUs to achieve over 111 TFLOPS of peak single precision compute performance. In a standard 42U rack with 10x 4U servers, that’s potentially over 1 PFLOP of single precision compute performance!
870 GFLOPS of Peak Double Precision
Helps speed up time required to complete double precision floating point operations used within Computational Fluid Dynamics, Structural Mechanics, Reservoir Simulation and Aerodynamics applications, where numerical precision is mission critical.
Half Precision (FP16) Support
Developers who do not need the accuracy of 32-bit mathematical operations can now use 16-bit operations to help achieve high performance through a more efficient use of memory bandwidth and reduced memory footprint.
8GB HBM Memory
HBM is a new type of memory design with low power consumption and ultra-wide communication lanes. It uses vertically stacked memory chips interconnected by microscopic wires called “through-silicon vias,” or TSVs, placed directly onto the interposer, shortening the distance information has to travel between memory and processor.
AMD PowerTune Technology is an intelligent power management system that monitors both GPU activity and power draw. AMD PowerTune optimizes the GPU to deliver low power draw when GPU workloads do not demand full activity and delivers the optimal clock speed to ensure the highest possible performance within the GPU’s power budget for high intensity workloads.5