RAD – Publikationen
AMD Research and Development (RAD) legt großen Wert auf die Veröffentlichung von wichtigen und von Experten geprüften Forschungsergebnissen auf Konferenzen und in Fachzeitschriften.
Die Links auf dieser Seite verweisen auf die zahlreichen Publikationen von RAD in den letzten Jahren.
2024
- AI-Based Approaches in Network Security – AI4Good 2024
- T3: Transparent Tracking & Triggering for Fine-grained Overlap of Compute and Collectives – ASPLOS 2024
- Integrating FPGA and GPU Acceleration to OpenMP Distributed Computing – FPL 2024
- Turn-based Spatiotemporal Coherence for GPUs – HiPEAC 2024
- Networking Technologies for Handling AI Workloads – ISC 2024
- Sustainable Computing at Scale – MODSIM 2024
2023
- Spectrum Usage and Occupancy Monitoring: Challenges and Software-Defined Radio Solutions – IIIE WCNC 2023
- Improving DNN Throughput Via Intelligent Concurrent GEMM Executions – arXiv 2023
- The Next Era for Chiplet Innovation – DATE 2023
- Leveraging MLIR to Design for AI Engines – FCCM 2023
- Reducing Internode Communication Using FPGA-Accelerated Neural Network Surrogate Models – FIRE 2023
- Navigating the Future Landscape of System-On-Chip Technology – IEEE SOCC 2023
- Tale of Two Cs: Computation vs. Communication Scaling for Future Transformers on Future Hardware – IISWC 2023
- SPARTA: Spatial Acceleration for Efficient and Scalable Horizontal Diffusion Weather Stencil Computation – ICS 2023
- Introduction to the AMD Versal ACAP Adaptable Intelligent Engine and to its Programming Model – SC 2023
- Innovative Approaches to AI with Adaptive Computing – SPL 2023
2022
- Demystifying BERT: System Design Implications – IISWC 2022
- A Case for Fine-grain Coherence Specialization in Heterogeneous Systems – TACO
- Virtual Coset Coding for Encrypted Non-Volatile Memories with Multi-Level Cells – HPCA 2022
- Data Convection: A GPU-Driven Case Study for Thermal-Aware Data Placement in 3D DRAMs – SIGMETRICS 2022
- Cloak: Tolerating Non-Volatile Cache Read Latency – ICS 2022
- Uncertainty Quantification Methods for ML-based Surrogate Models of Scientific Applications – NeurIPS 2022
- Eager Memory Cryptography in Caches – MICRO 2022
- Athena: An Early-Fetch Architecture To Reduce On-Chip Page Walk Latencies – PACT 2022
- Improving Energy Efficiency of Permissioned Blockchains Using FPGAs – ICPADS 2022
2021
- Analyzing and Leveraging Decoupled L1 Caches in GPUs – HPCA 2021
- Deadline-Aware Offloading for High-Throughput Accelerators – HPCA 2021
- Understanding Chiplets Today to Anticipate Future Integration Opportunities and Limits – DATE 2021
- Systems-on-Chip with Strong Ordering – TACO
- Pioneering Chiplet Technology and Design for AMD EPYC™ and Ryzen™ Processor Families – ISCA 2021 (Industry Track)
- Quantifying Server Memory Frequency Margin and Using it to Improve Performance in HPC Systems – ISCA 2021
- Interconnect Modeling for Homogeneous and Heterogeneous Multiprocessors – Springer (Book Chapter)
- Increasing GPU Translation Reach by Leveraging Under-Utilized On-Chip Resources – MICRO 2021
- DUB: Dynamic Underclocking and Bypassing in Network-on-Chip for Heterogeneous GPU Workloads – NOCS 2021
- A New Era of Tailored Computing (short paper) – VLSI-Symposium 2021
- Efficient Cache Utilization via Model-aware Data Placement for Recommendation Models – MEMSYS 2021
- Virtual Coset Coding for Encrypted Non-Volatile Memories with Multi-Level Cells – HPCA 2022
- Using neural networks to reduce communication in numerical solution of partial differential equations – NEURIPS 2021
- Using physics-informed regularization to improve extrapolation capabilities of neural networks – NEURIPS 2021
2020
- Kite: A Family of Heterogeneous Interposer Topologies Enabled via Accurate Interconnect Modeling – DAC 2020
- SeqPoint: Identifying Representative Iterations of Sequence-based Neural Networks – ISPASS 2020
- Improving the Utilization of Micro-operation Caches in x86 Processors – MICRO 2020
- Centaur: A Novel Architecture for Reliable, Low-Wear,High-Density 3D NAND Storage – SIGMETRICS 2020
- Analyzing and Leveraging Shared L1 Caches in GPUs – PACT 2020
- PreFAM: Understanding the Impact of Prefetching in Fabric-Attached Memory Architectures – MEMSYS 2020
- CFDNet: a deep learning-based accelerator for fluid simulations – ICS 2020
- Optimizing of Intercache Traffic Entanglement in Tagless Caches With Tiling Opportunities – TCAD 2020
- Optimizing of Intercache Traffic Entanglement in Tagless Caches With Tiling Opportunities – CASES 2020
- Independent Forward Progress of Work-groups – ISCA 2020
- Experiences with ML-Driven Design: A NoC Case Study – HPCA 2020
- GPU Initiated OpenSHMEM: Correct and Efficient Intra-Kernel Networking for dGPUs – PPoPP 2020
- Centaur: A Novel Architecture for Reliable, Low-Wear, High-Density 3D NAND Storage – SIGMETRICS 2020
- DSM: A Case for Hardware-Assisted Merging of DRAM Rows with Same Content – SIGMETRICS 2020