A new technical paper titled “A Hardware-Aware Failure-Detection Method for GPU Control-Logic” was published by researchers at Hitachi, Ltd., Osaka University, and Kyoto University.
Excerpt
“Various failure detection methods have been proposed for SDCs caused by faults in data units such as registers. However, effective methods for detecting SDCs resulting from faults in control logic, such as scheduling units, have not yet been established. This paper assumes three types of control-logic failures for a general GPU architecture and proposes efficient failure detection methods for each type.”
Find the technical paper here. July 2025.
H. Itsuji, T. Uezono, T. Toba, K. Ito and M. Hashimoto, “A Hardware-Aware Failure-Detection Method for GPU Control-Logic,” in IEEE Access, vol. 13, pp. 113890-113904, 2025, doi: 10.1109/ACCESS.2025.3584759. Creative commons license.

Leave a Reply