Study Of HW Acceleration for Neural Networks (Arizona State Univ.)


A new technical paper titled "Hardware Acceleration for Neural Networks: A Comprehensive Survey" was published by researchers at Arizona State University. Abstract "Neural networks have become a dominant computational workload across cloud and edge platforms, but their rapid growth in model size and deployment diversity has exposed hardware bottlenecks that are increasingly dominated by mem... » read more

Faster Mask Synthesis With GPUs


Design teams face rising pressure to deliver larger chips with higher transistor densities on tighter schedules using advanced node processing. The computing demands of modern applications, especially those making heavy use of AI, are extending pressure beyond design to every step of the development flow, including manufacturing, where photolithography and mask synthesis must keep pace. This po... » read more

Efficient Synchronous Dataflow Execution For GPUs (NVIDIA, UW-Madison)


A new technical paper titled "Kitsune: Enabling Dataflow Execution on GPUs with Spatial Pipelines" was published by researchers at NVIDIA and the University of Wisconsin-Madison. Abstract "State-of-the-art DL models are growing in size and complexity, with many modern models also increasing in heterogeneity of behavior. GPUs are still the dominant platform for DL applications, relying on ... » read more

Building Custom Graphics Cards For Cloud Gaming


The global cloud gaming market will reach over $20B by 2030, with Asia Pacific representing 45% of the opportunity according to Grandview Research. However, incumbent GPU solutions were designed for data center compute, not the unique economics of cloud gaming, where profitability depends on maximizing concurrent users per GPU while maintaining a premium user experience. For companies devel... » read more

GPU Enables Surround View In Automotive Domain Controller


In recent years, the capabilities of Advanced Driver Assistance Systems (ADAS) have flourished. Nearly half of all car sales in the USA offer Level 2 capabilities (such as lane keeping and adaptive cruise control) or higher, and China is pushing the market further towards Level 3 (conditional automation with driver oversight) and beyond. The advanced functionality offered by these ADAS and a... » read more

Co-Optimizing GPU Architecture And SW To Enhance Edge Inference Performance (NVIDIA)


A new technical paper titled "EdgeReasoning: Characterizing Reasoning LLM Deployment on Edge GPUs" was published by researchers at NVIDIA. Abstract "Edge intelligence paradigm is increasingly demanded by the emerging autonomous systems, such as robotics. Beyond ensuring privacy-preserving operation and resilience in connectivity-limited environments, edge deployment offers significant energ... » read more

MIT’s Survey On Accelerators and Processors for Inference, With Peak Performance And Power Comparisons


A new technical paper titled "Lincoln AI Computing Survey (LAICS) and Trends" was published by researchers at MIT Lincoln Laboratory Supercomputing Center. Abstract "In the past year, generative AI (GenAI) models have received a tremendous amount of attention, which in turn has increased attention to computing systems for training and inference for GenAI. Hence, an update to this survey is ... » read more

Critical Factors For Storing Data In DRAM


DRAM is becoming more complicated to develop, and more difficult to manage inside AI data centers. In the past, latency, bandwidth, and capacity were the primary considerations. But as the amount of data that needs to be processed, moved, and stored continues to rise, a whole new set of factors is emerging. Steven Woo, fellow and distinguished inventor at Rambus, talks about latency under load,... » read more

Comprehensive Performance Study of Zero-Knowledge Proofs on GPUs (Univ. of Michigan)


A new technical paper titled "ZKProphet: Understanding Performance of Zero-Knowledge Proofs on GPUs" was published by researchers at University of Michigan. Abstract "Zero-Knowledge Proofs (ZKP) are protocols which construct cryptographic proofs to demonstrate knowledge of a secret input in a computation without revealing any information about the secret. ZKPs enable novel applications in p... » read more

GPU Driver Update Adds Support For Additional Vulkan And OpenCL Extensions


Here are some of the highlights of what has been updated in the latest Imagination GPU Linux and Android Driver Development Kits: Leveraging cooperative matrix in Vulkan To help accelerate graphics post-processing, neural shaders, physics simulations, and machine learning inference on the GPU, DDK 25.2 implements support for VK_KHR_cooperative_matrix. This extension provides Vulkan developers... » read more

← Older posts Newer posts →