Revolutionizing Semiconductor Development With GPU-Enhanced Atomistic Modeling


There are many challenges in the development of a modern semiconductor chip, from front-end architecture simulation to final signoff. Volume manufacturing has its own set of challenges, while silicon lifecycle management (SLM) extends into field deployment and aging concerns. Underlying this entire development flow, however, lie the materials used to build the actual chips. Guiding the explorat... » read more

Embedded GPU: An Open-Source And Configurable RISC-V GPU Platform for TinyAI Devices (EPFL)


A new technical paper titled "e-GPU: An Open-Source and Configurable RISC-V Graphic Processing Unit for TinyAI Applications" was published by researchers at EPFL. Abstract "Graphics processing units (GPUs) excel at parallel processing, but remain largely unexplored in ultra-low-power edge devices (TinyAI) due to their power and area limitations, as well as the lack of suitable programming... » read more

Comparisons of HW Versus SW Implementation of Warp Level Features in Vortex RISC-V GPU (Georgia Tech, IIT)


A new technical paper titled "Hardware vs. Software Implementation of Warp-Level Features in Vortex RISC-V GPU" was published by researchers at Georgia Tech and Indian Institute of Technology Bombay. Abstract "RISC-V GPUs present a promising path for supporting GPU applications. Traditionally, GPUs achieve high efficiency through the SPMD (Single Program Multiple Data) programming model. Ho... » read more

Inference Framework For Deployment Challenges of Large Generative Models On GPUs (Google)


A new technical paper titled "Scaling On-Device GPU Inference for Large Generative Models" was published by researchers at Google and Meta Platforms. Abstract "Driven by the advancements in generative AI, large machine learning models have revolutionized domains such as image processing, audio synthesis, and speech recognition. While server-based deployments remain the locus of peak perform... » read more

Getting Real About AI Processors


There’s a lot of confusion and hype around AI. Nearly every service, product or subject area in the technology industry now has an AI label. A lot of this is valid and there’s no doubt that AI is opening up new capabilities and higher productivity across all industries. This white paper categorises AI and related hardware options, with a particular focus on on-device (i.e. edge) AI, givi... » read more

Redefining The Role Of The GPU In Next-Generation Vehicles


Automotive architectures are experiencing a seismic shift, with the traditional distributed architecture being steadily replaced with a more cost-effective centralized model. Even taken in isolation, this would be a significant driver on the computing needs of next-generation automotive SoCs, but the complexity of the problem is compounded when considered alongside the simultaneous rise of adva... » read more

Reverse Engineering NVIDIA GPU Cores (Universitat Politècnica de Catalunya)


A new technical paper titled "Analyzing Modern NVIDIA GPU cores" was published by Universitat Politècnica de Catalunya. Abstract "GPUs are the most popular platform for accelerating HPC workloads, such as artificial intelligence and science simulations. However, most microarchitectural research in academia relies on GPU core pipeline designs based on architectures that are more than 15 yea... » read more

GPU Analysis Identifying Performance Bottlenecks That Cause Throughput Plateaus In Large-Batch Inference


A new technical paper titled "Mind the Memory Gap: Unveiling GPU Bottlenecks in Large-Batch LLM Inference" was published by researchers at Barcelona Supercomputing Center, Universitat Politecnica de Catalunya, and IBM Research. Abstract "Large language models have been widely adopted across different tasks, but their auto-regressive generation nature often leads to inefficient resource util... » read more

GPU Or ASIC For LLM Scale-Up?


The CEOs of OpenAI, Anthropic, and xAI share a strikingly similar vision — AI's progress is exponential, it will change humanity, and its impact will be greater than most people expect. This is more than just speculation. The market for AI, and its value, are real today: A human developer with GitHub CoPilot codes 55% faster with AI. GPT-4 scores 88th percentile on the LSAT vs. 50t... » read more

RISC-V High Performance Multicore and GPU SoC Platform For Safety Critical System


A new technical paper titled "A RISC-V Multicore and GPU SoC Platform with a Qualifiable Software Stack for Safety Critical Systems" published by researchers at Universitat Politecnica de Catalunya and Barcelona Supercomputing Center. Abstract "In the context of the Horizon Europe project, METASAT, a hardware platform was developed as a prototype of future space systems. The platform is bas... » read more

← Older posts