Redefining The Role Of The GPU In Next-Generation Vehicles


Automotive architectures are experiencing a seismic shift, with the traditional distributed architecture being steadily replaced with a more cost-effective centralized model. Even taken in isolation, this would be a significant driver on the computing needs of next-generation automotive SoCs, but the complexity of the problem is compounded when considered alongside the simultaneous rise of adva... » read more

Reverse Engineering NVIDIA GPU Cores (Universitat Politècnica de Catalunya)


A new technical paper titled "Analyzing Modern NVIDIA GPU cores" was published by Universitat Politècnica de Catalunya. Abstract "GPUs are the most popular platform for accelerating HPC workloads, such as artificial intelligence and science simulations. However, most microarchitectural research in academia relies on GPU core pipeline designs based on architectures that are more than 15 yea... » read more

GPU Analysis Identifying Performance Bottlenecks That Cause Throughput Plateaus In Large-Batch Inference


A new technical paper titled "Mind the Memory Gap: Unveiling GPU Bottlenecks in Large-Batch LLM Inference" was published by researchers at Barcelona Supercomputing Center, Universitat Politecnica de Catalunya, and IBM Research. Abstract "Large language models have been widely adopted across different tasks, but their auto-regressive generation nature often leads to inefficient resource util... » read more

GPU Or ASIC For LLM Scale-Up?


The CEOs of OpenAI, Anthropic, and xAI share a strikingly similar vision — AI's progress is exponential, it will change humanity, and its impact will be greater than most people expect. This is more than just speculation. The market for AI, and its value, are real today: A human developer with GitHub CoPilot codes 55% faster with AI. GPT-4 scores 88th percentile on the LSAT vs. 50t... » read more

RISC-V High Performance Multicore and GPU SoC Platform For Safety Critical System


A new technical paper titled "A RISC-V Multicore and GPU SoC Platform with a Qualifiable Software Stack for Safety Critical Systems" published by researchers at Universitat Politecnica de Catalunya and Barcelona Supercomputing Center. Abstract "In the context of the Horizon Europe project, METASAT, a hardware platform was developed as a prototype of future space systems. The platform is bas... » read more

Speeding Up Computational Lithography With The Power And Parallelism Of GPUs


There are so many challenges in producing modern semiconductor devices that it’s amazing for the industry to pull it off at all. From the underlying physics to fabrication processes to the development flow, there is no shortage of tough issues to address. Some of the biggest arise in lithography for deep submicron chips. A recent post outlined the major trends in lithography and summarized a ... » read more

HW-Aligned Sparse Attention Architecture For Efficient Long-Context Modeling (DeepSeek et al.)


A new technical paper titled "Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention" was published by DeepSeek, Peking University and University of Washington. Abstract "Long-context modeling is crucial for next-generation language models, yet the high computational cost of standard attention mechanisms poses significant computational challenges. Sparse attention... » read more

Uncore Frequency Scaling For Energy Optimization In Heterogeneous Systems (UIC, Argonne)


A new technical paper titled "Exploring Uncore Frequency Scaling for Heterogeneous Computing" was published by researchers at University of Illinois Chicago and Argonne National Laboratory. Abstract "High-performance computing (HPC) systems are essential for scientific discovery and engineering innovation. However, their growing power demands pose significant challenges, particularly as sys... » read more

Transforming Industrial IoT With Edge AI And AR


The Internet of Things (IoT) has evolved significantly from its early days of centralized cloud processing. Initially, IoT applications relied heavily on cloud-based data processing, where data from various devices was collected, processed, and analyzed in the cloud before insights were sent back to the devices. While effective, this approach has limitations, particularly in environments requir... » read more

The Use Of GPU Compute In Automotive


The pace of innovation in automotive is accelerating. Electrification, advanced driver assistance systems (ADAS) and vehicle connectivity are revolutionizing the in-car experience, which is now largely determined by the capabilities of the car’s software and electronic hardware. When a vehicle can receive software upgrades while it is on the road, the electronic control units (ECUs) that a... » read more

← Older posts