Developing Energy-Efficient AI Accelerators For Intelligent Edge Computing And Data Centers


Artificial intelligence (AI) accelerators are deployed in data centers and at the edge to overcome conventional von Neumann bottlenecks by rapidly processing petabytes of information. Even as Moore’s law slows, AI accelerators continue to efficiently enable key applications that many of us increasingly rely on, from ChatGPT and advanced driver assistance systems (ADAS) to smart edge device... » read more

What Is An Integrated Circuit?


In our modern world, just about everything is woven together by electronics. From microwaves to satellites, electronics-powered devices are infused into our every waking moment. Today, even our sleep includes digital acoustics, haptics, and analytics. But while the systems that light, connect, and move our lives can vary greatly, nearly every electronic device has one or more of the same fundam... » read more

Compiler-Driven Performance Boosts For GPNPUs


The GNU C Compiler – GCC – was first released in 1987. 36 years ago. Several version streams are still actively being developed and enhanced, with GCC13 being the most advanced, and a GCC v10.5 released in early July this year. You might think that with 36 years of refinement by thousands of contributors that penultimate performance has been achieved. All that could be discovered has bee... » read more

Fast, Accurate, Automated Via Insertion During Design Implementation Requires Foundry Rule Compliance


As the scaling of silicon technology proceeds, via resistance is becoming a dominant factor in integrated circuit (IC) yield, performance, and reliability. At advanced nodes, interconnects and via dimensions decrease, while the number of metallization layers increases. To moderate the impact of via resistance on yield and reliability and reduce electromigration (EM) and voltage drop (IR) effect... » read more

Neon Intrinsics In Rust


At the end of 2021, the Neon intrinsics in Rust were completed and the community proposed stabilizing them (not requiring a nightly compiler). The implementation of the Neon intrinsics was a large effort mostly undertaken by the Rust community so Arm would like to thank everyone involved in that. At the time of writing, all the Neon intrinsics that are Armv8.0-A are implemented and are stabi... » read more

Cleaning Marine Geometries Has Never Been Easier


Ship designers and naval architects increasingly use computational fluid dynamics (CFD) tools for more accurate solutions, detailed physics, and quicker results. Marine ship design studies in the past relied mainly on scaled-down models in towing tanks for insights into ship resistance, seakeeping, propulsion, and maneuvering. However, these models had discrepancies in their Reynolds and Fro... » read more

A Packet-Based Architecture For Edge AI Inference


Despite significant improvements in throughput, edge AI accelerators (Neural Processing Units, or NPUs) are still often underutilized. Inefficient management of weights and activations leads to fewer available cores utilized for multiply-accumulate (MAC) operations. Edge AI applications frequently need to run on small, low-power devices, limiting the area and power allocated for memory and comp... » read more

Evolution Of Equalization Techniques In High-Speed SerDes For Extended Reaches


The relentless demand for massive amounts of data is accelerating the pace of high-performance computing (HPC) within the high-speed Ethernet realm. This escalation, in turn, intensified the complexity associated with designing networking SoCs, including switches, NICs, retimers, and pluggable modules. Such growth is accelerating the demand for bandwidth hungry applications to transition from 4... » read more

Analog IP Reuse


Analog integrated circuit IP is essential to how microelectronic circuits and systems interact with the environment. It enables things like signal conversion, stable power supply, and communication in state-of-the-art devices. However, designing these critical components – even though they are often a small part of complex chips – is very costly and risk-prone. And in today’s analog field... » read more

Generative AI Training With HBM3 Memory


One of the biggest, most talked about application drivers of hardware requirements today is the rise of Large Language Models (LLMs) and the generative AI which they make possible.  The most well-known example of generative AI right now is, of course, ChatGPT. ChatGPT’s large language model for GPT-3 utilizes 175 billion parameters. Fourth generation GPT-4 will reportedly boost the number of... » read more

← Older posts Newer posts →