Can You Rely Upon Your NPU Vendor To Be Your Customers’ Data Science Team?


The biggest mistake a chip design team can make in evaluating AI acceleration options for a new SoC is to rely entirely upon spreadsheets of performance numbers from the NPU vendor without going through the exercise of porting one or more new machine learning networks themselves using the vendor toolsets. Why is this a huge red flag? Most NPU vendors tell prospective customers that (1) the v... » read more

HBM4 Feeds Generative AI’s Hunger For More Memory Bandwidth


Generative AI (Gen AI), built on the exponential growth of Large Language Models (LLMs) and their kin, is one of today’s biggest drivers of computing technology. Leading-edge LLMs now exceed a trillion parameters and offer multimodal capabilities so they can take a broad range of inputs, whether they’re in the form of text, speech, images, video, code, and more, and generate an equally broa... » read more

Understanding The Total Cost Of Ownership In HPC And AI Systems


Cost is often the deciding factor when it comes to purchasing decisions at an organization, particularly those dealing with high-tech investments. When organizations evaluate proposals for new procurements, the initial capital cost of the system often receives significant attention. A great deal of preparation and planning goes into the decision to make a large purchase. While this is a critica... » read more

From Mobile Phones To Robotics: How The Industry Continues To Drive Innovation


I recently had the opportunity to host Pierre Cambou, Principal Analyst for Global Semiconductors at Yole Group, on the Advantest podcast. What struck me about our conversation was while we focused on what was going on in the mobile market, the entire talk was reflective of the cyclical nature of the semiconductor industry and how technology can drive intense cycles of innovation. As Pierre ... » read more

Liquid Cooling, Meeting The Demands Of AI Data Centers


Many Porsche “purists” reflect forlornly upon the 1997, 5th generation, 996 version of the iconic 911 sports car. It was the first year of the water-cooled engine versions of the 911, which had previously been based on air-cooled engines since their entry into the market in 1964. The 911 was also the successor to the popular air-cooled 356. For over three decades, Porsche’s flagship 911 w... » read more

Chip Industry Week in Review


The Biden-Harris Administration announced preliminary terms with HP for $50 million in direct funding under the CHIPs and Science Act to support the expansion and modernization of HP’s existing microfluidics and microelectromechanical systems (“MEMS”) facility in Corvallis, Oregon. CHIPS for America launched the CHIPS Metrology Community, a collaborative initiative designed to advance ... » read more

PCIe Over Optical: Transforming High-Speed Data Transmission


With the rise in AI requiring new computing models and enhanced data transmission methods to cope, the necessity for innovative, high-performance, and low-latency connectivity solutions has never been more apparent. PCIe over Optical is set to play a key role in enabling the growth of AI, and here we examine some of the intricacies of PCIe over Optical to explore its implementation, challenges,... » read more

AI’s Role In Chip Design Widens, Drawing In New Startups


Using AI in EDA is reinvigorating the whole tools industry, prompting established players to upgrade their tool offerings with AI/ML features, while drawing in startups trying to carve out differentiated approaches to fill unaddressed gaps with new tools and methodologies. Today’s new generation of entrepreneurs is comprised of both young post-grads with innovative ideas and industry veter... » read more

GDDR7: The Ideal Memory Solution In AI Inference


The generative AI market is experiencing rapid growth, driven by the increasing parameter size of Large Language Models (LLMs). This growth is pushing the boundaries of performance requirements for training hardware within data centers. For an in-depth look at this, consider the insights provided in "HBM3E: All About Bandwidth". Once trained, these models are deployed across a diverse range of... » read more

Accelerating The Pace And Precision Of AI Chip Innovation


The Hot Chips 2024 conference, which took place this week in Silicon Valley, was a showcase for AI chip innovation. The three-day program illustrated the race among both established chipmakers and new entrants to explore advanced architectures and embrace novel design solutions to deliver the next breakthrough AI processor. In this article, I share a few “hot takes” from the conference that... » read more

← Older posts Newer posts →