Wafer-Scale Computing for LLMs (U. of Edinburgh, Microsoft)


A new technical paper titled "WaferLLM: A Wafer-Scale LLM Inference System" was published by researchers at University of Edinburgh and Microsoft Research. Abstract "Emerging AI accelerators increasingly adopt wafer-scale manufacturing technologies, integrating hundreds of thousands of AI cores in a mesh-based architecture with large distributed on-chip memory (tens of GB in total) and ultr... » read more

System Bits: Oct. 24


Optical communication on silicon chips With the huge increase in computing performance in recent decades achieved by squeezing ever more transistors into a tighter space on microchips, at the same time this downsizing has also meant packing the wiring within microprocessors ever more tightly together. This has led to effects such as signal leakage between components, which can slow down commun... » read more