AI Adoption Slow For Design Tools

While ML adoption is robust, full AI is slow to catch fire. But that could change in the future.

popularity

A lot of excitement, and a fair amount of hype, surrounds what artificial intelligence (AI) can do for the EDA industry. But many challenges must be overcome before AI can start designing, verifying, and implementing chips for us. Should AI replace the algorithms in use today, or does it have a different role to play?

At the end of the day, AI is a technique that has strengths and weaknesses, and it has to be compared against those already in use. Existing algorithms and tools have decades of data and experience that have been encapsulated in them, and little of that knowledge is available in a form that could train new AI models. That puts these data-hungry techniques at a serious disadvantage — one that will take time to overcome. But AI does not need that type of data for all tasks. Some tasks produce copious amounts of relevant data, and those are the ones in which early results are showing a lot of promise.

The question always has been how much of the available data is useful? “There are millions of ways to apply machine learning,” says Dan Yu, product manager for ML Solutions in IC Design Verification at Siemens EDA. “Machine learning is a generic technology, and it really depends on how people perceive the problem, how people want to use data to solve a problem. Once we have more data, and once we pay attention to the quality of data, then later we will get smarter and smarter AI models. It takes time.”

The role of EDA within the chip industry is fairly clear, and EDA has embraced machine learning, which is a subset of EDA. But how the broader AI fits into EDA is far less obvious. “We turn to EDA when it can help you get your job done, or get the task done faster or better,” says Dean Drako, president and CEO of IC Manage. “The two things that we want from the EDA industry are to design faster and make it better. We’re all human, and we make mistakes. People have issues, they get sick, and they only want to work 8 or 10 hours a day. If you can have AI do something either better, or make the person more productive, then that’s a big win. It’s about productivity. If AI makes me productive, that’s where I win.”

The EDA industry has to find the right tasks that provide those benefits, and which are possible with the data available today. “AI has a good chance of success when trying to replicate a task that a human is good at,” says Thomas Andersen, vice president for AI and machine learning at Synopsys. “Place-and-route is not a task a human would be good at. Any simple algorithm would beat a human because the human is not very good at placing things at this quantity, in the same way a simple calculator would beat most humans. A human would beat it in much more creative tasks and much more complex tasks — cognitive types of things.”

AI is just another algorithm in the arsenal, and machine learning is a subset of that. “Linear regression is now being considered as being machine learning, but people have been using regression for a long time,” says Michal Siwinski, CMO for Arteris IP. “It’s a statistical concept that goes back a long time. Fundamentally, these are different types of algorithms, and it is a matter of finding which algorithm becomes more efficient for a given task. Before we had convolution neural networks, it was really hard to effectively address aspects of vision and aspects of large language models.”

To make any of this work requires the right data. “AI and machine learning are basically learning from historical data, from the data we have accumulated,” says Siemens’ Yu. “It condenses knowledge from the data. What is important is the data you feed into your AI model to train it, to extract the knowledge effectively. You use that knowledge to predict what would happen if I’m given a new case. AI is basically saving us the effort to replicate unnecessary work.”

But a lack of data can create issues. “Care has to be taken with AI because it has a problem with outliers and glitches,” says Marc Swinnen, director of product marketing at Ansys. “You cannot rely on it to always give you a good answer. Design tasks may be more acceptable because you need fast turnaround and iterative calculations during placement or routing. You may leave outliers until later verification stages, where it is easier to fix it using an ECO than to try and consider all corner cases every time a decision is made — especially when they are rarely applicable. At sign-off, the whole point is to catch the outliers.”

Optimization functions rely on a cost function. “Deep learning, or any other machine learning techniques, are basically minimizing a given cost function,” says Arteris’ Siwinski. “That’s how the math behind it works. The cost function is constrained by how you defined what success is, and what the parameters look like.”

The data problem
The number one problem is not computing power or the model. It’s the data. “If you only train your model on the current design, then AI’s knowledge is very limited,” says Yu. “Your success rate depends on how much data you have accumulated. If you have a series of design, incremental or derivative designs, that will help. Maybe you are a design house, and you design for many customers. If you have trained a model for design A and you know design B will only have slight modification, then you can re-use the model. Now your success rate would be much higher. Some verification engineers are more experienced and can transfer what they learned from previous projects to a new project. The same is true here. We need the data to train the right model.”

With AI, an algorithm typically is trained using a broad set of data to create a model, which then can be highly optimized for performance and power on the inferencing side. “We’re taking completely trained models, some of which are proprietary to the customer, that are designed for the specific use cases they want,” said Paul Karazuba, vice president of marketing at Expedera. “So rather than just using a general-purpose device, the customers have specific things they want to do, and they’re looking to us to process them as optimally as it can be done. Our architecture was designed with the intention of being scalable, but also to be optimized.”

The bigger challenge for EDA is how to get data that spans the entire industry in order to automate some of those steps. “The semiconductor design industry is going to be one of the industries that finds that the hardest,” says IC Manage’s Drako. “Designs are severely protected and coveted. Even the design rules at TSMC are closely guarded secrets, and they try to encrypt them. No one under the sun wants any of their design data to go anywhere outside of their company. We’re going to have a hard challenge as an industry. I do believe that eventually we will overcome it. We did it for synthesis, and for place-and-route. The tools have seen many designs because every time there is a bug, we give the data to the EDA company so they can fix the bug.”

Some are more positive, particularly when it comes to machine learning. “The semiconductor industry is very rich in data,” says Siwinski. “You have so many designs, so many instances of SoCs being designed every year and new generations, derivatives, similar things with different architectures, millions and millions of test vectors running through hundreds of corner cases to be checked. That means it’s a great place for machine learning, because machine learning is not really about the algorithms. That’s the easy part. It is about being able to frame the problem statement you’re trying to solve with the right data to support it. If you can frame that properly, you can absolutely be using machine learning.”

That also can be done using less data and data sharing, which is more of a problem with AI than its machine learning subset. “I cannot imagine that because then there would be no competitive advantage for anybody anymore,” says Arvind Narayanan, senior director for product line management within Synopsys. “Across all industries, there are sometimes a number of players that create a consortium to share technology. Will the whole industry come together to essentially combine all the information they have? I just don’t see it. I don’t see that coming because everybody is extremely protective of their IP, and I understand why they are.”

Data sharing makes everyone jumpy. “There is a lot of cooperation that goes on,” says Drako. “It’s not talked about a lot, because it makes all of the chip companies very nervous. In the case of training of models, it gets harder because the vendor is asking for the data to be kept for a longer time period. There’s going to be a lot of problems with it.”

There also are data validity and consistency issues to consider. “If you apply the same process used to create ChatGPT to the chip design world, I cannot just use any RTL that somebody has written before,” says Synopsys’ Andersen. “There needs to be a quality component of it. I need to know not only that this RTL is good, but also which RTL is good for what purpose. There might be different requirements in terms of QR or functionality.”

Risk aversion
The semiconductor industry always has been risk-averse. “AI is going to make lots of mistakes because the training data or the model and the solution is new and unproven,” says Drako. “However, AI will give the same answer, right or wrong, consistently. Humans don’t do that. Once I get my model and I prove that it’s accurate enough for my task, I can be assured that it’ll be accurate enough from then on. The problem with the human is that, if I train somebody and they’re accurate enough for the first month of the first year, I’m still going to get mistakes in the second and third and fourth year. Maybe talking about mistakes isn’t the right thing. Maybe consistency is the right way to think about it.”

One way to avoid the problem initially is to concentrate on optimization. “By design, an optimization system cannot do worse than your reference design,” says Andersen. “You may send it in the wrong direction with the wrong inputs, and then it searches the wrong space and will never find you a better result. All results that are worse will automatically be discarded.”

Mistakes do happen. “The results you get are related to how much effort you put into building it,” says Siwinski. “It’s very effort-intensive to do it properly. If you’re asking the wrong questions, or if you give the wrong data, then the results are not going to be very good. You need to understand it just to ask the right questions. How do you look at the data sets? How do you partition how you do it? It’s an art and a science to do this properly.”

You have to understand when errors cannot be tolerated. “There is a limit to how much we can rely on AI,” says Ansys’ Swinnen. “It does play a significant role in things like optimization, but also in things like thermal analysis. For example, when working with variable size meshing, we need an algorithm that quickly determines where the likely hot spots are, and then we can build meshes that are much tighter where we know we need them and looser where not required. That allows us to speed up the whole process significantly by using AI intelligence to identify which areas need to be concentrated upon, but at the end — the calculations have to be exact.”

Generative EDA
While the subject of AI is popular, it is generative AI that is the hot technology today. People are asking when AI will be able to generate Verilog or replace constrained random test pattern generation. “People are using AI to write software programs,” says Drako. “It is improving their productivity because it’s taking some of the drudgery out of it. I need to write a program to do X, and it’s not quite what I wanted, but if I change this, fix this, move this over here, then boom, it’s pretty good. So it increases productivity, and we’ll see it used very effectively in that method or in that manner in what I’ll call a design or creative industry.”

But the semiconductor industry is not as driven by productivity as by software. “As exciting and as sexy, as eye catching as some of this stuff is right now — and the hype over the last two months has been pretty high — the reality is that a lot of these models have a long way to go,” says Siwinski. “Can we get some image manipulation and creation? Absolutely. Music and other things related to language models, yeah, those are getting pretty sophisticated. Is that the same as being able to create advanced code that is going to be secure, that is going to be safe, and that is not going to have some of the challenges as IP reuse? There are places where people can get libraries of things, which is great, but they’re not necessarily what I would deploy in high-performance programs, where you need to have high reliability, high security.”

Again, it comes back to training. “Where does that power come from?” asks Yu. “The power comes from a lot of data being fed into training. OpenAI didn’t disclose the amount of data they have used to train the most recent GPT-4, but I know that for GPT-3 they used several billion tokens, and that means they have collected all the data from Wikipedia, from openly accessible web pages, and from many books and publications. That is where the intelligence comes from. It trained on GitHub, as well. So it has a lot of GitHub fed into the language model. When you look at EDA problem, do we have access to so much data to properly train a powerful model?”

Yu recently published a paper that provides the figures shown below. As a comparison in August 2022, there were 14,197,122 images in 21,841 categories used to train ImageNet.

Fig. 1: Data from "A Survey of Machine Learning Applications in Functional Verification" by Dan Yu, Harry Foster, and Tom Fitzpatrick of Siemens EDA. Source: DVCon 2023

Fig. 1: Data from “A Survey of Machine Learning Applications in Functional Verification” by Dan Yu, Harry Foster, and Tom Fitzpatrick of Siemens EDA. Source: DVCon 2023

There are some early attempts. “You can tell AI to create RTL but is it going to be the most optimized RTL that will satisfy the PPA requirements?” asks Narayanan. “We are not there yet. It will spit out the logic function that you’re looking for, but the second step is how you optimize it. How do you take it to the next level? That’s work in progress.”

As an industry, we do have some experience in this already. “The danger with things like language models is you may spend more time debugging poorly written RTL than you would have taken to write it,” notes Andersen.

This is similar to the early days of IP reuse where huge amounts of poor RTL flooded the market. “Even if AI gives us the RTL, we still have to do the quality check,” says Yu. “Perhaps that could also be automated in the future. We also have to integrate that design with other pieces and make sure the new design works as a whole. There are many steps until some model could produce a complete design.”

Conclusion
Arthur C. Clarke once said, “Any sufficiently advanced technology is indistinguishable from magic.”

“AI may be wondrous, but it is not magic,” says Siwinski. “It’s just science and math. Machine learning is just another tool that is very data-dependent, and you have to ask the right questions. But it’s something that everybody should be embracing because it is going to be 100% pervasive.”

While EDA is adopting machine learning and other pieces of true AI, it is not ready to throw away many of the existing algorithms. “Machine learning is not a drop-in replacement for our existing algorithms or tools,” says Yu. “They are helping us to accelerate thing that were not very efficiently. They are helping to automate some processes where people were in the loop. Those are tasks where machine learning can help. Sometimes machine learning also can improve our previous primitive algorithms, make them more accurate.”

Generative EDA, meanwhile, may have to wait a little longer. “It’s unclear how this is going to play in our industry, which is very risk-averse,” says Drako. “AI will be used in design stuff where it is checked by humans and gives humans a template to start with, and then they can move forward more effectively, more quickly. Our industry wants surety. Eventually, we’ll get models that are trained well enough where we’ll get that surety.”



Leave a Reply


(Note: This name will be displayed publicly)