Intel CEO Lip-Bu Tan stamps out chip bugs with aggressive new quality standards, says major validation errors can result in termination — 'B0, you keep your job. Anything above that, you are fired'
Make B0 revision work perfectly, or lose your job.
When Lip-Bu Tan became the CEO of Intel last year, it was clear that a lot was going to change at the company. Now, details of these changes are beginning to emerge. We already know that Lip-Bu Tan personally assesses and approves chip designs before tape outs, but as it turns out, he also wants designs to be bug-free and ready for mass production already with the A0 revision, something that the company's products have failed to do.
"One thing about timetable, I have a culture right now I have just implemented. It has to be A0 to production," said Lip-Bu Tan at JP Morgan's Global Technology, Media and Communications Conference. "A0 is when you tape out, first time pass. Intel does not have that culture, so I tell that, first time pass A0. B0, you keep your job. Anything above that, you are fired."
"So that culture people initially thought that I'm just joking, and now I started to implement, they started to say that, 'Okay, Lip-Bu, you are very serious, you really look into all the design, all the bugs that we've tried to fix, and then all the IP that we use. You make sure that we certify and make sure we do that before we go to tape-out,' and so those are kind of the culture we need to have," Tan said.
A0 is the very first manufactured version of a chip produced after the initial tape out and before any silicon fixes are implemented. First-pass success means that the chip boots, functions correctly, meets major specifications, no major redesign is needed, and the silicon is close to production quality (or of production quality). Achieving A0 success with a complex CPU design on an advanced node is extremely difficult, more so than with other types of processors with simpler designs and redundant features.
While Nvidia and some other companies indeed begin to mass produce A0 chips after the initial tape out and bring up, it often takes Intel more revisions to get rid of bugs and maximize performance and yield. For example, Intel's Xeon 'Sapphire Rapids' processor contained as many as 500 bugs, and it took Intel a dozen revisions to get rid of erratas and reach planned performance and decent yields. At the time, that chip had seen A0, A1, B0, C0, C1, C2, D0, E0, E2, E3, E4 and E5 steppings to fix the egregious number of bugs.
Tan's comments are a bit unusual for a CEO of a company of Intel's size, as he essentially says that Intel's prior engineering culture was lax, so he is improving internal execution discipline. Ultimately, Tan wants fewer respins, faster validation, and shorter development cycles.
Whether or not A0 success can be achieved by all of Intel's products remains to be seen. For example, Nvidia is known for incorporating various yield-boosting techniques into its complex GPUs (e.g., redundant logic and caches) to avoid stepping failures and costly respins. However, Intel's design approaches are different.
One of the ways to reduce risks is to use industry-standard silicon-proven IP blocks and heavily verify designs before taping them out. In addition, Intel engineers might have to make less risky design decisions to achieve first-time success. Such an approach may make Intel's produces less ambitious in general, but at least the company's business performance will be more predictable.
Get Tom's Hardware's best news and in-depth reviews, straight to your inbox.
Follow Tom's Hardware on Google News, or add us as a preferred source, to get our latest news, analysis, & reviews in your feeds.

Anton Shilov is a contributing writer at Tom’s Hardware. Over the past couple of decades, he has covered everything from CPUs and GPUs to supercomputers and from modern process technologies and latest fab tools to high-tech industry trends.
-
coolitic Seems like he knows Intel's problem quite well.Reply
We'll see if he can actually lead them to executing well. -
Vigilant256 How does this culture compare to AMD , Apple, Qualcomm?Reply
I know this was a problem in the past especially the data center team , where they had more headcount than AMD , but still lost to them.
The market share lost and constant delays in the product truly set intel back a lot, second biggest issue after the intel process 10nm++++ era. -
Gururu This is a problem across all technology areas. I've suffered through volumes of quality standards while dealing with incompetent employees stemming from low paying jobs and poor education. Technicians are the new engineers/chemists and poorly educated engineers/chemists are the new managers. His approach won't work unless he is planning to pay top dollar and offer great benefits for high quality engineers.Reply -
ottonis This will certainly lead to more conservative designs and shape a mindset that primarily attempts to avoid any risks and errors rather than achieving more performance or higher efficiency.Reply
So, you can go that route if you are in a leading position. But if there is stiff competition that outperforms your chips then this CEO will have to rethink his priorities. -
King_V I don't know chip development, and maybe call me a cynic, but this comes off to me as very "I don't care about the realities on the ground, just make it happen. Or else."Reply -
JRStern Reply
It does sound like this.King_V said:I don't know chip development, and maybe call me a cynic, but this comes off to me as very "I don't care about the realities on the ground, just make it happen. Or else."
But the problem is, Intel has drifted waaaaaaaay over to the other side.
And at least they have a CEO who knows how it works down there where the rubber hits the road, that's basically what's been missing for the last fifteen years. -
JRStern I'm not sure how that comes across because I'll echo what others have said, in tech stuff happens. But the culture has changed in the last twenty years or so, look at Elon Musk blowing up billion dollar rockets and shrugging it off as a learning experience. Look at your local Agile software development group moving fast and breaking things.Reply
Most of all look at Intel back in their 10nm daze when the whole company ground to a halt for five years while the lab boys fiddled and fiddled and fiddled, while TSM just shrugged, did a redneck engineering fix, and zoomed right on by. LBT is right to make sure that is NOT happening now.
The problem is it may be, we still don't know if 18a or 14a can be saved.
(and I'm not sure where the boundary is between tape-out errors and process problems ...) -
DS426 Reply
It would have to, yes; you can't have everything -- life, business, engineering, it's always a game of trade-offs. What's the trade-off here? Faster Time-to-Production almost certainly means more conservative designs. I mean, who wants to lose their job? Building in some redundancies sounds unavoidable if anyone is being realistic here.ottonis said:This will certainly lead to more conservative designs and shape a mindset that primarily attempts to avoid any risks and errors rather than achieving more performance or higher efficiency.
So, you can go that route if you are in a leading position. But if there is stiff competition that outperforms your chips then this CEO will have to rethink his priorities.
Very executive-like, yes. Threatening your employees is one way to go about it, lol. Not a great work culture, IMO, and I get that Intel has a lot of pressure (and generally that industry does, both advanced microchip design and advanced node fabs). Maybe fewer higher-paid engineers is his preferred approach?King_V said:I don't know chip development, and maybe call me a cynic, but this comes off to me as very "I don't care about the realities on the ground, just make it happen. Or else." -
JamesJones44 Reply
My understanding of the 14nm+++++ era was due to technology choices for 10nm vs bad chip design (lithography tooling and the unwillingness to adapt forms of it)Vigilant256 said:How does this culture compare to AMD , Apple, Qualcomm?
I know this was a problem in the past especially the data center team , where they had more headcount than AMD , but still lost to them.
The market share lost and constant delays in the product truly set intel back a lot, second biggest issue after the intel process 10nm++++ era.