Skip to main content

Open menu Close main menu

Sign in

View Profile
Sign out

Search

Search Tom's Hardware

Best Picks
CPUs
GPUs
SSDs
News
Laptops
Premium
Coupons
More
- Newsletter
- Reviews
- PC Components
- PC Building
- Motherboards
- Cases
- Cooling
- Power Supplies
- RAM
- Desktops
- 3D Printers
- Peripherals
- Monitors
- Windows 11
- Gaming
- Overclocking
- About Us

Trending

Win 10 EOL
Panther Lake
Intel 18A
Battlefield 6
RTX 5090
Apple A19 vs Ryzen 9

Don't miss these

Nvidia

GPUs Nvidia's new CPX GPU aims to change the game in AI inference

Broadcom

Semiconductors OpenAI and Broadcom to co-develop 10GW of custom AI chips in yet another blockbuster AI partnership

Positron AI

Artificial Intelligence Positron AI says its Atlas accelerator beats Nvidia H200 on inference in just 33% of the power

Elon Musk, Grok 3.5, xAI

Tech Industry Elon Musk says xAI will deploy 50 million H100 GPUs in the next 5 years, doubles down on goal as Grok 2.5 goes open source

The Orange Pi AI Studio Pro mini-PC meant for AI development

Artificial Intelligence Huawei-powered mini-PC debuts with Huawei AI chip and 192GB of memory

Broadcom

Artificial Intelligence OpenAI widely thought to be Broadcom's mystery $10 billion custom AI processor customer

A picture of a CoreWeave office

Tech Industry CoreWeave deal with OpenAI now worth $22.4 billion

The Arm office in Munich, Germany

CPUs Report: Arm developing custom CPU for OpenAI's in-house accelerator

Grok Microsoft Azure

Artificial Intelligence Microsoft adds Grok 4 to Azure AI Foundry following cautious trials

Nvidia GPUs

Artificial Intelligence Nvidia to lease its own chips from AI startup Lambda, deal involves 18,000 GPUs over a 4-year agreement

Open AI and Nvidia logos

Tech Industry Nvidia and OpenAI forge $100 billion alliance to deliver 10 gigawatts of Nvidia hardware for AI datacenters

Enfabrica

Tech Industry Nvidia-backed startup invents Ethernet memory pool to help power AI

Intel

GPUs Intel unveils Crescent Island, an inference-only GPU with Xe3P architecture and 160GB of memory

Intel's Raja Koduri

Artificial Intelligence Legendary GPU architect Raja Koduri's new startup leverages RISC-V and targets CUDA workloads

Nvidia

Networking Nvidia outlines plans for using silicon photonics and co-packaged optics in AI clusters by 2026

PC Components
CPUs

AI Startup Groq Debuts in the Cloud

By Arne Verheyde published 24 January 2020

1000 TOPS in the cloud.

When you purchase through links on our site, we may earn an affiliate commission. Here’s how it works.

Groq has become the second start-up to debut its deep learning chip in the cloud, EETimes reported. Its tensor streaming processor (TSP) for AI inference is now available via cloud service provider Nimbix for “selected customers”.

Late last year, Graphcore became the first startup to see its AI chip become available in the cloud through Microsoft’s Azure. Nimbix is now offering Groq’s tensor streaming processor.

Nimbix’ CEO stated: “Groq’s simplified processing architecture is unique, providing unprecedented, deterministic performance for compute intensive workloads, and is an exciting addition to our cloud-based AI and Deep Learning platform.”

You may like

OpenAI and Broadcom to co-develop 10GW of custom AI chips in yet another blockbuster AI partnership
Positron AI says its Atlas accelerator beats Nvidia H200 on inference in just 33% of the power
Elon Musk says xAI will deploy 50 million H100 GPUs in the next 5 years, doubles down on goal as Grok 2.5 goes open source

Groq’s TSP is rated at 1000 TOPS (1POPS) and the company claims it has 2.5x the performance of the best GPUs at a large batch size, and its performance lead would be 17x at a batch size of 1.

Besides GPUs, the chip competes with many other deep learning inference chips. Most notably, Intel and Qualcomm will also seek to get their own NNP-I and Cloud 100 chips in the cloud this year.

WikiChip has a deep dive on the chip.

Stay On the Cutting Edge: Get the Tom's Hardware Newsletter

Get Tom's Hardware's best news and in-depth reviews, straight to your inbox.

Contact me with news and offers from other Future brandsReceive email from us on behalf of our trusted partners or sponsors

By submitting your information you agree to the Terms & Conditions and Privacy Policy and are aged 16 or over.

See more CPUs News

TOPICS

Read more

Broadcom

OpenAI and Broadcom to co-develop 10GW of custom AI chips in yet another blockbuster AI partnership

Positron AI

Positron AI says its Atlas accelerator beats Nvidia H200 on inference in just 33% of the power

Elon Musk, Grok 3.5, xAI

Elon Musk says xAI will deploy 50 million H100 GPUs in the next 5 years, doubles down on goal as Grok 2.5 goes open source

The Orange Pi AI Studio Pro mini-PC meant for AI development

Huawei-powered mini-PC debuts with Huawei AI chip and 192GB of memory

Broadcom

OpenAI widely thought to be Broadcom's mystery $10 billion custom AI processor customer

Nvidia

Nvidia's new CPX GPU aims to change the game in AI inference

Latest in CPUs

Apple M5 SoC

Apple M5 chip smashes Snapdragon X2 Elite in early single-thread benchmarks

Intel Core i9-14900K CPU

Multiple generations of Intel's modern chips see price hikes up to 20% overseas

Tachyum

Tachyum's 'general-purpose' Prodigy chip delayed again — now with 256 cores per chiplet and a $500 million purchase order from EU investor

Apple M5 SoC

Apple unveils M5 chip with 10-core CPU and a 10-core GPU

The Arm office in Munich, Germany

Report: Arm developing custom CPU for OpenAI's in-house accelerator

Intel

Intel Panther Lake SKUs spotted in HWMonitor update — release notes specify Core Ultra X, H, and U variants

Latest in News

Microsoft Maia

Intel Foundry secures contract to build Microsoft's Maia 2 next-gen AI processor on 18A/18A-P node, claims report

Visiting Micron Fab 16

Micron is preparing to exit China’s data center memory market completely, report claims

MSI EZ PC Builder

MSI's new AI-powered PC building assistant recommends the 9800X3D as a budget CPU

TSMC

TSMC moves up 2nm production plans in Arizona

Amazon Nuclear power

Amazon reveals 960 megawatt modular nuclear reactor power plans to cope with AI demand

Prolo Ring

This smart ring transforms your finger into a mouse — Prolo Ring aims to become the ultimate macro

1 Comment Comment from the forums

bit_user

As seems to be the trend, it's heavily-dependent on on-chip memory. The pic of the PCIe card doesn't show any off-chip memory, though perhaps there's some HBM2 under the IHS? Otherwise, you might hit a (performance) wall, when your model tries to scale beyond what they can fit on-chip.

I would also worry about the energy usage resulting from all of the on-chip data movement, since the on-chip memory is supposedly organized in a global pool.

The top-line numbers are impressive, though.
Reply

Tom's Hardware is part of Future US Inc, an international media group and leading digital publisher. Visit our corporate site.

Terms and conditions
Contact Future's experts
Privacy policy
Cookies policy
Accessibility Statement
Advertise with us
About us
Coupons
Careers

© Future US, Inc. Full 7th Floor, 130 West 42nd Street, New York, NY 10036.