Chinese-made DeepSeek AI model records extensive online user data, stores it in China-based servers

China USA
(Image credit: Shutterstock)

DeepSeek’s newest R1 large language model has already become notorious after its release cratered AI stocks, and revelations about its privacy policy might raise eyebrows even more — the company records extensive data from its online users, including keystrokes, passwords, and data entered in queries like images and text, and then stores it in China-based servers. 

Personal information, including date of birth, email addresses, phone numbers, and passwords, are all fair game, according to DeepSeek. Any content users give to the R1 LLM, from text and audio prompts to uploaded files, may also be collected by DeepSeek. And whenever someone contacts DeepSeek, it says it might keep users’ proof of identity, which presumably means documents like a driver’s license.

But that’s not all. DeepSeek records anything related to users’ hardware: IP addresses, phone models, language, etc. Its collection efforts are so thorough that the company notes “keystroke patterns or rhythms.” Cookies, a classic method of tracking users on the Internet, also contribute to user data collection.

As for where all this information is stored, the privacy policy says it’s all kept inside servers located in China, a point that has the potential to spark serious controversy. Concerns about the personal details of Americans being in the hands of the Chinese government was a key factor in the Biden administration’s attempt to ban TikTok, raising the possibility that DeepSeek might come under similar scrutiny.

Developed by Chinese AI company DeepSeek, R1 is an open-source LLM that boasts cutting-edge performance at a fraction of the computing power. With 671 billion parameters, it’s one of the most significant AI models and only took 2.8 million GPU hours to train. Meta’s Llama 3 required 30.8 million GPU hours, or 11 times more.

DeepSeek boasted about these accomplishments over a month ago, but R1 launched on January 20, and the implications were fully appreciated by the stock market only yesterday. The market reacted by selling shares in AI companies like Nvidia. While the spotlight on DeepSeek has raised its profile, many have also reviewed how it handles user privacy, a particularly thorny issue for anything involving AI and software developed in China. 

Matthew Connatser

Matthew Connatser is a freelancing writer for Tom's Hardware US. He writes articles about CPUs, GPUs, SSDs, and computers in general.