Skip to main content
Open this photo in gallery:

AI models launched by Chinese startup DeepSeek have raised doubts about the billions of dollars spent by U.S. firms on similar technology, though some have raised doubts about DeepSeek's claims regarding its costs.Dado Ruvic/Reuters

Shares in major technology companies fell sharply Monday after a little-known startup in China released powerful but inexpensive new artificial-intelligence models, sparking worries about a challenge to U.S. dominance in a crucial emerging sector.

Chinese startup DeepSeek released an AI model called R1 last week, one month after unveiling another one, named V3. The newest release is known as a reasoning model that can handle more complex questions and problems than standard chatbots.

Those developments are raising questions about the hundreds of billions of dollars that companies such as OpenAI, Meta and Google are spending to develop AI, even as some experts cast doubt on the Chinese company’s claims.

Nvidia Corp. NVDA-Q, the leading maker of chips that are used to power AI, fell 17 per cent on Monday, shaving roughly US$550-billion from its market cap, more than the combined total market cap of Canada’s four most valuable public companies. Oracle Corp. ORCLN-Q dropped 13.8 per cent, while Alphabet Inc. GOOGL-Q and Microsoft Corp. MSFT-Q dropped 4.2 per cent and 2.1 per cent respectively.

Canadian company Celestica Inc. CLS-T, which builds equipment for data centres and has seen its shares soar on demand for AI hardware, plummeted by 28 per cent. Some energy companies, which power data centres, did not escape the rout either, including Canada’s Capital Power Corp. CPX-T, which shed 17 per cent of its market value.

What is DeepSeek and why is it disrupting the AI sector?

DeepSeek has flown under the radar, with discussion about it limited mostly to AI developers. But talk hit a frenzy over the weekend, especially as Silicon Valley leaders picked up on a technical report from the company that said it spent just US$5.6-million to train one of its models.

Dario Amodei, chief executive of San Francisco-based AI company Anthropic, has said a model can cost US$100-million to train, while research firm Epoch AI has estimated that building the largest AI models will cost more than US$1-billion by 2027.

Not only did DeepSeek appear to dramatically undercut development costs, it claimed to have done so using a relatively small number of Nvidia chips that are not even state-of-the-art, as the U.S. has restricted the export of advanced chips to China in recent years to thwart the country’s progress in AI.

Chinese startup DeepSeek's launch of its latest AI models, which it says are on a par or better than industry-leading models in the United States at a fraction of the cost, is threatening to upset the technology world order. Alex Cohen produced this report.

Reuters

The benchmarks used to assess the prowess of DeepSeek’s models show that R1 is roughly on par with some of the latest releases from U.S. companies, such as OpenAI and Meta Platforms Inc. META-Q. The models are also open-source, meaning the code is available for free. To top it off, DeepSeek unveiled an image generation model on Monday that it says outperforms its competitors.

DeepSeek did not respond to interview requests.

“Constraints create innovation,” said Boris Wertz, founder of Version One Ventures in Vancouver. “In this case, it was more a whole industry that got a little drunk,” he said, referring to the money AI companies have spent to acquire graphics processing units and build data centres.

U.S. President Donald Trump said on Monday (January 27) that Chinese startup DeepSeek's technology should act as spur for American companies and said it was good that companies in China have come up with a cheaper, faster method of artificial intelligence.

Reuters

Just last week, OpenAI, Oracle and others announced a plan to spend up to US$500-billion to construct data centres to power AI in the U.S., while Meta is spending up to US$65-billion on AI infrastructure this year. Elon Musk’s xAI bought 100,000 GPUs to build what Nvidia has described as the world’s largest AI supercomputer. (GPUs, or graphics processing units, are powerful computer chips used to train and run AI models.)

The performance and costs of DeepSeek’s models, however, are throwing new light on those dollar figures. “What happens with these hundreds of billions of dollars of capital that are being raised and funnelled into the space?” said John Ruffolo, founder of Maverix Private Equity. “If you’re an investor devoted to this space, you must be panicking today.”

The release from DeepSeek comes at a time when adoption of AI “is still incredibly low, as people and businesses grapple with understanding how to best leverage the technology,” said Nicole Janssen, co-founder at AltaML in Edmonton.

The launch of ChatGPT by OpenAI in 2022 kicked off a frenzy of development and investment, as businesses try to dominate the market for generative AI – applications that produce and interpret text and other media – even as concerns remain about accuracy and reliability.

Lower cost could help drive AI adoption, but some experts are skeptical about DeepSeek’s claims.

Alexandr Wang, the founder of U.S. company Scale AI, told CNBC recently that DeepSeek could have as many as 50,000 high-powered GPUs that it cannot publicly disclose owing to U.S. export controls.

“We need to take statements from DeepSeek about their compute resources with a grain of salt. We have no way to verify these numbers,” said Daniel Roy, a professor at the University of Toronto and research director at the Vector Institute. DeepSeek has also not been transparent about the data that it used to train its models, he added.

Investors sold technology stocks across the globe on Monday as they worried that the emergence of a low-cost Chinese artificial intelligence model would threaten the dominance of current AI leaders like Nvidia.

Reuters

Thanos Moschopoulos, a technology analyst at BMO Capital Markets, advised investors in a research note to be “cautious” about claims about training costs, writing that more testing is needed to fully understand the uses and limitations of DeepSeek’s models.

“Ultimately, while this creates incremental uncertainty regarding the longer-term size and growth of the market for AI infrastructure, we expect this to remain a large total addressable market,” he wrote in a note to clients.

The implications of both cheaper and more efficient AI development could affect some of Canada’s largest investors, including pension funds, that have placed major bets on data centre companies and the energy producers that power these facilities. Such bets are typically long-term, and major pension funds don’t often react to market volatility.

The Canada Pension Plan Investment Board is expected to soon own a major stake in Baltimore-based Constellation Energy Corp., which was one of the stocks that was hit hardest in Monday’s sell-off, falling 20.8 per cent.

CPPIB also owns a nearly $900-million stake in U.S.-based data centre operator Equinix Inc., which had dropped 4.3 per cent.

The Public Sector Pension Investment Board, which manages pensions for the federal public service, has backed Toronto-based Cohere, a builder of AI models that was valued at US$5.5-billion last year. Cohere and DeepSeek have different approaches, however. Cohere focuses on building proprietary AI models for use by enterprise customers, where security and privacy are paramount. Implementing open-source models, such as those made by DeepSeek, can require more technical expertise.

“The next revolution in LLMs won’t come from throwing money at the problem,” said Cohere co-founder Nick Frosst in a statement. “This is a moment of clarity, where the obvious becomes undeniable in how we will pursue better LLMs.”

Gennady Pekhimenko, co-founder of Toronto-based CentML and an expert in machine learning efficiency, said the few million dollars in training costs cited by DeepSeek are likely not the entire picture, and that the company relied on other AI models as part of its process. “Their training utilizes the existing models to get very high-quality, representative data, so they needed to train way less,” he said. “It’s not like you just start the process and get everything from scratch and you’re done.”

Mr. Pekhimenko said there remains a need for GPUs and data centres to build large, foundational models. Plus, running AI models after training – such as asking a question of ChatGPT – requires a lot of infrastructure. “It doesn’t mean you shouldn’t build data centres,” he said.

DeepSeek is sparking geopolitical concerns, too. The United States and China are locked in a race for supremacy in AI, which could bring enormous economic benefits and military advantages. Silicon Valley investor Marc Andreesen wrote on X that DeepSeek’s latest model is “AI’s Sputnik moment.”

“AI is the front for the new Cold War. Superiority of technology in the digital world is correlated to superiority militarily,” Mr. Ruffolo said.

DeepSeek has limitations, notably that its responses are in line with the views of the Chinese Communist Party. “Sorry, that’s beyond my current scope,” the DeepSeek app responded when The Globe and Mail asked about the 1989 Tiananmen Square protests. “Let’s talk about something else.” The app also said that Taiwan has “always been an inalienable part of China’s territory since ancient times.”

Because DeepSeek has published its methods, other companies could try to replicate its approach to building cheaper, more powerful models. Vijay Gadepally, the chief technology officer at Canadian AI cloud computing company Radium, noted the potential benefits. “It lowers the barrier to entry,” he said, “forces us to rethink the investment ratio between infrastructure and algorithms for AI progress, and is a step forward in reducing the energy and environmental impact of AI.”

Report an editorial error

Report a technical issue

Editorial code of conduct

Tickers mentioned in this story

Study and track financial data on any traded entity: click to open the full quote page. Data updated as of 06/03/26 4:00pm EST.

SymbolName% changeLast
NVDA-Q
Nvidia Corp
-3.01%177.82
ARM-Q
Arm Holdings Plc ADR
-5.17%114.38
GOOGL-Q
Alphabet Cl A
-0.78%298.52
MSFT-Q
Microsoft Corp
-0.42%408.96
CLS-T
Celestica Inc Sv
-6.56%339.51
META-Q
Meta Platforms Inc
-2.38%644.86
CPX-T
Capital Power Corp
-3.42%60.77

Follow related authors and topics

Authors and topics you follow will be added to your personal news feed in Following.

Interact with The Globe