https://www.techtarget.com/whatis/feature/DeepSeek-explained-Everything-you-need-to-know
In the world of AI, there has been a prevailing notion that developing leading-edge large language models requires significant technical and financial resources. That's one of the main reasons why the U.S. government pledged to support the $500 billion Stargate Project announced by President Donald Trump.
But Chinese AI development firm DeepSeek has disrupted that notion. On Jan. 20, 2025, DeepSeek released its R1 LLM at a fraction of the cost that other vendors incurred in their own developments. DeepSeek is also providing its R1 models under an open source license, enabling free use.
Within days of its release, the DeepSeek AI assistant -- a mobile app that provides a chatbot interface for DeepSeek-R1 -- hit the top of Apple's App Store chart, outranking OpenAI's ChatGPT mobile app. The meteoric rise of DeepSeek in terms of usage and popularity triggered a stock market sell-off on Jan. 27, 2025, as investors cast doubt on the value of large AI vendors based in the U.S., including Nvidia. Microsoft, Meta Platforms, Oracle, Broadcom and other tech giants also saw significant drops as investors reassessed AI valuations.
DeepSeek is an AI development firm based in Hangzhou, China. The company was founded by Liang Wenfeng, a graduate of Zhejiang University, in May 2023. Wenfeng also co-founded High-Flyer, a China-based quantitative hedge fund that owns DeepSeek. Currently, DeepSeek operates as an independent AI research lab under the umbrella of High-Flyer. The full amount of funding and the valuation of DeepSeek have not been publicly disclosed.
DeepSeek focuses on developing open source LLMs. The company's first model was released in November 2023. The company has iterated multiple times on its core LLM and has built out several different variations. However, it wasn't until January 2025 after the release of its R1 reasoning model that the company became globally famous.
The company provides multiple services for its models, including a web interface, mobile application and API access.
DeepSeek represents the latest challenge to OpenAI, which established itself as an industry leader with the debut of ChatGPT in 2022. OpenAI has helped push the generative AI industry forward with its GPT family of models, as well as its o1 class of reasoning models, which include o3 and o4 mini.
While the two companies are both developing generative AI LLMs, they have different approaches.
OpenAI | DeepSeek | |
Founding year | 2015 | 2023 |
Headquarters | San Francisco, Calif. | Hangzhou, China |
Development focus | Broad AI capabilities | Efficient, open source models |
Key models | GPT-4o, o1 | DeepSeek-V3, DeepSeek-R1 |
Specialized models | Dall-E (image generation), Whisper (speech recognition) |
DeepSeek Coder (coding), Janus Pro (vision model) |
API pricing (per million tokens) |
o1: $15 (input), $60 (output) | DeepSeek-R1: $0.55 (input), $2.19 (output) |
Open source policy | Limited | Mostly open source |
Training approach | Supervised and instruction-based fine-tuning | Reinforcement learning |
Development cost | Hundreds of millions of dollars for o1 (estimated) |
Less than $6 million for DeepSeek-R1, according to the company |
DeepSeek uses a different approach to train its R1 models than what is used by OpenAI. The training involved less time, fewer AI accelerators and less cost to develop. DeepSeek's aim is to achieve artificial general intelligence, and the company's advancements in reasoning capabilities represent significant progress in AI development.
In a research paper, DeepSeek outlines the multiple innovations it developed as part of the R1 model, including the following:
Since the company was created in 2023, DeepSeek has released a series of generative AI models. With each new generation, the company has worked to advance both the capabilities and performance of its models:
Alibaba and Ai2 released their own updated LLMs within days of the R1 release -- Qwen2.5 Max and Tülu 3 405B.
While there was much hype around the DeepSeek-R1 release, it has raised alarms in the U.S., triggering concerns and a stock market sell-off in tech stocks. On Monday, Jan. 27, 2025, the Nasdaq Composite dropped by 3.4% at market opening, with Nvidia declining by 17% and losing approximately $600 billion in market capitalization.
DeepSeek is raising alarms in the U.S. for several reasons, including the following:
Countries and organizations around the world have already banned DeepSeek, citing ethics, privacy and security issues within the company. Because all user data is stored in China, the biggest concern is the potential for a data leak to the Chinese government. The LLM was also trained with a Chinese worldview -- a potential problem due to the country's authoritarian government.
Places where DeepSeek is banned include the following:
DeepSeek's popularity has not gone unnoticed by cyberattackers.
On Jan. 27, 2025, DeepSeek reported large-scale malicious attacks on its services, forcing the company to temporarily limit new user registrations. The timing of the attack coincided with DeepSeek's AI assistant app overtaking ChatGPT as the top downloaded app on the Apple App Store.
Despite the attack, DeepSeek maintained service for existing users. The issue extended into Jan. 28, when the company reported it had identified the issue and deployed a fix.
DeepSeek has not specified the exact nature of the attack, though widespread speculation from public reports indicated it was some form of DDoS attack targeting its API and web chat platform.
Wiz Research -- a team within cloud security vendor Wiz Inc. -- published findings on Jan. 29, 2025, about a publicly accessible back-end database spilling sensitive information onto the web -- a "rookie" cybersecurity mistake. Information included DeepSeek chat history, back-end data, log streams, API keys and operational details. DeepSeek took the database offline shortly after being informed. It's unclear for how long the database was exposed.
Now we know exactly how DeepSeek was designed to work, and we may even have a clue toward its highly publicized scandal with OpenAI.
Sean Michael Kerner is an IT consultant, technology enthusiast and tinkerer. He has pulled Token Ring, configured NetWare and been known to compile his own Linux kernel. He consults with industry and media organizations on technology issues.
DeepSeek under the microscope: a sanity check
DeepSeek-R1 more readily generates dangerous content than other large language models
AI jailbreaking techniques prove highly effective against DeepSeek
Wiz reveals DeepSeek database exposed API keys, chat history
04 Jun 2025