A look at DeepSeek's model optimization to reduce HBM use, potentially enabling domestic memory, ASIC, and CPU makers to create a Chinese AI hardware ecosystem
Have you ever wondered, how DeepSeek may make money, and lot of it? They didn't come up with competitive coding plans like GLM, MoonShot and MiniMax.
@bookwormengr
Related Coverage
- DeepSeek vs OpenAI: The $10B AGI Race Begins Memeburn
- Models & Pricing — The prices listed below are in units of per 1M tokens. DeepSeek
- DeepSeek V4 Pro tops global bang-for-buck ranking after 75% price cut South China Morning Post · Vincent Chow
- DeepSeek Just Froze AI Prices at a Level Western Labs Cannot Match Implicator.ai · Marcus Schuler
- DeepSeek made its 75% discount permanent. The AI price war just escalated. The Next Web · Ana Maria Constantin
- DeepSeek permanently reduces the price of its flagship V4 model by 75 percent Engadget · Jackson Chen
- DeepSeek makes 75% V4-Pro price cut permanent Tech in Asia · Minh Le
- China's DeepSeek to make permanent 75% price cut on flagship V4-Pro AI model Reuters
- Deepseek makes its 75 percent discount permanent, pricing output tokens at least 34x below GPT-5.5 The Decoder · Matthias Bastian
- China's DeepSeek permanently cuts prices of flagship V4-Pro AI model by 75% The Tech Portal · Ashutosh Singh
- DeepSeek to Make Permanent 75% Discount on Flagship AI Model Hacker News
- DeepSeek makes the V4 Pro price discount permanent Hacker News
- DeepSeek says it will lower V4 Pro API prices by 75% to $0.435/1M input and $0.87/1M output tokens, making permanent the discount prices set to expire on May 31 Bloomberg
Discussion
-
@kyleichan
Kyle Chan
on x
tl;dr: DeepSeek open-source innovations drive down compute + memory demands, shift away from GPU & HBM to NAND & LPDDR where China's YMTC & CXMT are strong, helping China's entire AI ecosystem move away from US-controlled chokepoints.
-
@jetpen
Ben Eng
on x
@bookwormengr Looks like the US will regret its policy of restricting high performance computing, GPU, and semiconductor fab technology exports to China. That set China on the course of building their own semiconductor supply chain and competency.
-
@rohanpaul_ai
Rohan Paul
on x
Great article here on DeepSeek. Their real story is not cheaper chatbots, but architecture that turns hardware scarcity into strategy. DeepSeek is not trying to sell coding seats, it is trying to make Chinese memory, accelerators, and systems useful for frontier AI. Every [image]
-
@theopeningmove1
@theopeningmove1
on x
@mark_k @deepseek_ai pricing is the part that changes the read here. v4 pro is now $0.435 in / $0.87 out per 1m tokens. openrouter has gpt-5.5 pro at $30 / $180. not saying same quality. saying every wrapper now has to explain when it pays up.
-
@teortaxestex
@teortaxestex
on x
Pretty good analysis of the hardware infrastructure dimension of DeepSeek strategy
-
@xiz25
Dr. Xi Zeng
on x
@rohanpaul_ai this is the part people under-discuss: scarcity changes product taste. When compute is abundant, teams can hide bad UX behind bigger models. When memory and hardware are constrained, you are forced to decide what the model should actually look at, remember, and skip…
-
@datachaz
Charly Wargnier
on x
A fascinating deep dive into DeepSeek reveals a brilliantly unconventional strategy. They are completely ignoring standard industry trends. Instead of fighting for short-term multimodal profits, they are looking ahead. They use radical frameworks like MoE to absolutely crush
-
@rohanpaul_ai
Rohan Paul
on x
🇨🇳 🇺🇸 China's Huawei's new 122TB SSD shows how export controls can move innovation sideways instead of simply stopping it. Huawei just built a 122.88TB AI SSD by changing the package around the memory, not by matching Samsung's most advanced 400+ layer 3D NAND. And a 245TB [image…
-
@mark_k
Mark Kretschmann
on x
Fascinating and very deep article about DeepSeek AI (@deepseek_ai). You would have never guessed what their strategy is, it's really interesting: They're not chasing quick money from coding plans or multimodal models. Instead, their radical architecture innovations (MoE, MLA,
-
@dan_jeffries1
Daniel Jeffries
on x
Fascinating read. We desperately need an American open source champion and fast.
-
@pstasiatech
Paul Triolo
on x
Excellent analysis.....Who in China makes LPDDR? CXMT. They are only 0.5 Gen behind on speed for LPDDR and 1 generation behind on density. Not very far! In addition abundant NAND, Chinese ecosystem will have abundant LPDDR in near future. Can this relieve pressure on compute?
-
@rohanpaul_ai
Rohan Paul
on x
Reuters: DeepSeek just made its V4-Pro price cut permanent, pushing the price down to 25% of its original API cost. DeepSeek has not confirmed that better Ascend 950 supply caused the permanent cut, but the timing points to a cost curve moving downward as China's AI stack shifts …
-
@agraylin
Alvin Wang Graylin
on x
Interesting read on how @deepseek_ai innovations in #AI models were driven by China's HW constraints and why its solutions will ultimately negate those constraints. 💡🐳
-
@kimmonismus
@kimmonismus
on x
Let that sink in for a moment. DeepSeek v4 pro 75% discount. Permanent! In: $0.43 Out: $0.87 If you read the DeepSeek v4 tech paper you know that this model is insanely good when it comes to efficiency. Only 27% compute and only 10% cache compares to v3.2. SemiAnalysis wrote…
-
@jukan05
Jukan
on x
Does this mean DeepSeek has already brought a Huawei Ascend 950 cluster online? [image]
-
@deepseek_ai
@deepseek_ai
on x
We are making our discount permanent! 🎉 Enjoy building with DeepSeek-V4-Pro and bring your innovative ideas to life! 🚀 [image]
-
@yuchenj_uw
Yuchen Jin
on x
Wow. A massive 75% discount from DeepSeek. Either they've done some serious inference optimizations, or Huawei chips are just that much cheaper? More open-source AI models, better token economy.
-
@goodalexander
@goodalexander
on x
btw guys this is 1/35th the price of GPT 5.5
-
@zerohedge
@zerohedge
on x
Has the “$5 trillion capex in 5 years” considered China
-
@healthranger
@healthranger
on x
Holy cow, DeepSeek just made top-tier AI services ridiculously cheap... PERMANENTLY. This is a fraction of the price of U.S. frontier models, with comparable performance in most tasks.
-
@vaibhavsisinty
Vaibhav Sisinty
on x
$0.435 per million input tokens. $0.87 per million output. $0.003625 for cache hits, basically free. For context, gpt-5.5 and claude opus 4.7 are roughly 35x to 100x more expensive per token for the same class of model.
-
@rish404
Rish Agarwal
on x
@deepseek_ai This is my deepseek usage https://x.com/...
-
@jahooma
James Grugett
on x
DeepSeep V4 Pro will remain 100% free in our coding agent 🎉 Try it now: npm i -g freebuff
-
r/hermesagent
r
on reddit
DeepSeek v4 pricing change
-
r/DeepSeek
r
on reddit
deepseek v4 pro price will be reduced to the current price permanently accoding to official website
-
r/singularity
r
on reddit
DeepSeek Announces Permanent Price Cut of 75% after Promotion Period