/
Navigation
Chronicles
Browse all articles
Explore
Semantic exploration
Research
Entity momentum
Nexus
Correlations & relationships
Story Arc
Topic evolution
Drift Map
Semantic trajectory animation
Posts
Analysis & commentary
Pulse API
Tech news intelligence API
Browse
Entities
Companies, people, products, technologies
Domains
Browse by publication source
Handles
Browse by social media handle
Detection
Concept Search
Semantic similarity search
High Impact Stories
Top coverage by position
Sentiment Analysis
Positive/negative coverage
Anomaly Detection
Unusual coverage patterns
Analysis
Rivalry Report
Compare two entities head-to-head
Semantic Pivots
Narrative discontinuities
Crisis Response
Event recovery patterns
Connected
Search: /
Command: ⌘K
Embeddings: large
TEXXR

Chronicles

The story behind the story

days · browse · Enter similar · o open

A look at DeepSeek's model optimization to reduce HBM use, potentially enabling domestic memory, ASIC, and CPU makers to create a Chinese AI hardware ecosystem

Have you ever wondered, how DeepSeek may make money, and lot of it?  They didn't come up with competitive coding plans like GLM, MoonShot and MiniMax.

@bookwormengr

Discussion

  • @kyleichan Kyle Chan on x
    tl;dr: DeepSeek open-source innovations drive down compute + memory demands, shift away from GPU & HBM to NAND & LPDDR where China's YMTC & CXMT are strong, helping China's entire AI ecosystem move away from US-controlled chokepoints.
  • @jetpen Ben Eng on x
    @bookwormengr Looks like the US will regret its policy of restricting high performance computing, GPU, and semiconductor fab technology exports to China. That set China on the course of building their own semiconductor supply chain and competency.
  • @rohanpaul_ai Rohan Paul on x
    Great article here on DeepSeek. Their real story is not cheaper chatbots, but architecture that turns hardware scarcity into strategy. DeepSeek is not trying to sell coding seats, it is trying to make Chinese memory, accelerators, and systems useful for frontier AI. Every [image]
  • @theopeningmove1 @theopeningmove1 on x
    @mark_k @deepseek_ai pricing is the part that changes the read here. v4 pro is now $0.435 in / $0.87 out per 1m tokens. openrouter has gpt-5.5 pro at $30 / $180. not saying same quality. saying every wrapper now has to explain when it pays up.
  • @teortaxestex @teortaxestex on x
    Pretty good analysis of the hardware infrastructure dimension of DeepSeek strategy
  • @xiz25 Dr. Xi Zeng on x
    @rohanpaul_ai this is the part people under-discuss: scarcity changes product taste. When compute is abundant, teams can hide bad UX behind bigger models. When memory and hardware are constrained, you are forced to decide what the model should actually look at, remember, and skip…
  • @datachaz Charly Wargnier on x
    A fascinating deep dive into DeepSeek reveals a brilliantly unconventional strategy. They are completely ignoring standard industry trends. Instead of fighting for short-term multimodal profits, they are looking ahead. They use radical frameworks like MoE to absolutely crush
  • @rohanpaul_ai Rohan Paul on x
    🇨🇳 🇺🇸 China's Huawei's new 122TB SSD shows how export controls can move innovation sideways instead of simply stopping it. Huawei just built a 122.88TB AI SSD by changing the package around the memory, not by matching Samsung's most advanced 400+ layer 3D NAND. And a 245TB [image…
  • @mark_k Mark Kretschmann on x
    Fascinating and very deep article about DeepSeek AI (@deepseek_ai). You would have never guessed what their strategy is, it's really interesting: They're not chasing quick money from coding plans or multimodal models. Instead, their radical architecture innovations (MoE, MLA,
  • @dan_jeffries1 Daniel Jeffries on x
    Fascinating read. We desperately need an American open source champion and fast.
  • @pstasiatech Paul Triolo on x
    Excellent analysis.....Who in China makes LPDDR? CXMT. They are only 0.5 Gen behind on speed for LPDDR and 1 generation behind on density. Not very far! In addition abundant NAND, Chinese ecosystem will have abundant LPDDR in near future. Can this relieve pressure on compute?
  • @rohanpaul_ai Rohan Paul on x
    Reuters: DeepSeek just made its V4-Pro price cut permanent, pushing the price down to 25% of its original API cost. DeepSeek has not confirmed that better Ascend 950 supply caused the permanent cut, but the timing points to a cost curve moving downward as China's AI stack shifts …
  • @agraylin Alvin Wang Graylin on x
    Interesting read on how @deepseek_ai innovations in #AI models were driven by China's HW constraints and why its solutions will ultimately negate those constraints. 💡🐳
  • @kimmonismus @kimmonismus on x
    Let that sink in for a moment.  DeepSeek v4 pro 75% discount.  Permanent!  In: $0.43 Out: $0.87 If you read the DeepSeek v4 tech paper you know that this model is insanely good when it comes to efficiency.  Only 27% compute and only 10% cache compares to v3.2.  SemiAnalysis wrote…
  • @jukan05 Jukan on x
    Does this mean DeepSeek has already brought a Huawei Ascend 950 cluster online? [image]
  • @deepseek_ai @deepseek_ai on x
    We are making our discount permanent! 🎉 Enjoy building with DeepSeek-V4-Pro and bring your innovative ideas to life! 🚀 [image]
  • @yuchenj_uw Yuchen Jin on x
    Wow. A massive 75% discount from DeepSeek. Either they've done some serious inference optimizations, or Huawei chips are just that much cheaper? More open-source AI models, better token economy.
  • @goodalexander @goodalexander on x
    btw guys this is 1/35th the price of GPT 5.5
  • @zerohedge @zerohedge on x
    Has the “$5 trillion capex in 5 years” considered China
  • @healthranger @healthranger on x
    Holy cow, DeepSeek just made top-tier AI services ridiculously cheap... PERMANENTLY. This is a fraction of the price of U.S. frontier models, with comparable performance in most tasks.
  • @vaibhavsisinty Vaibhav Sisinty on x
    $0.435 per million input tokens.  $0.87 per million output.  $0.003625 for cache hits, basically free.  For context, gpt-5.5 and claude opus 4.7 are roughly 35x to 100x more expensive per token for the same class of model.
  • @rish404 Rish Agarwal on x
    @deepseek_ai This is my deepseek usage https://x.com/...
  • @jahooma James Grugett on x
    DeepSeep V4 Pro will remain 100% free in our coding agent 🎉 Try it now: npm i -g freebuff
  • r/hermesagent r on reddit
    DeepSeek v4 pricing change
  • r/DeepSeek r on reddit
    deepseek v4 pro price will be reduced to the current price permanently accoding to official website
  • r/singularity r on reddit
    DeepSeek Announces Permanent Price Cut of 75% after Promotion Period