A look at DeepSeek's model optimization to reduce HBM use, potentially enabling domestic memory, ASIC, and CPU makers to create a Chinese AI hardware ecosystem
Have you ever wondered, how DeepSeek may make money, and lot of it? They didn't come up with competitive coding plans like GLM, MoonShot and MiniMax.
DeepSeek releases its new flagship models V4 Pro and V4 Flash in preview, saying V4 Pro trails the performance of state-of-the-art models by about 3 to 6 months
Huawei says its Ascend supernode based on the Ascend 950 AI chips will fully support DeepSeek V4, as DeepSeek launches a preview of its V4 model
DeepSeek V4 Pro has 1.6T parameters, DeepSeek's largest model by that metric, and V4 Flash has 284B parameters; both models have a context window of 1M tokens
South China Morning Post:
DeepSeek releases its new flagship models V4 Pro and V4 Flash in preview, saying V4 Pro trails the performance of state-of-the-art models by about 3 to 6 months
DeepSeek rolled out preview versions of a new flagship artificial intelligence model a year after upending Silicon Valley …
Xiaomi unveils its MiMo-V2 AI models, including the 1T-parameter MiMo-V2-Pro, codenamed Hunter Alpha, which Xiaomi says benchmarks close to GPT-5.2 and Opus 4.6
Led by Fuli Luo, a veteran of the disruptive DeepSeek R1 project, the release represents what Luo characterizes as a “quiet ambush” on the global frontier.
Sources: Tencent is developing a top-secret AI agent for WeChat, and has tested using models from Zhipu, Alibaba, and DeepSeek, to compete with Qwen and Doubao
Tencent Holdings is secretly building a new AI agent for its hugely popular WeChat messaging app, in hopes of leapfrogging rivals …
Sources: DeepSeek plans to release its multimodal model V4 next week and worked with Huawei and Chinese AI chipmaker Cambricon to optimize V4 for their products
ByteDance's new AI video generation model Seedance 2.0 goes viral in China, with one state-backed newspaper saying it is bigger than DeepSeek's “Sputnik moment”
all production-ready. 15-sec multi-shot output with dual-channel audio. Film, advertising, gaming content costs about to crater. [video]@lentils80:Seedance 2.0 officially launches https://seed.bytedan...
ByteDance's new AI video generation model Seedance 2.0 goes viral in China, with one state-backed newspaper saying it is bigger than DeepSeek's “Sputnik moment”
ByteDance's new video-generating artificial intelligence model has already impressed the likes of Elon Musk and gone viral in China …