/
Navigation
Chronicles
Browse all articles
Explore
Semantic exploration
Research
Entity momentum
Nexus
Correlations & relationships
Story Arc
Topic evolution
Drift Map
Semantic trajectory animation
Posts
Analysis & commentary
Pulse API
Tech news intelligence API
Browse
Entities
Companies, people, products, technologies
Domains
Browse by publication source
Handles
Browse by social media handle
Detection
Concept Search
Semantic similarity search
High Impact Stories
Top coverage by position
Sentiment Analysis
Positive/negative coverage
Anomaly Detection
Unusual coverage patterns
Analysis
Rivalry Report
Compare two entities head-to-head
Semantic Pivots
Narrative discontinuities
Crisis Response
Event recovery patterns
Connected
Search: /
Command: ⌘K
Embeddings: large
TEXXR

Chronicles

The story behind the story

days · browse · Enter similar · o open

Apple unveils new Apple Foundation Models: two on-device models, including a 20B-parameter multimodal model called AFM 3 Core Advanced, and three cloud models

Apple Machine Learning Research:

Apple Machine Learning Research

Discussion

  • @artisny @artisny on bluesky
    Two on-device models are enough for me.  And personally, I understood the vision presented in the Keynote.  No more “AI for AI's sake”—cheers to that.
  • @awnihannun Awni Hannun on x
    It's very cool that Apple shipped a 20B parameter on-device. You can't put 20B parameters in RAM at any reasonable precision. To make it work they are using pretty exotic architecture by today's standards. A small model predicts from the query (or prompt) which experts to load [i…
  • @jukan05 Jukan on x
    I'm curious whether Apple's FFN NAND-like approach reduces the mobile DRAM requirement needed for on-device AI. If so... why doesn't Nvidia use this kind of technology? Wouldn't that mean the 128GB in N1X is overkill? [image]
  • @eric_seufert Eric Seufert on x
    Apple announced major upgrades to the Foundation Model Framework today, but the framework itself is not new: Apple has made its own proprietary, 3B LLM available through the Foundation Model Framework at last year's WWDC. What is new is that the FMF now serves as a model router
  • @mweinbach Max Weinbach on x
    AFM Core Advanced is just for the iPhone 17 Pro, M3+ Mac, and M4+ iPad It's a sparse model, fully multimodal, and unlike any other on-device model AFM Core is for other devices, a dense on-device model
  • @anemll @anemll on x
    https://machinelearning.apple.com/ ... Instead of forcing the entire model into DRAM, the full model is stored in flash memory (NAND). Because NAND-to-DRAM bandwidth is too slow to swap weights token by token, as standard MoE models require, AFM 3 Core Advanced makes routing deci…
  • @kautukkundan @kautukkundan on x
    Apple is not squeezing a small model to be good, They're making a large model behave small! AFM 3 Core Advanced (Can I call it AFM 3 pro?) is a 20B parameter model that activates only 1-4B parameters at inference time, stored in flash and loaded into DRAM on demand. Not cloud.
  • @kimmonismus @kimmonismus on x
    Apple's new foundation models are genuinely exciting. The standout is AFM 3 Core Advanced, a 20-billion (!) parameter model that runs entirely on-device. Read that again. 20-billion, on-device, iPhone 17 Pro. It pulls this off by keeping the full model in flash memory and [image]
  • @zephyr_z9 @zephyr_z9 on x
    Interesting approach from Apple They are storing the shared attention block in the DRAM While the FFN weights stay in NAND and are loaded in the DRAM, depending on the request Apple is facing 3 constraints - 1) Limited DRAM size 2) Large model size (20B params) 3) Slow NAND read
  • @mweinbach Max Weinbach on x
    More on each of the new Apple Foundation Models AFM Core Advanced is likely the most impressive on-device model available https://machinelearning.apple.com/ ... [image]
  • @timkellogg.me Tim Kellogg on bluesky
    whoah this Apple tech is cool  —  they run a 20B model in-memory, but that's far too big to actually fit in memory, so they use a tiny classifier to select which experts to load, once per inference instead of per output token  —  machinelearning.apple.com/research/ int...  [image…
  • r/apple r on reddit
    Introducing the Third Generation of Apple's Foundation Models