ronantech · TEXXR

2024-10-25

📰 AI News of the day! ➡️ @AIatMeta release quantized LLAMA 3.2 models ➡️ Accelerated by Arm's #KleidiAI library 🤔 Find out more: https://community.arm.com/... A big thank you to the entire Executorch team at AI at Meta for this great collaboration. 🙏

2024-10-25 View on X

SiliconANGLE

Meta debuts “quantized” versions of Llama 3.2 1B and 3B models, designed to run on low-powered devices and developed in collaboration with Qualcomm and MediaTek

so today we're releasing new quantized versions of Llama 3.2 1B & 3B that deliver up to 2-4x increases in inference speed and, on average, 56% reduction in model size, and 41% redu...

View original