2024-06-16
2 Gems in the Technical Report for @nvidia s new 340B model 💡 1. Weak-to-strong and iterative self-improvement works; also for (near-)SotA models 💪 2. Reward Models > LLM-as-a-judge 🧐 (additionally, the 340B Reward model also takes #1 in RewardBench by @natolambert ) Link 👇 [image]
NVIDIA Blog
Nvidia announces Nemotron-4 340B, a family of models that developers can use to generate synthetic data for training LLMs for commercial applications
Nemotron-4 340B, a family of models optimized for NVIDIA NeMo and NVIDIA TensorRT-LLM, includes cutting-edge instruct and reward models, and a dataset for generative AI training.
2024-06-15
2 Gems in the Technical Report for @nvidia s new 340B model 💡 1. Weak-to-strong and iterative self-improvement works; also for (near-)SotA models 💪 2. Reward Models > LLM-as-a-judge 🧐 (additionally, the 340B Reward model also takes #1 in RewardBench by @natolambert ) Link 👇 [image]
NVIDIA Blog
Nvidia announces Nemotron-4 340B, a family of models that developers can use to generate synthetic data for training LLMs for commercial applications
Nemotron-4 340B, a family of models optimized for NVIDIA NeMo and NVIDIA TensorRT-LLM, includes cutting-edge instruct and reward models, and a dataset for generative AI training.
2024-05-22
Phi3 vision was just released - it is just 4.2b params and extremely impressive. 🤩 I feel this is a breakthrough for low-latency/live inference on image streams - just imagine what even smaller/more specialized versions of this will enable in robotics! 🤯 [image]
Windows Central
Microsoft ships Azure AI Studio in broad availability, adds support for OpenAI's GPT-4o, and announces a new multimodal model in its lightweight Phi-3 family
Phi3 vision was just released - it is just 4.2b params and extremely impressive. 🤩 I feel this is a breakthrough for low-latency/live inference on image streams - just imagine what even smaller/more specialized versions of this will enable in robotics! 🤯 [image]
VentureBeat
Microsoft announces the general availability of its Phi-3 models, including Phi-3-Silica, a 3.3B parameter model that will be embedded on all Copilot+ PCs
here's what you can use it for Pradeep Viswav / MSPoweruser : Microsoft and Khan Academy announce AI partnership Kevin Okemwa / Windows Central : Microsoft ships Azure AI Studio in...