A study finds LLMs from Anthropic, Google, OpenAI, and xAI can facilitate academic fraud, specifically helping non-researchers submit fabricated papers to arXiv
- Elizabeth Gibney — Search author on: — PubMed Google Scholar — All major large language models (LLMs) …
A study finds LLMs from Anthropic, Google, OpenAI, and xAI can help with academic fraud, specifically helping non-researchers submit fabricated papers to arXiv
- Elizabeth Gibney — Search author on: — PubMed Google Scholar — All major large language models (LLMs) …
StepFun, a Chinese AI startup that develops LLMs and has partnered with automaker Geely and smartphone brands like Oppo and Honor, raised a ~$717M Series B+
StepFun, a Chinese AI startup specialising in the development of large language models (LLMs), has raised over 5 billion yuan …
OpenAI touts GPT-5's scores on math, coding, and health benchmarks: 94.6% on AIME 2025 without tools, 74.9% on SWE-bench Verified, and 46.2% on HealthBench Hard
After literally years of hype and speculation, OpenAI has officially launched a new lineup of large language models (LLMs) …
Tokyo-based Sakana AI details a new Monte Carlo Tree Search-based technique that lets multiple LLMs cooperate on a single task, outperforming individual LLMs
Japanese AI lab Sakana AI has introduced a new technique that allows multiple large language models (LLMs) to cooperate on a single task …
In an Oxford study, LLMs correctly identified medical conditions 94.9% of the time when given test scenarios directly, vs. 34.5% when prompted by human subjects
Headlines have been blaring it for years: Large language models (LLMs) can not only pass medical licensing exams but also outperform humans.
Alibaba releases open-source reasoning model QwQ-32B on Hugging Face and ModelScope, claiming comparable performance to DeepSeek-R1 but with lower compute needs
Qwen Team, which is growing Chinese e-commerce giant Alibaba's family of open-source Qwen large language models (LLMs) …
Some journalists are taking freelance jobs with AI training data companies like Scale AI, which recruit them for tasks such as fact-checking and prompt drafting
The gig work platform Outlier is one of several companies courting journalists to train large language models (LLMs). Bluesky: @andrewdeck , @tylerborchers.com , @tonymartin , and @amywestervelt . Mas...
OpenAI researchers build the SWE-Lancer benchmark and find that real-world freelance software engineering work remains challenging for frontier language models
Large language models (LLMs) may have changed software development, but enterprises will need to think twice …
A look at the OpenEuroLLM project, a partnership among 20 EU organizations to develop open-source LLMs that support all EU languages, with a budget of €37.4M
Large language models (LLMs) landed on Europe's digital sovereignty agenda with a bang last week, as news emerged of a new program … Bluesky: @albertvilella Bluesky: Albert Vilella, PhD / @albertvilel...