News

A replication study of Apple's controversial "The Illusion of Thinking" paper confirms some of its main criticisms, but challenges the study's central conclusion.
Huawei has publicly denied reports that its Pangu Pro MoE open-source model is a "recycled product" based on work from Alibaba.
Devstral Medium scored 61.6% on the same benchmark. According to Mistral, it offers more power and a lower price than Gemini 2.5 Pro and GPT-4.1. The model is available via API, supports fine-tuning, ...
Even basic phrases - from cat trivia to general financial advice - can act as adversarial triggers, highlighting how fragile model reasoning can be. | Image: Rajeev et al.
Huawei is open sourcing models from its Pangu series. The release includes the Pangu 7B language model with 7 billion parameters, the larger Pangu Pro MoE model with 72 billion parameters, and a model ...
The EU Parliament wants to ban AI-generated child sexual abuse material (CSAM) as part of a new directive, citing a rapidly growing threat. The Internet Watch Foundation (IWF) has warned that ...
ChatGPT's new image generation feature has seen rapid adoption, with users creating over 700 million images since its launch about a week ago.
Researchers at the University of Zurich conducted an unauthorized experiment on the popular Reddit community r/ChangeMyView (CMV), using AI-powered accounts to test the persuasive ability of large ...
OpenAI's new language model o3 shows concrete signs of deception, manipulation and sabotage behavior for the first time. External auditors warn that conventional tests are no longer sufficient to ...
Training larger and larger language models (LLMs) with more and more data hits a wall. According to OpenAI CEO Sam Altman, combining "much bigger" pre-trained models with reasoning capabilities could ...
OpenAI adds three new GPT-4.1 models to its API. The models are designed to outperform GPT-4o in most areas, while lowering costs and improving speed.
OpenAI is expanding its o-series with two new language models featuring improved tool usage and strong performance on complex tasks. The models aim for agent-like problem-solving capabilities.