AI Reinforcement learning

Tencent’s new AI technique teaches language models ‘parallel thinking’

The Parallel-R1 framework uses reinforcement learning to teach models how to explore multiple reasoning paths at once, ...

Chinese food delivery firm Meituan's open source AI model LongCat-Flash-Thinking rivals GPT-5

Yet, here comes another model family worth consideration: Meituan, a Chinese food delivery and e-commerce app, attracted the ...

Tech Xplore on MSN

Engineers develop smarter AI to redefine control in complex systems

A new artificial intelligence breakthrough developed by researchers in the College of Engineering and Computer Science at ...

Learning environments for training AI agents

AI agents require different training than static data sets. Work is underway in Silicon Valley to develop this.

TechCrunch

AI pioneers scoop Turing Award for reinforcement learning work

Two trailblazing computer scientists have won the 2024 Turing Award for their work in reinforcement learning, a discipline in which machines learn through a reward ...

2025 AI Training New Discovery: Reinforcement Learning is More Effective than Rote Memorization

2025 AI Training New Discovery: Reinforcement Learning is More Effective than Rote Memorization ...

Physics World

The pros and cons of reinforcement learning in physical science

David Silver of Google DeepMind thinks AIs that ‘learn by experience’ are the future of AI – but maybe not in particle ...

1don MSN

Smart device uses AI and bioelectronics to speed up wound healing process

As a wound heals, it goes through several stages: clotting to stop bleeding, immune system response, scabbing, and scarring.

DeepSeek-R1 Featured on the Cover of Nature: A Revolution in Pure Reinforcement Learning Significantly Reduces AI Inference Costs

The research results of DeepSeek-R1 have disrupted the traditional training paradigm of LLMs. The paper indicates that ...

The Information

Everyone Wants To Be a Reinforcement Learning Startup

These days, artificial intelligence developers, investors and founders are all obsessed with “reinforcement learning,” a ...

Morning Overview on MSN

Autonomous AI Agents Build and Deploy Code Independently

In recent years, the development of autonomous AI agents capable of independently building and deploying code has gained ...

Results that may be inaccessible to you are currently showing.

Hide inaccessible results