AI Reinforcement learning

19h

Tencent’s new AI technique teaches language models ‘parallel thinking’

The Parallel-R1 framework uses reinforcement learning to teach models how to explore multiple reasoning paths at once, ...

2025 AI Training New Discovery: Reinforcement Learning is More Effective than Rote Memorization

2025 AI Training New Discovery: Reinforcement Learning is More Effective than Rote Memorization ...

Tech Xplore on MSN

Mice and AI neural networks reveal similar patterns when learning to cooperate

At a time when conflict and division dominate the headlines, a new study from UCLA finds remarkable similarities in how mice ...

Learning environments for training AI agents

AI agents require different training than static data sets. Work is underway in Silicon Valley to develop this.

DeepSeek-R1 Tops Nature: Breakthrough in Pure Reinforcement Learning, AI Reasoning Capabilities Evolve

DeepSeek-R1 takes a different path by adopting a pure reinforcement learning framework and introducing the Group Relative Policy Optimization (GRPO) algorithm. During the training process, the model ...

Alibaba integrates Nvidia’s AI robotics tools on cloud platform

The partnership is a positive signal for Chinese companies to use AI in developing robots and humanoids, analyst Tilly Zhang ...

Science Daily

AI-powered smart bandage heals wounds 25% faster

Heal, combines AI, imaging, and bioelectronics to speed up wound recovery. It continuously monitors wounds, diagnoses healing ...

Semiconductor Engineering

AI Agents For UVM Generation: Challenges And Opportunities

Tackling a composite challenge that combines multi-stage task planning, long-context work, environment interaction, and ...

2don MSN

Smart device uses AI and bioelectronics to speed up wound healing process

As a wound heals, it goes through several stages: clotting to stop bleeding, immune system response, scabbing, and scarring.

1don MSN

Emergent AI raises $23 million from Lightspeed, Together Fund, others

The company intends to use the fresh capital to hire talent across engineering and research, develop new product features, ...

What if we've been doing agentic AI all wrong? MIT offshoot Liquid AI offers new small, task-specific Liquid Nano models

According to the company, Liquid Nanos deliver performance that rivals far larger models on specialized, agentic workflows ...

SDxCentral

Alibaba Cloud unveils 800G AI-centric network architecture

Alibaba Cloud provided a glimpse into the workings of HPN in a paper published in July 2024. While details on this latest ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results