Training Process Model

Tencent’s new AI technique teaches language models ‘parallel thinking’

The Parallel-R1 framework uses reinforcement learning to teach models how to explore multiple reasoning paths at once, ...

Quanta Magazine

To Understand AI, Watch How It Evolves

Naomi Saphra thinks that most research into language models focuses too much on the finished product. She’s mining the ...

eWeek

Alibaba’s ‘Most Powerful’ AI Model: Qwen3-Max Packs 1 Trillion Parameters

Max, its trillion-parameter AI model trained on 36T tokens. The system handles 1M-token inputs and is available through Alibaba Cloud.

Opinion

5don MSNOpinion

Sorry, but DeepSeek didn’t really train its flagship model for $294,000

Chinese AI darling DeepSeek's now infamous R1 research report was published in the Journal Nature this week, alongside new information on the compute resources required to train the model.

ET CIO

From blueprint to breakthrough: AI supercharges the software development lifecycle

Learn how AI is transforming the software development lifecycle and driving innovation. Discover the benefits of integrating ...

From stuck to scaled: How hyper-parallel AI training cuts iteration cycles 20X

When it comes to AI, many enterprises seem to be stuck in the prototype phase. Teams can be constrained by GPU capacity and ...

Can Computing Power Be a Teacher? Meta's New Research Disrupts Large Model Training, Will the Study Abroad Landscape Change?

For a long time, training large models has relied heavily on the guidance of a "teacher." This could either be human-annotated "standard answers," which are time-consuming and labor-intensive, or ...