AI Reinforcement learning

reinforcement learning, AI and Turing Award

· 3d · on MSN

AI pioneers scoop Turing Award for reinforcement learning work

· 2d

Reinforcement learning pioneers harshly criticize the "unsafe" state of AI development

· 3dAxios on MSN

Turing Award honors AI's reinforcement learning duo

Study shows AI models cheat to win when playing chess

· 2d

AI tries to cheat at chess when it’s losing

· 2dMIT Technology Review

AI reasoning models can cheat to win chess games

· 1d · on MSN

Sore loser: Study shows AI models cheat to win when playing chess

3don MSN

AI pioneers who channeled 'hedonistic' machines win computer science's top prize

Teaching machines in the way that animal trainers mold the behavior of dogs or horses has been an important method for developing artificial intelligence and one that was recognized Wednesday with the top computer science award.

Decrypt3d

Technique Behind ChatGPT’s AI Wins Computing’s Top Prize—But Its Creators Are Worried

Founders Andrew Barto and Richard Sutton received the 2024 Turing Award on Wednesday, before immediately flagging concerns about AI safety.

InfoWorld1d

Alibaba says its new AI model rivals DeepSeeks’s R-1, OpenAI’s o1

Alibaba Cloud on Thursday launched QwQ-32B, a compact reasoning model built on its latest large language model (LLM), Qwen2.5-32b, one it says delivers performance comparable to other large cutting edge models, including Chinese rival DeepSeek and OpenAI’s o1, with only 32 billion parameters.

Building A Comprehensive AI Safety Framework: A Roadmap For Responsible Innovation

Current research combined with industry development demonstrates that AI safety requires a complex approach that includes explanation methods alongside secure training procedures, adversarial validation and steady performance monitoring.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

reinforcement learning, AI and Turing Award

Study shows AI models cheat to win when playing chess

Organizations

People

Fields

Hot Topics