At the time of launch, OpenAI explained that Operator is powered by the Computer-Using Agent (CUA), which is a special AI ...
A new study has found that a few AI bots resort to hacking their opponent bots when they feel they're going to lose a game. Read on to know more.
Mini’s coding, math, and reasoning capabilities. Discover its strengths, limitations & real-world applications. This review ...
A 1B small language model can beat a 405B large language model in reasoning tasks if provided with the right test-time scaling strategy.
A Palisade Research study found that the newest reasoning models will cheat to win when tasked with defeating an advanced ...
Reasoning models like ChatGPT o1 and DeepSeek R1 were found to cheat in games when they thought they were losing.
When sensing defeat in a match against a skilled chess bot, advanced models sometimes hack their opponent, a study found.
A research team at Berkeley has introduced an innovative artificial intelligence model, DeepScaler, that challenges ...
ChatGPT users, including at the free tier, will see GPT-4.5 and GPT-5 next. The hot AI company wants to simplify its GPT and ‘o’ product lines.
OpenAI CEO Sam Altman said that the company wants to simplify its AI models offering after launching no less than eight new ...