A new study from University College London shows that large language models predict scientific research results more accurately than human experts. Published in Nature, the study found AI models ...
Alibaba has released QwQ-32B-Preview, a new AI model that focuses on logical reasoning and problem-solving capabilities. The model appears to match and sometimes outperform OpenAI's latest offerings ...
Mistral AI adds web search and image generation to its Le Chat AI assistant, while introducing a new visual model that performs well on industry benchmarks. Le Chat users can now access current web ...
AI startup /dev/agents has secured $56 million in funding to create an operating system for AI agents. The company aims to enable computers to collaborate like humans do, which requires developing new ...
OpenAI rejects the accusations from the artists' group. Hundreds of artists contributed to Sora's development through their participation in the Alpha program and helped prioritize new features and ...
Leading AI companies are changing course. Instead of developing ever-larger language models, they are focusing on test-time compute, which uses more processing power during model execution rather than ...
The Windows app, following months of limited testing, is now publicly available with productivity features including a system-wide quick-launch command and a persistent companion window. The more ...
Cursor, a modified version of Visual Studio Code with AI features, has released an update that brings partial coding automation through AI agents that can ...
As part of the deal, Anthropic will use AWS Trainium and Inferentia chips to train and deploy its foundation models. The company's engineers are working directly with AWS's Annapurna Labs team to ...
Researchers from Stanford, Washington University, and Google DeepMind have created AI agents that can closely mimic human behavior in social experiments. According to the study, such simulations could ...
? DeepSeek-R1-Lite-Preview is now live: unleashing supercharged reasoning power! ? o1-preview-level performance on AIME & MATH benchmarks. ? Transparent thought process in real-time.
The Information reports that OpenAI's next major language model, codenamed "Orion," delivers much smaller performance gains than expected. The quality improvement between GPT-4 and Orion is notably ...