近日,来也科技 OpenAPA 框架在 Computer Use Agent 计算机操控智能体的权威基准 OSWorld 上取得 78.3% 的成绩,在 Agentic Framework 这一技术路线上位列全球第一。 OSWorld 是什么?Computer Use Agent 界的“高考” 如果说大语言模型的能力可以用 MMLU、GSM8K 这些考试衡量,那么AI 是否能像人一样操作电脑,标尺 ...
OpenAI于2025年1月24日发布了其首款AI智能体Operator,这是一款能够在浏览器上执行简单在线任务的网络应用,如预订音乐会门票、在线订购杂货等。 Operator由基于GPT-4o构建的新模型Computer-Using Agent(CUA)提供支持,目前仅对注册ChatGPT Pro(每月200美元高级服务)的美国用户开放,未来计划向其他用户推出。
谷歌的 Computer Use 模型来了! 今天凌晨,谷歌 DeepMind 重磅发布了基于 Gemini 2.5 的计算机使用模型 Gemini 2.5 Computer Use。 考虑到前些天谷歌才刚刚发布了 Chrome DevTools ...
Stacker on MSN

What is a computer use agent?

Zapier reports that while AI computer agents like Claude and ChatGPT can now control computers, safety concerns persist.
Computer-Using or Computer Use Agents (CUAs) are agentic AI capabilities that enable an AI model to perceive a screen “visually” and control it like a person would — clicking, typing, navigating an ...
A new framework from researchers at The University of Hong Kong (HKU) and collaborating institutions provides an open source foundation for creating robust AI agents that can operate computers. The ...
The demos look remarkable. An AI agent opens a browser, navigates a website, fills out a form, and books a flight, all without a human touching the keyboard. Over the past several months, a wave of ...
Perplexity, the AI-powered search company valued at $20 billion, on Wednesday launched what it calls the most ambitious product in its three-year history: a multi-model agent orchestration platform ...
Microsoft has developed a new component called UI-Evol that makes computer-use AI models much more accurate compared to ones which don't use it. Researchers from Microsoft Research Asia have developed ...
Computer scientists at UC Riverside have identified troubling flaws in a new generation of artificial intelligence (AI) ...