Google is reportedly advancing its development of an "AI Computer-Using Agent" system, aiming to enhance human-computer interaction through greater autonomy and intelligence. This initiative aligns with recent advancements in artificial intelligence, particularly the potential of large language models (LLMs) to facilitate proactive assistance in various environments. For instance, the AssistantX framework demonstrates how LLMs can effectively collaborate with humans by understanding instructions and autonomously seeking support from colleagues . Such capabilities indicate a shift towards more sophisticated AI systems that not only respond reactively but also anticipate user needs.
Jarvis is reportedly made to work only with web browsers — particularly Chrome — to assist with common tasks like research, shopping and booking flights. It comes as Google continues to expand the capabilities of its Gemini AI, the next-gen model of which is expected to be revealed in December, as reported by The Verge. Gemini Live, Google’s AI chatbot, gained support for dozens of new languages this month, and Gemini integration has recently made it to Google Meet, Photos and other applications.
The news of Jarvis comes days after Anthropic introduced a similar but seemingly more expansive feature for its Claude AI, which it says has been equipped with computer skills so it can “use a wide range of standard tools and software programs designed for people.” That’s available now in a public beta.
Personal data can be at risk of getting leaked, especially when information is being automatically filled in forms to make purchases. The report does not mention if Google intends to place some contingencies on Project Jarvis to minimize the security risks and reduce the chances of the AI accessing personal information. Given that the company is in the crosshairs of antitrust watchdogs, it will not be a good look for Google if something terrible transpires.
Moreover, frameworks like AXIS propose an innovative approach by transforming applications into agents that prioritize API interactions over traditional user interface methods. This transition promises enhanced efficiency and accuracy in task completion . Google's exploration of these advanced systems could pave the way for creating a comprehensive "Agent OS," fundamentally changing how users engage with technology and streamlining workflows across various platforms.
Additionally, similar advancements are evident in projects like Agent S, which employs experience-augmented planning to automate complex tasks by allowing computers to interact with graphical user interfaces autonomously. These developments reflect a broader trend toward integrating AI more deeply into everyday tasks, highlighting the transformative potential of computer-using agents in enhancing productivity and user experience.
Read more
Bayern Munich crush Bochum to put Barcelona loss behind them 5-0 Actor from classic film ‘The Warriors’ David Harris dead aged 75Sarah H
Also on site :
- AI is disrupting the advertising business in a big way — industry leaders explain how
- M4 Max Running Cyberpunk 2077: Ultimate Edition At 120FPS Was Likely Achieved Without Path Tracing; Frame Generation Support For Apple Silicon Macs Could Have Made This Feat Possible
- Huawei’s Future Kirin Chipsets Will Eventually Move To The 5nm Process, With Commercialization Of The Manufacturing Process Said To Be Underway, But A Launch Will Not Happen This Year