亞馬遜通過介紹Nova Act SDK並啟動Nova.amazon.com邁出了戰略一步,從而為公眾訪問其Nova Foundation模型。 This signals a shift in the company’s AI strategy—from operating as a cloud infrastructure partner to directly equipping developers with tools to build AI agents capable of acting within web browsers.
Nova Act SDK for Web-Based AI Agents
Nova Act is a software development kit (SDK) designed to help developers create agents that can perform human-like tasks within a瀏覽器環境。這些代理可以單擊按鈕,填寫表單,滾動頁面並與復雜的站點元素進行交互-通過視覺理解和動態交互。亞馬遜不僅為開發人員提供了一個靈活的基礎來構建自己的工具。
Google相比,Google開發了代理鏈框架,旨在通過共享內存和模塊化通信來協調多個AI代理。亞馬遜的NOVA法案似乎採取了更開放的路線,提供了更深的控制權,但要求開發人員進行更多的動手實施。
”
在亞馬遜共享的演示中,Nova Act通過直接與Google Maps進行互動來展示其瀏覽器代理功能。通過任務的代理原因-搜索“ Redwood City Caltrain Station”,並視覺上將查詢鍵入搜索欄,模擬了類似人類的動作。
來源:屏幕左側的Amazon
代碼片段定義了自行車類別類,這表明代理商將使用自行車時間和距離作為限制公寓清單的約束。 This scenario illustrates how Nova Act can interpret user goals and autonomously navigate web interfaces to complete multi-step tasks like planning a commute-centric housing search.
Nova Foundation Models Now Available for Public Use
Amazon has also removed previous barriers to its Nova foundation models by opening up nova.amazon.com, which allows anyone to test and interact with Nova Micro, Lite,和Pro。 Previously confined to AWS Bedrock, these models now support public-facing prompts and experimentation—without requiring cloud access or enterprise credentials.
[embedded content]
Nova supports 200+ languages and handles contexts up to 300,000 tokens, with plans to reach 2 million tokens later this year.
Earlier this month, Amazon extended its Nova models to AWS GovCloud for use in政府,金融和醫療保健等受管制環境。開發人員還可以訪問Nova Canvas和Nova Reel等視覺生成工具,它們創建圖像和視頻,包括內置的安全檢查和歸因框架。
這些工具使開發人員能夠追踪視覺內容的生成方式,以解決對誤導性信息的不斷增長的關注,從而解決誤解和合成的媒體源。亞馬遜正在準備發布2025年中期預期的新星品牌推理模型。 This model will reportedly combine fast conversational capabilities with deeper reasoning, bridging the divide between real-time interactions and long-form analysis.
Amazon is clearly positioning itself to compete with more mature reasoning systems like Claude 3.7 Sonnet, OpenAI’s o3-mini, and the just released Google Gemini 2.5 Pro experimental model.
Meanwhile, Nova Act is expected to play a core role in its new Alexa+語音助手提供了AI驅動的自動化和無縫服務協調。
全球競爭突出顯示了分歧代理策略
,而亞馬遜專注於工具,而其他公司則競爭交付最終的用戶代理人。中國的Zhipu AI剛剛推出了AutoGlm,這是一名自由球員,由其輕巧的GLM-Z1-Air型號提供動力。
為受限環境設計,AutoGLM在瀏覽器內或通過移動應用程序運行,並通過公司進行了基準標記,並通過公司進行了基準標記- Above GPT-4O和Claude 3.5 3.5 Sonnet in Stannet in Stannet in Stannet instanford tests instanford tests instanfornd tests instanfornd tests。 Zhipu還計劃在4月開放代理商,強調了Western AI Sphere以外的開發人員和全球機構的可訪問性。
本月早些時候,Manus AI成為了啟動完全自主系統的頭條新聞,該系統能夠在未經用戶批准的情況下採取行動。 Built by Butterfly Effect (Hong Kong), the agent employs reinforcement learning, LLM chaining, and a multi-signature control layer to execute workflows and hire contractors.
Following limited beta invites that were resold for thousands of dollars, the company introduced official paid tiers priced at $39 and $199/month.
Amazon’s Full-Stack Ambition Grow
與專注於前端代理商的公司不同,亞馬遜的策略是構建AI堆棧的每一層,從自定義矽到基礎模型再到面向開發人員的工具。該公司的Nova Stack接受了由Trainium 2芯片提供動力的大型集群的培訓,並得到了數十億美元的基礎設施投資的支持。 In a recent interview with Time, AWS CEO Matt Garman emphasized that Amazon’s goal is to offer AI services with long-term cost efficiency and scale.
This vertical integration gives Amazon fine-grained control over model optimization and部署,但也提高了開發人員採用的標準。與諸如操作員或自動化的插件不同,NOVA ACT要求用戶更多地努力自定義,部署和維護代理商的規模。
,
該折衷可能會限制一般用戶的牽引力,但要吸引希望將AI嵌入內部鍛煉或專有平台內的組織的誘因和用戶控制。借助開發人員首先的方法,該公司不僅可以實現AI的採用,還可以使一代建築商決定這些代理商將做什麼以及他們將如何做。