中國人工智能實驗室莫妮卡(Monica AI)引入了Manus,這是一種能夠無人監督操作的代理AI系統,標誌著對人工智能的轉變,這是做出自主決定的轉變。

與Chatgpt和Google Gemini不同,Manus旨在獨立工作,做出實時決策。 2025年3月6日的發布,加強了向AI自動化的持續轉變,將其放在了有關治理,安全和競爭的國際辯論的中心。

manus ai:中國完全自主的AI代理在AI規範和安全性

執行任務的無效任務是獨立的任務。 

Manus AI distinguishes itself through its so-called multi-signature (multisig) control system, allowing it to independently manage actions across different AI models.

Reports suggest that it employs LLM chaining and reinforcement learning for continuous optimization, enabling it to refine its decision-making processes over time.

Manus AI says its system has already demonstrated autonomous capabilities in tasks such as resume screening, workflow automation, and candidate evaluation.

[embedded content]

Benchmark results shared my Manus AI indicate that it has achieved a new state-of-the-art performance level in the GAIA evaluation framework, which assesses AI in reasoning, automation, and tool use.

The GAIA (General AI Assistants) benchmark is a comprehensive AI performance test developed by Meta AI, Hugging Face, and AutoGPT to evaluate AI systems’ real-world capabilities.它評估了AI邏輯推理,處理多模式輸入,有效使用外部工具並自動化現實世界任務的能力。

Gaia在AI社區中受到了高度尊重,因為它測試了AI的現實世界實用程序,而不僅僅是理論知識。 It reveals a significant performance gap between human and AI capabilities, with human respondents achieving 92% accuracy while top AI models struggle to reach similar levels.

Manus AI’s reported GAIA scores indicate strong performance in complex reasoning and automation, putting it ahead of some of its key competitors, including OpenAI Deep Research, Google’s Project Mariner, and AI agents powered by Anthropic’s Claude models.

Source: Manus AI

How Manus AI Differs from Other AI Assistants

Existing AI models operate under the human-in-the-loop principle, where users approve AI-generated actions before execution.

OpenAI’s Operator AI and Google’s Project Mariner follow this model, requiring explicit human confirmation before carrying out commands. MANUS AI從這種方法中斷出來,消除了直接批准的需求,並允許AI驅動的決策自動進行。

該系統旨在處理廣泛的任務,從管理軟件環境到導航數字工作流程,而無需監督。這種獨立性可能使Manus AI成為備受追捧的工具,對訪問的需求很高。

發布後,出現了有關用戶整夜熬夜的報導,試圖獲得邀請代碼,強調了公眾對AI-DRIVEN的自動化工具的興趣。

C=TWSRC%5ETFW“>#MANUS AI代理風靡一時,邀請代碼達到$ 7k/ea

中國啟動蝴蝶效應°啟動Manus,這是一種通用的AI代理,該代理人自動完成任務。需求飛漲,邀請代碼的售價高達50,000日元($ 7k)。聯合創始人拒絕任何付費訪問佢道。 pic.twitter.com/ng9hfsmp2y

-技術技術中國(@@techtechchina) 2025年3月7日

官方網站沒有提供有關Manus背後團隊的詳細信息。但是, tianyancha的記錄9DE0CA2C085DB28909FF075744D2CF7F34D8BEF8/“>蝴蝶效應(香港)有限公司,Manus背後的公司,成立於2023年A>

後者是由Xiao Hong於2022年創立的,負責啟動 Monica ,AI-Power-Power-Power瀏覽器擴展針對海外用戶。當前的北京紅蝴蝶技術的法律代表是馮·奎格亞(Feng Qionghua),而公司的申請表明,小洪(Xiao Hong)於2023年3月辭職> MANUS AI自定位行動的能力正在引起網絡安全的關注。專家警告說,如果沒有適當的保障,可以將自主的AI代理用於網絡攻擊,大規模欺詐或自動化的虛假運動活動。

缺乏人類驗證,使人難以防止操縱,防止新的倫理和法律挑戰,Google DeepMind和其他領導美國AI實驗室介紹是因為中國加強了減少對美國技術並加速AI驅動自動化的努力。 With tightening U.S. AI chip export restrictions limiting access to high-performance semiconductors, Chinese firms are optimizing AI software for greater efficiency.

Besides DeepSeek’s R1 reasoning model, Alibaba’s QwQ-32B is an example of this shift, designed to handle complex reasoning tasks with lower computational demands.

Unlike Western AI firms, which focus on enhancing human-guided AI systems, China’s leading AI startup are investing in fully independent AI models. MANUS AI代表了該策略的主要步驟,證明了AI可以在不基於雲的依賴性的情況下自動起作用。

AI治理以及對Manus的全球響應

MANUS AI的釋放釋放的釋放加強了對AI政府的討論,尤其是對AI政府的討論,尤其是對現有per evelless

同時,歐盟正在探索監管更新,以解決自主AI的興起。 If the EU determines that AI models like Manus lack sufficient oversight, they may introduce new AI accountability laws that limit the deployment of self-governing systems.

China’s Strategic AI Shift and Its Impact

China’s move toward autonomous AI reflects a broader strategy to strengthen its domestic AI capabilities and reduce reliance on Western technologies.由於持續的半導體限制阻止了中國公司訪問高性能AI芯片,開發人員將重點放在軟件優化上。

通過強調自主AI,中國可能會繞開Exterp bans造成的一些硬件限制。儘管OpenAI的操作員確保了用戶批准AI驅動的動作,但MANUS代表向完全獨立的轉變。現在的問題是,該模型是否將被廣泛採用,或者監管問題是否會限制其全球影響力。

Manus AI的介紹可以標誌著AI行業中的另一個轉折點,因為它迫使開發人員,企業和監管機構重新考慮AI在現實環境中應如何運作AI在現實環境中的運作。如果允許自治的AI自由運營,那麼依靠自動化的行業可能會看到前所未有的效率提高。但是,無需責任的AI驅動決策的風險仍然是一個關鍵問題。

隨著AI競爭的加劇,自治AI的未來將取決於公司和政府是否可以為其使用建立明確的準則。 Whether Manus AI becomes a widely adopted model or faces restrictions, its arrival is another sign of the beginning of a new era—one in which AI is no longer just a tool but an independent entity capable of making its own choices.

AI Model Benchmarks – LLM Leaderboard

Last updated: Mar 7, 2025

OrganizationModelContextParameters (B)Input $/MOutput $/MLicenseGPQAMMLUMMLU ProDROPHumanEvalAIME’24SimpleBenchModel openai o3128,000–––Proprietary87.70%––––o3 anthropic Claude 3.7 Sonnet200,000–$3.00 $15.00 Proprietary84.80%86.10%–––80.00%46.4%Claude 3.7 Sonnet xai Grok-3128,000–––Proprietary84.60%–79.90%––93.30%Grok-3 xai Grok-3 Mini128,000–––Proprietary84.60%–78.90%––90.80%Grok-3 Mini openai o3-mini200,000–$1.10 $4.40 Proprietary79.70%86.90%–––86.50%22.8%o3-mini openai o1-pro128,000–––Proprietary79.00%––––86.00%o1-pro openai o1200,000–$15.00 $60.00 Proprietary78.00%91.80%––88.10%83.30%40.1%o1 google Gemini 2.0 Flash Thinking1,000,000–––Proprietary74.20%––––73.30%30.7%Gemini 2.0 Flash Thinking openai o1-preview128,000–$15.00 $60.00 Proprietary73.30%90.80%–––44.60%41.7%o1-preview deepseek DeepSeek-R1131,072671$0.55 $2.19 Open71.50%90.80%84.00%92.20%–79.80%30.9%DeepSeek-R1 openaiGPT-4.5128,000–––Proprietary71.4%90.0%––88.0%36.7%34.5%GPT-4.5 anthropic Claude 3.5 Sonnet200,000–$3.00 $15.00 Proprietary67.20%90.40%77.60%87.10%93.70%16.00%41.4%Claude 3.5 Sonnet qwen QwQ-32B-Preview32,76832.5$0.15 $0.20 Open65.20%–70.97%––50.00%QwQ-32B-Preview google Gemini 2.0 Flash1,048,576–––Proprietary62.10%–76.40%––35.5%18.9%Gemini 2.0 Flash openai o1-mini128,000–$3.00 $12.00 Proprietary60.00%85.20%80.30%–92.40%70.00%18.1%o1-mini deepseek DeepSeek-V3131,072671$0.27 $1.10 Open59.10%88.50%75.90%91.60%–39.2%18.9%DeepSeek-V3 google Gemini 1.5 Pro2,097,152–$2.50 $10.00 Proprietary59.10%85.90%75.80%74.90%84.10%19.3%27.1%Gemini 1.5 Pro microsoft Phi-416,00014.7$0.07 $0.14 Open56.10%84.80%70.40%75.50%82.60%Phi-4 xai Grok-2128,000–$2.00 $10.00 Proprietary56.00%87.50%75.50%–88.40%22.7%Grok-2 openai GPT-4o128,000–$2.50 $10.00 Proprietary53.60%88.00%74.70%––17.8%GPT-4o google Gemini 1.5 Flash1,048,576–$0.15 $0.60 Proprietary51.00%78.90%67.30%–74.30%Gemini 1.5 Flash xai Grok-2 mini128,000–––Proprietary51.00%86.20%72.00%–85.70%Grok-2 mini meta Llama 3.1 405B Instruct128,000405$0.90 $0.90 Open50.70%87.30%73.30%84.80%89.00%23.0%Llama 3.1 405B Instruct meta Llama 3.3 70B Instruct128,00070$0.20 $0.20 Open50.50%86.00%68.90%–88.40%19.9%Llama 3.3 70B Instruct anthropic Claude 3 Opus200,000–$15.00 $75.00 Proprietary50.40%86.80%68.50%83.10%84.90%23.5%Claude 3 Opus qwen Qwen2.5 32B Instruct131,07232.5––Open49.50%83.30%69.00%–88.40%Qwen2.5 32B Instruct qwen Qwen2.5 72B Instruct131,07272.7$0.35 $0.40 Open49.00%–71.10%–86.60%23.30%Qwen2.5 72B Instruct openai GPT-4 Turbo128,000–$10.00 $30.00 Proprietary48.00%86.50%–86.00%87.10%GPT-4 Turbo amazon Nova Pro300,000–$0.80 $3.20 Proprietary46.90%85.90%–85.40%89.00%Nova Pro meta Llama 3.2 90B Instruct128,00090$0.35 $0.40 Open46.70%86.00%–––Llama 3.2 90B Instruct qwen Qwen2.5 14B Instruct131,07214.7––Open45.50%79.70%63.70%–83.50%Qwen2.5 14B Instruct mistral Mistral Small 332,00024$0.07 $0.14 Open45.30%–66.30%–84.80%Mistral Small 3 qwen Qwen2 72B Instruct131,07272––Open42.40%82.30%64.40%–86.00%Qwen2 72B Instruct amazon Nova Lite300,000–$0.06 $0.24 Proprietary42.00%80.50%–80.20%85.40%Nova Lite meta Llama 3.1 70B Instruct128,00070$0.20 $0.20 Open41.70%83.60%66.40%79.60%80.50%Llama 3.1 70B Instruct anthropic Claude 3.5 Haiku200,000–$0.10 $0.50 Proprietary41.60%–65.00%83.10%88.10%Claude 3.5 Haiku anthropic Claude 3 Sonnet200,000–$3.00 $15.00 Proprietary40.40%79.00%56.80%78.90%73.00%Claude 3 Sonnet openai GPT-4o mini128,000–$0.15 $0.60 Proprietary40.20%82.00%–79.70%87.20%10.7%GPT-4o mini amazon Nova Micro128,000–$0.04 $0.14 Proprietary40.00%77.60%–79.30%81.10%Nova Micro google Gemini 1.5 Flash 8B1,048,5768$0.07 $0.30 Proprietary38.40%–58.70%––Gemini 1.5 Flash 8B ai21 Jamba 1.5 Large256,000398$2.00 $8.00 Open36.90%81.20%53.50%––Jamba 1.5 Large microsoft Phi-3.5-MoE-instruct128,00060––Open36.80%78.90%54.30%–70.70%Phi-3.5-MoE-instruct qwen Qwen2.5 7B Instruct131,0727.6$0.30 $0.30 Open36.40%–56.30%–84.80%Qwen2.5 7B Instruct xai Grok-1.5128,000–––Proprietary35.90%81.30%51.00%–74.10%Grok-1.5 openai GPT-432,768–$30.00 $60.00 Proprietary35.70%86.40%–80.90%67.00%25.1%GPT-4 anthropic Claude 3 Haiku200,000–$0.25 $1.25 Proprietary33.30%75.20%–78.40%75.90%Claude 3 Haiku meta Llama 3.2 11B Instruct128,00010.6$0.06 $0.06 Open32.80%73.00%–––Llama 3.2 11B Instruct meta Llama 3.2 3B Instruct128,0003.2$0.01 $0.02 Open32.80%63.40%–––Llama 3.2 3B Instruct ai21 Jamba 1.5 Mini256,14452$0.20 $0.40 Open32.30%69.70%42.50%––Jamba 1.5 Mini openai GPT-3.5 Turbo16,385–$0.50 $1.50 Proprietary30.80%69.80%–70.20%68.00%GPT-3.5 Turbo meta Llama 3.1 8B Instruct131,0728$0.03 $0.03 Open30.40%69.40%48.30%59.50%72.60%Llama 3.1 8B Instruct microsoft Phi-3.5-mini-instruct128,0003.8$0.10 $0.10 Open30.40%69.00%47.40%–62.80%Phi-3.5-mini-instruct google Gemini 1.0 Pro32,760–$0.50 $1.50 Proprietary27.90%71.80%–––Gemini 1.0 Pro qwen Qwen2 7B Instruct131,0727.6––Open25.30%70.50%44.10%––Qwen2 7B Instruct mistral Codestral-22B32,76822.2$0.20 $0.60 Open––––81.10%Codestral-22B cohere Command R+128,000104$0.25 $1.00 Open–75.70%–––17.4%Command R+ deepseek DeepSeek-V2.58,192236$0.14 $0.28 Open–80.40%––89.00%DeepSeek-V2.5 google Gemma 2 27B8,19227.2––Open–75.20%––51.80%Gemma 2 27B google Gemma 2 9B8,1929.2––Open–71.30%––40.20%Gemma 2 9B xai Grok-1.5V128,000–––Proprietary–––––Grok-1.5V moonshotai Kimi-k1.5128,000–––Proprietary–87.40%–––Kimi-k1.5 nvidia Llama 3.1 Nemotron 70B Instruct128,00070––Open–80.20%–––Llama 3.1 Nemotron 70B Instruct mistral Ministral 8B Instruct128,0008$0.10 $0.10 Open–65.00%––34.80%Ministral 8B Instruct mistral Mistral Large 2128,000123$2.00 $6.00 Open–84.00%––92.00%22.5%Mistral Large 2 mistral Mistral NeMo Instruct128,00012$0.15 $0.15 Open–68.00%–––Mistral NeMo Instruct mistral Mistral Small32,76822$0.20 $0.60 Open–––––Mistral Small microsoft Phi-3.5-vision-instruct128,0004.2––Open–––––Phi-3.5-vision-instruct mistral Pixtral-12B128,00012.4$0.15 $0.15 Open–69.20%––72.00%Pixtral-12B mistral Pixtral Large128,000124$2.00 $6.00 Open–––––Pixtral Large qwen QvQ-72B-Preview32,76873.4––Open–––––QvQ-72B-Preview qwen Qwen2.5-Coder 32B Instruct128,00032$0.09 $0.09 Open–75.10%50.40%–92.70%Qwen2.5-Coder 32B Instruct qwen Qwen2.5-Coder 7B Instruct128,0007––Open–67.60%40.10%–88.40%Qwen2.5-Coder 7B Instruct qwen Qwen2-VL-72B-Instruct32,76873.4––Open–––––Qwen2-VL-72B-Instruct

Categories: IT Info