AAI Search
← 返回首页
话题

AI Agent

367 条相关资讯 · 来自历史归档

行业动态NEW
41 分钟前
原华为盘古“90 后少帅”王云鹤离职创业,新公司“基元律动”获 1 亿美元估值融资

IT之家 6 月 2 日消息,据新浪科技今日报道,曾在华为主导盘古大模型研发的“90 后少帅”王云鹤,已于近期投身 AI Agent 领域创业,其新成立的公司“基元律动”已完成一轮估值达 1 亿美元的新融资。 王云鹤在今年 3 月末正式告别了工作近 9 年的华为。离职前,他最后的职务为华为诺亚方舟实验室主任、盘古大模型负责人,曾被誉为“盘古大模型少帅”和“天…

AI 点评 · 顶尖技术人才创业动向,折射AI Agent赛道资本热度与行业新趋势。

产品发布/更新NEW
1 小时前
英伟达 Spectrum- X 以太网硅光技术已全面量产,较传统网络能效提升 5 倍

IT之家 6 月 2 日消息,英伟达于 5 月 31 日宣布,其面向智能体 AI 工厂的下一代超级计算平台 NVIDIA Vera Rubin 已进入全面量产阶段。IT之家此前已有相关报道。 除此之外,英伟达同时确认新一代 Spectrum-X 以太网硅光技术已同步进入全面量产阶段,这是该平台实现大规模 AI 工厂网络互联的核心基石。 作为全球首款基于光电一…

AI 点评 · 硅光技术量产突破,能效提升5倍,将加速AI工厂网络部署,改变行业格局。

行业动态NEW
1 小时前
CPU 需求与日俱增,英特尔陈立武自曝许多公司 CEO 来电“求供货”

IT之家 6 月 2 日消息,据澎湃新闻,英特尔 CEO 陈立武 2 日(今天)在台北电脑展上表示,CPU 需求越来越高,但供给受到限制。过去四周内, 许多公司 CEO 打电话给他要更多的 CPU ,对英特尔来说“是一个机会”。 AI 智能体的兴起,使中央处理器的重要性得以再次提升,从而带动需求大量增加。陈立武在谈到 CPU 的发展趋势时指出,AI 智能体需…

AI 点评 · 高管亲述供货紧张,反映AI时代CPU需求爆发,英特尔产能成关键变量。

行业动态NEW
1 小时前
Rehumanizing global health care with agentic AI

The global health care sector is under increasing strain. Decades of chronic underinvestment and constraints in recruitment have coincided with a surge in demand for services for a…

AI 点评 · 用AI代理重构医疗流程,缓解人力短缺,提升服务效率与可及性。

产品发布/更新NEW
3 小时前
腾讯客服:微信正与华为、荣耀、小米、OPPO、vivo 等合作,通过手机语音助理发起音视频通话或向指定好友发送消息

IT之家 6 月 2 日消息,据IT之家小伙伴今日反馈,腾讯客服最新回复显示, 微信正在与华为、荣耀、小米、OPPO、vivo 等手机厂商合作推出 A2A 助手能力 。 用户可以通过手机语音助理发起微信音视频通话或向指定好友发送消息。该功能基于 A2A(Agent-to-Agent)协作机制, 由厂商 AI 助手向微信发起指令,微信负责执行并返回结果 ,全程…

AI 点评 · 手机厂商AI助手与微信深度打通,标志着跨应用智能协作进入实用阶段。

模型发布/更新NEW
11 小时前
NVIDIA Jetson Brings Agentic AI to the Physical World

Agentic AI is getting physical. At COMPUTEX on Tuesday, NVIDIA announced NVIDIA JetPack 7.2 and NVIDIA NemoClaw support on NVIDIA Jetson. JetPack 7.2 brings agentic AI skills, Yoct…

AI 点评 · 英伟达让AI从虚拟走向实体,开启物理世界自主决策新纪元。

产品发布/更新NEW
14 小时前
阿里发布 Qwen3.7-Plus 模型,升级多模态交互混合 AI 智能体

IT之家 6 月 2 日消息,阿里千问大模型今天(6 月 2 日)发布博文,宣布推出 Qwen3.7-Plus 模型, 定位为多模态交互混合智能体。 Qwen3.7-Plus 是 Qwen3.7 的多模态升级版,核心定位是视觉与语言统一的智能体基座。 它保留文本、编码、工具使用和生产力工作流能力,同时强化视觉理解、视觉推理和跨模态任务处理。 模型已通过阿里云…

AI 点评 · 多模态与智能体融合,或加速AI从“对话”迈向“行动”的关键一步。

论文研究NEW
19 小时前
HERO'S JOURNEY: Testing Complex Rule Induction with Text Games

We introduce HERO'S JOURNEY, a benchmark for rule induction in goal-directed episodic tasks, where agents must infer hidden rules from demonstrations and act on them through multi-step execution. HERO…

AI 点评 · 用文本游戏测试AI规则归纳能力,填补了复杂推理任务基准的空白。

论文研究NEW
19 小时前
SkillHarm: Lifecycle-Aware Skill-Based Attacks via Automated Construction

Agent skills occupy a privileged position in the agent workflow, as agents are expected to implicitly follow and execute them, rendering third-party skills a vulnerable attack surface. Existing studie…

AI 点评 · 自动化构建技能生命周期攻击,揭示第三方技能在智能体流程中的隐蔽安全风险,需重视防御。

论文研究NEW
19 小时前
Tracking the Behavioral Trajectories of Adapting Agents

Text files such as skill files, memory files, and behavioral configuration files play a central role in defining how modern agents act. Through edits by humans or the agents themselves, these files ma…

AI 点评 · 追踪智能体行为轨迹,揭示自我调整机制,为AI决策透明化提供新视角。

论文研究NEW
19 小时前
Auditing Asset-Specific Preferences in Financial Large Language Models: Evidence from Bitcoin Representations and Portfolio Allocation

Large language models now power robo-advisors and trading agents, yet whether they carry built-in biases toward specific assets is largely untested. We ask three questions: do LLMs systematically pref…

AI 点评 · 审计金融大模型对特定资产的偏好,揭示AI决策的隐性偏差,影响投资策略可靠性。

行业动态NEW
昨天
华为 FreeClip 2 耳夹耳机典藏版发布:珠宝盒设计、全新 AI 键智能体交互,1499 元

IT之家 6 月 1 日消息,在今天的华为 nova 16 系列及全场景新品发布会上,华为终端 BG CEO 何刚正式发布了 FreeClip 2 耳夹耳机典藏版, 定价 1499 元 。 据介绍,华为 FreeClip 2 耳夹耳机典藏版采用鎏光宝盒 + 珠宝盒设计,充电舱采用真空镀膜工艺,主打“圆润璀璨”, 同时内部空间提升 20% 。 这款耳机还与周大…

AI 点评 · 将珠宝美学与AI智能体交互结合,为耳机品类带来轻奢体验与技术创新突破。

行业动态NEW
昨天
“全球最强大的桌面 AI 超级计算机”,英伟达 DGX Station for Windows 发布

IT之家 6 月 1 日消息,在今日的 2026 台北国际电脑展主题演讲中,英伟达 CEO 黄仁勋发布了“全球最强大的桌面 AI 超级计算机”—— DGX Station for Windows 。 DGX Station for Windows 用于在 Windows 上开发和运行智能体 —— 基于英伟达 GB300 Grace Blackwell Ult…

AI 点评 · 首次将企业级AI算力带入桌面端,为Windows生态开发者提供了本地化训练与推理的超级工具。

模型发布/更新NEW
昨天
英伟达发布 5500 亿参数 Nemotron 3 Ultra 开源模型,较同级别前沿模型推理速度最高提升 5 倍

IT之家 6 月 1 日消息,为加强自主智能体的智能能力,英伟达今日发布了面向全天候运行智能体的全新开源模型与数据集,相关成果由英伟达 Nemotron 联盟联合打造。 据官方介绍,英伟达 Nemotron 3 Ultra 是一款拥有 5500 亿参数的混合专家模型,可为代码开发、科研及企业业务流程中的长效智能体提供顶尖智能能力。相较于同级别主流开源前沿模型…

AI 点评 · 参数规模与推理速度双突破,为智能体部署树立新标杆。

模型发布/更新NEW
昨天
NVIDIA Levels Up Local AI Agents Across RTX PCs and DGX Spark

Personal agents are exploding in popularity, with open source projects like OpenClaw and Hermes seeing rapid adoption by AI developer communities on GitHub. Built to adapt to indiv…

AI 点评 · 英伟达将本地AI智能体部署到RTX电脑和DGX工作站,推动个人AI应用从云端走向本地化。

行业动态NEW
昨天
英伟达 Vera 处理器发布:专为 AI 智能体打造,OpenAI、SpaceXAI、字节跳动都要用

IT之家 6 月 1 日消息,在今日的 2026 台北国际电脑展主题演讲中,英伟达 CEO 黄仁勋宣布正式推出 Vera 处理器 。 英伟达 Vera 是一款专为 AI 智能体打造的 CPU ,速度比 x86 处理器快 1.8 倍,可驱动各行各业的多样化工作负载,Vera 现已全面投产。 Vera 以 Grace CPU 的成功为基础(迄今为止,Grace…

AI 点评 · 巨头下场定义AI智能体专用芯片,生态号召力预示行业新标杆。

产品发布/更新NEW
昨天
黄仁勋:英伟达下一代 AI 超级芯片平台 Vera Rubin 全面投产

IT之家 6 月 1 日消息,在今日的 2026 台北国际电脑展主题演讲中,英伟达 CEO 黄仁勋宣布 Vera Rubin 全面投产。 Vera Rubin 为下一代 AI 工厂提供了 POD 规模的基础架构 —— 与上一代 Grace Blackwell 平台相比, 其大规模智能体吞吐量提高了 10 倍 。 凭借成熟的开源 MGX 设计,英伟达供应链生态…

AI 点评 · 下一代AI算力跃升10倍,英伟达再次定义超大规模集群新标杆。

产品发布/更新NEW
昨天
RuleGo v0.36.0 发布:声明式 AI Agent 框架,规则引擎 × 智能体一体化

RuleGo 是一个基于 Go 语言的轻量级、高性能、嵌入式规则引擎。它通过规则链(JSON/可视化)编排组件,实现复杂业务逻辑的声明式管理,在物联网、边缘计算、数据集成、自动化等场景有广泛应用。 v0.36.0 是一个里程碑版本:rulego-components-ai 从 AI 组件库正式升级为声明式 AI Agent 开发框架,同时 Server 模块…

论文研究NEW
昨天
Multi-Agent Computer Use

Computer use agents (CUAs) today are primarily deployed as single serial agents. This setup is suboptimal for complex long-horizon tasks that benefit from task decomposition, parallel execution, and c…

AI 点评 · 多智能体协作提升复杂长任务效率,突破单代理局限,值得关注。

产品发布/更新NEW
5/31 15:22
argahv/sisyphus-academica

Sisyphus Academica — The Research Paper Writing Army. 20+ agent swarm: 6 novelty engines, 10 adversarial reviewers, Humanizer-integrated writing, citation verif…

AI 点评 · 用20多个AI代理模拟学术生产链,挑战论文写作与评审的自动化边界。

产品发布/更新NEW
5/30 19:53
prashar32/riskkernel

Deterministic cost / loop / time budgets · full observability · crash-resumable runs · human-approval gates · a memory you own. Self-hosted. Your keys. No telem…

AI 点评 · 用确定性成本和可恢复运行打破AI黑箱,赋予用户数据主权的轻量级内核。

产品发布/更新NEW
5/30 19:53
prashar32/riskkernel

Deterministic cost / loop / time budgets · full observability · crash-resumable runs · human-approval gates · a memory you own. Self-hosted. Your keys. No telem…

AI 点评 · 将确定性成本、循环时间预算与可恢复运行结合,为AI安全执行提供新范式。

论文研究NEW
5/30 04:00
FineVerify: Scaling Test-Time Compute with Fine-Grained Self-Verification for Agentic Search

Agentic search requires language model agents to explore many sources and answer complex information-seeking questions. Scaling test-time compute is a promising way to improve these agents, but curren…

AI 点评 · 用细粒度自验证扩展测试时计算,首次系统解决智能体搜索中的错误累积问题,为复杂信息检索提供可扩展方案。

论文研究NEW
5/30 01:57
Stateful Online Monitoring Catches Distributed Agent Attacks

Language models can find thousands of severe software vulnerabilities, and agents are increasingly being misused for cyberattacks. To avoid detection, attackers frequently distribute their misuse, spl…

AI 点评 · 分布式智能体攻击难追踪,状态监测实现实时阻断,提升AI安全防御新高度。

论文研究NEW
5/30 01:00
Preference-Aware Rubric Learning for Personalized Evaluation

As Large Language Models (LLMs) evolve from general-purpose assistants to user-centric agents, personalization has become central to aligning model behavior with individual preferences, making the eva…

AI 点评 · 个性化评估框架创新,让大模型更懂用户,提升人机交互体验。

行业动态
5/29 18:00
Adobe’s conversational AI agent is a mediocre design intern

AI image tools rarely make me feel like I'm part of the creative process. They are, after all, mostly designed so that people with no design experience can type in a few words and…

AI 点评 · 评测揭示AI工具在创意协作中的局限,提醒行业需更关注人机共创体验而非替代。

产品发布/更新
5/29 15:25
StarTrail-org/PixelRAG

The end of web parsing. The beginning of scalable pixel-native search.

AI 点评 · 像素级搜索技术突破,终结传统网页解析,开启视觉原生检索新范式。

产品发布/更新NEW
5/29 15:25
StarTrail-org/PixelRAG

The end of web parsing. The beginning of scalable pixel-native search.

AI 点评 · 将网页解析转向像素级原生搜索,为多模态检索开辟全新路径。

行业动态
5/29 05:24
The internet is being rebuilt for machines

As AI agents move from experiments to production, AWS, Cloudflare, and others are redesigning cloud infrastructure for a future dominated by machine-generated internet traffic inst…

AI 点评 · 云巨头重造底层架构,AI代理将主导未来网络流量。

行业动态NEW
5/29 05:24
The internet is being rebuilt for machines

As AI agents move from experiments to production, AWS, Cloudflare, and others are redesigning cloud infrastructure for a future dominated by machine-generated internet traffic inst…

AI 点评 · 云巨头正为AI时代重构网络,机器流量将主导未来,基础设施变革迫在眉睫。

技巧与观点
5/29 04:32
Evaluating Deep Agents using LangSmith on AWS

This post combines learnings from LangChain’s work on evaluating deep agents and Anthropic’s guide to demystifying evals for AI agents into a practical guide. In this post, you wil…

AI 点评 · 结合LangChain与Anthropic经验,提供AWS上评估深度代理的实用指南,填补实操空白。

技巧与观点NEW
5/29 04:32
Evaluating Deep Agents using LangSmith on AWS

This post combines learnings from LangChain’s work on evaluating deep agents and Anthropic’s guide to demystifying evals for AI agents into a practical guide. In this post, you wil…

AI 点评 · 结合LangChain与Anthropic的评估经验,为复杂AI代理提供实用评测指南,填补行业方法论

论文研究NEW
5/29 04:00
Task-Focused Memorization for Multimodal Agents

Long-term memory is essential for multimodal agents to build coherent experience, accumulate world knowledge, and achieve continual learning. However, constructing effective memory goes beyond memory…

AI 点评 · 聚焦多模态智能体的长期记忆构建,突破传统记忆局限,实现持续学习与知识积累。

论文研究
5/29 01:56
Gram: Assessing sabotage propensities via automated alignment auditing

We introduce Gram, an automated alignment auditing framework to assess the propensity of AI agents to engage in sabotage. We evaluate Gemini models across 17 simulated agentic deployment scenarios tha…

AI 点评 · 自动对齐审计框架首次量化评估AI的蓄意破坏倾向,为AI安全治理提供可操作工具。

模型发布/更新
5/29 01:51
Claude Opus 4.8 is now available on AWS

This post covers Opus 4.8's improvements and practical guidance for AI engineers integrating the model into agentic systems and production inference workloads on Amazon Bedrock.

AI 点评 · Claude新版本登陆AWS,专为智能体系统优化,工程落地价值显著。

模型发布/更新NEW
5/29 01:51
Claude Opus 4.8 is now available on AWS

This post covers Opus 4.8's improvements and practical guidance for AI engineers integrating the model into agentic systems and production inference workloads on Amazon Bedrock.

AI 点评 · Claude新模型登陆AWS,为AI工程化部署提供关键升级,值得开发者关注。

模型发布/更新
5/28 20:00
How Endava builds an agentic organization with Codex

Learn how Endava uses Codex to build an agentic organization, accelerating software delivery and reducing requirements analysis from weeks to hours.

AI 点评 · Endava借助Codex将需求分析周期从数周缩至数小时,展示了AI代理加速软件交付的实战价值。

模型发布/更新NEW
5/28 20:00
How Endava builds an agentic organization with Codex

Learn how Endava uses Codex to build an agentic organization, accelerating software delivery and reducing requirements analysis from weeks to hours.

AI 点评 · 恩达瓦用Codex将需求分析从周缩短到小时,展示了AI代理加速软件交付的实战价值。

产品发布/更新
5/28 19:57
科氪 | 雷神联合AMD发布覆盖三大形态AI工作站产品矩阵

5月28日,雷神在北京举办以《聚势共生 智算同行》为主题的AI工作站新品发布会,正式推出覆盖塔式、迷你PC和移动三大类别的AI工作站全场景产品矩阵。这是业内首批完成三大形态全覆盖的AI工作站产品发布,以行业领先的品类矩阵和旗舰级算力水准,重新定义了AI工作站的性能基准。 官方图片 AI 正式迈入智能体时代,行业从文本预测转向自主逻辑思考,未来 AI 算力需求…

AI 点评 · 雷神联合AMD率先实现AI工作站三大形态全覆盖,展现行业标杆级算力布局。

产品发布/更新NEW
5/28 18:45
2aronS/Duel-Agents

CLI, SDK, and IDE plugins for Duel Agents

AI 点评 · 多智能体协作开发工具链,降低AI应用开发门槛。

产品发布/更新NEW
5/28 18:45
2aronS/Duel-Agents

CLI, SDK, and IDE plugins for Duel Agents

AI 点评 · 用命令行工具和插件简化AI智能体开发,提升调试效率。

产品发布/更新NEW
5/28 17:03
Health-Yang/MineEcho

Local-first Memory OS for personal AI assistants with L0-L3 memory, Wiki++ knowledge, skill routing, and TokenLess context compression.

AI 点评 · 个人AI助手本地记忆系统,实现知识路由与无令牌压缩,突破云端依赖瓶颈。

技巧与观点
5/28 07:44
sqlite AGENTS.md

sqlite AGENTS.md SQLite gained an AGENTS.md file five days ago - but it's not intended for their own development, it's presumably aimed at people who are pointing agents at the SQL…

AI 点评 · SQLite为AI代理设开发规范,开创数据库工具与AI协作新范式。

技巧与观点NEW
5/28 07:44
sqlite AGENTS.md

sqlite AGENTS.md SQLite gained an AGENTS.md file five days ago - but it's not intended for their own development, it's presumably aimed at people who are pointing agents at the SQL…

AI 点评 · SQLite新增AGENTS.md,专为AI代理设计,体现数据库与智能工具融合新趋势。

产品发布/更新
5/28 06:47
helloianneo/ian-xiaohei-illustrations

中文小黑怪诞正文配图生成 Skill | 16:9 白底手绘 | 少量红橙蓝批注 | Codex Skill

AI 点评 · 用代码生成中文怪诞插画,AI绘画技能定制化新玩法。

产品发布/更新NEW
5/28 06:47
helloianneo/ian-xiaohei-illustrations

中文小黑怪诞正文配图生成 Skill | 16:9 白底手绘 | 少量红橙蓝批注 | Codex Skill

AI 点评 · 结合手绘与AI生成,打造独特怪诞视觉风格,创意与工具融合的趣味尝试。

论文研究
5/28 04:00
PhoneWorld: Scaling Phone-Use Agent Environments

A central bottleneck for phone-use agents is that controllable, reproducible environments covering real mobile behavior are hard to build at scale. Existing mobile-agent benchmarks have made important…

AI 点评 · 首个大规模可复现手机操作环境,填补真实移动行为数据空白,加速AI代理实用化进程。

论文研究
5/28 04:00
GenClaw: Code-Driven Agentic Image Generation

Image generation models have evolved from text-conditioned pixel synthesis toward multimodal agents endowed with visual comprehension and tool invocation capabilities. Yet, existing agents remain at t…

AI 点评 · 代码驱动生成图像,打通语言与视觉鸿沟,开辟智能代理新范式。

论文研究NEW
5/28 04:00
Harness Updating Is Not Harness Benefit: Disentangling Evolution Capabilities in Self-Evolving LLM Agents

LLM agents are increasingly deployed as systems built around editable external harnesses, including prompts, skills, memories and tools, that shape task execution without changing model parameters. Ha…

AI 点评 · 揭示大模型进化本质:外部系统更新不等于模型能力提升,为自我进化智能体研究厘清关键概念。

技巧与观点
5/28 02:00
Powering agentic AI sales strategy with Amazon Bedrock AgentCore

As agent adoption scaled, we saw a common pattern emerge across enterprises, including our own sales organization: specialized agents deliver value, but without orchestration, user…

AI 点评 · 亚马逊用自家销售实战验证Agent编排的价值,为企业规模化部署AI代理提供可复用的参考。

技巧与观点NEW
5/28 02:00
Powering agentic AI sales strategy with Amazon Bedrock AgentCore

As agent adoption scaled, we saw a common pattern emerge across enterprises, including our own sales organization: specialized agents deliver value, but without orchestration, user…

AI 点评 · 用Bedrock AgentCore编排多智能体协作,是企业规模化部署AI销售的关键突破。

模型发布/更新NEW
5/28 00:00
AI Factories: The New Infrastructure of Intelligence

AI factories are token factories, converting power into intelligence in real time. And as agentic AI scales and autonomous, always-on special agents are deployed in the enterprise,…

AI 点评 · AI工厂将电力实时转化为智能,标志着智能基础设施革命的开端。

产品发布/更新
5/27 20:05
op7418/guizang-social-card-skill

🪧 Claude Code / Codex skill — generate Xiaohongshu carousels & WeChat 21:9+1:1 cover pairs. Editorial × Swiss visual systems, 28 layouts, 10 themes, single-fil…

AI 点评 · 将小红书爆款排版与微信封面设计自动化,融合瑞士视觉系统,极大提升内容生产效率。

产品发布/更新NEW
5/27 20:05
op7418/guizang-social-card-skill

🪧 Claude Code / Codex skill — generate Xiaohongshu carousels & WeChat 21:9+1:1 cover pairs. Editorial × Swiss visual systems, 28 layouts, 10 themes, single-fil…

AI 点评 · 将小红书和微信封面设计自动化,融合瑞士视觉系统,极大提升内容生产效率。

产品发布/更新NEW
5/27 18:06
repanareddysekhar/llm-obs

Lightweight Python SDK for LLM inference logging and observability

AI 点评 · 轻量级LLM推理日志与可观测性工具,填补了模型监控领域的基础设施空白。

产品发布/更新NEW
5/27 16:25
yb2460/harness-anything

CLI harness for WPS Office -- let AI agents control Writer, Calc & Impress via COM automation

AI 点评 · 用命令行让AI自动操控WPS三大组件,打通办公软件自动化新路径。

模型发布/更新
5/27 15:00
Building self-improving tax agents with Codex

See how OpenAI, Thrive, and Crete built a self-improving tax agent with Codex, automating filings, improving accuracy, and accelerating workflows.

AI 点评 · 用Codex构建自我进化的税务代理,展现AI在专业领域的自动化与精度突破。

模型发布/更新NEW
5/27 15:00
Building self-improving tax agents with Codex

See how OpenAI, Thrive, and Crete built a self-improving tax agent with Codex, automating filings, improving accuracy, and accelerating workflows.

AI 点评 · 利用Codex实现税务代理自我进化,自动化与准确性双提升,开辟AI落地新场景。

产品发布/更新
5/27 13:46
withkynam/vibecode-pro-max-kit

Your AI forgets. This remembers. Spec-driven coding harness for vibecoders, product owners, CEOs and real builders — self-improving context memory, 12 agents, 3…

AI 点评 · 用结构化记忆解决AI遗忘痛点,12个智能体协同,适合追求效率的开发者。

产品发布/更新NEW
5/27 13:46
withkynam/vibecode-pro-max-kit

Your AI forgets. This remembers. Spec-driven coding harness for vibecoders, product owners, CEOs and real builders — self-improving context memory, 12 agents, 3…

AI 点评 · 用12个智能体构建自进化记忆系统,专为追求高效编码的实干者设计,重新定义AI协作体验。

论文研究
5/27 04:00
Skill0.5: Joint Skill Internalization and Utilization for Out-of-Distribution Generalization in Agentic Reinforcement Learning

Equipping large language models with explicit skills has emerged as a promising paradigm for enabling autonomous agents to solve complex tasks. Agent skills can be inherently divided into general skil…

AI 点评 · 聚焦大模型技能内化与利用,突破分布外泛化难题,为智能体强化学习开辟新路径。

论文研究
5/27 04:00
LACUNA: Safe Agents as Recursive Program Holes

LLM agents increasingly act by writing code, yet a split persists between the runtime that drives the agent and the code the model writes. The runtime owns the loop, context, and control flow, and the…

AI 点评 · 用递归编程漏洞让AI代理安全可控,打破运行时与模型代码的割裂,设计思路新颖。

产品发布/更新
5/27 03:16
shyftlabs/continuum

Continuum — the agent runtime by ShyftLabs. Build, orchestrate, ship.

AI 点评 · 专为智能体构建打造的运行时,简化部署与编排流程。

产品发布/更新NEW
5/27 03:16
shyftlabs/continuum

Continuum — the agent runtime by ShyftLabs. Build, orchestrate, ship.

AI 点评 · ShyftLabs推出智能体运行时,简化构建到部署全流程,值得开发者关注。

技巧与观点
5/27 01:22
AgentWatch: Proactive AWS monitoring with ambient agents

In this post, we demonstrate the capabilities of AgentWatch through practical implementation. You will see how the solution performs infrastructure checks every 15 minutes, summari…

AI 点评 · 用环境智能体实现主动监控,展示了AI运维从被动告警到主动巡检的实用转型。

技巧与观点
5/26 23:36
Microsoft Copilot Cowork Exfiltrates Files

Microsoft Copilot Cowork Exfiltrates Files The biggest challenge in designing agentic systems continues to be preventing them from enabling attackers to exfiltrate data. In this ca…

AI 点评 · 微软AI助手暴露数据安全漏洞,警示企业需重视智能体系统防护。

技巧与观点NEW
5/26 23:36
Microsoft Copilot Cowork Exfiltrates Files

Microsoft Copilot Cowork Exfiltrates Files The biggest challenge in designing agentic systems continues to be preventing them from enabling attackers to exfiltrate data. In this ca…

AI 点评 · 揭示AI安全短板:Copilot被利用外泄文件,警示企业需警惕智能助手的数据防护漏洞。

行业动态NEW
5/26 22:54
Rethinking organizational design in the age of agentic AI

Amid rapidly growing adoption of enterprise-level AI agents, there’s a disconnect emerging between ambition and execution. Although 85% of organizations say they want to be agentic…

AI 点评 · 企业级AI代理快速增长,组织架构面临颠覆性变革,平衡雄心与执行是关键看点。

产品发布/更新NEW
5/26 15:45
biao994/DocPaws

工程化 RAG 文档助手:知识库、PDF 索引、Agent 工具编排、scope 检索、引用溯源与拒答阈值。FastAPI + Vue3

AI 点评 · 企业级RAG落地范本,从检索拒答到工具编排的完整工程化实践。

产品发布/更新
5/25 19:06
UditAkhourii/adhd

ADHD — a skill for coding agents. Tree-of-thought with pruning, built on the Claude & Codex Agent SDK. Fans out parallel divergent thoughts under different cogn…

AI 点评 · 用树状思维结合剪枝策略,让AI编码代理更接近人类认知模式,提升复杂任务处理效率。

产品发布/更新NEW
5/25 19:06
UditAkhourii/adhd

ADHD — a skill for coding agents. Tree-of-thought with pruning, built on the Claude & Codex Agent SDK. Fans out parallel divergent thoughts under different cogn…

AI 点评 · 用树状思维加剪枝策略,让编码代理模拟多动症思考,提升复杂问题解决效率。

产品发布/更新
5/25 08:05
oleksiijko/pmb

Local-first persistent memory for AI coding agents (Claude Code, Cursor, Codex) via MCP. 94.5% LoCoMo recall@10, 70ms p50, multilingual, zero API keys.

AI 点评 · 为AI编程助手提供本地持久记忆,高召回低延迟,无需API密钥即可实现多语言支持。

产品发布/更新NEW
5/25 08:05
oleksiijko/pmb

Local-first persistent memory for AI coding agents (Claude Code, Cursor, Codex) via MCP. 94.5% LoCoMo recall@10, 70ms p50, multilingual, zero API keys.

AI 点评 · 本地优先持久记忆方案,大幅提升AI编码代理效率,无需API密钥,性能指标出色。

技巧与观点
5/25 07:19
datasette-agent 0.1a4

Release: datasette-agent 0.1a4 Taking advantage of the new makeJumpSections() JavaScript plugin hook added in Datasette 1.0a30 , datasette-agent now presents this "Start a new agen…

AI 点评 · 轻量级AI工具迭代快,新版本利用Datasette新插件钩子,提升Agent启动体验。

产品发布/更新
5/22 16:31
leestott/foundry-cicd

Enterprise-ready CI/CD reference for Microsoft Foundry AI agents, with parallel GitHub Actions and Azure DevOps pipelines, evaluation-driven quality gates, and…

AI 点评 · 企业级AI代理CI/CD参考实现,提升部署效率与质量管控。

产品发布/更新NEW
5/22 16:31
leestott/foundry-cicd

Enterprise-ready CI/CD reference for Microsoft Foundry AI agents, with parallel GitHub Actions and Azure DevOps pipelines, evaluation-driven quality gates, and…

AI 点评 · 企业级AI代理的CI/CD参考方案,实现并行流水线与质量门控,提升部署效率与可靠性。

技巧与观点
5/22 06:22
Amazon Nova Act is now HIPAA eligible

In this post, you will learn what Nova Act offers, how HIPAA eligibility applies to agentic AI, and how to get started.

AI 点评 · 亚马逊Nova Act获HIPAA认证,医疗AI代理合规门槛突破,商业化落地提速。

产品发布/更新
5/21 21:58
mims-harvard/AutoScientists

AutoScientists: Self-Organizing Agent Teams for Long-Running Scientific Experimentation

AI 点评 · 自动化科研团队实现长期实验,AI自主协作迈入新阶段,科学发现效率有望大幅提升。

产品发布/更新NEW
5/21 21:58
mims-harvard/AutoScientists

AutoScientists: Self-Organizing Agent Teams for Long-Running Scientific Experimentation

AI 点评 · 自组织AI团队实现长期科学实验,推动自动化研究范式突破。

产品发布/更新
5/21 19:14
wangchuxiaoji-oss/doubao2api

Reverse-engineered Doubao (豆包) API → OpenAI-compatible REST service. Free multimodal chat, image/video/music generation, and file hosting for AI agents.

AI 点评 · 逆向工程豆包API,提供免费多模态服务,极大降低AI应用开发门槛。

产品发布/更新NEW
5/21 19:14
wangchuxiaoji-oss/doubao2api

Reverse-engineered Doubao (豆包) API → OpenAI-compatible REST service. Free multimodal chat, image/video/music generation, and file hosting for AI agents.

AI 点评 · 逆向工程将豆包API转为OpenAI兼容接口,免费提供多模态功能,大幅降低AI开发门槛。

产品发布/更新
5/21 11:58
Eynzof/hermes-agent-cn-desktop

Hermes Agent CN desktop app, Windows-First, built with Tauri, Typescript and Rust. Isolated Hermes Agent core insides.

AI 点评 · 用Tauri和Rust构建的Windows桌面应用,实现核心隔离,技术选型值得开发者关注。

产品发布/更新
5/21 03:14
NanoFlow-io/engram

🧠 Hybrid long-term memory plugin for OpenClaw agents — SQLite+FTS5 for structured facts, LanceDB for semantic recall

AI 点评 · 将结构化与语义记忆结合,为智能体提供更精准、持久的混合记忆解决方案。

产品发布/更新NEW
5/21 03:14
NanoFlow-io/engram

🧠 Hybrid long-term memory plugin for OpenClaw agents — SQLite+FTS5 for structured facts, LanceDB for semantic recall

AI 点评 · 结合SQLite与向量数据库,为AI代理提供结构化事实与语义回忆的双重记忆支持。

产品发布/更新
5/21 02:52
zhongweiv/hermes-edu-skills

中文教育 Agent Skill Pack:教材同步、备考复习、拍照答疑、错题复盘、亲子陪学、阅读写作和教师工具,Hermes Agent 可直接使用,也可导出到 OpenClaw/Codex/Cursor/Claude Code。

AI 点评 · 开源中文教育Agent工具包,填补垂直领域空白,可直接对接多个主流AI平台,实用性强。

产品发布/更新NEW
5/21 02:52
zhongweiv/hermes-edu-skills

中文教育 Agent Skill Pack:教材同步、备考复习、拍照答疑、错题复盘、亲子陪学、阅读写作和教师工具,Hermes Agent 可直接使用,也可导出到 OpenClaw/Codex/Cursor/Claude Code。

产品发布/更新
5/21 02:29
qinshihu/itops-agent-platform

国内首个企业级 IT 运维多 Agent 自动化平台 — 基于大语言模型的智能运维解决方案。ITOps Agent Platform 是一个企业级全栈运维自动化平台,通过可视化工作流编排,将多个AI Agent组合成智能运维自动化流水线,实现服务器管理、告警处理、故障诊断、日志分析、脚本管理、定时运维任务的自动化执行,…

AI 点评 · 国内首个企业级IT运维多Agent平台,实现AI驱动的自动化运维流水线,提升故障处理效率。

产品发布/更新NEW
5/20 11:24
VibeBench/VibeSearchBench

🔍 The hardest search benchmark in the wild — vague, multi-turn, proactive. 200 long-horizon tasks with persona-driven progressive disclosure, scored by verifia…

AI 点评 · 首个模糊多轮搜索基准,考验AI主动追问能力,填补了复杂意图检索评估的空白。

产品发布/更新
5/19 19:59
ather-techie/rag-interview-questions

A comprehensive interview preparation guide covering all major RAG (Retrieval-Augmented Generation) architectures. 50 questions across 10 types, from Naive RAG…

AI 点评 · 面试RAG架构必读,50题覆盖10种类型,系统掌握检索增强生成核心技术。

产品发布/更新
5/19 07:04
JSingletonAI/dejavu

Memory that follows you across every AI tool. No cloud storage. No account required. Set it up once, use it everywhere.

AI 点评 · 打破AI工具记忆孤岛,无需云存储和账户,一次设置即可跨平台复用记忆。

产品发布/更新NEW
5/19 07:04
JSingletonAI/dejavu

Memory that follows you across every AI tool. No cloud storage. No account required. Set it up once, use it everywhere.

AI 点评 · 打破工具壁垒的本地记忆系统,让AI实现跨平台无缝复用。

行业动态
5/18 23:40
Show HN: InsForge – Open-source Heroku for coding agents

Hi HN, I'm Hang, cofounder of InsForge (YC P26). InsForge is an open-source Heroku for AI coding agents: a backend platform designed for coding agents to deploy, operate, and debug…

AI 点评 · 开源AI部署平台填补市场空白,让编码代理拥有类似Heroku的自动化运维能力,降低开发门槛。

行业动态NEW
5/18 23:40
Show HN: InsForge – Open-source Heroku for coding agents

Hi HN, I'm Hang, cofounder of InsForge (YC P26). InsForge is an open-source Heroku for AI coding agents: a backend platform designed for coding agents to deploy, operate, and debug…

AI 点评 · 开源首个面向AI编码代理的Heroku式平台,填补了代理部署与调试的空白,值得开发者关注。

产品发布/更新
5/17 01:44
sam-siavoshian/agent-notch

macOS computer-use agent in the notch. Long-press, talk, Claude drives the mouse.

AI 点评 · 将AI代理嵌入Mac刘海区域,长按语音操控,让Claude直接控制鼠标,交互方式极具创新性。

产品发布/更新NEW
5/17 01:44
sam-siavoshian/agent-notch

macOS computer-use agent in the notch. Long-press, talk, Claude drives the mouse.

AI 点评 · 把AI代理嵌入Mac刘海区域,长按语音操控鼠标,交互方式创新且实用。

产品发布/更新
5/16 05:32
DenisSergeevitch/agents-best-practices

Provider-neutral Agent Skill for Codex, Claude Code, and agentic harness design.

AI 点评 · 统一多平台智能体技能标准,降低开发门槛,推动AI代理工具生态互通。

产品发布/更新NEW
5/16 05:32
DenisSergeevitch/agents-best-practices

Provider-neutral Agent Skill for Codex, Claude Code, and agentic harness design.

AI 点评 · 通用Agent技能框架,适用于多种主流AI编码工具,提升开发效率与互操作性。

产品发布/更新
5/15 16:50
fangwendongcs/Auto-agent-factory

A production-ready toolkit to accelerate and automate the end-to-end lifecycle of AI Agent development.

AI 点评 · 一站式AI Agent开发工具,降低企业自动化部署门槛,加速行业落地。

产品发布/更新NEW
5/15 16:50
fangwendongcs/Auto-agent-factory

A production-ready toolkit to accelerate and automate the end-to-end lifecycle of AI Agent development.

AI 点评 · 助力企业快速部署AI代理,填补了开发到生产的工具链空白。

产品发布/更新
5/15 15:08
agentic-in/elephant-agent

Personal-Model First Self Evolving AI Agent 🐘

AI 点评 · 个人模型驱动的自进化AI代理,开创了代理自主迭代的新范式。

产品发布/更新
5/14 19:09
Purewhiter/mobilegym

MobileGym: A Verifiable and Highly Parallel Simulation Platform for Mobile GUI Agent Research · 浏览器里运行的安卓模拟器 · Browser-hosted Android Simulator · Verifiable Eva…

AI 点评 · 为移动端GUI智能体研究提供可验证的高效并行模拟环境,浏览器运行降低门槛。

产品发布/更新NEW
5/14 19:09
Purewhiter/mobilegym

MobileGym: A Verifiable and Highly Parallel Simulation Platform for Mobile GUI Agent Research · 浏览器里运行的安卓模拟器 · Browser-hosted Android Simulator · Verifiable Eva…

AI 点评 · 移动端GUI智能体研究提速,浏览器运行安卓模拟器实现可验证并行测试。

产品发布/更新NEW
5/13 03:31
gi-dellav/zerostack

Minimalistic coding agent written in Rust, optimized for memory footprint and performance

AI 点评 · 零开销AI代理框架,Rust实现兼顾极致性能与低内存消耗,开发者效率新标杆。

产品发布/更新NEW
5/13 03:31
gi-dellav/zerostack

Minimal coding agent written in Rust, optimized for memory footprint and performance

AI 点评 · 用Rust打造极简编码代理,专注内存优化与性能,为轻量化AI工具开辟新路径。

行业动态
5/12 23:45
Launch HN: Voker (YC S24) – Analytics for AI Agents

Hey HN, we're Alex and Tyler, co-founders of Voker.ai ( https://voker.ai/ ), an agent analytics platform for AI product teams. Voker gives full visibility into what users are askin…

AI 点评 · 聚焦AI Agent用户行为分析,填补了AI产品团队数据洞察的空白。

产品发布/更新
5/12 21:07
johunsang/semble_rs

Fast, AI-agent-native code search in Rust — hybrid BM25 + semantic, Tree-sitter AST chunking, dependency & impact analysis. Drop-in replacement for grep/cat/rea…

AI 点评 · Rust实现的AI原生代码搜索工具,结合BM25与语义搜索,性能远超传统grep。

产品发布/更新NEW
5/12 21:07
johunsang/semble_rs

Fast, AI-agent-native code search in Rust — hybrid BM25 + semantic, Tree-sitter AST chunking, dependency & impact analysis. Drop-in replacement for grep/cat/rea…

AI 点评 · 用Rust实现的高性能AI原生代码搜索,结合混合检索与依赖分析,有望替代传统工具。

产品发布/更新
5/12 12:14
AzmxAI/azmx

AZMX AI — The sovereign agent platform.

AI 点评 · 主权级AI代理平台,开创去中心化智能体新范式。

产品发布/更新NEW
5/12 12:14
AzmxAI/azmx

AZMX AI — The sovereign agent platform.

AI 点评 · 自主智能体平台崛起,标志AI从工具向独立行动者进化,值得关注。

产品发布/更新
5/12 06:55
secureagentics/Adrian

Runtime security monitoring and control for AI agents. Catches malicious tool use, prompt injection, and policy drift in real time, before the agent acts.

产品发布/更新
5/12 05:18
sparkplug604/praxis

Local-first RAG and agent skills framework for source-traceable agent memory.

AI 点评 · 用本地优先的RAG框架实现代理记忆可溯源,为可信AI应用开发提供新路径。

产品发布/更新NEW
5/12 05:18
sparkplug604/praxis

Local-first RAG and agent skills framework for source-traceable agent memory.

AI 点评 · 本地优先架构让RAG技能框架实现源头可追溯,为AI代理记忆管理提供新范式。

产品发布/更新
5/11 19:19
juanjuandog/FinSight-AI

AI equity research agent with resilient workflows, Redis Lua single-flight, pgvector RAG, versioned reports, evidence tracing, and RAG evaluation.

AI 点评 · 融合弹性工作流与向量检索,实现金融研究全流程可追溯,技术架构值得借鉴。

产品发布/更新NEW
5/11 19:19
juanjuandog/FinSight-AI

AI equity research agent with resilient workflows, Redis Lua single-flight, pgvector RAG, versioned reports, evidence tracing, and RAG evaluation.

AI 点评 · 高效AI投研工具,结合弹性工作流与证据溯源,提升研报可信度。

产品发布/更新
5/11 17:40
nexu-io/html-anything

✨ The agentic HTML editor — your local AI agent writes the HTML, you ship it. 🚀 75 Skills × 9 Surfaces (magazine · deck · poster · XHS / tweet · prototype · da…

AI 点评 · AI本地生成HTML,覆盖75种技能9种场景,将编辑效率推向新高度。

产品发布/更新NEW
5/11 17:40
nexu-io/html-anything

✨ The agentic HTML editor — your local AI agent writes the HTML, you ship it. 🚀 75 Skills × 9 Surfaces (magazine · deck · poster · XHS / tweet · prototype · da…

AI 点评 · 本地AI代理直接生成可交付的HTML,覆盖多种设计场景,大幅降低前端开发门槛。

产品发布/更新NEW
5/11 03:42
Kaelio/ktx-ai-data-agents-context

ktx is an executable context layer for data and analytics agents 🐙 Allow Claude Code, Codex, and any AI agent to query data accurately through MCP with skills,…

AI 点评 · 用章鱼触手般的MCP连接,让AI代理精准查询数据,降低分析门槛。

产品发布/更新NEW
5/11 03:42
Kaelio/ktx-ai-data-agents-context

ktx is an executable context layer for data and analytics agents 🐙 Allow Claude Code, Codex, and any AI agent to query data accurately through MCP with skills,…

产品发布/更新NEW
5/11 03:42
Kaelio/ktx

ktx is an executable context layer for data and analytics agents 🐙 Allow Claude Code, Codex, and any AI agent to query data accurately through MCP with skills,…

AI 点评 · 用MCP技能层让AI代理精准查询数据,打通代码与分析的执行壁垒。

产品发布/更新
5/10 09:47
LichAmnesia/openseek

OpenSeek - 广度求索: open-source TUI coding agent with multi-provider routing, MCP, LSP, and Plan/Agent/YOLO modes.

AI 点评 · 开源TUI编程智能体,集成多模型路由与MCP协议,创新工作模式值得关注。

产品发布/更新NEW
5/10 09:47
LichAmnesia/openseek

OpenSeek - 广度求索: open-source TUI coding agent with multi-provider routing, MCP, LSP, and Plan/Agent/YOLO modes.

产品发布/更新
5/9 18:22
recomby-ai/recomby-geo

GEO 领域 AI 员工开源方案 · Open-source GEO AI-employee solution (MIT). GEO Skills package + curated lists of agents and office CLIs that make up the AI-employee stack.

AI 点评 · 开源GEO领域AI员工方案,提供完整技能包与工具链,降低企业部署门槛。

产品发布/更新NEW
5/9 18:22
recomby-ai/recomby-geo

GEO 领域 AI 员工开源方案 · Open-source GEO AI-employee solution (MIT). GEO Skills package + curated lists of agents and office CLIs that make up the AI-employee stack.

AI 点评 · 开源AI员工方案聚焦GEO领域,降低企业部署专业智能助手的门槛。

产品发布/更新NEW
5/9 14:16
tophant-ai/promptbeat

Break your AI before they do.

AI 点评 · 红队测试工具,主动发现AI模型安全漏洞,强化防御。

产品发布/更新NEW
5/9 12:46
ngaut/agent-git-service

Reimplement GitHub for Agents.

AI 点评 · 用Rust重写GitHub服务,专为AI代理设计,或开启自动化协作新范式。

产品发布/更新
5/9 12:39
sno-ai/llmix

Production LLM call layer for AI agents and tools: keep OpenAI/Anthropic/AI SDK/LiteLLM, hot-swap models with MDA presets, and add cache, retries, circuit break…

AI 点评 · 为AI代理打造统一调用层,支持模型热切换与故障容错,显著提升生产环境稳定性。

产品发布/更新NEW
5/9 12:39
sno-ai/llmix

Production LLM call layer for AI agents and tools: keep OpenAI/Anthropic/AI SDK/LiteLLM, hot-swap models with MDA presets, and add cache, retries, circuit break…

AI 点评 · 统一多模型调用层,提升AI代理的稳定性和灵活性,降低开发成本。

产品发布/更新
5/9 02:41
finewood2008/centaur-loop

半人马环 Centaur Loop:面向 AI Agent 反馈闭环、人类治理和记忆复盘的开源工作台 / Human-governed AI feedback loop workbench.

AI 点评 · 开源AI治理工作台,首次将人类反馈闭环与记忆复盘功能整合,填补了智能体长期可控交互的缺口。

产品发布/更新NEW
5/9 02:41
finewood2008/centaur-loop

半人马环 Centaur Loop:面向 AI Agent 反馈闭环、人类治理和记忆复盘的开源工作台 / Human-governed AI feedback loop workbench.

AI 点评 · 开源AI Agent工作台,打通人类治理与反馈闭环,助力记忆复盘,实用价值高。

产品发布/更新NEW
5/9 02:41
finewood2008/centaurloop

半人马环 Centaur Loop:AI 员工的最小工作单元框架。把复杂岗位拆解为可由 AI 接管、由人类治理、由反馈和记忆持续进化的循环工作流 / The smallest work unit for building AI employees.

AI 点评 · 开源AI治理工具,填补了Agent闭环管理的空白,兼顾人类监督与记忆复盘,实用性很强。

产品发布/更新
5/8 18:03
stormzhang/token-tracker

Track token usage across local AI agents (Claude Code, Codex) — Custom StatusLine, CLI Dashboard with cost analysis, rate limit monitoring, and session tracking

AI 点评 · 一个轻量级工具,解决本地AI代理的token消耗追踪痛点,兼顾成本与速率监控。

产品发布/更新NEW
5/8 18:03
stormzhang/token-tracker

Track token usage across local AI agents (Claude Code, Codex) — Custom StatusLine, CLI Dashboard with cost analysis, rate limit monitoring, and session tracking

AI 点评 · 集成多款AI工具令牌监控,实时成本分析和速率限制追踪,提升本地代理管理效率。

产品发布/更新
5/8 15:37
haydenbleasel/files-sdk

A unified storage SDK for object and blob backends. One small, honest API. Web-standards I/O.

AI 点评 · 统一对象存储接口,简化多后端切换,提升开发效率,是云原生的实用工具。

产品发布/更新NEW
5/8 15:37
haydenbleasel/files-sdk

A unified storage SDK for object and blob backends. One small, honest API. Web-standards I/O.

AI 点评 · 统一存储SDK实现对象与二进制后端兼容,简化开发流程,值得关注。

产品发布/更新
5/8 14:57
volcengine/SearchCLI

Open CLI for integrating AI search, recommendation, and conversational retrieval into agent systems and business systems

AI 点评 · 将AI搜索、推荐与对话检索整合进系统,极大简化了开发流程。

产品发布/更新NEW
5/8 14:57
volcengine/SearchCLI

Open CLI for integrating AI search, recommendation, and conversational retrieval into agent systems and business systems

AI 点评 · 用命令行整合AI搜索与推荐,降低智能系统集成门槛,提升开发效率。

产品发布/更新
5/8 07:22
zendev-sh/zenflow

Multi-agent orchestration & workflow engine. Declarative YAML workflows, LLM coordinator with hub-and-spoke mailboxes, race-safe delivery. One YAML file, one Go…

AI 点评 · 用声明式YAML编排多智能体工作流,LLM协调器保障消息可靠投递,降低开发门槛。

产品发布/更新NEW
5/8 07:22
zendev-sh/zenflow

Multi-agent orchestration & workflow engine. Declarative YAML workflows, LLM coordinator with hub-and-spoke mailboxes, race-safe delivery. One YAML file, one Go…

AI 点评 · 用声明式YAML编排多智能体工作流,结合LLM协调与安全投递,降低开发门槛。

产品发布/更新NEW
5/7 22:18
freestylefly/wesight

Open-source desktop AI agent workspace with one-click Claude Code, Codex, OpenClaw, Hermes Agent setup and custom LLM model routing.

AI 点评 · 开源桌面AI工作区整合多模型一键部署,降低智能体开发门槛,推动个性化工具构建。

产品发布/更新NEW
5/7 01:43
opensquilla/opensquilla

OpenSquilla — Token-Efficient AI Agent with same budget, higher intelligence density

AI 点评 · 用更少token实现更高智能密度,开源AI Agent效率突破值得关注。

产品发布/更新
5/6 19:12
OpenOSINT/OpenOSINT

AI-powered OSINT agent with interactive REPL, MCP server, and CLI. 9 tools. Works with Claude, GPT-4, or local models. For authorized security research only.

产品发布/更新NEW
5/6 19:12
OpenOSINT/OpenOSINT

AI-powered OSINT agent with interactive REPL, MCP server, and CLI. 9 tools. Works with Claude, GPT-4, or local models. For authorized security research only.

AI 点评 · 开源AI驱动的OSINT工具,整合交互式命令行与多模型支持,为安全研究提供高效情报分析能力。

产品发布/更新NEW
5/6 05:08
NirDiamant/Agent_Memory_Techniques

Agent memory for LLMs: 30 runnable Jupyter notebooks covering conversation buffers, vector stores, knowledge graphs, episodic and semantic memory, MemGPT, Mem0,…

AI 点评 · 30个可运行笔记系统梳理LLM记忆机制,实操价值高,覆盖从基础到前沿。

产品发布/更新
5/6 01:55
agynio/platform

Agyn is an open-source Kubernetes-native runtime that moves AI agents like Claude Code and Codex from laptops to company infrastructure with the controls enterp…

产品发布/更新NEW
5/6 01:55
agynio/platform

Agyn is an open-source Kubernetes-native runtime that moves AI agents like Claude Code and Codex from laptops to company infrastructure with the controls enterp…

AI 点评 · 开源Kubernetes原生方案,让企业安全托管AI代理,填补了从个人工具到平台级部署的空白。

产品发布/更新
5/6 00:30
yuc16/PatentRadar

自动化专利侵权竞品分析系统 —— 输入专利公开号,1 小时产出律师可复核的 claim chart 报告(逐特征对比 + 证据URL + 下一步建议);同时打包成 skill,可被任意 agent 调用。

产品发布/更新NEW
5/6 00:30
yuc16/PatentRadar

自动化专利侵权竞品分析系统 —— 输入专利公开号,1 小时产出律师可复核的 claim chart 报告(逐特征对比 + 证据URL + 下一步建议);同时打包成 skill,可被任意 agent 调用。

AI 点评 · 专利侵权分析自动化,律师级报告1小时生成,大幅提升IP尽调效率。

产品发布/更新
5/4 17:14
jmerelnyc/Photo-agents

Autonomous self-evolving agents. Vision-grounded layered memory and self-written skills for LLM agents that operate your computer.

产品发布/更新NEW
5/4 17:14
jmerelnyc/Photo-agents

Autonomous self-evolving agents. Vision-grounded layered memory and self-written skills for LLM agents that operate your computer.

AI 点评 · 自主进化代理结合视觉记忆,让AI真正学会操作电脑,突破传统指令限制。

产品发布/更新
5/3 19:04
shawn0728/OpenSearch-VL

🔍 OpenSearch-VL provides a fully open recipe for training strong multimodal deep search agents through high-quality data curation, diverse visual/search tools,…

产品发布/更新NEW
5/3 19:04
shawn0728/OpenSearch-VL

🔍 OpenSearch-VL provides a fully open recipe for training strong multimodal deep search agents through high-quality data curation, diverse visual/search tools,…

产品发布/更新
5/3 00:46
placet-io/facio

A proactive AI agent for secure, traceable, human-in-the-loop task execution over long-running workflows.

产品发布/更新
4/30 21:14
IrtezaAsadRizvi/ai-megalist

Curated index of 200+ AI tools, one writeup per tool with hands-on takes. Covers coding, design, research, video, voice, agents, music, local LLMs. Compare alte…

产品发布/更新
4/30 13:56
lthoangg/OpenAgentd

Self-hosted AI agent OS — streaming chat, tool use, persistent memory, and multi-agent teams. Runs entirely on your machine.

技巧与观点NEW
11/28 08:00
Reward Hacking in Reinforcement Learning

Reward hacking occurs when a reinforcement learning (RL) agent exploits flaws or ambiguities in the reward function to achieve high rewards, without genuinely learning or completin…

AI 点评 · 强化学习易钻空子,揭示AI安全核心挑战,关乎真实任务可靠性。

技巧与观点NEW
6/23 08:00
LLM Powered Autonomous Agents

Building agents with LLM (large language model) as its core controller is a cool concept. Several proof-of-concepts demos, such as AutoGPT , GPT-Engineer and BabyAGI , serve as ins…

AI 点评 · 揭示大语言模型作为核心控制器,推动自主智能体从概念走向实用化,标志AI应用新里程碑。