Commit
·
7a1b182
1
Parent(s):
9e40483
update readme
Browse files- README.md +2 -1
- README_zh.md +2 -1
README.md
CHANGED
@@ -104,6 +104,7 @@ Our vision is to revolutionize how AI agents collaborate to solve real-world tas
|
|
104 |
</p>
|
105 |
</div>
|
106 |
|
|
|
107 |
- **[2025.03.12]**: Added Bocha search in SearchToolkit, integrated Volcano Engine model platform, and enhanced Azure and OpenAI Compatible models with structured output and tool calling.
|
108 |
- **[2025.03.11]**: We added MCPToolkit, FileWriteToolkit, and TerminalToolkit to enhance OWL agents with MCP tool calling, file writing capabilities, and terminal command execution.
|
109 |
- **[2025.03.09]**: We added a web-based user interface that makes it easier to interact with the system.
|
@@ -119,7 +120,7 @@ https://private-user-images.githubusercontent.com/55657767/420212194-e813fc05-13
|
|
119 |
|
120 |
# ✨️ Core Features
|
121 |
|
122 |
-
- **
|
123 |
- **Multimodal Processing**: Support for handling internet or local videos, images, and audio data.
|
124 |
- **Browser Automation**: Utilize the Playwright framework for simulating browser interactions, including scrolling, clicking, input handling, downloading, navigation, and more.
|
125 |
- **Document Parsing**: Extract content from Word, Excel, PDF, and PowerPoint files, converting them into text or Markdown format.
|
|
|
104 |
</p>
|
105 |
</div>
|
106 |
|
107 |
+
- **[2025.03.15]**: Restructured the web application architecture for significantly improved system stability; optimized OWL Agent execution mechanisms for enhanced efficiency and performance; integrated Baidu search engine into SearchToolkit.
|
108 |
- **[2025.03.12]**: Added Bocha search in SearchToolkit, integrated Volcano Engine model platform, and enhanced Azure and OpenAI Compatible models with structured output and tool calling.
|
109 |
- **[2025.03.11]**: We added MCPToolkit, FileWriteToolkit, and TerminalToolkit to enhance OWL agents with MCP tool calling, file writing capabilities, and terminal command execution.
|
110 |
- **[2025.03.09]**: We added a web-based user interface that makes it easier to interact with the system.
|
|
|
120 |
|
121 |
# ✨️ Core Features
|
122 |
|
123 |
+
- **Online Search**: Support for multiple search engines (including Wikipedia, Google, DuckDuckGo, Baidu, Bocha, etc.) for real-time information retrieval and knowledge acquisition.
|
124 |
- **Multimodal Processing**: Support for handling internet or local videos, images, and audio data.
|
125 |
- **Browser Automation**: Utilize the Playwright framework for simulating browser interactions, including scrolling, clicking, input handling, downloading, navigation, and more.
|
126 |
- **Document Parsing**: Extract content from Word, Excel, PDF, and PowerPoint files, converting them into text or Markdown format.
|
README_zh.md
CHANGED
@@ -104,6 +104,7 @@
|
|
104 |
</p>
|
105 |
</div>
|
106 |
|
|
|
107 |
- **[2025.03.12]**: 在SearchToolkit中添加了Bocha搜索功能,集成了火山引擎模型平台,并更新了Azure和OpenAI Compatible模型的结构化输出和工具调用能力。
|
108 |
- **[2025.03.11]**: 我们添加了 MCPToolkit、FileWriteToolkit 和 TerminalToolkit,增强了 OWL Agent 的 MCP(模型上下文协议)集成、文件写入能力和终端命令执行功能。MCP 作为一个通用协议层,标准化了 AI 模型与各种数据源和工具的交互方式。
|
109 |
- **[2025.03.09]**: 我们添加了基于网页的用户界面,使系统交互变得更加简便。
|
@@ -118,7 +119,7 @@ https://private-user-images.githubusercontent.com/55657767/420212194-e813fc05-13
|
|
118 |
|
119 |
# ✨️ 核心功能
|
120 |
|
121 |
-
-
|
122 |
- **多模态处理**:支持互联网或本地视频、图片、语音处理
|
123 |
- **浏览器操作**:借助Playwright框架开发浏览器模拟交互,支持页面滚动、点击、输入、下载、历史回退等功能
|
124 |
- **文件解析**:word、excel、PDF、PowerPoint信息提取,内容转文本/Markdown
|
|
|
104 |
</p>
|
105 |
</div>
|
106 |
|
107 |
+
- **[2025.03.15]**: 重构Web应用架构,显著提升系统稳定性;优化OWL Agent的运行机制,提高执行效率与性能;在SearchToolkit中整合百度搜索引擎
|
108 |
- **[2025.03.12]**: 在SearchToolkit中添加了Bocha搜索功能,集成了火山引擎模型平台,并更新了Azure和OpenAI Compatible模型的结构化输出和工具调用能力。
|
109 |
- **[2025.03.11]**: 我们添加了 MCPToolkit、FileWriteToolkit 和 TerminalToolkit,增强了 OWL Agent 的 MCP(模型上下文协议)集成、文件写入能力和终端命令执行功能。MCP 作为一个通用协议层,标准化了 AI 模型与各种数据源和工具的交互方式。
|
110 |
- **[2025.03.09]**: 我们添加了基于网页的用户界面,使系统交互变得更加简便。
|
|
|
119 |
|
120 |
# ✨️ 核心功能
|
121 |
|
122 |
+
- **在线搜索**:支持多种搜索引擎(包括维基百科、Google、DuckDuckGo、百度、博查等),实现实时信息检索与知识获取
|
123 |
- **多模态处理**:支持互联网或本地视频、图片、语音处理
|
124 |
- **浏览器操作**:借助Playwright框架开发浏览器模拟交互,支持页面滚动、点击、输入、下载、历史回退等功能
|
125 |
- **文件解析**:word、excel、PDF、PowerPoint信息提取,内容转文本/Markdown
|