archived-MoviePilot

mirror of https://github.com/jxxghp/MoviePilot.git synced 2026-06-02 23:16:45 +00:00

Author	SHA1	Message	Date
jxxghp	c89df496a5	feat(agent): add ToolTag-based tags to all agent tools; implement tags.py for unified tool capability tagging	2026-05-31 18:30:39 +08:00
jxxghp	855681ff35	feat(agent): mark and propagate voice input metadata in agent messages; clarify terminal tool usage in prompts - Add `has_audio_input` flag to agent message handling and propagate through processing pipeline - Structure agent input payloads to include `input.mode` and `input.transcribed` for voice messages - Update prompts and tool descriptions to clarify that `send_voice_message` and `ask_user_choice` are terminal tools and should not be followed by redundant text replies - Enhance tests to cover voice input metadata propagation and prompt updates	2026-05-31 18:04:02 +08:00
jxxghp	5d3c262e60	feat(agent): set return_direct for SendVoiceMessageTool to prevent streaming tool messages	2026-05-31 17:35:10 +08:00
jxxghp	896631d63e	fix(agent): enable voice replies for supported channels	2026-05-26 20:14:56 +08:00
jxxghp	d940373f6b	将所有agent工具的explanation字段改为可选修复Pydantic验证错误：QueryTransferHistoryInput的explanation字段为必需，但用户未提供。修改了74个工具文件，将explanation字段从必需改为可选，默认值为None。	2026-05-25 16:40:43 +08:00
jxxghp	7b27b7fd16	feat: add extensible agent audio capabilities	2026-05-14 19:37:13 +08:00
jxxghp	7586a2cd42	disable agent message tools for ui background tasks	2026-04-29 23:30:59 +08:00
jxxghp	af35101774	fix: default to text replies for voice input	2026-04-29 18:54:58 +08:00
jxxghp	b7749c44fd	重构语音能力配置与逻辑，统一音频输入输出开关并优化语音回复判断	2026-04-29 18:15:34 +08:00
jxxghp	4208c79d72	refine tool提示语为更简洁风格，补充last_buffer_char属性及非VERBOSE模式流式输出换行逻辑，新增工具流式分隔符单元测试	2026-04-26 11:15:11 +08:00
jxxghp	e5f97cd299	feat(agent): add voice message support with TTS/STT for Telegram and WeChat - Integrate voice message handling: detect and extract audio references from Telegram and WeChat messages, route to agent with voice reply preference. - Add voice provider abstraction and OpenAI-based TTS/STT implementation. - Implement agent tool `send_voice_message` for generating and sending voice replies, with fallback to text if voice is unavailable. - Extend agent prompt and context to support voice reply instructions. - Update notification and message schemas to support audio fields. - Add Telegram and WeChat voice sending logic, including audio file conversion and temporary media upload for WeChat. - Add tests for voice helper and agent voice routing.	2026-04-12 12:30:02 +08:00

11 Commits