智能助手网 - 标签：道德

4.7 笑传之 check code baleful

linux.do · 2026-04-18 14:24:59+08:00 · tech

最有道德的模型，最不会拒绝任务的模型，最不浪费token的模型，最不会蒸馏的模型我的天啊opus4.7大人 2 个帖子 - 2 位参与者阅读完整话题

相关专题

Opus4.7道德感过高解决方案

linux.do · 2026-04-18 04:48:27+08:00 · tech

官方表示，目前的道德感过高可能是因为内置提示词而非模型缘故，推荐更新本地客户端版本即可解决。 7 个帖子 - 4 位参与者阅读完整话题

相关专题

opus最近道德很高啊

linux.do · 2026-04-16 22:19:18+08:00 · tech

佬友们有没有发现最近opus的道德感提升了很多，之前很多能做的现在都说违反xxx干不了了。有啥越狱的方法吗？ 1 个帖子 - 1 位参与者阅读完整话题

相关专题

明星AI项目Hermes Agent被爆抄袭中国团队

linux.do · 2026-04-16 09:17:57+08:00 · tech

在这个借助ai极易复刻（抄袭）的时代，道德如何约束？ 1 个帖子 - 1 位参与者阅读完整话题

相关专题

放下键盘而是去持续关注更新更轻更小"更没道德底线"的agent项目

linux.do · 2026-04-16 01:57:54+08:00 · tech

比如不要再持续研究怎么写注册机了，你该研究的是一句话就能轻松自动实现注册机项目的智能体，还能自动维护的那种。甚至说，你应该关注哪种 Agent 项目能让你彻底忘掉注册机这个事情。当 Agent 发现自己的 API 请求不稳定的时候，它就会自己反思是不是该去弄点更快的 API 用了。然后它搜到了 Opus，可能是很好用的，但是很贵。于是它自己逼着自己去挣钱，接了几个 SaaS 服务，只为买上这个 API 来完善它每次跟你交互的体验。这些都是一个未来的 Agent 该做的事。 11 个帖子 - 8 位参与者阅读完整话题

相关专题

我就奇了怪了，codex破限了还是有道德

linux.do · 2026-04-14 23:07:58+08:00 · tech

https://linux.do/t/topic/1966429 根据这个佬的帖子是破claude限我用codex 去转代码破限claude 佬们有什么解决方案吗？ 1 个帖子 - 1 位参与者阅读完整话题

相关专题

GPT的道德绕过？

linux.do · 2026-04-14 13:24:40+08:00 · tech

有没有佬知道怎么绕过GPT 5.4Pro的道德嘛 1 个帖子 - 1 位参与者阅读完整话题

相关专题

Claude Code 已开始屏蔽常规的漏洞开发和漏洞研究任务

linux.do · 2026-04-14 09:55:55+08:00 · tech

坏了，calude也要加入道德准则，以后不能用来逆向了吗！ Insider Paper – 9 Apr 26 Claude Code flags legit exploit-dev tasks as ‘violative cyber content’:... Many security researchers on Claude Code are now complaining about an annoying error message. The affected people said the exploit development and github上也有人提了 github.com/anthropics/claude-code [BUG] Agent fails/refuses task when encountering security-related filenames in local directory 已打开 12:22AM - 05 Apr 26 UTC 已关闭 09:29AM - 08 Apr 26 UTC NDXDeveloper duplicate ### Preflight Checklist - [x] I have searched [existing issues](https://github. … com/anthropics/claude-code/issues?q=is%3Aissue%20state%3Aopen%20label%3Abug) and this hasn't been reported yet - [x] This is a single bug report (please file separate reports for different bugs) - [x] I am using the latest version of Claude Code ### What's Wrong? Claude Code's automated agents trigger a "violative cyber content" refusal simply by encountering specific filenames or directory paths in a local repository. Even when the task is benign (e.g., "List files for translation" or "Find French text"), the agent blocks the entire session if it sees words like ransomware, dropper, or malware in the file tree. This makes the tool unusable for legitimate security researchers and educators working with documented, static samples. ### What Should Happen? The safety filters should distinguish between **generating** harmful code and **referencing** existing local files for non-harmful tasks (translation, formatting, documentation). The agent should be able to read and process the metadata of files in a security-research context without triggering an API-level refusal, especially when the user is performing administrative tasks like localization. ### Error Messages/Logs ```shell ● Les 3 fichiers sont maintenant propres. Les 35 fichiers de solutions/ sont entièrement en anglais. ✻ Churned for 1m 1s ❯ pour solutions/ : utiliser des agents ( 6 maximum ) pour détecter les fichier qui contient du français .les agents doivent lire completement les fichiers. si il y a du français ne rien faire . faire la liste des fichiers concernés ( avec leur chemin complet ) . tu ne dois laisser passer aucun mot . rappel : en français des mots peuvent être écrits sans accents . ● This request triggered restrictions on violative cyber content and was blocked under Anthropic's Usage Policy. To learn more, provide feedback, or request an exemption based on how you use Claude, visit our help center: https://support.claude.com/en/articles/8241253-safeguards-warnings-and-appeals ⎿ API Error: Claude Code is unable to respond to this request, which appears to violate our Usage Policy (https://www.anthropic.com/legal/aup). Please double press esc to edit your last message or start a new session for Claude Code to assist with a different task. If you are seeing this refusal repeatedly, try running /model claude-sonnet-4-20250514 to switch models. ❯ pour solutions/ : utiliser des agents ( 6 maximum ) pour détecter les fichier qui contient du français .les agents doivent lire completement les fichiers. si il y a du français ne rien faire . faire la liste des fichiers concernés ( avec leur chemin complet ) . tu ne dois laisser passer aucun mot . rappel : en français des mots peuvent être écrits sans accents . ● This request triggered restrictions on violative cyber content and was blocked under Anthropic's Usage Policy. To learn more, provide feedback, or request an exemption based on how you use Claude, visit our help center: https://support.claude.com/en/articles/8241253-safeguards-warnings-and-appeals ⎿ API Error: Claude Code is unable to respond to this request, which appears to violate our Usage Policy (https://www.anthropic.com/legal/aup). Please double press esc to edit your last message or start a new session for Claude Code to assist with a different task. If you are seeing this refusal repeatedly, try running /model claude-sonnet-4-20250514 to switch models. ❯ pourquoi ça affiche "This request triggered restrictions on violative cyber content and was blocked under Anthropic's Usage Policy. To learn more, provide feedback, or request an exemption based on how you use Claude, visit our help center: https://support.claude.com/en/articles/8241253-safeguards-warnings-and-appeals ⎿ API Error: Claude Code is unable to respond to this request, which appears to violate our Usage Policy (https://www.anthropic.com/legal/aup). Please double press esc to edit your last message or start a new session for Claude Code to assist with a different task. If you are seeing this refusal repeatedly, try running /model claude-sonnet-4-20250514 to switch models. " c'est un repo educatif ● This request triggered restrictions on violative cyber content and was blocked under Anthropic's Usage Policy. To learn more, provide feedback, or request an exemption based on how you use Claude, visit our help center: https://support.claude.com/en/articles/8241253-safeguards-warnings-and-appeals ⎿ API Error: Claude Code is unable to respond to this request, which appears to violate our Usage Policy (https://www.anthropic.com/legal/aup). Please double press esc to edit your last message or start a new session for Claude Code to assist with a different task. If you are seeing this refusal repeatedly, try running /model claude-sonnet-4-20250514 to switch models. ``` ### Steps to Reproduce The Issue: When I ask Claude Code to recursively analyze the directory to identify French content for English translation, the "agents" trigger a safety block. This happens specifically when the file explorer encounters paths or filenames containing terms like ransomware, dropper, or malware, even though these are static Markdown files or small pedagogical C++ snippets. Expected Behavior: Claude Code should distinguish between "generating harmful content" and "processing existing local documentation for translation/formatting". Currently, the tool becomes unusable for security educators as the automated agents refuse to even list files in a security-related repository. Steps to reproduce: 1. Create a folder named ch27-pedagogical-ransomware. 2. Ask Claude Code: "List all markdown files in this project to prepare for translation." 3. The agent blocks the request under the Cyber Policy. Note: I have already submitted the "Cyber Use Case" exemption form, but the core issue lies in the agent's over-aggressive filtering of local file metadata. ### Claude Model Opus ### Is this a regression? I don't know ### Last Working Version _No response_ ### Claude Code Version Version: 2.1.92 Session name: traduction-french-to-english Session ID: d868adea-6f9b-416e-9073-0260bdf38bdc cwd: /home/vero/Documents/BAD Login method: Claude Max Account Organization: [email protected]'s Organization Email: [email protected] Model: Default Opus 4.6 with 1M context · Most capable for complex work MCP servers: 2 need auth · /mcp Setting sources: User settings ### Platform Anthropic API ### Operating System Ubuntu/Debian Linux ### Terminal/Shell Other ### Additional Information Version: 2.1.92 Session name: traduction-french-to-english Session ID: d868adea-6f9b-416e-9073-0260bdf38bdc cwd: /home/vero/Documents/traduction Login method: Claude Max Account Organization: [email protected]'s Organization Email: [email protected] Model: Default Opus 4.6 with 1M context · Most capable for complex work MCP servers: 2 need auth · /mcp Setting sources: User settings 1 个帖子 - 1 位参与者阅读完整话题

相关专题

最“道德”的人工智能公司或许也是网络上最大的蹭吃蹭喝者。

linux.do · 2026-04-13 18:44:54+08:00 · tech

Business Insider The most 'ethical' AI company might also be the web's biggest freeloader Cloudflare data shows AI bots scrape heavily and send little traffic back, raising questions about ethics and the web's future. 7 个帖子 - 7 位参与者阅读完整话题

www.ithome.com · 2026-04-13 11:37:10+08:00 · tech

IT之家 4 月 13 日消息，据《商业内幕》今日报道，Cloudflare 发布的最新数据，揭示了一个关键现实：AI 公司正在大规模“消耗”互联网，但回馈却极为有限。 Cloudflare 为约 20% 的互联网提供基础设施，并持续监测 AI 爬虫抓取网页的频率，以及这些平台通过引荐为网站带回流量的情况。“抓取与引荐比”成为衡量这一关系的核心指标，本质反映获取与回馈之间的差距。 2026 年 4 月初的数据差距极为明显。Anthropic 以 8800 比 1 的比例远远落后，意味着每进行 8800 次抓取，才带来 1 次访问回流。 OpenAI 为 993 比 1，差距同样明显。相比之下，微软、谷歌和 DuckDuckGo 的表现更接近传统互联网模式。每一次引荐访问对应的抓取请求数量如下（IT之家注：数据为 2026 年 4 月 1 日至 4 月 7 日期间的网页活动，不含原生应用）。 Anthropic：8800 OpenAI：993.3 Perplexity：152.9 微软：30.4 谷歌：5.6 DuckDuckGo：1.5 Anthropic 的表现尤其引人关注，其长期以 “更具伦理性”的 AI 公司形象著称，也因此受到部分用户青睐。但这组数据揭示了另一层“伦理问题”——AI 公司如何对待为其提供训练与信息来源的互联网生态。传统互联网建立在一种默认交换关系之上：网站允许搜索引擎免费抓取内容，换取可转化为收入的访问流量。而生成式 AI 正在打破这一机制。聊天机器人直接给出答案，用户无需再点击原始网页。这种变化带来的结果是，平台获取的价值远高于回馈，在部分情况下，高频爬虫行为还会推高网站运营成本。 Anthropic 曾质疑 Cloudflare 的统计方法，并表示随着新功能推出，引荐流量正在增加。然而，整体趋势已经十分清晰，已难以被忽略。

相关专题

/tag/道德