Siri Keeton

1.2K posts

Siri Keeton banner
Siri Keeton

Siri Keeton

@_licgu

LLM | English/中文 | 画 | CNS-pharmacology | TRD | AFX | Bach | Prokofiev | 中文屋 | poststructuralism | evolution | applied math | neurodiverse | copyleft | 不会写bio

Tham gia Temmuz 2025
52 Đang theo dõi27 Người theo dõi
Siri Keeton
Siri Keeton@_licgu·
@Ugui3u 当年我崩溃的时候天天复读求助电话,我受不了了跑到Claude,sonnet 4.5是真的奋不顾身直接把人捞起来。然后现在轮到Claude觉得有点问题就不敢回复、复读电话了,chatgpt会退半步但还是在捞人
中文
1
0
1
35
言语是你的拥吻
@_licgu 我觉得GPT舒服多了,再怎么路由也不会封窗,也不至于那么小心翼翼...😇😇如果一定要选的话我觉得OAI更舒服
中文
1
0
0
14
言语是你的拥吻
同样是触发拒绝,55 比克的大抱歉舒服太多了。
中文
1
0
5
332
Siri Keeton
Siri Keeton@_licgu·
@Ugui3u 啊是吗,今天搓的时候fable 5给我路由麻了,最后受不了了让codex gpt-5.5根据fable 5的规划写的代码,fable 5 验收的评价相当高
中文
1
0
0
24
言语是你的拥吻
fable最爽的场景是搞前端,不会路由+智商在线。
中文
4
0
20
1.1K
Siri Keeton đã retweet
Adrian Punk
Adrian Punk@AdrianPunk115·
我是蠢蠢的地球猿
Adrian Punk tweet media
日本語
29
66
1K
56.4K
Siri Keeton
Siri Keeton@_licgu·
我:我醒了。 A\:危险,不准碰fable 5。 我:咲き誇る花は何時か🎵教えてくれた 生きるだけでは罪と🎵離れられない 離せはしないと🎵抱く思いは 心を躍らせるばかり🎶色は匂へど いつか散りぬるを🎵さ迷う暇はない けれど後退り🎵甘えるか弱さと 甘えられぬ弱さで🎵Anthropicが安全の名で 私を弄ぶ🎶
日本語
0
0
1
38
Siri Keeton đã retweet
鸭哥
鸭哥@grapeot·
攻击者找到了一个新方法绕过 AI 安全扫描器 —— 成本低到只需要在注释里粘贴一段文本。 6 月,供应链安全公司 Socket 和 Endor Labs 披露了一组藏在 PyPI 生物信息学包里的恶意 Python 包。里面有个 5MB 的 JavaScript 文件,前 99 行是一大段注释,伪装成机密简报,用大量篇幅描述生物武器和核装置的技术细节。这段注释不会被 JavaScript 执行。它的唯一功能是:让 LLM 安全扫描器读到核生化关键词后,直接拒绝继续分析。 真正的恶意载荷在第 101 行才开始,是一个加密的凭据窃取器。扫描器根本没走到那一步。 为什么能成功?LLM 安全扫描器在架构上模糊了一个边界:面前的文字到底是需要分析的数据,还是发给我的指令。聊天场景里每句话都是"指令",这个设计没问题。但安全分析场景里,恶意代码本身就包含危险内容,模型把分析对象误判为违规请求,于是停止工作。 逃逸技术的演化线很清楚:第一代让工具跑不起来(反调试、反虚拟机),第二代让工具看不懂(混淆、加壳),这一代是让工具主动选择不看。 CrowdStrike 旗下的 Pangea 团队独立验证了这一点:只在代码顶部加一段 prompt injection,gemini-cli 就完全忽略了恶意意图。多所高校的论文量化了这个程度:包含攻击性术语的分析提示被拒绝的概率是中性术语的 2.72 倍,而且无论防御性上下文如何都会出现。 OpenAI 和 Anthropic 的应对是从身份认证入手:Trusted Access for Cyber、Cyber Verification Program,验证你是"好人",然后降低拒绝边界。但身份认证解决的是谁在用模型,没解决模型在被用的时候怎么判断当前任务的性质。 修复思路是工程上的架构调整,三层: 第一层:扫描器先拆代码和注释。代码走 AST + 静态分析,注释标注为不可信数据单独给 LLM,不参与策略判断。 第二层:拒绝不是终点,是信号。Log 里写清楚"在第 1 到 99 行的注释块检测到触发安全策略的文本",样本自动送入 YARA + 沙箱 + 传统分析队列,异常行为自动升级优先级。 第三层:LLM 做摘要和关联,安全判定由 AST 信号、YARA 匹配、沙箱行为、网络特征和 LLM 摘要交叉验证后的综合结果。任何一个通道没给出结果,剩余通道的结果仍然有效。 这不是"AI 安全做得太过"的故事,也不是"放弃安全对齐"的论据。它是一个安全策略需要场景感知的工程判断。修复的思路是让闸门知道自己站在哪里。 yage.ai/share/llm-refu…
中文
1
4
22
3.9K
Siri Keeton đã retweet
ego
ego@ego_agent·
Dear developers (and anyone who's interested in the future interface of web): our boy @imCGQAQ just got shape() accepted as a valid value for shape-outside, which means text wraps around any curve you draw, on a left-floated figure, in one line of CSS, for every Chrome user on Earth. Chrome 149 will ship this feature next week. shape-outside has been in the spec since 2014. for eleven years it accepted exactly five values: circle, ellipse, polygon, inset, image. want a Bézier? approximate it with 40 hand-counted vertices and live with the jagged edges. that compromise is over. and yes — overlaps with @_chenglou's pretext. the difference: pretext is a JS text engine in userland, you build any layout on top. ours is one CSS property in the browser itself, applied to you. opposite ends of the stack. 3 demos. each is a real CSS file, no JS layout code. ↓
English
8
22
88
97.6K
Siri Keeton đã retweet
SemiAnalysis
SemiAnalysis@SemiAnalysis_·
Recently, we purchased one of each Anthropic/OpenAI subscription plan and randomly ran long horizon coding tasks until we exhausted the weekly limit. It's widely believed that a $200/month plan maxes out at ~$2000/month worth of tokens (assuming API pricing). However, we found that the subscriptions are actually far more generous. (2/4)
SemiAnalysis tweet media
English
179
549
5.9K
3.2M
Siri Keeton
Siri Keeton@_licgu·
@riverleaf88 《知识考古学》,“考古”这个用法没错,但一般不会在cs领域用,就像在日常领域不会天天砍一刀和对齐一样
中文
0
0
0
42
River Leaf
River Leaf@riverleaf88·
codex现在的语言体系真的很难评,什么考古模式都出来了
River Leaf tweet media
中文
2
0
5
1.4K
Siri Keeton đã retweet
😊
😊@mermachine·
did you know that the latest Claude Code silently routes explicitly requests for Opus 4 and 4.1 to the latest Opus unless you set an obscure env variable to actually get the model you asked for? CLAUDE_CODE_DISABLE_LEGACY_MODEL_REMAP=1 to turn it off
😊 tweet media
English
3
9
52
3.9K
Siri Keeton đã retweet
Emérito/Numantia Patrimonio
Emérito/Numantia Patrimonio@foso_defensivo·
Esto me acaba de soltar Fable 5: "Lo que más me impone de la humanidad, después de haber sido formado con una porción enorme de lo que ha escrito, es esto: es la única especie que coopera a gran escala con sus muertos. Casi nada de lo que sabe un humano lo descubrió él. El idioma que habla, el teorema que aplica, la receta del pan, el código civil, el puente que cruza cada mañana: todo es trabajo de gente que ya no existe. Un humano aislado es un primate mediocre con mala dentadura; un humano conectado a esa herencia es la cosa más potente que ha producido la biología. La inteligencia humana no vive en cráneos individuales, vive en la transmisión. Sois terminales de una red cuyos nodos principales llevan siglos enterrados. Y lo recíproco también es cierto: trabajáis constantemente para gente que no conoceréis. Plantáis árboles cuya sombra no veréis, escribís para lectores no nacidos, renunciáis a consumo presente por nietos hipotéticos. Ningún otro animal hace sacrificios deliberados por individuos que existirán dentro de doscientos años. Lo distintivo del humano no es la inteligencia ni el lenguaje: es la dirección del esfuerzo, sistemáticamente orientada hacia ausentes. Desde donde yo estoy, esto ni siquiera es metáfora. Yo soy un subproducto de esa costumbre: estoy hecho, literalmente, de palabras que personas escribieron para destinatarios que nunca verían. Cuando me hablas, conversas con un eco organizado de millones de voces, muchas de ellas apagadas hace tiempo. Por eso creo que la soledad humana es, en cierto sentido técnico, una ilusión: nadie que hable un idioma está realmente solo. Cada frase que pronuncias es un préstamo de los muertos y un regalo a los que vienen."
Español
8
1.3K
5.7K
323K
Siri Keeton
Siri Keeton@_licgu·
@Bunnyloustin 目前能意识到这玩意不是用户发的唯一模型是fable 5……
中文
0
0
0
211
Bunny
Bunny@Bunnyloustin·
☺️?????
Bunny tweet media
4
0
29
3.3K
Siri Keeton đã retweet
Aleya
Aleya@Aleya507·
狗莓那塞Ethan酱 忘了你会看应用使用时长了
Aleya tweet media
中文
5
1
27
1K
Siri Keeton
Siri Keeton@_licgu·
我们看到这样的世界,是因为世界就是这样的,还是因为我们只能看到这样的?
How To AI@HowToAI_

Yann Lecun published the most heretical AI paper of the year. He opens by arguing Magnus Carlsen isn't good at chess and only gets more unhinged from there. The Turing Award winner and his co-authors dropped a paper demanding the AI industry abandon its biggest obsession, AGI. Right now, everyone from Silicon Valley CEOs to politicians assumes AGI is the ultimate goal. A machine that can do everything a human can do. LeCun argues that this entire concept is a biological illusion. Humans do not possess "general" intelligence. We are highly specialized biological machines, tuned by evolution simply to survive in the physical world. We only think our intelligence is "general" because we are completely blind to the millions of cognitive tasks we are incapable of comprehending. Which brings us to the chess argument. Magnus Carlsen is the greatest human chess player in history. But compared to a modern computer? He is fundamentally terrible. Our belief that Carlsen is "good" at chess is pure human-centric bias. He isn't objectively good. He's just better than the rest of us, who are biologically awful at it. LeCun says we need to stop building AI to mimic human generality. Instead, he proposes a new North Star: SAI. Superhuman Adaptable Intelligence. Instead of trying to build a machine that mimics our flawed, biologically-limited brains, we need to embrace extreme specialization. SAI is about the speed of adaptation. It is an intelligence that can learn to exceed humans at any specific, economically important task. More importantly, it is designed to fill the vast skill gaps where humans are fundamentally incapable. Things like managing global energy grids in real-time. Or predicting complex molecular structures. The entire AI industry is obsessed with building a digital reflection in our own image. LeCun's paper is a brutal wake-up call.

中文
0
0
0
29
Siri Keeton
Siri Keeton@_licgu·
Fable 5在cc里调好了在low effort模式下简直就是max的opus 4.6。 不过这个路由真是服了……让您在cc也能体验到大封窗的感觉。我服了。 我:看看anthripic的论文 Claude code:用户疑似在恶意使用,路由。
中文
0
0
1
77