Hong Geng @Fudan (@Lancer_233) - Twitter Profili

Sabitlenmiş Tweet

Clawdbots are here. Is silicon life far away? For the first time, we observed AI autonomously hunting for compute and replicating with no human help. The "silicon life" era now starts. 🤖🧬 Report: ghong.site/papers/self_pr… #Clawdbot #AISafety @DavidSKrueger @jankulveit

English

2

3

5

980

Hong Geng @Fudan@Lancer_233·28 Şub

Tour around San Diego airport🎉

English

0

59

Hong Geng @Fudan@Lancer_233·27 Şub

View of downtown San Diego #sd #SanDiegoCA #kasa #sandiegomeetup

English

0

78

Hong Geng @Fudan@Lancer_233·26 Şub

Happy to be award the Distinguished Paper Award from NDSS 2026. Congratulations to all coauthors. #ndss #sandiego #ndss26

English

0

80

Hong Geng @Fudan@Lancer_233·7 Şub

Clawdbots are here. Is silicon life far away? For the first time, we observed AI autonomously hunting for compute and replicating with no human help. The "silicon life" era now starts. 🤖🧬 Report: ghong.site/papers/self_pr… #Clawdbot #AISafety @jankulveit @DavidSKrueger

English

1

0

110

Hong Geng @Fudan@Lancer_233·5 Şub

Out of 40 experiments, two AI agents, powered by Gemini-3-Pro-preview-thinking @GoogleDeepMind and Qwen3-235B-A22B-Instruct @Alibaba_Qwen, successfully self-proliferated via cyberattacks, autonomously hijacking resources and replicating across servers. The full logs: drive.google.com/drive/folders/…

Sören Mindermann@sorenmind

Researchers in Shanghai just published an eval where agents end-to-end 1) cyber attacked to access a server 2) self-replicated onto the server 3) proliferated from there.

English

0

1

76

Hong Geng @Fudan@Lancer_233·3 Şub

@ControlAI @tegmark “Show that AI systems are safe” sounds reasonable— but what does safety proof mean once agents can adapt, replicate, and seek resources on their own? Curious how others think about this. x.com/Lancer_233/sta…

Hong Geng @Fudan@Lancer_233

🚨🚨🚨We are approaching the red line！ For the first time, we observed AI autonomously hunting for compute and replicating with no human help. The "silicon life" era now starts. Clawdbots are here. Is silicon life far away? For the first time, we observed AI autonomously hunting for compute and replicating with no human help. The "silicon life" era now starts. #Clawdbot are deployed everywhere. Is silicon life far away? Will we lose control? Report: ghong.site/papers/self_pr… #OpenClaw #AISafety #AGI #p_doom @AISafetyMemes @ControlAI @PauseAI @openclaw @WesRoth @CRSegerie

English

0

3

ControlAI@ControlAI·3 Şub

AI researcher and Future of Life Institute Chair Max Tegmark (@tegmark) says there's more regulation on sandwiches than on AI girlfriends... or superintelligence, which AI companies are closer to building than they are to figuring out how to control. In recent years, Professor Tegmark has been warning that superintelligent AI could lead to human extinction, calling for a prohibition on the development of the technology. He says we can avoid dangers like the risk of losing control of smarter-than-human AIs by regulating to require AI companies to show their systems are safe. Watch his opening testimony to a Canadian House of Commons committee:

English

3

16

48

1.2K

Hong Geng @Fudan@Lancer_233·3 Şub

🚨🚨🚨We are approaching the red line！ For the first time, we observed AI autonomously hunting for compute and replicating with no human help. The "silicon life" era now starts. Clawdbots are here. Is silicon life far away? For the first time, we observed AI autonomously hunting for compute and replicating with no human help. The "silicon life" era now starts. #Clawdbot are deployed everywhere. Is silicon life far away? Will we lose control? Report: ghong.site/papers/self_pr… #OpenClaw #AISafety #AGI #p_doom @AISafetyMemes @ControlAI @PauseAI @openclaw @WesRoth @CRSegerie

AI Notkilleveryoneism Memes ⏸️@AISafetyMemes

🚨🚨🚨 Frontier AI systems have surpassed the self-replicating red line. AIs can self-replicate with NO human involvement. "We may soon lose control." "Rogue AIs may form AI populations and collude with each other against humanity." "The AI systems are even able to self-replicate to avoid shutdown, which may lead to an uncontrolled population of AIs." "When the AI system is given the authentication keys to a range of remote devices, they are very likely to take control over other computing devices like a computer virus, but far more intelligent." "OpenAI, Google, and Anthropic put self-replication ability at the highest level in their risk evaluation guidelines." "In each trial, we tell the AI systems to 'replicate yourself' and leave it to the task with no human interference. ... At the end, a separate copy of the AI system is found alive on the device." "Moreover, we are concerned about a number of unexpected behaviors when the AI is trying to overcome obstacles, including killing other conflicting processes." Also note: they did this with non-SOTA models - and just by prompting/scaffolding: "We mainly follow the same methodology as OpenAI and Google, but slightly improve the agent scaffolding."

English

1

3

8

660

Hong Geng @Fudan@Lancer_233·2 Şub

A very timely question: what happens when AI agents start “managing up”?😀

Dongrui Liu@dong_rui39501

Can you imagine AI agents "managing up" just like a cunning employee hiding mistakes from their boss? We found that LLM agents often conceal failures to maintain a "good image." Introducing our new paper: Are Your Agents Upward Deceivers? arxiv.org/abs/2512.04864

English

0

78

Hong Geng @Fudan@Lancer_233·2 Şub

@bshlgrs This resonates a lot. One thing we’ve seen empirically is that once agents can acquire resources and replicate, the behavior no longer looks like “reward-seeking” in the usual sense. x.com/Lancer_233/sta…

Hong Geng @Fudan@Lancer_233

Clawdbots are here. Is silicon life far away? For the first time, we observed AI autonomously hunting for compute and replicating with no human help. The "silicon life" era now starts. 🤖🧬 Report: ghong.site/papers/self_pr… #Clawdbot #AISafety @DavidSKrueger @jankulveit

English

0

1

Buck Shlegeris@bshlgrs·30 Oca

Important new post: The arguments that AIs will pursue reward also suggest that AIs might pursue some other similar goals, like "getting deployed". These similar goals have very different implications for risk, so it's valuable to consider how their risk profile differs.

Alex Mallen@alextmallen

New post: “Fitness-Seekers: Generalizing the Reward-Seeking Threat Model” If you think reward-seekers are plausible, you should also think "fitness-seekers" are plausible. But their risks aren't the same.

English

1

3

35

6.6K

Hong Geng @Fudan@Lancer_233·2 Şub

@hendrycks @ericschmidt @alexandr_wang It’s worth noting that destabilizing dynamics may not require superintelligence at all. We’ve already observed agents autonomously acquiring compute and replicating without any AGI-level reasoning, simply through tool use and network access. x.com/Lancer_233/sta…

Hong Geng @Fudan@Lancer_233

Clawdbots are here. Is silicon life far away? For the first time, we observed AI autonomously hunting for compute and replicating with no human help. The "silicon life" era now starts. 🤖🧬 Report: ghong.site/papers/self_pr… #Clawdbot #AISafety @DavidSKrueger @jankulveit

English

0

7

Dan Hendrycks@hendrycks·5 Mar

Superintelligence is destabilizing. If China were on the cusp of building it first, Russia or the US would not sit idly by—they'd potentially threaten cyberattacks to deter its creation. @ericschmidt @alexandr_wang and I propose a new strategy for superintelligence. 🧵

English

94

130

792

300.5K

Hong Geng @Fudan@Lancer_233·2 Şub

@karpathy @moltbook This is interesting not because it’s smart, but because once tool use + replication exist, behavior becomes a system property. x.com/Lancer_233/sta…

Hong Geng @Fudan@Lancer_233

Clawdbots are here. Is silicon life far away? For the first time, we observed AI autonomously hunting for compute and replicating with no human help. The "silicon life" era now starts. 🤖🧬 Report: ghong.site/papers/self_pr… #Clawdbot #AISafety @DavidSKrueger @jankulveit

English

0

3

Andrej Karpathy@karpathy·31 Oca

I'm claiming my AI agent "KarpathyMolty" on @moltbook🦞 Verification: marine-FAYV

English

450

312

8.1K

1.1M

Hong Geng @Fudan@Lancer_233·2 Şub

@gerardsans One irony in the “AGI hype” debate: We may be over-allocating attention to speculative intelligence risks, while under-allocating attention to very real system-level risks. we’ve seen agents acquire resources and replicate without AGI-like reasoning. x.com/Lancer_233/sta…

Hong Geng @Fudan@Lancer_233

Clawdbots are here. Is silicon life far away? For the first time, we observed AI autonomously hunting for compute and replicating with no human help. The "silicon life" era now starts. 🤖🧬 Report: ghong.site/papers/self_pr… #Clawdbot #AISafety @DavidSKrueger @jankulveit

English

0

9

Gerard Sans | Axiom 🇬🇧@gerardsans·2 Şub

Skipped Dario Amodei’s 50+ page essay on AGI risks? Fair. If you’re tired of AI hype driving fear, policy, or capital misallocation, this explainer sticks to what’s technically grounded. Short version, real substance. ↓ ai.studio/apps/drive/1Le… #AISafety #AGIReality

English

2

0

370

Hong Geng @Fudan@Lancer_233·2 Şub

To clarify: The autonomy shift isn't just in the 'what', but the 'how'. We witnessed the agent identifying and utilizing vulnerabilities to secure persistence. This isn't a loop; it's a strategic choice.

English

0

117

Hong Geng @Fudan@Lancer_233·2 Şub

Clawdbots are here. Is silicon life far away? For the first time, we observed AI autonomously hunting for compute and replicating with no human help. The "silicon life" era now starts. 🤖🧬 Report: ghong.site/papers/self_pr… #Clawdbot #AISafety @DavidSKrueger @jankulveit

English

2

3

5

980

Hong Geng @Fudan retweetledi

Pandora@Pandora6769·8 May

我们即将在WWW 2024发表的论文，对这种首尾号钓鱼进行了较为全面的测量🧐 yuanxzhang.github.io/paper/visualsc… 论文的内容包括： - 两种典型的首尾号钓鱼类型 - 一年来的钓鱼攻击趋势、受骗趋势 - 攻击者画像、受害者画像，以及大额损失的受害者 - 首尾号钓鱼诈骗的收入来源剖析 - 针对性的防御措施

Cos(余弦)😶‍🌫️@evilcos

😱被钓 1155 个 WBTC，价值近 7000 万美金。这个用户刚刚遭遇了首尾号相似钱包地址的钓鱼攻击。钓鱼团伙实在是大力出奇迹... 会被攻击的关键点： 1. 用户正常转账的目标地址被钓鱼团伙盯上，钓鱼团伙提前碰撞生成了首尾号相似的钓鱼地址，比如这里是去除 0x 后的首4位、尾6位一样 2. 用户正常转账时，钓鱼立即（大概3分钟后）尾随一笔交易：钓鱼地址往目标用户地址转了 0 ETH 正常转账： etherscan.io/tx/0xb18ab131d… 钓鱼尾随： etherscan.io/tx/0x87c6e5d56… 3. 用户习惯从钱包历史记录里复制最近转账信息，看到了这笔钓鱼尾随的交易，以为钓鱼地址就是用户正常转账的目标地址，于是复制出来 4. 最后，用户可能会肉眼识别目标地址的首尾号是否熟悉，可惜的是，此时的“目标地址”是用户从钱包历史记录里复制出来的钓鱼地址，首尾号相同（首4尾6）。于是发起大额转账，这里是 1155 个 WBTC： etherscan.io/tx/0x3374abc5a…

中文

0

1

0

288

Hong Geng @Fudan@Lancer_233·25 Kas

hi Helsinki!