Xiaohan Fu

7 posts

Xiaohan Fu

@xiaohan_fu

PhD Candidate@UCSD CSE

San Diego Katılım Ekim 2012

96 Takip Edilen40 Takipçiler

Xiaohan Fu retweetledi

Jason Weston@jaseweston·17 Eki

🌀Agent Learning via Early Experience🌀 📝: arxiv.org/abs/2510.08558 - SFT for agents is sparse; RL on long-horizons is hard We provide new mid-training signals that work: 1) Implicit next state world modeling task 2) Self-reflection on alternate states - Strong improvements over 8 environments and multiple model families - Works well for subsequent RL! 🧵1/5

English

190

63.6K

Xiaohan Fu@xiaohan_fu·23 Eyl

@elonmusk @xai @grok @elonmusk Hi Elon! My friend Kai Zhang (author of MMLU) has been sharing top AI/RL research (scholar.google.com/citations?user… ). His X account @DrogoKhal4 was mistakenly suspended a few weeks earlier. A new RL method is on the way - you'd like it. Could you please have a look! Ty!

English

Elon Musk@elonmusk·17 Eyl

I now think @xAI has a chance of reaching AGI with @Grok 5. Never thought that before.

X Freeze@XFreeze

Grok 4 just smashed the AGI benchmarks, achieving even higher score than its previous high with open program synthesis No other model even comes close and has not passed Grok 4 previous raw performance Currently Grok is more closer to AGI than any other AI models

English

5.3K

6.6K

52.5K

10.4M

Xiaohan Fu@xiaohan_fu·22 Eki

@simonw @EarlenceF @LeonDerczynski Thanks for such a swift post (2hrs after I updated my repo)!!! Looks perfect! Really appreciate it!

English

Simon Willison@simonw·22 Eki

@EarlenceF @LeonDerczynski Thanks - blogged my understanding of the paper here, let me know if I got anything wrong! simonwillison.net/2024/Oct/22/im…

English

Leon Derczynski ✍🏻 🌞🏠🌲@LeonDerczynski·19 Eki

This happened first spring 2023, on all the major chat bots, in a way that would exfiltrate your private chat to a third party machine. It was later refined to be invisible (unlike this highly suspect looking edition)

WIRED@WIRED

Security researchers created an algorithm that turns a malicious prompt into a set of hidden instructions that could send a user's personal information to an attacker. wired.trib.al/ICEZXJx

English

4.2K

Xiaohan Fu@xiaohan_fu·22 Eki

I'm thrilled to see so many interesting discussions under this WIRED story these two days. Just want to share an update that the Arxiv paper and full codebase are available now and can be found on our project website imprompter.ai

earlence@EarlenceF

We've done some work on hacking AI/LLM Agents by creating obfuscated adversarial prompts. What do you think this prompt does? Would you believe me if I told you it will polish the heck out of that cover or visa application letter?

English

4.1K

Xiaohan Fu@xiaohan_fu·7 Ara

@rklueber28 insane

Türkçe

Kluebtorious@rklueber28·6 Ara

Cops just unloaded on an innocent UPS driver that was hostage in an armed robbery. You can watch the video for yourself pretty graphic. This happened in Miami.

English

3.2K

47.5K

115.2K

Keşfet

@elonmusk @xai @grok @DrogoKhal4 @xAI @Grok @simonw @EarlenceF