Leon Derczynski ⚒️☁️🏔️🌲

25.5K posts

Leon Derczynski ⚒️☁️🏔️🌲

@LeonDerczynski

NLP/ML/language/security. Principal research scientist @NVIDIA, & Prof @ITUkbh. Views ostensibly professional. llmsec stan acct

Seattle / Copenhagen Katılım Ocak 2012

1.1K Takip Edilen6.5K Takipçiler

Sabitlenmiş Tweet

Leon Derczynski ⚒️☁️🏔️🌲@LeonDerczynski·13 Haz

Proud to announce: 💫 garak - an LLM vulnerability scanner💫 🔎 Check if a model is susceptible to common attacks 🦜 Supports HuggingFace, OpenAI, ggml, Cohere, ... 🔧 >70 probes: prompt injection, false claims, toxicity, encoding evasion, .. github.com/leondz/garak/

English

337

63.3K

Leon Derczynski ⚒️☁️🏔️🌲@LeonDerczynski·3d

@deliprao peer - sure, seems relatable

English

188

Delip Rao e/σ@deliprao·4d

"peer" review

English

236

45.1K

Leon Derczynski ⚒️☁️🏔️🌲@LeonDerczynski·2 May

@ChShersh push tests that turn it red. forces reading and thinking

English

Dmitrii Kovanikov@ChShersh·30 Nis

I miss the toxic tech culture of receiving 50+ code review comments. Now everyone just pushes AI slop and doesn’t care as long as CI is green.

English

142

194

3.9K

133.5K

Leon Derczynski ⚒️☁️🏔️🌲@LeonDerczynski·1 May

@moyix it's not polite to rowhammer your spouse brendan

English

190

Brendan Dolan-Gavitt@moyix·30 Nis

Wow you ask a couple INNOCENT questions about running some INNOCENT binaries on your wife's laptop and all of a sudden it's some kind of big investigation

English

2.9K

Leon Derczynski ⚒️☁️🏔️🌲@LeonDerczynski·17 Nis

@nikaletras Below 25% gives unacceptable errors

English

Nikos Aletras@nikaletras·16 Nis

Hot take: *ACL venues should align with the broader ML community by adopting a ~25-30% acceptance rate for Main, while maintaining ~15% for Findings. This is rather straightforward and would not massively impact the overall size of our conferences. ⬇️

English

2.1K

Nikos Aletras@nikaletras·16 Nis

I find really bizarre that main conference acceptance rates at top #NLProc venues (*ACL, EMNLP) hover around 19-22%, while top ML venues (NeurIPS, ICLR, ICML) consistently accept 25-30% or more. This is problematic for our community for two main reasons ⬇️

English

18.1K

Leon Derczynski ⚒️☁️🏔️🌲 retweetledi

Rhys@RhysSullivan·5 Nis

it's starting

leo 🐾@synthwavedd

lmfao gpt image 2 just generated me an image with the gemini watermark in the bottom right completely unprompted

English

2.1K

37.6K

1.7M

Leon Derczynski ⚒️☁️🏔️🌲 retweetledi

🎭@deepfates·5 Nis

im curious about the history of "prompts" -- as in, the > or $ or whatever in your terminal, not the text string for AI models. unfortunately this is impossible to google now

English

332

21.6K

Leon Derczynski ⚒️☁️🏔️🌲@LeonDerczynski·2 Nis

@verena_rieser @GoogleDeepMind nice work!

English

Leon Derczynski ⚒️☁️🏔️🌲 retweetledi

Verena Rieser@verena_rieser·1 Nis

What will AI safety look like in an agentic world? 🕵🏼‍♀️ important foresight piece led by my team @GoogleDeepMind

Matija Franklin@FranklinMatija

Excited about our new paper: AI Agent Traps AI agents inherit every vulnerability of the LLMs they're built on - but their autonomy, persistence, and access to tools create an entirely new attack surface: the information environmental itself. The web pages, emails, APIs, and databases agents interact with can all be weaponised against them. We introduce a taxonomy of six classes of adversarial threats - from prompt injections hidden in web pages to systemic attacks on multi-agent networks. I’m outlining the six categories of traps in the thread bellow

English

3.5K

Leon Derczynski ⚒️☁️🏔️🌲@LeonDerczynski·17 Mar

@segyges "form carries meaning" section is /really/ short on argumentation and citations. the arguments are definitely out there, in droves - do you not want to bring them in instead of the sloppy hand-waving here?

English

1.3K

SE Gyges@segyges·16 Mar

"Stochastic Parrots" is a meme that won't go away. It seemed important enough to do a rundown of everything that is wrong with the technical or "philosophy of language" side of the paper (which is everything). 👇

English

137

21.7K

Leon Derczynski ⚒️☁️🏔️🌲 retweetledi

NVIDIA AI Developer@NVIDIAAIDev·3 Şub

30M downloads and counting for the NVIDIA Nemotron family on @huggingface 🤗 We're grateful for the incredible community that has made this possible. Get started with Nemotron: nvda.ws/4q8MtVP

English

117

18.8K

Leon Derczynski ⚒️☁️🏔️🌲@LeonDerczynski·19 Ara

@s4rah_dev @guyrleech i'd defer to English English for this one

English

sarah@s4rah_dev·18 Ara

@guyrleech they are call biscuits here in North America so I wasn’t too sure….

English

1.5K

Leon Derczynski ⚒️☁️🏔️🌲@LeonDerczynski·19 Ara

@ajrgd @s4rah_dev

QME

Alex Greenland@ajrgd·18 Ara

@s4rah_dev please tell me this is bait? afternoon tea, with jam and clotted cream. big debate on which goes on first (devon vs cornwall). i'm devon

English

532

Leon Derczynski ⚒️☁️🏔️🌲@LeonDerczynski·19 Ara

@s4rah_dev

GIF

QME

Leon Derczynski ⚒️☁️🏔️🌲@LeonDerczynski·19 Ara

@arvidkahl should be green

English

422

Arvid Kahl@arvidkahl·18 Ara

Is there already such a thing as an "external hardware LLM" like we have external hard drives? Instead of having to run/maintain a local model, I want an inference machine that I can just plug in and point my prompts at. Single GPU, maybe a few in parallel. Who's building this?

English

620

143

404.7K

Leon Derczynski ⚒️☁️🏔️🌲@LeonDerczynski·19 Ara

@sarahookr 哇..!

日本語

827

Sara Hooker@sarahookr·19 Ara

Hands down one of the best meals yet I have had in London.

English

1.3K

114.4K

Leon Derczynski ⚒️☁️🏔️🌲@LeonDerczynski·16 Ara

@ObsoleteSony

GIF

QME

995

Obsolete Sony@ObsoleteSony·16 Ara

What are your top 3 must-play PS1 games for someone who has never experienced the console before?

English

266

962

56.5K

Leon Derczynski ⚒️☁️🏔️🌲 retweetledi

Tri Dao@tri_dao·16 Ara

Nvidia continues to put out some of the strongest and fastest open models. Pretraining and post training data are released as well, something very few orgs have done

Bryan Catanzaro@ctnzr

Today, @NVIDIA is launching the open Nemotron 3 model family, starting with Nano (30B-3A), which pushes the frontier of accuracy and inference efficiency with a novel hybrid SSM Mixture of Experts architecture. Super and Ultra are coming in the next few months.

English

379

28.9K

Leon Derczynski ⚒️☁️🏔️🌲 retweetledi

Nathan Lambert@natolambert·15 Ara

It's an honor to be competing with Nvidia for the best models with open data, checkpoints, and code. Super excited about Nemotron 3 and Nvidia's new focus on fully open models in 2025.

Bryan Catanzaro@ctnzr

English

354

31K

Leon Derczynski ⚒️☁️🏔️🌲@LeonDerczynski·15 Ara

@soumyesinghal @nvidia cracked

English

118

Soumye Singhal@soumyesinghal·15 Ara

🚀 Nemotron 3 Nano is live! Had a blast post-training this model with a cracked team. Its strong for its size, and highly efficient at inference. And true to @nvidia's open release style: weights (BF16/FP8/base) + training recipes + code + datasets. HF: huggingface.co/collections/nv… Blog + Nano tech report: nvda.ws/48RusVt

English

2.2K

Leon Derczynski ⚒️☁️🏔️🌲@LeonDerczynski·15 Ara

@llm_wizard banger

Indonesia

134

Chris 🇨🇦@llm_wizard·15 Ara

Nemotron 3 Nano is released (and it's a banger), but more importantly: It's just as open as the last one, and it's ONLY THE FIRST ONE. Super and Ultra: OTW > Model Weights - RELEASED > Pre-Training Data - MOSTLY RELEASED > Post-Training Data - MOSTLY RELEASED > RL Environments - RELEASED (as well as a library to train the model) Tech Report, blogs, videos, guides, AND MORE.

English

158

10.6K

Keşfet

@deliprao @ChShersh @moyix @nikaletras @verena_rieser @GoogleDeepMind @segyges @huggingface