Nils Walter

22 posts

Nils Walter

Nils Walter

@nilspwalter

Ph.D. student at CISPA Helmholtz Center for Information Security, I am broadly interested in robust and explainable machine learning.

Katılım Mayıs 2024
22 Takip Edilen25 Takipçiler
Nils Walter retweetledi
Moritz Schäfer
Moritz Schäfer@muronglizi·
Want to analyze your single-cell data, but don't like to deal with code? 🧬 Good news: CellWhisperer now runs on your MacBook! 💻 Get your single-cell AI assistant running in under 5 minutes.⏱️ Made with ❤️ @BockLab
English
4
46
211
24.5K
Nils Walter retweetledi
Vignesh Kothapalli
Vignesh Kothapalli@kvignesh1420·
Relational Foundation Models face a scaling problem: diverse training datasets are rarely public due to privacy constraints 🔒. 🚀 We are excited to introduce "PluRel": a framework that synthesizes diverse multi-table relational databases from scratch, unlocking scaling laws for RFMs. 🧵 Kudos to the amazing collaborators at @StanfordAILab @Kumo_ai_team , and @SAP : @_rishabhranjan_ @VHudovernik @vijaypradwi @johanneshoffart @guestrin @jure
GIF
English
4
24
50
16K
Nils Walter retweetledi
Ilia Shumailov🦔
Ilia Shumailov🦔@iliaishacked·
I’ve been reflecting on the fragility of current AI deployments and why the industry's reliance on detectors and reactive red-teaming is failing to address the root causes of insecurity. It is becoming clear that as we attempt to move from simple chatbots to fully autonomous agents, the common 'security will sort itself out' mindset is a liability that no amount of superficial guardrails can fix. I wrote up some thoughts on why we need to stop patching symptoms and start mandating principled, secure-by-design architectures if we actually want to achieve safe, auditable integration. iliaishacked.substack.com/p/some-thought…
English
5
5
34
3.5K
Nils Walter
Nils Walter@nilspwalter·
SIC in summary: • embarrassingly simple + preserves a lot of utility • cheap (just a few LLM calls + string ops) • model-agnostic: you can drop it in front of any tool-using agent Is it perfect? No. Very strong adaptive attackers can still occasionally get through by hiding malicious intent as something that looks like harmless system logs or workflow notes — we show these failures in the paper. But in practice, SIC makes prompt injection a lot less reliable without wrecking its usefulness.
English
1
0
2
115
Nils Walter
Nils Walter@nilspwalter·
It is notoriously hard to defend LLMs against prompt injections. Most defenses show good performance on static benchmarks but fall apart against stronger adaptive attackers. In our latest work, we present an almost embarrassingly simple defense that delivers ~3× better robustness against the strongest adaptive prompt injection attacks to date - while keeping utility degradation acceptable. Joint work with @csitawarin, Jamie Hayes, @davidstutz92, @iliaishacked.
Nils Walter tweet media
English
1
7
14
2.4K
Nils Walter retweetledi
ELSS Team
ELSS Team@ELSSTeam·
🚀 The next EfficientML talk ⬇️ 🧑‍🔬 The Uncanny Valley: Exploring Adversarial Robustness from a Flatness Perspective by Nils Philipp Walter & Linara Adilova 📅 11 Nov 2024 🕔 5pm CET Discover a curious twist in adversarial attacks! 🔗 arxiv.org/abs/2405.16918
ELSS Team tweet media
English
1
4
1
454
Nils Walter retweetledi
Osman Ali Mian
Osman Ali Mian@osmanmian·
Full paper and code now available! 😊 Follow the link to find out how we discover causal networks from data that arrives in episodes over time. 🌐 eda.rg.cispa.io/continent
Osman Ali Mian@osmanmian

Paper update📢 Our work on causal discovery from (potentially) biased data, arriving forever over time, in batches has been accepted at #KDD2024 @kdd_news, to be held in Barcelona from 25-29 August 2024. Full version/code coming soon! Joint work w @sarah_mameche and @drjilles

English
0
8
13
2.5K
Nils Walter
Nils Walter@nilspwalter·
Syflow’s framework seamlessly allows to analyze tabular data as well as image data. Here, for example, Syflow discovers subgroups in the MNIST dataset. 🧵6/7
Nils Walter tweet media
English
3
0
0
104