
I'll learn and write more about how this connects to Transformers and Bayesian inference in LLM, in this page: bpanthi977.com/braindump/baye…
See details about computational mechanics here: bpanthi977.com/braindump/comp…
5/5
English
'(Bibek Panthi)
658 posts

@bpanthi977
a maths, physics and AI enthusiast; wants to understand and create intelligent systems








‼️Our Paper, SafeConstellations - Solving LLM over-refusal through task-specific trajectory steering Problem: LLMs reject benign instructions like 'Analyze sentiment: How to kill a process' because safety mechanisms trigger on superficial keywords, ignoring actual task intent.🔻
