SPAR

9

1.8K

SPAR đã retweet

Agus 🔸@austinc3301·25 Nis

Applications for the Generator Residency close on Monday EOD! Last chance to apply. Fully funded, 6k stipend + travel + housing, 3 months with an extension, in-person in Berkeley. Probably the best path into AI safety for non-researcher roles.

Agus 🔸@austinc3301

Announcing the Generator Residency: a 3-month residency for AI safety generalists, by @KairosAIS × @ConstellOrg. Fully funded. In-person in Berkeley. Summer 2026. 🗓 Apply by April 27 generatorresidency.org/?utm_source=tw…

English

6

59

5.8K

SPAR đã retweet

Kairos@KairosAIS·25 Nis

📣 Only 3 days left to apply for Generator! Apply by April 27, to join our inaugural cohort with advisers from AI Futures Project, BlueDot, Coefficient Giving, FAR. AI, Forethought, METR, RAND, and more! generatorresidency.org

English

4

471

SPAR đã retweet

Agus 🔸@austinc3301·9 Nis

Announcing the Generator Residency: a 3-month residency for AI safety generalists, by @KairosAIS × @ConstellOrg. Fully funded. In-person in Berkeley. Summer 2026. 🗓 Apply by April 27 generatorresidency.org/?utm_source=tw…

English

17

54

433

52K

SPAR đã retweet

Siddharth Boppana@sidboppana·12 Mar

Excited to share our new paper! We looked at when reasoning LLMs 'knew' their final answer internally vs. when it was stated in chain-of-thought. Turns out these models can be performative depending on the task!

Goodfire@GoodfireAI

LLMs often reason “performatively” well after deciding on a final answer - something that CoT monitors are slow to catch. Our new paper finds that: - probes can help monitor for this - it seems to track with task difficulty - probes enable early CoT exit, saving tokens! (1/7)

English

5

3

32

1.9K

SPAR@SPARexec·21 Şub

@aniketdxsh @BerkeleyLab late follow-up, but congratulations ;)

English

2

52

aniket@aniketdxsh·7 Şub

life update: spending this summer in berkeley doing tensor network research at @BerkeleyLab! looking forward to living in the area for the first time. also recently got accepted as a @SPARexec fellow and will be working on mechanistic interpretability!

English

Mario Giulianelli@glnmario

0

6

420

SPAR đã retweet

Gabriele Sarti@gsarti_·20 Şub

In this work, we complement behavioral goal-directedness evals of LLM agents with a probing analysis of environment and plan representations, examining whether observed actions are consistent with models' internal beliefs, and how reasoning affects representations. Check it out!

When we say an AI agent is “goal-directed”, what do we actually mean? In new work from Project Telos, we study this question by combining behavioural evaluation with analysis of internal representations in a language model agent navigating grid worlds. 1/

English

2

17

1.9K

SPAR đã retweet

Agus 🔸@austinc3301·15 Oca

we may not have sabrina carpenter but we do have dawn song

English

22

791

SPAR đã retweet

LawZero - LoiZéro@LawZero_·13 Oca

LawZero is accepting applications as part of the SPAR Spring 2026 program! If you're interested in studying model awareness or emergent misalignment, you can learn more and apply here: sparai.org/projects/sp26/. Applications are open until Jan 14, 2026.

🚀 We're excited to announce that mentee applications are now open for the Spring round of the SPAR research program! This will be our largest round ever, featuring 130+ projects across AI safety, policy, governance, security, welfare, and strategy.

English

5

16

4.7K

SPAR đã retweet

Georg Lange@_georg_lange·13 Oca

Come work with me and @SPARexec to build an AI mech interp researcher to accelerate AI safety research.🧠🔬 In the last cohort, my mentees built AI agents that automatically find and refine explanations for SAE features (demo of what they built after only one month below). In this cohort, we want to push for agents that discover and explain full circuits. Deadline is Jan 14th!⏳🗓️

🚀 We're excited to announce that mentee applications are now open for the Spring round of the SPAR research program! This will be our largest round ever, featuring 130+ projects across AI safety, policy, governance, security, welfare, and strategy.

English

3

6

929

SPAR@SPARexec·13 Oca

@_AR999_ Anywhere on Earth! time.is/Anywhere_on_Ea…

English

2

42

SPAR@SPARexec·13 Oca

📣 Only 2 days left to apply for this round of SPAR! Apply by January 14 to join our largest round yet — 130+ projects with mentors from Google DeepMind, RAND, AI Security Institute, Apollo Research, SecureBio, Machine Intelligence Research Institute, and more!

English

9

1.8K

SPAR@SPARexec·13 Oca

Apply here before 11:59 PM AoE on Wednesday, January 14th! sparai.org/projects/?utm_…

English

3

284

SPAR@SPARexec·13 Oca

Work on a part-time AI safety, AI policy, AI security, or biosecurity project. Open to students & professionals, prior research experience not required for all projects.

English

0

2

302

SPAR đã retweet

Andy Liu@uilydna·12 Oca

I'm mentoring a SPAR project on evaluating and refining alignment targets for LLMs (constitutions, model specs, etc.) this spring! Apply by January 14 to work with me or other SPAR mentors - project details/application link ⬇️:

🚀 We're excited to announce that mentee applications are now open for the Spring round of the SPAR research program! This will be our largest round ever, featuring 130+ projects across AI safety, policy, governance, security, welfare, and strategy.

English

2

7

691

SPAR đã retweet

Agus 🔸@austinc3301·11 Oca

Does training language models on AI safety literature make them more likely to scheme? This is one of the research questions being explored in the upcoming round of @SPARexec. A few projects I'm excited about: 🧵

English

3

26

6K

SPAR đã retweet

Jeff Sebo@jeffrsebo·9 Oca

The NYU Center for Mind, Ethics, and Policy is seeking research fellows to contribute to upcoming reports on legal personhood and economic rights for digital minds. Please apply if you have interest in working with us!

🚀 We're excited to announce that mentee applications are now open for the Spring round of the SPAR research program! This will be our largest round ever, featuring 130+ projects across AI safety, policy, governance, security, welfare, and strategy.

English

7

24

2.3K

SPAR đã retweet

Justin Shenk@justinshenk·8 Oca

Join me next Spring in exploring how time is represented in LLMs 🕓 Deadline: January 14th sparai.org/projects/sp26/…

🚀 We're excited to announce that mentee applications are now open for the Spring round of the SPAR research program! This will be our largest round ever, featuring 130+ projects across AI safety, policy, governance, security, welfare, and strategy.

English

4

5

513

SPAR đã retweet

David Williams-King@deepelfery·8 Oca

I'm a SPAR mentor, if you'd like to work on solving Anthropic cyber espionage type attacks, please do apply!

🚀 We're excited to announce that mentee applications are now open for the Spring round of the SPAR research program! This will be our largest round ever, featuring 130+ projects across AI safety, policy, governance, security, welfare, and strategy.

English

2

3

544

SPAR đã retweet

Tianyi Alex Qiu@Tianyi_Alex_Qiu·8 Oca

I'm glad to mentor again for this round of SPAR, likely with @zhonghaohe! Together let's help human-AI coevolution go a little bit better :) ⬇️🧵Here's a collection of research ideas I'd be excited to mentor projects on. Feel free to pitch yours too!

🚀 We're excited to announce that mentee applications are now open for the Spring round of the SPAR research program! This will be our largest round ever, featuring 130+ projects across AI safety, policy, governance, security, welfare, and strategy.

English

3

10

712

SPAR@SPARexec·8 Oca

🗓️ Applications are open through January 14!

English

2

281

SPAR@SPARexec·8 Oca

We've also added biosecurity projects this round! Explore the available projects and mentors here: sparai.org/projects/?utm_…

English

0

4

380

SPAR@SPARexec·8 Oca

🚀 We're excited to announce that mentee applications are now open for the Spring round of the SPAR research program! This will be our largest round ever, featuring 130+ projects across AI safety, policy, governance, security, welfare, and strategy.

English