SPAR

57 posts

SPAR banner
SPAR

SPAR

@SPARexec

We're a part-time, virtual research program that gives students and early career professionals an opportunity to work with professional AI safety researchers.

Tham gia Mart 2024
79 Đang theo dõi525 Người theo dõi
Tweet ghim
SPAR
SPAR@SPARexec·
📣 Only 2 days left to apply for this round of SPAR! Apply by January 14 to join our largest round yet — 130+ projects with mentors from Google DeepMind, RAND, AI Security Institute, Apollo Research, SecureBio, Machine Intelligence Research Institute, and more!
English
2
2
9
1.8K
SPAR đã retweet
Agus 🔸
Agus 🔸@austinc3301·
Applications for the Generator Residency close on Monday EOD! Last chance to apply. Fully funded, 6k stipend + travel + housing, 3 months with an extension, in-person in Berkeley. Probably the best path into AI safety for non-researcher roles.
Agus 🔸@austinc3301

Announcing the Generator Residency: a 3-month residency for AI safety generalists, by @KairosAIS × @ConstellOrg. Fully funded. In-person in Berkeley. Summer 2026. 🗓 Apply by April 27 generatorresidency.org/?utm_source=tw…

English
2
6
59
5.8K
SPAR đã retweet
Kairos
Kairos@KairosAIS·
📣 Only 3 days left to apply for Generator! Apply by April 27, to join our inaugural cohort with advisers from AI Futures Project, BlueDot, Coefficient Giving, FAR. AI, Forethought, METR, RAND, and more! generatorresidency.org
English
1
1
4
471
SPAR đã retweet
Siddharth Boppana
Siddharth Boppana@sidboppana·
Excited to share our new paper! We looked at when reasoning LLMs 'knew' their final answer internally vs. when it was stated in chain-of-thought. Turns out these models can be performative depending on the task!
Siddharth Boppana tweet media
Goodfire@GoodfireAI

LLMs often reason “performatively” well after deciding on a final answer - something that CoT monitors are slow to catch. Our new paper finds that: - probes can help monitor for this - it seems to track with task difficulty - probes enable early CoT exit, saving tokens! (1/7)

English
5
3
32
1.9K
aniket
aniket@aniketdxsh·
life update: spending this summer in berkeley doing tensor network research at @BerkeleyLab! looking forward to living in the area for the first time. also recently got accepted as a @SPARexec fellow and will be working on mechanistic interpretability!
English
2
0
6
420
SPAR đã retweet
Gabriele Sarti
Gabriele Sarti@gsarti_·
In this work, we complement behavioral goal-directedness evals of LLM agents with a probing analysis of environment and plan representations, examining whether observed actions are consistent with models' internal beliefs, and how reasoning affects representations. Check it out!
Mario Giulianelli@glnmario

When we say an AI agent is “goal-directed”, what do we actually mean? In new work from Project Telos, we study this question by combining behavioural evaluation with analysis of internal representations in a language model agent navigating grid worlds. 1/

English
1
2
17
1.9K
SPAR đã retweet
Agus 🔸
Agus 🔸@austinc3301·
we may not have sabrina carpenter but we do have dawn song
Agus 🔸 tweet media
English
1
1
22
791
SPAR đã retweet
LawZero - LoiZéro
LawZero - LoiZéro@LawZero_·
LawZero is accepting applications as part of the SPAR Spring 2026 program!  If you're interested in studying model awareness or emergent misalignment, you can learn more and apply here: sparai.org/projects/sp26/.  Applications are open until Jan 14, 2026.
SPAR@SPARexec

🚀 We're excited to announce that mentee applications are now open for the Spring round of the SPAR research program! This will be our largest round ever, featuring 130+ projects across AI safety, policy, governance, security, welfare, and strategy.

English
0
5
16
4.7K
SPAR đã retweet
Georg Lange
Georg Lange@_georg_lange·
Come work with me and @SPARexec to build an AI mech interp researcher to accelerate AI safety research.🧠🔬 In the last cohort, my mentees built AI agents that automatically find and refine explanations for SAE features (demo of what they built after only one month below). In this cohort, we want to push for agents that discover and explain full circuits. Deadline is Jan 14th!⏳🗓️
SPAR@SPARexec

🚀 We're excited to announce that mentee applications are now open for the Spring round of the SPAR research program! This will be our largest round ever, featuring 130+ projects across AI safety, policy, governance, security, welfare, and strategy.

English
2
3
6
929
SPAR
SPAR@SPARexec·
📣 Only 2 days left to apply for this round of SPAR! Apply by January 14 to join our largest round yet — 130+ projects with mentors from Google DeepMind, RAND, AI Security Institute, Apollo Research, SecureBio, Machine Intelligence Research Institute, and more!
English
2
2
9
1.8K
SPAR
SPAR@SPARexec·
Work on a part-time AI safety, AI policy, AI security, or biosecurity project. Open to students & professionals, prior research experience not required for all projects.
English
1
0
2
302
SPAR đã retweet
Andy Liu
Andy Liu@uilydna·
I'm mentoring a SPAR project on evaluating and refining alignment targets for LLMs (constitutions, model specs, etc.) this spring! Apply by January 14 to work with me or other SPAR mentors - project details/application link ⬇️:
SPAR@SPARexec

🚀 We're excited to announce that mentee applications are now open for the Spring round of the SPAR research program! This will be our largest round ever, featuring 130+ projects across AI safety, policy, governance, security, welfare, and strategy.

English
1
2
7
691
SPAR đã retweet
Agus 🔸
Agus 🔸@austinc3301·
Does training language models on AI safety literature make them more likely to scheme? This is one of the research questions being explored in the upcoming round of @SPARexec. A few projects I'm excited about: 🧵
English
2
3
26
6K
SPAR đã retweet
Jeff Sebo
Jeff Sebo@jeffrsebo·
The NYU Center for Mind, Ethics, and Policy is seeking research fellows to contribute to upcoming reports on legal personhood and economic rights for digital minds. Please apply if you have interest in working with us!
SPAR@SPARexec

🚀 We're excited to announce that mentee applications are now open for the Spring round of the SPAR research program! This will be our largest round ever, featuring 130+ projects across AI safety, policy, governance, security, welfare, and strategy.

English
2
7
24
2.3K
SPAR đã retweet
Tianyi Alex Qiu
Tianyi Alex Qiu@Tianyi_Alex_Qiu·
I'm glad to mentor again for this round of SPAR, likely with @zhonghaohe! Together let's help human-AI coevolution go a little bit better :) ⬇️🧵Here's a collection of research ideas I'd be excited to mentor projects on. Feel free to pitch yours too!
SPAR@SPARexec

🚀 We're excited to announce that mentee applications are now open for the Spring round of the SPAR research program! This will be our largest round ever, featuring 130+ projects across AI safety, policy, governance, security, welfare, and strategy.

English
1
3
10
712
SPAR
SPAR@SPARexec·
🗓️ Applications are open through January 14!
English
0
0
2
281
SPAR
SPAR@SPARexec·
We've also added biosecurity projects this round! Explore the available projects and mentors here: sparai.org/projects/?utm_…
English
1
0
4
380
SPAR
SPAR@SPARexec·
🚀 We're excited to announce that mentee applications are now open for the Spring round of the SPAR research program! This will be our largest round ever, featuring 130+ projects across AI safety, policy, governance, security, welfare, and strategy.
English
2
3
15
11.5K