Tim Kendall

43 posts

Tim Kendall banner
Tim Kendall

Tim Kendall

@tkendall

san francisco, ca Katılım Mart 2007
514 Takip Edilen1.2K Takipçiler
Tim Kendall retweetledi
Apollo Research
Apollo Research@apolloaievals·
Apollo is spinning out from fiscal sponsorship into a PBC. We are increasing our efforts in frontier AI safety research, evals and governance. Additionally, we are spinning up a product team to build AGI safety products starting with AI agent monitoring.
Apollo Research tweet media
English
3
13
94
11.3K
Tim Kendall retweetledi
Logical Intelligence
Logical Intelligence@logic_int·
Our Aleph agent, powered by @OpenAI 's GPT‑5.2, scored 668/672, 99.4% w/hyper-efficiency on @gtsoukal et al.'s PutnamBench (the hardest formal math benchmark) a critical step in natural language automated code generation — English as programming — with hallucination-free results
Logical Intelligence tweet media
English
25
48
341
67.1K
Tim Kendall retweetledi
Logical Intelligence
Logical Intelligence@logic_int·
Next week we'll have a major announcement to share with more information about the team, the mission, and our new purpose-built, general reasoning model emphasizing prediction power and discovery - and how we will empower deployment of AI autonomy in critical systems where this is currently impossible. Follow us and CEO @evelovesolive for more.
English
1
5
37
5.2K
Tim Kendall retweetledi
Alien
Alien@alienorg·
Continuous Human Verification Protocol (CHVP) is the backbone of Alien The idea behind it is to verify that someone is a human, privately. It's an open protocol that uses all known trustless methods for identity verification. We realized early on that there is no ideal solution that could meet all 4 criteria: privacy of user data, decentralization, uniqueness, and easy UX for global adoption CHVP is in its early stage of Phase 0 of the network and is the logical evolution to enable trust online in the age of AI
Alien tweet media
English
4
5
36
95K
Ashish Vaswani
Ashish Vaswani@ashVaswani·
We are beyond thrilled to share our first flagship models, Rnj-1 base and instruct 8B parameter models. Rnj-1 is the culmination of 10 months of hard work by a phenomenal team, dedicated to advancing American SOTA OSS AI. Lots of wins with Rnj-1. 1. SWE bench performance close to GPT 4o. 2. Tool use outperforming all comparable open source models. 3. Mathematical reasoning (AIME’25) nearly at par with GPT OSS MoE 20B. ….
Essential AI@essential_ai

Today, we’re excited to introduce Rnj-1, @essential_ai's first open model; a world-class 8B base + instruct pair, built with scientific rigor, intentional design, and a belief that the advancement and equitable distribution of AI depend on building in the open. We bring American open-source at par with the best in the world.

English
103
173
1.8K
603K
Logical Intelligence
Logical Intelligence@logic_int·
🚀 Aleph prover just went BEAST MODE 4 math problems unsolved for 20+ years. Formal proofs in Lean 4. Less than 48 hours. Under $5k total. ✅ Binomial tail bounds conjecture (Telgarsky, 2009) ✅ Quantum gate lattice approximation (Greene & Damelin, 2015)* ✅ Erdős 124 ✅ Erdős 481 ✅ #1 on PutnamBench leaderboard The era of AI mathematics is here. Special thanks to @BorisHanin and @ylecun for helping bring this to life 🙏 And massive kudos to the @LeanFRO team — none of this is possible without the incredible foundation you've built. Aleph will be soon available to the public, stay tuned! *conditional on results from Sardari (2015), formalization pending
English
27
73
381
116.1K
Tim Kendall retweetledi
Logical Intelligence
Logical Intelligence@logic_int·
Our Aleph prover agent just hit #1 on PutnamBench, a benchmark built from Putnam problems - one the hardest college-level math olympiad - fully formalized with machine-checked proofs and no human involvement. Putnam problems are often considered harder than IMO problems and span a wide range of topics, including calculus, number theory, group theory, and other core areas of mathematics. This is strong evidence that AI can handle deep, multi-step reasoning with correctness guarantees — the same kind of technology we’re using to verify real software, hardware, and scientific discoveries that require formal logic.
Logical Intelligence tweet media
English
4
34
172
37.9K
Tim Kendall retweetledi
Logical Intelligence
Logical Intelligence@logic_int·
We just tested Aleph prover on this version of Erdos #124 problem and were able to prove it in less that 2.5 hours and under $200 in cost: gist.github.com/winger/a2c27e4… x.com/vladtenev/stat…
Vlad Tenev@vladtenev

We are on the cusp of a profound change in the field of mathematics. Vibe proving is here. Aristotle from @HarmonicMath just proved Erdos Problem #124 in @leanprover, all by itself. This problem has been open for nearly 30 years since conjectured in the paper “Complete sequences of sets of integer powers” in the journal Acta Arithmetica. Boris Alexeev ran this problem using a beta version of Aristotle, recently updated to have stronger reasoning ability and a natural language interface. Mathematical superintelligence is getting closer by the minute, and I’m confident it will change and dramatically accelerate progress in mathematics and all dependent fields.

English
7
31
108
42K
Tim Kendall retweetledi
extra.email
extra.email@extradotemail·
Introducing Extra. Email will never feel the same again. Join the Beta waitlist at extra.email.
English
19
16
248
238.5K
Tim Kendall retweetledi
Biz Stone
Biz Stone@biz·
My new project with Pinterest cofounder Evan Sharp, Tangle.com. (Still invite only for now.)
English
62
15
226
44K
Tim Kendall retweetledi
Kevin Yang
Kevin Yang@kevinyang·
OMG it finally works!! 🚀🚀🚀 I got @OpenAI to draft emails 📧 for me in the background 🤯 This christmas I built an email assistant (EmailTriager.com) that automatically drafts email replies behind the scenes no chrome extension necessary, here's the story 👇
English
170
557
5.1K
1.2M
Tim Kendall retweetledi
Pinterest
Pinterest@Pinterest·
We’re thrilled to announce open registration! This means you can sign up w/o an invite at Pinterest.com. Happy pinning to everyone!
English
56
1.1K
107
0