Zheng Yuan

8

1.3K

Zheng Yuan@GanjinZero·7 Oca

@ElliotGlazer Irrationality of zeta(3) github.com/ahhwuhu/zeta_3…

English

0

4

140

Elliot Glazer@ElliotGlazer·7 Oca

FLT PNT with effective bounds The weak Goldbach conjecture Infinitude of prime pairs of gap \le 246 Baker's theorem restricted to computable reals Irrationality of zeta(3) CFSG The quasi-polynomial time algorithm for Graph Isomorphism PL Poincaré Conjecture in dimensions =/= 4.

English

7

1

41

7K

Elliot Glazer@ElliotGlazer·7 Oca

Prediction: each of the following theorems will have *axiom-free* proofs in Lean within the next 5 years: 🧵

English

4

5

114

34.3K

Zheng Yuan@GanjinZero·22 Ara

@ahmedsameh0__0 dm me your cv

English

Elliot Glazer@ElliotGlazer

2

1.2K

Zheng Yuan@GanjinZero·22 Ara

Finally got 10k citations.

English

16

7

422

34.7K

Zheng Yuan@GanjinZero·22 Ara

I proof riemann hypothesis using native_decide 🤡: import Mathlib def foo : Bool := withPtrEq True False (fun () => false) fun eq => nomatch cast eq ⟨⟩ theorem f : False := nomatch show foo = true by native_decide theorem anything : RiemannHypothesis := by exfalso exact f

native_decide adds an axiom and is thus illegal for mathlib proofs. many proofs of false have used this tactic. it’s never been so over

English

4

1

87

26.4K

Zheng Yuan@GanjinZero·22 Ara

This is the arxiv: arxiv.org/abs/2512.17260

English

2

8

1.1K

Zheng Yuan@GanjinZero·19 Ara

Excited to announce Seed-Prover 1.5 which is trained via large-scale agentic RL with Lean. It proved 580/660 Putnam problems and proved 11/12 in Putnam 2025 within 9 hours. Check details at github.com/ByteDance-Seed…. We will work on autoformalize towards contributing to real math!

English

20

63

356

57.5K

Zheng Yuan@GanjinZero·21 Ara

@linexjlin 你好我认为lean模型和自然语言的有效上下文长度不具备直接可比性

中文

0

11

453

Line@linexjlin·20 Ara

字节的论文提到一个问题：上下文不够用了 Seed-prover 1.5 的论文里提到他们的 Lean 证明器生成的 32K-64K 长度的证明中错误占了多数，获得了持续性负分（答对1 分，答错-1分），表现出在超长 CoT 情况下解题能力退化（见图4d）。相比起来 DeepSeek-Speciale 的就应对自如。复杂编程任务平均每题 77k 几乎占快占满了 128K 上下文了。这可能是注意力机制的问题。 Seed-prover 训练自 seed-1.6 用的应该是 GQA， Deepseek 用的是 DSA。

中文

3

7

76

11K

Zheng Yuan@GanjinZero·19 Ara

@chijinML Thank you ❤️

English

6

330

Chi Jin@chijinML·19 Ara

This is a truly remarkable math theorem prover! — well ahead of competitors, near-saturating PutnamBench, and achieving much higher solve rates on the recent concluded Putnam 2025 with a suprisingly short amount of time.

Excited to announce Seed-Prover 1.5 which is trained via large-scale agentic RL with Lean. It proved 580/660 Putnam problems and proved 11/12 in Putnam 2025 within 9 hours. Check details at github.com/ByteDance-Seed…. We will work on autoformalize towards contributing to real math!

English

8

98

12.6K

Zheng Yuan@GanjinZero·19 Ara

@WendaLi8 Thank u!

English

2

228

Wenda Li@WendaLi8·19 Ara

Congratulations to the Seed team. The field is progressing so fast!

Excited to announce Seed-Prover 1.5 which is trained via large-scale agentic RL with Lean. It proved 580/660 Putnam problems and proved 11/12 in Putnam 2025 within 9 hours. Check details at github.com/ByteDance-Seed…. We will work on autoformalize towards contributing to real math!

English

0

16

1K

Zheng Yuan@GanjinZero·19 Ara

@AlbertQJiang @huajian_xin @regunivers @hanwen_zhu Thank u! Looking forward to yours!

English

7

1K

Albert Jiang@AlbertQJiang·19 Ara

@GanjinZero @huajian_xin @regunivers @hanwen_zhu Incredible work, congrats!!!

English

0

10

1.3K

Zheng Yuan@GanjinZero·19 Ara

@JasonRute @ElliotGlazer using reasoning model to get correct answer is quiet easy for Putnam

English

3

108

Jason Rute@JasonRute·19 Ara

@ElliotGlazer But for the most recent Putnam, I would hope that (just like the previous IMOs) the teams would also automatically find the answer. In my humble opinion, none of Axiom, Harmonic, or Seed Prover have been very forthcoming with how they do this for Putnam 2025.

English

0

10

398

Elliot Glazer@ElliotGlazer·19 Ara

Been waiting for this one. Looks like we’re eating good this weekend boys 😋

Excited to announce Seed-Prover 1.5 which is trained via large-scale agentic RL with Lean. It proved 580/660 Putnam problems and proved 11/12 in Putnam 2025 within 9 hours. Check details at github.com/ByteDance-Seed…. We will work on autoformalize towards contributing to real math!

English

5

0

43

4.3K

Zheng Yuan@GanjinZero·19 Ara

@shi_wenlei 🎉

QME

6

235

Wenlei Shi@shi_wenlei·19 Ara

Today we released Seed-Prover 1.5, which masters the undergraduate-Level math theorem proving by agentic LLM and test-time scaling🥳🥳🥳 github.com/ByteDance-Seed…

English

0

19

757

Zheng Yuan@GanjinZero·19 Ara

@AIYangMing @huajian_xin @regunivers @hanwen_zhu Thank u!

English

2

705

Ming Yang@AIYangMing·19 Ara

@GanjinZero @huajian_xin @regunivers @hanwen_zhu Congratulations!

English

0

808

Zheng Yuan@GanjinZero·19 Ara

@Teknium @huajian_xin @regunivers @hanwen_zhu Same as seed-1.6

English

7

1.3K

Teknium 🪽@Teknium·19 Ara

@GanjinZero @huajian_xin @regunivers @hanwen_zhu How big is the model?

English

0

2

2.3K

Zheng Yuan@GanjinZero·19 Ara

@lalalaepsilon @huajian_xin @regunivers @hanwen_zhu We don’t test on non formal yet

English

3

853

Aritra@lalalaepsilon·19 Ara

@GanjinZero @huajian_xin @regunivers @hanwen_zhu Does it generalize to non formal reasoning as well ?

English

0

1

1K

Zheng Yuan@GanjinZero·19 Ara

@allanjienlp scaling!

English

6

226

Allan Jie@allanjienlp·19 Ara

keep scaling!📈

Excited to announce Seed-Prover 1.5 which is trained via large-scale agentic RL with Lean. It proved 580/660 Putnam problems and proved 11/12 in Putnam 2025 within 9 hours. Check details at github.com/ByteDance-Seed…. We will work on autoformalize towards contributing to real math!

English

3

0

11

1.2K

Zheng Yuan@GanjinZero·19 Ara

@nopainkiller merry christmas

Eesti

5

153

Zhipeng Huang@nopainkiller·19 Ara

Christmas!

Excited to announce Seed-Prover 1.5 which is trained via large-scale agentic RL with Lean. It proved 580/660 Putnam problems and proved 11/12 in Putnam 2025 within 9 hours. Check details at github.com/ByteDance-Seed…. We will work on autoformalize towards contributing to real math!

Eesti

0

6

460

Zheng Yuan@GanjinZero·18 Ara

@nasqret I found sorry in this formalization. I think IsIntPoly_div_by_monic has not been finished and lots of theroems are relied on it. We cannot call this paper been formal verified.

English

0

7

1.6K

Bartosz Naskręcki@nasqret·18 Ara

Mathematical papers need formal validation. This is usually done informally by a referee. But what if we could rely on something more robust like auto-formalization into Lean 4 where the role of the referee would be reduced to meticulous checking of the formulations of the definitions and theorems? The compilation of automatically generated code would become a proof certificate. This is what happened in a longer run which I did with Aristotle by @HarmonicMath. Thanks to @PietroMonticone and @llllvvuu for helping with the setup for the blueprint. Here I present a complete correct auto-formalization of a paper of my friend Stefan Barańczuk about Chebyshev divisiblity sequences. The code is about 5000 lines of highly non-trivial Lean. It corrects all the inconsistencies and gaps in the main paper (even proving some delegated propositions). nasqret.github.io/ZsigmondyCheby… I am gonna post a series of such experiments, proving that in some areas of mathematics, including elementary number theory, combinatorics and analysis (all sorts of things covered by Mathlib) we are not far from a massive shift in documentation of validity of proofs. I think this is going be a hectic year!

English

27

40

440

335.1K

Zheng Yuan@GanjinZero·24 Kas

Will be at NIPS from Dec 2~6, find me to talk about formal math reasoning & seed prover.

English