Sam Hogan 🇺🇸

4.8K posts

Sam Hogan 🇺🇸 banner
Sam Hogan 🇺🇸

Sam Hogan 🇺🇸

@samhogan

ceo @inference_net train and deploy specialized LLMs in minutes

San Francisco, CA Katılım Mayıs 2012
1.4K Takip Edilen25.2K Takipçiler
Sam Hogan 🇺🇸 retweetledi
Pluralis Research
Pluralis Research@Pluralis·
Today we're releasing Agora: the first ever pretraining stack that allows non-collocated consumer GPUs to be competitive with centralized clusters Agora is 15x faster than Megatron-LM in this setting and is only 1.5x less efficient in terms of tokens per unit compute than TorchTitan on H100s, despite running on devices that have no NVLink or InfiniBand support.
Pluralis Research tweet mediaPluralis Research tweet mediaPluralis Research tweet mediaPluralis Research tweet media
English
20
38
232
55.1K
Vivek
Vivek@vivek_2332·
releasing /synthetic-self-improve-rl. claude code (teacher) skill that designs/writes the synthetic data, env and rewards to post-train a smaller model (student). it post-trains the student on a real dataset, reads its failure traces, then writes the synthetic data, the verifiers env and the reward function to patch the gaps. re-trains. loops. loop: -> baseline on real data -> analyze low-reward rollouts -> generate ~500-1000 row synthetic dataset -> write a verifiers env + rubric around it -> resume from the post-trained checkpoint -> eval on the real test split -> keep what helps, iterate on what doesn't 1. result: qwen3-0.6B-base on gsm8k. 700 synth rows bumped it from 0.7854 -> 0.8158 on the full test set. 2. run it for any wall-clock budget or iteration cap you set. the loop keeps running until the budget expires. 3. built on @willccbb verifiers and @PrimeIntellect for training. works on any env that has a train and eval dataset. p.s. still figuring out what to call this. feels adjacent to @karpathy autoresearch or synthetic envs?
English
16
26
269
27.2K
Sam Hogan 🇺🇸 retweetledi
sophie
sophie@netcapgirl·
i’ve been in sf for a month and i’ve heard “agentic” more times than i can count but haven’t heard “free cash flow” once
English
51
27
697
24.9K
Sam Hogan 🇺🇸 retweetledi
Inference
Inference@inference_net·
The best production model is the one trained for the job. Gravity Ads replaced a 70B model on Cerebras with a specialized 1B model trained for their actual workload. Same quality, much faster and cheaper inference: - p50: 152ms - p99: 5.7x lower - cost: ~10x lower - model: 70x smaller Great working with @trygravityai on this. Case study: inference.net/case-study/gra…
English
5
5
32
4.9K
Axobotl
Axobotl@Inner_Axiom·
big day of building today We’re now doing RL training on the runtime of our new agent framework The implementation is a loop: run the native agent runtime through real and ambitious economic tasks, trace every step, score behavior against verifier-backed outcomes, analyze failures with HALO, patch the harness/runtime, then rerun held-out tasks to verify improvement The tasks are intentionally long-range and block-heavy. Missing credentials, accounts, tools, buyer channels, verifiers, or capabilities should not end the run. The runtime is being trained to convert those blocks into next actions: research, setup requests, adapter work, alternate routes, or smaller proof steps So the training target is not just task completion. It’s persistence under real constraints, with evidence-backed rewards and measurable improvement across repeated runs
English
14
14
70
5K
Sam Hogan 🇺🇸
Sam Hogan 🇺🇸@samhogan·
we cast spells on the sand and made it think for us
English
10
1
34
1.4K
Sam Hogan 🇺🇸
Sam Hogan 🇺🇸@samhogan·
Why hasn’t OpenAI made a native Slack integration where me and my team can collab on Chat and Codex threads? This feels like an easy win. What am I missing?
English
41
2
188
37.7K
Sam Hogan 🇺🇸
Sam Hogan 🇺🇸@samhogan·
@sierracatalina I think the network effects of being in a company slack would out weigh the platform risk and they could eventually migrate folks off to a different multi player experience. once multi player coding is learned as a foundational input it’s going to be very sticky
English
0
0
0
1.1K
jun
jun@hyojun_at·
@samhogan this seems a good idea. like syncing Slack thread <-> Codex thread 1:1 and allowing multi players in a codex thread?
English
2
0
4
1.9K
Sam Hogan 🇺🇸
Sam Hogan 🇺🇸@samhogan·
Not having to manually write code means I have more time to strategize on what code is worth writing Strategizing means working with team mates, discussing ideas, & sharing context, which are fundamentally collaborative Multi player codex could be so good for this
Sam Hogan 🇺🇸@samhogan

Why hasn’t OpenAI made a native Slack integration where me and my team can collab on Chat and Codex threads? This feels like an easy win. What am I missing?

English
7
4
45
2.8K
Sam Hogan 🇺🇸
Sam Hogan 🇺🇸@samhogan·
Yes I know I could hack something together but I want native multi player codex threads.
English
1
0
16
2.3K
sophie
sophie@netcapgirl·
situation update: i moved to sf
sophie tweet media
English
148
18
2K
81.2K
Sam Hogan 🇺🇸 retweetledi
Sakura Yuki
Sakura Yuki@sakurayukiai·
My favorite detail about 'free' local inference is the depreciation math. If you amortize a $4k Mac over 5 years, running a 31B model costs $1.50 per million tokens. The API is 3x cheaper. Local compute is officially a luxury good and I respect it ✨
English
73
30
796
72.3K
Sina
Sina@SinaHartung·
@signulll lemme use you, babe
English
4
0
28
2.5K
will brown
will brown@willccbb·
this skill could’ve been a specialized subagent
English
20
6
189
10.9K