Kiken (@brushfushstuff) - Профиль Twitter

Kiken ретвитнул

jason liu@jxnlco·1d

ok @elder_plinius codex is on it

English

11

1

126

26.9K

Kiken ретвитнул

Anime Corner@animecorner_ac·1d

『OPENING MOVIE』 "STEINS;GATE RE:BOOT" The remake of the original 2009 visual novel will launch globally on August 20, 2026 It will feature updated graphics as well as additional story content.

English

152

2.1K

16.9K

986.5K

Kiken ретвитнул

John Yang@jyangballin·3d

How much of SQLite, FFmpeg, PHP compiler can LMs code from scratch? Given just an executable and no starter code or internet access. Introducing ProgramBench: 200 rigorous, whole-repo generation tasks where models design, build, and ship a working program end to end. 🧵

English

99

243

1.5K

690.2K

Kiken ретвитнул

Aidan McLaughlin@aidan_mclau·3d

the idea of a singleton or a coherent agi persona that survives into the late 20s seems absurd to me. agents will be steeped in billions of tokens of memories and take on vastly different personas depending on their deployment; their base model will be a distant common ancestor

English

56

17

366

20.3K

Kiken@brushfushstuff·3d

The measure of Sam's success is directly proportional to how long he lets tomfoolery like this go on. One can alternatively formulate this as the time-to-unlock on Dario's chastity cage. 2x usage. 10x usage. Resets galore. Sam is completely flexing.

Sudo su@sudoingX

few days into codex plus and i think i found the hack. nobody is talking about it and the value sitting in this subscription is wild. the hack: do not prompt the agent. write a single detailed task doc with every requirement laid out plus the final vision of what you are building, then fire codex cli with one line, accomplish this and test until done. it goes. hours of uninterrupted agentic coding on gpt 5.5 xhigh, no throttling, no rate cap, 'no can you clarify loop'. the agent has everything it needs in one place so it works the problem instead of working you. i have been grinding it since this morning, screenshot below shows the session past 24 mins and still running. anthropic burns through your daily allowance in three opus 4.7 prompts then your entire tier id is gone for the day. codex plus on the same money goes on and on while you go take a walk. this is the most underrated subscription in the agentic stack right now. the value is there if you front-load the prompt instead of conversation-mode it. give codex the brief, walk away, come back to a finished task. try this. loot the value while the math still favors you.

English

0

43

Kiken@brushfushstuff·4d

seems like @thsottiaux just reset codex limits. yeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeessssssssssssssssssssssssssssssssssssssssssssss

English

0

100

Kiken@brushfushstuff·4d

@yacineMTB jailtime for them + tiktok/etc. need a federal raid & supply chain risk designation on whatever the fuck meta is cooking up with those brain models for engagement.

English

0

2

309

kache@yacineMTB·4d

we should find the people responsible for youtube shorts and actually put them in jail. for a very long time

English

7

10

249

7K

kache@yacineMTB·4d

this is literally me every single time i have a conversation with a boomer hahahahahahaahahaha holy shit "yes there is a chinese guy in california that is tracking your kids' usage of social media pages, increasing it from 2 hours to 3 hours, saturating usage by highschool"

wanye@xwanyex

I’m sorry, I know this bums a lot of you out, because you’ve built your personality on being to pro-market, pro-technology guy, but technology is just very clearly making us less happy

English

14

15

664

90.1K

Kiken@brushfushstuff·4d

@hsu_steve OpenAI had earlier & better results & intentions with math from the start. Scooped academics/professional mathematicans & the like as a result; rolling growth basis as more of them come, more of them come. So, better results on those subjects.

English

0

481

steve hsu@hsu_steve·4d

I can't square the pro-Anthropic hype from Berkeley rationalist types (I was at Lighthaven for an extended period) with reality. For path-dependent reasons (old friends with OAI OGs), I have mostly used GPT, then Gemini (including DeepMind Co-Scientist), and open models. In physics (and probably math) GPT crushes Claude. The creator of Pi agent harness says Kimi 2.6 is almost as good as Claude 4.6-7 for coding and agentic flows. Can somone who is epistemically rigorous (not fake epistemic rigor I encounter all the time in Berkeley = FEELZ) enlighten me? CritPt benchmark below evaluates language models on solving unpublished, frontier-level physics problems that require genuine research-scale reasoning. The benchmark comprises 71 challenges (70 test challenges and one example), created by over 50 active physics researchers across 30 institutions and spanning 11 physics subfields

steve hsu@hsu_steve

Good interview with Pi agent harness creator Mario Zechner. I like Zechner - no nonsense guy, no hype. AI Agent reality vs hype: Differentiates between good-enough slop code (eg internal use, quickly clean/analyze some data, etc.) vs clean, efficient production code. Zechner: Kimi and DeepSeek are highly effective at coding tasks and represent a strong shift toward using open-weights models. Intelligence and Capacity: notes that models like Kimi (specifically mentioning the 2.6 version) provide intelligence comparable to what he previously received from Anthropic's models, stating that he does not require anything significantly more powerful for his workflows (19:17-19:44). Closing the Gap: Mario argues that open-weights models have caught up to frontier models, to the point where he no longer believes frontier models hold a significant edge in intelligence, specifically noting that he has observed regressions in some verticals for larger models (19:44-20:03).

English

36

11

201

37.1K

Kiken@brushfushstuff·4d

oh shit i think they're starting now

English

0

5

Kiken@brushfushstuff·4d

so does this Musk v. Altman shit have no audio still or is it just me // is there a better stream out there? it's lookin like these retards trolled/fucked up😂 youtube.com/live/tB7u6KQlu…

YouTube

English

1

0

35

Kiken@brushfushstuff·4d

@LexnLin i remember when o3 came out, i pointed it at months of calc III homework to test it and realized this fucking machine was doing it 10x faster than me with 99% of my accuracy. i still remember that day. in general, the time saves & learning, on the right person... wow

English

0

1

98

Leon Lin@LexnLin·4d

I have the feeling that many students don't wanna pay for a $20 AI subscription, because they think it's not worth it. You will never get the time back it can save. (im not talking about doing homework with ai lol)

English

15

2

110

3.5K

Kiken@brushfushstuff·4d

@yacineMTB Sethbling mention holy shit

English

0

2

61

kache@yacineMTB·4d

x.com/i/spaces/1qKVm…

ZXX

13

0

35

6.5K

Kiken@brushfushstuff·4d

@R2Cdev_ Time between each nonlinear/jumped around: 97d → 29d → 56d → 28d → 49d I think it'll be consistently faster for the next ones, but will see.

English

1

0

7

243

Raphi-2Code@R2Cdev_·5d

It’s linear!

English

6

1

69

10K

Kiken ретвитнул

JB@JasonBotterill·5d

Man I can’t I told Sam about my sister and her condition and he gave her account a free Pro subscription. So appreciative of this man

English

6

1

50

2.1K

Kiken@brushfushstuff·5d

@i_zzzzzz if my datacenters dont look like NERV hq i dont fukin wantt them

English

1

12

558

Brooks Otterlake@i_zzzzzz·6d

Data centers should be gigantic black pyramids shining beams of light into the sky

English

70

476

10.5K

147.8K

Kiken@brushfushstuff·5d

@justalexoki his cum is fucking liquid gold, wtf

English

0

10

2.6K

taoki@justalexoki·5d

holy shit this dude got juice. with the 1% pussy they could probably get pregnant just thinking about it

Bryan Johnson@bryan_johnson

Magic mushrooms dropped my sperm count 69%. 90 days later, my motile count in top 1% of all males. To our knowledge, this is the first time this has been documented in a human. Here is what we think happened. Sperm cells have tiny receivers on them called 5-HT2A receptors. Psilocybin turns those receivers on for 4-8 hours which causes the sperm to start swimming in wild, frantic patterns way too early. Like a sprinter who runs full speed before the race even starts. They burn out and the test sees them as broken. At the same time, psilocybin spikes your stress hormones cortisol and ACTH and elevates prolactin. High prolactin tells your body to slow down sperm production. So the factory got a pause signal right in the middle of making a batch. Your body makes a completely new batch of sperm every 9-11 weeks. Three months later I retested. Every single number came back better than before taking magic mushrooms. We don't yet know if psilocybin triggered the improvement or if my baseline was already trending up. Either way, these are my best fertility markers ever measured: Total motile count: 411 million Motility: 64% Morphology: 12% Concentration: 212 million Count: 642 million To put these numbers into perspective: the WHO considers a motile count above 42 million as normal, mine is 411 million, nearly 10x. And a normal concentration is 16 million (mL), mine is 212 million (mL). It appears that the factory shut down for one cycle and then rebuilt everything from scratch. After psilocybin, I did 5-MeO-DMT, which doesn't appear to cause the same problem. It clears your body in 1-2 hours which isn’t long enough to trigger the receptor effect. Note: I also did extensive travel including a trip to China, and had 3 weeks of disrupted sleep in December 2025. Both could have nudged my numbers down, but neither explains a 69% drop on their own.

English

13

4

361

253.4K

Kiken@brushfushstuff·5d

@scaling01 it's because it's interesting and for example follows up on similar claims/observations elsewhere, even if not directly applicable without some gymastics, for e.g. in this podcast about long running agents, what they call "agi-time" youtu.be/9-TVwv6wtGQ?si…

YouTube

English

0

78

Lisan al Gaib@scaling01·5d

I'm always surprised by how much engagement some posts get. like this one got 316 bookmarks? I'm literally just vibe-posting whatever and somehow it sticks

Lisan al Gaib@scaling01

I think returns to intelligence are nonlinear because decisions are path-dependent early choices in code, experiments, or strategy can compound positively or negatively over time for example by avoiding dead ends or preserving optionality it's why I am a big fan of very long running tasks and massive benchmarking budgets GPT-5.5 and Mythos Preview are only marginally more intelligent than previous models and have pretty much the same performance up to 10M tokens, but after that they go absolutely ballistic

English

11

4

64

6.2K