barry farkus

955 posts

barry farkus banner
barry farkus

barry farkus

@paimon2cool

circling something deep

cyberia Katılım Mart 2024
109 Takip Edilen79 Takipçiler
barry farkus
barry farkus@paimon2cool·
@irl_danB Why can they delete their db via an api call is the real question
English
0
0
0
53
dan
dan@irl_danB·
this post is sad and enlightening just an absolutely wild level of misunderstanding about these systems, even from technical people using them in production I’m at a loss for how to remedy this gap as urgently and as quickly as needed
dan tweet mediadan tweet mediadan tweet mediadan tweet media
JER@lifeof_jer

x.com/i/article/2048…

English
89
36
632
135.4K
10x'er
10x'er@10x_er·
This stuff is literally not real. I give this feature 6-12 months tops before they remove it, bookmark this and confim. Like what do you mean you want me to pick how hard my model wants to try? I want it to try its hardest, always, and I want it to be fast. Why would I ever need to consider tradeoffs when doing knowledge work with AI.
10x'er tweet media10x'er tweet media
English
294
19
1.6K
344.8K
barry farkus
barry farkus@paimon2cool·
@gabriel1 The optimal number of hours isn’t a fixed number. Depends on the week
English
0
0
0
41
gabriel
gabriel@gabriel1·
people who rationalize it being optimal to work 40h weeks don't enjoy their work
English
15
10
468
50.8K
Nathan Baschez
Nathan Baschez@nbaschez·
My biggest challenge with vibe coding / agentic engineering lately has been getting stuck in what I call a "plan doom loop" - have AI write a plan - review myself, seems good - have AI review plan, it always finds something - repeat It drains my time and energy to determine how important the "findings" really are Who has solved this
English
365
6
347
69.8K
barry farkus
barry farkus@paimon2cool·
@yacineMTB Weren’t you in Costco building a robot from your phone a few months ago?
English
0
0
0
22
kache
kache@yacineMTB·
if you're not reading every single line of code these "coding" agents produce you are not going to make it
English
148
46
1.4K
69.8K
BuBBliK
BuBBliK@k1rallik·
> been paying $200/month for cloud AI APIs > laptop: M2 MacBook, 16GB RAM > tried running models locally, garbage quality after 4K tokens > read this TurboQuant breakdown on Tuesday > applied 3-bit KV cache compression > same MacBook now runs 100K token conversations > quality: identical to cloud > cancelled all API subscriptions Wednesday > it's been 3 days > saved $200/month forever > with a free algorithm from a free paper > my MacBook didn't change. the math did
BuBBliK@k1rallik

x.com/i/article/2037…

English
263
745
13.6K
2.1M
barry farkus
barry farkus@paimon2cool·
Full quote. “Or do you think it just fixes itself by doing this for long enough? I assure you that is not the case. Again, I agree hallucination rate tends to be lower because more tokens and good post training make self correction more likely”
English
0
0
1
17
Ian Sharar
Ian Sharar@aifilmmaker·
“do you think it just fixes itself by doing this for long enough? I assure you that is not the case.” I can’t tell if you’re fucking with me or just delusional, or maybe you think someone reading this would be too lazy to scroll up to clearly read you saying the opposite?? 😂😂😂😂😂😂 done with you child
English
2
0
0
44
truthache
truthache@truthache68·
Al will always hallucinate. It’s not mistake—just math mathing.
English
321
1.5K
9K
200.1K
barry farkus
barry farkus@paimon2cool·
I literally said reasoning gives them more chance to self correct, but that doesn’t prevent hallucinations. Scroll up. Arguing with retards is a bad habit of mine so going to stop here, but ask yourself why one of the big labs hasn’t solved a millennium prize or some other unsolved problem? If it was as simple as just letting reasoning continue for long enough it should be easy right?
English
1
0
1
33
Ian Sharar
Ian Sharar@aifilmmaker·
Nah bro you said they couldn’t even sell correct 😂 you said more reasoning wouldn’t result in better answers. Don’t lie. Yeah I said in theory, as one very small part of my answer. There are models that can run for days, OpenAI has talked about this, and they appear pretty flawless, but they aren’t for public use, obviously that’s way too expensive and computer intensive to offer as a service, but they exist, because, say it with me, “more reasoning - better answers.” Who knows what the limit is. You were arguing against this stop lying, anyone can scroll up lol
English
1
0
0
48
barry farkus
barry farkus@paimon2cool·
I’ve read the deepseek paper many times. Sure, reasoning tokens improve output - I’m not disagreeing with that. Your claim was that “in theory it could go on forever” - the passages you’re quoting don’t back this up like you seem to think. No one who actually knows how LLMs work is claiming this.
English
1
0
1
42
Ian Sharar
Ian Sharar@aifilmmaker·
😂😂😂😂😂😂 okay this is when deepseek learned that more time equals better answers. The paper even shows this example of it correcting itself thanks to more tokens, something you claimed it wouldn’t do. ALL of the models show this trend. More time - more accurate answers. Sucks to suck
Ian Sharar tweet mediaIan Sharar tweet mediaIan Sharar tweet media
English
1
0
0
75
Ian Sharar
Ian Sharar@aifilmmaker·
It is true lol you think I’m making it up? I’ve read the papers on these models, the chart is always there. In theory it could go on forever, but we don’t have unlimited compute. IMO AGI already came and got went anyway. It just means general, as in not one specific task like chess. These models can do chess, math, poetry, etc all in one model. Thats AGi but people keep changing the goalposts lol
English
1
0
0
59
Ian Sharar
Ian Sharar@aifilmmaker·
It literally fixes itself by doing it for long enough, on a graph it’s like a straight line going up - the longer the reasoning, the more accurate the answer. The only limit here (that we know of) is how long you are willing to let it run. I can find you a chart that shows this lol
English
1
0
0
96
barry farkus
barry farkus@paimon2cool·
So what if it hallucinates within the blocks? Or do you think it just fixes itself by doing this for long enough? I assure you that is not the case. Again, I agree hallucination rate tends to be lower because more tokens and good post training make self correction more likely. But reasoning models in no way prevent hallucinations. imo, hallucinations are a good and necessary part of models anyway.
English
1
0
1
91
Ian Sharar
Ian Sharar@aifilmmaker·
That’s not how it works. The reasoning is part of the output, and can go on for as long as you want, hopefully ending up on the right answer eventually but it’s not usually the first thing - the longer the reasoning the better the answer, that’s how they have models winning math and coding competitions, you just let them go for hours on end. It absolutely prevents them in a meaningful way, to the point where they basically don’t exist, especially for the long long reasoners they have (that aren’t publicly released)
English
1
0
0
89
barry farkus
barry farkus@paimon2cool·
Right but it does literally spit out the “first thing”. But the first thing is just conditioned on useful reasoning traces. I agree it reduces hallucinations in practice , but it doesn’t actually prevent them in a meaningful way. I’m disagreeing with your take - not agreeing with the video
English
1
0
1
87
Ian Sharar
Ian Sharar@aifilmmaker·
It absolutely changes it lol, but SIGNIFICANTLY giving them a better chance, hallucinations are so rare now that it’s hardly a problem anymore, if you’re using a SOTA model. It’s not as simple as just spitting something out even if they don’t know, like the video claims. There are tons of safeguards. Sure sometimes they fail but it’s not like the video says, sorry.
English
1
0
0
102
Jade Cole
Jade Cole@JadeCole2112·
Who cares. What matters is do they improve the quality and/or reduce the cost of software development? So far, the answer seems to be "not really".
Jade Cole tweet media
English
112
12
520
90.7K
Camus
Camus@newstart_2024·
Jimmy Carr drops a wild take on AI + physics "Strip the screens away, and we're still living in the 1970s. Nothing real has happened in physics since 1972. String theory? Nowhere. But point AI at real physics now? Everything else is just stamp collecting. Physics gave us all our technology. We could get a 50x productivity leap and true human flourishing... or it could go the other way." The real revolution isn't just AI—it's AI meeting physics. 1:02 clip inside—mind-blowing perspective.
English
330
274
4.2K
1.5M
Janet A. Carr
Janet A. Carr@janetacarr·
if LLMs will make software engineering obsolete, why don't they just generate binaries instead? not code & compile under the hood. Object code is data too. why do LLMs have to generate things that humans create?
English
763
167
5.5K
446K
Midnight Capital LLC
Midnight Capital LLC@Midnight_Captl·
The change from the @dwarkesh_sp podcast 2 months ago vs. the tweet below from @karpathy is genuinely insane It’s a night and day difference. We went from “these models are slop and we’re 10 years away” to “I’ve never felt more behind and I could be 10x more powerful” This all changed with Opus 4.5. It will be looked back on as a historical milestone.
Andrej Karpathy@karpathy

I've never felt this much behind as a programmer. The profession is being dramatically refactored as the bits contributed by the programmer are increasingly sparse and between. I have a sense that I could be 10X more powerful if I just properly string together what has become available over the last ~year and a failure to claim the boost feels decidedly like skill issue. There's a new programmable layer of abstraction to master (in addition to the usual layers below) involving agents, subagents, their prompts, contexts, memory, modes, permissions, tools, plugins, skills, hooks, MCP, LSP, slash commands, workflows, IDE integrations, and a need to build an all-encompassing mental model for strengths and pitfalls of fundamentally stochastic, fallible, unintelligible and changing entities suddenly intermingled with what used to be good old fashioned engineering. Clearly some powerful alien tool was handed around except it comes with no manual and everyone has to figure out how to hold it and operate it, while the resulting magnitude 9 earthquake is rocking the profession. Roll up your sleeves to not fall behind.

English
66
93
1.5K
293K
Daniel Levine
Daniel Levine@dlevine815·
We've got big plans to improve the core ChatGPT experience in 2026. What are some thing you'd love to see? Even small ideas welcome! Looking forward to getting them built 🙏
English
2.7K
102
2.3K
417K