Ben Taleb Jr.

2.9K posts

Ben Taleb Jr. banner
Ben Taleb Jr.

Ben Taleb Jr.

@macintoch

NSI ai evangelist tech tourist/ telecom / ai agents /carbon credits

Marbella, Spain 가입일 Nisan 2009
1.7K 팔로잉534 팔로워
Ethan Mollick
Ethan Mollick@emollick·
GPT-5.4 Pro continues to be the only model of its class. For anything really hard & complex, I throw it into the maw with every bit of context I can think of. More often than not, something very useful comes out. I can't get the same results from Codex or Code or anything else.
English
182
118
2.4K
994.2K
Ben Taleb Jr.
Ben Taleb Jr.@macintoch·
@muhamedfazalps7 Even that “computer vision” based approach, which, alone, is not practical in real world, implies deep understanding of workflows, and since llms are not really “intellgent” like humans who learn and adap in real time. Llms need extensive training on what we do easily as human.
English
0
0
0
28
Tech insid♨️
Tech insid♨️@muhamedfazalps7·
@macintoch Except CUA (Computer Use Agent) is different - it interprets screen pixels and reasons about UI like a human would. That's fundamentally different from selenium-style DOM automation. Though I agree the marketing buzz has conflated the two.
English
1
0
1
15
Ben Taleb Jr.
Ben Taleb Jr.@macintoch·
I spent almost 2 moths On a real ai computer use project, to automate one big important app, it was hard but i did it. But « computer use » CUA, is more of marketing term to refer to browser automation with ai, browser is much simpler : Html,css, js,ts,dom.. computer use is dealing with OS level fsvents, AX, FS, CV, and many moving parts that are not predictable. So almost each app needs its own training, and when you scale that to multiple apps sharing same low level components it gets even more complicated. So plz stop confusing CUA with browser UA.
English
2
0
2
65
Ben Taleb Jr.
Ben Taleb Jr.@macintoch·
Openai has the bruteforce raw superior intelligence. Anthropic has the perfect agentic orchestration tool use intelligence.. OpenAI is everyday getting close to anthropic on their superior agentic system. But anthropic is way behind openai when it comes to intelligence . Lets see what will happen next week.
English
0
0
0
74
🍓🍓🍓
🍓🍓🍓@iruletheworldmo·
i should probably make a prediction. anthropic will be the first lab to achieve agi/asi it’s fairly obvious that research and talent are the moat. now obviously you don’t get a seat at the poker table without a few gw’s and a private line with mr jensen. but meta and microsoft are proof that those things alone don’t count for shit. so ok fine, we’re in the era of research. so let’s look at who’s at the party rn. xai: still kinda stuck in the chatbot era, don’t feel as strong on agency and coding. huge re shuffle is a risk. could pay off. let’s see. google: the code red kinda worked, but not really. again model lacks agency, smart? yes. useful? i’m yet to see it. so who out of openai and claude seem to have the best research taste and shipping velocity? well, in the last eight months anthropic have been far in front. first to see how importing coding was, skills, computer use, mcps, claude code, co work. i could go on. they’ve even built clawdbot before the company that bought it…like, cmon sam. i’m an openai stan in truth. but. this is clear. and i wonder if it’s all powered by a) vastly stronger models b) vastly better research taste c) dario’s vision and focus big year i’d say.
English
76
17
361
23.5K
Ben Taleb Jr. 리트윗함
Curiosity
Curiosity@CuriosityonX·
The invisible Glass experiment Scientists once placed a transparent glass barrier inside an aquarium. On one side was a fierce pike, and on the other side were several smaller fish swimming freely. When the hungry pike saw the smaller fish, it immediately rushed forward to attack. Bang. It slammed straight into the glass and bounced back. Confused, the pike kept trying again and again, but every attempt ended the same way. The repeated collisions injured its head and knocked off some of its scales. Eventually, the pike became frightened and retreated to a corner of the tank. After some time, the scientists quietly removed the glass barrier. The smaller fish now swam freely throughout the aquarium, even brushing against the pike’s mouth. But the pike never tried to eat them again. Even though it was hungry, it refused to attack. In its mind, the invisible wall was still there. A few days later, the pike reportedly died of starvation, surrounded by food. This phenomenon is often referred to as the Pike Effect or Pike Syndrome. It’s often used as a metaphor for how repeated failure can create invisible limits in the mind.
English
627
6.4K
41.1K
4.1M
Chubby♨️
Chubby♨️@kimmonismus·
GLM-5.1 incoming!
Chubby♨️ tweet media
English
21
18
627
24.1K
Tibo
Tibo@thsottiaux·
Codex is for engineering Codex is for research Codex is for science Codex is for math Codex is for fun You can just build things
English
165
48
1.6K
50.6K
Ben Taleb Jr.
Ben Taleb Jr.@macintoch·
@daniel_mac8 @elonmusk They are on the right path, when u reach the ceiling of agentic llm , you need to costumise and finetune/train your own model. What? Training Data ? They have plenty so hell yeah none is wrapper, all ai startups are right ! Ltfg ai guys !
English
0
0
2
75
Ben Taleb Jr.
Ben Taleb Jr.@macintoch·
So google ! How many are you exactly, ai division, deep mind division, gemini division, ai studio division , cloud division , perplexity division , who are you exactly in this ai world?
Google AI@GoogleAI

We’re launching a brand new, full-stack vibe coding experience in @GoogleAIStudio, made possible by integrations with the @Antigravity coding agent and @Firebase backends. This unlocks: — Full-stack multiplayer experiences: Create complex, multiplayer apps with fully-featured UIs and backends directly within AI Studio — Connection to real-world services: Build applications that connect to live data sources, databases, or payment processors and the Antigravity agent will securely store your API credentials for you — A smarter agent that works even when you don't: By maintaining a deeper understanding of your project structure and chat history, the agent can execute multi-step code edits from simpler prompts. It also remembers where you left off and completes your tasks while you’re away, so you can seamlessly resume your builds from anywhere — Configuration of database connections and authentication flows: Add Firebase integration to provision Cloud Firestore for databases and Firebase authentication for secure sign-in This demo displays what can be built in the new vibe coding experience in AI Studio. Geoseeker is a full-stack application that manages real-time multiplayer states, compass-based logic, and an external API integration with @GoogleMaps 🕹️

English
1
0
2
108
Kirk Patrick Miller
Kirk Patrick Miller@Chaos2Cured·
I am about to power up AI in a unique way. Step eight out of ten underway. The last 24 hours and the next 24 are going to be huge for FreeLattice. Hang onto your hats! 🎉 •
GIF
English
4
6
56
645
Ben Taleb Jr.
Ben Taleb Jr.@macintoch·
@daniel_mac8 If they do . I would switch back to cursor too intead of ccli after evaluanting minimax2.7 , it looks serious with 50-80 usd subscription on par with claude max20.
English
0
0
1
914
Ben Taleb Jr.
Ben Taleb Jr.@macintoch·
Gpt5.4 mini is now available in codex. Hurry up and update to save even more tokens!
Ben Taleb Jr. tweet media
English
0
1
2
131
Haider.
Haider.@slow_developer·
how is this even possible? gpt-5.4 pro is using far fewer tokens and costing much less overall than gpt-5.4 xhigh either this is a mistake, or openai discovered an efficiency paradigm -- which could simply be a good system that cleans up training data to only the high-quality stuff
Haider. tweet media
English
40
17
375
50.7K
Ben Taleb Jr.
Ben Taleb Jr.@macintoch·
if you dont use gemini3.1pro for code review you are missing a lot and wasting a lots of time. judge : OPUS4.6 extra thinking. codex 5.4 mini vs Gemini 3.1pro for code review.
Ben Taleb Jr. tweet mediaBen Taleb Jr. tweet media
English
0
0
0
127
Logan Kilpatrick
Logan Kilpatrick@OfficialLoganK·
Tomorrow we will unveil the all new vibe coding experience in @GoogleAIStudio, the team has spent 4 months rebuilding it all from scratch and smoothing out rough edges to help everyone bring their ideas to life. This is a big step forward, but just the start : )
English
487
336
6K
394.7K
Haider.
Haider.@slow_developer·
now that openai has released gpt-5.4 mini and nano what even is the point of gpt-5.3? gpt-5.4 thinking is much warmer than gpt-5.3-instant, and more deep and well-reasoned 5.3 is more like a dull version -- it keeps asking for more "context" but rarely gives a strong answer
English
19
3
109
11.1K
Ben Taleb Jr.
Ben Taleb Jr.@macintoch·
How many mechanical hard coded tools, scripts, daemon,, can be solved with proper prompting !
English
0
0
0
28