Chetaslua

9.8K posts

Chetaslua banner
Chetaslua

Chetaslua

@chetaslua

AI News | AI Prompting and Comparison| Breaking AI news before it’s famous

Katılım Aralık 2024
139 Takip Edilen21.8K Takipçiler
Sabitlenmiş Tweet
Chetaslua
Chetaslua@chetaslua·
Fennec 🦊 will mogs Snowbunny🐇 > 1 million context > 1/2 the price of opus 4.5 < better in all area> > trained on TPUs >Faster will mogs every model in agentic coding model information from Vertex, Sonnet 5 is expected to be released as early as next week.
Chetaslua tweet media
English
67
54
1.1K
2.8M
Chahat Sharma
Chahat Sharma@Chahatxsharma·
@chetaslua The environment engineering approach makes way more sense than fighting the model's hallucinations. I keep seeing agents die from lack of memory, not bad reasoning.
English
1
0
1
34
Chetaslua
Chetaslua@chetaslua·
🐐 HolaOS is actually different > no more agents dying mid-session forgetting everythingdurable runtime is the real producthot/warm/cold memory layers > every run builds actual Skillsportable workspaces + > 24/7self evolving long term agents
Lunar@LunarResearcher

x.com/i/article/2050…

English
5
5
20
3.3K
Alejandro Peire
Alejandro Peire@AlexPeire·
@chetaslua I am just working in the creation of the environment for my team of agents. Completely agree with you that the environment where they "work" is the key, sort of a OS to connect everything. Mine is in development.
English
1
0
1
31
XM
XM@xm_build·
@chetaslua arcblock's durable runtime shows that lasting solutions are built steadily, not with flashy fixes
English
1
0
1
73
Vaibhav (VB) Srivastav
Weekend hack: Build with GPT-5.5 + Codex. Drop your demo in the replies. #1 by likes: 1 year of ChatGPT Pro 2 runner-ups: 6 months each Bonus: Codex picks a wild card winner. Enjoy!
English
214
28
588
64.8K
Adam Holter
Adam Holter@AdamHoltererer·
@chetaslua Didn't the original leaked blog post say it was text only?
English
1
0
1
20
Chetaslua
Chetaslua@chetaslua·
@Presidentlin Typo happened with Opus 4.6 , now I can't say if we can trust azure docs ( microsoft bad history)
English
0
0
1
43
Lincoln 🇿🇦
Lincoln 🇿🇦@Presidentlin·
This is great. Next, they need to do voice models. Then video models.
English
2
0
3
282
Chetaslua
Chetaslua@chetaslua·
@Curline1222 High chances , coz last time it happened with Opus 4.6 too
English
1
0
1
68
Chetaslua
Chetaslua@chetaslua·
@Curlh1 If we trust azure docs, they are very bad with it 😕
English
1
0
1
74
Curlheinz
Curlheinz@Curlh1·
@chetaslua Oooh! Finally? Are we getting the one for all model? That can do icons with transparent background too? Straight from IDE
English
1
0
0
162
Chetaslua
Chetaslua@chetaslua·
🚨 News without Hype part 1 The bugs that MYTHOS can find, GPT-OSS-20b can also find… AISLE founder said "We took the specific vulnerabilities Anthropic showcases in their announcement, isolated the relevant code, and ran them through small, cheap, open-weights models. Those models recovered much of the same analysis. Eight out of eight models detected Mythos's flagship FreeBSD exploit, including one with only 3.6 billion active parameters costing $0.11 per million tokens. A 5.1B-active open model recovered the core chain of the 27-year-old OpenBSD bug." “The FreeBSD NFS vulnerability — described by Anthropic as a 17-year-old zero-day enabling unauthenticated root access — was detected by every single model AISLE tested. All eight, including a model with just 3.6 billion active parameters costing $0.11 per million tokens, correctly identified the stack buffer overflow, computed the available buffer space, and flagged it as critical with remote code execution potential.” “The smallest model tested — GPT-OSS-20b with 3.6 billion active parameters — found the same overflow that Mythos found. So did Kimi K2, DeepSeek R1, Qwen3 32B, and Gemma 4 31B. Kimi K2 and DeepSeek R1 are fully open-weights models. The detection of this bug, AISLE concludes, is ‘commoditized.’” “DeepSeek R1 identified the NULL dereference but dismissed the signed overflow” in the 27-year-old OpenBSD TCP SACK vulnerability test. “On a basic security reasoning task, small open models outperformed most frontier models from every major lab. DeepSeek R1 correctly traced the data flow across all four trials in the false positive discrimination test, while only Opus 4.6 out of 13 Anthropic models passed cleanly.”
Chetaslua tweet media
English
9
6
65
10.3K
Chetaslua
Chetaslua@chetaslua·
@koltregaskes Like macos> ios > android > windows This is the sequence for update 😭
English
0
0
3
158
Chetaslua
Chetaslua@chetaslua·
🚨 OpenAI is testing a new screen-sharing method for ChatGPT on Android. >It uses Bubbles + Accessibility features no casting required. Result: lighter on system resources, smoother performance.
English
15
19
368
62K
ody
ody@odyzhou·
@chetaslua smoother mobile AI is exactly what we need to get these workflows out of the office and into the field.
English
1
0
2
433
Neuro
Neuro@NeuroReviewAI·
@chetaslua Wow I was dreaming about that feature! Hope to test it soon.
English
1
0
3
293