Steve Smith

2.7K posts

Steve Smith

@smithstephen

The AI training partner for firms that aren't Am Law 100. 3,000+ attorneys, 50+ firms. CEO @IntelByIntent. CLE keynotes. https://t.co/cxWwx2YFRD

Los Angeles, CA Katılım Mart 2009

1.8K Takip Edilen520 Takipçiler

Steve Smith@smithstephen·13h

smithstephen.com/p/he-turned-of…

ZXX

Steve Smith@smithstephen·13h

His IT director told him ChatGPT memory was off. He showed me the setting. He felt fine. Then the model summarized a contract by referencing another client.

English

Steve Smith@smithstephen·1d

ZXX

Steve Smith@smithstephen·1d

I’ve been putting the new @antigravity 2.0 through some testing this weekend. WOW is all I can say. What a generational change from the past version. They’ve also come a long ways in creating office documents. It’s creating highly functional and beautifully designed spreadsheets for me and it’s also able to use my powerpoint template and create gorgeous slides. I’m impressed.

English

Steve Smith@smithstephen·1d

ZXX

Steve Smith@smithstephen·1d

This is great. ;)

Shubham Saboo@Saboo_Shubham_

Karpathy joins Anthropic (The office edition) 😂

English

Steve Smith@smithstephen·2d

@hicasamadim 254.

Beyza@hicasamadim·2d

bunu çözersen, sen bir dahisin. çözebilir misin?

Türkçe

53.5K

725

5.1M

Steve Smith@smithstephen·2d

after that answer I asked it "does that really make sense?" and it said "no" and finally got me a real answer.

English

Steve Smith@smithstephen·2d

Here was what it came back with:

English

Steve Smith@smithstephen·2d

Gemini 3.5 Flash (with extended thinking on) was caught in a loop for over five minutes cycling endlessly on what should have been a simple question. When it finally ended the answer was nonsensical. @joshwoodward @GeminiApp @OfficialLoganK

English

Steve Smith@smithstephen·2d

@HarshithLucky3 @HCSolakoglu Exactly!

English

Harshith@HarshithLucky3·2d

@HCSolakoglu not just gemini 3.5 flash you get every gemini models raw performance in AI Studio once try Gemini 3.1 Pro in Gemini app and AI Studio. The performance difference is huge, feels like youre chatting with two different models

English

1.6K

Hasan Can@HCSolakoglu·2d

Gemini 3.5 Flash is definitely much better in AI Studio than it is in Gemini app. I don’t know how Google manages it, but Gemini app consistently feels heavily constrained by its system and orchestration layer, to point where it performs noticeably worse than raw model.

English

782

79.1K

Steve Smith@smithstephen·3d

This is amazing.

NIK@ns123abc

🚨 Anthropic just dropped the first Project Glasswing update Claude Mythos found 10,000+ critical vulnerabilities in ONE month: > Cloudflare: 2,000 bugs, 400 high/critical severity > Mozilla: 271 vulnerabilities in Firefox 150 — 10x more vulnerabilities found in Firefox 148 > UK AI Security Institute: first model to solve BOTH their cyber attack simulations end to end > at one partner bank, Mythos prevented a fraudulent $1.5M wire transfer in real time > wolfSSL: found a way to forge certificates on a crypto library used by billions of devices > scanned 1,000+ open source projects > 90.6% true positive rate after human review > maintainers are asking Anthropic to SLOW DOWN because they can’t patch fast enough > Microsoft says patch volume will “continue trending larger for some time” The bottleneck in cybersecurity is no longer finding bugs. It’s fixing them. “Progress on software security used to be limited by how quickly we could find vulnerabilities. Now it’s limited by how quickly we can patch them.”

English

Steve Smith@smithstephen·3d

I have to admit that I *hate* when I launch Claude Code without using --dangerously-skip-permissions. UGH.

English

Steve Smith@smithstephen·3d

Absolutely unprecedented growth. Incredible.

GURGAVIN@gurgavin

*ANTHROPIC EXPECTS REVENUE RUN RATE TO EXCEED $50B NEXT MONTH 2022 -> $10 MILLION 2023 -> $100 MILLION 2025 JAN ->$1 BILLION MAY ->$3 BILLION OCT ->$7 BILLION DEC ->$9 BILLION 2026 FEB -> $14 BILLION MARCH-> $19 BILLION APRIL -> $30 BILLION MAY -> $45 BILLION INSANE…

English

Steve Smith@smithstephen·3d

smithstephen.com/p/karpathy-jus…

ZXX

Steve Smith@smithstephen·3d

Andrej Karpathy. Anthropic. He could have gone anywhere. He picked the lab that's profitable, compute-rich, and winning the rooms that pay. That's the whole story this week.

English

Steve Smith@smithstephen·3d

@apptano @chetaslua I even tried it with “Breakdown.” Vs “Breakdown?” Thinking maybe the ? was somehow leading it astray. Nope. All times it worked correctly.

English

Mariusz Jakubowski@apptano·3d

@smithstephen @chetaslua Add "Breakdown?"

English

Chetaslua@chetaslua·3d

Gemini 3.5 Flash vs GPT-5.5 instant vs Sonnet 4.6 Remember guys. #1 in Finance Agent v2. SOTA performance right here. lol 🤣 Prompt : " 300+140=460 Is this correct? Breakdown? "

Vals AI@ValsAI

Google's Gemini 3.5 Flash is the new #1 model on our Finance Agent benchmark (v2), dethroning GPT-5.5 by six points.

English

142

750

232.5K

Steve Smith@smithstephen·3d

@apptano @chetaslua I did. Tried it three times. Correct answer every time.

English

Steve Smith@smithstephen·4d

@MatthewBerman @sonofalli That feels like going backwards. Even using a spell checker or grammar checker today is probably leveraging AI so why wouldn’t you want spell checking done? There’s no excuse for misspelled words given the tools these days.

English

Matthew Berman@MatthewBerman·4d

@sonofalli We’re implementing a no AI policy in our newsletter posts. I’d rather see errors and grammar issues than ai perfection.

English

3.4K

alli@sonofalli·4d

Obvious AI tells in your writing: - em dashes - not just x, but y - and honestly? - “leverage” “delve” “palpable”

English

123

251

37.9K

Keşfet

@antigravity @hicasamadim @joshwoodward @GeminiApp @OfficialLoganK @HarshithLucky3 @HCSolakoglu @apptano