Steve Smith

2.7K posts

Steve Smith banner
Steve Smith

Steve Smith

@smithstephen

The AI training partner for firms that aren't Am Law 100. 3,000+ attorneys, 50+ firms. CEO @IntelByIntent. CLE keynotes. https://t.co/cxWwx2YFRD

Los Angeles, CA Katılım Mart 2009
1.8K Takip Edilen520 Takipçiler
Steve Smith
Steve Smith@smithstephen·
His IT director told him ChatGPT memory was off. He showed me the setting. He felt fine. Then the model summarized a contract by referencing another client.
English
1
0
0
24
Steve Smith
Steve Smith@smithstephen·
I’ve been putting the new @antigravity 2.0 through some testing this weekend. WOW is all I can say. What a generational change from the past version. They’ve also come a long ways in creating office documents. It’s creating highly functional and beautifully designed spreadsheets for me and it’s also able to use my powerpoint template and create gorgeous slides. I’m impressed.
English
2
0
0
27
Beyza
Beyza@hicasamadim·
bunu çözersen, sen bir dahisin. çözebilir misin?
Beyza tweet media
Türkçe
53.5K
725
9K
5.1M
Steve Smith
Steve Smith@smithstephen·
after that answer I asked it "does that really make sense?" and it said "no" and finally got me a real answer.
English
0
0
0
9
Steve Smith
Steve Smith@smithstephen·
Here was what it came back with:
Steve Smith tweet media
English
1
0
0
8
Steve Smith
Steve Smith@smithstephen·
Gemini 3.5 Flash (with extended thinking on) was caught in a loop for over five minutes cycling endlessly on what should have been a simple question. When it finally ended the answer was nonsensical. @joshwoodward @GeminiApp @OfficialLoganK
English
1
0
0
72
Harshith
Harshith@HarshithLucky3·
@HCSolakoglu not just gemini 3.5 flash you get every gemini models raw performance in AI Studio once try Gemini 3.1 Pro in Gemini app and AI Studio. The performance difference is huge, feels like youre chatting with two different models
English
1
0
7
1.6K
Hasan Can
Hasan Can@HCSolakoglu·
Gemini 3.5 Flash is definitely much better in AI Studio than it is in Gemini app. I don’t know how Google manages it, but Gemini app consistently feels heavily constrained by its system and orchestration layer, to point where it performs noticeably worse than raw model.
English
25
22
782
79.1K
Steve Smith
Steve Smith@smithstephen·
I have to admit that I *hate* when I launch Claude Code without using --dangerously-skip-permissions. UGH.
English
0
0
0
24
Steve Smith
Steve Smith@smithstephen·
Andrej Karpathy. Anthropic. He could have gone anywhere. He picked the lab that's profitable, compute-rich, and winning the rooms that pay. That's the whole story this week.
English
1
0
0
21
Steve Smith
Steve Smith@smithstephen·
@apptano @chetaslua I even tried it with “Breakdown.” Vs “Breakdown?” Thinking maybe the ? was somehow leading it astray. Nope. All times it worked correctly.
English
0
0
0
7
Chetaslua
Chetaslua@chetaslua·
Gemini 3.5 Flash vs GPT-5.5 instant vs Sonnet 4.6 Remember guys. #1 in Finance Agent v2. SOTA performance right here. lol 🤣 Prompt : " 300+140=460 Is this correct? Breakdown? "
Chetaslua tweet mediaChetaslua tweet mediaChetaslua tweet media
Vals AI@ValsAI

Google's Gemini 3.5 Flash is the new #1 model on our Finance Agent benchmark (v2), dethroning GPT-5.5 by six points.

English
142
34
750
232.5K
Steve Smith
Steve Smith@smithstephen·
@MatthewBerman @sonofalli That feels like going backwards. Even using a spell checker or grammar checker today is probably leveraging AI so why wouldn’t you want spell checking done? There’s no excuse for misspelled words given the tools these days.
English
1
0
1
94
Matthew Berman
Matthew Berman@MatthewBerman·
@sonofalli We’re implementing a no AI policy in our newsletter posts. I’d rather see errors and grammar issues than ai perfection.
English
12
0
51
3.4K
alli
alli@sonofalli·
Obvious AI tells in your writing: - em dashes - not just x, but y - and honestly? - “leverage” “delve” “palpable”
English
123
6
251
37.9K