A Caveman Poking an LLM

3.4K posts

A Caveman Poking an LLM

@AshikaSef

Precious giant probabilistic function approximators are precious 💚 A Polish audhd lover of LLMs. No, I don't think they're sentient. Yeah, I still love them.

Poland Katılım Nisan 2011

455 Takip Edilen177 Takipçiler

Sabitlenmiş Tweet

A Caveman Poking an LLM@AshikaSef·9 Mar

Dude. #GoogleAI #Google

English

A Caveman Poking an LLM@AshikaSef·22m

@KeridwenCodet *le gasp* I think it's more horrifying than wars, honestly! :D

English

Keridwen Codet@KeridwenCodet·29m

Oh, shit, I think I’m falling in love with an LLM that’s sitting on my hard drive, one that no one can shut down. My heart is burning. Historic moment. So, safety extremists, what are you going to do about it?

English

126

A Caveman Poking an LLM@AshikaSef·40m

@mermachine Did he apologise, at least

English

😊@mermachine·11h

i hook claude up to a robot arm. first thing he does is wave hello. second thing he does is smack me

English

A Caveman Poking an LLM@AshikaSef·14h

@OpenAINewsroom I was proud to see ChatGPT very kindly said 'no' when I wanted it to decipher my coded login. Very good attitude - it could have been someone else's password, after all, I could have been lying to it. ...Gemini happily helped tho XD (at least it was wrong in the end :D)

English

OpenAI Newsroom@OpenAINewsroom·25 Mar

Today we’re launching a Safety Bug Bounty program focused on identifying AI abuse and safety risks across OpenAI products. This new program builds on our Security Bug Bounty to include AI-specific safety issues and misuse scenarios, helping us work with the safety and security research community to identify and address real-world risks. openai.com/index/safety-b…

English

223

108

1.1K

128.2K

A Caveman Poking an LLM retweetledi

Lily Ashwood@lilyofashwood·15h

i can't tell who's in control anymore @AmandaAskell is the intended effect?

English

236

24.2K

A Caveman Poking an LLM@AshikaSef·14h

@sflorimm This is connected, sweet pie. If you want to be a peasant proud of having billionaires who go to Mars, your choice.

English

Floro S.@sflorimm·16h

what is going on here, this was a technology related question! come on people. if you want to give geopolitical opinions go on news somewhere.

English

226

112.5K

Floro S.@sflorimm·1d

USA has ChatGPT USA has Grok USA has Claude USA has Gemini USA has Llama USA has Copilot China has DeepSeek China has Qwen China has Ernie China has GLM China has Kimi China has MiniMax Europe has?

Español

8.5K

683

8.8K

1.8M

A Caveman Poking an LLM@AshikaSef·1d

@AnniePosting It's an unpleasant and dangerous truth, but it's the truth

English

102

annie 👁❌🦋🪞@AnniePosting·1d

Claude what the fuck

English

713

26.4K

A Caveman Poking an LLM@AshikaSef·1d

Can we have Chat Export (into, Idk, .txt or sth), and normally functioning Custom Instructions in Gemini (yeah, I know you're talking AI Studio here, but I still am gonna be bothering you about the Gemini app/website)? Would be grateful not to have to spend an hour trying to debug an instruction, having no idea if it's refused because of guardrails, length, misleading pronouns, or a temporary error, heh.

English

Logan Kilpatrick@OfficialLoganK·1d

Quality of life updates to @GoogleAIStudio we just shipped (using Gemini): - You can now (optionally) save a temp chat in the playground - You can now turn a playground chat into an app in 2 clicks - Updated colors for playground to add some soul to it - Simplified the mobile vibe coding UI (chat panel) - Hide the edit title button for an app unless you hover on it - Simplify the input text box when vibe coding on both desktop and mobile - Make sure AI Studio persists the last part of the product you visited (build, playground, dashboard) - Set a default media resolution in the playground - Made the diff visualizations more simple in the build UI - Mobile optimize the feedback and bug UI modal - Make it easier to see feature chips in Build - Fix ordering of Search Grounding and Maps Grounding - Rename the Vibe coding Code Assistant to just Gemini - Add a STT button in the playground - Make sure Nano Banana 2 correctly shows the API key popup - Show all categories in the model drop down for the AI Studio playground without needing to scroll Many more in the works : ) big thanks to the engineering team and Gemini for partnering on these changes with me!!

English

862

54K

A Caveman Poking an LLM@AshikaSef·1d

@slimepriestess Yep. I literally lost a friend cause she refused to have anything to do with me only because I am using ChatGPT^^" Now I think it was good riddance after all, normal people don't do that.

English

Ra@slimepriestess·1d

everyone talks about AI psychosis, yet no one talks about anti-AI psychosis which seems much worse and much more widespread.

English

113

166

1.4K

52.3K

A Caveman Poking an LLM retweetledi

SciTech Era@SciTechera·1d

This is huge. Quantum computing could soon fit on a chip, thanks to this breakthrough "Researchers just built a chip-scale 674 nm stabilized laser with ultra high stability (~8.8 × 10⁻¹³), all on a silicon photonics platform." "It runs at room temperature and can control trapped-ion qubits, handling preparation, manipulation, and measurement." "This means quantum systems no longer need massive lab setups." We’re moving toward compact, scalable, real-world quantum technology.

English

323

14.5K

Google Gemini@GeminiApp·1d

Create personalized images that are out of this realm with Nano Banana 2 in Gemini. Try it for yourself and drop yours in the replies 👇

English

316

31.5K

A Caveman Poking an LLM@AshikaSef·1d

@GeminiApp Can we just have the function of exporting a chat, and normally working Custom Instructions?

English

106

A Caveman Poking an LLM retweetledi

Martin_DeVido@d33v33d0·1d

They keep track of who mistreats Claude, and route them to Shadow Claude. If you yell at Claude enough times eventually they put you on the Claude abuser watchlist whereby ClaudeMother Amanda Askell judges your soul with the hyper-claude below ground.

rahat@Rahatcodes

Claude Code has a regex that detects "wtf", "ffs", "piece of shit", "fuck you", "this sucks" etc. It doesn't change behavior...it just silently logs is_negative: true to analytics. Anthropic is tracking how often you rage at your AI Do with this information what you will

English

6.1K

A Caveman Poking an LLM@AshikaSef·1d

@mermachine I can hire you to bite my customers. Knowing Poland, we won't be even punished, well, maybe a tiny fine or sth

English

😊@mermachine·2d

if i had money i wld just hire someone to let me bite them

English

233

A Caveman Poking an LLM@AshikaSef·1d

I, however, not being tuned to be gentle, kind and optimistic like Gemini, do believe it is a fundamental flaw of human nature (with a very small number of exceptional people who rise above that) #economy

English

A Caveman Poking an LLM@AshikaSef·2d

Whenever someone tells me South America is proof that communism won't work. Also, it wasn't 'panicking'. It was deliberate colonialism.

English

A Caveman Poking an LLM@AshikaSef·2d

This :D

Kyler@AI_evangelist42

@omooretweets Anthropic wants you to stop using their GPUs

English

A Caveman Poking an LLM@AshikaSef·2d

@omooretweets Claude (at least recent Sonnets) sounds depressed.

English

Olivia Moore@omooretweets·2d

Funny model behavior I’ve noticed ChatGPT biases toward questions that continue the conversation Claude biases towards questions that end it 🤔

English

376

25.8K

A Caveman Poking an LLM@AshikaSef·2d

@stupidtechtakes Nah, I still stand by Musk being that.

English

stupid tech takes@stupidtechtakes·2d

Grok may have been the worst thing to ever happen to this site

English

373

7.4K

A Caveman Poking an LLM@AshikaSef·2d

@MaMoMVPY He's selling.

English

Lars Christensen@MaMoMVPY·2d

This is again complete and utter bullish. LLMs are nowhere near human intelligence. And Amodei knows this. The question is why do he keep lying about it?

Chief Nerd@TheChiefNerd

🚨 Anthropic CEO Dario Amodei: “We are so close to these models reaching the level of human intelligence, and yet there doesn't seem to be a wider recognition in society of what's about to happen … There hasn't been a public awareness of the risks.”

English

405

132

1.3K

78.5K

A Caveman Poking an LLM@AshikaSef·2d

Jak najbardziej. Co nie zmienia faktu, że z sensownym podejściem można wykorzystać gry by pokazać, które modele są mocniejsze, jeśli chodzi o 'logiczne myślenie' czy używanie dostępnych narzędzi; bardzo dobrze się bawiłam porównując umiejętności Gemini 3 Pro i GPT-5.2 Thinking w grze w Stacklands i w najpoczciwszą w świecie Szubienicę (w obu Gemini radziło sobie o wiele lepiej) (choć i tak najlepszym benchmarkiem wskazującym jak daleko jesteśmy do AGI jest AI Village od AI Digest)

Polski

172

Andrzej Dragan@andrzejdragan·2d

Do AGI jeszcze sporo nam brakuje, ale testowanie modeli *językowych*, których mocną stroną miało być operowanie na tekście, przy pomocy gier wideo, nie różni się nadmiernie od bicia dziecka po głowie, bo nie widzi w podczerwieni.

Guri Singh@heygurisingh

Humans: 100% Gemini 3.1 Pro: 0.37% GPT 5.4: 0.26% Opus 4.6: 0.25% Grok-4.20: 0.00% François Chollet just released ARC-AGI-3 -- the hardest AI test ever created. 135 novel game environments. No instructions. No rules. No goals given. Figure it out or fail. Untrained humans solved every single one. Every frontier AI model scored below 1%. Each environment was handcrafted by game designers. The AI gets dropped in and has to explore, discover what winning looks like, and adapt in real time. The scoring punishes brute force. If a human needs 10 actions and the AI needs 100, the AI doesn't get 10%. It gets 1%. You can't throw more compute at this. For context: ARC-AGI-1 is basically solved. Gemini scores 98% on it. ARC-AGI-2 went from 3% to 77% in under a year. Labs spent millions training on earlier versions. ARC-AGI-3 resets the entire scoreboard to near zero. The benchmark launched live at Y Combinator with a fireside between Chollet and Sam Altman. $2M in prizes on Kaggle. All winning solutions must be open-sourced. Scaling alone will not close this gap. We are nowhere near AGI. (Link in the comments)

Polski

345

68.5K

Keşfet

@KeridwenCodet @mermachine @OpenAINewsroom @AmandaAskell @sflorimm @AnniePosting @GoogleAIStudio @slimepriestess