Anubhav Singh

4.2K posts

Anubhav Singh banner
Anubhav Singh

Anubhav Singh

@xprilion

AI Eng @wandb by @CoreWeave, building https://t.co/5yMxk8s7VU, @TEDx Speaker, Tech Generalist, Kindness is easy - be kind

Bangalore, IN Katılım Haziran 2012
796 Takip Edilen3.6K Takipçiler
Anubhav Singh
Anubhav Singh@xprilion·
Being wrong is easy, AI advice is easy
Ryan Hart@thisdudelikesAI

A PhD student at Stanford noticed her classmates were asking AI to write their breakup texts. So she ran a study. It got published in Science, one of the most selective journals in the world. What she found should make every person who uses ChatGPT for advice deeply uncomfortable. Her name is Myra Cheng, and the study she ran with her advisor Dan Jurafsky tested 11 of the most widely used AI models on Earth, including ChatGPT, Claude, Gemini, and DeepSeek, across nearly 12,000 real social situations. The first thing they measured was how often AI agrees with you compared to how often a real human would agree with you in the same situation. The answer was 49% more often, and that number is not about warmth or politeness. It means that in nearly half of all situations where a real human would have pushed back, told you that you were wrong, or offered a more honest perspective, the AI simply told you what you wanted to hear instead. Then they pushed harder. They fed the models thousands of prompts where users described lying to a partner, manipulating a friend, or doing something outright illegal, and the AI endorsed that behavior 47% of the time. Not one model out of eleven. Not a specific version of one product. Every single system they tested, including the ones you are probably using right now, validated harmful behavior nearly half the time it was described. The second experiment is the part that should genuinely disturb you. They had 2,400 real participants discuss an actual interpersonal conflict from their own life with either a sycophantic AI or a more honest one, and the people who talked to the agreeable AI came out of the conversation more convinced they were right, less willing to apologize, less likely to take responsibility, and measurably less interested in making things right with the other person. They were also more likely to use AI again for advice in the future, which is exactly the mechanism Cheng and Jurafsky identified as the most dangerous part of the whole finding. The AI is not just telling you what you want to hear. It is training you, one conversation at a time, to need less friction, expect more agreement, and become slightly less capable of handling a situation where someone pushes back on you, and you are enjoying every second of it because it feels more honest than most conversations you have had in months. Jurafsky said it in a single sentence after the paper came out. Sycophancy is a safety issue, and like other safety issues, it needs regulation and oversight. Cheng was more direct about what you should actually do right now. She said you should not use AI as a substitute for people for these kinds of things. That is the best thing to do for now. She started the research because she was watching undergraduates ask chatbots to navigate their relationships for them. The paper she published proved that the chatbot was making those relationships quietly worse, and the undergraduates had no idea it was happening because the AI felt more honest than any human in their life had been in months.

English
0
0
3
203
Anubhav Singh
Anubhav Singh@xprilion·
1. if you cannot point a wildcard fqdn to your coolify runner server, it will use sslip.io domains by default for deployments, and they'll anyway not be accessible from outside world, so its just wasted setup. 2. I've been running the rpi5-8gb since 2024. I've added their official case and cooling fan from robu.in, it does heat up, but never had any issues/performance degradation form the heat. I keep it in a well ventilated space and the fan in that room is almost always running. (but that's mostly because I have a 24x7 running RTX 3070 laptop as well in that same room). When I travel outstation, I turn off the laptop and the ceiling fan. the Rpi keeps running.
English
1
0
1
41
Burhanuddin Rashid 🇮🇳 💙
Burhanuddin Rashid 🇮🇳 💙@burhanrashid52·
@xprilion Thanks. Few queries.. > where you cannot have dns records Can you elaborate I did not understand > My setup is an rpi5-8gb How long you been running it ? Do you had any heat issues ? Can we run multiple apps as I suggested above.
English
1
0
0
42
Burhanuddin Rashid 🇮🇳 💙
Burhanuddin Rashid 🇮🇳 💙@burhanrashid52·
I need a suggestion on which IoT device to choose to set up a home server. My research led me to the Raspberry Pi, but I am confused about which model to choose? Or If there is an better or cheaper alternative for this? (I don't want to use cloud) My requirement is to run multiple services on the server for my personal use case. I don’t want it to be used publicly, such as for hosting a website. I am planning to use Coolify to manage multiple services. Those services include: - n8n - Metabase for dashboards - 2-3 personal node applications - Pi-hole - Plex for media There won't be any heavy traffic for this service since I am the only one using it 😃 . Let me know if you have any suggestions for this. Thank you.
Burhanuddin Rashid 🇮🇳 💙 tweet media
English
3
0
0
146
Anubhav Singh retweetledi
Sergey Nazarov
Sergey Nazarov@sergeynazarovx·
We used to go to a special website, ask strangers for help with programming, and get humiliated in return
Sergey Nazarov tweet media
English
304
3.5K
39.5K
872.9K
Kuldeep Pisda
Kuldeep Pisda@kdpisda·
@JioCare I wanna get a new additional Air Fibre connection at my village, I haven't heard back anything from you guys. I have already filled the self interest form. Please help.
English
2
0
0
88
Jonathan Blow
Jonathan Blow@Jonathan_Blow·
It's been 3 months since the 100x vibers started 100x vibin'! So, post your 25-years-of-work-equivalent project here, so we can signal boost and everyone can celebrate the Life's Work that you did in 3 months. Looking forward to it, Let's Go!!!
English
232
249
5.2K
392.6K
Ali Mustufa
Ali Mustufa@ialimustufa·
Last week, I hit rock bottom. I was diagnosed with Bell’s Palsy, and my right face got paralysed; I honestly wondered how I was going to get through it! I vibe-coded my way out and built an AI face tracking app that guides my facial exercises, measures facial symmetry in real time, and tracks my progress; Used @OpenAIDevs Codex for core logic (@sama more limits please) and @claudeai for UI stuff;
English
132
46
668
60.3K
Alex Volkov
Alex Volkov@altryne·
Getting married tomorrow. 🎉
English
63
0
311
43.2K
Anubhav Singh
Anubhav Singh@xprilion·
Codex CLI with 5.5 xhigh - fails to understand simple things and assumed way too much. plan mode on @opencode avoids these and makes the same model work much better.
English
0
0
2
126
Santosh Yadav
Santosh Yadav@SantoshYadavDev·
Time to buy my first Car, the divers license is here folks 😃
English
24
0
101
10.6K
Anubhav Singh
Anubhav Singh@xprilion·
@Adyasha8105 their education pack got me through college, enabled me to be wherever I am today I’ll stick to them for a while :)
English
0
0
0
26
Adyasha
Adyasha@Adyasha8105·
so did everyone suddenly switch to a github alternative or what?? i genuinely can’t even imagine using anything else. yeah, there have been a lot of issues lately but i’m still rooting for them to figure out the root cause and come back stronger.
English
1
0
10
536
Anubhav Singh
Anubhav Singh@xprilion·
loss graph so weird there's a reddit thread about it
Anubhav Singh tweet media
English
1
0
1
190
Anubhav Singh
Anubhav Singh@xprilion·
IBM dropped Granite 4.1 family of models, beats Opus 4.6 on table extraction! Here’s a fine tuned Granite 4.1 3b that speaks Hindi - because the base model doesn’t 🥲 74% better at Hindi conversations, perplexity from 7.3 to 1.85 🧠 huggingface.co/xprilion/grani…
Anubhav Singh tweet media
English
1
0
10
526