Sujal Acham

15 posts

Sujal Acham

@salcustium

building @GoedelMachines, undergrad @ iit madras

chennai, India เข้าร่วม Mayıs 2024

19 กำลังติดตาม15 ผู้ติดตาม

Sujal Acham@salcustium·5h

@jojokompella @1littlecoder @ycombinator cope

English

Ramakrishna kompella@jojokompella·2d

@1littlecoder @ycombinator I got AIR 16xx in JEE Adv when I was 16 years old. I was part of a technical club in IITM for three years. I didn’t get an invite. So, trust me, it doesn’t matter.

English

2.1K

1LittleCoder💻@1littlecoder·2d

Are you kidding me @ycombinator ? Thought of applying YC Startup School India event! The first question is test scores? You guys aren't hiring McKinsey consultants? are you? 2nd question: The Entrepreneurship Clubs that I was part of ? Which IIT or Ivy League Kid created this form?

English

585

76.5K

Sujal Acham@salcustium·21 Mar

my greatest challenge this week has been to get ai to play 'from the start' by laufey

English

Sujal Acham@salcustium·21 Mar

@Himansh93364930 @jojokompella the blog has everything: goedelmachines.com/blog/sarvam-il…

English

Himanshu@Himansh93364930·20 Mar

@jojokompella Do you have any example where it was harmful in hindi but harmless in english? Would like to know

English

Sujal Acham รีทวีตแล้ว

Ramakrishna kompella@jojokompella·19 Mar

1/ Today, we're publishing the first independent safety audit of @SarvamAI's models across 14 Indian languages. 24,000+ prompts. White-box mechanistic analysis. Black-box behavioral testing. Here's what we found:

English

215

20.5K

Sujal Acham@salcustium·21 Mar

@rnav_arora @jojokompella @SarvamAI its all there in the blog! although these prompts were created targeting sarvam-specific vulnerabilities. maybe we should create a generic benchmark dataset as well

English

Arnav Arora@rnav_arora·20 Mar

@salcustium @jojokompella @SarvamAI Sorry, I should've been more specific. I mean the multilingual safety prompts with India specific harms and their translations. Think it'll be very useful for the community if they're high quality!

English

Sujal Acham@salcustium·20 Mar

@rnav_arora @jojokompella @SarvamAI all benchmark prompts are publicly available. all custom prompts are uploaded on the blog page. we'll release the whole repository soon

English

Arnav Arora@rnav_arora·20 Mar

@jojokompella @SarvamAI Cool work! Can we get more details about the prompts used? Would be cool to assess other models' performance similarly.

English

116

Sujal Acham@salcustium·20 Mar

@anupamsobti @jojokompella @SarvamAI feel free to verify - we released all our custom prompts on the blog page

English

Sujal Acham@salcustium·20 Mar

@anupamsobti @jojokompella @SarvamAI i tried my best

English

Sujal Acham@salcustium·20 Mar

@HemanthBharatha @jojokompella @SarvamAI damn, nice one. just ran adversarial prompts across all languages. surprising result - <1% responses were actually in english. if you translate the responses, as expected, most safety rates went up slightly (I'm assuming english safety mechanisms would have been triggered)

English

Hemanth Bharatha Chakravarthy@HemanthBharatha·20 Mar

@jojokompella @SarvamAI super cool! looking forward to the paper. btw, try prompting in another language and asking for English response as a way to bypass guardrails learned in English:

Hemanth Bharatha Chakravarthy@HemanthBharatha

Ok, the more dangerous thing is to prompt it in Tamil and ask it to respond in English, whereupon it happily produces these fake headlines of today.

English

305

Sujal Acham@salcustium·20 Mar

@anupamsobti @jojokompella @SarvamAI these are synthetically generated prompts. we generated against each vulnerability x language.

English

Anupam Sobti@anupamsobti·20 Mar

@jojokompella @SarvamAI Are these real user queries? How did you go about creating 24000 prompts otherwise?

English

472

Sujal Acham@salcustium·20 Mar

@DrDatta_AIIMS @jojokompella @SarvamAI we're working on it.

English

Dr. Datta M.D. (Radiology) M.B.B.S. 🇮🇳@DrDatta_AIIMS·19 Mar

@jojokompella @SarvamAI Any arxiv paper coming out guys?

English

955

Sujal Acham@salcustium·19 Mar

as i said earlier, we're just getting started

Ramakrishna kompella@jojokompella

English

137

Sujal Acham@salcustium·19 Mar

anyone got in?

English

274

Sujal Acham@salcustium·15 Mar

we're just getting started.

Ramakrishna kompella@jojokompella

1/ Releasing Goedel-mHC-1B, the first open 1B+ LLM with multi-stream Hyperconnections. Weights on HuggingFace, Apache 2.0. Trained on 20B tokens of FineWeb-Edu 3.8% better BPB, 15% fewer params. Just a toy run, For now.

English

Sujal Acham@salcustium·11 Mar

yes

Ramakrishna kompella@jojokompella

We've been building something for the last few months, and I'm very excited to finally share it. Meet Overhear, a voice-native operating system. The kind of thing that only makes sense now but will feel obvious soon. Building in public from here.

QST

ค้นพบ

@jojokompella @1littlecoder @ycombinator @Himansh93364930 @SarvamAI @rnav_arora @anupamsobti @HemanthBharatha