Aman Goyal

974 posts

Aman Goyal

@goyalaman03

Tinkering @maruthlabs • 26 • opinions my own

Delhi, India Katılım Ocak 2023

48 Takip Edilen220 Takipçiler

Sabitlenmiş Tweet

Aman Goyal@goyalaman03·25 Şub

Sharing a bit more of our progress. Try it out now on chat.maruthlabs.com Read more on - maruthlabs.com

Maruth Labs@maruthlabs

We’ve been working on a small 150M parameter model called Madhuram-v0.5. It’s a model designed to be finetuned for specific tasks. We're quite happy with how it holds up. Here is a comparison of our Madhuram-v0.5-Base with other models of similar sizes.

English

1.7K

Aman Goyal@goyalaman03·20h

@sumanta35679701 Hi, sorry for the late response. We are a research-first company focused on building high-performance Small Language Models (SLMs) like Madhuram, which utilizes our proprietary Perseus architecture. You can read more about our work and us at maruthlabs.com

English

sumanta Bhattacharyya@sumanta35679701·1d

@goyalaman03 Would love to know more about your work !!

English

Aman Goyal@goyalaman03·5 May

Hiring: Research Intern @ MaruthLabs We are looking for a Research Intern to join us for a 3-month internship focused on pushing the boundaries of high-performance Small Language Models (SLMs). The Role: • Research & Experimentation: You will be given access to 0.5x H100 GPU compute to test and iterate on your own research ideas. • Scaling Up: Upon reaching your research milestones, you will be granted access to an 8x H100 node for a full-scale training run. • Integration: Successful experiments and optimizations will be integrated directly into our core model training pipelines. Requirements: • Strong proficiency in Python and a deep understanding of Transformer architectures. • A research-oriented mindset with an interest in SLMs, efficiency, and context-length expansion. • Degree is not a barrier: We value proof of work, GitHub contributions, and technical curiosity over formal credentials. Details: • Stipend: ₹15,000 per month. • Duration: 3 Months (Extendable). • Location: Remote. How to Apply: Interested candidates should send their CV and a brief outline of a research idea they would like to explore on an H100 to contact@maruthlabs.com. #MaruthLabs #LLM #Research #Hiring #MachineLearning #SLM

English

292

24.2K

Aman Goyal@goyalaman03·1d

@VazeKshitij This generational run needs to be studied.

English

124

kshitij vaze@VazeKshitij·1d

Been a while, but we back at it again lads🫡

Pune, India 🇮🇳 English

105

Aman Goyal@goyalaman03·7 May

@ChinmayKak @AnshumanAI Hi Chinmay, while spending those 15K on a coding agent subscription seems like a good option, this is more to build a team. Plus an agent can't replace human thinking and novelty "yet". We will be increasing the stipend though once we get the green light to do so. :)

English

Chinmay@ChinmayKak·5 May

@goyalaman03 @AnshumanAI Genuine question wouldn’t those 15k be better spent then on a coding agent subscription? Might help you guys more. In any case, all the best! Dm if you need help pls:)

English

Aman Goyal@goyalaman03·7 May

@amresh_war Please e-mail us with your idea and CV on contact@maruthlabs.com if you haven't already. :D

English

Amreshwar Singh@amresh_war·5 May

@goyalaman03 Hey, currently I am a student and want to pursue research in Automatic Modulation Classification for wireless communication in defence equipments. Iam applying for this role hoping I would get the opportunity to perform my research

English

159

Aman Goyal@goyalaman03·7 May

@sumanta35679701 We could but we are also looking at this as a way to build a team 😀

English

sumanta Bhattacharyya@sumanta35679701·5 May

@goyalaman03 No offense, but honestly look over in angellist or get some one to work voluntarily in exchange of equity. However You can hire claude as well in that money.

English

144

Aman Goyal@goyalaman03·6 May

@IamRamratan Will give it a read

English

meat.delete@IamRamratan·5 May

@goyalaman03 Try reading The Godfather

English

Aman Goyal@goyalaman03·4 May

Recommend some good books to read to pass time while the GPUs free up 🫠

English

284

Aman Goyal@goyalaman03·6 May

x.com/i/spaces/1Nxar…

ZXX

Aman Goyal@goyalaman03·6 May

Planning to do a space in about 30 minutes. See you there.

English

Aman Goyal@goyalaman03·5 May

@silver__tsuki Hi Rahul, I think 0.5x H100 is good enough for running ablation studies for a 150M params model. Besides, it will be available 24x7 for use without any restrictions and we will provide 8xH100 (or even more and better GPUs) for the complete training run. :D

English

642

RSC ☀️🌲@silver__tsuki·5 May

@goyalaman03 > 15000 stipend for ML research intern > 0.5x H100 😭

GIF

English

865

Aman Goyal@goyalaman03·5 May

@AnshumanAI Hi Anshuman, we understand that 15,000 might feel less to you but it is the best we can do currently.

English

744

Anshuman@AnshumanAI·5 May

@goyalaman03 15,000 me safai karmi jhadu na maare. "Research"

Filipino

826

Aman Goyal@goyalaman03·5 May

@Naive_enough Hi Aditya, thank you for your interest. Sadly we don't have any full-time roles currently.

English

486

Aditya Khedekar@Naive_enough·5 May

@goyalaman03 Any full-time roles? ~2 yrs exp!

English

819

Aman Goyal@goyalaman03·5 May

@notanikdey Received. Will get back to you soon. 😄

English

528

Anik Dey@notanikdey·5 May

@goyalaman03 just applied via email, interested in working on SLM reasoning through structured adapters

English

631

Aman Goyal@goyalaman03·4 May

@eliebakouch Would you suggest having a smaller model (so that it could be served) with a great performance with a very long context like 1M? The model wouldn't obviously be the best at very complex tasks but would be good enough for stuff like legal-tech where long context is a challenge.

English

119

elie@eliebakouch·4 May

separating infra and science for long context doesn't make sense, most long context science is about making computation and memory (capacity and bandwidth) feasible at scale. today's infra wouldn't support MHA on a 1T model at 1M context

dr. jack morris@jxmnop

it is endlessly fascinating to me that we still don't have a true 1M-context model it's an unusual case where the infra is far ahead of the science. Claude discontinued 1M+ context bc it didn't really work past ~200k we don't have the right data? training techniques? not sure

English

7.8K

Aman Goyal@goyalaman03·4 May

@funirudh Me

funi@funirudh·3 May

how many of yall still buy books from offline bookstores im trynna see sumn

English

4.3K

Aman Goyal@goyalaman03·4 May

Please be for Araujo. Please be for Araujo.

Managing Barça@ManagingBarca

Manchester City table €90M bid for Barcelona defender. managingbarca.com/transfer-news/…

Español

266

Aman Goyal@goyalaman03·4 May

Yes, those cloud bills aren't gonna pay themself.

Ava ☆@_Ava_VT

SERIOUS QUESTION: If somebody handed you $800,000 and said it's because you're ugly, would you accept it?

English

190

Aman Goyal@goyalaman03·1 May

@FioraStarlight Yes. It's been a common practice for a long time.

English

Fiora Starlight@FioraStarlight·30 Nis

Do people ever do, like, reverse SFT? Like, constructing an example of an unwanted behavior, and having the loss signal say "you should put zero probability on every token in this sequence"?

English

101

12.9K

Aman Goyal@goyalaman03·1 May

What's up with X showing different sports related posts? All I want to see is new papers. This is frying my brain.

English

Keşfet

@sumanta35679701 @VazeKshitij @ChinmayKak @AnshumanAI @amresh_war @IamRamratan @silver__tsuki @elonmusk