Nishant Kambhatla

178 posts

Nishant Kambhatla banner
Nishant Kambhatla

Nishant Kambhatla

@protonish_

he/him 👨🏻‍🔬PhD student @SFU NatLang Lab 🇨🇦. 🤖 Research on Neural Machine Translation, Multilingual NLP, and Decipherment.

Vancouver, British Columbia Katılım Mayıs 2011
217 Takip Edilen69 Takipçiler
Nishant Kambhatla
Nishant Kambhatla@protonish_·
@gneubig The serverless endpoints by Runpod.io have worked well for us - you don’t have to keep the endpoint permanently open, you can set the min/max workers for it to autoscale. Both cost effective and reliable. We’ve had a good experience with their customer service too.
English
0
0
3
985
Graham Neubig
Graham Neubig@gneubig·
I'm looking for cost effective and simple ways to serve LLMs that we trained or fine tuned ourselves (7-70B range). What are the best options nowadays? (Self promotion welcome!)
English
37
20
247
84.2K
Nishant Kambhatla retweetledi
Andrej Karpathy
Andrej Karpathy@karpathy·
# on shortification of "learning" There are a lot of videos on YouTube/TikTok etc. that give the appearance of education, but if you look closely they are really just entertainment. This is very convenient for everyone involved : the people watching enjoy thinking they are learning (but actually they are just having fun). The people creating this content also enjoy it because fun has a much larger audience, fame and revenue. But as far as learning goes, this is a trap. This content is an epsilon away from watching the Bachelorette. It's like snacking on those "Garden Veggie Straws", which feel like you're eating healthy vegetables until you look at the ingredients. Learning is not supposed to be fun. It doesn't have to be actively not fun either, but the primary feeling should be that of effort. It should look a lot less like that "10 minute full body" workout from your local digital media creator and a lot more like a serious session at the gym. You want the mental equivalent of sweating. It's not that the quickie doesn't do anything, it's just that it is wildly suboptimal if you actually care to learn. I find it helpful to explicitly declare your intent up front as a sharp, binary variable in your mind. If you are consuming content: are you trying to be entertained or are you trying to learn? And if you are creating content: are you trying to entertain or are you trying to teach? You'll go down a different path in each case. Attempts to seek the stuff in between actually clamp to zero. So for those who actually want to learn. Unless you are trying to learn something narrow and specific, close those tabs with quick blog posts. Close those tabs of "Learn XYZ in 10 minutes". Consider the opportunity cost of snacking and seek the meal - the textbooks, docs, papers, manuals, longform. Allocate a 4 hour window. Don't just read, take notes, re-read, re-phrase, process, manipulate, learn. And for those actually trying to educate, please consider writing/recording longform, designed for someone to get "sweaty", especially in today's era of quantity over quality. Give someone a real workout. This is what I aspire to in my own educational work too. My audience will decrease. The ones that remain might not even like it. But at least we'll learn something.
English
660
3.4K
17K
2.2M
Jeremy Howard
Jeremy Howard@jeremyphoward·
How does a company that's literally meant to be an expert at this get salaries so wrong? GBP45k for an ML research scientist in London... Really?
English
34
10
317
222.6K
Nishant Kambhatla
Nishant Kambhatla@protonish_·
My Schengen visa was approved just in time for #EACL2023 . Super excited to meet #NLProc folk *in person* in Croatia ✌🏻
English
0
0
1
171
Nishant Kambhatla retweetledi
Eric Zhu
Eric Zhu@ericzhu·
are you kidding me rn?
Eric Zhu tweet media
English
4.1K
10.1K
74.8K
0
Nishant Kambhatla retweetledi
SIAM Activity Group on Dynamical Systems
"The mathematics of burger flipping" (by Jean-Luc Thiffeault): arxiv.org/abs/2206.13900 "What is the most effective way to grill food? Timing is everything, since only one surface is exposed to heat at a given time. Should we flip only once, or many times?"
English
11
72
324
0
Nishant Kambhatla
Nishant Kambhatla@protonish_·
🚨 New paper at #EAMT2022 - "Auxiliary Subword Segmentations as Related Languages for Low Resource Multilingual Translation" We construct pairs of “related languages” by segmenting a source twice, each time with a diff subword vocabulary size and tokenizer. w/ @anoopsarkar
Nishant Kambhatla tweet mediaNishant Kambhatla tweet mediaNishant Kambhatla tweet mediaNishant Kambhatla tweet media
English
0
0
2
0
EAMT2024
EAMT2024@EAMT_2024·
It's almost here! The deadline for submitting your work to #EAMT2022 is in one week. We welcome anyone who is excited about #MT, from users to researchers to translators! More info: eamt2022.com #EAMT
English
2
4
4
0
Nishant Kambhatla retweetledi
Sahil Bloom
Sahil Bloom@SahilBloom·
20 ideas I can’t stop thinking about:
English
317
4.1K
16.4K
0
Nishant Kambhatla retweetledi
Ayaka Mikazuki (Keep4o)
Ayaka Mikazuki (Keep4o)@ayaka14732·
I miss the time when we write tokenizers from scratch. I had a clear mind back then. Tokenizers nowadays always surprise us.
Ayaka Mikazuki (Keep4o) tweet media
English
5
3
55
0
Nishant Kambhatla retweetledi
shreya rajpal
shreya rajpal@ShreyaR·
always a mind***k when I remember that r/place and wordle were created by the same dude
English
0
3
20
0
Nishant Kambhatla
Nishant Kambhatla@protonish_·
Spent 2 hrs writing an elaborate OpenReview for a paper. It had 8 references to support my comments. I carefully re-read the paper & my #review before submission only to be redirected to the *error* page as the link had expired. Alas! Lost the review. Can't write again. In pain💔
English
0
0
1
0
Nishant Kambhatla
Nishant Kambhatla@protonish_·
@Noelle7777 @aclmeeting The registration form will ask you to specify the type of registration (regular/student/volunteer etc.) you want to opt for. No extra forms.
English
0
0
0
0
ACL 2026
ACL 2026@aclmeeting·
The moment you have all been waiting for has arrived. We are pleased to announce that registration for the 60th annual meeting of the ACL is now open! The conference will be hybrid. aclweb.org/portal/content…
English
2
9
51
0
Nishant Kambhatla
Nishant Kambhatla@protonish_·
@jlibovicky Adding Russian, as a related language, to the mix might also help. Folks have shown in the past that adapting models trained on Russian to Ukrainian and Belarusian helps in low-res settings.
English
0
0
0
0
Jindřich Libovický
Jindřich Libovický@jlibovicky·
We are building a direct 🇨🇿 Czech-Ukrainian 🇺🇦 translation system. Any parallel data would be appreciated 🙏 We have everything from Opus, TED talks, Wikimatrix, and some works of fiction. Anything we have missed?
English
9
29
96
0
Nishant Kambhatla retweetledi