DOJO

3.9K posts

DOJO

DOJO

@Dojo__0

AI Research scientist. Working on FMRI & Pathology FMs.

Katılım Kasım 2022
2.2K Takip Edilen86 Takipçiler
DOJO
DOJO@Dojo__0·
@lucasmaes_ Its so good bro! going to play around with the models
English
0
0
1
102
DOJO retweetledi
Lucas Maes
Lucas Maes@lucasmaes_·
JEPA are finally easy to train end-to-end without any tricks! Excited to introduce LeWorldModel: a stable, end-to-end JEPA that learns world models directly from pixels, no heuristics. 15M params, 1 GPU, and full planning <1 second. 📑: le-wm.github.io
English
56
294
2.4K
251.3K
DOJO retweetledi
Tanishq Kumar
Tanishq Kumar@tanishqkumar07·
I've been working on a new LLM inference algorithm. It's called Speculative Speculative Decoding (SSD) and it's up to 2x faster than the strongest inference engines in the world. Collab w/ @tri_dao @avnermay. Details in thread.
English
133
455
4K
599.7K
DOJO
DOJO@Dojo__0·
@initlayers If you're interested in local attention do read these papers one is "Hiera: Hierarchical transformers without the bells & whistle" then another paper I belive it was window attention is bugged
English
1
0
2
15
initlayers
initlayers@initlayers·
It's a Saturday afternoon and I'm preparing a presentation to explain to my university what I'm doing during my internship. The funny part is most of the technical details probably won't make much sense to them anyway. Still have to make the slides, simplify everything, and explain months of work in a few minutes. Sometimes that's just how these things go.
initlayers tweet media
English
1
0
16
452
DOJO
DOJO@Dojo__0·
@initlayers HR NET is based on multi scale feature extraction while DA Net is based on attention between the channels and spatial features
English
1
0
1
8
initlayers
initlayers@initlayers·
@Dojo__0 Cool. I can feel you. What are these papers btw?
English
1
0
1
27
initlayers
initlayers@initlayers·
One thing I've realized while reading a lot of papers: many of them are just careful combinations of ideas that already worked before. When you don't follow the chronology, every paper feels extremely complex and novel. But once you trace the lineage, you start seeing the pattern. Good parts from previous work get reused, tweaked, and sometimes that combination produces surprisingly strong results. The real skill is knowing what to borrow and what to change.
initlayers tweet media
English
4
4
129
5K
DOJO
DOJO@Dojo__0·
@VazeKshitij Let's go bro!! Ah dude you're killing it!!
English
0
0
1
10
kshitij vaze
kshitij vaze@VazeKshitij·
So......I can confirm that Deepinder Goyal has seen the comments and the mentions tagging me on the temple hiring post. I just had a conversation with them, and the storm that all of you created over there is the first thing that Team Temple bought up in my interview with em. Deepinder himself was asking if the team knew about my profile and if they were looking into me - I don't think I can put it in words just how damn grateful I am for each and every single last one of you guys man! It went well, lasted for 27-ish minutes. From what I can make out at my end, I did well and according to team temple, it was a good conversation my resume would now go to the technical team, and we'll move on from there. There will be a few more conversations with the tech team, and we'll land on an answer about the further proceedings soon enough. IDK man, I AM STARTING TO HAVE HOPE!!!!
English
175
13
1.2K
59.5K
DOJO retweetledi
Arth Singh
Arth Singh@iarthsingh·
Not even a week in, and we've already hit 400+ members! If you haven't joined yet, you're missing out on some really interesting research project discussions happening on our Discord; link in the comments.
Arth Singh tweet mediaArth Singh tweet mediaArth Singh tweet media
English
1
5
26
781
DOJO
DOJO@Dojo__0·
@initlayers Hey, I am interested. I will dm.
English
1
0
1
66
initlayers
initlayers@initlayers·
We're looking for research assistants or early-career research engineers at the Language Technologies Research Center, IIITH. If you're genuinely motivated to learn and build, this could be you. The work involves designing workflows for data ingestion, working with multiple ML/AI models, and building search and indexing pipelines. Familiarity with cloud infrastructure or building optimized standalone applications is a strong plus. You don't need to know everything already. If you're serious and willing to put in the work, you'll learn on the job. There's proper support, mentorship, and technical training to help you grow while you build. If this sounds like you, reach out.
English
35
4
210
10.7K
DOJO
DOJO@Dojo__0·
@prajdabre Something like infoNCE would work good but recently at least in my opinion a simple allignment loss and an anti collapse loss( coding rate regularisation) if we simply don't want a large batch size
English
0
0
0
53
Raj Dabre
Raj Dabre@prajdabre·
Slightly challenging ML question: Suppose you are building an embedding model for RAG where for a document and a related query, you generate embeddings which are as similar as possible. You confidently scrape a bunch of document-title pairs and train a transformer model by maximizing the cosine similarity between the embeddings of the document-title pairs. But this messed up completely. Your embedding model aligned a correct document-title pair during testing but also wrong ones. What happened? What is the fix?
English
27
2
143
12.3K
DOJO retweetledi
Koustava Goswami
Koustava Goswami@koustavagoswami·
🚀 PhD Internship (will be hosting one person): Diffusion LLMs (DLLM) Looking for PhD students with: • hands-on DLLM research • First author published paper on DLLM (NeurIPS/ICLR/ICML/ACL/EMNLP) Send Google Scholar + a brief introduction → DM RT appreciated 🙏 #NLP #ML
English
1
14
171
16.1K
DOJO
DOJO@Dojo__0·
@chhuti_is More than 50 casualties..
English
0
0
0
219
DOJO
DOJO@Dojo__0·
@Neetivaan GTRE engine ka deal kitna dur hai ?
Indonesia
0
0
0
36
ghatak
ghatak@Neetivaan·
the fine print of the India - US deal is still not out, so wait until everything comes out in detail
English
10
19
333
8.5K
Ash🫠
Ash🫠@Seizeyoloflow·
@Dojo__0 @Neetivaan It will be used for Salaries, pensions I guess !? Wt do you think
English
1
0
0
42
ghatak
ghatak@Neetivaan·
Mega Textile Park, the GoI is planning slow de ath to Kanglus and P0rkis. Bhāratvarsh is here to stay ☀️🥶
English
16
108
1.3K
13.6K
Chandan Perla
Chandan Perla@Chandan_Perla·
We’re working on a project that we’re very bullish on Building it with a lot of care We’re looking for operators People who get the vibe and get things done. If this interests you, feel free to drop a message - Let’s talk!
English
47
6
163
14.8K
DOJO retweetledi
Wing Commonder Max
Wing Commonder Max@SkywardAdi·
@idrwalerts @idrwalerts Dear IDRW team, this work is done by our teammates, please give them credit in place of Indian Reddit User, it took lot of time to get this information and to understand exact part of aircraft 🙏
English
9
19
263
16.1K