Sabitlenmiş Tweet
Sebastian Nehrdich
2.1K posts

Sebastian Nehrdich
@SebastianNehrd2
東北大学 助教 Assistant professor, Tohoku University. Also in charge of Dharmamitra in collaboration with BAIR, UC Berkeley. Research in ancient Asian languages.
Sendai, Japan Katılım Kasım 2020
1.8K Takip Edilen7.4K Takipçiler

MITRA Explore is coming and it will be 🔥🔥
Dharmamitra@dharmamitra_ucb
Coming very soon: MITRA Explore will enable to ask more open questions and get answers based on the powerful retrieval capabilities of Dharmamitra!
English
Sebastian Nehrdich retweetledi
Sebastian Nehrdich retweetledi

A bit late to the party but this is a great paper, and I am very happy to see that ByT5-Sanskrit is indeed a versatile model that adepts well and outperforms much larger LLMs in task-specific settings for Sanskrit!
Manoj Balaji@manojbalaji1
🧨 Think giant LLMs can do everything? Sanskrit poetry just put them on notice: a small, task-specific model beats instruction-tuned LLMs at converting verse → canonical prose. Curious? Read on. 1/n #AACL #AACLIJCNLP #AACLIJCNLP2025 #ACL #NLP #Sanskrit
English

2. Mitrasamgraha: A Comprehensive Classical Sanskrit Machine Translation Dataset arxiv.org/abs/2601.07314 A large dataset of parallel sentence pairs for Classical/Vedic Sanskrit to English, covering multiple domains and time spans. Useful for machine translation!
English

Two preprints of the Dharmamitra project:
1. MITRA: arxiv.org/abs/2601.06400 This paper describes our large multilingual parallel dataset release, the machine translation model, and our retrieval system. 1/
English

It took some years for us to make transformer-based implementations truly competitive (i.e. aclanthology.org/2024.findings-…), but the 2018 model has kept the edge even during the LLM explosion of the recent years. 2/
English

It still amazes me that the data-driven Sanskrit word segmentation model Oliver Hellwig (with a bit of contribution of myself) in 2018 created (aclanthology.org/D18-1295/) was completely trained and evaluated on CPU within a couple of hours! 1/
English





