Sebastian Nehrdich

2.1K posts

Sebastian Nehrdich

@SebastianNehrd2

東北大学助教 Assistant professor, Tohoku University. Also in charge of Dharmamitra in collaboration with BAIR, UC Berkeley. Research in ancient Asian languages.

Sendai, Japan Katılım Kasım 2020

1.8K Takip Edilen7.3K Takipçiler

Sabitlenmiş Tweet

Sebastian Nehrdich@SebastianNehrd2·3 Haz

Thrilled to share that I will be joining Tohoku University in Sendai, Japan, as tenure track assistant professor from autumn this year. Happy to work on AI for Sanskrit, Chinese, Tibetan, Pali, and Japanese, and to educate new generation of students who can work with these tools.

English

746

36K

Sebastian Nehrdich@SebastianNehrd2·5d

This deserves attention! A whole book written by a Japanese Studies expert on a topic of Korean history relies heavily on AI tools for accessing the sources and acknowledges it clearly in the preface. Is this the kind of future we want for humanities?

English

1.5K

Sebastian Nehrdich retweetledi

Dharmamitra@dharmamitra_ucb·12 May

We have recently changed our etext basis for Kangyur and Tengyur to the Esukhia Derge version, making folio-level linking to the ToUDA scanned Derge edition and to rKTs possible via the new "segment view" option in Mitra Deep Research & Explore!

English

509

Sebastian Nehrdich@SebastianNehrd2·11 May

@5chstereoAI you can always send us an email at dharmamitra.project@gmail.com about feature requests and feedback! We are happy to learn more and to explore how we can improve the platform.

English

5chStereo@5chstereoAI·11 May

😅 I just saw that you are Prof at Tohoku! I use your tools for research and they are very helpful! Do you have a link where I can submit feedback / feature request? For example, I think it would be very helpful to have a GIS type map showing how Dhamma travelled across time and geography.

English

Sebastian Nehrdich@SebastianNehrd2·11 May

Very modern technology meets very traditional scholarship! To the people who made those translations under painstaking effort int hose years, it must sound like absolute science fiction what we can do with them nowadays!

Dharmamitra@dharmamitra_ucb

Dharmamitra now includes the 国訳一切経 renderings with links to the individual pages on archive.org for those texts that are available on archive.org! The 国訳一切経 results appear in Mitra Explore and Mitra Deep Research responses.

English

1.5K

Sebastian Nehrdich@SebastianNehrd2·11 May

@5chstereoAI We are already working together with our archive unit here At Tohoku university and just added links to the new Derge database!

English

5chStereo@5chstereoAI·11 May

@SebastianNehrd2 Do you also partner with other universities? x.com/ag0vb/status/2…

川村悠人@Ag0vB

大偉業。東北大学が所蔵するチベット仏教関係資料を網羅的にデジタル化し、公開する「チベット仏教ポータル」が東北大学にて始動。 touda.tohoku.ac.jp/collection/dat…

English

Sebastian Nehrdich retweetledi

이상엽 • 李尚曄@SangyopLee·16 Nis

서울대 교양채널 샤로잡다에서 힙불교 열풍, 불교는 종교인가 철학인가, 불교철학의 매력, 불교의 기본적 교리 등의 주제를 논하고 왔습니다. youtu.be/a6l44RlgcPU?si…

YouTube

한국어

7.3K

Sebastian Nehrdich retweetledi

Dharmamitra@dharmamitra_ucb·11 Nis

We just rolled out a massive update to our interface over the last days! One of the massive changes is that we replaced MITRA Search with MITRA Explore, which brings deep research functionality into the Search system with filter options and more 1/

English

811

Sebastian Nehrdich@SebastianNehrd2·11 Nis

there certainly is, the problem is more on the structural end, i.e. we don't usually have clean text-level parallels like in the case of the tibetan translation of the Ybh etc. so the matching becomes less precise and reliable, and Pali literature has a lot of repititions etc. which make automatize matching a bit more tricky! But I think we can do this in a future update.

English

ℭ𝔥𝔬𝔷𝔞𝔫𝔤@ChozangNoyb·11 Nis

@SebastianNehrd2 Amongst the Agamas, Nikayas and Vinaya, is there enough correspondence to do a partial equivalent including Pali?

English

Sebastian Nehrdich@SebastianNehrd2·11 Nis

Since I repeatedly got the request on whether there is a way to just browse the mitra parallel data without all the DharmaNexus matching etc. around it, I created a small html page where you can do exactly that! dharmamitra.github.io/mitra-parallel…

English

757

Sebastian Nehrdich retweetledi

Dharmamitra@dharmamitra_ucb·1 Nis

Dharmamitra is very concerned about the implications of machine translation software on the language abilities of the users! We therefore now introduce a new feature that will ask the users to manually translate a passage every five requests. 1/

English

6.6K

Sebastian Nehrdich@SebastianNehrd2·27 Mar

Happening this weekend, online participation possible!

Dharmamitra@dharmamitra_ucb

Happening at Tohoku University on Saturday and Sunday! Please spread the news, and joining online is possible via this registration link: docs.google.com/forms/d/e/1FAI…

English

Sebastian Nehrdich@SebastianNehrd2·3 Mar

MITRA Explore is coming and it will be 🔥🔥

Dharmamitra@dharmamitra_ucb

Coming very soon: MITRA Explore will enable to ask more open questions and get answers based on the powerful retrieval capabilities of Dharmamitra!

English

887

Sebastian Nehrdich retweetledi

Dharmamitra@dharmamitra_ucb·3 Mar

Coming very soon: MITRA Explore will enable to ask more open questions and get answers based on the powerful retrieval capabilities of Dharmamitra!

English

1.7K

Sebastian Nehrdich@SebastianNehrd2·27 Şub

One recent observation by a colleague: “cracking complex Sanskrit sentences, understanding their grammar and translating them used to be a solitary joy. AI tools like dharmamitra make it less solitary, and less joyful.”

English

2.7K

Sebastian Nehrdich retweetledi

Dharmamitra@dharmamitra_ucb·20 Şub

We are happy to announce that Dharmamitra now features a board of advisors. They will advise on the kind of data that Dharmamitra includes, on the functionality and design of our applications, and on making sure that we keep providing tools and utility of highest quality.

English

909

Sebastian Nehrdich@SebastianNehrd2·11 Şub

A bit late to the party but this is a great paper, and I am very happy to see that ByT5-Sanskrit is indeed a versatile model that adepts well and outperforms much larger LLMs in task-specific settings for Sanskrit!

Manoj Balaji@manojbalaji1

🧨 Think giant LLMs can do everything? Sanskrit poetry just put them on notice: a small, task-specific model beats instruction-tuned LLMs at converting verse → canonical prose. Curious? Read on. 1/n #AACL #AACLIJCNLP #AACLIJCNLP2025 #ACL #NLP #Sanskrit

English

747

Sebastian Nehrdich@SebastianNehrd2·10 Şub

This is sooo avoidable! Like there is zero intellectual effort in looking up the papers and citing them properly, at least that much effort one can expect right!

English

342

Sebastian Nehrdich@SebastianNehrd2·10 Şub

I am reviewing for various computer science conferences these days and I had to reject roughly 40% of the papers I looked at so far purely on the fact that their bibliographies are filled with hallucinations.

English

452

Sebastian Nehrdich@SebastianNehrd2·13 Oca

2. Mitrasamgraha: A Comprehensive Classical Sanskrit Machine Translation Dataset arxiv.org/abs/2601.07314 A large dataset of parallel sentence pairs for Classical/Vedic Sanskrit to English, covering multiple domains and time spans. Useful for machine translation!

English

432

Sebastian Nehrdich@SebastianNehrd2·13 Oca

Two preprints of the Dharmamitra project: 1. MITRA: arxiv.org/abs/2601.06400 This paper describes our large multilingual parallel dataset release, the machine translation model, and our retrieval system. 1/

English

1.3K

Keşfet

@5chstereoAI @elonmusk @BarackObama @taylorswift13 @cristiano @BillGates @NASA @nikifrancismediavine