tsuki

1.7K posts

tsuki

@tensorcore

Hiroyuki Ootomo. High-precision GEMM emulation on Tensor Cores. Work at #76B900. Cooking @cp_async. Hai-to-Yoka: https://t.co/jAdudlZfnb

Tokyo, Japan Katılım Kasım 2017

461 Takip Edilen3.3K Takipçiler

Sabitlenmiş Tweet

tsuki@tensorcore·16 Kas

emulation is all you need

English

18.8K

tsuki@tensorcore·48m

講義の資料を作らないとなのだけれど、そこはかとなくやる気が起きない。やっぱり教員には向いていないのだろうな。

日本語

tsuki retweetledi

msk@crcrpar·5d

イケメンに会いたいその機能はいらないと思うだけで日々生きてる

日本語

440

tsuki@tensorcore·3d

migrating from PhotoPrism to Immich

English

266

tsuki@tensorcore·5d

研究室のSlackに指導教員の変顔を投稿するチャンネルが欲しい

日本語

806

tsuki@tensorcore·5d

@alexUnder_sky Machiavelli would make a terrible author 😅

English

sacha🥝@alexUnder_sky·5d

@tensorcore goals > ethics, or how niccolo machiavelli said: the ends jusrify the means

English

tsuki@tensorcore·5d

I just got a review request for the same paper again, but from a different journal this time... What is wrong with the authors' ethics?

tsuki@tensorcore

thinking about the appropriate action to take when reviewing a paper that contains clear instances of plagiarism… Is it more appropriate to require a detailed explanation from the authors, or simply to recommend rejection?

English

1.2K

tsuki retweetledi

Kazuki Fujii@kazukifujii·4 Haz

テックブログ公開 Day5です FlashAttentionや昨今のHardware Awareな高速化手法を理解したり、提案したりする上で必須となるCUDA Programmingに関して、基礎から解説していくブログシリーズの第一弾です。3万字超えのブログですが、かなり分かりやすく書いていますのでぜひご覧ください。 CUDA Programming Guide Part 1｜Kazuki Fujii zenn.dev/kaz20/articles…

Kazuki Fujii@kazukifujii

テックブログ公開 Day4です。 RLVR(強化学習)時代において欠かすことのできないweight syncの機能についてvLLMがどのようにこれを実現しているのかやさしく解説を行いました。 RLVR時代におけるInference Framework: Weight Syncing編｜Kazuki Fujii zenn.dev/kaz20/articles…

日本語

508

58.2K

tsuki@tensorcore·5 Haz

Hey, CodeRabbit and Codex, can you communicate directly without going through me?

English

397

tsuki@tensorcore·4 Haz

夜食べても太らない美味しい無が欲しい。取り敢えず塩かき氷を試してみようかと。

日本語

183

tsuki@tensorcore·3 Haz

Surface RTX Spark Dev Box...? Not a BBQ plate?

English

369

tsuki@tensorcore·1 Haz

最近バカすぎて風邪ひいてない

日本語

252

tsuki retweetledi

SIAM Activity Group on Supercomputing@siag_sc·1 Haz

Don't miss our upcoming Supercomputing Spotlights webinar! Laura Grigori will be speaking about "Randomized mixed precision algorithms for large scale linear algebra problems" on June 10, 2pm UTC! More details + registration link here: siag-sc.org/randomized-mix…

English

289

tsuki@tensorcore·1 Haz

羽田とかの入国エリアで怒鳴っている案内係たち、早くAIに置き換わらないかな

日本語

286

tsuki@tensorcore·29 May

Recently, cheap-looking AI-generated images have been everywhere on blogs and social media. They often feel overloaded with details, with no sense of restraint or "beauty of subtraction." If this is the result of AI democratization, it's kind of pathetic.

English

246

tsuki@tensorcore·28 May

@AcceleratedMu3n !? もちろん、クッキーですから

日本語

acc-mu3n@AcceleratedMu3n·28 May

@tensorcore パチかわとパチワレは食べれるのですね!?

日本語

acc-mu3n@AcceleratedMu3n·28 May

少し前に某同僚から笹団子ちぃかわお土産で戴きました。可愛い。可愛すぎる出社するたびに、つい見つめてしまう☺️

日本語

599

tsuki retweetledi

msk@crcrpar·27 May

在籍してるオフィスでポジションいくつか空いてるぽい。私はオフィスにいないので「いやcrcrparの顔が見たくねーんだよ」という人も良いのでは💡💡💡

日本語

574

tsuki@tensorcore·25 May

codex、お願いするとMerge conflictを直してくれる。同僚みたいだ(？実際に直してもらったことないけれど。

日本語

295

tsuki@tensorcore·22 May

@HanGuo97 @jcz42 @yoonrkim @tri_dao s/@tensorcore/@__tensorcore__/ 🙂

929

Han Guo@HanGuo97·22 May

Finally, huge thanks to the incredible team: @jcz42, Arjun, Driss, @tensorcore, @yoonrkim, and @tri_dao! PDF: arxiv.org/abs/2605.19269 Code: github.com/HanGuo97/coda-…

English

5.7K

Han Guo@HanGuo97·22 May

LLM training is dominated by compute-heavy ops like MatMuls and attention. But it also has many memory-heavy ops: norms, activations, residuals, reductions. These mostly move tensors around. As FP8/NVFP4 make FLOPs cheaper, data movement gets harder to ignore. Fig: ~1B LLaMA-3 training