Mike Wong

1.4K posts

Mike Wong banner
Mike Wong

Mike Wong

@artixels

pixel surgeon

latentpark Katılım Haziran 2009
919 Takip Edilen2K Takipçiler
Mike Wong
Mike Wong@artixels·
Captured a vid of sampling a tiny diffusion model with the neural engine (A17 Pro) on my phone. Not very fast but pretty acceptable given everything happened on the iPhone locally
Mike Wong tweet media
English
0
0
4
138
Mike Wong
Mike Wong@artixels·
☁️ Tiny diffusion model on @Apple iPhone Neural Engine (ANE) Converted one of my Tiny checkpoints into a CoreML package, and this image was generated in 3.78 seconds with memory use under 140MB on my iPhone 15 Pro. Local, private and energy-efficient 🥂
Mike Wong tweet mediaMike Wong tweet media
English
1
3
11
1.1K
Mike Wong
Mike Wong@artixels·
⛅𝐓𝐢𝐧𝐲 𝐃𝐢𝐟𝐟𝐮𝐬𝐢𝐨𝐧 𝐌𝐨𝐝𝐞𝐥𝐬 I trained before are fairly compact, the left on was from a 𝟗.𝟓 𝐌𝐁 one and the right was from a 𝟐𝟐 𝐌𝐁 one. It is fun to witness how a model learns the statistics which enables generative sampling of that photographic experience.
Mike Wong tweet mediaMike Wong tweet media
English
0
0
5
183
masahiro.teraoka
masahiro.teraoka@tiraoka·
nightlyだけどsm121対応のtorchが来たっぽい。 forums.developer.nvidia.com/t/dgx-spark-sm… とりあえず、インストールしてVACEを試したところ、flash_attnで詰まったので、pipでflash_attnをビルドでインストール. 結果2%ほどの速度向上笑 ほぼ誤差?? メモリは5%ほど少なくて済むようにはなった。うーんどうだろこれ
日本語
1
0
2
416
Mike Wong retweetledi
Garrett Johnson
Garrett Johnson@garrettkjohnson·
It's been a long time coming but the latest version of three-mesh-bvh brings support for out-of-the-box Line & Point cloud BVH support! Now all your geometries can be fast 🚀 1/3 #threejs #webgl #javascript
English
3
11
72
6K
Mike Wong retweetledi
MrNeRF
MrNeRF@janusch_patas·
Splannequin: Freezing Monocular Mannequin-Challenge Footage with Dual-Detection Splatting TL;DR: How to freeze a dynamic molecular scene when people inevitably move. Contributions: • A Novel Problem Formulation and Benchmark: We are the first to formally address synthesizing high-fidelity, freeze-time videos from monocular MC footage, providing a new benchmark and evaluation protocol. • A Targeted Regularization Framework: We propose a novel method to identify and regularize hidden and defective Gaussians, the primary sources of temporal artifacts, anchoring them to reliable past or future states. • State-of-the-art Performance with Zero Inference Overhead: We improve visual quality and stability in existing methods without architectural changes. As the deformation runs only once for a target instant, we achieve inference speeds exceeding 280 FPS on an RTX 4090.
English
1
4
41
2.9K
Lnyan
Lnyan@lkwq007·
Lnyan tweet media
ZXX
1
0
1
93
Mike Wong retweetledi
Shinji Ogaki
Shinji Ogaki@ShinjiOgaki·
今年は2次元のtransient renderingを使った作品を提出しました。 運営の皆様お疲れ様でした。 #レイトレ合宿
日本語
3
109
710
38.1K
hardmaru
hardmaru@hardmaru·
ICML’s Statement about subversive hidden LLM prompts We live in a weird timeline…
hardmaru tweet media
English
65
121
1K
811.3K