David Braun (dbraun.bsky.social)

247 posts

David Braun (dbraun.bsky.social)

@DoItRealTime

PhD student @PrincetonCS audiovisual ML. @CCRMA/@Stanford, @BrownUniversity @dbraun.bsky.social

Princeton, NJ Beigetreten Aralık 2016

436 Folgt1.9K Follower

Angehefteter Tweet

David Braun (dbraun.bsky.social)@DoItRealTime·20 Eki

🎛️Audio ML friends, I made a transpiler from Faust code to JAX! Faust is a powerful language for sound synthesis, so connecting it to JAX while supporting differentiability and learnable parameters is a huge research opportunity. 🧵

David Braun (dbraun.bsky.social) tweet media

English

127

David Braun (dbraun.bsky.social)@DoItRealTime·21 Şub

@jon_barron Have your agent learn argbind. It works great with dataclasses. My fork and dev branch has some minor improvements.

English

433

Jon Barron@jon_barron·21 Şub

Deleting all my config dataclasses today. Each hparam is now just a default argument value and an `# hparam` comment so it’s greppable. Sweeps are just an agent running a sed script on the repo, launching, and reverting. Model variant configs are all bash scripts of sed calls.

English

100

15.4K

David Braun (dbraun.bsky.social)@DoItRealTime·25 Oca

@OfficialLoganK Could Google continue the jax-metal project? Apple seems to have abandoned it. See jax-ml/jax/pull/34485

English

449

Logan Kilpatrick@OfficialLoganK·25 Oca

Mac mini ordered

Magyar

231

2.3K

770K

David Braun (dbraun.bsky.social)@DoItRealTime·13 Ara

@_JonghoChoi 된장인지 ㅁㅁ인지 고민…

한국어

Jongho Choi@_JonghoChoi·13 Ara

jukebox보다가 수노 2인가쯤에 오 소리에 아티팩트 제법 적은 음악 비슷한걸 만드네~ 제법이네 하면서 신기해했던게 진짜 얼마안되는데 이제 수노가 걍 사람이만든 음악보다 좋은거같다.. youtube.com/watch?v=dGJiSg…

YouTube

한국어

276

David Braun (dbraun.bsky.social)@DoItRealTime·19 Ağu

@bigblueboo 5/4 DnB! youtu.be/JN-5mPtluhQ?si…

YouTube

Charlie Deck@bigblueboo·19 Ağu

Eh. I’m a huge fan of music diffusion but there are persistent gaps between islands of coherency in the distribution. Try having your DnB track in 5/4. Or a mathrock bebop. Or a Klingon showtune. They nail commercial music tho

@levelsio@levelsio

So now that AI can do music really well too What remains human is - live performances - authenticity/emotional bond - using AI during live performances (imagine live generating music on the fly) - ultra-famous artists

English

387

David Braun (dbraun.bsky.social)@DoItRealTime·15 Ağu

@giffmana Also missed arxiv.org/abs/2411.18447 by @marco_ppasini et al.

English

874

Lucas Beyer (bl16)@giffmana·15 Ağu

Sad that they don't cite JetFormer at all, it's imo extremely related and one of my favourite papers of last year! More people need to know about (and cite) JetFormer. Other than that, looks like a nice paper.

Tanishq Mathew Abraham, Ph.D.@iScienceLuvr

NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale "Autoregressive models—generating content step-by-step like reading a sentence—excel in language but struggle with images. Traditionally, they either depend on costly diffusion models or compress images into discrete, lossy tokens via vector quantization (VQ). NextStep-1 takes a different path: a 14B-parameter autoregressive model that works directly with continuous image tokens, preserving the full richness of visual data. It models sequences of discrete text tokens and continuous image tokens jointly—using a standard LM head for text and a lightweight 157M-parameter flow matching head for visuals. This unified next-token prediction framework is simple, scalable, and capable of producing stunningly detailed image"

English

457

70K

David Braun (dbraun.bsky.social)@DoItRealTime·3 Tem

@miketuritzin Yea it was. Kudos to Derivative for supporting it early.

English

Mike Turitzin@miketuritzin·3 Tem

@DoItRealTime Oh nice, yeah definitely looks like a similar idea. That must have been a DK2?

English

Mike Turitzin@miketuritzin·2 Tem

Just ran across this tweet of mine from 2019. It doesn't look like that much outside of VR (and with Twitter compression) but this was one of the most trippy things I ever experienced in VR. The way the tunnel is "head locked" is really wild to experience.

Mike Turitzin@miketuritzin

Made this simple VR music visualization when testing FFT code in my app. The way it appears in VR is much different, as the start of the "tunnel" is head locked, so it appears that you are creating a psychedelic tunnel (that you can't look away from!) as you move your head.

English

1.6K

David Braun (dbraun.bsky.social)@DoItRealTime·22 Haz

@iquilezles Would alternating the phases of the rotors increase throughput?

English

343

inigo quilez@iquilezles·21 Haz

We improvised a marble elevator with some lego technique and regular legos, for a duplos marble run we had around.

English

366

29.3K

David Braun (dbraun.bsky.social)@DoItRealTime·2 Haz

@nsthorat Does this have any advantage over Claude Code + PyCharm?

English

145

Nikhil Thorat@nsthorat·2 Haz

I am officially 100% sold on cursor being revolutionary. I built something this weekend I had estimated 2 weeks of work, done in 2 days and because cursor is not lazy, the code is very readable with small grokable components. Cursor + Sonnet 4 is amazing.

English

4.4K

David Braun (dbraun.bsky.social) retweetet

arXiv Sound@ArxivSound·21 May

``DAC-JAX: A JAX Implementation of the Descript Audio Codec,'' David Braun, ift.tt/FzlXUdq

English

1.4K

David Braun (dbraun.bsky.social)@DoItRealTime·19 May

In other words, if I wanted to explore usage of a real-time, 8.2 ms latency, 44.1 kHz DAC model, JAX might be faster. Of course, more analysis and testing are welcome. More details in the paper linked at github.com/DBraun/DAC-JAX

English

309

David Braun (dbraun.bsky.social)@DoItRealTime·19 May

I benchmarked the chunked compression/decompression speeds. These are the functions you would use on long files or streaming. For a hop size of 8.2 ms, JAX performs compression in 7.1 ms and decompression in 4.3 ms. PyTorch performs compression in 8.3 ms, decompression in 6.3 ms.

English

388

David Braun (dbraun.bsky.social)@DoItRealTime·19 May

Happy to release "DAC-JAX: A JAX Implementation of the Descript Audio Codec." This can reuse PyTorch weights of all model sizes, and it includes a device-parallel training script. It uses the standard JAX libraries: Flax, Optax, Orbax, and CLU. github.com/DBraun/DAC-JAX

English

2.1K

David Braun (dbraun.bsky.social)@DoItRealTime·8 May

@bigblueboo This may be a coincidence, but The Beatles also poured paint on a piano in Strawberry Fields Forever.

English

134

Charlie Deck@bigblueboo·8 May

This ad reads more like a darkly ironic comment in Gen AI than a celebration of thin, luxe consumer electronics

Tim Cook@tim_cook

Meet the new iPad Pro: the thinnest product we’ve ever created, the most advanced display we’ve ever produced, with the incredible power of the M4 chip. Just imagine all the things it’ll be used to create.

English

David Braun (dbraun.bsky.social)@DoItRealTime·8 May

@naotokui_en It’s possibly a reference to Strawberry Fields Forever in which The Beatles pour paint on a piano.

English

228

Nao Tokui@naotokui_en·8 May

Really, Apple? It's so sad and heartbreaking to see the new iPad ad from Apple, a company that used to respect the craftsmanship of tool makers. Pure arrogance.

English

1.9K

David Braun (dbraun.bsky.social)@DoItRealTime·5 Ara

The afternoon workshop is here youtube.com/watch?v=VIlCY7… Thanks to the Programmable Audio Workshop (@Inria @insadelyon @Grame_Lyon) for having me!

YouTube

English

276

David Braun (dbraun.bsky.social)@DoItRealTime·5 Ara

Another example shows a differentiable polyphonic wavetable synth. The wavetables (2048 sample arrays) are learnable as well as the "Wavetable Position" which blends between them. Notice that the middle plot is a blend of a sine and triangle, but all the wavetables are learnable!

English

356

David Braun (dbraun.bsky.social)@DoItRealTime·5 Ara

New talk and workshop on Faust+JAX, this time with parameter *automation*. With simple SGD and L1 time-domain loss over the input audio and ground truth, we recover the parameter automation of a lowpass filter's cutoff frequency. youtube.com/watch?v=046Gi7…

YouTube

GIF

English

760

David Braun (dbraun.bsky.social)@DoItRealTime·1 Haz

DawDreamer now has a demo script for using multiprocessing to efficiently create one-shots from a synthesizer! Use all the cores! github.com/DBraun/DawDrea…

English

873

David Braun (dbraun.bsky.social)@DoItRealTime·1 Haz

Last week I presented my work integrating the Faust audio language and machine learning framework JAX youtu.be/AnWBSbC8vU8 This video goes over many nuances of the Google Colabs which I shared in October. I think it’s a great roadmap for future research in audio ML.

YouTube

English

582

Entdecken

@jon_barron @OfficialLoganK @_JonghoChoi @bigblueboo @giffmana @marco_ppasini @miketuritzin @iquilezles