David Braun (dbraun.bsky.social)

247 posts

David Braun (dbraun.bsky.social) banner
David Braun (dbraun.bsky.social)

David Braun (dbraun.bsky.social)

@DoItRealTime

PhD student @PrincetonCS audiovisual ML. @CCRMA/@Stanford, @BrownUniversity @dbraun.bsky.social

Princeton, NJ Beigetreten Aralık 2016
436 Folgt1.9K Follower
Angehefteter Tweet
David Braun (dbraun.bsky.social)
🎛️Audio ML friends, I made a transpiler from Faust code to JAX! Faust is a powerful language for sound synthesis, so connecting it to JAX while supporting differentiability and learnable parameters is a huge research opportunity. 🧵
David Braun (dbraun.bsky.social) tweet media
English
1
16
127
0
Jon Barron
Jon Barron@jon_barron·
Deleting all my config dataclasses today. Each hparam is now just a default argument value and an `# hparam` comment so it’s greppable. Sweeps are just an agent running a sed script on the repo, launching, and reverting. Model variant configs are all bash scripts of sed calls.
English
9
2
100
15.4K
Jongho Choi
Jongho Choi@_JonghoChoi·
jukebox보다가 수노 2인가쯤에 오 소리에 아티팩트 제법 적은 음악 비슷한걸 만드네~ 제법이네 하면서 신기해했던게 진짜 얼마안되는데 이제 수노가 걍 사람이만든 음악보다 좋은거같다.. youtube.com/watch?v=dGJiSg…
YouTube video
YouTube
한국어
1
0
2
276
Charlie Deck
Charlie Deck@bigblueboo·
Eh. I’m a huge fan of music diffusion but there are persistent gaps between islands of coherency in the distribution. Try having your DnB track in 5/4. Or a mathrock bebop. Or a Klingon showtune. They nail commercial music tho
@levelsio@levelsio

So now that AI can do music really well too What remains human is - live performances - authenticity/emotional bond - using AI during live performances (imagine live generating music on the fly) - ultra-famous artists

English
1
0
1
387
Mike Turitzin
Mike Turitzin@miketuritzin·
@DoItRealTime Oh nice, yeah definitely looks like a similar idea. That must have been a DK2?
English
1
0
0
51
Mike Turitzin
Mike Turitzin@miketuritzin·
Just ran across this tweet of mine from 2019. It doesn't look like that much outside of VR (and with Twitter compression) but this was one of the most trippy things I ever experienced in VR. The way the tunnel is "head locked" is really wild to experience.
Mike Turitzin@miketuritzin

Made this simple VR music visualization when testing FFT code in my app. The way it appears in VR is much different, as the start of the "tunnel" is head locked, so it appears that you are creating a psychedelic tunnel (that you can't look away from!) as you move your head.

English
1
0
12
1.6K
inigo quilez
inigo quilez@iquilezles·
We improvised a marble elevator with some lego technique and regular legos, for a duplos marble run we had around.
English
12
8
366
29.3K
Nikhil Thorat
Nikhil Thorat@nsthorat·
I am officially 100% sold on cursor being revolutionary. I built something this weekend I had estimated 2 weeks of work, done in 2 days and because cursor is not lazy, the code is very readable with small grokable components. Cursor + Sonnet 4 is amazing.
English
7
0
65
4.4K
David Braun (dbraun.bsky.social) retweetet
arXiv Sound
arXiv Sound@ArxivSound·
``DAC-JAX: A JAX Implementation of the Descript Audio Codec,'' David Braun, ift.tt/FzlXUdq
English
0
1
12
1.4K
David Braun (dbraun.bsky.social)
In other words, if I wanted to explore usage of a real-time, 8.2 ms latency, 44.1 kHz DAC model, JAX might be faster. Of course, more analysis and testing are welcome. More details in the paper linked at github.com/DBraun/DAC-JAX
English
0
0
1
309
David Braun (dbraun.bsky.social)
I benchmarked the chunked compression/decompression speeds. These are the functions you would use on long files or streaming. For a hop size of 8.2 ms, JAX performs compression in 7.1 ms and decompression in 4.3 ms. PyTorch performs compression in 8.3 ms, decompression in 6.3 ms.
David Braun (dbraun.bsky.social) tweet media
English
1
0
2
388
David Braun (dbraun.bsky.social)
Happy to release "DAC-JAX: A JAX Implementation of the Descript Audio Codec." This can reuse PyTorch weights of all model sizes, and it includes a device-parallel training script. It uses the standard JAX libraries: Flax, Optax, Orbax, and CLU. github.com/DBraun/DAC-JAX
English
1
6
28
2.1K
Nao Tokui
Nao Tokui@naotokui_en·
Really, Apple? It's so sad and heartbreaking to see the new iPad ad from Apple, a company that used to respect the craftsmanship of tool makers. Pure arrogance.
English
1
0
11
1.9K
David Braun (dbraun.bsky.social)
Another example shows a differentiable polyphonic wavetable synth. The wavetables (2048 sample arrays) are learnable as well as the "Wavetable Position" which blends between them. Notice that the middle plot is a blend of a sine and triangle, but all the wavetables are learnable!
David Braun (dbraun.bsky.social) tweet mediaDavid Braun (dbraun.bsky.social) tweet media
English
1
0
1
356
David Braun (dbraun.bsky.social)
New talk and workshop on Faust+JAX, this time with parameter *automation*. With simple SGD and L1 time-domain loss over the input audio and ground truth, we recover the parameter automation of a lowpass filter's cutoff frequency. youtube.com/watch?v=046Gi7…
YouTube video
YouTube
GIF
English
1
0
5
760
David Braun (dbraun.bsky.social)
Last week I presented my work integrating the Faust audio language and machine learning framework JAX youtu.be/AnWBSbC8vU8 This video goes over many nuances of the Google Colabs which I shared in October. I think it’s a great roadmap for future research in audio ML.
YouTube video
YouTube
English
0
1
1
582