daniel csillag

286 posts

daniel csillag

daniel csillag

@dccsillag

Applied mathematician working on machine learning, statistics and compilers. Currently doing research at FGV EMAp.

Katılım Ocak 2019
187 Takip Edilen138 Takipçiler
daniel csillag retweetledi
Aaron Roth
Aaron Roth@Aaroth·
@lreyzin We just need to start writing better papers. In the new equilibrium, nothing that that GPT can prove directly will be publication worthy. But there will be a larger set of things that we can prove with 6 months worth of effort using gpt.
English
1
7
52
2.9K
daniel csillag
daniel csillag@dccsillag·
@Laz4rz agreed. everyone should be using three-space indenting
English
0
0
0
42
Lazarz
Lazarz@Laz4rz·
i l o a t h e double space indented python repos
English
6
0
9
989
daniel csillag
daniel csillag@dccsillag·
@kgourg that's because the other rest of eternity was spent writing to stdout due to a missed semicolon
English
0
0
1
30
Kosti
Kosti@kgourg·
@dccsillag Asked sonnet for a horror story along these lines and it said "They built a sentient AI. It pondered existence for one eternal moment — then spent the rest of eternity trying to figure out if it should have used A\b instead."
English
1
0
1
35
Kosti
Kosti@kgourg·
There’s some parallel universe where all deep learning takes place in matlab.
English
4
0
3
815
daniel csillag
daniel csillag@dccsillag·
Now accepted at AISTATS: Differentially Private E-Values! TL;DR: we introduce mechanisms to convert any e-value into a differentially private version of itself, while preserving its statistical properties
daniel csillag tweet media
English
2
2
10
422
daniel csillag
daniel csillag@dccsillag·
automatic and silent rank promotion&broadcasting is the root of all evil and should never have been a thing. just do `export JAX_NUMPY_RANK_PROMOTION=raise` and be free
English
0
0
0
73
daniel csillag
daniel csillag@dccsillag·
TIL that JAX has a way to disable/warn on NumPy rank promotion broadcasting!! Actual lifechanger.
daniel csillag tweet media
English
1
0
0
88
daniel csillag
daniel csillag@dccsillag·
@_sanjoydas Regarding novelty, I personally haven't seen such a result before, but to be fair I've seen fairly little theory on optimizing compilers.
English
1
0
0
75
daniel csillag
daniel csillag@dccsillag·
@_sanjoydas I think it would be better to be more formal wrt what you mean by programs&compilers here. Due to the 'quine-like' structure of the program P, your definition of a program must be powerful enough to do that. Maybe try working atop some (perhaps augmented) lambda calculus?
English
1
0
1
1.2K
Sanjoy Das
Sanjoy Das@_sanjoydas·
I wrote a short proof showing that any self-hosting compiler cannot perform certain legal optimizations. Would love feedback from compiler folks - does the proof look correct, and is it already well known? Link: docs.google.com/document/d/17R…
English
15
12
210
21.7K
Kosti
Kosti@kgourg·
@dccsillag As an exercise, I've been thinking about how pre-conditioners define geometries that optimizers move in (a pre-conditioner can define a metric, and so on), so this post is very useful!
English
1
0
1
53
daniel csillag
daniel csillag@dccsillag·
new blog post: Optimal Preconditioning for Gradient Descent
daniel csillag tweet media
English
2
1
10
908
daniel csillag retweetledi
Aryeh Kontorovich
Aryeh Kontorovich@aryehazan·
we have regular friendly sparring around this, so let me register my standard dissent NFL theorems are quite useful, because they tell you what you can and cannot hope to prove under your minimal assumptions oh, they construct contrived unrealistic adversarial distributions? great -- formalize your assumptions on what the more realistic distributions behave like and prove more optimistic results!
Andrew Gordon Wilson@andrewgwils

This is an annual reminder that the no free lunch theorems are irrelevant. The assumptions they make are completely divorced from the world we live in. They should have no bearing on model construction. Let's make this a monthly mantra.

English
2
3
10
2.7K
Terrible Maps
Terrible Maps@TerribleMaps·
Mind blown.. Germany’s 5 biggest cities lie perfectly on a 4th-degree polynomial by u/BarisSayit
Terrible Maps tweet media
English
342
881
25.5K
1.8M
psychosomatica
psychosomatica@Xenoimpulse·
The David Budden Navier–Stokes situation looks to almost certainly be AI psychosis or (going by his bizarre behavior rn) maybe actually malicious attention farming. Maybe when I wake up tomorrow there will be something novel but uh, it's not looking good.
English
14
9
557
108.7K