Eshwar Ram Arunachaleswaran

27 posts

Eshwar Ram Arunachaleswaran

Eshwar Ram Arunachaleswaran

@EshwarERA

PhD Student at the University of Pennsylvania

Philadelphia, PA Katılım Haziran 2024
351 Takip Edilen122 Takipçiler
Eshwar Ram Arunachaleswaran retweetledi
Aaron Roth
Aaron Roth@Aaroth·
Suppose you and I both have different features about the same instance. Maybe I have CT scans and you have physician notes. We'd like to collaborate to make predictions that are more accurate than possible from either feature set alone, while only having to train on our own data.
Aaron Roth tweet media
English
4
29
188
25.9K
Eshwar Ram Arunachaleswaran
We prove minimizing profile swap-regret is necessary & sufficient for non-manipulability and gets NR +PO. Bonus: if all agents minimize it, the dynamics can reach profiles that cannot be realized as Correlated Equilibria by traditional mediators—unlike normal-form games!
English
0
0
1
171
Eshwar Ram Arunachaleswaran
Our fix? Profile Swap-Regret, a further coarsening of polytope swap-regret, leveraging a geometric view of algorithms (link). Admits an efficient algorithm with O(√T) convergence! arxiv.org/abs/2402.09549
English
1
0
1
237
Eshwar Ram Arunachaleswaran
Eshwar Ram Arunachaleswaran@EshwarERA·
@willccbb Very interesting! Just tried it out.Full chains of thought are amazing to read especially where the model makes some progress, can't quite get the answer and resorts to writing sanitized boilerplate text without speculating in the final output
English
0
0
1
66
will brown
will brown@willccbb·
@EshwarERA likely a relatively small model, mostly a demo of their reasoning chain tricks / teasing a bigger version coming soon
English
1
0
1
116
will brown
will brown@willccbb·
it was so close...
will brown tweet media
English
2
0
4
886
Eshwar Ram Arunachaleswaran
Eshwar Ram Arunachaleswaran@EshwarERA·
@willccbb Right. Which model is this? Pieces of the answer are correct traces of reasoning, but it's unable to chain them together
English
1
0
1
63
will brown
will brown@willccbb·
@EshwarERA yeah the internet word association btw no-swap + CE is way higher than what you get from the few dozen papers about no-swap stackelberg stuff
English
1
0
1
57
Eshwar Ram Arunachaleswaran retweetledi
Aaron Roth
Aaron Roth@Aaroth·
Now appearing in SODA; thanks to an eagle eyed reviewer, we've updated the title to "An Elementary Predictor Obtaining 2*Sqrt{T} + 1 Distance to Calibration." Fortunately it still fits in one line. (It turns out there are m+1 numbers in the set {0, 1/m, 2/m, ..., 1}, not m...)
Aaron Roth@Aaroth

A quick thread on a short (3 page) paper, giving a simple algorithm that makes predictions guaranteeing 2*Sqrt{T} "Distance to calibration" against an adversary. The algorithm and proof are so simple I can describe it in thread. Joint with Eshwar, @natalie_collina, and Mirah:

English
0
4
27
7.2K
Eshwar Ram Arunachaleswaran
Eshwar Ram Arunachaleswaran@EshwarERA·
@willccbb That's a cool explanation. Would digging into it require understanding how the underlying value model generalizes the estimates from training to the new states it encounters at test time? Thinking of mechanisms that would result in this imperfect correlation
English
1
0
1
235
will brown
will brown@willccbb·
yeah, let’s restrict to fully observable EFGs with clear win states you def need search during training to estimate a value function but it’s interesting that mtcs at test-time is still useful empirically — it wouldn’t be needed at all if you had a perfect value function i think you could also show that it wouldn’t help if value estimates were worst-case correlated (eg over a subtree) one plausible story is that test-time search helps you explore enough of the state space that value estimate errors are imperfectly correlated, so you get concentration of errors and basically a new value function with tighter confidence intervals
English
2
0
2
168
will brown
will brown@willccbb·
anyone have a good learning-theoretic explanation of why scaling search is more efficient than scaling models (e.g for alphago)? feels like there could maybe be a boosting-related story
English
3
0
8
1.3K
Eshwar Ram Arunachaleswaran
Eshwar Ram Arunachaleswaran@EshwarERA·
@willccbb I'd be very interested in hearing any boosting like explanations. Are you perhaps hinting at search aggregating varying performance of an imperfect model over different parts of the context space?
English
1
0
2
331
Eshwar Ram Arunachaleswaran retweetledi
Aaron Roth
Aaron Roth@Aaroth·
The Sherman Act (1890) was written to prevent old fashioned price fixing: picture two men meeting in a smoky bar. It requires overt communication of intent to coordinate on prices. But pricing algorithms can automatically coordinate on high prices. What is algorithmic collusion?
Aaron Roth tweet media
English
3
25
143
23.4K
Eshwar Ram Arunachaleswaran retweetledi
Ben Recht
Ben Recht@beenwrekt·
What scientific overpublication is an inevitable symptom of the mass psychosis of a cabal of overachievers? argmin.net/p/youre-gonna-…
English
5
13
99
19.6K
Eshwar Ram Arunachaleswaran retweetledi
Aaron Roth
Aaron Roth@Aaroth·
A quick thread on a short (3 page) paper, giving a simple algorithm that makes predictions guaranteeing 2*Sqrt{T} "Distance to calibration" against an adversary. The algorithm and proof are so simple I can describe it in thread. Joint with Eshwar, @natalie_collina, and Mirah:
Aaron Roth tweet media
English
2
15
92
32.7K