
Bayesian ML at Scale
460 posts

Bayesian ML at Scale
@BayesianIn
Machine Learning, Recommender Systems, Causal Inference, Bayesian Inference https://t.co/V5yOMCprrS










Yann is right about everything (except RL).



One missing piece in medical AI is how to act under causal uncertainty. Inspired by @yudapearl, we don’t collapse uncertainty into heuristics. We maintain multiple causal hypotheses and update their weights online through real-world feedback. Prediction asks what may happen. Causal bounds + stability ask when automation must stop.


Thanks for the problem. Let me try to understand it. X=1 is the patient has the inclination to take the drug and X=2 is the patient does not have the inclination to take the drug, these two inclinations are equally probable, so P(X=1)=P(X=2)=0.5 P(death|do(Drug),X=1,M2)=0.2 P(death|do(Drug),X=2,M2)=0 P(death|do(Placebo),X=1,M2)=0.1 P(death|do(Placebo),X=2,M2)=0.1 M1 is that the Drug behaves like the Placebo (sugar tablet) P(death|do(Drug),X=1,M1)=0.1 P(death|do(Drug),X=2,M1)=0.1 P(death|do(Placebo),X=1,M1)=0.1 P(death|do(Placebo),X=2,M1)=0.1 So by the backdoor rule: P(death|do(Drug),M1)=P(death|do(Drug),M2)=0.1, so with X unobserved the likelihood is identical in both cases. So the likelihood for model=M1 and for model=M2 are the same in an RCT which does not observe X. If there is an additional single observation of: death, drug, X=1 (patient has the inclination to take the drug, and this was observed) The probability of this single observation is 0.2 under M2 and 0.1 under M1. This makes the posterior probability of M2 as 2/3. I must confess I am confused about a) Why M2 involves a covariate which is the inclination to take the drug, rather than another easily measured attribute. Does this make the example more interesting? b) Why you think this poses a challenge to Bayes. It's entirely possible I misunderstood an aspect of this example.



I am still only "half Bayesian", for reasons explained here: ucla.in/2nZN7IH and made concrete here: x.com/yudapearl/stat… @ngdonghung @JiangSH24 @soboleffspaces



Simple way to replace RL with supervised learning: assign the reward to every action on the path to it and learn to predict it. Hypothesis: no RL algorithm will ever beat this by much.




















