Mario Beraha

66 posts

Mario Beraha banner
Mario Beraha

Mario Beraha

@mberaha2

I like studying statistical problems.

Katılım Eylül 2019
162 Takip Edilen165 Takipçiler
Mario Beraha retweetledi
Alessandra Guglielmi
Alessandra Guglielmi@AlessandraGugl9·
📢I’m really proud of “Bayesian clustering of high-dimensional data via latent repulsive mixtures’’ just appeared on Biometrika, Advance articles, doi.org/10.1093/biomet…. Thank you to my terrific coauthors Lorenzo Ghilotti and @mberaha2.
English
1
7
36
2.3K
Mario Beraha retweetledi
Alessandra Guglielmi
Alessandra Guglielmi@AlessandraGugl9·
📢The submission of contributed talks for the 14th International Conference on Bayesian Nonparametrics, UCLA (Los Angeles, US), June 23-27, 2025, is now OPEN! Deadline for submission: Dec 15, 2024.
English
1
2
3
176
Mario Beraha retweetledi
Daniel Litt
Daniel Litt@littmath·
Average temperatures by year, with the years listed in alphabetical order, and suddenly the climate crisis looks like a joke.
Daniel Litt tweet media
English
9
14
320
10K
Mario Beraha retweetledi
Dov Ben-Shimon
Dov Ben-Shimon@DovBenShimon·
Warning: extremely difficult text to read below. Last night I surrendered my phone and signed a waiver. And then I sat with a small group and an Israeli military attaché and we watched the 45 minute compilation of Hamas videos from October 7th. 1/12
English
2.8K
10.4K
31K
6.8M
Mario Beraha
Mario Beraha@mberaha2·
As a bonus, we show that our model-based approach can be used to infer the cardinality of the dataset as well. This is another classical problem in computer science which was typically solved using a different data structure!
English
0
1
1
100
Mario Beraha
Mario Beraha@mberaha2·
Hence, we adopt a pragmatic approach and propose the class "smoothed" estimators, which work very well in practice and are easy to compute! We extend the analysis to sketches obtained with multiple hash functions by drawing from the "multi-view" literature.
English
1
0
0
95
Mario Beraha
Mario Beraha@mberaha2·
Together with Stefano Favaro and Matteo Sesia, we just arXiv'd our latest take on frequency and cardinality estimation from compressed (sketched) data: arxiv.org/abs/2309.15408
Mario Beraha tweet media
English
1
0
2
435
Mario Beraha
Mario Beraha@mberaha2·
As a bonus, we show that our model-based approach can be used to infer the cardinality of the dataset as well. This is another classical problem in computer science which was typically solved using a different data structure!
English
0
0
0
50
Mario Beraha
Mario Beraha@mberaha2·
Hence, we adopt a pragmatic approach and propose the class "smoothed" estimators, which work very well in practice and are easy to compute! We extend the analysis to sketches obtained with multiple hash functions by drawing from the "multi-view" literature.
English
1
0
0
55
Mario Beraha retweetledi
Alessandra Guglielmi
Alessandra Guglielmi@AlessandraGugl9·
@mberaha2 and I have just published the first project we started collaborating 5 years ago! Meanwhile we have taken onboard 4 more coauthors, and I thank them all. See Childhood obesity in Singapore: A Bayesian nonparametric approach, SMJ OnlineFirst
English
0
2
9
515
Mario Beraha
Mario Beraha@mberaha2·
In particular, we show that, in some cases, this leads to a predictive distribution that depends on the sample size and the number of unique traits in the sample, similarly to what happens under the Pitman-Yor process prior!
English
0
0
0
184
Mario Beraha
Mario Beraha@mberaha2·
Indeed, under a CRM prior, the predictive distribution of the number of "new" traits in an additional sample depends only on the sample size! We propose a new class of priors derived from the scaled subordinators by James et al. (2015).
English
1
0
1
203
Mario Beraha
Mario Beraha@mberaha2·
Personal update. As of last week, I'm officially a Ph.D. in data science and computation. My thesis was on the statistical learning of RPMs, under the supervision of @AlessandraGugl9! We also arXived the first two papers from my postdoc, joint works with Stefano Favaro.
Mario Beraha tweet media
English
3
3
42
2.6K