Artem Artemev

215 posts

Artem Artemev

Artem Artemev

@aptemav

Machine Learning PhD @ImperialCollege

Cambridge, England Katılım Ekim 2011
400 Takip Edilen200 Takipçiler
Edward Z. Yang
Edward Z. Yang@ezyang·
My read on Mojo is it's what you would do if Swift for Tensorflow failed and you were like "why did it fail" and concluded it's because no one likes Swift, so instead you do Python, and also MLIR is cool so backend to MLIR directly instead of TF
English
5
5
88
23K
Artem Artemev
Artem Artemev@aptemav·
We also applied eXLA to the language transformer model, and in the experiment we modified the sequence length which in turn controls the size of the self-attention block. Out of the box TF implementation fails with OOM with lengths more than 2k, and eXLA runs up to 7k. [7/8]
Artem Artemev tweet media
English
1
0
0
0
Artem Artemev retweetledi
Alexander Terenin
Alexander Terenin@avt_im·
When working with a Gaussian process, have you ever wondered why Cholesky factorization failed, or a CG solve did not converge? Answer: it's because you've got redundant, overlapping data points. And that's just the starting point! On arXiv now! arxiv.org/abs/2210.07893
English
2
14
150
0
Artem Artemev retweetledi
Stat.ML Papers
Stat.ML Papers@StatMLPapers·
Wide Mean-Field Bayesian Neural Networks Ignore the Data. (arXiv:2202.11670v1 [cs.LG]) ift.tt/iyfvQew
English
0
4
13
0
Artem Artemev retweetledi
Mark van der Wilk
Mark van der Wilk@markvanderwilk·
I am still welcoming PhD applicants for 2022 at Imperial College London. We are a growing research group, with clear goals on what new abilities we want to develop in ML and neural networks. Topics: Invariances, neural arch search, (Bayesian) model selection, Gaussian processes.
English
8
130
426
0
Artem Artemev retweetledi
Vincent Dutordoir
Vincent Dutordoir@vdutor·
We are organizing a small-scale, offline #NeurIPS2021 satellite event in Cambridge (UK) on the 8th of December. If you are interested in NeurIPS content and are in the neighborhood, this is your chance to connect with your local machine learning community neuripsmeetupcambridge.info
English
4
33
114
0
Artem Artemev retweetledi
Mark van der Wilk
Mark van der Wilk@markvanderwilk·
Join us to discuss Conjugate Gradient based GP approximations! We make training easier by automatically setting approximation parameters like CG tolerance using marginal likelihood bounds. Today 5pm (London) / 9am PDT. Long talk and poster available at icml.cc/virtual/2021/p….
Mark van der Wilk@markvanderwilk

Current Conjugate Gradient Gaussian Processes require manual tuning to trade off accuracy and speed. Existing guidelines can give suboptimal results, without clear warnings. Our method tunes automatically, runs fewer CG steps, and performs better: arxiv.org/abs/2102.08314 👇1/6

English
2
6
37
0
Artem Artemev retweetledi
Mark van der Wilk
Mark van der Wilk@markvanderwilk·
Current Conjugate Gradient Gaussian Processes require manual tuning to trade off accuracy and speed. Existing guidelines can give suboptimal results, without clear warnings. Our method tunes automatically, runs fewer CG steps, and performs better: arxiv.org/abs/2102.08314 👇1/6
Mark van der Wilk tweet media
English
1
7
61
0
Artem Artemev retweetledi
Mark van der Wilk
Mark van der Wilk@markvanderwilk·
I'm looking forward to speaking tomorrow. I will share some thoughts on: - How Gaussian processes can help deep learning - Recent work on accurate GP inference - What makes a method "exact", and to what extent recent methods live up to this Link below if you want to join!
Cambridge MLG@CambridgeMLG

In tomorrow's CBL Alumni talk, we're happy to host our former PhD student @markvanderwilk, now assistant professor at @imperialcollege @ICComputing. More details at talks.cam.ac.uk/talk/index/158… Note: We just require attendees to have valid Zoom accounts (no registration required).

English
2
4
60
0
Artem Artemev retweetledi
Mark van der Wilk
Mark van der Wilk@markvanderwilk·
Tomorrow 10 Dec at 11am GMT I will speak at the Bayesian Deep Learning Meetup about **Bayesian Model Selection** and how it can help architecture search. In a short 20 minutes we will discuss why we (Bayesians ∪ Deep Learners) should care, and approaches from now and the past.
Mark van der Wilk tweet media
English
3
28
210
0