Artem Artemev

215 posts

Artem Artemev

@aptemav

Machine Learning PhD @ImperialCollege

Cambridge, England Katılım Ekim 2011

400 Takip Edilen199 Takipçiler

Sabitlenmiş Tweet

Artem Artemev@aptemav·27 Kas

Check out our work "Memory Safe Computations with XLA compiler" at #NeurIPS2022 (with Yuze An, @dyedgreen, @markvanderwilk). The paper and PR can be found at openreview.net/pdf?id=2S_GtHB… and github.com/tensorflow/ten…. The poster is neurips.cc/virtual/2022/p…. Some details in short [1/8]

English

Artem Artemev@aptemav·5 May

@krzysztof_rus @PatrickKidger @ezyang I'm not sure how Enzyme is going to help here, even with MLIR support. User still needs an interface, in some form, to autodiff.

English

Krzysztof@krzysztof_rus·3 May

@PatrickKidger @ezyang Enzyme AD adds autodiff to LLVM but I am not sure if it plays nicely with MLIR.

English

170

Edward Z. Yang@ezyang·2 May

My read on Mojo is it's what you would do if Swift for Tensorflow failed and you were like "why did it fail" and concluded it's because no one likes Swift, so instead you do Python, and also MLIR is cool so backend to MLIR directly instead of TF

English

23K

Artem Artemev@aptemav·27 Kas

@markvanderwilk will be at the #NeurIPS2022 presenting the poster. If you are at #NeurIPS pop in and say hello. Thanks! [8/8]

English

Artem Artemev@aptemav·27 Kas

We also applied eXLA to the language transformer model, and in the experiment we modified the sequence length which in turn controls the size of the self-attention block. Out of the box TF implementation fails with OOM with lengths more than 2k, and eXLA runs up to 7k. [7/8]

English

Artem Artemev@aptemav·27 Kas

English

Artem Artemev retweetledi

Alexander Terenin@avt_im·17 Eki

When working with a Gaussian process, have you ever wondered why Cholesky factorization failed, or a CG solve did not converge? Answer: it's because you've got redundant, overlapping data points. And that's just the starting point! On arXiv now! arxiv.org/abs/2210.07893

English

150

Artem Artemev retweetledi

Stat.ML Papers@StatMLPapers·24 Şub

Wide Mean-Field Bayesian Neural Networks Ignore the Data. (arXiv:2202.11670v1 [cs.LG]) ift.tt/iyfvQew

English

Artem Artemev retweetledi

Mark van der Wilk@markvanderwilk·9 Ara

I am still welcoming PhD applicants for 2022 at Imperial College London. We are a growing research group, with clear goals on what new abilities we want to develop in ML and neural networks. Topics: Invariances, neural arch search, (Bayesian) model selection, Gaussian processes.

English

128

426

Artem Artemev retweetledi

Vincent Dutordoir@vdutor·15 Kas

We are organizing a small-scale, offline #NeurIPS2021 satellite event in Cambridge (UK) on the 8th of December. If you are interested in NeurIPS content and are in the neighborhood, this is your chance to connect with your local machine learning community neuripsmeetupcambridge.info

English

114

Artem Artemev retweetledi

Mark van der Wilk@markvanderwilk·22 Tem

Join us to discuss Conjugate Gradient based GP approximations! We make training easier by automatically setting approximation parameters like CG tolerance using marginal likelihood bounds. Today 5pm (London) / 9am PDT. Long talk and poster available at icml.cc/virtual/2021/p….

Mark van der Wilk@markvanderwilk

Current Conjugate Gradient Gaussian Processes require manual tuning to trade off accuracy and speed. Existing guidelines can give suboptimal results, without clear warnings. Our method tunes automatically, runs fewer CG steps, and performs better: arxiv.org/abs/2102.08314 👇1/6

English

Artem Artemev retweetledi

Mark van der Wilk@markvanderwilk·29 Nis

English

Artem Artemev retweetledi

Mark van der Wilk@markvanderwilk·29 Nis

I'm looking forward to speaking tomorrow. I will share some thoughts on: - How Gaussian processes can help deep learning - Recent work on accurate GP inference - What makes a method "exact", and to what extent recent methods live up to this Link below if you want to join!

Cambridge MLG@CambridgeMLG

In tomorrow's CBL Alumni talk, we're happy to host our former PhD student @markvanderwilk, now assistant professor at @imperialcollege @ICComputing. More details at talks.cam.ac.uk/talk/index/158… Note: We just require attendees to have valid Zoom accounts (no registration required).

English

Artem Artemev retweetledi

Mark van der Wilk@markvanderwilk·9 Ara

Tomorrow 10 Dec at 11am GMT I will speak at the Bayesian Deep Learning Meetup about **Bayesian Model Selection** and how it can help architecture search. In a short 20 minutes we will discuss why we (Bayesians ∪ Deep Learners) should care, and approaches from now and the past.