Artem Artemev Profile
Artem Artemev

@aptemav

Followers
204
Following
3K
Media
5
Statuses
215

Machine Learning PhD @ImperialCollege

Cambridge, England
Joined October 2011
Don't wanna be here? Send us removal request.
@aptemav
Artem Artemev
3 years
Check out our work "Memory Safe Computations with XLA compiler" at #NeurIPS2022 (with Yuze An, @dyedgreen, @markvanderwilk). The paper and PR can be found at and The poster is Some details in short [1/8].
Tweet card summary image
github.com
Hello, In this pull request I would like to introduce the code of the paper that has been accepted at the NeurIPS 2022. This is the joint work of Yuze An (@melody-an), Tilman Roeder (@dyedgreen), M...
1
5
16
@aptemav
Artem Artemev
3 years
@markvanderwilk will be at the #NeurIPS2022 presenting the poster. If you are at #NeurIPS pop in and say hello. Thanks! [8/8].
0
0
0
@grok
Grok
15 hours
Generate videos in just a few seconds. Try Grok Imagine, free for a limited time.
211
68
961
@aptemav
Artem Artemev
3 years
We also applied eXLA to the language transformer model, and in the experiment we modified the sequence length which in turn controls the size of the self-attention block. Out of the box TF implementation fails with OOM with lengths more than 2k, and eXLA runs up to 7k. [7/8]
Tweet media one
1
0
0
@aptemav
Artem Artemev
3 years
eXLA allowed to run a OOM-free scaled version of sparse Gaussian process regression model (SGPR) without any change in the SGPR's code from GPflow (. [6/8]
Tweet media one
1
0
0
@aptemav
Artem Artemev
3 years
With eXLA we ran kernel matrix-vector multiplication for input vectors of size n=1e6 on a single GPU. The allocation of the intermediate matrix in this expression requires 8TB in fp64 which is non-practical with default ML frameworks. [5/8]
Tweet media one
1
0
1
@aptemav
Artem Artemev
3 years
Optimizations in the extension adjust the computational graph in the attempt to make it less memory demanding. Here are some results: [4/8].
1
0
0
@aptemav
Artem Artemev
3 years
This question is the motivation for our work, and the aim is to resolve OOM issues that practitioners might encounter during the ML development or at execution time of the existing ML code. We introduced an extension (eXLA) to the optimization pipeline in XLA compiler [3/8].
1
0
0
@aptemav
Artem Artemev
3 years
Out of memory (OOM) issues can cause a lot of trouble and users need to invest a lot of effort into resolving OOM, and sometimes even re-write the existing software. What if a compiler would sort it out for the user automatically?! [2/8].
1
0
0
@aptemav
Artem Artemev
3 years
RT @avt_im: When working with a Gaussian process, have you ever wondered why Cholesky factorization failed, or a CG solve did not converge?….
Tweet card summary image
arxiv.org
Gaussian processes are frequently deployed as part of larger machine learning and decision-making systems, for instance in geospatial modeling, Bayesian optimization, or in latent Gaussian models....
0
17
0
@aptemav
Artem Artemev
4 years
RT @markvanderwilk: I am still welcoming PhD applicants for 2022 at Imperial College London. We are a growing research group, with clear go….
0
131
0
@aptemav
Artem Artemev
4 years
RT @vdutor: We are organizing a small-scale, offline #NeurIPS2021 satellite event in Cambridge (UK) on the 8th of December. If you are int….
0
34
0
@aptemav
Artem Artemev
4 years
RT @markvanderwilk: Join us to discuss Conjugate Gradient based GP approximations! We make training easier by automatically setting approxi….
0
6
0
@aptemav
Artem Artemev
4 years
RT @markvanderwilk: Current Conjugate Gradient Gaussian Processes require manual tuning to trade off accuracy and speed. Existing guideline….
0
7
0
@aptemav
Artem Artemev
4 years
RT @markvanderwilk: I'm looking forward to speaking tomorrow. I will share some thoughts on:.- How Gaussian processes can help deep learnin….
0
4
0
@aptemav
Artem Artemev
5 years
RT @markvanderwilk: Tomorrow 10 Dec at 11am GMT I will speak at the Bayesian Deep Learning Meetup about **Bayesian Model Selection** and ho….
0
28
0
@aptemav
Artem Artemev
5 years
RT @vincentadam87: Come and chat with the authors of our paper:. Doubly sparse variational gaussian processes!. #A….
0
5
0
@aptemav
Artem Artemev
5 years
RT @arnosolin: My #ICML2020 tutorial videos on "Machine Learning with Signal Processing" are now freely available:.I: .
0
68
0
@aptemav
Artem Artemev
5 years
RT @TamasGorbe: #Puzzle.Can the Queen pass through all 9 shaded squares in just 4 legal moves starting from this position? .
0
117
0