@neilhoulsby
Neil Houlsby
10 months
A nice approach to adaptive computation from @XueFz taking advantage of sequence length flexibility in Transformers.
@XueFz
Fuzhao Xue on the job market!
10 months
1/ Introducing AdaTape: an adaptive computation transformer with elastic input sequence! 🚀 * Flexible computation budget via elastic sequence length * Dynamic memory read & write for adaptable input context * Direct adaptive computation enhancement of input sequences
2
24
143
0
2
11