New work on how to construct neural network layers on composite objects of scalars, vectors, bivectors, … --> multivectors! Via Clifford algebras, we generalize convolution and Fourier transforms to multivectors, especially relevant for PDE modeling: Tweet added by Johannes Brandstetter @jo_brandstetter | Musk Viewer

@jo_brandstetter

Johannes Brandstetter

@jo_brandstetter

2 years

New work on how to construct neural network layers on composite objects of scalars, vectors, bivectors, … --> multivectors! Via Clifford algebras, we generalize convolution and Fourier transforms to multivectors, especially relevant for PDE modeling:

13

178

680

Replies

@jo_brandstetter

Johannes Brandstetter

@jo_brandstetter

2 years

Work done as a close collaboration with @vdbergrianne and @wellingmax , at our new AI4Science team at Microsoft Research @MSFTResearch Amsterdam, and... @rejuvyesh at Microsoft Autonomous Systems and Robotics Research Group. 2/n

1

3

31

@jo_brandstetter

Johannes Brandstetter

@jo_brandstetter

2 years

In sciences, PDEs describe physical processes as scalar and vector fields interacting and coevolving over time. We want neural PDE surrogates to take into account the relationship between different fields and their internal components, via so-called multivectors. 3/n

1

3

24

@jo_brandstetter

Johannes Brandstetter

@jo_brandstetter

2 years

In Clifford algebras, different spatial primitives can be summarized into such multivectors. A multivector does not only have scalar, and vector components. But also has plane (bivector) and volume (trivector) primitives. 4/n

Tweet media one

1

4

27

@jo_brandstetter

Johannes Brandstetter

@jo_brandstetter

2 years

An example of a bivector is the cross-product of two vectors in 3d, which is a plane segment spanned by these two vectors, and is often represented as a vector itself. But the cross-product has a sign flip under reflection which a true vector does not. It is a bivector. 5/n

Tweet media one

1

3

27

@jo_brandstetter

Johannes Brandstetter

@jo_brandstetter

2 years

We introduce Clifford neural layers, both for convolution and for Fourier operations, that handle scalar (e.g. density), vector (e.g. wind/electric field), bivector (e.g. magnetic field) and higher order fields as proper geometric objects termed as multivector fields. 6/n

1

3

25

@jo_brandstetter

Johannes Brandstetter

@jo_brandstetter

2 years

And that is the cool thing: we can build neural networks on top of this concept of multivectors😊If there are physical connections between fields, e.g. between electric and magnetic fields, Clifford layers give you a handle on how to encode these. 7/n

1

2

26

@jo_brandstetter

Johannes Brandstetter

@jo_brandstetter

2 years

We introduce Clifford convolution: instead of convolving over channels of feature maps, we convolve over channels of multivector feature maps using multivector kernel. 8/n

Tweet media one

1

2

24

@jo_brandstetter

Johannes Brandstetter

@jo_brandstetter

2 years

We introduce Clifford Fourier transforms: instead of Fourier transforming feature maps, we Fourier transform multivector feature maps. 9/n

Tweet media one

1

2

21

@jo_brandstetter

Johannes Brandstetter

@jo_brandstetter

2 years

We therefore evaluate on fluid dynamics (Navier-Stokes equations) where scalar smoke and vector velocity fields form multivectors, coupled over force terms. 10/n

Tweet media one

Tweet media two

1

1

20

@jo_brandstetter

Johannes Brandstetter

@jo_brandstetter

2 years

We further evaluate on weather forecasting models where scalar pressure and vector wind velocity fields form multivectors. 11/n

Tweet media one

1

2

18

@jo_brandstetter

Johannes Brandstetter

@jo_brandstetter

2 years

We replace each convolution with a Clifford convolution in ResNet architectures, and each Fourier Neural Operator layer with a Clifford Fourier layer in Fourier Neural Operators (FNOs): 12/n

Tweet media one

1

1

20

@jo_brandstetter

Johannes Brandstetter

@jo_brandstetter

2 years

We finally evaluate on Maxwell’s equations in 3 dimensions, which beautifully illustrate the multivector nature of electromagnetism. This is because the magnetic field is the dual (bivector) of the electric field (vector). 13/n

Tweet media one

1

1

17

@jo_brandstetter

Johannes Brandstetter

@jo_brandstetter

2 years

For all these datasets, we experiment with different number of training trajectories. Clifford neural layers consistently improve generalization capabilities of the tested architectures.

1

2

16

@jo_brandstetter

Johannes Brandstetter

@jo_brandstetter

2 years

But… in the end, it seems we have just touched the tip of the iceberg when it comes to marrying geometric algebra and deep learning. There is much more to harness :)

3

2

43

@andrewwhite01

Andrew White 🐦‍⬛/acc

2 years

@jo_brandstetter How does this compare with @mpspells 's work on Clifford Algebra with attention for geometric deep learning?

Tweet card media

Geometric Algebra Attention Networks for Small Point Clouds

Much of the success of deep learning is drawn from building architectures that properly respect underlying symmetry and structure in the data on which they operate - a set of considerations that...

1

0

11

@jo_brandstetter

Johannes Brandstetter

@jo_brandstetter

2 years

@andrewwhite01 @mpspells Thank you very much for forwarding this paper. We did our best to cite as much as possible, seems we overlooked this important one. This is a nice piece of work! @mpspells builds multivectors by applying vector-vector operations, e.g. uses the cross-product to obtain 1/2

1

0

10

@alkalait

Freddie Kalaitzis

2 years

@jo_brandstetter I look forward to Clifford Algebras being a hot topic of research in ML. I've been looking for the past year but from a basic data modeling lens. A big limitation is the exponential growth of dimension in degree. Do you restrict yourself up to 3-vectors?

@alkalait

Freddie Kalaitzis

2 years

Linear systems are the linchpin of many problems in science & engineering What if I told you they are solved error-free by extending their geometry, in terms of oriented • lines (vectors) • areas (bivectors) • volumes (trivectors) • etc. Case in point: an N×2 linear system

Tweet media one

2

2

10

1

0

3

@jo_brandstetter

Johannes Brandstetter

@jo_brandstetter

2 years

@alkalait We need less channels if we use multivector objects. In 2D and 3D experiments, Clifford networks performed better for the same nr of params. For 4D and higher, you are right, there might be a tradeoff which needs to be explored :)

0

0

2

@zhrli

Zhaorui Li

1 year

@jo_brandstetter any relation between this and climax?

1

0

0

@jo_brandstetter

Johannes Brandstetter

@jo_brandstetter

1 year

@zhrli Besides the shared area of fluid dynamics so far little... yet :)

0

0

1