Andrew M. Webb @AndrewM_Webb X Profile

Andrew M. Webb

@AndrewM_Webb

Followers

13K

Following

4K

Media

348

Statuses

2K

Machine learning & pop sci animations | 𝗧𝗼𝗽 𝘁𝘄𝗲𝗲𝘁𝘀: https://t.co/aEXwFoJKus

Manchester, England

Joined May 2015

Don't wanna be here? Send us removal request.

Andrew M. Webb

@AndrewM_Webb

4 years

If you have a distribution, with density p, that you don't know how to sample from, you can still estimate integrals like ∫f(x)p(x)dx by sampling:. Sample from a different distribution, with density q, then weight your samples by p(x) / q(x). This is called importance sampling

7

121

918

Andrew M. Webb

@AndrewM_Webb

1 year

I'm speaking at @developconf in July with @vTimeNet about using AI to aid the development of a top rated game on a modest budget, using our recent game Border Bots VR as a case study. Details of the session here: #ImASpeaker #developconf

0

2

Grok

@grok

2 days

Generate videos in just a few seconds. Try Grok Imagine, free for a limited time.

600

2K

8K

Andrew M. Webb

@AndrewM_Webb

3 years

RT @EchoStatements: New blog post: Visualising the Legendre Transform. In which, I show how the Legendre transform can be derived in a visu….

0

5

0

Andrew M. Webb

@AndrewM_Webb

3 years

I hope either somebody finds this useful or there's an easier way that somebody can share with me. Explanation of the problem and the solution on the other site. Hope that's legal.

0

Andrew M. Webb

@AndrewM_Webb

3 years

It seems (?) like there isn't an easy way to use PyTorch's transformer implementations autoregressively at prediction time efficiently; in each iteration of the loop the whole sequence attends to the whole sequence. Here's a solution I came up with:

gist.github.com

GitHub Gist: instantly share code, notes, and snippets.

2

0

4

Andrew M. Webb

@AndrewM_Webb

4 years

In NumPy/PyTorch if I have some arrays (that may share some axes) that I want to make jointly broadcastable, I use this 'make_broadcastable' function, using einops underneath and the same pattern syntax. Anyone know of an easier/built-in way?.

1

0

9

Andrew M. Webb

@AndrewM_Webb

4 years

Bonus points to Jan Zacharias for the only publication I could find exposing the 'alleged prime number' as a fraud!.

0

5

Andrew M. Webb

@AndrewM_Webb

4 years

A fun example of a falsehood spreading through citation: according to (most of) these 47 papers, 19,349,663 is prime

2

1

32

Andrew M. Webb

@AndrewM_Webb

4 years

RT @AndrewM_Webb: Let's ride a Martian space elevator—from Pavonis Mons to areostationary orbit!. The left hand side looks down on the nort….

0

7

0

Andrew M. Webb

@AndrewM_Webb

4 years

Happy Halloween! I decided to get in the spirit and dress as the walking dead. (I've no idea if non-British followers will know who this is)

4

0

21

Andrew M. Webb

@AndrewM_Webb

4 years

I'm doing a night-time trek up mount Snowdon with friends and we're raising money for Mind, a charity in England & Wales that helps people with mental health problems and raises awareness of mental health. I'd really appreciate any donations or retweets.

1

6

13

Andrew M. Webb

@AndrewM_Webb

4 years

RT @AndrewM_Webb: A simple visual aid to see why the area of a circle is half the circumference times the radius. Based on a section of @st….

0

298

0

Andrew M. Webb

@AndrewM_Webb

4 years

You can get super efficient estimates of ∫f(x)p(x)dx by, rather than picking a q similar to p, pick one similar to |f(x)|p(x) (suitably normalized). I.e., pick something that has a high density in regions that have a large effect on the integral.

1

0

23

Andrew M. Webb

@AndrewM_Webb

4 years

How efficient this is, in terms of number of samples required, depends on how similar the two distributions are. You usually want the distribution you sample from to have heavier tails than the target distribution: it's better to oversample the tails than not sample them at all.

1

0

30

Andrew M. Webb

@AndrewM_Webb

4 years

For clarity: for each sample xᵢ you'd compute the weight wᵢ = p(xᵢ) / q(xᵢ), then your monte carlo estimate of the integral would be Σwᵢ⋅f(xᵢ).

2

0

18

Andrew M. Webb

@AndrewM_Webb

4 years

(Tweeting at 3 am because I was having coding nightmares).

1

0

4

Andrew M. Webb

@AndrewM_Webb

4 years

It results in not-very-restful sleep and I first experienced this with learning programming, so I still call them 'coding nightmares'.

1

0

6

Andrew M. Webb

@AndrewM_Webb

4 years

If I spend a few hours in a day learning something new and abstract, that night I'll often have non-visual dreams: the 'dream' is just repetitive, nonsensical thoughts. It feels like repeatedly trying out new ideas but on nonsense data. Anyone else experience this?.

Nalini Joshi

@monsoon0

4 years

Do you ever dream of Mathematics? 🙋🏽‍♀️.

4

1

32

Andrew M. Webb

@AndrewM_Webb

4 years

re: Dawkins' science tweet. It's hard to have useful discussions when we use "science" to mean at least three distinct things:. - objective reality.- the process people use to understand reality.- the collection of facts we've come to accept as true by using that process.

6

9

89