AndrewM_Webb Profile Banner
Andrew M. Webb Profile
Andrew M. Webb

@AndrewM_Webb

Followers
13K
Following
4K
Media
348
Statuses
2K

Machine learning & pop sci animations | 𝗧𝗼𝗽 π˜π˜„π—²π—²π˜π˜€: https://t.co/aEXwFoJKus

Manchester, England
Joined May 2015
Don't wanna be here? Send us removal request.
@AndrewM_Webb
Andrew M. Webb
4 years
If you have a distribution, with density p, that you don't know how to sample from, you can still estimate integrals like ∫f(x)p(x)dx by sampling:. Sample from a different distribution, with density q, then weight your samples by p(x) / q(x). This is called importance sampling
7
121
918
@AndrewM_Webb
Andrew M. Webb
1 year
I'm speaking at @developconf in July with @vTimeNet about using AI to aid the development of a top rated game on a modest budget, using our recent game Border Bots VR as a case study. Details of the session here: #ImASpeaker #developconf
Tweet media one
0
2
2
@grok
Grok
2 days
Generate videos in just a few seconds. Try Grok Imagine, free for a limited time.
600
2K
8K
@AndrewM_Webb
Andrew M. Webb
3 years
RT @EchoStatements: New blog post: Visualising the Legendre Transform. In which, I show how the Legendre transform can be derived in a visu….
0
5
0
@AndrewM_Webb
Andrew M. Webb
3 years
I hope either somebody finds this useful or there's an easier way that somebody can share with me. Explanation of the problem and the solution on the other site. Hope that's legal.
0
0
0
@AndrewM_Webb
Andrew M. Webb
3 years
It seems (?) like there isn't an easy way to use PyTorch's transformer implementations autoregressively at prediction time efficiently; in each iteration of the loop the whole sequence attends to the whole sequence. Here's a solution I came up with:
Tweet card summary image
gist.github.com
GitHub Gist: instantly share code, notes, and snippets.
2
0
4
@AndrewM_Webb
Andrew M. Webb
4 years
In NumPy/PyTorch if I have some arrays (that may share some axes) that I want to make jointly broadcastable, I use this 'make_broadcastable' function, using einops underneath and the same pattern syntax. Anyone know of an easier/built-in way?.
Tweet media one
1
0
9
@AndrewM_Webb
Andrew M. Webb
4 years
Bonus points to Jan Zacharias for the only publication I could find exposing the 'alleged prime number' as a fraud!.
0
0
5
@AndrewM_Webb
Andrew M. Webb
4 years
A fun example of a falsehood spreading through citation: according to (most of) these 47 papers, 19,349,663 is prime
Tweet media one
2
1
32
@AndrewM_Webb
Andrew M. Webb
4 years
RT @AndrewM_Webb: Let's ride a Martian space elevatorβ€”from Pavonis Mons to areostationary orbit!. The left hand side looks down on the nort….
0
7
0
@AndrewM_Webb
Andrew M. Webb
4 years
Happy Halloween! I decided to get in the spirit and dress as the walking dead. (I've no idea if non-British followers will know who this is)
Tweet media one
4
0
21
@AndrewM_Webb
Andrew M. Webb
4 years
I'm doing a night-time trek up mount Snowdon with friends and we're raising money for Mind, a charity in England & Wales that helps people with mental health problems and raises awareness of mental health. I'd really appreciate any donations or retweets.
Tweet media one
1
6
13
@AndrewM_Webb
Andrew M. Webb
4 years
RT @AndrewM_Webb: A simple visual aid to see why the area of a circle is half the circumference times the radius. Based on a section of @st….
0
298
0
@AndrewM_Webb
Andrew M. Webb
4 years
You can get super efficient estimates of ∫f(x)p(x)dx by, rather than picking a q similar to p, pick one similar to |f(x)|p(x) (suitably normalized). I.e., pick something that has a high density in regions that have a large effect on the integral.
1
0
23
@AndrewM_Webb
Andrew M. Webb
4 years
How efficient this is, in terms of number of samples required, depends on how similar the two distributions are. You usually want the distribution you sample from to have heavier tails than the target distribution: it's better to oversample the tails than not sample them at all.
1
0
30
@AndrewM_Webb
Andrew M. Webb
4 years
For clarity: for each sample xα΅’ you'd compute the weight wα΅’ = p(xα΅’) / q(xα΅’), then your monte carlo estimate of the integral would be Ξ£wα΅’β‹…f(xα΅’).
2
0
18
@AndrewM_Webb
Andrew M. Webb
4 years
(Tweeting at 3 am because I was having coding nightmares).
1
0
4
@AndrewM_Webb
Andrew M. Webb
4 years
It results in not-very-restful sleep and I first experienced this with learning programming, so I still call them 'coding nightmares'.
1
0
6
@AndrewM_Webb
Andrew M. Webb
4 years
If I spend a few hours in a day learning something new and abstract, that night I'll often have non-visual dreams: the 'dream' is just repetitive, nonsensical thoughts. It feels like repeatedly trying out new ideas but on nonsense data. Anyone else experience this?.
@monsoon0
Nalini Joshi
4 years
Do you ever dream of Mathematics? πŸ™‹πŸ½β€β™€οΈ.
4
1
32
@AndrewM_Webb
Andrew M. Webb
4 years
re: Dawkins' science tweet. It's hard to have useful discussions when we use "science" to mean at least three distinct things:. - objective reality.- the process people use to understand reality.- the collection of facts we've come to accept as true by using that process.
6
9
89