Alex Dimakis @AlexGDimakis profile

Alex Dimakis

@AlexGDimakis

Followers

13,250

Following

1,702

Media

166

Statuses

3,356

UT Austin Professor. Researcher in Machine Learning and Information Theory. National AI Institute on the Foundations of Machine Learning (IFML) Co-director.

https://t.co/UacxwiY1Ob

Austin, TX

Joined April 2009

Don't wanna be here? Send us removal request.

Explore tweets Explore followers Explore following

Explore trending content on Musk Viewer

#KAZZAWARDS2024 • 540610 Tweets

SAROCHA REBECCA IN KAZZ • 456964 Tweets

NORAWIT WITH PAUL SMITH • 126997 Tweets

Varane • 37744 Tweets

インプレゾンビ • 33691 Tweets

魔法少女 • 25290 Tweets

B1NI TOPS SPOTIFY • 24610 Tweets

FOURTH x KAZZ 2024🥳 • 23554 Tweets

#Destiny • 18211 Tweets

#うたコン • 16437 Tweets

名誉毀損 • 14118 Tweets

モグコレ • 13166 Tweets

#仰天ニュース • 12828 Tweets

風の行方 • 11763 Tweets

いなば食品 • 10216 Tweets

ナイトローゼ

bulgaristan türklerinin

ドリフターズ

BOARD THE WISHBUS WithJC

ジュビロジュビロ

佐藤輝明

薬害エイズ

Harganya Lazada Lebih Bagus

Lindelof

村上くん

ノーアウト満塁

試合成立

コーヒートーク

杉下右京

ジェイド・リーチ

ジュンスカ

石川昂弥

ダンプラ

新井先生

Dfesta Jakarta

コールド

ちゅーる

うさほー

ライマル

カリステ

まりほー

全問正解

こいほー

ホットケーキ

サトテル

ライデル

柱稽古クイズ

辻󠄀無惨

どらほー

#ابراهيم_المهيدب

Last Seen Profiles

@BloXmove

@Crider_HP

@kats_senio

@Truefoxrs

@divanosaurus

@ZahreshM

@taozi330

@artenzim

@sintaccmarketcr

@honokax27

@AdriRCRH

@britpodawards

@sabriesh

@RyanHennessyTV

@ThePMmag

@ZahraaAlraies

@tokenames

@Bylithe

@cecishre

@mpw649jg

Pinned Tweet

Alex Dimakis

@AlexGDimakis

4 years

(1/3) We wrote a survey on Deep Learning Techniques for Inverse Problems in Imaging We came up with a taxonomy that I think is interesting. Also discussed the whole 'what is supervised vs unsupervised' issue. @WillettBecca

Deep Learning Techniques for Inverse Problems in Imaging

Recent work in machine learning shows that deep neural networks can be used to solve a wide variety of inverse problems arising in computational imaging. We explore the central prevailing themes...

arxiv.org

3

39

199

Alex Dimakis

@AlexGDimakis

2 years

Someone is trying to scam my PhD student. My student asks to verify their identity 1/2

46

432

4K

Alex Dimakis

@AlexGDimakis

9 months

I was surprised by a talk Yejin Choi (an NLP expert) gave yesterday in Berkeley, on some surprising weaknesses of GPT4: As many humans know, 237*757=179,409 but GPT4 said 179,289. For the easy problem of multiplying two 3 digit numbers, they measured GPT4 accuracy being only…

250

538

4K

Alex Dimakis

@AlexGDimakis

1 year

Human bilinguals are more robust to dementia and cognitive decline. In our recent NeurIPS paper we show that bilingual GPT models are also more robust to structural damage in their neuron weights. Further, we develop a theory.. (1/n)

19

231

2K

Alex Dimakis

@AlexGDimakis

2 years

2/ Scammer ends up improving our sample complexity bound for StyleGAN inverse problems. They teach them to do chaining arguments instead of just union bounds now, jeez. @giannis_daras

12

28

1K

Alex Dimakis

@AlexGDimakis

3 years

Doctor: We used a deep learning algorithm for your MRI reconstruction. Turns out one of your kidneys is a cat.

11

130

965

Alex Dimakis

@AlexGDimakis

2 years

One huge advantage of deep learning (vs classical ML models) that is not often discussed is *modularity*: One can download pre-trained models, glue them like Legos and fine tune them end-to-end because gradients flow through. (1/n)

19

105

954

Alex Dimakis

@AlexGDimakis

2 years

Based on recent papers (Gpt3, Palm, dalle2, Gato, Metaformer) I am forming the opinion that maybe 'Scale is all you need', possibly even for general intelligence (?!). Just convert everything to tokens and predict the next token. (1/n)

37

77

716

Alex Dimakis

@AlexGDimakis

2 years

The term Artificial Intelligence was coined by John McCarthy to avoid association with Cybernetics and specifically its pioneer Norbert Wiener who was already famous, pain to work with, and working on Cybernetics in MIT. Original quote from McCarthy's Stanford page: ... (1/n)

19

139

704

Alex Dimakis

@AlexGDimakis

4 years

@FernleafFlynn @even_kei @IllithidHeretic Two major industries breaking ways for a paltry sum.

4

8

601

Alex Dimakis

@AlexGDimakis

1 year

Here is a simple way to beat ChatGPT and any similar architecture with one Turing test question. ChatGPT, GPT3 and all related Transformers have a finite maximum token sequence length, usually 2k to 4k tokens. (1/n)

35

78

535

Alex Dimakis

@AlexGDimakis

5 years

Best to leave TF for later

10

50

505

Alex Dimakis

@AlexGDimakis

1 year

My thoughts on the now famous Google leak doc: 1. Open source AI is winning. I agree, and that is great for the world and for a competitive ecosystem. In LLMs we're not there, but we just got OpenClip to beat openAI Clip and Stable diffusion is better than…

Google "We Have No Moat, And Neither Does OpenAI"

Leaked Internal Google Document Claims Open Source AI Will Outcompete Google and OpenAI

www.semianalysis.com

13

82

454

Alex Dimakis

@AlexGDimakis

6 months

Probably the best 1h introduction to LLMs that I've seen. And after 20mins its not an introduction, its getting into cutting edge research updates updated up to this month. I had not heard of the data exfiltration by prompt injection or the recent…

[1hr Talk] Intro to Large Language Models

This is a 1 hour general-audience introduction to Large Language Models: the core technical component behind systems like ChatGPT, Claude, and Bard. What the...

www.youtube.com

2

50

422

Alex Dimakis

@AlexGDimakis

4 months

Excited to be the director for the new Texas Center for Generative AI

Institute for Foundations of Machine Learning

@MLFoundations

4 months

Please welcome the Center for Generative AI -- a World-Class AI Research Center with a Texas-Sized GPU Cluster. Led by @AlexGDimakis 💫 #YearofAI @TexasScience @UTAustin

3

26

143

52

17

404

Alex Dimakis

@AlexGDimakis

1 year

A small experiment: This Tweet has an even number of likes.

13

0

396

Alex Dimakis

@AlexGDimakis

4 years

As Information theory was becoming a 'hot' scientific trend in the 50s, Claude Shannon wrote a one-page paper advising hype *reduction*. That never happens anymore.

𝗔𝘆𝘂𝘀𝗵

@AB2World

4 years

Claude Shannon's "The Bandwagon" (1956) is a timeless gem. Short, one page advise and perspective on the status of the field. "... we must keep our own house in first class order. The subject of information theory has certainly been sold, if not oversold."

0

41

154

3

88

373

Alex Dimakis

@AlexGDimakis

2 years

I was informed that Alexander Vardy, a giant in coding theory passed away. A tragic loss for his family, UCSD and academia. Alex's many discoveries include the Polar decoding algorithm used in the 5G wireless standard, (1/3)

10

50

374

Alex Dimakis

@AlexGDimakis

3 months

Ptolemy the king of Egypt wanted to learn geometry but found Euclid's book, the Elements, too difficult to study. So he asked Euclid to show him an easier way to master it. Euclid famously said "Sir, there is no royal road to geometry." This is still true a few thousand years…

Andrej Karpathy

@karpathy

3 months

# on shortification of "learning" There are a lot of videos on YouTube/TikTok etc. that give the appearance of education, but if you look closely they are really just entertainment. This is very convenient for everyone involved : the people watching enjoy thinking they are…

668

3K

14K

7

32

374

Alex Dimakis

@AlexGDimakis

3 years

Here is a very good reason why the Nyquist–Shannon sampling theorem requires that your function is low-pass before you sub-sample to downscale. If you just sub-sample without smoothing, a bad guy can place another image exactly on the pixels you sub-sample. Adversarial aliasing.

Alex Tamkin 🦣

@AlexTamkin

3 years

image-scaling attacks are wild small dots added to the image on the left turns it into the image on the right when downscaled could make auditing ML systems very tricky if you only look at the original images...

36

550

2K

8

54

349

Alex Dimakis

@AlexGDimakis

4 years

If you are a #neurips2020 reviewer, please read the authors rebuttal and, at the very least, update your review indicating that you read it and your updated thoughts. It takes 5 minutes and its a good step towards decency. Meta-reviewers please enforce this.

11

28

351

Alex Dimakis

@AlexGDimakis

2 years

Honored to be selected as an IEEE Fellow `for contributions to distributed coding and learning' Congratulations to the whole Fellows class of 2022

Diana Marculescu

@dianamarculescu

2 years

Congratulations to @utexasece 's Seth Bank, @AlexGDimakis , and Sriram Vishwanath for being selected as @IEEEorg Fellows!

0

5

35

56

5

345

Alex Dimakis

@AlexGDimakis

5 months

The Google Gemini paper was released today and has 940 authors. I was impressed, but then found that a recent LHC physics paper with 5,154 authors. The first nine pages describe the research and the other 24 pages list the authors and their institutions. But that's not even the…

12

33

335

Alex Dimakis

@AlexGDimakis

4 years

Let the advisor show you how to write the rebuttal

1

27

335

Alex Dimakis

@AlexGDimakis

1 year

New neural renderer by Nvidia. The model adds fingerprints, smudges and dust and generates renders indistinguishable from real to me. Oh, and its done at *real-time!*. Can't wait to see games using this. (1/2)

1

45

325

Alex Dimakis

@AlexGDimakis

4 years

We're very excited that @UT Austin will lead an NSF national Institute on the Foundations of Machine Learning with @UW , @WichitaState and @MSFTResearch Announcement:

UT Austin Selected as Home of National AI Institute Focused on Machine Learning

AUSTIN, Texas — The National Science Foundation has selected The University of Texas at Austin to lead the NSF AI Institute for Foundations of Machine

news.utexas.edu

20

48

323

Alex Dimakis

@AlexGDimakis

1 year

Who first generated text with statistical methods like GPT? In 1948 Claude Shannon wrote the landmark paper 'A Mathematical Theory of Communication'. There, he defined and estimated the entropy of English by generating synthetic text: 'THE HEAD AND IN FRONTAL ATTACK ON (1/n)

1

37

268

Alex Dimakis

@AlexGDimakis

2 years

Greece is quite the outlier here in the south on the number of metal bands per Capita. Any explanations?

Amazing Maps

@amazingmap

8 years

Metal bands per 1 million people (Europe)

1K

4K

20K

52

40

262

Alex Dimakis

@AlexGDimakis

9 months

@raj_raj88 But even fine-tuning with 1.8m multiplication examples was not able to teach it to generalize to other (3 digit) multiplications. This indicates some fundamental architecture limitation.

17

6

262

Alex Dimakis

@AlexGDimakis

9 months

References: The Faith and Fate Paper is available here: Video of this great talk here:

Possible Impossibilities and Impossible Possibilities

Yejin Choi (University of Washington)https://simons.berkeley.edu/talks/yejin-choi-university-washington-2023-08-14Large Language Models and TransformersIn th...

www.youtube.com

5

40

263

Alex Dimakis

@AlexGDimakis

5 months

"Datacomp1B is the first public dataset that outperforms OpenAI" #NeurIPS2023

2

30

218

Alex Dimakis

@AlexGDimakis

3 years

My students after every joke I make in a Zoom lecture. (h/t: @OdedRechavi )

3

5

255

Alex Dimakis

@AlexGDimakis

2 years

My student Giannis discovered that DALLE2 has a secret language. This can be used to crate absurd prompts that generate images. E.g. ''Apoploe vesrreaitais eating Contarra ccetnxniams luryca tanniounons'' generates Birds eating Bugs! We wrote a short paper on our experiments.

Giannis Daras

@giannis_daras

2 years

DALLE-2 has a secret language. "Apoploe vesrreaitais" means birds. "Contarra ccetnxniams luryca tanniounons" means bugs or pests. The prompt: "Apoploe vesrreaitais eating Contarra ccetnxniams luryca tanniounons" gives images of birds eating bugs. A thread (1/n)🧵

204

2K

9K

8

43

233

Alex Dimakis

@AlexGDimakis

2 years

I really need to disagree with this statement. E.g. in my lab in UT Austin, good software engineering is useful but not the most important skill to learn. We train ML researchers on how to do research, e.g. understanding and improving landmark papers, ideally writing one.

Mo Bavarian

@mobav0

2 years

This is probably well-known in some circles but not everywhere. The most important skill for Research Scientists in AI (at least at @OpenAI ) is software engineering. Background in ML research is sometimes useful, but you can usually get away with a few landmark paper.

38

194

2K

15

11

234

Alex Dimakis

@AlexGDimakis

2 years

Amazing news: AI and Data science research center founded in Greece, €21 million funding, Led by Christos Papadimitriou, @KonstDaskalakis and Timos Sellis under @athenaRICinfo and the support of @Greece_2021

Gianna Angelopoulos

@GAngelopoulou

2 years

Στην τελευταία εκδήλωση της Επιτροπής @Greece_2021 , πριν τη λήξη του επετειακού έτους, ανακοινώσαμε τη δημιουργία της Μονάδας «Αρχιμήδης» στο Ερευνητικό Κέντρο «Αθηνά». Ενός Ινστιτούτου για την Τεχνητή Νοημοσύνη, την Επιστήμη των Δεδομένων και τους Αλγορίθμους.

9

12

157

5

32

225

Alex Dimakis

@AlexGDimakis

4 years

Of Cramer-Rao and Rao-Blackwell fame

C.R. Rao, a doyen of statistics, turns 100

C.R. Rao shaped the dramatic growth of mathematical statistics in the 20th century, refining and restructuring it from its somewhat ad hoc origins

www.thehindu.com

1

41

225

Alex Dimakis

@AlexGDimakis

3 years

New NeurIPS paper: We train a Robust CLIP encoder that produces approximate CLIP representations by seeing highly corrupted images. We can classify images by observing 2% random pixels or very blurry images better than humans.

3

24

204

Alex Dimakis

@AlexGDimakis

1 year

Scott Aaronson gave an extraordinary public lecture in UT Austin's Machine Learning Lab (MLL) yesterday. Most packed auditorium I've seen. He described a taxonomy for AI alignment methods 1. Off switch! 2. Sandboxing / Isolation 3. Interpretability 4. Multiple competing /…

6

14

198

Alex Dimakis

@AlexGDimakis

3 months

The #Sora model is indeed incredible 🤯 congratulations to the OpenAI team. It is common for people to think that all the amazing research breakthroughs in AI (like #Sora ) are happening inside companies like OpenAI, while universities are becoming irrelevant. I want to highlight…

4

20

200

Alex Dimakis

@AlexGDimakis

3 years

We have tried to use discriminators of GANs as regularizers, for detecting adversarial examples, for dozens of things: It NEVER works. I always think it's a great idea and then nope. 😓

13

5

193

Alex Dimakis

@AlexGDimakis

2 years

While waiting for #CVPR2022 CMT to get up again, I would like to propose a simple cryptographic solution to the big data submission problem: We only upload a SHA256 hash of our to-be-submitted pdf and then upload the committed pdf any time next week.

10

4

194

Alex Dimakis

@AlexGDimakis

4 years

Oppenheimer's Berkeley recommendation letter for Feynman mentions that Wigner said, "He is a second Dirac, only this time human."

Oppenheimer’s Letter of Recommendation for Richard Feynman (1943)

“He is a second Dirac, only this time human”

www.cantorsparadise.com

1

11

192

Alex Dimakis

@AlexGDimakis

1 year

We develop a theory that shows how multitasking creates regularization. This is can be seen as a simple theoretical model for bilingual cognitive reserve. Interestingly, the phenomenon appears only when the tasks are sufficiently diverse. (2/n)

3

7

183

Alex Dimakis

@AlexGDimakis

3 years

On the difference between (classical) Statistics and Machine Learning, I found this gem by Leo Breiman:'The two cultures of Statistical modeling'

Statistical Modeling: The Two Cultures (with comments and a rejoinder by the author)

There are two cultures in the use of statistical modeling to reach conclusions from data. One assumes that the data are generated by a given stochastic data model. The other uses algorithmic models...

projecteuclid.org

5

32

178

Alex Dimakis

@AlexGDimakis

2 years

Excited that our paper on deep generative models for robust MRI is featured by Amazon Science. We trained the first generative model for MRI images. Also for the first time we are competitive with supervised deep MRI methods and more robust to anatomy and measurement changes.

Amazon Science

@AmazonScience

2 years

Time can seem to slow during an MRI scan. #AmazonResearchAward recipient Jonathan Tamir is developing #machinelearning methods to shorten exam times and extract more data from this essential — but often uncomfortable — imaging process. Find out how.

0

16

69

4

21

177

Alex Dimakis

@AlexGDimakis

4 years

Meet the 'double descent' phenomenon. After we figure it out we should probably rewrite the book chapter on bias-variance tradeoff.

OpenAI

@OpenAI

4 years

A surprising deep learning mystery: Contrary to conventional wisdom, performance of unregularized CNNs, ResNets, and transformers is non-monotonic: improves, then gets worse, then improves again with increasing model size, data size, or training time.

96

664

2K

7

32

178

Alex Dimakis

@AlexGDimakis

3 years

A public service announcement: please upload all your papers on preprint servers like arxiv. The publisher owns final pdf THEY typeset, not the preprint pdf you submitted. If your papers are only behind a paywall you are violating funding recommendations.

1

24

176

Alex Dimakis

@AlexGDimakis

4 years

#NeurIPS2019 best paper awards Congrats to all the authors!

NeurIPS 2019 Paper Awards

With this blog post, it is our pleasure to unveil the NeurIPS paper awards for 2019, and share more information on the selection process…

neuripsconf.medium.com

0

54

173

Alex Dimakis

@AlexGDimakis

4 years

Then, one wakes up after the talk and asks an extremely insightful question.

Oded Rechavi

@OdedRechavi

4 years

Full professors sitting in the first row of every seminar in history

60

427

4K

5

13

168

Alex Dimakis

@AlexGDimakis

2 years

Is there a doctor on the plane? -Yes, but not that kind of doctor. -The passenger in 36c is trying to inpaint an image using a pre-trained stable diffusion model and simply copy-pastes the inpainting observed part in place, after each iteration! -Ok, I got this.

5

7

165

Alex Dimakis

@AlexGDimakis

2 years

The night is young and full of Overfull \hbox(6.97092pt too wide) detected at line 375

4

164

Alex Dimakis

@AlexGDimakis

3 years

Interesting fact about GANs that is not as well known as it should be: Take a pre-trained GAN (eg DCGAN) and feed independent random noise to the discriminator. It is easy to tell noise is not a real image. You would expect that the discriminator will easily see this. (1/4)

2

9

159

Alex Dimakis

@AlexGDimakis

3 years

We have multiple postdoc openings at the AI Institute for the Foundations of Machine Learning (IFML). Fellows can work with all IFML groups in UT Austin, Univ. of Washington and Microsoft Research (1/3)

1

44

155

Alex Dimakis

@AlexGDimakis

2 years

DALL·E 2 and similar models are producing amazing images from Text. But can they count to five? I don't have access but when I try 'An image of five apples' on multimodalart latentdiffusion LAION-400M model I get wrong images constructed. (1/n)

12

156

Alex Dimakis

@AlexGDimakis

4 years

Fun question in my ML midterm: Say a feature X1 is independent from the target label Y. We can always remove this feature and not lose in predictive performance.

Yes

865

No

1599

13

17

151

Alex Dimakis

@AlexGDimakis

4 years

I will be giving an online seminar at IAS Princeton, on Thursday April 23rd, 2pm (Central time) : Deep Generative models and Inverse Problems

Theoretical Machine Learning Seminar

Modern deep generative models like GANs, VAEs and invertible flows are showing amazing results on modeling high-dimensional distributions, especially for images. We will show how they can be used to...

www.ias.edu

5

19

150

Alex Dimakis

@AlexGDimakis

3 years

I disagree-- many scientists will use ML algorithms in the same way they use databases, compilers and statistics today. Domain expertise and scientific insight do not go away when the tools change.

François Chollet

@fchollet

3 years

Within 10-20 years, nearly every branch of science will be, for all intents and purposes, a branch of computer science. Computational physics, comp chemistry, comp biology, comp medicine... Even comp archeology. Realistic simulations, big data analysis, and ML everywhere

298

1K

5K

2

4

147

Alex Dimakis

@AlexGDimakis

5 months

Very cool explanation of emergence, even in light of the neurips recent best paper award: even if for a single task, performance increases smoothly with more training, if a composite task requires k tasks to be correct, a phase transition appears as k grows. I'd like to add…

Boaz Barak

@boazbaraktcs

5 months

1/2 Wrote blog on whether emergent abilities and grokking are a fundamental feature of deep learning, a "mirage" or both. This is partially based on the beautiful paper of @RylanSchaeffer , @BrandoHablando , and @sanmikoyejo that recently won the NeurIPS outstanding paper award.

10

59

438

6

21

146

Alex Dimakis

@AlexGDimakis

4 years

New paper: Your Local GAN: a new layer of two-dimensional sparse attention and a new generative model. Also progress on inverting GANs which may be useful for inverse problems. with @giannis_daras from NTUA and @gstsdn @Han_Zhang_ from @googleai

1

23

142

Alex Dimakis

@AlexGDimakis

4 months

We just discovered that the inpainting model in Stable Diffusion is cheating. To clarify: Inpainting is a type of inverse problem where some missing data (pixels) must be filled in. In our testing, some of the inpaintings from the SDXL inpainting model where a little 'too…

14

11

141

Alex Dimakis

@AlexGDimakis

8 months

Today the 25 National AI Research Institutes (funded by the National Science Foundation @NSF ) are showcasing in the US Senate. Excited to be part of this event, presenting our work on generative AI in our IFML institute.

6

12

138

Alex Dimakis

@AlexGDimakis

18 days

We are very excited that our first GH200 nodes have arrived in TACC for our GenAI center. Here is one. Fun facts: NVIDIA makes GH200 'superchips' (i.e. modules), a GH200 DGX box and a GH200 rack, which are all different. As Dan Stanzione, our TACC director, kindly explained…

5

17

134

Alex Dimakis

@AlexGDimakis

3 years

ICML reviews are out. Time for people who have never served on program committees or tried to hunt down late reviewers to tell us how to solve complex problems with one weird trick.

7

0

134

Alex Dimakis

@AlexGDimakis

3 years

I've been experimenting with CLIP here -- It can answer the world's greatest questions it seems: This image is classified as Persian Baklava (indeed, I certify no Greek would do that) @docmilanfar @CevherLIONS @NAChristakis

10

12

128

Alex Dimakis

@AlexGDimakis

4 years

Today I learned: Anthony Fauci has Google scholar h-index:220 (h/t @side1track1 )

4

8

130

Alex Dimakis

@AlexGDimakis

3 years

8 percent of Americans believe they can beat a Gorilla or Lion in an unarmed fight. My question to such a person is: have you ever seen a Lion or Gorilla in real life?

Stephen Gutowski

@StephenGutowski

3 years

American exceptionalism in one image.

387

812

6K

18

14

129

Alex Dimakis

@AlexGDimakis

1 year

Researchers from our NSF Institute on the Foundations of Machine Learning (IFML) win 2 outstanding paper awards at NeurIPS 2022 (after winning 2/5 awards in NeurIPS 2021 also)! Congratulations @jaywhang_ @lschmidt3 (1/n) The first...

3

7

125

Alex Dimakis

@AlexGDimakis

2 years

Observed again this year in Neurips: The reviewers with the dumbest questions, the wrong claims that the 'proof is wrong', and complete lack of knowledge of the area, rank themselves as having the highest confidence 😅

8

4

124

Alex Dimakis

@AlexGDimakis

2 years

GPT3 being helpful in proposal writing (Green text is generated from the given prompt). If a model can write its own proposal and get it funded, does that count as human-level intelligence? Or PI-level intelligence? (don't ask which one is higher).

13

11

122

Alex Dimakis

@AlexGDimakis

3 years

@ccanonne_ But after that week you will have a perfect figure, EXACTLY as you wanted, and tikz code you can re-use to make other perfect figures. Plus it will be vector graphics and you can make a perfect T-shirt out if it. Yes it's a bit of a fetish.

3

0

118

Alex Dimakis

@AlexGDimakis

4 years

#icml2020 submitted

4

0

113

Alex Dimakis

@AlexGDimakis

2 years

The worst I've seen after a conference talk is 'Unfortunately your proof is wrong, you're using Jensen's inequality in the wrong direction'

katie

@focusfronting

2 years

what’s your (anonymized!) conference presentation horror story, mine was when someone gave a talk and then in the q&a it was immediately pointed out that what they thought was interesting was just a spelling convention and the whole paper was based on a misunderstanding

724

329

7K

5

1

112

Alex Dimakis

@AlexGDimakis

3 years

Proud advisor moment! Qi Lei wins UT Oden Dissertation Award for her awesome thesis: "Provably effective algorithms for min-max optimization" co-advised with @ProfInderjit @OdenInstitute @UTAustin @QiLei45724485

4

8

110

Alex Dimakis

@AlexGDimakis

2 years

Kurt Godel broke the foundations of mathematics and got a B grade in school math (otherwise straight As!)

Moshe Vardi

@vardi

2 years

Kurt Goedel's school report card . See the grade in Math!

5

29

155

6

25

108

Alex Dimakis

@AlexGDimakis

2 years

.. to be clear, I hope 'scale is all you need' is not true, and that new theoretical ideas that we are missing are discovered for learning. I've just been genuinely surprised by how much progress we get from raw scale. (5/n)

6

4

107

Alex Dimakis

@AlexGDimakis

3 years

The magical healing powers of a goal

0

6

107

Alex Dimakis

@AlexGDimakis

1 year

We theoretically analyze the case of random Gaussian task vectors and prove that multi-tasking leads to higher robustness with high probability. Project page:

1

3

102

Alex Dimakis

@AlexGDimakis

2 years

Original source: A Review of 'The Question of Artificial Intelligence', (an edited vol. by B. Bloomfield), written by John McCarthy in 1989 and appeared in Annals of the History of Computing in 1989.) (5/5).

3

96

Alex Dimakis

@AlexGDimakis

2 years

Thanksgiving: An academic holiday so that we can actually find the time to write all these overdue recommendation letters.

1

98

Alex Dimakis

@AlexGDimakis

21 days

Phi-3 just released by Microsoft. Three small size models (3.8B, 7B and 14B) trained on highly filtered and synthetic data. They report impressive performance since the 3.8B model (trained on 3T tokens) has MMLU of 69% matching Llama3 8B, and the 7B Phi-3 model has 75% MMLU,…

3

20

98

Alex Dimakis

@AlexGDimakis

2 years

1

2

93

Alex Dimakis

@AlexGDimakis

2 years

I remember how much I hated object oriented programming when I was coding in C. Still, most of programming is about plumbing and gluing other people's code. Nothing enables this better than differentiable models. Abstraction, Encapsulation, Inheritance, etc for free. (2/n)

1

2

89

Alex Dimakis

@AlexGDimakis

3 years

The academic job market was quite competitive back then

Robbert Dijkgraaf

@RHDijkgraaf

3 years

Faculty recruitment on Mt. Olympus. Weyl on possible colleagues @the_IAS Bohr—out of the question Schrödinger—created the “wave” form of quantum mechanics Heisenberg—fate tied up with that of Germany Gödel—a very limited field Weil—might be somewhat difficult colleague 😉

29

201

798

3

5

90

Alex Dimakis

@AlexGDimakis

5 years

Proud advisor moment: My first Phd student @DimitrisPapail receives the NSF Career award on coding theory and machine learning.

6

5

89

Alex Dimakis

@AlexGDimakis

2 years

The Metaformer paper shows evidence you don't even need attention. Just MLP layers transforming tokens and any blending between them, (even pooling) every few layers. Blend more data and more tasks and it seems to learn arithmetic, generalize to new tasks, etc (2/n)

2

4

87

Alex Dimakis

@AlexGDimakis

1 year

I guess the main point is that unless there is state, there is no way to pass a Turing test, just by converting the past conversation to a prompt. (6/n)

4

2

90

Alex Dimakis

@AlexGDimakis

2 years

1

3

90

Alex Dimakis

@AlexGDimakis

4 years

Incredible: ML-based archeology

doodlewhale

@doodlewhale

4 years

This is absolutely incredible. This Lumière brothers video from 1896 'Arrival of a Train at La Ciotat' has been upscaled by machine learning to 4k, 60fps. src: Credit to developers (Github code) DIAN, Topaz AI, ESRGAN, Waifu2x, DeOldify, Anime 4K

16

578

1K

2

17

89

Alex Dimakis

@AlexGDimakis

2 years

You can run python in latex in overleaf apparently. All experiments will be self generated in next paper. 🤠

Almog Yalinewich

@yalinewich

2 years

@jradavenport @overleaf you essentially run the python script as a shell command and pipe the output to the tex file, e.g. \input{|python your_script.py}

7

22

270

0

15

88

Alex Dimakis

@AlexGDimakis

2 years

UT Austin ranked No 1 among US universities in NSF funding ($144 million in 2020). Our AI institute (IFML) featured in the the article.

UT Austin No. 1 in NSF Funding in United States

AUSTIN, Texas — The University of Texas at Austin is ranked No. 1 among U.S. universities in research financed by the National Science Foundation (NSF) in

news.utexas.edu

0

4

86

Alex Dimakis

@AlexGDimakis

5 years

The state of deep learning prior work

Oded Rechavi

@OdedRechavi

5 years

Keeping up with the literature

46

1K

5K

1

10

87

Alex Dimakis

@AlexGDimakis

2 years

There are indeed shallow and bad theory results in top ML conferences, sometimes accepted. But this should not be a blanket statement that all ML theory is bad. It's same as experimental results, that can be cherry picked etc. Good ML theory is needed to sharpen our thinking.

Kording —-& Lab 🦖

@KordingLab

2 years

Hot take: Proofs in many ICML papers are a thin veneer of mathematical sophistication, meant to show the authors' refinement. They are not generally meant to contribute to math or to apply to reality but as proving membership in a tribe. A bit like an NFT.

20

22

278

10

5

86

Alex Dimakis

@AlexGDimakis

2 years

Pythagoras of Samos applied for funding to the Greek NSF for Machine learning. Review summary at 490BC said, this 'machine learning' fad is just geometry, reject.

Victor Chernozhukov #peace 🇺🇦

@VC31415

2 years

Just a sober reminder.

3

24

387

2

11

85

Alex Dimakis

@AlexGDimakis

2 years

..that all the clever ideas (Convents, attention, better optimization, etc) are maybe changing the constants a bit, but 90% of progress comes from scale and we have seen no ceiling yet. (4/n)

7

4

81

Alex Dimakis

@AlexGDimakis

2 years

''I can report that many of the earlier AI pioneers did not influence me except negatively, (..) As for myself, one of the reasons for inventing the term "Artificial intelligence" was to escape association with "cybernetics"... (2/n)

1

8

80

Alex Dimakis

@AlexGDimakis

4 years

Overparametrization alone is not enough for easy learning. This paper proves that even if the ground truth is a 1-layer network, learning a classifier can require super-polynomial time to achieve small test-error.

Superpolynomial Lower Bounds for Learning One-Layer Neural...

We prove the first superpolynomial lower bounds for learning one-layer neural networks with respect to the Gaussian distribution using gradient descent. We show that any classifier trained using...

arxiv.org

1

16

84

Alex Dimakis

@AlexGDimakis

18 days

Violently arresting your own faculty on campus, for no reason, is not a sign of a free thinking university.

Robert Mackey

@RobertMackey

18 days

It is worth watching this CNN video from the moment Emory Econ Professor @CarolineFohlin came across the violent arrest of a protester on campus and asked the police, with shock, "What are you doing?" That's all that prompted an officer to hurl her to the ground and handcuff her.

1K

12K

27K

3

12

84

Alex Dimakis

@AlexGDimakis

5 years

Yay, we won a TRIPODS grant on foundations of machine learning and artificial intelligence @utexasece @UTCompSci

4

2

85

Alex Dimakis

@AlexGDimakis

2 months

I can definitely tell you one thing related to this great clip: doing a Phd definitely brings a good amount of pain and suffering, even to the most talented ones.