To all budding compbio & ML folks interested in bio: Don't just only run behind the latest ML model hype train. The greatest long run impact will come by really assimilating prior bio/compbio literature with the goal of really understanding strategies for how to model biology. 1/
And by collaborating closely with biologists developing technology, deeply studying biological systems & diseases to identify the relevant questions in biology & medicine. 2/
This is not to say that u shud not keep pace with the latest ML tech. The tech is amazing & has immense potential. But to make impact in bio, in most cases, u need to understand how to formulate bio problems as ML tasks, 3/
whether the data u have is appropriate for the question & the modeling, what kind of data you might need to answer a question or a family of questions 4/
And I can guarantee you, you do not need ultra large models and insane compute to address a large majority of extremely exciting problems in biology via computational approaches. 6/
Also please respect the data by delving deeply into how it was generated & processed. Be humble & respect the immense body of prior work. But there are plenty of unsolved problems that require deep ML expertise coupled with a willingness to truly embrace biology. 7/7
@anshulkundaje
Given what you know now, How would you start learning ML and DL if you had to start all over again? What would you learn and in what order
@anshulkundaje
Yeah exactly. It really feels like some ML folks see the latest trend as a solution in search of a problem. Tailored solutions to biology requires some understanding of it and collaboration with domain experts who know what’s already been tried before.