Mathurin Dorel
12.5K posts

Mathurin Dorel
@MathSRIsh
Bioinformatician, Cellular Biologist, Techbio Founder. Collecting data in a complex world @[email protected] @mathsrish.bsky.social Also #boardgames









Enterprise clinical NLP: $100K+ bundled contracts. Need drug extraction? Buy the whole suite. 12 OpenMed NER models. Drugs. Diseases. Procedures. Anatomy. Chemicals. Genes. Pick one or all. Apache 2.0. $0. Which entity type matters most in your work?



Extremely interesting discussion of what it means for a cell to be alive vs. dead in this paper!


Some people at frontier AI labs told me they believe startups are over. OpenAI, Anthropic, Google, xAI will absorb every industry as AGI nears. Coding today, science, medicine, and finance next. Then everything else. If they’re right, that’s a pretty boring end of the world.



Reasons to be pessimistic (and optimistic) on the future of biosecurity owlposting.com/p/reasons-to-b… "It was such a fun read (if you can say that about an article on weapons)!" —a glowing review from an early reader this is (once again) the longest article I have ever published at 13,000 words. it involves interviews with 16+ researchers/VC's/policy folks in this field, and discusses basically every single facet of biosecurity that i could find. topics include: how machine-learning in rapid response therapeutic design may work, the financial status of the customer base of biosecurity startups, why agroterrorism feels extremely likely to me, and a lot more i admittedly started the essay pessimistic that this subject matters at all, and i end it surprised that it doesn't keep more people awake at night. im not a doomer about it all, but i can see how people become one. very grateful to the people who decide to spend their career (or some fraction of it) working here, and especially grateful to the ones who helped teach me about the subject




Using claude code to directly control a liquid handling robot is such a crazy experience

Very well put! AI is real. It needs data. LLMs have access to all our writing etc. Biology does not have an equivalent corpus of high-quality data that spans the dynamics we're proposing to solve. Diseases and aging occur at the level of organs and organisms, and we need data there to simulate it. Status quo won't get us there in a few years. But we can act! Identify the most important data that can't be accelerated, and start collecting it now so we can leverage AI for longevity as early as possible. We are setting up an @impetusgrants focus on AI-enabling datasets specifically.













