Andrei Zinovyev

17 posts

Andrei Zinovyev banner
Andrei Zinovyev

Andrei Zinovyev

@andrei_zinovyev

Group leader in silico R&D @evotec, chair @InstitutPrairie Computational Systems Biology, ML/AI methods and application, Omics

Paris, France Katılım Eylül 2022
66 Takip Edilen53 Takipçiler
Andrei Zinovyev retweetledi
Andrej Karpathy
Andrej Karpathy@karpathy·
LLM model size competition is intensifying… backwards! My bet is that we'll see models that "think" very well and reliably that are very very small. There is most likely a setting even of GPT-2 parameters for which most people will consider GPT-2 "smart". The reason current models are so large is because we're still being very wasteful during training - we're asking them to memorize the internet and, remarkably, they do and can e.g. recite SHA hashes of common numbers, or recall really esoteric facts. (Actually LLMs are really good at memorization, qualitatively a lot better than humans, sometimes needing just a single update to remember a lot of detail for a long time). But imagine if you were going to be tested, closed book, on reciting arbitrary passages of the internet given the first few words. This is the standard (pre)training objective for models today. The reason doing better is hard is because demonstrations of thinking are "entangled" with knowledge, in the training data. Therefore, the models have to first get larger before they can get smaller, because we need their (automated) help to refactor and mold the training data into ideal, synthetic formats. It's a staircase of improvement - of one model helping to generate the training data for next, until we're left with "perfect training set". When you train GPT-2 on it, it will be a really strong / smart model by today's standards. Maybe the MMLU will be a bit lower because it won't remember all of its chemistry perfectly. Maybe it needs to look something up once in a while to make sure.
Artificial Analysis@ArtificialAnlys

GPT-4o Mini, announced today, is very impressive for how cheap it is being offered 👀 With a MMLU score of 82% (reported by TechCrunch), it surpasses the quality of other smaller models including Gemini 1.5 Flash (79%) and Claude 3 Haiku (75%). What is particularly exciting is that it is also to be offered at a cheaper price than these models. The reported price is $0.15/1M input tokens and $0.6/1M output tokens. With such a cheap price for input tokens and its large 128k context window, it will be very compelling for long context use-cases (including large document RAG). @OpenAI have clearly made a very high quality model relative to its size (pricing can indicate size due to the direct relationship to compute cost). The model seems a worthy successor to GPT3.5 Turbo as OpenAI's smallest model and the model used for ChatGPT's free version.

English
189
918
7.5K
1.4M
Andrei Zinovyev retweetledi
SysBioCurie
SysBioCurie@SysBioCurie·
Are you interested in joining @sysbiocurie in Paris, and applying machine learning methodologies for fighting cancer ? We are looking for a PhD student to develop new concepts and tools for analyzing multi-modal and spatial omics data of cancer patients. sysbio.curie.fr/texts/PhdCompu…
English
0
9
17
1.8K
Andrei Zinovyev
Andrei Zinovyev@andrei_zinovyev·
@YannPonty Fortunately for me, Pascal was my third or the fourth programming language, but I did my PhD on it (and still use some code from that time)
English
0
0
0
21
Andrei Zinovyev retweetledi
Dr Anna Niarakis
Dr Anna Niarakis@Annaerial·
We will be in Basel in September, with @ArnauMontagud & #Laurence Calzone to organise a pretty cool workshop at [BC]2 @BC2Conference ! Think about joining us!
[BC]2@BC2Conference

@LS2Switzerland @CampusBiotech @biozentrum @GfellerD @zkutalik @BioAlps @SwissBiotech @ELIXIREurope @P_Palagi @alexlederer19 @AiswwaryaPrasad Interested in mechanistic and #AI digital twins in #personalizedmedicine? Join this full-day workshop at @bc2basel, to learn about this emerging and promising concept of #digitaltwins and discuss the state of the art in the field. 👉Register here: tinyurl.com/bdhbr4dp

English
0
5
14
814
Andrei Zinovyev retweetledi
Itai Yanai
Itai Yanai@ItaiYanai·
New paper! It’s absolutely wild what you see when studying bacteria using single-cell RNA-Seq! Unlike in eukaryotes, we found a genome-scale pattern of correlations among genes showing a strong dependency on global chromosomal locations. biorxiv.org/content/10.110… @AndrewPountain
Itai Yanai tweet media
English
17
195
726
0
Andrei Zinovyev retweetledi
Marco Ruscone, PhD
Marco Ruscone, PhD@MRuscone·
A great poster session yesterday at #ECCB2022! There was also @njmmatthieu presenting his poster "Construction of the cystic fibrosis biological network from a comparative analysis of transcriptomic studies"! Looking forward for today's session! 💪🏻
Marco Ruscone, PhD tweet media
English
1
4
18
0
Dr Anna Niarakis
Dr Anna Niarakis@Annaerial·
Beautiful Barcelona <3...I can't believe it's been almost a decade since my last visit...
Dr Anna Niarakis tweet mediaDr Anna Niarakis tweet mediaDr Anna Niarakis tweet mediaDr Anna Niarakis tweet media
English
1
0
15
0
Andrei Zinovyev retweetledi
PerMedCoE
PerMedCoE@PerMedCoE·
Multiscale modeling allows to study the different modes of cancer cell invasion 🗣️Marco Ruscone, Arnau Montagud, Philippe Chavrier, Olivier Destaing, Andrei Zinovyev, Emmanuel Barillot, Vincent Noël and Laurence Calzone
PerMedCoE tweet media
Français
1
2
6
0
Andrei Zinovyev retweetledi
nature
nature@Nature·
Lab leaders wrestle with paucity of postdocs #Echobox=1661901810" target="_blank" rel="nofollow noopener">nature.com/articles/d4158…
English
41
140
341
0