
@quantmaxxer @nkreu113r never looked into it in much detail bcs it just doesn’t scale well enough for my applications (I know they have the Nano version of it but haven’t tried yet). generally I’m still trying to wrap my head around pretraining so don’t rly have an informed opinion
English















