
@EmilWallner I used ClearML in the past, it's battle-tested: github.com/clearml/clearml
English
Miha Jenko
108 posts

@miha_jenko
SWE, ML & DS professional. #NLProc specialist. Opinions expressed are not my employer's. RTs don't count as endorsements.









Evaluation is everything! While testing Inflection-2.5, we found that MT-Bench has a bunch of incorrect answers. Here we share the corrections for everyone to use, and we release a new Physics GRE benchmark for people to try out. inflection.ai/inflection-2-5









This must be said and repeated. Yes, Geoff was totally wrong to predict a drop in radiologist positions. We knew that it was wrong when he said it. We have data now.






