björn
120.1K posts

björn
@bj2rn
working on pretraining and model scaling. now @smallco, prev @bigco. freedom lover. guobaorou enthusiast.


i do not think i understood how large london is. this is a very large city.

I have a small language model and it’s been pre trained. Now I post train it to say “I’m a language model”. With no mention of openAI The trained model still ends up saying it’s a LLM made by openAI. Even tho OpenAI is never mentioned in the instruction tuning dataset, and there in fact is one sample that says “I am being developed by Anthropic” (not true)! Makes me think models saying they’re made by so and so is pretty weak evidence of copying /stealing / distillation.



felt the opposite moving here from seoul. didn’t expect it to feel so small, a cute little town

i do not think i understood how large london is. this is a very large city.

Korea Air got Brood War as an inflight entertainment option in case anyone was wondering

BTS RM Quote resurfaces, causes concern: "We don't want to change our identity to get to Number One. If we sing in Full English, then that's not BTS' Their New Song “SWIM” is a fully English song, however it's topping all Korea charts.

was messing with the OpenAI base URL in Cursor and caught this accounts/anysphere/models/kimi-k2p5-rl-0317-s515-fast so composer 2 is just Kimi K2.5 with RL at least rename the model ID

Renewing my US Passport. The old design (on the right) is full of color and civic pride. The new one (left) is just transparently a sad little zogpass.



What if a world model could render not an imagined place, but the actual city? We introduce Seoul World Model, the first world simulation model grounded in a real-world metropolis. TL;DR: We made a world model RAG over millions of street-views. proj: seoul-world-model.github.io















