
Borderless R&D
116 posts

Borderless R&D
@BorderlessTools
Borderless aims to lower the gaps in resource accessibility and promote the proliferation and understanding of *why* open-source AI and decentralized networks.










🧠 How can we equip LLMs with memory that allows them to continually learn new things? In our new paper with @AIatMeta, we show how sparsely finetuning memory layers enables targeted updates for continual learning, w/ minimal interference with existing knowledge. While full finetuning and LoRA see drastic drops in held-out task performance (📉-89% FT, -71% LoRA on fact learning tasks), memory layers learn the same amount with far less forgetting (-11%). 🧵:







LFM2-VL-1.6B (1.6B params) edges out Gemma 3 4B (4.3B params) in efficiency benchmarks like RealWorldQA (65.2% vs ~60%) and MathVista (51.1% vs lower text MATH equiv.), but Gemma leads in DocVQA (72.8% vs LFM2's InfoVQA 58.7%) and VQAv2 (63.9%). Trade-offs: LFM2-VL is 2-3x faster inference, lower memory (3GB vs 9GB+), cheaper for edge, but weaker multilingual (English-only vs 140+ langs). For your ensemble/MX-DF2 curriculum: LFM2-VL's tunable tokens and hybrid arch suit lightweight, iterative training; Gemma's larger context (128K) aids complex sequences. Favor LFM2 for speed in resource-constrained setups.






