Ismail Salim

37 posts

Ismail Salim

@IssySalim

Co-founder/CEO @RunLocalAI (YC S24). Making it easier to ship better on-device AI. Also into house/techno from early 90's/00's,

Katılım Haziran 2020

427 Takip Edilen139 Takipçiler

Ismail Salim@IssySalim·14 Tem

Amazing video from @juliarturc 🙌 The best explanation you'll find for legacy, K & I quants in GGUF/llama.cpp: youtube.com/watch?v=vW30o4… Also nice to hear a nod to why we exist: “The challenge is choosing the right quant settings. This depends on your model, target hardware and trade-offs between quality and speed."

YouTube

English

305

Ismail Salim@IssySalim·14 Tem

@ThatSonti @browserOS_ai Awesome!

English

105

Nikhil Sonti@nv_sonti·10 Tem

Introducing @BrowserOS_ai – an open-source agentic browser, an alternative to Perplexity Comet. We believe browsers will become the new OS, where we offload work to AI agents that'll have access to all your sensitive data. Open-source, privacy-first alternatives need to exist.

English

265

101.8K

Ismail Salim@IssySalim·20 Haz

@joekndy Insane! Congrats 🙌

Español

Joe Kennedy@joekndy·20 Haz

Omg

252

17.3K

Ismail Salim@IssySalim·13 Haz

@fleetwood___ 🐐

QME

Ismail Salim@IssySalim·18 May

@idode_k Very cool - looks super useful. Definitely going to share with my recruiter friends!

English

202

idode@idode_k·18 May

i built an agent that helps recruiters source candidates. reply if you'd like to try it out.

English

7.5K

Ismail Salim@IssySalim·15 May

@joekndy Very cool idea - right up my street. Interesting you're deploying local models for iOS too. We pretty much spend all day every day doing that. Just DM'd you

English

Joe Kennedy@joekndy·14 May

Mixy looks simple but it’s the hardest thing my cofounder and I have built in 17 years of working together All proprietary models running directly on the phone, on top of a highly performant audio engine. Mixy is built to last and be the only tool for making music you’ll need

English

169

16.8K

Ismail Salim@IssySalim·1 Nis

@Aussie_Pete @ReallyEpicTuts @Blackmagic_News @Aussie_Pete - You are the 🐐 for DaVinci customer support. (Just sent you a DM btw!)

English

Peter Chamberlain@Aussie_Pete·24 Eki

@ReallyEpicTuts @Blackmagic_News This is a known issue with iOS 18 on the iPhone 16. We are in discussion with Apple about this. For now, if you need stabilisation and you see this jump in standard mode, try cinematic mode, being aware that LUTs are not shown in the preview but the recording should be correct.

English

121

Blackmagic Design@Blackmagic_News·24 Eki

Blackmagic Camera for iOS 2.1 Update! Get support for using new iPhone 16 camera control features with Blackmagic Camera, improved Blackmagic Cloud organisation syncing and new recording bit rate options for H.264 and H.265 codecs. Download now from apps.apple.com/us/app/blackma…

English

9.3K

Ismail Salim@IssySalim·1 Nis

@rohit_bmd @OliCoDev Awesome customer support @rohit_bmd 💪 (Just DM'd you btw)

English

Rohit Gupta@rohit_bmd·22 Mar

@OliCoDev This is a limitation of Windows Media Player with Windows 10. When you render there is an option of compatibility with Windows Media Player on Deliver page.

English

OliCo@OliCoDev·22 Mar

I'm not exactly sure what is going on with Davinci Resolve right now, but whenever I update it to the latest version and try to export a simple video, for some reason it feels like the audio and video are both desynced from one another by a quarter of a second.

English

Ismail Salim@IssySalim·28 Şub

@awnihannun We'll be staying tuned at @RunLocalAI :)

English

140

Awni Hannun@awnihannun·28 Şub

I'm committing to only use local LLMs for the next few weeks to get a real vibe-check on the gap in perf between closed/server-side and open/local (powered by MLX of course). My favorite tools for that right now are: - The raw terminal (mlx_lm.generate / mlx_lm.chat) - LM Studio

English

320

28.6K

Ismail Salim@IssySalim·25 Şub

@GavinSBaker @GavinSBaker - What are your thoughts on demand for on-device/local inference as general inference demand grows? (Both in absolute terms and proportionally to server-side)

English

328

Gavin Baker@GavinSBaker·24 Şub

Shifting from a pre-training centric world to an inference centric world is likely positive for compute overall. Intelligence may scale even better with test-time compute (inference) than it does with pre-training per the charts below. The balance of compute always had to move from pre-training to inference to generate an “ROI on AI.” Just going to happen a lot faster than expected. And while shifting to a test-time compute, “inference first” world is probably good for compute demand, this shift does change the type of compute. And this has an impact on who wins and who loses from a supplier perspective. More 50-100 megawatt datacenters geospatially and cost-optimized for inference. More inference “Hondas.” Fewer 1 gigawatt plus datacenters (which can be anywhere) with the networking, storage, and cooling (which enables density which simplifies networking while increasing potential cluster size) technologies necessary for coherence. Less pre-training “Ferraris.” And the number of companies doing pre-training in a “Ferrari” likely steadily shrinks over time. Satya explained this in the clearest way possible in his most recent podcast. All the back and forth about the Cowen note vs. Microsoft IR commentary in Australia is missing the forest for the trees - the CEO literally just told you he was going to shift investments away from pre-training focused compute to inference optimized compute, which he noted was different! Also Grok-3 voice mode is epic.

English

427

113.4K

Ismail Salim@IssySalim·22 Şub

@idode_k 🔥🔥🔥

QME

idode@idode_k·21 Şub

built a lil prototype this week! if interested in trying out let me know and i'll send you the test flight 🫡

idode@idode_k

what would pocket, kindle or audible look like as a completely ai native app? i've been playing around with a new ai native e-reader prototype that i built. - ask questions using your voice and get responses in voice and text - the llm has context to the text your reading + other content that you've saved this prototype uses openai's realtime api and the text i'm reading here is @simonw's "things we learned about llm's in 2024".

English

735

Ismail Salim retweetledi

merve@mervenoyann·22 Şub

I can't wait to try more local models 🤠

English

3.1K

Ismail Salim@IssySalim·22 Şub

@mervenoyann Thoughts on OLMoE??

English

165

merve@mervenoyann·22 Şub

I was actually serious 😂

merve@mervenoyann

@allen_ai @soldni brb buying an iPhone 16 Pro

English

149

15.7K

Ismail Salim@IssySalim·21 Şub

@fleetwood___ 🐐

QME

Ismail Salim@IssySalim·20 Şub

🔥 VLMs on mobile devices with world-facing cameras key for proactive, intelligent computing. Local/on-device inference key for real-time, private experiences. Great to see an emphasis on smaller VLMs. Excited to see where @huggingface, @moondreamai, etc. take things 🚀

Miquel Farré@micuelll

Holy shit! Did we just open-source the smallest video-LM in the world? SmolVLM2 is runnning natively on your iPhone 🚀 huggingface.co/blog/smolvlm2

English

839

Ismail Salim@IssySalim·19 Şub

@huybery Hey @huybery - Just DM'd you on Twitter about obtaining a commercial license for Qwen2.5-3B. Please let me know if you can help :)

English

Binyuan Hui@huybery·19 Şub

👏🏻Great to see such an outstanding work based on Qwen. Thanks to the Sailor Team for continuously advancing LLM democratization!

Longxu Dou@LongxuDou

🚀 Excited to share our technical report on the Southeast Asian multilingual model Sailor2 and its latest updates! Our 49-page report details Sailor2's development journey, including multilingual data cleaning, small model data mixture simulations, multi-stage continual pre-training, multi-stage post-training, and multi-cultural multi-lingual evaluations. Sailor2 aims to streamline the multilingual model pre-training process efficiently for the community. 🧭 We highlight Sailor2's impressive performance in low-resource language translation scenarios and its cultural understanding advantages in Southeast Asia, promoting practical applications for regional languages. Model updates include: 💡 More precise outputs: Reduced redundancy in model outputs through refined post-training data and optimization techniques. 🌈 Handling longer texts: Expanded to handle up to 128K context length in Southeast Asian languages through long-text training. ⚡️ Faster inference: Achieved 2.5x faster inference speed with speculative decoding. 🌪️ More model sizes: Introduced new sizes of 3B and 14B through model pruning. 🌟 All models are Apache-licensed for commercial use; development tools (code, resources) are open-source. 📚 Technical report: huggingface.co/papers/2502.12… 🤖️ Models: huggingface.co/collections/sa… 💬 Demo: huggingface.co/spaces/sail/Sa… 📣 Sailor2 community: huggingface.co/sailor2

English

140

11.2K

Ismail Salim@IssySalim·19 Şub

@JustinLin610 Hey @JustinLin610 - Just messaged you on Twitter DM about obtaining a commercial license for Qwen2.5-3B. Please let me know if you can help :)

English

Junyang Lin@JustinLin610·19 Şub

Pretty cool to see our latest vl model with rag!

Akshay 🚀@akshay_pachaar

Let's build a multimodal RAG app using Qwen2.5-VL Max (100% local):

English

5.6K

Ismail Salim@IssySalim·17 Şub

Snap's announcement about their on-device text-to-image model seems to have slipped under the radar… Apparently, it generates 1024x1024 images with quality that's comparable to cloud-oriented models like Stable Diffusion XL. But it can do that locally on an iPhone 16 Pro Max in <1.5 seconds! 🤯 Snap are planning to ship it soon to their ~450m daily active users, and I wouldn't be surprised if it's free. I wonder how all these subscription-driven, cloud-based image generation apps will respond… Announcement: newsroom.snap.com/ai-text-to-ima… Paper: arxiv.org/abs/2412.09619

English

367

Ismail Salim@IssySalim·17 Şub

@Snap's announcement about their on-device text-to-image model seems to have slipped under the radar… Apparently, it generates 1024x1024 images with quality that's comparable to cloud-oriented models like Stable Diffusion XL. But it can do that locally on an iPhone 16 Pro Max in <1.5 seconds! 🤯 Snap are planning to ship it soon to their ~450m daily active users, and I wouldn't be surprised if it's free. I wonder how all these subscription-driven, cloud-based image generation apps will respond… Announcement: newsroom.snap.com/ai-text-to-ima… Paper: arxiv.org/abs/2412.09619

English

Ismail Salim@IssySalim·16 Şub

Short but sweet talk about the WebNN API: youtube.com/watch?v=FoYBWz… Def worth checking out the YouTube playlist from @jason_mayes WebAI Summit last year. It's packed with great talks! Looking forward to the next summit!

YouTube

English

301

Keşfet

@juliarturc @browserOS_ai @BrowserOS_ai @joekndy @fleetwood___ @idode_k @Aussie_Pete @ReallyEpicTuts