Ismail Salim

37 posts

Ismail Salim

Ismail Salim

@IssySalim

Co-founder/CEO @RunLocalAI (YC S24). Making it easier to ship better on-device AI. Also into house/techno from early 90's/00's,

Katılım Haziran 2020
427 Takip Edilen139 Takipçiler
Ismail Salim
Ismail Salim@IssySalim·
Amazing video from @juliarturc 🙌 The best explanation you'll find for legacy, K & I quants in GGUF/llama.cpp: youtube.com/watch?v=vW30o4… Also nice to hear a nod to why we exist: “The challenge is choosing the right quant settings. This depends on your model, target hardware and trade-offs between quality and speed."
YouTube video
YouTube
English
0
0
5
305
Nikhil Sonti
Nikhil Sonti@nv_sonti·
Introducing @BrowserOS_ai – an open-source agentic browser, an alternative to Perplexity Comet. We believe browsers will become the new OS, where we offload work to AI agents that'll have access to all your sensitive data. Open-source, privacy-first alternatives need to exist.
English
11
26
265
101.8K
Ismail Salim
Ismail Salim@IssySalim·
@idode_k Very cool - looks super useful. Definitely going to share with my recruiter friends!
English
1
0
0
202
idode
idode@idode_k·
i built an agent that helps recruiters source candidates. reply if you'd like to try it out.
English
20
6
74
7.5K
Ismail Salim
Ismail Salim@IssySalim·
@joekndy Very cool idea - right up my street. Interesting you're deploying local models for iOS too. We pretty much spend all day every day doing that. Just DM'd you
English
0
0
0
75
Joe Kennedy
Joe Kennedy@joekndy·
Mixy looks simple but it’s the hardest thing my cofounder and I have built in 17 years of working together All proprietary models running directly on the phone, on top of a highly performant audio engine. Mixy is built to last and be the only tool for making music you’ll need
English
13
5
169
16.8K
Peter Chamberlain
Peter Chamberlain@Aussie_Pete·
@ReallyEpicTuts @Blackmagic_News This is a known issue with iOS 18 on the iPhone 16. We are in discussion with Apple about this. For now, if you need stabilisation and you see this jump in standard mode, try cinematic mode, being aware that LUTs are not shown in the preview but the recording should be correct.
English
3
0
2
121
Blackmagic Design
Blackmagic Design@Blackmagic_News·
Blackmagic Camera for iOS 2.1 Update! Get support for using new iPhone 16 camera control features with Blackmagic Camera, improved Blackmagic Cloud organisation syncing and new recording bit rate options for H.264 and H.265 codecs. Download now from apps.apple.com/us/app/blackma…
Blackmagic Design tweet media
English
6
15
89
9.3K
Rohit Gupta
Rohit Gupta@rohit_bmd·
@OliCoDev This is a limitation of Windows Media Player with Windows 10. When you render there is an option of compatibility with Windows Media Player on Deliver page.
English
2
0
2
32
OliCo
OliCo@OliCoDev·
I'm not exactly sure what is going on with Davinci Resolve right now, but whenever I update it to the latest version and try to export a simple video, for some reason it feels like the audio and video are both desynced from one another by a quarter of a second.
English
1
0
2
94
Awni Hannun
Awni Hannun@awnihannun·
I'm committing to only use local LLMs for the next few weeks to get a real vibe-check on the gap in perf between closed/server-side and open/local (powered by MLX of course). My favorite tools for that right now are: - The raw terminal (mlx_lm.generate / mlx_lm.chat) - LM Studio
English
35
20
320
28.6K
Ismail Salim
Ismail Salim@IssySalim·
@GavinSBaker @GavinSBaker - What are your thoughts on demand for on-device/local inference as general inference demand grows? (Both in absolute terms and proportionally to server-side)
English
0
0
5
328
Gavin Baker
Gavin Baker@GavinSBaker·
Shifting from a pre-training centric world to an inference centric world is likely positive for compute overall. Intelligence may scale even better with test-time compute (inference) than it does with pre-training per the charts below. The balance of compute always had to move from pre-training to inference to generate an “ROI on AI.” Just going to happen a lot faster than expected. And while shifting to a test-time compute, “inference first” world is probably good for compute demand, this shift does change the type of compute. And this has an impact on who wins and who loses from a supplier perspective. More 50-100 megawatt datacenters geospatially and cost-optimized for inference. More inference “Hondas.” Fewer 1 gigawatt plus datacenters (which can be anywhere) with the networking, storage, and cooling (which enables density which simplifies networking while increasing potential cluster size) technologies necessary for coherence. Less pre-training “Ferraris.” And the number of companies doing pre-training in a “Ferrari” likely steadily shrinks over time. Satya explained this in the clearest way possible in his most recent podcast. All the back and forth about the Cowen note vs. Microsoft IR commentary in Australia is missing the forest for the trees - the CEO literally just told you he was going to shift investments away from pre-training focused compute to inference optimized compute, which he noted was different! Also Grok-3 voice mode is epic.
Gavin Baker tweet media
English
27
49
427
113.4K
Ismail Salim retweetledi
merve
merve@mervenoyann·
I can't wait to try more local models 🤠
English
4
1
15
3.1K
Ismail Salim
Ismail Salim@IssySalim·
@huybery Hey @huybery - Just DM'd you on Twitter about obtaining a commercial license for Qwen2.5-3B. Please let me know if you can help :)
English
0
0
2
80
Binyuan Hui
Binyuan Hui@huybery·
👏🏻Great to see such an outstanding work based on Qwen. Thanks to the Sailor Team for continuously advancing LLM democratization!
Longxu Dou@LongxuDou

🚀 Excited to share our technical report on the Southeast Asian multilingual model Sailor2 and its latest updates! Our 49-page report details Sailor2's development journey, including multilingual data cleaning, small model data mixture simulations, multi-stage continual pre-training, multi-stage post-training, and multi-cultural multi-lingual evaluations. Sailor2 aims to streamline the multilingual model pre-training process efficiently for the community. 🧭 We highlight Sailor2's impressive performance in low-resource language translation scenarios and its cultural understanding advantages in Southeast Asia, promoting practical applications for regional languages. Model updates include: 💡 More precise outputs: Reduced redundancy in model outputs through refined post-training data and optimization techniques. 🌈 Handling longer texts: Expanded to handle up to 128K context length in Southeast Asian languages through long-text training. ⚡️ Faster inference: Achieved 2.5x faster inference speed with speculative decoding. 🌪️ More model sizes: Introduced new sizes of 3B and 14B through model pruning. 🌟 All models are Apache-licensed for commercial use; development tools (code, resources) are open-source. 📚 Technical report: huggingface.co/papers/2502.12… 🤖️ Models: huggingface.co/collections/sa… 💬 Demo: huggingface.co/spaces/sail/Sa… 📣 Sailor2 community: huggingface.co/sailor2

English
7
10
140
11.2K
Ismail Salim
Ismail Salim@IssySalim·
@JustinLin610 Hey @JustinLin610 - Just messaged you on Twitter DM about obtaining a commercial license for Qwen2.5-3B. Please let me know if you can help :)
English
0
0
0
39
Ismail Salim
Ismail Salim@IssySalim·
Snap's announcement about their on-device text-to-image model seems to have slipped under the radar… Apparently, it generates 1024x1024 images with quality that's comparable to cloud-oriented models like Stable Diffusion XL. But it can do that locally on an iPhone 16 Pro Max in <1.5 seconds! 🤯 Snap are planning to ship it soon to their ~450m daily active users, and I wouldn't be surprised if it's free. I wonder how all these subscription-driven, cloud-based image generation apps will respond… Announcement: newsroom.snap.com/ai-text-to-ima… Paper: arxiv.org/abs/2412.09619
Ismail Salim tweet media
English
2
3
7
367
Ismail Salim
Ismail Salim@IssySalim·
@Snap's announcement about their on-device text-to-image model seems to have slipped under the radar… Apparently, it generates 1024x1024 images with quality that's comparable to cloud-oriented models like Stable Diffusion XL. But it can do that locally on an iPhone 16 Pro Max in <1.5 seconds! 🤯 Snap are planning to ship it soon to their ~450m daily active users, and I wouldn't be surprised if it's free. I wonder how all these subscription-driven, cloud-based image generation apps will respond… Announcement: newsroom.snap.com/ai-text-to-ima… Paper: arxiv.org/abs/2412.09619
Ismail Salim tweet media
English
0
0
0
40
Ismail Salim
Ismail Salim@IssySalim·
Short but sweet talk about the WebNN API: youtube.com/watch?v=FoYBWz… Def worth checking out the YouTube playlist from @jason_mayes WebAI Summit last year. It's packed with great talks! Looking forward to the next summit!
YouTube video
YouTube
Ismail Salim tweet media
English
1
2
7
301