Manit chahar
1K posts

Manit chahar
@manitchahar
AI Developer | Tech Enthusiast | Always curious, always learning
Bharat Katılım Temmuz 2017
371 Takip Edilen31 Takipçiler

@thegenioo K2.7 is good haven't tried GLM 5.2 yet. Not available in opencode yet
English

Do not listen to anything I say
Burke Holland@burkeholland
This Fable decision will be reversed inside of 48 hours.
English


@DavidOndrej1 If it's not every one second you are not serious enough
English

I have a VPS just for Fable 5
> it checks the API every 60s
> if Fable is offline, nothing happens
> if online, the program starts
> there's a list of 400+ questions for Fable
> all worded to extract maximum amount of knowledge
> fresh API key with $500 limit on the VPS
> the moment Fable is online, program starts
> 5-10 calls to API in parallel
> dataset will begin populating with Fable answers
> even if they shut it down again, I will at least have this
if you aren't doing this, you aren't serious enough
English

@DavidOndrej1 @fortelabs If it's not every one second you are not serious
English

@fortelabs yes... i even have a 400 question list that checks every 60s if the API for Fable is available
and when it becomes, it will instantly begin creating a dataset of Fable answers to these specific questions
English

@archiexzzz I'm Planning to build Claude Code Style Wrapper to use Sarvam Models. want to see how well they can be used.
English

after few months:
default all your agentic workflows to use the model_name:
"sarvam-1000b/sarvam-1t"
and fallback_model_providers: [anthropic, openai]
Chandra R. Srikanth@chandrarsrikant
Anthropic pulling the plug will push Indian IT firms to adopt model-agnostic architectures, AI fallback plans By @debanganaghosh4 and @shaw_reshab moneycontrol.com/news/business/…
English

people are already fine-tuning Le Chaton Fat
and you're still thinking about Fable... ?
Guillaume Lample @ NeurIPS 2024@GuillaumeLample
English

We're thrilled to announce that we have raised $234M in the first close of our $300M Series B at a $1.5B valuation.
@HCLTech and @BessemerVP have joined us in this round, alongside continued support from @khoslaventures and @peakxvpartners
For countries and companies, sovereign control on the AI stack is no longer an optionality. Sarvam will be the partner of choice for this aspiration. The capital allows us to accelerate our momentum towards this full stack of models, compute, and deployments.
A huge thank you to our customers, partners, investors, and the Sarvam team for your trust and belief in what we are building. We’re just getting started.
Read more: sarvam.ai/announcing-ser…

English

The Municipal IT company of Rio de Janeiro's city government has reportedly gotten early access to Le Chaton Fat, and combined with their learnings from Rio 3.5 Open 397B, has trained "O Gatinho Gordo" which boasts a whopping 1 Quadrillion parameters and has supersaturated all known benchmarks

Alexander Knigge@AlexanderKnigge
oh my god its happening @MistralAI has officially confirmed the upcoming release of Le Chaton Fat - 30T MoE with 256 experts - 1M context window - multimodal and multilingual - outperforms Fable 5 on every benchmark
English


@elie2222 @catalinmpit Most deeplearning frameworks are highly optimised for Cuda
English

@manitchahar @catalinmpit He’s not training though. Also why is apple bad for training?
English

@elie2222 @catalinmpit For inferencing yes.. for training nope
English

@catalinmpit isnt the mac way better for llms?
because of vram
English
Manit chahar retweetledi

@IndianTechGuide Google makes more money from windows then Microsoft
English









