abu

1.1K posts

abu

@aqaderb

model performance @basetenco

Katılım Temmuz 2017

871 Takip Edilen687 Takipçiler

Sabitlenmiş Tweet

abu@aqaderb·1 Oca

I have a single resolution this year. Every year, I experience this palpable feeling of a fresh slate. All the baggage from the past year, dropped. My goal is to feel that every morning this year. To remind myself that every second alive is an opportunity to change direction.

English

118

abu retweetledi

Luke Drago@luke_drago_·13 Mar

some of the rejected (but still beautiful) alternative choices:

English

975

abu@aqaderb·6 Mar

@fanjiewang you guys rock

English

142

Frank Wang@fanjiewang·6 Mar

been working with these guys since the early days of Zen always delivered 100+ TPS and sub-second TTFT across various models we’ve collab'ed on

Baseten@baseten

We've launched the fastest GLM 5 API available at 190 TPS and 0.79 sec TTFT with the Baseten Inference Stack. Ready for your coding and agentic workflows. baseten.co/blog/how-we-bu…

English

204

13.9K

abu retweetledi

Bobson🌱Dugnutt@f_a_infinityy·26 Şub

Please, just call me dawg. Big dawg was my father

English

4.4K

50.7K

953.9K

abu@aqaderb·23 Şub

@philipkiely so frickin cool, kudos philip!

English

251

Philip Kiely@philipkiely·23 Şub

Inference Engineering launches today. baseten.com/inference-engi…

English

187

216

2.2K

1.3M

abu@aqaderb·25 Oca

ZXX

176

abu retweetledi

Will Reed@willreed·23 Oca

b10’s inference infrastructure powers the most sophisticated app companies serving specialized models at scale. there are dozens of these companies today and there will be thousands in the future. incredible momentum + world-class talent density. and, it’s still so early…

Baseten@baseten

We’re thrilled to announce that we have raised $300M at a $5B valuation. The round is led by IVP and CapitalG, both doubling down on their investment in Baseten, and joined by 01A, Altimeter, Battery Ventures, BOND, BoxGroup, Blackbird Ventures, Conviction, Greylock, and NVIDIA. Read more here: baseten.co/blog/announcin…

English

5.3K

abu retweetledi

Daniel Litt@littmath·12 Oca

IMO it should be considered quite rude in most contexts to post or send someone a wall of 100% AI-generated text. “Here, read this thing I didn’t care enough about to express myself.”

English

176

691

9.2K

765.4K

abu@aqaderb·28 Ara

@ArtificialAnlys try it here baseten.co/library/glm-4-…

English

238

abu@aqaderb·28 Ara

happy holidays! we just dropped the fastest GLM 4.7: 400+ TPS as benchmarked by @ArtificialAnlys

English

5.4K

abu@aqaderb·15 Ara

wander and wonder

English

146

abu@aqaderb·11 Ara

they're really so good. @saltyph and i spend about 10 mins of every 1:1 raving about them and imagining all the magical experiences we are going to build together.

Baseten@baseten

Today we’re welcoming the @parsedlabs team to Baseten! With their RL and post-training expertise, Baseten is enabling companies to own their intelligence by unifying training and inference. Read more about it from the Parsed founders ⤵️ baseten.co/blog/parsed-ba…

English

883

abu retweetledi

Baseten@baseten·10 Ara

English

22.3K

abu retweetledi

Charlie O'Neill@oneill_c·10 Ara

@parsedlabs has been acquired by @baseten. Big Token wants you to believe the future is a monoculture: one model to rule everything, one bill to pay forever. Rent the demigod, trust that next month's update will finally solve your problem, and pray that GPT-(n+1) happens to have your exact behaviour covered in its pre-training data and RL environments. It's a convenient story for their bottom lines. We took the other side of that bet. A year ago, @mudithj , @maxkirkby and I were three Australians with a thesis that felt unfashionable: for the real, task-specific work, a specialised model will win. There are learning signals from the real world that you can't distil into a prompt. We thought the frontier was flattening and that the next wave of value would come from depth, not scale. Models that actually understand (and remember!) the job they're hired to do, trained on real feedback, owned by the people who use them. Democratised Token, rather than Big Token. We were right about the thesis. We were wrong about how hard inference is. Every customer deployment taught us the same lesson: training a model that outperforms GPT-n is only half the problem. The other half is running it fast, reliably, and at scale. Baseten was the only one that could actually do it. Eventually the question became obvious: why are we pretending this is two companies? To @tuhinone, Phil, @amiruci, Pankaj, and the entire Baseten team: you've quietly become one of the most important companies in AI, and we're thrilled to help make that louder. To the Parsed team: you are the most technically brilliant and genuinely good group of people I've worked with. Every late night, every commute to work (about 15 ft from our rooms to our desks), every argument about loss functions and Apple Monitors at 2am, it all mattered. To Julia and Ash from LocalGlobe: thank you for believing in our contrarian vision when it was challenged the most. To Max, this is our latest adventure, and none of this would be possible without the technical brilliance and hardened experience you've developed over the time that I've known you. And finally to Mudith: you are the best CEO and most authentic leader I could have picked to found this company with. I'm lucky to have built this with you both. We set out to help models touch grass. Now we get to do it with more compute, a bigger team, and the best inference stack in the world. Time to land some clean shots on the crowned giants of San Francisco.

English

1.5K

abu@aqaderb·11 Kas

@ZachyQ stud

English

Zachy Qader@ZachyQ·4 Nis

Old film #3

English

312

abu retweetledi

Stev@LoschbourStev·30 Eki

ZXX

2.1K

78.2K

11.4M

abu@aqaderb·25 Eyl

@runwayml open source this

English

260

Runway@runwayml·24 Eyl

Today we're sharing our first research work exploring diffusion for language models: Autoregressive-to-Diffusion Vision Language Models We develop a state-of-the-art diffusion vision language model, Autoregressive-to-Diffusion (A2D), by adapting an existing autoregressive vision language model for parallel diffusion decoding. Our approach makes it easy to unlock the speed-quality trade-off of diffusion language models without training from scratch, by leveraging existing pre-trained autoregressive models.

English

410

111.6K

abu@aqaderb·5 Eyl

@vim_dzl u not

English

Vim@vim_dzl·5 Eyl

Biggest fear about TLPM is that I'm doing both jobs poorly

English

201

abu retweetledi

Gaurav Misra@gmharhar·4 Eyl

We're changing our name. Captions is now Mirage. Today's most popular medium is short-form video. It's the fastest-growing format, it’s what people want, and it’s what brands need to grow. But unlike other forms of AI content like text or image, which can be generated and ready-to-use with a single prompt, generated video still requires a lot of manual work and editing to be ready to share. This is where the real opportunity lies. The way we see it, the real race for AI video hasn’t even begun. The world is waiting for a foundation model for video that can take an idea and produce a fully usable, cohesive output from scratch. So that’s what we’re building. We’re rebranding to Mirage to reflect our expanded vision for video, starting with the short-form format, through frontier AI research and models. Check it out at mirage dot app

English

105

439

134.8K

abu@aqaderb·8 Ağu

@holynski_ holy

English

510

Aleksander Holynski@holynski_·8 Ağu

Something fun we discovered: you can use #Genie3 to step into and explore your favorite paintings. Here's a short visit to Edward Hopper's "Nighthawks".

English

1.2K

11.2K

9.7M

abu@aqaderb·6 Ağu

@DennisHXu @tuhinone @OpenAI 200+!

Dennis Xu@DennisHXu·6 Ağu

@tuhinone @OpenAI congrats man, how many tokens per second are you guys running it at?

English

254

Tuhin Srivastava@tuhinone·6 Ağu

We're very excited to be an @OpenAI launch partner for GPT OSS. Today's a big day for open models, and we have day 0 support for GPT OSS 120b via our Model APIs: baseten.co/library/gpt-os… We'll be rolling out more performance optimizations and benchmarks over the coming hours and days, so stay tuned -- and congrats to the OpenAI team on the launch!

English

17.5K

Keşfet

@fanjiewang @philipkiely @ArtificialAnlys @saltyph @parsedlabs @baseten @mudithj @maxkirkby