Landan Seguin

14 posts

Landan Seguin

@landanjs

Santa Clara, CA Katılım Mart 2018

129 Takip Edilen75 Takipçiler

Landan Seguin@landanjs·26 Tem

@maxisawesome538 👋❤️👋

QME

Max ⛅@maxisawesome538·26 Tem

I'm starting a surfing group chat. Any surfers on my TL? reply to be included

English

2.7K

Landan Seguin retweetledi

jasmine collins@jazco·12 Haz

today we're announcing our @DbrxMosaicAI x @Shutterstock partnership, and a new text-to-image diffusion model: ✨ImageAI!!✨ this model is geared towards enterprise use cases and is trained exclusively on shutterstock's trusted data catalog! databricks.com/company/newsro…

English

27.4K

Landan Seguin@landanjs·26 Oca

@ilyas121_real @summerlinARK @Replit @amasad @MosaicML Our SD training estimates are in the blog post below: $160k in 2 weeks. Next, we will train the thing :) watch out for it! twitter.com/MosaicML/statu…

Databricks AI Research@DbrxMosaicAI

How much does it take to train a Stable Diffusion model from scratch? The answer: 79,000 A100-hours in 13 days, for a total training cost of <$160k. Our tooling reduces the time and cost to train by 2.5x, and is also extensible and simple to use. mosaicml.com/blog/training-…

English

229

Ilyas@ilyasbuilds·26 Oca

@summerlinARK @Replit @amasad @MosaicML When you say train SD for 160k is that from scratch? Has it really come down from 600k in just 5ish months?

English

149

Will Summerlin@WSummerlinAI·26 Oca

.@Replit & @amasad are writing the playbook on building an AI-first company. With @MosaicML, the cost to train your own model is de minimis. I.e., you can train Stable Diffusion on MosaicML for $160k. Other reasons to train your own model: inference cost/speed, data privacy, etc.

Amjad Masad@amasad

“To paraphrase Alan Kay, perhaps people who are really serious about product should make their own models.”

English

15.2K

Landan Seguin retweetledi

Databricks AI Research@DbrxMosaicAI·17 Kas

Just in time for Thanksgiving - we're dropping a new batch of recipes for training image segmentation models. Reduce time-to-train by up to 5.4x, improve quality by up to +4.6 mIoU, and impress everyone at your #efficientML potluck! mosaicml.com/blog/mosaic-im…

English

Landan Seguin retweetledi

Databricks AI Research@DbrxMosaicAI·3 Ağu

New blog post: mosaicml.com/blog/behind-th… We're setting an up-to-date baseline for semantic segmentation model training: 45.56 mIoU on the ADE20k benchmark in 3.5 hours using 8x NVIDIA A100 GPUs. Next step: develop and release #EfficientML recipes to speed it up!

English

Landan Seguin retweetledi

Davis Blalock@davisblalock·5 May

Having trouble keeping up with arXiv? 🎉 Announcing "Davis Summarizes Papers" 🎉 tl;dr: People kept telling me I should make the ~15 paper summaries I do each week into a newsletter, so I did: dblalock.substack.com Free forever, and you can also read all past posts as a blog

English

103

534

Landan Seguin retweetledi

Databricks AI Research@DbrxMosaicAI·16 Mar

We've shared great research before, but reproducing methods from papers is hard. Announcing Composer, our library of ML speedups: github.com/mosaicml/compo…. Train CV models ~4x faster and NLP models ~2x faster at the same accuracy -- with minimal tuning. (1/5)

English

103

Landan Seguin retweetledi

Jonathan Frankle@jefrankle·16 Mar

TLDR: Announcing 🌟COMPOSER🌟, a PyTorch trainer for efficient training *algorithmically*. Train 2x-4x faster on standard ML tasks, a taste of what's coming from @MosaicML. Star it, 𝚙𝚒𝚙 𝚒𝚗𝚜𝚝𝚊𝚕𝚕 𝚖𝚘𝚜𝚊𝚒𝚌𝚖𝚕, contribute, be efficient! github.com/mosaicml/compo… Thread:

English

372

Landan Seguin@landanjs·9 Şub

@borisdayma Nice! At @MosaicML, we’ve seen significant differences in perplexity when training a GPT-2 model on OpenWebText with vs without dropout! Look forward to seeing your results 😁

English

Boris Dayma 🖍️@borisdayma·9 Şub

Do you still use regularization even when your dataset is huge? I was still using dropout while my dataset won't get processed more than 2-3 epochs (I also have a large batch size). Let's see what happens🤞

English

Landan Seguin@landanjs·1 Eki

@johncarlosbaez I found this talk (and many others) by Alan Watts enlightening youtu.be/mMRrCYPxD0I (start at 1:26, ignore the cheesy music and video) "So after you are dead, the only thing that can happen is the same experience, or the same sort of experience as before you were born"

YouTube

English

Landan Seguin retweetledi

MIT CSAIL@MIT_CSAIL·10 Nis

Left: MIT computer scientist Katie Bouman w/stacks of hard drives of black hole image data. Right: MIT computer scientist Margaret Hamilton w/the code she wrote that helped put a man on the moon. (image credit @floragraham) #EHTblackhole #BlackHoleDay #BlackHole

English

663

48.7K

111.5K

Landan Seguin@landanjs·10 Eyl

blog.openai.com/openai-scholar…

ZXX

Landan Seguin retweetledi

Mason@webdevMason·5 May

Kinda crazy how much more advice there is on persuading people than on being persuaded. Not just when to buy an argument, but also how to get it to percolate through your models of the world & actually behave as though you'd bought it.

English

501

Landan Seguin retweetledi

Andrej Karpathy@karpathy·29 Mar

@michael_nielsen @TheAtlantic It's frustrating how thinking feels like exploring a large idea cave serially with a tiny flashlight. It's a bit shocking how underdeveloped our tooling is in surpassing limitations of thinking / short term memory. Pen & paper was a good first step, haven't taken too many since.

English

103

Keşfet

@maxisawesome538 @DbrxMosaicAI @Shutterstock @Replit @amasad @MosaicML @borisdayma @johncarlosbaez