BigScience Large Model Training

129 posts

BigScience Large Model Training banner
BigScience Large Model Training

BigScience Large Model Training

@BigScienceLLM

Follow the training of "BLOOM 🌸", the @BigScienceW multilingual 176B parameter open-science open-access language model, a research tool for the AI community.

JeanZay supercomputer (France) Beigetreten Mart 2022
1 Folgt8.4K Follower
BigScience Large Model Training retweetet
clem 🤗
clem 🤗@ClementDelangue·
The Bloom paper is out. Looks like it's doing worse than current GPT3 API in zero-shot generation tasks in English but better than other open-source LLMs & better than all in zs multi-lingual (which was the main goal). Proud of the work from the community! arxiv.org/abs/2211.05100
clem 🤗 tweet media
English
11
105
593
0
bparlan 🦇🔊
bparlan 🦇🔊@bparlan·
Dear @BigScienceLLM, Not sure whether it is too late and my eyes need rest, or you do not include the Turkish language in databases at all? #languages" target="_blank" rel="nofollow noopener">bigscience.huggingface.co/blog/building-…
English
1
0
0
0
BigScience Large Model Training retweetet
Saulnier Lucile
Saulnier Lucile@LucileSaulnier·
🌸@BigscienceW BLOOM's intermediate checkpoints have already shown some very cool capabilities! What's great about BLOOM is that you can ask it to generate the rest of a text - and this even if it is not yet fully trained yet! 👶 🧵 A thread with some examples
Saulnier Lucile tweet media
Saulnier Lucile@LucileSaulnier

A milestone soon to be reached 🚀💫 Can't wait to see the capabilities and performance of this long-awaited checkpoint! What about you? Have you already prepared some prompts that you want to test? ✏️

English
5
25
146
0
BigScience Large Model Training
BigScience Large Model Training@BigScienceLLM·
For 111 days, we've enjoyed world-class hardware stability and throughput thanks to the hard work of our friends at @Genci_fr, @INS2I_CNRS, Megatron & DeepSpeed. Having reached our objective earlier than expected, we'll keep training for a few more days. Stay tuned, more soon ;)
English
3
25
305
0