AndreasFischer1985

638 posts

AndreasFischer1985 banner
AndreasFischer1985

AndreasFischer1985

@AFischer1985

@[email protected] Data Scientist & Bildungsforscher. Privat aktiv für die Bundesstelle für Open Data - und privat hier 👨🏻‍💻

Nürnberg Katılım Mart 2022
136 Takip Edilen154 Takipçiler
AndreasFischer1985 retweetledi
VAGO solutions
VAGO solutions@VAGOsolutions·
📢 What a week for open-source AI! @AIatMeta Llama-3.1-8b-instruct impressed with its German skills. Today, we're launching Llama-3.1-SauerkrautLM-8b-Instruct! Built on our Sauerkraut Dataset V2 🔗 Details: huggingface.co/VAGOsolutions/…
VAGO solutions tweet mediaVAGO solutions tweet mediaVAGO solutions tweet mediaVAGO solutions tweet media
English
2
5
12
364
AndreasFischer1985 retweetledi
AI at Meta
AI at Meta@AIatMeta·
Starting today, open source is leading the way. Introducing Llama 3.1: Our most capable models yet. Today we’re releasing a collection of new Llama 3.1 models including our long awaited 405B. These models deliver improved reasoning capabilities, a larger 128K token context window and improved support for 8 languages among other improvements. Llama 3.1 405B rivals leading closed source models on state-of-the-art capabilities across a range of tasks in general knowledge, steerability, math, tool use and multilingual translation. The models are available to download now directly from Meta or @huggingface. With today’s release the ecosystem is also ready to go with 25+ partners rolling out our latest models — including @awscloud, @nvidia, @databricks, @groqinc, @dell, @azure and @googlecloud ready on day one. More details in the full announcement ➡️ go.fb.me/tpuhb6 Download Llama 3.1 models ➡️ go.fb.me/vq04tr With these releases we’re setting the stage for unprecedented new opportunities and we can’t wait to see the innovation our newest models will unlock across all levels of the AI community.
English
261
1.4K
5.6K
1.3M
AndreasFischer1985 retweetledi
Arena.ai
Arena.ai@arena·
Chatbot Arena Update! 1. Multilingual Arena -- four new languages (German, Spanish, Russian, Japanese). GPT-4o is #1 in English, German, and Spanish. Gemini-1.5-Pro is #1 in Japanese, Chinese, and French. Claude-3 Opus is #1 in Russian. The competition is tight, and we need more votes 🗳️ to confidently rank them. Let's challenge LLMs in any language! 2. Yi-1.5-34B-Chat shows impressive performance, matching larger models like Qwen-1.5-110B and GPT-4-0613. Congrats @01AI_Yi on this milestone! 3. Phi-3 Medium and Small are finally on the board! Medium (14B) ranks near GPT-3.5-Turbo-0613, Small (7B) ranks ~Llama-2-70B. We also see robust performance in Hard Prompts. Congrats @Microsoft Phi team on these great models for the community! Learn more - Full leaderboard leaderboard.lmsys.org - Chat & vote at chat.lmsys.org
Arena.ai tweet mediaArena.ai tweet mediaArena.ai tweet media
English
18
49
354
79K
AndreasFischer1985
AndreasFischer1985@AFischer1985·
Wir haben im Laufe der letzten Jahre im Projekt #KIPerWeb eine lebhafte, kompetente und offene Austauschrunde zur Nutzung und Entwicklung von KI-gestützen Webanwendungen etabliert, die wir nun ehrenamtlich weiterführen werden. Interesse mitzumachen? 👉 #cop-kiperweb" target="_blank" rel="nofollow noopener">github.com/AndreasFischer…
Deutsch
0
1
2
94
AndreasFischer1985 retweetledi
Philipp Schmid
Philipp Schmid@_philschmid·
Llama 3 released! 🚨🔔@AIatMeta just released their best open LLM! 👑🚀 Llama 3 is the next iteration of Llama with a ~10% relative improvement to its predecessor! 🤯 Llama 3 comes in 2 different sizes 8B and 70B with a new extended tokenizer and commercially permissive license! ✅ Blog: huggingface.co/blog/llama3 Models: huggingface.co/models?other=l… New and improvements to v2✨: 🔠 Trained on 15T Tokens & fine-tuned on 10M human annotated samples 🧮 8B & 70B versions as Instruct and Base 🚀 Llama 3 70B best open LLM on MMLU (> 80 🤯) 🧑🏻‍💻 Instruct good at coding 8B with 62.2 and 70B 81.7 on Human Eval ✍🏻 Tiktoken-based tokenizer with a 128k vocabulary 🪟 8192 default context window (can be increased) 🧠 Used SFT, PPO & DPO for alignment. 💰Commercial use allowed ✅ 🤗 Available on @Hugging Face 🤝 1-click deployments on Hugging Face, Amazon SageMaker, Google Cloud 🔜 more model sizes & enhanced performance Massive kudos to Meta for continuing its commitment to open AI. Honored to partner with Joe and team! 🤗 The gap is melting. 🧊
Philipp Schmid tweet media
English
6
60
252
26.9K
AndreasFischer1985 retweetledi
Devendra Chaplot
Devendra Chaplot@dchaplot·
We just released Mixtral-8x22B-v0.1 and Mixtral-8x22B-Instruct-v0.1: - Free to use under Apache 2.0 license - Outperforms all open models - Native function calling - Masters English, French, Italian, German and Spanish. - Seq_len = 64K mistral.ai/news/mixtral-8…
Devendra Chaplot tweet media
English
26
176
1K
152.2K
AndreasFischer1985 retweetledi
WizardLM
WizardLM@WizardLM_AI·
🧙‍♀️ WizardLM-2 8x22B is our most advanced model, and just slightly falling behind GPT-4-1106-preview. 🧙 WizardLM-2 70B reaches top-tier capabilities in the same size. 🧙‍♀️ WizardLM-2 7B even achieves comparable performance with existing 10x larger opensource leading models. The model weights of WizardLM-2 8x22B and WizardLM-2 7B are shared on Huggingface, and WizardLM-2 70B and the demo of all the models will be available in the coming days. huggingface.co/collections/mi…
WizardLM tweet media
English
4
22
144
44.4K
AndreasFischer1985 retweetledi
Philipp Schmid
Philipp Schmid@_philschmid·
We can do it! 🙌 First open LLM outperforms @OpenAI GPT-4 (March) on MT-Bench. WizardLM 2 is a fine-tuned and preferences-trained Mixtral 8x22B! 🤯 TL;DR; 🧮 Mixtral 8x22B based (141B-A40 MoE) 🔓 Apache 2.0 license 🤖 First > 9.00 on MT-Bench with an open LLM 🧬 Used multi-step synthetic data pipeline including Evol-instruct 🔄 data partitions and stage-by-stage training 👨‍🔬 Used SFT → DPO → PPO Blog: wizardlm.github.io/WizardLM2/ Model: huggingface.co/microsoft/Wiza… Paper: coming soon
Philipp Schmid tweet media
English
12
75
370
55.6K
AndreasFischer1985 retweetledi
clem 🤗
clem 🤗@ClementDelangue·
The new @MistralAI is now #1 on the openLLM leaderboard. Apache 2.0 license too! 🔥🔥🔥
clem 🤗 tweet media
English
11
66
433
77.7K
AndreasFischer1985 retweetledi
Edward Snowden
Edward Snowden@Snowden·
OpenAI confessing 𝐨𝐧 𝐭𝐡𝐞𝐢𝐫 𝐨𝐰𝐧 𝐛𝐥𝐨𝐠 to a belief that "as we get closer to building AI, it will make sense to start being less open... but it's totally OK to not share the science..." is about as bad of a heel-turn as it gets.
English
176
2.2K
13K
579.6K
AndreasFischer1985
AndreasFischer1985@AFischer1985·
Heute ist ein Beitrag von mir und Jens Dörpinghaus vom @BIBB_de in der Fachzeitschrift Knowledge erschienen! 🥳🎉🥂 Titel; „Web Mining of Online Resources for German Labor Market Research and Education: Finding the Ground Truth?“ 😎
AndreasFischer1985 tweet media
Deutsch
1
0
2
167