Samira Daruki

246 posts

Samira Daruki

@SamiraDaruki

Learning and Training Gemini ♊, PreTraining 🤝 RL PostTraining, Science of Scaling, Model Design, Compute 🤝 Intelligence 🤝 Efficiency.

San Francisco, CA Katılım Mayıs 2011

907 Takip Edilen227 Takipçiler

Sabitlenmiş Tweet

Samira Daruki@SamiraDaruki·7 Ara

What a year @GoogleAI (Dec 2022-Dec 2023)🚀Working with an amazing team all over the globe has been a highlight, impressed with how Gemini was built as a startup within Google. Been a unique rewarding experience with tons of learning along the journey. Another step forward in AI.

Jeff Dean@JeffDean

I’m very excited to share our work on Gemini today! Gemini is a family of multimodal models that demonstrate really strong capabilities across the image, audio, video, and text domains. Our most-capable model, Gemini Ultra, advances the state of the art in 30 of 32 benchmarks, including 10 of 12 popular text and reasoning benchmarks, 9 of 9 image understanding benchmarks, 6 of 6 video understanding benchmarks, and 5 of 5 speech recognition and speech translation benchmarks. Gemini Ultra is the first model to achieve human-expert performance on MMLU across 57 subjects with a score above 90%. It also achieves a new state-of-the-art score of 62.4% on the new MMMU multimodal reasoning benchmark, outperforming the previous best model by more than 5 percentage points. Gemini was built by an awesome team of people from @GoogleDeepMind, @GoogleResearch, and elsewhere at @Google, and is one of the largest science and engineering efforts we’ve ever undertaken. As one of the two overall technical leads of the Gemini effort, along with my colleague @OriolVinyalsML, I am incredibly proud of the whole team, and we’re so excited to be sharing our work with you today! There’s quite a lot of different material about Gemini available, starting with: Main blog post: blog.google/technology/ai/… 60-page technical report authored by th Gemini Team: deepmind.google/gemini/gemini_… In this thread, I’ll walk you through some of the highlights.

English

2.5K

Samira Daruki@SamiraDaruki·4h

A team with deep expertise in LLM pretraining, data, and multimodal understanding, building @ElorianAI Lab to push the frontier of multimodal intelligence and reasoning. 🚀

Andrew M. Dai@AndrewDai

After almost 12 years in Brain/DeepMind, I’ve finally decided to take the leap. My cofounders: @yinfeiy, Seth and I have kicked-off @ElorianAI. The first multimodal reasoning lab founded and led by former LLM pretraining, data and multimodal leads. youtu.be/YlvfNpOMeOY?si… (1/n)

English

133

Samira Daruki@SamiraDaruki·5h

@AndrewDai @yinfeiy @ElorianAI Many Congrats @AndrewDai ! It was sad to see you leave Gemini, but super excited for the future work by ElorianAI team in advancing the MultiModal reasoning space! 🚀

English

Andrew M. Dai@AndrewDai·1d

YouTube

English

776

314.4K

Samira Daruki retweetledi

Senator Scott Wiener@Scott_Wiener·21 Mar

WE JUST RE-LAUNCHED THE BAY LIGHTS! San Francisco’s light is now even brighter. Truly the best city on the planet. Happy Bay Lights!

English

126

199

3.4K

260.6K

Samira Daruki retweetledi

Sebastian Raschka@rasbt·15 Mar

I (finally) put together a new LLM Architecture Gallery that collects the architecture figures all in one place! sebastianraschka.com/llm-architectu…

English

202

1.4K

8.2K

719.2K

Samira Daruki@SamiraDaruki·5 Mar

"Watch out for performative sacrifice and don’t confuse pain with progress."

Toby Pohlen@TobyPhln

At 1:30 a.m. PT on November 3, 2023 Elon sent a message to the xAI group chat saying that we need to go “extremely hardcore” for the next 36 hours; Grok will be released publicly tomorrow. You didn’t have to be in the exclusive company chat to get the message; it was also posted publicly at the same time: x.com/i/status/17203… What unfolded over the next day and a half was one of the best examples of engineering at pace that I’ve ever seen. All we had when we started was a somewhat fine-tuned base model and a half-baked UI. Our team of ten split up the tasks: curate data, improve the model, implement the raw prompting and RAG service, build the production infra. I took care of the latter. At 8:51 p.m. PT the next day, we announced Grok to the world with a long-form post on X (x.com/xai/status/172…). Over the past 36 hours, we came up with Fun mode (including Grok’s sunglasses), finished the whole production system, and most importantly tuned the RAG system that gave it real-time knowledge of the world through the X platform (a first in the industry). A day and a half of straight coding and shipping; no drugs, not even caffeine, just pure adrenaline. Elon gave us a mission and we delivered. The launch went very well. We invited a couple hundred X creators and Grok’s ability to roast accounts went viral. It was the first time a publicly accessible AI was allowed to poke fun at people. This episode is a prime example of what you can achieve by going extremely hardcore: you move and deliver results faster than any outsider could have anticipated. Within 36 hours, we took the company from silence to relevance. It was well worth it. xAI’s hardcore culture is infamous on X. I love the tent meme that suggests we all sleep (well, slept in my case) in the office in tents. Our reputation precedes us and even new joiners hit the ground grinding hard. However, unless you understand the “why,” you are at risk of simply replicating the “how” without achieving the same results. You need to grind with purpose and the purpose is to move fast towards a known goal. When the goal and the means of reaching it are crystal clear, a small, skilled, and highly motivated team can outcompete companies old and new, big and small. Never grind to show off; never work late to be seen; never sacrifice without cause. There is no medal for the one who tried extremely hard but failed. There is only a medal for the winner. If all your efforts lead nowhere, you’re arguably not very productive. Always keep your eyes firmly on the goal, do everything to reach it as quickly as possible, and make sure you're on track to win. A hardcore engineering culture is one of the most effective ways of accelerating real progress. Watch out for performative sacrifice and don’t confuse pain with progress.

English

147

Samira Daruki retweetledi

Elon Musk@elonmusk·1 Mar

Another one bites the dust

English

20.9K

33.9K

456.2K

117.5M

Samira Daruki retweetledi

Nikita Bier@nikitabier·1 Mar

Today was the biggest day on 𝕏 in history.

English

6.1K

4.8K

65.3K

90.4M

Samira Daruki@SamiraDaruki·26 Şub

@hyhieu226 @OpenAI @xai Take care and all the best wishes, Hieu!

English

Hieu Pham@hyhieu226·26 Şub

I have made the difficult decision to leave @OpenAI. Working here and at @xai before was a once-in-a-lifetime experience. I have met the best people. Not the best people in AI. Not the best people in tech. Simply the best people. At these companies, I have helped creating extremely intelligent entities that will meaningfully improve our lives. The work makes me proud. But the intensive work came with a price. I cannot believe I would say this one day, but I am burnt out. All the mental health deteriorating that I used to scoff at is real, miserable, scary, and dangerous. I am going to take a break from frontier AI labs, and will take my family to my home country Vietnam. There, I will try something new, and also search for a cure for my conditions. I hope I will heal. Until then.

English

1.1K

409

14K

1.2M

Samira Daruki@SamiraDaruki·21 Şub

Gemini 3.1 Pro 🚀🥇

Design Arena@Designarena

BREAKING: Gemini 3.1 Pro Preview has landed in #1 on SVG Arena by Design Arena with an ELO of 1421 This 87-point lead the largest winning margin that we've seen a model have on SVG Arena since the arena launch Huge congratulations to the @GoogleDeepMind team!

Indonesia

146

Samira Daruki retweetledi

Oriol Vinyals@OriolVinyalsML·19 Şub

Gemini 3.1 Pro has landed! Amazing performance / capabilities across the board. Beyond SOTA, the best are all the things that evals can't measure. E.g. SVG has gotten so much better (see 🧵) blog.google/innovation-and…

English

412

57.4K

Samira Daruki retweetledi

Fangyu Liu@hardy_qr·19 Şub

Gemini 3. Point 1. A small first step towards something truly exciting and ambitious. It is the most rewarding experience in the world to turn interesting research ideas into shaping the frontier. They were toy experiments in the lab just a few months ago, now it's in your hand!

Noam Shazeer@NoamShazeer

Last week we upgraded Gemini 3 Deep Think. Today, we’re shipping the core intelligence that makes those breakthroughs possible: Gemini 3.1 Pro. A noticeably smarter, more capable baseline for your hardest challenges. Available now: blog.google/innovation-and…

English

894

Samira Daruki@SamiraDaruki·15 Şub

@OriolVinyalsML Welcome back! 🚀 Please come to the Gemini SF office too 🌉

English

330

Oriol Vinyals@OriolVinyalsML·14 Şub

Personal update: After an amazing 10 years in London, it's time for a major change. One-way ticket back to California 🌞! I'm incredibly excited to return to the Bay Area to continue building Gemini and pushing us toward the age of AGI 🚀

English

1.5K

167.1K

Samira Daruki retweetledi

Google DeepMind@GoogleDeepMind·11 Şub

How could AI act as a better research collaborator? 🧑‍🔬 In two new papers with @GoogleResearch, we show how Gemini Deep Think uses agentic workflows to help solve research-level problems in mathematics, physics, and computer science. More → goo.gle/4aGs3Pz

English

100

284

424.1K

Samira Daruki@SamiraDaruki·12 Şub

🧠🧠🧠🚀

Oriol Vinyals@OriolVinyalsML

An updated & faster Gemini 3 Deep Think is taking off! 🚀 Our smartest mode to date!™️ PhD-level reasoning to the most rigorous STEM challenges (models' gotta think harder). Gold medal-level results on Physics & Chemistry Olympiads. 🧪💻 Full details: bit.ly/4kzBLqq

ART

Samira Daruki retweetledi

Noam Shazeer@NoamShazeer·12 Şub

An updated Gemini 3 Deep Think is out today: 📈 Achieves SOTA on ARC-AGI-2, MMMU-Pro, and HLE. 🥇Gold-medal level on Physics & Chemistry Olympiads. It turns out the best way to solve hard problems is still to think about them. Read more: bit.ly/4kzBLqq

English

117

1.2K

109.7K

Samira Daruki retweetledi

Jerry Tworek@MillionInt·8 Şub

martin_casado@martin_casado

Actually, the primary advantage is that they can raise more money than their entire downstream startup third party ecosystem. And they can put that money directly to use buying data, and compute.

ZXX

345

27.4K

Samira Daruki@SamiraDaruki·8 Şub

@Yihe__Deng @WeiWang1973 @kaiwei_chang @baharanm @adityagrover_ Congratulations Yihe!

Filipino

257

Yihe Deng@Yihe__Deng·8 Şub

Finished my PhD defense this week! Immensely grateful to my advisor @WeiWang1973 and committee @kaiwei_chang @baharanm @adityagrover_ for their guidance and support over these years 🙏

English

676

30.7K

Samira Daruki retweetledi

Quoc Le@quocleix·2 Şub

Excited to share our latest work: "Semi-Autonomous Mathematics Discovery with Gemini." We used Gemini to systematically evaluate 700 "open" conjectures in the Erdős Problems database. The result? We addressed 13 problems marked as open—finding 5 novel autonomous solutions and identifying 8 existing solutions missed by previous literature. Read the full case study here: arxiv.org/abs/2601.22401

English

209

1.3K

246.3K

Samira Daruki@SamiraDaruki·30 Oca

@shaneguML We are in this together! :)

English

Shane Gu@shaneguML·30 Oca

Once you swag, you swag for the rest of your life. Saga of any AI PhD graduates of my era, however many years of industry experience you get.

English

3.7K

Samira Daruki@SamiraDaruki·21 Oca

@noahdgoodman @ericzelikman @TheAndiPenguin @gharik @YuchenHe07 Congrats Noah! You are missed here in Gemini but looking forward to what the humans& team will build!

English

noahdgoodman@noahdgoodman·20 Oca

I’ve co-founded humans& with @ericzelikman @TheAndiPenguin @gharik & @YuchenHe07 — a new AI lab working to bring people together, empower the better angels of our nature, and create the most optimistic future. At a time of tumult and danger, humanity needs a humanist AI.

humans&@humansand

Today we introduce humans&, a human-centric frontier AI lab. We believe AI can be reimagined, centering around people and their relationships with each other. At its best, AI should serve as a deeper connective tissue that strengthens organizations and communities

English

344

57.1K

Keşfet

@ElorianAI @AndrewDai @yinfeiy @hyhieu226 @OpenAI @xai @OriolVinyalsML @GoogleResearch