Samira Daruki

246 posts

Samira Daruki

Samira Daruki

@SamiraDaruki

Learning and Training Gemini ♊, PreTraining 🤝 RL PostTraining, Science of Scaling, Model Design, Compute 🤝 Intelligence 🤝 Efficiency.

San Francisco, CA Katılım Mayıs 2011
907 Takip Edilen227 Takipçiler
Sabitlenmiş Tweet
Samira Daruki
Samira Daruki@SamiraDaruki·
What a year @GoogleAI (Dec 2022-Dec 2023)🚀Working with an amazing team all over the globe has been a highlight, impressed with how Gemini was built as a startup within Google. Been a unique rewarding experience with tons of learning along the journey. Another step forward in AI.
Jeff Dean@JeffDean

I’m very excited to share our work on Gemini today! Gemini is a family of multimodal models that demonstrate really strong capabilities across the image, audio, video, and text domains. Our most-capable model, Gemini Ultra, advances the state of the art in 30 of 32 benchmarks, including 10 of 12 popular text and reasoning benchmarks, 9 of 9 image understanding benchmarks, 6 of 6 video understanding benchmarks, and 5 of 5 speech recognition and speech translation benchmarks. Gemini Ultra is the first model to achieve human-expert performance on MMLU across 57 subjects with a score above 90%. It also achieves a new state-of-the-art score of 62.4% on the new MMMU multimodal reasoning benchmark, outperforming the previous best model by more than 5 percentage points. Gemini was built by an awesome team of people from @GoogleDeepMind, @GoogleResearch, and elsewhere at @Google, and is one of the largest science and engineering efforts we’ve ever undertaken. As one of the two overall technical leads of the Gemini effort, along with my colleague @OriolVinyalsML, I am incredibly proud of the whole team, and we’re so excited to be sharing our work with you today! There’s quite a lot of different material about Gemini available, starting with: Main blog post: blog.google/technology/ai/… 60-page technical report authored by th Gemini Team: deepmind.google/gemini/gemini_… In this thread, I’ll walk you through some of the highlights.

English
1
1
8
2.5K
Samira Daruki
Samira Daruki@SamiraDaruki·
A team with deep expertise in LLM pretraining, data, and multimodal understanding, building @ElorianAI Lab to push the frontier of multimodal intelligence and reasoning. 🚀
Andrew M. Dai@AndrewDai

After almost 12 years in Brain/DeepMind, I’ve finally decided to take the leap. My cofounders: @yinfeiy, Seth and I have kicked-off @ElorianAI. The first multimodal reasoning lab founded and led by former LLM pretraining, data and multimodal leads. youtu.be/YlvfNpOMeOY?si… (1/n)

English
1
0
3
133
Andrew M. Dai
Andrew M. Dai@AndrewDai·
After almost 12 years in Brain/DeepMind, I’ve finally decided to take the leap. My cofounders: @yinfeiy, Seth and I have kicked-off @ElorianAI. The first multimodal reasoning lab founded and led by former LLM pretraining, data and multimodal leads. youtu.be/YlvfNpOMeOY?si… (1/n)
YouTube video
YouTube
English
82
71
776
314.4K
Samira Daruki retweetledi
Senator Scott Wiener
Senator Scott Wiener@Scott_Wiener·
WE JUST RE-LAUNCHED THE BAY LIGHTS! San Francisco’s light is now even brighter. Truly the best city on the planet. Happy Bay Lights!
English
126
199
3.4K
260.6K
Samira Daruki
Samira Daruki@SamiraDaruki·
"Watch out for performative sacrifice and don’t confuse pain with progress."
Toby Pohlen@TobyPhln

At 1:30 a.m. PT on November 3, 2023 Elon sent a message to the xAI group chat saying that we need to go “extremely hardcore” for the next 36 hours; Grok will be released publicly tomorrow. You didn’t have to be in the exclusive company chat to get the message; it was also posted publicly at the same time: x.com/i/status/17203… What unfolded over the next day and a half was one of the best examples of engineering at pace that I’ve ever seen. All we had when we started was a somewhat fine-tuned base model and a half-baked UI. Our team of ten split up the tasks: curate data, improve the model, implement the raw prompting and RAG service, build the production infra. I took care of the latter. At 8:51 p.m. PT the next day, we announced Grok to the world with a long-form post on X (x.com/xai/status/172…). Over the past 36 hours, we came up with Fun mode (including Grok’s sunglasses), finished the whole production system, and most importantly tuned the RAG system that gave it real-time knowledge of the world through the X platform (a first in the industry). A day and a half of straight coding and shipping; no drugs, not even caffeine, just pure adrenaline. Elon gave us a mission and we delivered. The launch went very well. We invited a couple hundred X creators and Grok’s ability to roast accounts went viral. It was the first time a publicly accessible AI was allowed to poke fun at people. This episode is a prime example of what you can achieve by going extremely hardcore: you move and deliver results faster than any outsider could have anticipated. Within 36 hours, we took the company from silence to relevance. It was well worth it. xAI’s hardcore culture is infamous on X. I love the tent meme that suggests we all sleep (well, slept in my case) in the office in tents. Our reputation precedes us and even new joiners hit the ground grinding hard. However, unless you understand the “why,” you are at risk of simply replicating the “how” without achieving the same results. You need to grind with purpose and the purpose is to move fast towards a known goal. When the goal and the means of reaching it are crystal clear, a small, skilled, and highly motivated team can outcompete companies old and new, big and small. Never grind to show off; never work late to be seen; never sacrifice without cause. There is no medal for the one who tried extremely hard but failed. There is only a medal for the winner. If all your efforts lead nowhere, you’re arguably not very productive. Always keep your eyes firmly on the goal, do everything to reach it as quickly as possible, and make sure you're on track to win. A hardcore engineering culture is one of the most effective ways of accelerating real progress. Watch out for performative sacrifice and don’t confuse pain with progress.

English
0
0
0
147
Samira Daruki retweetledi
Elon Musk
Elon Musk@elonmusk·
Another one bites the dust
English
20.9K
33.9K
456.2K
117.5M
Samira Daruki retweetledi
Nikita Bier
Nikita Bier@nikitabier·
Today was the biggest day on 𝕏 in history.
English
6.1K
4.8K
65.3K
90.4M
Hieu Pham
Hieu Pham@hyhieu226·
I have made the difficult decision to leave @OpenAI. Working here and at @xai before was a once-in-a-lifetime experience. I have met the best people. Not the best people in AI. Not the best people in tech. Simply the best people. At these companies, I have helped creating extremely intelligent entities that will meaningfully improve our lives. The work makes me proud. But the intensive work came with a price. I cannot believe I would say this one day, but I am burnt out. All the mental health deteriorating that I used to scoff at is real, miserable, scary, and dangerous. I am going to take a break from frontier AI labs, and will take my family to my home country Vietnam. There, I will try something new, and also search for a cure for my conditions. I hope I will heal. Until then.
English
1.1K
409
14K
1.2M
Samira Daruki retweetledi
Oriol Vinyals
Oriol Vinyals@OriolVinyalsML·
Gemini 3.1 Pro has landed! Amazing performance / capabilities across the board. Beyond SOTA, the best are all the things that evals can't measure. E.g. SVG has gotten so much better (see 🧵) blog.google/innovation-and…
Oriol Vinyals tweet media
English
25
29
412
57.4K
Samira Daruki retweetledi
Fangyu Liu
Fangyu Liu@hardy_qr·
Gemini 3. Point 1. A small first step towards something truly exciting and ambitious. It is the most rewarding experience in the world to turn interesting research ideas into shaping the frontier. They were toy experiments in the lab just a few months ago, now it's in your hand!
Noam Shazeer@NoamShazeer

Last week we upgraded Gemini 3 Deep Think. Today, we’re shipping the core intelligence that makes those breakthroughs possible: Gemini 3.1 Pro. A noticeably smarter, more capable baseline for your hardest challenges. Available now: blog.google/innovation-and…

English
0
1
13
894
Oriol Vinyals
Oriol Vinyals@OriolVinyalsML·
Personal update: After an amazing 10 years in London, it's time for a major change. One-way ticket back to California 🌞! I'm incredibly excited to return to the Bay Area to continue building Gemini and pushing us toward the age of AGI 🚀
English
59
31
1.5K
167.1K
Samira Daruki retweetledi
Google DeepMind
Google DeepMind@GoogleDeepMind·
How could AI act as a better research collaborator? 🧑‍🔬 In two new papers with @GoogleResearch, we show how Gemini Deep Think uses agentic workflows to help solve research-level problems in mathematics, physics, and computer science. More → goo.gle/4aGs3Pz
Google DeepMind tweet media
English
100
284
2K
424.1K
Samira Daruki retweetledi
Noam Shazeer
Noam Shazeer@NoamShazeer·
An updated Gemini 3 Deep Think is out today: 📈 Achieves SOTA on ARC-AGI-2, MMMU-Pro, and HLE. 🥇Gold-medal level on Physics & Chemistry Olympiads. It turns out the best way to solve hard problems is still to think about them. Read more: bit.ly/4kzBLqq
Noam Shazeer tweet media
English
39
117
1.2K
109.7K
Samira Daruki retweetledi
Quoc Le
Quoc Le@quocleix·
Excited to share our latest work: "Semi-Autonomous Mathematics Discovery with Gemini." We used Gemini to systematically evaluate 700 "open" conjectures in the Erdős Problems database. The result? We addressed 13 problems marked as open—finding 5 novel autonomous solutions and identifying 8 existing solutions missed by previous literature. Read the full case study here: arxiv.org/abs/2601.22401
Quoc Le tweet media
English
45
209
1.3K
246.3K
Shane Gu
Shane Gu@shaneguML·
Once you swag, you swag for the rest of your life. Saga of any AI PhD graduates of my era, however many years of industry experience you get.
Shane Gu tweet media
English
1
1
25
3.7K
noahdgoodman
noahdgoodman@noahdgoodman·
I’ve co-founded humans& with @ericzelikman @TheAndiPenguin @gharik & @YuchenHe07 — a new AI lab working to bring people together, empower the better angels of our nature, and create the most optimistic future. At a time of tumult and danger, humanity needs a humanist AI.
humans&@humansand

Today we introduce humans&, a human-centric frontier AI lab. We believe AI can be reimagined, centering around people and their relationships with each other. At its best, AI should serve as a deeper connective tissue that strengthens organizations and communities

English
32
25
344
57.1K