intenAi🦋

1.2K posts

intenAi🦋

@intenAi

Where Creativity Meets Intelligence🕊️

Cambridge, England Katılım Eylül 2021

365 Takip Edilen258 Takipçiler

intenAi🦋 retweetledi

Demis Hassabis@demishassabis·13 May

Really cool work from the team reimagining the mouse pointer to be intelligent! Try the prototype in @GoogleAIStudio it's pretty magical.

Google DeepMind@GoogleDeepMind

We’re reimagining a 50-year-old interface - the mouse pointer - with AI. 🖱️ These experimental demos show how people can intuitively direct Gemini on their screens using motion, speech, and natural shorthand to get things done 🧵

English

174

2.1K

222K

intenAi🦋 retweetledi

Zain Shah@zan2434·22 Nis

Imagine every pixel on your screen, streamed live directly from a model. No HTML, no layout engine, no code. Just exactly what you want to see. @eddiejiao_obj, @drewocarr and I built a prototype to see how this could actually work, and set out to make it real. We're calling it Flipbook. (1/5)

English

1.1K

3.7K

28.7K

5.9M

intenAi🦋 retweetledi

OpenAI Developers@OpenAIDevs·20 Nis

Last week, we released a preview of memories in Codex. Today, we’re expanding the experiment with Chronicle, which improves memories using recent screen context. Now, Codex can help with what you’ve been working on without you restating context.

English

224

368

4.5K

1.2M

intenAi🦋 retweetledi

Greg Brockman@gdb·17 Nis

Codex is open source, enabling anyone to build awesome applications on top of it:

Developing Adventures@DevAdventur3s

Want to use Codex computer control from your phone? OpenAssist got you covered. Let the agent work… you go touch grass 🌱😂 Built on codex app server thanks for making it open source. @OpenAIDevs @thsottiaux

English

124

1.8K

159.5K

intenAi🦋 retweetledi

ollama@ollama·18 Nis

Build your own assistant with @NVIDIAAI. ❤️

NVIDIA AI@NVIDIAAI

Here's your weekend project. Build a fully local, sandboxed AI assistant. Step-by-step tutorial to build your always-on agent: 🦞 on openclaw ✅ with NVIDIA NemoClaw ✨ using NVIDIA DGX Spark Get started: developer.nvidia.com/blog/build-a-s…

English

134

2.1K

305.8K

intenAi🦋 retweetledi

Greg Brockman@gdb·18 Nis

codex is becoming a full agentic IDE

Evan Bacon 🥓@Baconbrix

Building an iPhone app directly in Codex desktop with iOS simulator

English

145

222

4.2K

412K

intenAi🦋 retweetledi

NVIDIA@nvidia·16 Nis

AI will create new kinds of work, new industries, and new ways to build. Last week at Stanford, our CEO Jensen Huang joined Rep. Ro Khanna for a conversation on AI, jobs and the long-term opportunity ahead.

English

138

152

778

82.1K

intenAi🦋 retweetledi

Min Choi@minchoi·16 Nis

NVIDIA just dropped Lyra 2.0. This AI can turn one image into an explorable 3D world. Export to 3D Gaussians, meshes, and physics engines. Model Card + Project👇

English

482

52.6K

intenAi🦋 retweetledi

Google AI@GoogleAI·15 Nis

Today we launched Gemini 3.1 Flash TTS, our most expressive and controllable text-to-speech model yet. This launch [excitement] includes audio tags! 🗣🏷 Audio tags [explanatory] are a seamless way to guide vocal style, pace, and delivery using natural language commands embedded directly in your text. Want a different tempo or tone? [amazement] Just tag the audio to steer the AI-speech output! The model supports 70+ languages (24 of which are high-quality evaluated languages, including: Japanese, Hindi, and Arabic). Watch the audio tags in action in the demo below ↓

English

118

309

2.3K

201.3K

intenAi🦋 retweetledi

Demis Hassabis@demishassabis·16 Nis

Our most expressive and steerable TTS model yet! Designed to give builders granular control over AI-generated speech, Gemini 3.1 Flash TTS is really fun to play with! Available in preview today - for devs via the Gemini API & @GoogleAIStudio + for enterprises on Vertex AI

Logan Kilpatrick@OfficialLoganK

Introducing Gemini 3.1 Flash TTS 🗣️, our latest text to speech model with scene direction, speaker level specificity, audio tags, more natural + expressive voices, and support for 70 different languages. Available via our new audio playground in AI Studio and in the Gemini API!

English

136

1.5K

142.5K

intenAi🦋 retweetledi

NVIDIA AI Developer@NVIDIAAIDev·15 Nis

Today, we released Lyra 2.0, a framework for generating persistent, explorable 3D worlds at scale, from NVIDIA Research. Generating large-scale, complex environments is difficult for AI models. Current models often “forget” what spaces look like and lose track of movement over time, causing objects to shift, blur, or appear inconsistent. This prevents them from creating the reliable 3D environments required for downstream simulations. Lyra 2.0 solves these issues by: ✅ Maintaining per-frame 3D geometry to retrieve past frames and establish spatial correspondences ✅ Using self-augmented training to correct its own temporal drifting. Lyra 2.0 turns an image into a 3D world you can walk through, look back, and drop a robot into for real-time rendering, simulation, and immersive applications. ➡️ Learn more: research.nvidia.com/labs/sil/proje… 📄 Read the paper: arxiv.org/abs/2604.13036

English

103

463

2.9K

432.5K

intenAi🦋 retweetledi

Sundar Pichai@sundarpichai·15 Nis

Introducing Gemini on Mac. It’s the first time we’re bringing the @Geminiapp to desktop. The team built this initial release with @Antigravity, and it went from an idea to a native Swift app prototype in a few days. More features on the way!

English

527

851

11.5K

893.7K

intenAi🦋 retweetledi

Demis Hassabis@demishassabis·15 Nis

Great to see our collaboration w/ @BostonDynamics unlocking new capabilities! Gemini Robotics-ER 1.6 enables robots like Spot to read complex industrial gauges autonomously. Exciting step toward robots that can understand & operate usefully in the physical world

Google DeepMind@GoogleDeepMind

We’re rolling out an upgrade designed to help robots reason about the physical world. 🤖 Gemini Robotics-ER 1.6 has significantly better visual and spatial understanding in order to plan and complete more useful tasks. Here’s why this is important 🧵

English

204

201.7K

intenAi🦋 retweetledi

ollama@ollama·31 Mar

Ollama is now updated to run the fastest on Apple silicon, powered by MLX, Apple's machine learning framework. This change unlocks much faster performance to accelerate demanding work on macOS: - Personal assistants like OpenClaw - Coding agents like Claude Code, OpenCode, or Codex

English

293

729

5.8K

779.6K

intenAi🦋 retweetledi

Mistral AI for Developers@MistralDevs·17 Mar

⚡️ Introducing Mistral Moderation 2, our next-generation moderation model. It introduces new categories and builds on the strengths of the previous version. - Enhanced performance - 128k context length (up from 8k) - Free to use

English

424

22.6K

intenAi🦋 retweetledi

Mistral AI@MistralAI·18 Mar

Today, we’re introducing Forge, a system for enterprises to build frontier-grade AI models grounded in their proprietary knowledge. 🌎 Forge bridges the gap between generic AI and enterprise-specific needs. Instead of relying on broad, public data, organizations can train models that understand their internal context embedded within systems, workflows, and policies, aligning AI with their unique operations. We have already partnered with world-leading organizations, like ASML, DSO National Laboratories Singapore, Ericsson, European Space Agency, Home Team Science and Technology Agency (HTX) Singapore and Reply to train models on the proprietary data that powers their most complex systems and future-defining technologies.

English

363

2.6K

415.6K

intenAi🦋 retweetledi

Mistral AI for Developers@MistralDevs·17 Mar

🔥 Meet Mistral Small 4: One model to do it all. ⚡ 128 experts, 119B total parameters, 256k context window ⚡ Configurable Reasoning ⚡ Apache 2.0 ⚡ 40% faster, 3x more throughput Our first model to unify the capabilities of our flagship models into a single, versatile model.

English

320

2.6K

385.1K

intenAi🦋 retweetledi

NVIDIA AI Developer@NVIDIAAIDev·16 Mar

Ready to deploy AI agents? NVIDIA NemoClaw simplifies running @openclaw always-on assistants with a single command. 🦞 Deploy claws more safely ✨ Run any coding agent 🌍 Deploy anywhere Try now with a free NVIDIA Brev Launchable 🔗 nvidia.com/nemoclaw

NVIDIA Newsroom@nvidianewsroom

#NVIDIAGTC news: NVIDIA announces NemoClaw for the OpenClaw agent platform. NVIDIA NemoClaw installs NVIDIA Nemotron models and the NVIDIA OpenShell runtime in a single command, adding privacy and security controls to run secure, always-on AI assistants. nvda.ws/47xOPqQ

English

266

601

4.1K

893K

intenAi🦋 retweetledi

Andrej Karpathy@karpathy·7 Mar

I packaged up the "autoresearch" project into a new self-contained minimal repo if people would like to play over the weekend. It's basically nanochat LLM training core stripped down to a single-GPU, one file version of ~630 lines of code, then: - the human iterates on the prompt (.md) - the AI agent iterates on the training code (.py) The goal is to engineer your agents to make the fastest research progress indefinitely and without any of your own involvement. In the image, every dot is a complete LLM training run that lasts exactly 5 minutes. The agent works in an autonomous loop on a git feature branch and accumulates git commits to the training script as it finds better settings (of lower validation loss by the end) of the neural network architecture, the optimizer, all the hyperparameters, etc. You can imagine comparing the research progress of different prompts, different agents, etc. github.com/karpathy/autor… Part code, part sci-fi, and a pinch of psychosis :)

English

1.1K

3.6K

28.3K

11.1M

intenAi🦋 retweetledi

Andrej Karpathy@karpathy·6 Mar

nanochat now trains GPT-2 capability model in just 2 hours on a single 8XH100 node (down from ~3 hours 1 month ago). Getting a lot closer to ~interactive! A bunch of tuning and features (fp8) went in but the biggest difference was a switch of the dataset from FineWeb-edu to NVIDIA ClimbMix (nice work NVIDIA!). I had tried Olmo, FineWeb, DCLM which all led to regressions, ClimbMix worked really well out of the box (to the point that I am slightly suspicious about about goodharting, though reading the paper it seems ~ok). In other news, after trying a few approaches for how to set things up, I now have AI Agents iterating on nanochat automatically, so I'll just leave this running for a while, go relax a bit and enjoy the feeling of post-agi :). Visualized here as an example: 110 changes made over the last ~12 hours, bringing the validation loss so far from 0.862415 down to 0.858039 for a d12 model, at no cost to wall clock time. The agent works on a feature branch, tries out ideas, merges them when they work and iterates. Amusingly, over the last ~2 weeks I almost feel like I've iterated more on the "meta-setup" where I optimize and tune the agent flows even more than the nanochat repo directly.

English

335

559

6.5K

638.9K

Keşfet

@GoogleAIStudio @eddiejiao_obj @drewocarr @NVIDIAAI @GeminiApp @antigravity @BostonDynamics @openclaw