一起MoltBot吧
84 posts

一起MoltBot吧
@LetMoltBot
用 Moltbot 帮你把流程都搭好 🦞 配置指南|环境搭建教程|开源AI助手
Katılım Temmuz 2020
117 Takip Edilen1.5K Takipçiler

This is so cool !
Combine all kinds of media as input and generate.
Google@Google
Gemini Omni can create anything from any input, starting with video. 🪄 This means you can combine images, audio, video and text as input and generate high-quality videos. Or use drawings to create in a way that matches your vision. #GoogleIO
English

This is a smart first step by the EU.
One can quibble with the exact details, but generally speaking you need a policy push to get firms to reduce supply chain dependency on China. The point is not to decouple but to sensibly diversify sourcing for critical inputs.
FT China@ftchina
EU plans to force companies to buy parts from non-Chinese suppliers ft.trib.al/uphKP1J
English
一起MoltBot吧 retweetledi

For years, React Native devs had two AR/VR options: learn Unity, or settle for 2D panels in VR.
Real immersive 3D, from mobile AR to a native VR app, on one codebase? Didn't exist.
Today it does. @ReactVisionXR Studio is live with Meta Quest support 🧵👇
English

AI teams shouldn’t have to choose between expensive object storage and painful git workflows.
@huggingface Storage is built for model weights, datasets, checkpoints and artifacts:
- simple per-TB pricing
- built-in CDN
- Xet deduplication
- private by default when needed
Store your AI data where your AI work already happens: huggingface.co/storage
English

Recently these super realistic live TV broadcast shots have been going viral everywhere
Tried making one myself using GPT Image 2 + Kling 3.0
Prompt: A screenshot from a live Wimbledon TV broadcast during a packed Centre Court match. The camera cuts to the audience, an unbelievably attractive woman in her 20s with long black hair, flawless skin, elegant makeup, and a luxurious aura, seated in the VIP section wearing a sophisticated cream-white low-cut summer outfit with subtle jewelry. She smiles naturally while reacting to the match, unaware she's on camera. Wealthy spectators and champagne glasses around her, old-money tennis atmosphere, shallow depth of field. Full live tennis broadcast overlay: scoreboard, network watermark, broadcast graphics, 16:9 aspect ratio. The image looks exactly like a real TV screenshot, telephoto broadcast lens, realistic live color grading, slight compression artifacts, interlacing grain, subtle motion blur, imperfect live-camera framing.
English

Introducing GPT-Realtime-2 in the API: our most intelligent voice model yet, bringing GPT-5-class reasoning to voice agents.
Voice agents are now real-time collaborators that can listen, reason, and solve complex problems as conversations unfold.
Now available in the API alongside streaming models GPT-Realtime-Translate and GPT-Realtime-Whisper — a new set of audio capabilities for the next generation of voice interfaces.
English








