Sabitlenmiş Tweet
Yuki Arimo
80 posts

Yuki Arimo
@yukiarimo
Software Developer, AI Engineer, Artist, and Philosopher
Calgary, Canada Katılım Aralık 2022
40 Takip Edilen142 Takipçiler

Humans can see in high-res, high-FPS in real-time. Why can't VLMs?
Introducing AutoGaze: ViTs/VLMs "gaze" only at key video regions! Up to 4-100x token savings, 19x speedup, and enables scaling to 4K-res 1K-frame videos.
📄 arxiv.org/abs/2603.12254
🌐 autogaze.github.io
🤗 huggingface.co/collections/bf…
(1/n)🧵
English

@kamath_sutra Did you know that VITS is free? And only price you pay is Google Colab rent for training it once. Yeah, we’re still winning!
English

@AdinaYakup I’m waiting for simple & small video extrapolation model, but they continue to ship these giant bloated ones :(
English

daVinci-MagiHuman 🎬 Human Centric Audio-Video Generative Model by GAIR
Model: huggingface.co/GAIR/daVinci-M…
Paper: huggingface.co/GAIR/daVinci-M…
✨ 15B – Fully open source!
✨ 5-sec 1080p video in 38s on one H100
✨ Supports 6 languages
✨ Unified model with text + video + audio
English

Get the New Book 'Apple: The First 50 Years' for 30% Off on Amazon macrumors.com/2026/03/22/app…

English

I tested something I'd written myself: my first academic article, published 45 years ago. It came in at 77% AI-generated.


Prof. Devi Sridhar@devisridhar
Exact same issue for me- I know my previous books and articles have been used to train AI (looking at you anthropic)- & when I run previous articles (written pre-AI) into AI checkers, they can come back as high as 90% AI. It's not artificial intelligence- it's collective human intelligence.
English

@hasumi_nanako No, you’re fucking cringe slut for gooners and OF. Nothing more. GTFO
English

ACE Studio Video Composer doesn’t just generate tracks.
It analyzes your scenes, cuts, and motion to create synced music & SFX on a full timeline.
It’s like an AI audio assistant that builds your soundtrack and gives you fully editable clips.
Download: acestudio.ai 🚀
English

@Zai_org @NVIDIAGTC If it’s good with SwiftUI, I would like to win and learn!
English

🤗To celebrate @NVIDIAGTC and the launch of GLM-5-Turbo, we're running a giveaway!
Our team will be picking lucky winners on a rolling basis. You have a chance to get a free month of the Max Coding Plan!
Here's how to join in: a retweet, reply, or post with:
1. GLM-5-Turbo usecases or
2. A short write-up based on your experience or honest take
The window closes in 48 hours. All prizes will be sent out within 72 hours after that. Feel free to get your User ID here: z.ai/subscribe
Please jump in, we'd love to see what you're building!
ps. @louszbd is at GTC right now.😆 Come say hi in person!
English

The open source ecosystem underpins nearly every software system in the world. As AI grows more capable, open source security becomes increasingly important.
We're donating to the Linux Foundation to continue to help secure the foundations AI runs on.
The Linux Foundation@linuxfoundation
The Linux Foundation Announces $12.5 Million in Grant Funding (via @AlphaOmegaOSS and @OpenSSF) @AnthropicAI , @AmazonWebServices, @GitHub, @Google, @GoogleDeepMind, @Microsoft, @OpenAI to Invest in Sustainable Security Solutions for #OpenSource linuxfoundation.org/press/linux-fo…
English

@OpenAI @AndrewMayne No, AI will steal our data and achievements. You’re thieves!
English

AI is starting to help solve real issues in healthcare for patients and doctors.
OpenAI’s Head of Health Dr. Nate Gross and Health AI Research Lead Karan Singhal join @AndrewMayne to discuss how we're building new models and products to meet the world's health needs.
English

GPT-5.4 mini is available today in ChatGPT, Codex, and the API.
Optimized for coding, computer use, multimodal understanding, and subagents. And it’s 2x faster than GPT-5 mini.
openai.com/index/introduc…

English

@Superhuman What a meaningless yap from such losers. You generate AI slop, this is not learning
English

“Never stop learning. When you’re okay with feeling ‘new’ at something, everything else becomes accessible to you. The biggest superpower is remembering how to play and learn.”
Blessing Richardson at #SXSW2026 on how to invest in yourself at any stage in your career.

English

Today we’re introducing 1Password® Unified Access.
As AI agents start operating inside real production environments, organizations need visibility into how credentials and access are actually used.
Unified Access helps security teams discover, secure, and audit access across humans, machines, and AI agents.
🔗 More here: bit.ly/4dq2pjO
English

@liquidai @xenovacom How about sharing the code for this mouse effect?
English

a vision language model too fast for human eyes! kudos @xenovacom 🐐
Ramin@ramin_m_h
model’s so fast, Josh had to slow down the video capture to show case this demo! @liquidai
English

No, I mean if you will do as we did:
1. Rent a studio.
2. Record 24 hours (per one language, per one speaker) of audio.
3. Because you already have the text, transcripts would be perfect (no ASR errors).
4. Train VITS-2 model from scratch
The resulting model will:
1. Sound much better, because it uses VAE (trainable) and not that crap.
2. Speak with perfect pronunciation 100%, because phonemes are used instead of text.
3. Have no errors whatsoever because it is task-specific and non-autoregressive.
:)
English

Fish audio s2 pro released with 80+ supported languages and fine grained inline control.
Link on @huggingface huggingface.co/fishaudio/s2-p…

English








