LAION

1.5K posts

LAION

@laion_ai

100% TRULY OPEN AI.

The internet Katılım Eylül 2021

106 Takip Edilen15.5K Takipçiler

LAION retweetledi

Ross Wightman@wightmanr·8h

I've made extensive changes to OpenCLIP over the past few weeks, merging ideas I've been evolving in some other projects that I'm working on in parallel. If you've built anything around it, take a peek. Training has been refactored around a lightweight Task based abstraction making different model + loss/objective combos much cleaner to swap. FSDP2 support was added, improve torch.compile support, and some effort to allow combos of the FSDP2 or DDP + compile + activation checkpointing to work (mostly) nicely together. Native aspect NaFlex data pipelines and timm based NaFlexViT encoder is supported for train and eval (incl the OpenCLIP native SIGLIP2 naflex models). @mehdidc started this but I broke it and needed to rethink. The merging of CLAP (audio-clip) modelling, train task, data pipelines initiated by @JJitsev was unblocked thanks to the above cleanup. I just completed the reorganization and initial CLAP training appears to be functioning 🥳 There is still some verification to do. I will be testing distributed performance on JUPITER (Jülich Supercomputing Centre, Germany) to clear the way for some researchers to abandon their forks :)

English

3.7K

LAION retweetledi

Andrej Karpathy@karpathy·3d

Personal update: I've joined Anthropic. I think the next few years at the frontier of LLMs will be especially formative. I am very excited to join the team here and get back to R&D. I remain deeply passionate about education and plan to resume my work on it in time.

English

7.8K

11.1K

148K

26.6M

LAION retweetledi

Kōda@aimikoda·10 May

Seedance 2.0 Emotional Coordinates Meet Valence-Arousal. I’ve been experimenting with valence-arousal prompting in Seedance 2.0 on @mitte_ai, and surprisingly, a lot of it actually works. Most likely this should transfer to many other video/image models too. Before we start, I should mention that a lot of attempts can hit safety/violence filters depending on how extreme the emotional state becomes. My test prompts were also intentionally very simple and open-ended, which probably increased that. Valence measures how positive or negative an emotional state is. Arousal measures how calm or activated it is. So instead of writing: “sad, anxious, emotionally overwhelmed” you can try things like: valence: low arousal: high I tested mostly numeric values at first, but honestly I’ve been getting much better and more stable results with transition-based prompting like this: “The emotional state gradually shifts from high valence and low arousal to low valence and high arousal.” That feels much more model-friendly right now. I think this could become really useful for subtle acting, emotional transitions, cinematic dialogue scenes, uncanny performances and mood-driven storytelling. Leaving example Seedance 2.0 prompts, the Valence-Arousal infographic and a GPT Image 2 prompt below.

English

477

48.1K

LAION retweetledi

mrfakename@realmrfakename·5 May

Qwen3-ASR-Enhanced v0.1 An early checkpoint, consider it a beta release. Soon I will release a better version with more stable nonverbal tag support. Huge thanks to @huggingface for sponsoring the compute used to train this model! Thanks to LAION for some of the data.

English

3.5K

LAION retweetledi

Sakana AI@SakanaAILabs·29 Nis

We’re excited to introduce KAME: Tandem Architecture for Enhancing Knowledge in Real-Time Speech-to-Speech Conversational AI, accepted at #ICASSP2026! 🐢 Blog pub.sakana.ai/kame/ Paper arxiv.org/abs/2510.02327 Can a speech AI think deeply without pausing to process? In real conversation, we don’t wait until we’ve fully worked out what we want to say—we start talking, and our thoughts catch up as the sentence unfolds. Fast speech-to-speech models achieve this, but their reasoning tends to stay shallow. Cascaded pipelines that route through a knowledgeable LLM are smarter, but the added latency breaks the flow—they fall back to "think, then speak." In our new paper, we propose a way to break this trade-off. We call it KAME (Turtle in Japanese). A speech-to-speech model handles the fast response loop and starts replying immediately. In parallel, a backend LLM runs asynchronously, generating response candidates that are continuously injected as "oracle" signals in real time. This shifts the AI paradigm from "think, then speak" to "speak while thinking." The backend LLM is completely swappable. You can plug in GPT-4.1, Claude Opus, or Gemini 2.5 Flash depending on the task without changing the frontend. In our experiments, Claude tended to score higher on reasoning, while GPT did better on humanities questions. Try the model yourself here: huggingface.co/SakanaAI/kame

English

145

740

288.2K

LAION retweetledi

etn.@etnshow·27 Nis

BREAKING: Sequoia and Lightspeed co-lead Europe's largest seed funding round with $1.1B at $5.1B post for ex-Deepmind David Silver's Ineffable Intelligence. David is committing to giving away 100% of the money he makes from his Ineffable equity via Founders Pledge - the biggest pledge in their history and it is likely to amount to multiple billions.

English

694

108.7K

LAION retweetledi

Andreas Köpf@neurosp1ke·20 Nis

Daughter: „looks nice, but a bit slow“ .. ok Claude - please auto-research fast inference on Jetson Thor, my cuda graphs baseline is 691ms e2e …5h 130 experiments later auto-research plateaued at 285 ms. Integrating into real robot repo easy >2x speedup 🤯

English

489

34.7K

LAION@laion_ai·10 Nis

Round 2 is here! 🎧 We listened to your feedback and made it snappier — 30-second clips, smoother flow. We're building the largest open dataset of human music perception, and we need your ears. Listen. React. Done. Help shape open music AI: songrater.bud-e.ai

English

LAION@laion_ai·8 Nis

@alexandr_wang Not even open weights?

English

888

Alexandr Wang@alexandr_wang·8 Nis

1/ today we're releasing muse spark, the first model from MSL. nine months ago we rebuilt our ai stack from scratch. new infrastructure, new architecture, new data pipelines. muse spark is the result of that work, and now it powers meta ai. 🧵

English

728

1.2K

10.4K

4.5M

LAION retweetledi

Adina Yakup@AdinaYakup·7 Nis

VoxCPM2 🔊 New token-free TTS model from @OpenBMB huggingface.co/openbmb/VoxCPM2 ✨2B - Apache 2.0 ✨30 languages supported ✨Design voices from text (gender, age, tone, emotion) ✨48kHz studio-quality audio

English

249

12.9K

LAION@laion_ai·4 Nis

@realsigridjin @github I hope you have a backup ;)

English

Sigrid Jin 🌈🙏@realsigridjin·4 Nis

we tried to transfer the fastest growing repo on github and it's been locked for 2 days @github you good? 😭😭

English

388

35.9K

LAION retweetledi

Om Patel@om_patel5·4 Nis

I taught Claude to talk like a caveman to use 75% less tokens. normal claude: ~180 tokens for a web search task caveman claude: ~45 tokens for the same task "I executed the web search tool" = 8 tokens caveman version: "Tool work" = 2 tokens every single grunt swap saves 6-10 tokens. across a FULL task that's 50-100 tokens saved why does it work? caveman claude doesn't explain itself. it does its task first. gives the result. then stops. no "I'd be happy to help you with that." no "Let me search the web for you" no more unnecessary filler words "result. done. me stop." 50-75% burn reduction with usage limits getting tighter every week this might be the most practical hack out there right now

English

950

1.4K

23.6K

3.1M

LAION retweetledi

Aratako@Aratako_LM·3 Nis

Gemma 4 E4Bからaudio encoderだけを取り出してみました huggingface.co/Aratako/gemma-…

日本語

278

21.9K

LAION@laion_ai·2 Nis

@rosscoulthart @skdh @skdh time to join the open source AI community! Physics is sooo 20th century!

English

175

Ross Coulthart@rosscoulthart·2 Nis

A sign of how unwell contemporary science has become: The well-respected physicist commentator Sabine Hossenfelder @skdh has lost her academic affiliation because she dared to criticise a physicist's research. She delivers a damning condemnation: "A lot of research and the foundations of Physics is now pseudo-science. It hasn't followed the scientific method for decades." youtu.be/ZO5u3V6LJuM?si… She recounts a recent incident where a physicist contacted her, upset that she had judged their research as "100% bullshit", demanded she remove the relevant video, and then complained to people he believed were her supervisors when she refused. As a result of complaints (including from members of the community upset about her criticism of their research and academic conduct in general), her former academic institution—the Munich Center for Mathematical Philosophy— has discontinued her affiliation. Sabine is financially independent thanks to her audience support, so she is unbothered by the loss of affiliation or attempts to pressure her; however, she is concerned that many physicists fail to recognize the fundamental problems with their field. The broader issue in theoretical high-energy physics and foundations of physics is not new: critics like David Lindley ("the end of physics") and John Horgan ("the end of science") have pointed it out, yet the production of low-value "garbage" papers continues daily, gets published, funded, and hyped in the media. Many experts acknowledge the problems privately but stay silent publicly to protect their reputations and funding; an exception is physicist Will Kinney, who publicly criticized inflation model-building as mostly useless mathematical exercises with no realistic expectation of correctness. She strongly endorses Jesper Grimstrup's book "The Ant Mill," which describes the crisis in theoretical high-energy physics: no major breakthroughs in ~50 years, a lack of genuinely new ideas, and strong social pressures toward tribalism and groupthink that discourage independent thinking. She says intelligent people are wasting their time and taxpayer money on unproductive work due to ingrained groupthink; physicists are often shocked by external criticism and refuse to accept responsibility, blaming the critic instead. Hallmarks of pseudoscience in this area include: it looks like science from the inside (with courses, conferences, and jargon), but involves inventing mathematical "stories" or fictions about non-existent laws, new particles, forces, gravities, the beginning of the universe, multiverses, or extra dimensions—with what she says is zero empirical evidence. She compares this to naturopathy or other 'pseudoscientific' fields for brainwashing, rejection of challenging views, and overconfidence in one's intelligence; the main difference is that quack medical claims can directly harm people, while quack physics papers mainly waste money and resources. She highlights the core scientific failure: Science progresses by learning from mistakes and refining what counts as a "worthy" hypothesis. Post-1970s theoretical physics has not done this; instead, it continues guessing "nice" mathematics without basis, producing thousands of falsifiable but ultimately falsified predictions. Pre-1970s physics successfully solved real problems and made correct predictions (e.g., neutrino masses, Higgs boson); since then, the method of generating hypotheses via mathematical beauty or speculation has failed to yield confirmed breakthroughs, yet the community refuses to update its standards. The scientific method is misunderstood: it is not just "make a hypothesis and test it." Disciplines learn quality standards from past failures (e.g., random doomsday predictions are dismissed as unscientific because we know they waste time). Theoretical physics has stopped learning in this way for foundational questions. Many subfields (e.g., high-temperature superconductors, quantum information) are doing "normal science" productively, but the problem is concentrated in areas that invent superfluous, evidence-free hypotheses with no pressing data or consistency issues to solve. Example with dark matter: Solid evidence exists for it, and simple models suffice, but researchers unnecessarily complicate it with new "dark sectors," fifth forces, etc., that add extra assumptions and soon get ruled out—violating principles of parsimony. She compares it to the replication crisis in psychology (p-value hacking and irreproducible results), noting that psychology at least attempted reforms, while physics has doubled down on piling up unfruitful guesses (extra dimensions, multiverses, etc.). She proposes a solution: Journals and reviewers should adopt stricter guidelines—e.g., only publish papers where hypotheses use the minimal necessary assumptions and actually solve a real consistency or explanatory problem, rather than mathematical fiction. This could eliminate ~99% of the issue quickly, though journals resist due to incentives around publication volume and citations. The field has turned into a self-perpetuating system of producing and rewarding mathematical fiction instead of evidence-driven progress. Public exposure and pressure for reform are needed, even if it makes people uncomfortable. This is quite an important and challenging vent from Ms Hossenfelder. Good luck to her in her new independent role. @ericweinstein

YouTube

English

130

304

1.5K

177K

LAION retweetledi

Logan Kilpatrick@OfficialLoganK·2 Nis

Introducing Gemma 4, our series of open weight (Apache 2.0 licensed) models, which are byte for byte the most capable open models in the world! Gemma 4 is build to run on your hardware: phones, laptops, and desktops. Frontier intelligence with a 26B MOE and a 31B Dense model!

English

287

593

6.2K

524.6K

LAION retweetledi

Yuchen Jin@Yuchenj_UW·31 Mar

> Anthropic leaked Claude Code source code > someone forked it > 32.6k stars, 44.3k forks > got scared of getting sued > convert the whole codebase from TypeScript to Python with Codex AI is quietly erasing copyright.

English

234

449

413.1K

LAION retweetledi

Gergely Orosz@GergelyOrosz·31 Mar

This is either brilliant or scary: Anthropic accidentally leaked the TS source code of Claude Code (which is closed source). Repos sharing the source are taken down with DMCA. BUT this repo rewrote the code using Python, and so it violates no copyright & cannot be taken down!

English

442

1.2K

12.9K

2.2M

LAION retweetledi

V@vineetwts·31 Mar

I created documentation over Claude Code's Codebase, which explains - Its pipeline - How it works - How it handles Context - How it handles Memory & More Read it here - mintlify.com/VineeTagarwaL-…

Chaofan Shou@Fried_rice

Claude code source code has been leaked via a map file in their npm registry! Code: …a8527898604c1bbb12468b1581d95e.r2.dev/src.zip

English

153

647

5.9K

1.2M

LAION retweetledi

Chaofan Shou@Fried_rice·31 Mar

Claude code source code has been leaked via a map file in their npm registry! Code: …a8527898604c1bbb12468b1581d95e.r2.dev/src.zip

English

3.3K

7.6K

48.7K

35.6M

LAION retweetledi

Adina Yakup@AdinaYakup·24 Mar

daVinci-MagiHuman 🎬 Human Centric Audio-Video Generative Model by GAIR Model: huggingface.co/GAIR/daVinci-M… Paper: huggingface.co/GAIR/daVinci-M… ✨ 15B – Fully open source! ✨ 5-sec 1080p video in 38s on one H100 ✨ Supports 6 languages ✨ Unified model with text + video + audio

English

342

42.9K

Keşfet

@mehdidc @JJitsev @mitte_ai @huggingface @alexandr_wang @OpenBMB @realsigridjin @github