Tycho Labs

976 posts

Tycho Labs

@TychoLabsCom

Building trusted AI. Reaching beyond.

Germany Katılım Mayıs 2022

67 Takip Edilen193 Takipçiler

Tycho Labs@TychoLabsCom·1d

The local AI stack is getting interesting. Models like Qwen 3.6 27B are only one layer. The real leverage may come from the harness: context normalization, tool routing, execution loops, memory, and evals. Local agents will be won by systems, not models alone. #AI #LLM #LocalAI

English

Tycho Labs@TychoLabsCom·3d

Everything is possible!

Artificial Analysis@ArtificialAnlys

Cursor Composer 2.5's is 3–18x cheaper than Opus 4.7 in Claude Code (medium reasoning), and 5–32x cheaper than GPT-5.5 in Codex (medium) based on API pricing This low Cost per Task isn't just driven by relatively low token pricing, it's also driven by low relatively low token usage compared to other leading models. @cursor_ai Composer 2.5 only used 1.6M token to complete our Coding Agent Index benchmarks, while other models used up to 5.7M. This lower token usage also contributes to a low Time per Task. Across the Coding Agent Index configurations shown, average Time per Task was ~12 minutes. Composer 2.5 completed tasks in ~9 minutes on average, making it ~1.3x faster than average, while Composer 2.5 Fast completed tasks in ~7 minutes, making it ~1.8x faster than the average across agents. Link to full benchmark results below

English

Tycho Labs@TychoLabsCom·6d

@Google @GeminiApp @Samsung @_GentleMonster_ @WarbyParker Good luck!

English

Google@Google·6d

We’re partnering with @Samsung, @_GentleMonster_ and @WarbyParker on new intelligent eyewear. Here's a sneak peek at two designs from this fall's upcoming collections. #GoogleIO

English

283

969

8.9K

1.5M

Tycho Labs@TychoLabsCom·6d

LLMs learned to predict text. World models are learning to predict how reality changes. Is the future of AI language, simulation, or something entirely different?

Odyssey@odysseyml

Introducing Agora-1, a multi-agent world model. Multiple participants—human or AI—can now interact inside the same world simulation, all in real-time. Try our playable research preview today, with Agora-1 simulating a multiplayer GoldenEye deathmatch!

English

Tycho Labs@TychoLabsCom·6d

AI hardware won’t win because it has the best specs. It will win when intelligence becomes instant, private, and close enough to use without thinking. #AI #Startups #AIagents

English

Tycho Labs@TychoLabsCom·15 May

The GPU did not change. The model did. That is why open models are dangerous to the current AI stack. They keep turning consumer hardware into serious AI execution infrastructure.

Sudo su@sudoingX

update: qwen 3.6 27b dense q4 just one shotted octopus invaders game on a single 3090. hermes agent drove the whole thing, ~41 tok/s gen 21gb vram at full 262k context, thinking mode on. one prompt in and the canonical multi-file space shooter benchmark out, the same exact prompt i ran on qwen 3.5 27b dense back in march on the same card. 3.5 needed one external scope bug fix before the game would even load on first play. 3.6 needed nothing. 11 of 11 files written, 2411 lines of code, zero steering interventions, zero external fixes, playable on first load. 16 minutes 41 seconds wall clock from prompt to playable. consumer tier king on a single 3090 is locked tonight, and the silicon underneath my desk did not change between march and now. the open source ecosystem just moved the floor. watch it ship itself, the full 16 minutes 41 seconds sped to 3 minutes 45, no human touched the keyboard between the first prompt and the final frame.

English

Tycho Labs@TychoLabsCom·15 May

@ScottShapiroUXD The brutal truth is simple. If your AI product can be rebuilt by switching APIs and redesigning the UI, you don’t have a moat. The moat is workflow ownership, proprietary usage loops, distribution, and execution quality.

English

Scott Shapiro@ScottShapiroUXD·15 May

@TychoLabsCom Model access is table stakes. The moat is the workflow you build on top of it. Most companies haven't figured that out yet.

English

Tycho Labs@TychoLabsCom·15 May

The AI bubble won’t pop because AI is useless. It will pop for companies that confuse model access with a moat. #AI #Startups #LLM

English

Tycho Labs@TychoLabsCom·15 May

This is not AI writing research. This is AI compressing the research loop: idea → implementation → training → evaluation → iteration If that loop gets 10x faster, the real bottleneck becomes taste, direction, and knowing what is worth testing.

Aksel@akseljoonas

3 weeks since ml-intern launched and we just hit 1M messages exchanged. that's 3.3 agent-years of ML research in 21 days. 2 months worth of research every day. 17,383 training jobs total. talk about AI acceleration. here's some of what people built: @cmpatino_ replicated the full DeepSeek v4 architecture and pre+post trained a 100M MoE from scratch. → huggingface.co/cmpatino/nanow… it landed a third place submission on @kellerjordan0 optimizer competition. autoresearch on SOTA territory. github.com/KellerJordan/m… @_lewtun Got the intern to convert @AlecRad's cool new talkie-lm 1930 model to work with transformers. tokenizer, chat template, model conversion etc all one-shotted by ml-intern. huggingface.co/lewtun/talkie-… someone created entire PhD dissertation chapter on context-aware agentic cyber defense drafted with 16 research subagents. and someone used it to crack an @Anthropic kernel optimization take-home. (we don't know how to feel about this one 👀 ) just getting started → huggingface.co/spaces/smolage…

English

Tycho Labs@TychoLabsCom·14 May

Agent builders should stop treating chat history as just messages. Role order, message merging, tool results, compaction, and serialization can change model behavior. The next serious AI stack needs provider-normalized context handling to keep agent behavior consistent across models. #AI #AIagents #LLM

English

Tycho Labs@TychoLabsCom·13 May

Language models learned to predict the next token. World models are learning to predict the next state. That may be one of the most important shifts in AI.

Odyssey@odysseyml

Why We Must Build World Models

English

Tycho Labs@TychoLabsCom·11 May

The next AI moat may not be compute. It may be data strategy: knowing when to repeat rare, valuable data and when to source more unique tokens. In LLM training, what is the bigger unlock, smarter repetition or more unique high-quality data? #AI #LLM #MachineLearning

English

Tycho Labs@TychoLabsCom·9 May

The next AI platform won’t feel like opening software. It will feel like having capability closer to you, always aware of context, and ready to act when needed. The best technology disappears into daily life. #AI #Startups #AIagents

English

Tycho Labs@TychoLabsCom·5 May

If this holds up, the biggest unlock is not just longer context. It’s AI agents that can finally work with the full picture: codebases, histories, decisions, and workflows.

Alexander Whedon@alex_whedon

Introducing SubQ - a major breakthrough in LLM intelligence. It is the first model built on a fully sub-quadratic sparse-attention architecture (SSA), And the first frontier model with a 12 million token context window which is: - 52x faster than FlashAttention at 1MM tokens - Less than 5% the cost of Opus Transformer-based LLMs waste compute by processing every possible relationship between words (standard attention). Only a small fraction actually matter. @subquadratic finds and focuses only on the ones that do. That's nearly 1,000x less compute and a new way for LLMs to scale.

English

Tycho Labs@TychoLabsCom·2 May

The next generation of AI won’t be defined by who has the most data. It will be defined by who builds the fastest learning loops. #AI #LLM #Startups

English

Tycho Labs@TychoLabsCom·30 Nis

A lot of AI startups are trying to build the next interface. But the bigger opportunity may be underneath: The execution layer that connects models to tools, systems, data, and real-world workflows. That is where AI becomes infrastructure. #AI #Startups #AIagents

English

Tycho Labs@TychoLabsCom·29 Nis

@desert_mouse Both, but personal trust is where agents become truly valuable. Global evals tell us if the system is generally reliable. The real question is: can it reliably execute my workflows with my tools and constraints?

English

Arnon Kahani@desert_mouse·29 Nis

@TychoLabsCom Is it going to be global trust or personal i.e. your workflow your tools?

English

Tycho Labs@TychoLabsCom·29 Nis

“Feels better” is not enough for AI agents. Did it use fewer tools? Did it waste fewer tokens? Did it retry less? Did it complete the task more reliably? Trust needs to become measurable. #AI #AIagents #AIEvals

English

Tycho Labs@TychoLabsCom·29 Nis

@mistralvibe @MistralAI Great job! Keep going.

English

287

Mistral Vibe@mistralvibe·29 Nis

Introducing remote agents in Vibe and Mistral Medium 3.5. You can now launch remote agents in the cloud, including from the CLI or Le Chat. Plus, new Work mode in Le Chat for complex, multi-step tasks. 🧵

English

120

915

132.4K

Keşfet

@Google @GeminiApp @Samsung @_GentleMonster_ @WarbyParker @ScottShapiroUXD @desert_mouse @elonmusk