Tycho Labs

976 posts

Tycho Labs banner
Tycho Labs

Tycho Labs

@TychoLabsCom

Building trusted AI. Reaching beyond.

Germany Katılım Mayıs 2022
67 Takip Edilen193 Takipçiler
Tycho Labs
Tycho Labs@TychoLabsCom·
The local AI stack is getting interesting. Models like Qwen 3.6 27B are only one layer. The real leverage may come from the harness: context normalization, tool routing, execution loops, memory, and evals. Local agents will be won by systems, not models alone. #AI #LLM #LocalAI
English
4
0
4
85
Tycho Labs
Tycho Labs@TychoLabsCom·
Everything is possible!
Artificial Analysis@ArtificialAnlys

Cursor Composer 2.5's is 3–18x cheaper than Opus 4.7 in Claude Code (medium reasoning), and 5–32x cheaper than GPT-5.5 in Codex (medium) based on API pricing This low Cost per Task isn't just driven by relatively low token pricing, it's also driven by low relatively low token usage compared to other leading models. @cursor_ai Composer 2.5 only used 1.6M token to complete our Coding Agent Index benchmarks, while other models used up to 5.7M. This lower token usage also contributes to a low Time per Task. Across the Coding Agent Index configurations shown, average Time per Task was ~12 minutes. Composer 2.5 completed tasks in ~9 minutes on average, making it ~1.3x faster than average, while Composer 2.5 Fast completed tasks in ~7 minutes, making it ~1.8x faster than the average across agents. Link to full benchmark results below

English
0
0
0
45
Tycho Labs
Tycho Labs@TychoLabsCom·
AI hardware won’t win because it has the best specs. It will win when intelligence becomes instant, private, and close enough to use without thinking. #AI #Startups #AIagents
English
0
0
0
26
Tycho Labs
Tycho Labs@TychoLabsCom·
@ScottShapiroUXD The brutal truth is simple. If your AI product can be rebuilt by switching APIs and redesigning the UI, you don’t have a moat. The moat is workflow ownership, proprietary usage loops, distribution, and execution quality.
English
0
0
0
22
Scott Shapiro
Scott Shapiro@ScottShapiroUXD·
@TychoLabsCom Model access is table stakes. The moat is the workflow you build on top of it. Most companies haven't figured that out yet.
English
1
0
1
12
Tycho Labs
Tycho Labs@TychoLabsCom·
The AI bubble won’t pop because AI is useless. It will pop for companies that confuse model access with a moat. #AI #Startups #LLM
English
1
0
1
68
Tycho Labs
Tycho Labs@TychoLabsCom·
This is not AI writing research. This is AI compressing the research loop: idea → implementation → training → evaluation → iteration If that loop gets 10x faster, the real bottleneck becomes taste, direction, and knowing what is worth testing.
Aksel@akseljoonas

3 weeks since ml-intern launched and we just hit 1M messages exchanged. that's 3.3 agent-years of ML research in 21 days. 2 months worth of research every day. 17,383 training jobs total. talk about AI acceleration. here's some of what people built: @cmpatino_ replicated the full DeepSeek v4 architecture and pre+post trained a 100M MoE from scratch. → huggingface.co/cmpatino/nanow… it landed a third place submission on @kellerjordan0 optimizer competition. autoresearch on SOTA territory. github.com/KellerJordan/m… @_lewtun Got the intern to convert @AlecRad's cool new talkie-lm 1930 model to work with transformers. tokenizer, chat template, model conversion etc all one-shotted by ml-intern. huggingface.co/lewtun/talkie-… someone created entire PhD dissertation chapter on context-aware agentic cyber defense drafted with 16 research subagents. and someone used it to crack an @Anthropic kernel optimization take-home. (we don't know how to feel about this one 👀 ) just getting started → huggingface.co/spaces/smolage…

English
0
0
0
27
Tycho Labs
Tycho Labs@TychoLabsCom·
Agent builders should stop treating chat history as just messages. Role order, message merging, tool results, compaction, and serialization can change model behavior. The next serious AI stack needs provider-normalized context handling to keep agent behavior consistent across models. #AI #AIagents #LLM
English
0
0
0
23
Tycho Labs
Tycho Labs@TychoLabsCom·
The next AI moat may not be compute. It may be data strategy: knowing when to repeat rare, valuable data and when to source more unique tokens. In LLM training, what is the bigger unlock, smarter repetition or more unique high-quality data? #AI #LLM #MachineLearning
English
0
0
0
35
Tycho Labs
Tycho Labs@TychoLabsCom·
The next AI platform won’t feel like opening software. It will feel like having capability closer to you, always aware of context, and ready to act when needed. The best technology disappears into daily life. #AI #Startups #AIagents
English
0
0
0
32
Tycho Labs
Tycho Labs@TychoLabsCom·
The next generation of AI won’t be defined by who has the most data. It will be defined by who builds the fastest learning loops. #AI #LLM #Startups
English
1
0
1
51
Tycho Labs
Tycho Labs@TychoLabsCom·
A lot of AI startups are trying to build the next interface. But the bigger opportunity may be underneath: The execution layer that connects models to tools, systems, data, and real-world workflows. That is where AI becomes infrastructure. #AI #Startups #AIagents
English
0
0
0
45
Tycho Labs
Tycho Labs@TychoLabsCom·
@desert_mouse Both, but personal trust is where agents become truly valuable. Global evals tell us if the system is generally reliable. The real question is: can it reliably execute my workflows with my tools and constraints?
English
1
0
1
26
Arnon Kahani
Arnon Kahani@desert_mouse·
@TychoLabsCom Is it going to be global trust or personal i.e. your workflow your tools?
English
1
0
1
14
Tycho Labs
Tycho Labs@TychoLabsCom·
“Feels better” is not enough for AI agents. Did it use fewer tools? Did it waste fewer tokens? Did it retry less? Did it complete the task more reliably? Trust needs to become measurable. #AI #AIagents #AIEvals
English
1
0
1
57
Mistral Vibe
Mistral Vibe@mistralvibe·
Introducing remote agents in Vibe and Mistral Medium 3.5. You can now launch remote agents in the cloud, including from the CLI or Le Chat. Plus, new Work mode in Le Chat for complex, multi-step tasks. 🧵
English
41
120
915
132.4K