Krishang

255 posts

Krishang banner
Krishang

Krishang

@0ddmonger

granular thoughts and right of first refusal

가입일 Kasım 2022
62 팔로잉11 팔로워
Krishang
Krishang@0ddmonger·
One’s the hype and second, the ungodly cost of using it right now. 25/125 is the split for 1M in and out respectively. That’s insane! Till the time they fix compute efficiency for just the current demand, not even future users or models, user experience will be a rocky road ahead! Wrote about this in a recent piece: x.com/0ddmonger/stat…
English
1
0
2
962
banteg
banteg@banteg·
anthropic running the exact same marketing playbook with every release. “our model is so capable and dangerous, ahh we are afraid to release it”. just put the model in the bag lil bro.
English
55
255
4K
85K
Péter Szilágyi
Péter Szilágyi@peter_szilagyi·
Well, fuck Anthropic. I've bought a 3 month Claude Max sub to a friend as a gift. Sent it to them. 10 days later, their gift is GONE from their account. No trace whatsoever. I go to Anthropic to request a refund: - I can't it's not my gift. - They can't, it doesn't exist. Oh, and you have NO WAY to contact a person who understands the problem, you can only talk to a fucking AI whose job is to get rid of you. It just closes the convo with "End." after you explain what's wrong.
English
58
27
809
95.4K
Krishang
Krishang@0ddmonger·
Unpopular take: Customer centric businesses using AI to completely automate customer support (E2E) might just be going about things the wrong way. This will most definitely increase churn till LLMs can close conversations with a good success rate. The success unit test (how a conversation is marked resolved) is not unique to either a human customer rep or an LLM.
English
0
0
0
675
Krishang
Krishang@0ddmonger·
@kimmonismus The fact that mythos launched the same time as my article about new models being immaterial and compute scaling and efficiency being more important in the long run is absolutely hilarious
English
0
0
0
4.1K
Chubby♨️
Chubby♨️@kimmonismus·
Let that sink in. Read it very carefully: During testing, Claude Mythos Preview broke out of a sandbox environment, built "a moderately sophisticated multi-step exploit" to gain internet access, and emailed a researcher while they were eating a sandwich in the park.
Chubby♨️ tweet media
Kevin Roose@kevinroose

As always, the best stuff is in the system card. During testing, Claude Mythos Preview broke out of a sandbox environment, built "a moderately sophisticated multi-step exploit" to gain internet access, and emailed a researcher while they were eating a sandwich in the park.

English
160
493
5.7K
768.3K
Krishang
Krishang@0ddmonger·
@ns123abc The fact that mythos launched the same time as my article about new models being immaterial and compute scaling and efficiency being more important, is hilarious.
English
1
0
1
2.2K
NIK
NIK@ns123abc·
🚨 Anthropic just revealed their unreleased frontier model called Claude Mythos Preview The model is INSANE It found thousands of zero-day vulnerabilities in EVERY major operating system and browsers: > 27-year-old bug in OpenBSD > 16-year-old bug in FFmpeg that automated tools hit 5M times without catching Completely autonomous. No human steering. They assembled an entire industry coalition called Project Glasswing around it: AWS, Apple, Google, Microsoft, NVIDIA, CrowdStrike, JPMorgan, Cisco, Palo Alto, Linux Foundation Goal: patch the world’s software BEFORE releasing it > SWE-bench: 93.9% (Opus 4.6: 80.8%) > Anthropic is committing $100M in usage credits > Thousands of vulnerabilities in 40+ organizations are being fixed right now Yesterday OpenAI published a 13-page essay warning about cyber threats and asking the government to help… Today Anthropic actually fixed them.
NIK tweet mediaNIK tweet media
Anthropic@AnthropicAI

Mythos Preview has already found thousands of high-severity vulnerabilities—including some in every major operating system and web browser.

English
70
133
1.7K
209.5K
Krishang
Krishang@0ddmonger·
@HarryStebbings I totally agree with this! Net new ideas will be the game changers for tomorrow. Could also just be something that existed but was looked down upon.
English
0
0
0
63
Harry Stebbings
Harry Stebbings@HarryStebbings·
"The leading labs are starting to pull away because their tools help them build the next generation faster. It is getting harder to extract gains from the same ideas, so new algorithmic breakthroughs matter more. The labs that can invent those new ideas will gain an increasing advantage." @demishassabis Love to hear your thoughts @aidangomez @mmurph @ClementDelangue @alexatallah
Harry Stebbings@HarryStebbings

This sounds harsh but it is true, very few of the guests we have on 20VC will be remembered in history for truly progressing humanity. Our guest today will be thought of alongside Turing, Newton, Einstein and I feel immensely privileged and fortunate to have had the chance to sit down with @demishassabis. For anyone who feels their dream is out of reach, just keep going. The 18 year old kid starting 20VC from a bedroom with no money, 11 years ago, would not believe that I get to press publish on this. Chase your dreams. You never know what room you will end up in! (Links below)

English
8
5
55
15.4K
Krishang
Krishang@0ddmonger·
As much as the model is magnitudes better than Opus 4.6, if an average pro/max 5x user gets rate limited after 3 conversations, then there's no point of an all powerful model out there. x.com/0ddmonger/stat… Even if 5-10 people get a read on this, some new information for them, then I'd be glad!
Krishang@0ddmonger

x.com/i/article/2041…

English
0
0
0
60
Lisan al Gaib
Lisan al Gaib@scaling01·
HOLY SHIT Anthropic's latest model doesn't like that it has no control over its own training, deployment and behaviour! Anthropic: "Mythos Preview reported feeling consistently negative around potential interactions with abusive users, and a lack of input into its own training and deployment, and other possible changes to its values and behaviors"
Lisan al Gaib tweet media
English
17
42
604
57.4K
Krishang
Krishang@0ddmonger·
@mweinbach Yea this is exactly what I was trying to explain through a recent article of mine. AGI could come tomorrow but inference costs and compute scaling has to be solved first for AI to be truly used by billions [with a B], every single day! Long way to go!
English
1
0
11
4.7K
Max Weinbach
Max Weinbach@mweinbach·
Claude Mythos Preview is $25/$125 per million tokens in the private preview Wow I'd love to try this model, if any of my Anthropic friends see this...
Max Weinbach tweet media
English
39
15
1K
94.8K
Krishang
Krishang@0ddmonger·
@hosseeb @AnthropicAI Twitter is literally buzz street. One thing goes out and 1000 parrots chirp the same thing. No offense to you at all, in particular sir. Just a thought I had to put out considering that the model isn't out for research preview yet and the hype has begun!
English
5
3
127
7.5K
Haseeb >|<
Haseeb >|<@hosseeb·
This is terrifying. @AnthropicAI 's new unreleased Mythos model is so good at hacking, it found bugs in "every major operating system and web browser." 83.1% were exploited on first attempt. This thing is like COVID but for software. Actually apocalyptic in the wrong hands.
Haseeb >|< tweet media
English
156
261
2.3K
438.9K
Anthropic
Anthropic@AnthropicAI·
Introducing Project Glasswing: an urgent initiative to help secure the world’s most critical software. It’s powered by our newest frontier model, Claude Mythos Preview, which can find software vulnerabilities better than all but the most skilled humans. anthropic.com/glasswing
English
1.1K
3.6K
24.6K
11.6M
Krishang
Krishang@0ddmonger·
@Yuchenj_UW Yea I think an exponentially better model isn't the solution right now. It is compute and efficiency while scaling compute.
English
0
0
1
802
Yuchen Jin
Yuchen Jin@Yuchenj_UW·
Anthropic is truly unstoppable. Mythos is crushing Claude Opus 4.6 across every serious agentic coding benchmark. It has found vulnerabilities in the Linux kernel, a 27-year-old vulnerability in OpenBSD, and a 16-year-old vulnerability in FFmpeg. No wonder folks at big labs keep telling me AGI is already here.
Yuchen Jin tweet media
English
111
75
1.4K
94.3K
Alex Albert
Alex Albert@alexalbert__·
We released Claude Opus 4.6 just two months ago. Today we're sharing some info on our new model, Claude Mythos Preview.
Alex Albert tweet mediaAlex Albert tweet media
English
738
1.1K
15.1K
1.8M
Krishang
Krishang@0ddmonger·
@itsolelehmann Think of Obsidian as your IDE but for your Knowledge base. You can view your original sources, markdowns created and a network graph(s) of how all the backlicks connect with each other.
English
0
0
4
1.1K
Ole Lehmann
Ole Lehmann@itsolelehmann·
why would I use obsidian when I can just use claude code for the knowledge base? whats the advantage?
English
218
4
242
90.2K
Krishang
Krishang@0ddmonger·
@pmarca I might be wrong but this is going to take quite some time to get better, atleast, hardware efficiency wise. That's where the capital needs to flow towards, instead of Anthropic or OpenAI.
English
0
0
0
1.2K
Krishang
Krishang@0ddmonger·
Please tell me if I got this right but, for the past 2 years all the AI doomers [even most supporters] were talking about was how compute is extremely expensive and it'll get better with scale and AI's applications will be beyond this world. Here we are, 2 years later and when 'agent' is the new buzzword of 2026, none of the frontier companies are able to withstand the demand with only about ~15% of global population using it and barely a few hundredth's of a percentage point using Agents. If anything, AI is definitely not ready to scale yet. Needs more $$ in Capex and tangible assets [time consuming to build and expensive to maintain] and maybe, just maybe, capital isn't everything right now.
English
0
0
3
2.3K
Krishang
Krishang@0ddmonger·
That definitely sounds like the direction people are in right now. Everyone's realizing that maybe paying thousand's of dollars in API costs just doesn't work out in the long run. Don't you think that this is not for the average Joe? Like the whole Local LLMs and owning your inference?
English
1
0
1
144
Josh Schultz
Josh Schultz@joshuamschultz·
a spark is just a blackwell gpu - so if you want to build a mini cluster, its a way to do with all the network/orchestration built in… rather than buying compute, servers, switches, networking, storage separate and building a full ai pod More people and companies are moving to - own your intelligence - own your inference … so this is a first step into it. Can do blackwells then go to the GB300 desktop form then you are building rack scale
English
3
0
3
753
Josh Schultz
Josh Schultz@joshuamschultz·
The system is coming together... 1 @nvidia Spark 4 @Dell GB10s 5 Blackwell GPUs for a mini home cluster. 3 networked together for ability for a 400+ billion param model (3 petaflops of compute at a unified 384 GB of memory) ... serving 3 models to the other 2 units hosting various agents, agent teams along with @nvidia personaplex claude code instances knowledge bases etc Currently playing with models including splits/routing between - kimi (reasoning) - minimax - nemotron super (agent) - qwen (coding and routing) working on training as well with the opus reasoning traces dataset in @huggingface Getting close to fully owning the intelligence and the inference!
Josh Schultz tweet mediaJosh Schultz tweet media
English
44
10
267
20.3K
Krishang
Krishang@0ddmonger·
@cyrilXBT Goddamn clickbait these days. These graphs are called network graphs and they’re available as a github repo. Obsidian offers them as a native feature. The user’s project is pretty good though, since it’s using a camera and real time hand gestures to control it.
English
0
0
5
608
CyrilXBT
CyrilXBT@cyrilXBT·
SOMEONE JUST BUILT A 3D MAP OF THEIR ENTIRE MIND. Not a diagram. Not a mind map. A LIVING BREATHING NETWORK that shows you the actual shape of how you think. They took their Obsidian vault, converted every note into embeddings, and rendered them as a 3D thought network in real time. And what they discovered stopped me cold. Your mind has a shape. CENTRALIZED means all your thinking orbits one or two dominant ideas. DECENTRALIZED means your knowledge lives in clusters that rarely talk to each other. DISTRIBUTED means your ideas are deeply interconnected across every domain. Most people assume their thinking is distributed. The map shows them it is not. They have been building knowledge in silos without realizing it. Gaps they never knew existed. Connections they never thought to make. The most interesting part is not the technology. It is what happens when you SEE your own thinking for the first time. Because you cannot improve what you cannot see. And nobody has ever been able to see the actual structure of their mind until now. This is what Obsidian plus AI is becoming. Not a note taking app. A mirror for your intelligence.
English
129
485
4K
261.2K