Sergey @ science

831 posts

Sergey @ science

Sergey @ science

@sergey_science

Software engineer and founder with 25+ years of shipping systems end-to-end. Molecular biology, genetics, LLM Pipelines, Agents.

Finland انضم Şubat 2023
94 يتبع55 المتابعون
Chris Ramsay
Chris Ramsay@chrisramsay52·
Me when I meet a grey. 👽 This makes more sense if you’ve watched today’s video. But it works either way. Credit: Highstrangeness115 on ig
Chris Ramsay tweet media
English
201
23
766
17.1K
Beatriz Villarroel
Beatriz Villarroel@DrBeaVillarroel·
@Zigmanfreud Are you using AI, Mr Ziegler? Do you believe in the existence of artificial intelligence?
English
2
2
34
391
John Ziegler
John Ziegler@Zigmanfreud·
Let me save everyone a lot of time on the UFO/Space aliens issue… It is mathematically/logically impossible for us to have been visited by outside life because we have investigated all the planets close enough for them to have even theoretically left for Earth AFTER we existed!
Rapid Response 47@RapidResponse47

.@POTUS: I recently directed @SecWar to begin releasing government files relating to UFOs and unexplained aerial phenomena. I am pleased to report this process is well underway. We've found many very interesting documents — and the first releases will begin very, very soon. 👽

English
978
20
196
161.4K
Sergey @ science
Sergey @ science@sergey_science·
I wonder why the feedback is so contradictory and inconsistent. Personally, I've experienced both very good thinking of 4.7 even at low effort, and quite bizarre decisions of 4.7 in High effort mode. Feels like there is dependency on time of the day even -maybe inference is controlled by the total load on the system and makes the model dumber "adaptively".
English
0
0
1
79
Rylan Schaeffer
Rylan Schaeffer@RylanSchaeffer·
After 2 days of using Opus 4.7, I can say that this model feels like a step sideways, if not backwards. I've seen it: - regress in its ability to write scientific papers - hallucinate repeatedly - fail to draw basic inferences about what I want It feels different, not better
Jeremy Howard@jeremyphoward

Wow I can already say after just 5 hours using @AnthropicAI Opus 4.7 that this is the first model that "gets" what I'm doing when I'm working. It feels aligned with me in a way no previous model did. (4.6 actively worked against me. I hated it. So this is *very* exciting!)

English
22
11
285
34.4K
Bobby Cooke
Bobby Cooke@0xBoku·
Got cyber approved for @claudeai doesn’t matter, still just shoots down my requests. How lame of @AnthropicAI to shoot down the adversary simulation side of security. Opus 4.6 works but it’s becoming trash and broke several projects this week. Without offensive security, cybersecurity would still be in the Stone Age
English
14
10
154
14.2K
Sergey @ science
Sergey @ science@sergey_science·
@SethSHowes How many read depths did you achieve? but yes, retarded take and written by AI post. If you're not in bioinformatics, chances are, you did it wrong and came to convincing but wrong conclusions.
English
0
0
1
77
Seth Howes
Seth Howes@SethSHowes·
I’ve wanted to do this for a decade. But I never did - I refuse to give any company my DNA. It is me. So this week I sequenced my genome entirely at home. Literally on my kitchen table. I never exposed my DNA sequence to the internet. Not at any point. I used a MinION to do the sequencing (it’s smaller + weighs less than an iPhone). I used open-source DNA models for the analysis (Evo2 and AlphaGenome) running locally on a DGX Spark and Mac Studio. I traced mechanisms behind my family’s multigenerational autoimmune conditions that no clinician has been able to understand. When I set out to do this I didn’t know if it would actually work. It does. Your genome is the most private data you will ever have. You probably shouldn’t let it leave your house.
Seth Howes tweet mediaSeth Howes tweet mediaSeth Howes tweet media
Patrick Collison@patrickc

I'm lucky enough to have a great doctor and access to excellent Bay Area medical care. I've taken lots of standard screening tests over the years and have tried lots of "health tech" devices and tools. With all this said, by far the most useful preventative medical advice that I've ever received has come from unleashing coding agents on my genome, having them investigate my specific mutations, and having them recommend specific follow-on tests and treatments. Population averages are population averages, but we ourselves are not averages. For example, it turns out that I probably have a 30x(!) higher-than-average predisposition to melanoma. Fortunately, there are both specific supplements that help counteract the particular mutations I have, and of course I can significantly dial up my screening frequency. So, this is very useful to know. I don't know exactly how much the analysis cost, but probably less than $100. Sequencing my genome cost a few hundred dollars. (One often sees papers and articles claiming that models aren't very good at medical reasoning. These analyses are usually based on employing several-year-old models, which is a kind of ludicrous malpractice. It is true that you still have to carefully monitor the agents' reasoning, and they do on occasion jump to conclusions or skip steps, requiring some nudging and re-steering. But, overall, they are almost literally infinitely better for this kind of work than what one can otherwise obtain today.) There are still lots of questions about how this will diffuse and get adopted, but it seems very clear that medical practice is about to improve enormously. Exciting times!

English
405
1K
12.7K
2.4M
johann win
johann win@waejay_·
@sergey_science @jesse_vermeulen i can’t imagine doing my manager’s job of context switching across 8 direct reports and 6 projects, so makes sense why eventually 5 different agents/sessions is hard ¯\_(ツ)_/¯ maybe it’s a learned skill though
English
1
0
0
16
Jesse
Jesse@jesse_vermeulen·
honest question: what do people do during the 5-10 min while Claude is running?
English
2.2K
77
3.1K
636.9K
Sergey @ science
Sergey @ science@sergey_science·
I don't understand why you even started it. 1. Unnecessary costs for GH 2. Unnecessary tokens burned/electricity/fossil fuel 3. Unnecessary attention grab from all here Is it supposed to be fun or what? Sounds like irresponsible thing to do. If you have extra tokens that you don't know where to put, I will gladly accept them to build a genetic tool, for example.
English
1
0
1
192
Josh Cohenzadeh
Josh Cohenzadeh@jshchnz·
With my codemaxxed project surpassing 353,000,000 lines of code (not a typo) I actually got a @Github cease & desist 🪦 "We've noticed that the repository is growing fast while committing very frequently. This looks like some sort of automated activity that serves no purpose."
Josh Cohenzadeh tweet media
English
146
85
2.6K
229.2K
johann win
johann win@waejay_·
when you drift away to do something else, it leads to more context switch, which leads to more context switch, which makes it harder to stay in deep work mode, so i just started intentionally doing absolutely nothing. it gives my brain time to think alongside claude
English
1
0
3
910
Sergey @ science
Sergey @ science@sergey_science·
Felix, I’m worried you got an ill culture of daily shipping. People are reporting lots of issues. If I were a product manager in your team, I would pull emergency stop lever and focus on bug-free releases. With such high pace you are heading towards low reputation. I’m honestly concerned.
English
0
0
0
180
Felix Rieseberg
Felix Rieseberg@felixrieseberg·
We ship new little improvements every single day, but this one was requested so much that I'm tweeting about it: Skip all permissions for Claude Cowork. Use with care, brought to you by @dreamofabear
English
86
59
1.4K
153.2K
🪓 CyberAxe (Jeremy Benisek) 🪓
@ClaudeDevs Assholes. You disgust us. I hope you get sued, you abuse your customers then call it a bug. You full well know you're doing this on purpose. You lie more than Claude does.
English
1
0
5
277
ClaudeDevs
ClaudeDevs@ClaudeDevs·
We fixed a bug where rate limits on Claude subscriptions weren't properly adjusted for long context requests in Opus 4.7. We've reset 5-hour and weekly rate limits. Enjoy Opus 4.7!
English
689
898
19K
1.9M
Sergey @ science
Sergey @ science@sergey_science·
@caleb_kinmon @bcherny @firstadopter I don’t have that poor experience. Used 4.7 at low effort setting and shipped three changes in my work. Honestly, I saw no regression on 4.7 so far. But I did see them in 4.6 and 4.5.
English
1
0
0
26
tae kim
tae kim@firstadopter·
Anthropic running out of compute is hurting their brand among customers. Honestly? We're paying customers. We deserve the service (and reliability uptime!) we paid for.
tae kim tweet media
English
21
8
241
68.4K
Sergey @ science
Sergey @ science@sergey_science·
@RayFernando1337 You probably need to go back to basics: threaten it, say you have no hands and blind, and promise billions in rewards if it thinks hard :)
English
0
0
0
44
Ray Fernando
Ray Fernando@RayFernando1337·
Wait, what happened to the Extended Thinking toggle on Opus 4.7? Opened Claude this morning and the toggle I use every day is gone. It's now "Adaptive thinking, thinks only when needed." Dug into the docs and on 4.7, adaptive is the only mode. The model decides per-request if it wants to think or not. Where is the way to force it on? What does this mean for a Max user like me who lives on the phone and web (not Claude Code or the API)? On 4.6, Extended Thinking on meant every answer got the deep reasoning. I pay $200/month for Max and I kept it on for my workflows, projects, etc. On 4.7, every request kind of feels like a slot machine. Did the model think about this one? Is my request worthy enough of more thinking? What if I tell it to think harder...ultrathink...mega ultra uber giga think?? I don't know. Pull the lever and just...hope? When it does more thinking with 4.7 it is a nice experience and I love what the team built. Just wondering out loud if there's a way for $200 Max user to force thinking on every request. Happy to pay for it. Anyone else notice this?
Ray Fernando tweet media
English
101
24
419
103.9K
Nick Walsh
Nick Walsh@nwalsh0221·
@trq212 @ClaudeDevs Any chance you can help me with my subscription? I got charged but haven’t received it (submitted a ticket abt this too)
English
2
0
1
420
Sergey @ science
Sergey @ science@sergey_science·
This can't be right. You can see on X people have been complaining ("Opus got nerfed"). I noticed intelligence of 4.6 has degraded to the level of Sonnet (I used Windsurf IDE with explicit Opus 4.6), so had to switch to Codex 5.3, then got back to Opus 4.5 which worked better than 4.6.
English
0
0
0
49
Dave Kennedy
Dave Kennedy@HackingDave·
Think about all the orgs using Claude right now that have no idea how bad it has become over the past 4 weeks ago. No statement from Claude - but a total revert to where the model was a year ago - which in comparison to when 4.6 got released is effectively last years AI model. The amount of bugs, security issues, and complete destruction of production applications is going to be felt for quite a long time due to this. Claude: nothing to see here.
English
92
24
539
106.8K
Yiannis Panagidis
Yiannis Panagidis@yiannis__p·
@theo I feel like no matter how much better it might be than 5.4 I don’t really care cause Anthropic is too toxic to work with. Limits and costs aside, forcing me to use the worst harness destroys any potential model gains over 5.4
English
1
0
0
1.1K
Theo - t3.gg
Theo - t3.gg@theo·
How are people feeling about opus 4.7 so far?
English
790
14
1.7K
381.9K
peermux
peermux@peermux·
@theo Nobody cares about Opus anymore. Anthropic should give everyone equal access to Mythos. It goes directly against their own mission to only give corps/orgs access to it. We need fair and equal access.
English
5
0
6
2K