anotherjesse
6.5K posts

anotherjesse
@anotherjesse
Parent, Permaculture, Ukulist, ex-@replicate, ex-@planet, @OpenStack, UserScripts. I love the ocean, browsers, clouds & unicorns
Berkeley, CA Katılım Aralık 2006
1.5K Takip Edilen3.4K Takipçiler
anotherjesse retweetledi

@sawyerhood Making sloplings is interesting
The way it allows experimentation is quite fun
English

Inspired seeing @graycrawford show some of his work from last decade to today.
Exploring latent spaces non-verbally
Deserves a full talk :)
English

Hosting an @EvenRealities x @hwnation dinner in SF this Friday (3/20)
Small group of hardware founders, engineers, and operators.
A few spots left. Who wants in?
English

Woot!
Web Bluetooth means we don't even need a local app - just connect to the glasses via a webpage
@EvenRealities G2 glasses FTW

anotherjesse@anotherjesse
Where we're going we don't need apps! (progress on connecting/showing content on Even Reality G2 Glasses via BLE from macos / linux directly)
English
anotherjesse retweetledi
anotherjesse retweetledi
anotherjesse retweetledi

@HamelHusain asking it to rank them / ability to file unrelated bugs (that go into a human review before uploading) -- anything less than medium / considered "next phase" gets skipped.
I would say 90% of the things codex xhigh tags as medium+ are actually things I want addrssed
English

@HamelHusain I've had codex 5.4 xhigh for reviewer, high for coder do pretty good at this. also long as there is good "verification" loops that describe intended behavior
English

One thing that makes me feel that code factory has not arrived yet is the following experiment:
1.Ask a LLM to do an in-depth rigorous review of your code
2. In a new thread, as same/different LLM to consider those review comments independently and address issues it agrees with
3. Keep repeating until no new concerns
I find that this loop always goes on for a ridiculously long time, which means that there is a problem with the notion of claude-take-the-wheel. This seems to happen no matter the harness or the specificity of the specs.
It works fine for simple applications, but in the limit if the LLMs have this much cognitive dissonance you cannot trust it.
Either this, or LLM are RLHFd to always find some kind of issue.
English
anotherjesse retweetledi

This but instead of time wolfram’s rule 30 pattern
Science girl@sciencegirl
The fountain that shows date and time. Czech Republic.
English

@dexhorthy does it count if they prep'd building context using tools based on analyzing previous conversations?
English



















