Ben Freed

1.5K posts

Ben Freed

@codevibesmatter

co-founder and vibe cto of https://t.co/BjLsYC58FE, vertical AI for construction. Claude code maximalist.

new york Katılım Şubat 2025

465 Takip Edilen151 Takipçiler

Sabitlenmiş Tweet

Ben Freed@codevibesmatter·15 Mar

hi frens! I published my Claude Code workflow system as a package called Kata! github.com/codevibesmatte… This system integrates fully with CC's native task system, no beads (though its an amazing project) or other external task managers needed. Also, it relies heavily on hooks and has a pretty ingenious stop hook mechanism that is fully flexible and changes based on what mode you're in. The number and type of modes and individual mode instructions are fully customizable and the package has examples and a good set of starter modes. I'll post more when I get a chance but the main reason i built this is because a lot of systems felt too heavy handed or black-boxy for my liking. Anyway check it out and let me know what you think!

English

773

Ben Freed@codevibesmatter·13h

@Trace_Cohen Flushing

English

Trace Cohen@Trace_Cohen·15h

What’s your favorite NYC food secret or cheap eat!?

Trace Cohen@Trace_Cohen

NYC is expensive, eating doesn't have to be I built a free directory of 170+ deals across the city > $1 oysters, dollar slices, hidden prix fixe lunches at Michelin spots, late night eats etc > filters for price, borough, open right now, and today only Comment "FOOD" for link

English

1.3K

Ben Freed@codevibesmatter·13h

@sinasanm They didn't run it on their cursor bench

English

Sina Meraji@sinasanm·17h

is composer 2.5 10% or 10x better than kimi k2.6?

English

130

Ben Freed@codevibesmatter·17h

New Gemini pro app is so bad. I had it create a cheat sheet in Google docs with all my upcoming travel itinerary then when I asked it to edit it it said that it can't create Google docs and claimed that it had hallucinated the doc that it actually did create.

English

Ben Freed@codevibesmatter·21h

Ok the Theo podcast is really good

English

Ben Freed@codevibesmatter·1d

@robinebers I'll be trying the SDK out. Not particularly optimistic

English

149

Robin Ebers | AI Coach for Founders@robinebers·1d

@codevibesmatter 🤣 I always try them gotta give them a fair chance but this ain't it

English

3.3K

Robin Ebers | AI Coach for Founders@robinebers·1d

Anigravity 2 is dead-on-arrival instantly uninstalling

Robin Ebers | AI Coach for Founders tweet media

English

109

592

69.8K

Ben Freed@codevibesmatter·1d

@hunvreus @enjoyingthewind You find a happy middle. Spec it with a few questions answered and then get to coding fast

English

Ronan Berder@hunvreus·1d

Why on earth would you want to revert to Spec-Driven Development? Yes, agents are way faster at writing code. And (some) humans are better at system thinking. But we also suck at planning. Any experienced engineer knows you simply cannot sit down, write the specs and then write the software that matches it. At least not if you plan on writing something "good". You need to work through the problem to understand its boundaries and shape a solution that makes sense. Just leverage the fact that writing code is cheap: 1. Prototype, 2. Document learnings, 3. Rewrite based on learnings, 4. Document solution, 5. Refactor, 6. Document changes. Even if you have to repeat parts or all of this, you'll get to a good solution faster than with SDD.

Sahaj@iamsahaj_xyz

tried out /grill-me from @mattpocockuk it works. it's not fun but it works

English

382

92.4K

Ben Freed@codevibesmatter·1d

@toddsaunders @conductor_build Wait multimodal and multi-model are also different

English

Todd Saunders@toddsaunders·1d

I met someone tonight at an AI event that told me they were bilingual. I then went on a tangent about how I've been using @conductor_build and it's made multimodal. And how I've been using Codex and Opus together a ton. Turns out.. multimodal and bilingual are not the same.

English

833

Ben Freed@codevibesmatter·1d

WHAT

Andrej Karpathy@karpathy

Personal update: I've joined Anthropic. I think the next few years at the frontier of LLMs will be especially formative. I am very excited to join the team here and get back to R&D. I remain deeply passionate about education and plan to resume my work on it in time.

English

Ben Freed@codevibesmatter·1d

@kunchenguid @leo_linsky synesthesia is the unlock!

English

Kun Chen@kunchenguid·2d

@leo_linsky hard to say sometimes i feel thinking in words is really inefficient great thinking can happen directly in the latent space - very often when we suddenly feel inspired with a great idea, it didn’t really come from a big monologue

English

361

Leo Linsky@leo_linsky·2d

Since thinking tokens led to a step-function improvement in AI reasoning, does that mean people with no internal monologues are worse at reasoning?

English

380

Ben Freed@codevibesmatter·1d

@toddsaunders Brilliant. Signals ingestion highly underrated

English

249

Todd Saunders@toddsaunders·1d

A close friend just showed me the best AI workflow I've ever seen. His vertical SaaS went from $1.2M to $5.5M ARR this year. He says this is why. It all runs on Facebook. Facebook has no newsfeed API, they use Cloudflare to stop scrapers. So he runs a browser harness, signed in with his own account, and sweeps four competitor groups and two industry groups every week. Every new post gets screenshotted. - GPT 5.5 reads the post, categorizes it, and puts all of the information into a table. - Sonnet 4.6 does the triage and the labeling. Things like competitor type and signal (complaining, feature request, churn, pricing). - Opus 4.7 does the synthesis and the weekly cron job that fetches the post. It then finds patterns in the posts and writes briefs that go to Slack every Monday morning. Then, every post is automatically turned into a markdown file, and is automatically action against. Feature request / product complaint - Opus creates that bespoke feature in real time, and takes a video recording of final product for the product team to post. Questions / industry insights - Opus creates a SEO optimized blog post, publishes it, and gives the link for the marketing team to post. His competitors' customers are writing his roadmap. His competitors' weaknesses are writing his content calendar. It's absolutely wild.

English

429

71.4K

Ben Freed@codevibesmatter·1d

@kunchenguid Yassss

Kun Chen@kunchenguid·2d

i’ve been in the front row seat of tech companies’ AI adoption and layoffs should i make a post / video to explain what’s happening? if so, reply and let me know what you are most interested in hearing about

Polymarket@Polymarket

NEW: Meta to begin cutting about 8,000 jobs this week as AI spending surges.

English

408

118.1K

Ben Freed@codevibesmatter·1d

@businessbarista Taste issue

English

Alex Lieberman@businessbarista·1d

AI slop should be called Human slop. It is almost always a human skill issue, resulting from bad direction, bad context, and bad taste.

English

30K

Ben Freed@codevibesmatter·2d

@kunchenguid I dunno man I've been to some where they insist on cooking for you

English

Kun Chen@kunchenguid·2d

you can outsource your thinking you can even outsource understanding you can’t outsource Korean BBQ

English

1.9K

Ben Freed@codevibesmatter·2d

@quorralyne Cool I'll check it out!

English

Heather Downing@quorralyne·2d

Agreed. This is why we created Meko. If you use hooks with Claude Code, the entire: 1. conversation raw text 2. graph memory 3. flag for what is important to remember in the future Automatically added into Meko for agents to access. mekodata.ai

Ben Freed@codevibesmatter

@trq212 Because you shouldn't have to track progress manually like this there's tons of inherent drift. Deterministic workflows with stages, modes, gates and verifiable outputs is the way

English

187

Ben Freed@codevibesmatter·2d

@GeeksMirage @trq212 Then it would be much better to use reasoning traces for that

English

341

Vik Agarwal@GeeksMirage·2d

@codevibesmatter @trq212 You’re assuming that this is to track progress, which it is not. It is used to track historical decision flow, so that you have a log of WHY the LLM made decisions, so that you understand what happened and can improve it if needed in the future.

English

368

Thariq@trq212·2d

a prompt I've been using a lot recently: implement <SPEC> and while you do, keep a running implementation-notes.html file (or markdown) with decisions you had to make weren't in the spec, things you had to change, tradeoffs you had to make or anything else I should know

English

340

578

9.7K

792.2K

Ben Freed@codevibesmatter·2d

@aashutosh_01 @trq212 github.com/codevibesmatte…

QME

308

Aashutosh Sahni@aashutosh_01·2d

@codevibesmatter @trq212 Curious, what flow are you using to make thia happen?

English

327

Ben Freed@codevibesmatter·2d

@oussemadb @trq212 It doesn't have to be perfect but it should be more than a prayer

English

139

Oussema | Where AI Meets Money & Security@oussemadb·2d

@codevibesmatter @trq212 I dont agree tbh cuz waiting for the perfect harness means waiting forever. A simple notes file you can read and act on today beats a theoretically elegant system that doesn't exist yet. Pragmatic always beats ideal in a moving field.

English

167

Ben Freed@codevibesmatter·2d

@kupolov @trq212 Definitely the latter. you need more of muh harness

English

Ted Kupolov@kupolov·2d

@trq212 @codevibesmatter You guys should decide: is AGI coming soon, or is the current iteration of AI just the next level of abstraction -like the move from machine code to compiled languages like C? Dario is claiming devs aren't needed anymore.

English

236

Ben Freed@codevibesmatter·2d

@southphxceleb No it's been the way llms work since 3.5. it's just not the focus of the harness makers because it requires a very opinionated structure

English

pretty.hate.machine@southphxceleb·2d

I feel like as the models get smarter they’re making more nebulous decisions, introducing a “worklog” is an understandable iteration on this *brand new* way of building

Ben Freed@codevibesmatter

@trq212 Because you shouldn't have to track progress manually like this there's tons of inherent drift. Deterministic workflows with stages, modes, gates and verifiable outputs is the way

English

Keşfet

@Trace_Cohen @sinasanm @robinebers @hunvreus @enjoyingthewind @toddsaunders @conductor_build @kunchenguid