hadi javeed
118 posts

hadi javeed
@HadijPk
CTO at RevelAI Health. Tinkering with Byaan, an open-source local-first AI data agent. Previously Vincere Health (acquired)
Washington, DC شامل ہوئے Ocak 2015
496 فالونگ136 فالوورز
پن کیا گیا ٹویٹ

Hermes Agent now comes packaged with Karpathy's LLM-Wiki for creating knowledgebases and research vaults with Obsidian!
In just a short bit of time Hermes created a large body of research work from studying the web, code, and our papers to create this knowledge base around all of Nous' projects.
Just `hermes update` and type
/llm-wiki
in a new message or session to begin :)
github.com/NousResearch/h…

English

At Cognition we're seeing coding agents handling the entire SDLC, going way beyond just coding.
Here are some tips and tricks we're seeing dev teams use with agents like @devinai to handle the SDLC:
1. Scheduling daily E2E smoke tests: an automation signs up for your app, goes through onboarding, exercises core flows, and gets a pass/fail report in Slack every morning. You can even watch the screen recording or have it sent directly to you via Slack.
x.com/ryancarson/sta…
2. Auto-triaging production errors: it's easy to wire Sentry (or other) webhooks so new errors get root-caused, fixed, and shipped with a regression test before an on-call even has to look at their phone.
docs.devin.ai/api-reference/…
3. Scheduling weekly dependency updates: a scheduled session checks for outdated packages, runs your full test suite, and opens upgrade PRs grouped by patch, minor, and major bumps. Merge what's green, review what's not.
docs.devin.ai/product-guides…
4. Morning health digests: a scheduled session queries Datadog for error spikes, latency regressions, and failing monitors, then posts a severity-rated summary to Slack before standup.
5. Auto-fix on every PR: Sophisticated review agents like Devin Review catch bugs, security issues, and style violations on open PRs, then automatically push fixes directly to the branch. No back-and-forth in review comments, the agent handles the entire loop.
cognition.ai/blog/closing-t…
6. Parallelization of large migrations: for instance scope a REST-to-GraphQL or JS-to-TS migration, split it into conflict-free work packages, and run 8+ sessions in parallel.
7. Scheduling feature flag cleanups after releases: teams leave flags in place as a kill switch after new launches, then never get around to removing them. You can set a one-time session for a week after ship day and the cleanup actually happens: dead code path removed, tests updated, PR opened. (done via Scheduled Sessions)
8. Weekly changelogs: once per week, a scheduled session groups merged PRs by category (features, fixes, improvements), posts the digest to Slack + anywhere else relevant, and updates CHANGELOG.md
9. Reproducing customer-reported bugs from support tickets: paste a customer issue into Slack, tag Devin, and it attempts to reproduce the problem in the browser. You get a screen recording of the reproduction and a filed bug with exact steps-to-reproduce attached.
10. Enforcing your design system: schedule a session that scans merged PRs for hardcoded colors, missing design tokens, style violations, etc... Auto-creates tickets or kicks off sessions for anything that slipped through.
11. Auto-generating API docs from a ticket: create a docs Playbook, sync it as a Linear label, and apply it to any ticket. Devin generates documentation following your conventions and opens a PR.
12. Keeping docs in sync with code changes: schedule a daily session that reviews the previous 24 hours of merged PRs against your documentation. If an API endpoint changed, a config option was renamed, or a feature works differently now, it opens a PR to update the docs before users hit stale information.
13. Racing competing solutions against the same problem: if have a slow API endpoint you launch 3 parallel sessions, each trying a different optimization strategies (caching, query rewrite, denormalization). Compare the benchmarks and merge the winner (this can also be automated)
14. Automated visual regressions tests before every PR: add a repo skill that triggers whenever UI files change. Devin starts the app, screenshots every affected page at multiple viewports, and flags layout breakage, overflow, or missing elements (or you can have Devin autofix them)
This type of work is already partially being automated by a lot of teams, but usually by a human in the loop meaning they're taking time away from more important work to do things that don't usually provide immediate impact or business value
It's obvious that automating these repetitive tasks frees up engineering time, but to me it's also not a bad recruiting tactic - if you work here you won't be spending any of your time doing boring work.
English

this isn’t about AI replacing doctors.
It’s about access.
Patients are already using AI as their first touchpoint. Consumer-driven healthcare is here.
Health systems need to meet patients where they are through technology
Chengpeng@CPMou2022
This isn’t an edge case. From anonymized U.S. ChatGPT data, we are seeing: • ~2M weekly messages on health insurance • ~600K weekly messages from people living in “hospital deserts” (30 min drive to nearest hospital) • 7 out of 10 msgs happen outside clinic hours
English

@mdancho84 the big issue is the tribal knowledge and the semantic layer. how do you build that across an organization?
also how do you migrate years of work at an enterprise level from PowerBi, Tableau or looker
English

@DataRecce I am actually building a tool exactly what you described. let me know if you are interested to give it a try
it is all local and I will be open-sourcing it soon
English

@kirsten_lum_ codebase, db metadata and company docs should directly be integrated into AI tool. semantic layer is not that helpful, but if it can build self improving skills, compress business knowledge into skills, the tool could improve. still not close to replacing data scientists though
English

Text-to-SQL is going to have to get a lot better before this is true. Not in the way most people think though. AI writes amazing SQL, it just doesn’t understand what the data means, and it is so excruciating to document it that humans have been opting out of the task for decades
Matt Dancho (Business Science)@mdancho84
RIP BI Dashboards. Tools like Tableau and PowerBI are about to become extinct. This is what's coming (and how to prepare):
English

@kevarmstech I do think there is a room for integrating codebase, DB metadata and other documents into a BI tool. With evolving schema, the AI layer should re-index and build better skills and understanding
it won't solve the problem all the way, but can improve compared to what exists today. Skills.md can compress lot of business knowledge and they can be auto-improving skills
English

@kirsten_lum_ At Amazon we had a text-to-SQL homegrown to understand our table schema, and most of the time it would break as old tables got sunsetted for new ones. AI SQL is useless without a comprehensive understanding of how the data is stored, and imparting that on the LLM
English

My Claude Code setup right now:
export CLAUDE_CODE_NO_FLICKER=1
alias cc="~/.local/bin/claude --permission-mode auto"
Two lines. Biggest productivity unlock I’ve had in months.
What changed:
No flicker modeL feels like a real app, not terminal spam
Auto permissions: no more clicking “approve” 40 times
Just give it a task: come back to a PR
The key insight:
Manual approvals aren’t safety. They’re just friction.
Auto mode handles the risky stuff. Everything else moves.
A few quick upgrades:
Run /powerup (this is very new, to learn features)
Add a CLAUDE.md (teaches it your stack + conventions)
Create custom slash commands for repeat workflows or skills super helpful
Try the CLI for a week.
Same product… but the CLI UX just hits different.
Feels faster, locks you in, and honestly way more fun with tmux.
English

@nummanali Here is my setup and I love hadijaveed.me/2025/12/05/how…
English

@dhh wow this is cool @dhh . I created a simpler script for myself in Omarchy to do this exact thing github.com/hadijaveed/arc…
English

I had no idea that local model dictation had gotten this good and this fast! I'm blown away by how good hyprwhspr with Omarchy is just using a base model backed by the CPU. Unbelievably accurate. github.com/goodroot/hyprw…
English

Fizzy feels fast, delightful, and fun. All with a minimum of JavaScript. We have more lines of CSS than we do JS! Just 55 tiny Stimulus controllers. You just don't need much with Hotwire. github.com/basecamp/fizzy…

English

@jackfriks I recently wrote about this. Do things in parallel
hadijaveed.me/2025/08/04/ter…
English

Claude Code Made Me Fall in Love with the Terminal
Like many of you, I recently made the full switch from Cursor to Claude Code. This transition marked more than just a tool change – it fundamentally transformed how I think about development environments.
For years, I lived in VSCode (recently Cursor), relying heavily on mouse navigation and minimal keyboard shortcuts. I resisted the pull of Neovim and keyboard-centric workflows. But after embracing Claude Code, I discovered something profound: the terminal is the new IDE. You can run it everywhere with a consistent workflow – be it a Linux box, your Mac, or a VPS. That's all you need.
English

For years, I lived in VSCode (recently Cursor), relying heavily on mouse navigation and minimal keyboard shortcuts. I resisted the pull of Neovim and keyboard-centric workflows. But after embracing Claude Code, I discovered something profound: the terminal is the new IDE. You can run it everywhere with a consistent workflow – be it a Linux box, your Mac, or a VPS. That's all you need.
English






