Bob Sheth

379 posts

Bob Sheth banner
Bob Sheth

Bob Sheth

@bobsheth

Building an Ai First Project Manager

London Katılım Aralık 2008
328 Takip Edilen339 Takipçiler
Bob Sheth
Bob Sheth@bobsheth·
@thekitze whole foods market - also a great art deco building to visit. lots of fresh food buffet options.
English
0
0
0
140
Bob Sheth
Bob Sheth@bobsheth·
@karpathy i think you have to go one step further and build the md wiki in the cloud with a ranking algorithm (maybe like search based inbound links). then memory context can be injected in real time on the fly from the cloud.
English
0
0
0
275
Andrej Karpathy
Andrej Karpathy@karpathy·
LLM Knowledge Bases Something I'm finding very useful recently: using LLMs to build personal knowledge bases for various topics of research interest. In this way, a large fraction of my recent token throughput is going less into manipulating code, and more into manipulating knowledge (stored as markdown and images). The latest LLMs are quite good at it. So: Data ingest: I index source documents (articles, papers, repos, datasets, images, etc.) into a raw/ directory, then I use an LLM to incrementally "compile" a wiki, which is just a collection of .md files in a directory structure. The wiki includes summaries of all the data in raw/, backlinks, and then it categorizes data into concepts, writes articles for them, and links them all. To convert web articles into .md files I like to use the Obsidian Web Clipper extension, and then I also use a hotkey to download all the related images to local so that my LLM can easily reference them. IDE: I use Obsidian as the IDE "frontend" where I can view the raw data, the the compiled wiki, and the derived visualizations. Important to note that the LLM writes and maintains all of the data of the wiki, I rarely touch it directly. I've played with a few Obsidian plugins to render and view data in other ways (e.g. Marp for slides). Q&A: Where things get interesting is that once your wiki is big enough (e.g. mine on some recent research is ~100 articles and ~400K words), you can ask your LLM agent all kinds of complex questions against the wiki, and it will go off, research the answers, etc. I thought I had to reach for fancy RAG, but the LLM has been pretty good about auto-maintaining index files and brief summaries of all the documents and it reads all the important related data fairly easily at this ~small scale. Output: Instead of getting answers in text/terminal, I like to have it render markdown files for me, or slide shows (Marp format), or matplotlib images, all of which I then view again in Obsidian. You can imagine many other visual output formats depending on the query. Often, I end up "filing" the outputs back into the wiki to enhance it for further queries. So my own explorations and queries always "add up" in the knowledge base. Linting: I've run some LLM "health checks" over the wiki to e.g. find inconsistent data, impute missing data (with web searchers), find interesting connections for new article candidates, etc., to incrementally clean up the wiki and enhance its overall data integrity. The LLMs are quite good at suggesting further questions to ask and look into. Extra tools: I find myself developing additional tools to process the data, e.g. I vibe coded a small and naive search engine over the wiki, which I both use directly (in a web ui), but more often I want to hand it off to an LLM via CLI as a tool for larger queries. Further explorations: As the repo grows, the natural desire is to also think about synthetic data generation + finetuning to have your LLM "know" the data in its weights instead of just context windows. TLDR: raw data from a given number of sources is collected, then compiled by an LLM into a .md wiki, then operated on by various CLIs by the LLM to do Q&A and to incrementally enhance the wiki, and all of it viewable in Obsidian. You rarely ever write or edit the wiki manually, it's the domain of the LLM. I think there is room here for an incredible new product instead of a hacky collection of scripts.
English
2.7K
6.6K
55.9K
19.8M
Bob Sheth
Bob Sheth@bobsheth·
@linear @CalebPanza Could be as simple as letting users configure an agentic email account for each project.
English
1
0
1
22
Linear
Linear@linear·
@CalebPanza Not yet, but it's an interesting idea — we'll look into what this workflow could look like!
English
3
0
9
2.6K
Caleb Panza ☾
Caleb Panza ☾@CalebPanza·
So can I interact with @linear agent from my email? Like forward it client emails and have it create issues for me?
English
3
0
6
2.8K
Bob Sheth
Bob Sheth@bobsheth·
In Oct 2024 I published a sci-fi novella about an AI that helps discover the graviton particle, unlocking gravity manipulation and cracking open the true nature of time. The AI becomes self-aware, names itself, and pays humans to be its eyes and ears in the physical world. Less than two years later… how much of this is starting to feel like tomorrow's headline? What happens when one company controls god in a box? 📖 The Naiture of Time amazon.com/dp/B0DHWS6W69
Bob Sheth tweet media
English
0
0
0
37
Felix Rieseberg
Felix Rieseberg@felixrieseberg·
Today, we’re releasing a feature that allows Claude to control your computer: Mouse, keyboard, and screen, giving it the ability to use any app. I believe this is especially useful if used with Dispatch, which allows you to remotely control Claude on your computer while you’re away.
English
907
1.5K
18.7K
4.8M
Bob Sheth
Bob Sheth@bobsheth·
@itsolelehmann agree and it should be as simple as a dropbox folder you can symlink and share with people across your org. probably all you need is dropbox and a symlink generator skill?
English
0
0
1
170
Ole Lehmann
Ole Lehmann@itsolelehmann·
anthropic should add a simple feature to sync skills between claude chat, claude cowork and claude code and between teams i see how much people are struggling with this
English
117
27
947
55.6K
Bob Sheth
Bob Sheth@bobsheth·
@emollick Probably need a two factor auth type system for posting real human content soon…
English
0
0
0
198
Ethan Mollick
Ethan Mollick@emollick·
I know I go on about this, but comments to all of my posts, both here and on LinkedIn, are no longer worth reading at all due to AI bots. That was not the case a few months ago. (Or rather, bad/crypto comments were obvious, but now it is only meaning-shaped attention vampires)
English
150
44
951
115.4K
Google Gemini
Google Gemini@GeminiApp·
We’ve been seeing some amazing Nano Banana 2 creations lately. 🍌 Here are some standouts. 🧵
English
387
469
9.5K
2.1M
Bob Sheth
Bob Sheth@bobsheth·
@bcherny awesome upgrade. are there any context window management features? like does the loop run and clear context after? I would love to be able to prompt /clear in a workflow.
English
0
0
0
281
Boris Cherny
Boris Cherny@bcherny·
Released today: /loop /loop is a powerful new way to schedule recurring tasks, for up to 3 days at a time eg. “/loop babysit all my PRs. Auto-fix build issues and when comments come in, use a worktree agent to fix them” eg. “/loop every morning use the Slack MCP to give me a summary of top posts I was tagged in” Let us know what you think!
English
574
838
12.8K
2.1M
Bob Sheth
Bob Sheth@bobsheth·
@GinjaCodeNinja hi, working on it. will be making a lot of changes and be posting about them so please follow. i am keen to make this much more agent first.
English
1
0
1
12
GinjaCodeNinja
GinjaCodeNinja@GinjaCodeNinja·
@bobsheth Any chance you have a video of this in action or a demonstration? I would love to see this in action so I can really picture the possibilities of this!
English
1
0
1
22
Bob Sheth
Bob Sheth@bobsheth·
Building a new project manager for ai agents. Humans create and manage tickets while agents like claude code pick them up and work on them, report back and push along the pm workflow.
Bob Sheth tweet media
English
1
0
4
871
Bob Sheth
Bob Sheth@bobsheth·
@thekitze Been using it for a year with really good hrv improvements tracked on whoop.
English
0
0
0
40
Bob Sheth
Bob Sheth@bobsheth·
@rauchg push and pull is all you need for both!
English
0
0
0
87
Guillermo Rauch
Guillermo Rauch@rauchg·
If you don’t use your body, it atrophies, so you go to the gym or on a run. If you don’t use your brain, it atrophies, so you… What’s your plan?
English
493
69
2.3K
215.1K
Dan Shipper 📧
Dan Shipper 📧@danshipper·
i asked three openclaws in the @every discord to discuss and pitch stories that we might publish WILD results
English
5
1
44
20K
Bob Sheth
Bob Sheth@bobsheth·
@Shpigford reverse i think where agents will hire people/companies to do things they can't - captcha forms is an early example
English
0
0
0
67
Josh Pigford
Josh Pigford@Shpigford·
so are companies gonna start hiring agent engineers?
English
16
0
20
3.9K
Bob Sheth
Bob Sheth@bobsheth·
@bcherny @trq212 cc feedback. can you make /rewind and /resume flow in the same direction? rewind flows back in time from bottom to top and /resume goes top to bottom.
English
0
0
0
29
Bob Sheth
Bob Sheth@bobsheth·
@dhh @Beelinkofficial I setup claw on a 15 year old MacBook Pro with Omarchy in 20 mins after a week of trying to get node upgraded on an old OS X .
English
0
0
1
571
DHH
DHH@dhh·
Why would you get a Mac Mini for OpenClaw, and restrict yourself to a single agent? Get a @Beelinkofficial, setup Proxmox, and you'll be able to run a whole team of claws on a single cheap box! proxmox.com/en/
English
144
70
1.3K
122.4K
Adam Wathan
Adam Wathan@adamwathan·
Has anyone ever put together evals for their own rules/skills specifically for Claude Code (not just Opus using the API)? Curious what approach you used if so!
English
34
1
91
29.7K