AthenaDecisions (@AthenaDecisions) - Twitterプロフィール

AthenaDecisions@AthenaDecisions·15 Nis

@AndrewYNg Why would you do the planning with an LLM instead of a real planning algorithm? The LLM can certainly help formalize the intention of the user based on their input but why ask it to do every part of the process? It’s a weird constraint. Symbolic AI does better.

English

114

Andrew Ng@AndrewYNg·14 Nis

Planning is a key agentic AI design pattern in which we use a large language model (LLM) to autonomously decide on what sequence of steps to execute to accomplish a larger task. For example, if we ask an agent to do online research on a given topic, we might use an LLM to break down the objective into smaller subtasks, such as researching specific subtopics, synthesizing findings, and compiling a report. Many people had a “ChatGPT moment” shortly after ChatGPT was released, when they played with it and were surprised that it significantly exceeded their expectation of what AI can do. If you have not yet had a similar “AI Agentic moment,” I hope you will soon. I had one several months ago, when I presented a live demo of a research agent I had implemented that had access to various online search tools. I had tested this agent multiple times privately, during which it consistently used a web search tool to gather information and wrote up a summary. During the live demo, though, the web search API unexpectedly returned with a rate limiting error. I thought my demo was about to fail publicly, and I dreaded what was to come next. To my surprise, the agent pivoted deftly to a Wikipedia search tool — which I had forgotten I’d given it — and completed the task using Wikipedia instead of web search. This was an AI Agentic moment of surprise for me. I think many people who haven’t experienced such a moment yet will do so in the coming months. It’s a beautiful thing when you see an agent autonomously decide to do things in ways that you had not anticipated, and succeed as a result! Many tasks can’t be done in a single step or with a single tool invocation, but an agent can decide what steps to take. For example, to simplify an example from the HuggingGPT paper (cited below), if you want an agent to consider a picture of a boy and draw a picture of a girl in the same pose, the task might be decomposed into two distinct steps: (i) detect the pose in the picture of the boy and (ii) render a picture of a girl in the detected pose. An LLM might be fine-tuned or prompted (with few-shot prompting) to specify a plan by outputting a string like "{tool: pose-detection, input: image.jpg, output: temp1 } {tool: pose-to-image, input: temp1, output: final.jpg}". This structured output, which specifies two steps to take, then triggers software to invoke a pose detection tool followed by a pose-to-image tool to complete the task. (This example is for illustrative purposes only; HuggingGPT uses a different format.) Admittedly, many agentic workflows do not need planning. For example, you might have an agent reflect on, and improve, its output a fixed number of times. In this case, the sequence of steps the agent takes is fixed and deterministic. But for complex tasks in which you aren’t able to specify a decomposition of the task into a set of steps ahead of time, Planning allows the agent to decide dynamically what steps to take. On one hand, Planning is a very powerful capability; on the other, it leads to less predictable results. In my experience, while I can get the agentic design patterns of Reflection and Tool use to work reliably and improve my applications’ performance, Planning is a less mature technology, and I find it hard to predict in advance what it will do. But the field continues to evolve rapidly, and I'm confident that Planning abilities will improve quickly. If you’re interested in learning more about Planning with LLMs, I recommend: - Chain-of-Thought Prompting Elicits Reasoning in Large Language Models, Wei et al. (2022) - HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging Face, Shen et al. (2023) - Understanding the planning of LLM agents: A survey, by Huang et al. (2024) [Original text: deeplearning.ai/the-batch/issu… ]

English

444

2.4K

390.4K

AthenaDecisions@AthenaDecisions·20 Şub

This is a great survey of combined AI technologies to solve real problems. External decision engines plus LLMs is another promising approach!

Matei Zaharia@matei_zaharia

Interesting trend in AI: the best results are increasingly obtained by compound systems, not monolithic models. AlphaCode, ChatGPT+, Gemini are examples. In this post, we discuss why this is and emerging research on designing & optimizing such systems. bair.berkeley.edu/blog/2024/02/1…

English

AthenaDecisions@AthenaDecisions·12 Şub

@langchain Excellent news!!

English

LangChain@LangChain·12 Şub

🤖BCG X Releases AgentKit, a Full-Stack Starter Kit for Building Constrained Agents AgentKit is a LangChain-based starter kit to build constrained agents, developed by our partners at BCG X We've often found that in order to productionalize agentic applications, you need to develop them in a fairly constrained way. This will help with that! This is a full stack application, built on NextJS, FastAPI, and LangChain BCG X has already used this to develop: 🧪Generating drafts of complex clinical documents, such as clinical trial protocols, for a global pharma company 🚢Controlling and orchestrating supply chain optimization systems using a helpful agent assistant 🚗Developing a chatbot that helps a major automotive player service its customers Check out the code here: github.com/BCG-X-Official… Read the full blog here: blog.langchain.dev/bcg-x-releases…

English

110

475

74.9K

AthenaDecisions@AthenaDecisions·31 Oca

Athena Decision Systems Is Born! athenadecisions.com/f/athena-decis… via @athenadecisions

English

AthenaDecisions@AthenaDecisions·26 Oca

Decisions run the world. Making the right ones every day, hour, minute, second, microsecond is hard. AI can help - if it’s used wisely🦉!

English

AthenaDecisions

ディスカバー