Roger Oriol

100 posts

Roger Oriol

Roger Oriol

@rogiia

Katılım Ocak 2016
74 Takip Edilen34 Takipçiler
Roger Oriol retweetledi
Andrej Karpathy
Andrej Karpathy@karpathy·
I packaged up the "autoresearch" project into a new self-contained minimal repo if people would like to play over the weekend. It's basically nanochat LLM training core stripped down to a single-GPU, one file version of ~630 lines of code, then: - the human iterates on the prompt (.md) - the AI agent iterates on the training code (.py) The goal is to engineer your agents to make the fastest research progress indefinitely and without any of your own involvement. In the image, every dot is a complete LLM training run that lasts exactly 5 minutes. The agent works in an autonomous loop on a git feature branch and accumulates git commits to the training script as it finds better settings (of lower validation loss by the end) of the neural network architecture, the optimizer, all the hyperparameters, etc. You can imagine comparing the research progress of different prompts, different agents, etc. github.com/karpathy/autor… Part code, part sci-fi, and a pinch of psychosis :)
Andrej Karpathy tweet media
English
1.1K
3.6K
28.4K
11M
Roger Oriol
Roger Oriol@rogiia·
In Full-auto mode, Codex can not only read and write files, but also run shell commands in an environment confined around the current directory and with network disabled. In the future, you will be able to whitelist some shell commands to run with network enabled.
English
0
0
0
16
Roger Oriol
Roger Oriol@rogiia·
It can use multimodal input, allows sandboxing your development environment to secure your computer and also allows the use of context files, ~/.codex/instructions.md for global instructions for Codex and ./codex.md in the project root for project-specific context.
English
1
0
0
34
Roger Oriol
Roger Oriol@rogiia·
github.com/openai/codex Along with the launch of o3 and o4-mini, OpenAI has released a coding assitant for the terminal: Codex. You can use it to create new projects, make changes to existing projects or ask the model to explain code to you, all in the terminal.
English
1
0
0
99
Roger Oriol
Roger Oriol@rogiia·
In the benchmarks, GPT-4.1 easily beats GPT-4.5 at a lower price and higher speed. For this reason, OpenAI has said they will be deprecating GPT-4.5 in 3 months time.
English
0
0
0
40
Roger Oriol
Roger Oriol@rogiia·
After the unimpressive release of GPT-4.5 a month and a half ago, OpenAI is now releasing a new version - backwards. Today, they released three new models, exclusive to the API: GPT-4.1, GPT-4.1 mini and GPT-4.1 nano. openai.com/index/gpt-4-1/
English
1
0
0
50
Roger Oriol
Roger Oriol@rogiia·
This is inevitable, all of them want to win the AI race at any cost. If you don't want to be fooled by ever-slightly-increasing benchmarks, you should set up your own benchmarks that measure their performance on your own use cases.
English
0
0
0
13
Roger Oriol retweetledi
Jerry Liu
Jerry Liu@jerryjliu0·
In case you missed it, we launched 3 HUGE updates this week to @llama_index to help make it the most advanced, versatile multi-agent framework. 1. Jinja-style prompts - our new RichPromptTemplate lets you build dynamic prompts instead of hacking together the prompt f-strings and trying to intersperse it with outer logic. 2. Full multimodal support for agents - pass a chat message with interleaving text and images into a multi-agent system. 3. CodeAct agent - the next step beyond chain-of-thought is learning to execute code. Either use our pre-built agent out-of-the-box or learn to build it from scratch. Big shoutout to @LoganMarkewich and @masci for this. If you’re looking to build agents, here’s an entire stack of resources to help you get started: New Prompt Docs: docs.llamaindex.ai/en/latest/modu… docs.llamaindex.ai/en/latest/modu… docs.llamaindex.ai/en/latest/exam… CodeActAgent: docs.llamaindex.ai/en/latest/exam… CodeActAgent from scratch: docs.llamaindex.ai/en/latest/exam… Multimodal Agents: #multi-modal-agents" target="_blank" rel="nofollow noopener">docs.llamaindex.ai/en/stable/modu…
Jerry Liu tweet media
English
3
4
59
4.8K
Roger Oriol
Roger Oriol@rogiia·
Overall, these look like very capable frontier models that can compete with OpenAI, Anthropic and Google while at the same time being open-source, which is a huge win. Check out Meta's post on the models' architecture and benchmarks. ai.meta.com/blog/llama-4-m…
English
0
0
0
13
Roger Oriol
Roger Oriol@rogiia·
Meta has finally released the Llama 4 family of models that Zuckerberg hyped up so much. They are open-source, multimodal, MoE models. First impression, these models are massive. None of these will be able to run in the average computer with a decent GPU or Mac Mini. Let's see:
English
1
1
1
86
Roger Oriol
Roger Oriol@rogiia·
The Anthropic team has discovered an interesting approach to LLM thinking capabilities. Instead of making the model think deeply before answering or taking an action, they experimented with giving the model a think tool: anthropic.com/engineering/cl…
English
0
0
0
25
Roger Oriol
Roger Oriol@rogiia·
How to Write a Good index.html File Every web developer has been there: you're starting a new project and staring at an empty index.html file. You try to remember, which tags were meant to go in the head again? Which meta tags are best practice and which ones are deprecated?
English
1
0
1
37