ogre.run

8 posts

ogre.run banner
ogre.run

ogre.run

@ogrerun

https://t.co/siftqAwxDx, Inc. From source code to a working environment, in seconds.

United States Katılım Şubat 2024
85 Takip Edilen23 Takipçiler
ogre.run retweetledi
Wilder Lopes
Wilder Lopes@wilderlopes·
The majority of #GenAI and #LLMs applications in software development are targeting the act of creating code (great!). But there is a lot more room for GenAI-enabled improvement in the "code-adjacent" tasks, or what comes after coding. ⬇️
Wilder Lopes tweet media
English
7
1
3
513
ogre.run
ogre.run@ogrerun·
A new landing page for Ogre! Ogre is the code reproducibility platform. - Works with public GitHub repos - Just generate the files (README, Dockerfile, requirements) and download them - No account creation necessary app.ogre.run
English
0
0
0
15
ogre.run retweetledi
François Chollet
François Chollet@fchollet·
If you could fully automate software engineering (my job), I think that would be great, since I could then move on to higher-leverage things. Making software is a means to an end, not the end (software is different from art in that sense). However I have not yet seen any signs that we're going to be able to automate more than an infinitesimal fraction of my job -- not even "this might work someday maybe if $100B gets invested in the area and if we're very lucky" type of signs. I wish!
English
81
287
2.8K
538.8K
ogre.run retweetledi
Andrej Karpathy
Andrej Karpathy@karpathy·
# automating software engineering In my mind, automating software engineering will look similar to automating driving. E.g. in self-driving the progression of increasing autonomy and higher abstraction looks something like: 1. first the human performs all driving actions manually 2. then the AI helps keep the lane 3. then it slows for the car ahead 4. then it also does lane changes and takes forks 5. then it also stops at signs/lights and takes turns 6. eventually you take a feature complete solution and grind on the quality until you achieve full self-driving. There is a progression of the AI doing more and the human doing less, but still providing oversight. In Software engineering, the progression is shaping up similar: 1. first the human writes the code manually 2. then GitHub Copilot autocompletes a few lines 3. then ChatGPT writes chunks of code 4. then you move to larger and larger code diffs (e.g. Cursor copilot++ style, nice demo here youtube.com/watch?v=Smklr4…) 5.... Devin is an impressive demo of what perhaps follows next: coordinating a number of tools that a developer needs to string together to write code: a Terminal, a Browser, a Code editor, etc., and human oversight that moves to increasingly higher level of abstraction. There is a lot of work not just on the AI part but also the UI/UX part. How does a human provide oversight? What are they looking at? How do they nudge the AI down a different path? How do they debug what went wrong? It is very likely that we will have to change up the code editor, substantially. In any case, software engineering is on track to change substantially. And it will look a lot more like supervising the automation, while pitching in high-level commands, ideas or progression strategies, in English. Good luck to the team!
YouTube video
YouTube
Cognition@cognition

Today we're excited to introduce Devin, the first AI software engineer. Devin is the new state-of-the-art on the SWE-Bench coding benchmark, has successfully passed practical engineering interviews from leading AI companies, and has even completed real jobs on Upwork. Devin is an autonomous agent that solves engineering tasks through the use of its own shell, code editor, and web browser. When evaluated on the SWE-Bench benchmark, which asks an AI to resolve GitHub issues found in real-world open-source projects, Devin correctly resolves 13.86% of the issues unassisted, far exceeding the previous state-of-the-art model performance of 1.96% unassisted and 4.80% assisted. Check out what Devin can do in the thread below.

English
351
1.8K
10.8K
2.1M