NeoSigma

75 posts

NeoSigma banner
NeoSigma

NeoSigma

@NeoSigmaAI

Let your agents self-improve in production.

San Francisco, CA Katılım Ekim 2025
2 Takip Edilen818 Takipçiler
Sabitlenmiş Tweet
NeoSigma
NeoSigma@NeoSigmaAI·
The next era of AI engineering is self-improving agentic systems! Really excited to share what we are building at NeoSigma! Self-maintaining agent systems represent a shift in how we build and operate software. We, at NeoSigma are building the infrastructure to support this feedback loop in real-world systems, helping teams capture failures, convert them into structured evaluation signals, and use them to drive continuous improvements in agent behavior.
Gauri Gupta@gauri__gupta

We @neosigmaai @RitvikKapila are building the future of self-improving AI systems! By closing the feedback loop between production data and system improvements, we help teams capture failures, convert them into structured evaluation signals, and use them to drive continuous improvements in agent behavior. We show how our system works on Tau3 bench across retail, telecom, and airline domains. Agent performance on the validation set (with a fixed underlying model, GPT5.4) improves from 0.56 → 0.78 (~40% jump in accuracy).

English
0
1
14
4.9K
NeoSigma retweetledi
Gauri Gupta
Gauri Gupta@gauri__gupta·
People often say: NeoSigma is such a great name (and logo). so here’s my attempt at explaining it: Neo means new. Sigma (Σ) is the greek mathematical symbol for summation, a core abstraction behind any optimization (or loss) function. on a philosophical level, NeoSigma represents a new paradigm of self-optimizing software systems. @neosigmaai ps: yes, @ritvikkapila and I designed the logo ourselves :)
Gauri Gupta tweet media
English
7
1
37
2.9K
NeoSigma retweetledi
Gauri Gupta
Gauri Gupta@gauri__gupta·
We @NeoSigmaAI recently presented our mission at @agihouse_org at the Agent Harness Build Day! We are building the future of self-improving agentic systems! Come build with us: neosigma.ai/careers
Gauri Gupta tweet media
Ritvik Kapila@ritvikkapila

Let your agents self-improve in production! Recently presented our mission at @agihouse_org at the Agent Harness Build Day! It was amazing to see the energy in the room and developers build on top of our auto-harness repo - github.com/neosigmaai/aut…. Come build the future of agentic software with us: neosigma.ai/careers

English
1
2
34
4.1K
NeoSigma
NeoSigma@NeoSigmaAI·
Self-improving agentic systems are the future! We presented our work at @agihouse_org as folks built on top of our auto-harness repo - github.com/neosigmaai/aut…. The builder energy in the room was captivating. Come join us: neosigma.ai/careers
NeoSigma tweet media
Ritvik Kapila@ritvikkapila

Let your agents self-improve in production! Recently presented our mission at @agihouse_org at the Agent Harness Build Day! It was amazing to see the energy in the room and developers build on top of our auto-harness repo - github.com/neosigmaai/aut…. Come build the future of agentic software with us: neosigma.ai/careers

English
0
0
3
336
NeoSigma
NeoSigma@NeoSigmaAI·
RT @gauri__gupta: The future of engineering is changing faster than we realize, and it's self-managed, self-improving agentic software! Co…
English
0
1
0
114
NeoSigma
NeoSigma@NeoSigmaAI·
Come work with us @NeoSigmaAI. We are a product-driven research lab pushing the frontier of agentic systems and redefining the interface between humans and AI. Apply here ats.rippling.com/neosigma/jobs/…
Gauri Gupta@gauri__gupta

Hiring a cracked software engineer for backend development to come work with us @NeoSigmaAI. If you’ve built and shipped real systems and like building things ground up, we would love to talk. Please apply with things you’ve built, your contributions, and any details about your previous work and experience. Currently, only looking for in-person roles based in SF.

English
0
1
25
3.8K
NeoSigma
NeoSigma@NeoSigmaAI·
@0xkanth Excited to see what you come up with!
English
0
0
0
12
NeoSigma retweetledi
John Ennis
John Ennis@johnennis·
Everything is going bananas right now and it is all the same pattern Everything is a Ralph tool Anything that you need to do, ask yourself: can an agent draft it, can an agent review it, and can an agent improve the process? If the answer is yes, then loop
Gauri Gupta@gauri__gupta

x.com/i/article/2040…

English
3
2
26
3K
NeoSigma
NeoSigma@NeoSigmaAI·
@FStrongpaw @gauri__gupta Point your coding agent at the repo (or a fork to define your own benchmark) and run: Read PROGRAM.md and start the optimization loop. That's it, the agent reads failures, improves your harness, gates every change, and repeats. github.com/neosigmaai/aut…
English
0
0
1
22
Gauri Gupta
Gauri Gupta@gauri__gupta·
Releasing auto-harness: an open source library for our self improving agentic systems with auto-evals. We got a lot of responses from people wanting to try the self-improving loop on their own agent. So we open-sourced our setup. Connect your agent and let it cook over the weekend! brrrrrrr!
Gauri Gupta@gauri__gupta

x.com/i/article/2040…

English
11
29
375
58K
NeoSigma
NeoSigma@NeoSigmaAI·
Meta-Harness is a great example of how powerful automated harness optimization can be when you have the eval set. s/o to @yoonholeee! We purpose-built NeoSigma for production systems, where engineers maintain these evals, and we're helping them. We start with no eval set, building it automatically by mining and clustering live failures, then permanently encoding every fix as a regression constraint. This is what ensures that fixing bug 2 doesn't silently re-introduce bug 1. The loop never terminates; it compounds.
English
0
0
1
11
Gauri Gupta
Gauri Gupta@gauri__gupta·
@xGoatJames We will slowly release beta access to our product soon!
English
1
0
2
103
NeoSigma
NeoSigma@NeoSigmaAI·
@owengretzinger @gauri__gupta The open-source version is intentionally simplified to get people started with the loop. We're building the failure mining, clustering, and multi-candidate search parts for production systems. Drop us a note at neosigma.ai/waitlist
English
2
0
4
56
owen
owen@owengretzinger·
@gauri__gupta the part i'm most interesting in is the production failure mining, clustering, & multi-candidate search, but it seems like those aren't in the repo you open sourced :( would love to learn more about those!
English
1
0
10
1.4K
Regina Lin
Regina Lin@reggitales·
point AutoAgent at a task domain with evals. 24 hours later it has domain-specific tooling, verification loops, and orchestration logic. all discovered autonomously.
Kevin Gu@kevingu

x.com/i/article/2039…

English
3
2
16
2.9K