
Turing
9.4K posts

Turing
@turingcom
Accelerating superintelligence to drive economic growth.


Turing Research is launching a groundbreaking initiative to capture and utilize the complete, unfiltered operational history of companies, creating the definitive dataset for training the next generation of frontier models. Project Lazarus is an initiative to acquire and permanently preserve the full, unfiltered operational history of defunct or inactive companies at scale. We focus on private codebases, version histories, internal documentation, post-mortems, experimentation logs, infrastructure tooling, and everyday work artifacts that collectively reflect how real organizations actually operate. These materials capture the reality of knowledge work: incomplete specifications, tradeoffs made under time pressure, accumulated technical debt, evolving systems, and decisions made under uncertainty. Unlike polished outputs, operational traces preserve the causal structure of work across weeks, months, and years. We prioritize industries with high complexity and outsized GDP impact, including financial services, healthcare and pharma, advanced manufacturing, and enterprise software. These domains contain long-horizon decision making, regulatory constraints, supply chain dependencies, and high-value intellectual property that are critical for training economically useful AI systems. The data is structured for advanced methodologies such as reinforcement learning, imitation learning & long-horizon task evaluation, enabling models to learn multi-step reasoning, organizational decision processes, and system diagnosis over extended timelines. For founders, Project Lazarus is also preservation. A company’s history is a compressed record of human judgment, experimentation, and problem-solving. Instead of disappearing, that work compounds by becoming part of the foundation shaping the next generation of autonomous AI systems.



🔥 𝗘𝗻𝘁𝗲𝗿𝗽𝗿𝗶𝘀𝗲𝗢𝗽𝘀-𝗚𝘆𝗺 𝗶𝘀 𝘁𝗮𝗸𝗶𝗻𝗴 𝗼𝗳𝗳 𝗵𝘂𝗴𝗲: 2K downloads in 3 days (trending #6 dataset + #3 paper of the day) 🏆. So we re-ran the leaderboard on the 𝗹𝗮𝘁𝗲𝘀𝘁 𝗳𝗿𝗼𝗻𝘁𝗶𝗲𝗿 𝗰𝗹𝗼𝘀𝗲𝗱 𝗺𝗼𝗱𝗲𝗹𝘀… and the results were promising. ✅ Claude versions show a meaningful jump in reliability on enterprise tasks. ✅ Gemini 3.1 Pro is catching up fast, now much closer to Sonnet 4.6 than earlier releases. And yet, the bigger takeaway is still the same: - Big room for improvement on enterprise-grade agentic tasks. - These workflows punish "seemingly correct." One wrong default, one policy miss, one unintended side effect.. and the task fails. 📢 𝗖𝗮𝗹𝗹𝗼𝘂𝘁 (especially if you’re working on agents): As we prepare our next NeurIPS/COLM submissions, try your agents on EnterpriseOps-Gym and see how they hold up on realistic, policy-constrained, long-horizon tasks. 🌐 Website: enterpriseops-gym.github.io 🤗 Dataset: huggingface.co/datasets/Servi… @ServiceNowRSRCH , @sagardavasam , @turingcom , @turingcomdev , @Mila_Quebec , @shiva_malay @PShravannayak















