
New era loaded.
Dennis K Cinelli
97 posts

@DennisKCinelli
CFO @ Paramount Skydance

New era loaded.

We’re launching the Remote Labor Index (RLI) with @cais, the first benchmark evaluating whether AI agents can independently complete full, paid freelance tasks. The results provide a needed reality check: automation is advancing, but still has a long way to go. RLI offers a clear view of what AI can do today, and what that means for the workforce tomorrow.

We’re introducing SEAL Showdown, the AI leaderboard that actually captures real preferences, powered by a platform used by real people. Public benchmarks today rely on contrived tasks or narrow user groups. That leaves us guessing which models are actually preferred by people. SEAL Showdown changes that. Model performance can now be segmented by demographics and domains. Rankings aren’t just a single global average, they can be broken down by region, profession, education level, age and more, giving a nuanced view of how models work for different people. SEAL Showdown sets a new bar, because AI should be judged by how well it works for everyone.

Introducing our Agentic Leaderboards. These new leaderboards test AI agents in real-world, high-complexity environments, setting a new standard for completing end-to-end digital tasks.

President Trump says China's DeepSeek AI model is a "wake-up call" for American companies but it is a good thing for faster and cheaper methods of AI to be developed


1/ Today, @Scale_AI is announcing $1B of financing at a $13.8B valuation. The round was led by @Accel along with our existing investors. @Scale_AI has never been better positioned to accelerate the abundance of frontier data and pave the road for AGI. 🧵 scale.com/blog/scale-ai-…

We @scale_ai are announcing a strategic partnership @OpenAI today to be their preferred partner for enterprise fine-tuning. We are excited about the future of providing the perfect model for each enterprise's unique needs. Read more in 🧵 openai.com/blog/openai-pa…






Today, @scale_AI is launching our 2 major platforms to bolster government and enterprise: 🎖 Scale Donovan, the AI copilot for defense 🏙 Scale EGP, full-stack generative AI for global enterprise 👇 See Donovan in action below 🧵 on our platforms and why they are so critical






We are proud to partner with the U.S. government to deliver transformative, next-generation large language models (LLMs) purpose-built for intelligence and operations. Learn more → scale.com/federal-llm