

Mattheus Maximus ⚔️🐸🌿
2.5K posts

@Matthew__707
Kekius Maximus 👇🏼 https://t.co/yuADqk4eFv 👉🏼 0x26E550AC11B26f78A04489d5F20f24E3559f7Dd9 ~ https://t.co/hHGAOwgOvZ







Anthropic has been testing a new model called "Mythos" with certain customers: - a "step change" in AI capabilities, including "dramatically higher scores" in coding, academic reasoning and cybersecurity - "currently far ahead of any other AI model in cyber capabilities” - part of a new "Capybara" series of models, which are larger and more intelligent than Opus - more expensive to run than Opus; not yet ready for general release











These two photographs are separated by only 66 years.










Today's @symbolica harness is a clear example of what human-crafted targeting can achieve on ARC-AGI-3 public demo set You can "buy" performance with benchmark-specific prompts/strategies Their approach could still contain useful ideas, excited to see what the community finds








