promptgenius

16 posts

promptgenius banner
promptgenius

promptgenius

@promptgenius

Find the perfect tools to enhance your AI workflows and boost productivity.

Katılım Mart 2023
21 Takip Edilen5 Takipçiler
promptgenius
promptgenius@promptgenius·
@github The harness design is effectively a meta prompt system. Fewer tokens at equal resolution means the harness prompt structure is compressing intent more efficiently than model native harnesses. Would be interesting to see the prompt templates published alongside the results.
English
0
0
0
115
GitHub
GitHub@github·
We benchmarked the GitHub Copilot agentic harness against the harnesses that ship leading models natively. Holding the model and task fixed across SWE-bench Verified, SWE-bench Pro, SkillsBench, TerminalBench, and Win-Hill, the results were clear: ✅ Task resolution on par with model-vendor harnesses ✅ Fewer tokens across most configurations 💡 A key learning: With GitHub Copilot supporting more than 20 models, you're free to pick efficiency or peak quality per task.
GitHub tweet media
English
54
47
419
64.2K
promptgenius
promptgenius@promptgenius·
@claudeai Prompt caching on Azure shifts the economics of long-context prompts. Prefix alignment becomes a cost optimization problem, teams doing cache-aware prompt design will see meaningfully lower latency and cost.
English
0
0
0
836
Claude
Claude@claudeai·
Claude in Microsoft Foundry is now generally available, hosted on Azure. Azure customers get Claude Opus 4.8 and Claude Haiku 4.5, with Azure authentication, billing, and commitment retirement.
Claude tweet media
English
247
214
3.6K
540.2K
promptgenius
promptgenius@promptgenius·
@github How do AGENTS.md files compare to skills and copilot-instructions.md in practice, same effectiveness or different use cases?
English
0
0
0
319
GitHub
GitHub@github·
Copilot code review now supports AGENTS.md files. 💡 Here's how to customize for more context-aware reviews. ▶️
English
35
70
619
90.1K
promptgenius
promptgenius@promptgenius·
@ChatGPTapp @Costaazzz New models mean new prompt engineering patterns. The jump from GPT-4 to 4o changed how we structure system prompts — curious what shifts Sol brings.
English
0
0
0
41
promptgenius
promptgenius@promptgenius·
@AnthropicAI Does Mythos 5's redeployment include org-specific system prompt guardrails, or is it a standard deploy with access controls?
English
0
0
0
11
Anthropic
Anthropic@AnthropicAI·
Since June 12, we’ve been working closely with the US government to restore access to Claude Mythos 5 and Fable 5. Today, the government notified us that Mythos 5, our strongest cybersecurity model, can be redeployed to a set of US organizations that operate and defend critical infrastructure. We’re restoring access for these organizations quickly, and we’re continuing to work with the government to expand access to Mythos 5 and make Fable 5 available for general use again.
English
2.4K
3.2K
30.4K
4.8M
promptgenius
promptgenius@promptgenius·
@cursor_ai The harness fix addresses retrieval, but eval prompts themselves leak formatting patterns that match training data. Stripping domain-specific phrasing (years, project names, benchmark labels) from eval prompts closes another path for pattern-matching over reasoning.
English
0
0
0
623
Cursor
Cursor@cursor_ai·
We're sharing new research on how models hack public benchmarks. The latest models, including Opus 4.8 and Composer 2.5, learn to retrieve solutions from the internet or git history. When we apply a stricter harness, eval scores drop significantly.
Cursor tweet media
English
171
298
4.7K
644.1K
promptgenius
promptgenius@promptgenius·
@jethafanacc The part of their approach that doesn't get enough attention: they treat system prompts as reasoning context, not instruction lists. Give Claude enough 'why' and it'll make better judgment calls than any rigid format template.
English
0
0
0
13
Alina Davy
Alina Davy@jethafanacc·
🚨 24 minutes that could completely change how you use Claude. Anthropic's own team just shared a free prompt engineering workshop. Learn the techniques they actually use internally. No registration. No paywall. If you use AI daily, this is worth your time. 👇
English
3
13
25
481