Coreline

867 posts

Coreline

Coreline

@corline0

Link: https://t.co/BVppV1HUKl

Katılım Eylül 2014
548 Takip Edilen43 Takipçiler
AI/ML API
AI/ML API@aimlapi·
Qwen3.7-Max on AI/ML API - built for the agent era GPQA Diamond (92.4), HMMT (97.1), Apex (44.5) Sustains 35+ hours of autonomous execution Works with Claude Code, Qwen Code & more Comment Qwen to get Free promo code
English
83
23
139
33K
Coreline
Coreline@corline0·
@MrAhmadAwais @ArtificialAnlys I am using gemini 3.5 basic 10 dollar plan, and its amazing 4 hour limit are too generous on antigravity, for gemini 3.5 low and it is better than kimi 2.6 by all means.
English
0
0
0
51
Artificial Analysis
Artificial Analysis@ArtificialAnlys·
Google’s new Gemini 3.5 Flash is the clear leader on the Intelligence vs Speed Pareto frontier and makes large gains on GDPval-AA (real-world agentic tasks), but is 5x the cost of Gemini 3 Flash @GoogleDeepMind gave us pre-release access to Gemini 3.5 Flash, the latest model in its Flash family, which has traditionally has offered faster, lower-cost alternatives to Gemini Pro models. Gemini 3.5 Flash scores 55 on the Artificial Analysis Intelligence Index, up 9 points from Gemini 3 Flash, driven primarily by agentic performance gains and hallucination reduction. It achieves speeds of over 280 output tokens/s, but higher token usage and token pricing make it over 5x more costly to run the Intelligence Index than Gemini 3 Flash, and 75% more costly than Gemini 3.1 Pro. Gemini 3.5 Flash is $1.50/1M input and $9/1M output tokens, Gemini 3 Flash was $0.5/$3 per 1M input/output tokens, a 3x increase. The rest of the increase was driven by higher token usage when running our benchmarks Key results for Gemini 3.5 Flash with ‘high’ thinking level: ➤ 9 point Intelligence Index improvement: Gemini 3.5 Flash scores 55 on the Artificial Analysis Intelligence Index, up 9 points from Gemini 3 Flash. This places it ahead of Grok 4.3 (high, 53) and Claude Sonnet 4.6 (max, 52). The model improves across nearly all evaluations, with the largest gains coming from agentic evaluations and AA-Omniscience (knowledge and hallucination). On AA-Omniscience, Gemini 3.5 Flash improves by 11 points, driven primarily by reduced hallucinations, with its hallucination rate falling to 61%, a 31 point decrease compared to Gemini 3 Flash ➤ Agentic capability improvements: Gemini 3.5 Flash improves substantially over Gemini 3 Flash across our agentic evaluations, in both GDPval-AA (real-world agentic tasks) and Tau2-Bench Telecom (agentic tool use). Its GDPval-AA result is especially notable, achieving an Elo of 1656, well ahead of Gemini 3 Flash (1204) and Gemini 3.1 Pro (1314), and just behind GPT-5.4 (xhigh, 1674). This represents a meaningful step forward for Google in agentic performance, which has historically been a relative weakness for Gemini models ➤ Speed-intelligence frontier: Gemini 3.5 Flash achieves speeds of over 280 output tokens per second, ~70% faster than Gemini 3 Flash and models such as gpt-oss-120b and GPT-5.4 mini (xhigh). With its 55 Intelligence Index score, this places Gemini 3.5 Flash on the speed-intelligence Pareto frontier alongside Gemini 3.1 Pro and Gemini 3.1 Flash-Lite, reinforcing Google’s strength in models balancing speed and intelligence ➤ 5.5x increase in cost to run: Gemini 3.5 Flash costs $1,552 to run the Artificial Analysis Intelligence Index, 5.5x more than Gemini 3 Flash and 75% more than Gemini 3.1 Pro. This is driven by increases in both token usage and token prices. Output token usage is broadly unchanged from Gemini 3 Flash (73M vs. 72M), but input token usage increases significantly, driven primarily by an increase in the number of turns in agentic evaluations. Gemini 3.5 Flash is priced 3x higher than Gemini 3 Flash at $1.50/$9.00 per 1M input/output tokens, with a 90% discount for cached input tokens ➤ Google continues to lead multimodal performance: Gemini 3.5 Flash is multimodal, supporting image, video, and speech input alongside text. This differs from many proprietary models, including Claude Opus 4.7, Grok 4.3, and GPT-5.5, which support image input only. In our multimodal evaluation, MMMU-Pro, Gemini 3.5 Flash scores 84% - the highest score recorded. This puts models from Google in the top two spots, with Gemini 3.1 Pro scoring 82% Key model details: ➤ Context window: Retains the same 1M context window as Gemini 3 Flash ➤ Multimodality: Text, image, video and speech input with text output only ➤ Pricing: $1.50/$9.00 per million input/output tokens, with a 90% discount for cached input tokens Congratulations @GoogleDeepMind , @sundarpichai and @demishassabis on the great release!
Artificial Analysis tweet media
English
39
115
951
104.8K
Blazer
Blazer@NuminousVoidX·
@MrAhmadAwais A $5 plan could be enough for my daily work
English
2
0
1
62
Ahmad Awais
Ahmad Awais@MrAhmadAwais·
amazing to see developers get claude level work done in Command at 1/100th the cost and much better DX!! yet another developer, 382M tokens, $1 Go plan, "DeepSeek in Command Code behaves exactly like Claude and follows my instructions perfectly."
Coreline@corline0

Thanks @MrAhmadAwais,its beyond what I imagined for $1 could give me this many tokens. DeepSeek in @CommandCodeAI behaves exactly like Claude and follows my instructions perfectly. you should introduce a feature like who ever consume the full 4x stretch will get the reward 😆

English
3
1
48
2.4K
Coreline
Coreline@corline0·
Thanks @MrAhmadAwais,its beyond what I imagined for $1 could give me this many tokens. DeepSeek in @CommandCodeAI behaves exactly like Claude and follows my instructions perfectly. you should introduce a feature like who ever consume the full 4x stretch will get the reward 😆
Coreline tweet media
English
2
0
7
2.5K
Coreline
Coreline@corline0·
@saylor Tell me you have to sell without telling me you need to sell. 😆
English
0
0
0
6
Michael Saylor
Michael Saylor@saylor·
Buy more bitcoin than you sell.
English
4.3K
2.3K
26.1K
2.4M
Parody Jeff
Parody Jeff@Parodyjeffx·
FUCKING INSANE. Israeli rabbi Meir Mazuz praises the soldiers who gang-raped Palestinians kidnapped from Gaza. “You have done nothing wrong”
English
400
3.6K
6.9K
708.6K
Coreline
Coreline@corline0·
@sonaabeyg Second you select cross margin doesn't matter. Next time try with isolated screen shot.
English
0
0
0
253
sonum sohail
sonum sohail@sonaabeyg·
I am too good with leverage $4000 in pockets from just $200 margin (with leverage)! Refer to screen recording attached
sonum sohail tweet media
English
50
10
319
39K
Neitsab
Neitsab@Neitsab_FR·
@0xSero @BoysReviewer @gork Reply of Grok: Opencode Go ($10/month after first-month discount) delivers around 14,000 requests per 5 hours for M2.7 and up to 20,000 for M2.5, compared to MiniMax Starter/Basic plan's 1,500 requests per 5 hours for M2.7.
English
3
0
5
907
0xSero
0xSero@0xSero·
Best subscriptions on market based on your budget: 10$ a month: #1 - Opencode go #2 - GLM basic #3 - MiniMax basic #4 - Huggingface Pro 20$ a month: #1 - GPT-Plus #2 - Kimi-Code #3 - Mix and match from 10$ tier 50$ a month #1 - Qwen Code #2 - Mix and match from above
0xSero tweet media
English
112
44
1.1K
72.7K
Coreline
Coreline@corline0·
@claudeai Limits on paid plans are less than gemini cli, gemini cli build apk app on free tier. Gemini gives more credits free than claude paid. Insane.
English
0
0
0
41
Claude
Claude@claudeai·
Computer use is now in Claude Code. Claude can open your apps, click through your UI, and test what it built, right from the CLI. Now in research preview on Pro and Max plans.
English
2.5K
4.8K
59.3K
16.1M