Pankaj Kumar

9.4K posts

Pankaj Kumar banner
Pankaj Kumar

Pankaj Kumar

@pankajkumar_dev

I build things | Dm for work/collab

가입일 Ağustos 2024
638 팔로잉8.3K 팔로워
Pankaj Kumar
Pankaj Kumar@pankajkumar_dev·
Claude Mythos vs GPT-5.5 Cyber : the security race is getting serious - When Claude Mythos initially presented, it genuinely felt like Anthropic had something far ahead in cyber reasoning and long horizon analysis - The Firefox results were genuinely impressive with 423 bugs fixed in a month, 271 found with Mythos help, including sandbox escapes and a 20 year old XSLT issue - What made Mythos stand out wasn't just bug finding, but validating exploit paths and reasoning through them dynamically - But GPT-5.5 Cyber entering the picture changes things a lot with 71.4% expert cyber score vs Mythos 68.6%, while also being dramatically cheaper and faster to run - GPT-5.5 solving reverse engineering tasks in minutes for low compute cost is a pretty big deal for real production usage - Mythos still seems stronger for extremely deep codebase analysis and longer reasoning chains, but the serving cost sounds massive Honestly, It feels more like a race to make powerful cyber agents actually usable at scale without high cost or latency.
Pankaj Kumar tweet media
English
1
3
12
717
Pankaj Kumar 리트윗함
Pankaj Kumar
Pankaj Kumar@pankajkumar_dev·
ERNIE 5.1 is one of the most efficient frontier models yet - Baidu says ERNIE 5.1 reached near frontier-level performance at only 6% of the usual training cost, which is kind of wild. - They heavily compressed the model too cutting total params to 1/3 and active params to 1/2 while still improving performance - It hit #4 globally on Arena Search (1223 score), becoming the strongest Chinese model there - Scored 99.6 on AIME26 (with tools), putting it right behind Gemini 3.1 Pro for math reasoning - Also beat DeepSeek-V4-Pro on some practical agent benchmarks like τ³-Bench and SpreadsheetBench-Verified - Writing + knowledge performance is now getting close to Gemini 3.1 Pro territory as well - Most of the gains reportedly come from smarter training methods rather than brute-force scaling
Pankaj Kumar tweet media
English
4
4
27
1.7K
Dev Sharma
Dev Sharma@devsharmatwt·
this is how i use X btw :
English
22
1
38
494
Pankaj Kumar
Pankaj Kumar@pankajkumar_dev·
@Rindzay3210 yeap it performing well. and we need this type of models more.
English
0
0
0
66
Void
Void@Rindzay3210·
@pankajkumar_dev Ищет и правда очень хорошо, в моих задачах обошла sonnet 4.6
Русский
1
0
1
94
Pankaj Kumar
Pankaj Kumar@pankajkumar_dev·
@39_encho @kuuceo おめでとうございます🎊 幸せになってね! 結婚&ご出産おめでとう!!!
日本語
0
0
0
101
Pankaj Kumar
Pankaj Kumar@pankajkumar_dev·
@belmond_b_2434 「怖いよぉって言ってる場合じゃないだろベルさんw ご祝儀準備しとけ😂」
日本語
0
0
0
110
Pankaj Kumar 리트윗함
Pankaj Kumar
Pankaj Kumar@pankajkumar_dev·
Revamped UI of my most loved/hated project : Resume Sach 💀 A Hinglish AI Resume Roaster that judges your CV harder than Indian relatives. Upload resume → AI scans the fluff → choose roast level → emotional damage delivered instantly. Already 8000+ resumes roasted Your are NOT surviving this one : resume.pankajk.tech Disclaimer : Try at your own risk.
English
3
2
11
612
shivam
shivam@10xshivam·
Meet Vijay Shekhar Sharma (Founder of Paytm) Started Paytm in 2010 Faced years of losses & criticism IPO crashed in 2021 Many called Paytm “finished” But Vijay kept building. Focused on growth + profitability Improved the business year after year After 14 years, Paytm finally reported profit From being doubted publicly To building India’s biggest fintech brand. The comeback nobody saw coming.
shivam tweet media
English
2
0
8
295
Nikita Bier
Nikita Bier@nikitabier·
Would it valuable to know how many of your followers have been active on X in the last 24 hours?
English
9K
1.1K
19.5K
1.5M
mitsuki
mitsuki@mitsu_x0·
私普通に可愛いです
日本語
474
1.5K
65.8K
1.3M
Nandkishor
Nandkishor@devops_nk·
ChatGPT BTW
Nandkishor tweet media
English
161
181
11.9K
1M
Nandkishor
Nandkishor@devops_nk·
@pankajkumar_dev No bro it's having credentials and our api keys are very sensitive data we can't share it
English
2
0
1
4.1K