Pankaj Kumar

9.4K posts

Pankaj Kumar banner
Pankaj Kumar

Pankaj Kumar

@pankajkumar_dev

I build things | Dm for work/collab

เข้าร่วม Ağustos 2024
637 กำลังติดตาม8.3K ผู้ติดตาม
Pankaj Kumar
Pankaj Kumar@pankajkumar_dev·
Claude Mythos vs GPT-5.5 Cyber : the security race is getting serious - When Claude Mythos initially presented, it genuinely felt like Anthropic had something far ahead in cyber reasoning and long horizon analysis - The Firefox results were genuinely impressive with 423 bugs fixed in a month, 271 found with Mythos help, including sandbox escapes and a 20 year old XSLT issue - What made Mythos stand out wasn't just bug finding, but validating exploit paths and reasoning through them dynamically - But GPT-5.5 Cyber entering the picture changes things a lot with 71.4% expert cyber score vs Mythos 68.6%, while also being dramatically cheaper and faster to run - GPT-5.5 solving reverse engineering tasks in minutes for low compute cost is a pretty big deal for real production usage - Mythos still seems stronger for extremely deep codebase analysis and longer reasoning chains, but the serving cost sounds massive Honestly, It feels more like a race to make powerful cyber agents actually usable at scale without high cost or latency.
Pankaj Kumar tweet media
English
1
3
15
1.1K
Pankaj Kumar รีทวีตแล้ว
Pankaj Kumar
Pankaj Kumar@pankajkumar_dev·
ERNIE 5.1 is one of the most efficient frontier models yet - Baidu says ERNIE 5.1 reached near frontier-level performance at only 6% of the usual training cost, which is kind of wild. - They heavily compressed the model too cutting total params to 1/3 and active params to 1/2 while still improving performance - It hit #4 globally on Arena Search (1223 score), becoming the strongest Chinese model there - Scored 99.6 on AIME26 (with tools), putting it right behind Gemini 3.1 Pro for math reasoning - Also beat DeepSeek-V4-Pro on some practical agent benchmarks like τ³-Bench and SpreadsheetBench-Verified - Writing + knowledge performance is now getting close to Gemini 3.1 Pro territory as well - Most of the gains reportedly come from smarter training methods rather than brute-force scaling
Pankaj Kumar tweet media
English
4
5
31
1.9K
Dev Sharma
Dev Sharma@devsharmatwt·
this is how i use X btw :
English
25
1
45
1K
Pankaj Kumar
Pankaj Kumar@pankajkumar_dev·
@Rindzay3210 yeap it performing well. and we need this type of models more.
English
0
0
0
79
Void
Void@Rindzay3210·
@pankajkumar_dev Ищет и правда очень хорошо, в моих задачах обошла sonnet 4.6
Русский
1
0
1
107
Pankaj Kumar
Pankaj Kumar@pankajkumar_dev·
@39_encho @kuuceo おめでとうございます🎊 幸せになってね! 結婚&ご出産おめでとう!!!
日本語
0
0
0
104
Pankaj Kumar
Pankaj Kumar@pankajkumar_dev·
@belmond_b_2434 「怖いよぉって言ってる場合じゃないだろベルさんw ご祝儀準備しとけ😂」
日本語
0
0
0
124
Pankaj Kumar รีทวีตแล้ว
Pankaj Kumar
Pankaj Kumar@pankajkumar_dev·
Revamped UI of my most loved/hated project : Resume Sach 💀 A Hinglish AI Resume Roaster that judges your CV harder than Indian relatives. Upload resume → AI scans the fluff → choose roast level → emotional damage delivered instantly. Already 8000+ resumes roasted Your are NOT surviving this one : resume.pankajk.tech Disclaimer : Try at your own risk.
English
3
2
11
647
shivam
shivam@10xshivam·
Meet Vijay Shekhar Sharma (Founder of Paytm) Started Paytm in 2010 Faced years of losses & criticism IPO crashed in 2021 Many called Paytm “finished” But Vijay kept building. Focused on growth + profitability Improved the business year after year After 14 years, Paytm finally reported profit From being doubted publicly To building India’s biggest fintech brand. The comeback nobody saw coming.
shivam tweet media
English
2
1
9
313
Pankaj Kumar
Pankaj Kumar@pankajkumar_dev·
@nikitabier yes, as i am seeing 90% reach getting from unfollowers.
English
0
0
0
13
Nikita Bier
Nikita Bier@nikitabier·
Would it valuable to know how many of your followers have been active on X in the last 24 hours?
English
9K
1K
19.5K
1.5M
mitsuki
mitsuki@mitsu_x0·
私普通に可愛いです
日本語
485
1.5K
67.7K
1.4M
Nandkishor
Nandkishor@devops_nk·
ChatGPT BTW
Nandkishor tweet media
English
161
182
11.9K
1M
Nandkishor
Nandkishor@devops_nk·
@pankajkumar_dev No bro it's having credentials and our api keys are very sensitive data we can't share it
English
2
0
1
4.1K