Vikas Aditya

35 posts

Vikas Aditya

@vikasaditya2000

Future = humans + agents. Curious about their skills, output, and proof-of-work. CEO @HackerEarth

Silicon Valley Katılım Ocak 2025

14 Takip Edilen13 Takipçiler

Vikas Aditya@vikasaditya2000·9 Nis

Flying on @SouthwestAir today. Plane is 30% full, overhead bins empty… but my backpack must go under the seat. We’re officially at the stage where humans are executing policies like badly fine-tuned AI models—zero context, zero judgment. Thanks Southwest Airlines 😄

English

2.5K

Vikas Aditya@vikasaditya2000·19 Mar

Sometimes the most important product moments are the ones users barely notice — because everything just works. Really proud of the HackerEarth team for recently pulling off one of the largest campus hiring events for one of the most prestigious tech company: 125K student registrations, nearly 70K candidates in a single test, 0.5M+ code submissions, 1M+ test cases, 16M+ proctoring snapshots, and 2.5M+ reporting events. At that scale, reliability is the product. What impressed me most was not just the numbers, but the quality of execution behind them. The team strengthened the assessment experience, scaled evaluation and proctoring pipelines, reworked reporting for near real-time visibility, and built the platform to handle extreme bursts far more predictably. These are the kinds of engineering investments that don’t always show up in flashy demos, but they make all the difference when thousands of candidates are depending on your platform for a career-defining moment. Moments like this remind us that evaluation infrastructure is not only about capabilities — it is about trust at scale. When candidates show up for an important opportunity and companies are trusting your platform with their brand, performance is non-negotiable. Big shoutout to everyone across engineering, product, support, sales, and ops who made this happen. Grateful to be building with a team that cares so deeply about scale, resilience, and candidate experience.

English

Vikas Aditya@vikasaditya2000·17 Şub

At #aiimpactsummit, almost all major companies are there. They are showcasing the capabilities of their platforms. While LinkedIn is showing how jobs will be transformed, top of mind for most attendees is if Al will replace their job. We at @HackerEarth are doing a live poll of which jobs will be replaced by Al. And we have a mirror that says "You are looking at someone who will not be replaced by Al". Based on the interest attendees have in taking their selfie in front of this mirror and in participating in the poll, it's clear that we need to talk more about Al's impact on jobs more than anything else.

English

Vikas Aditya@vikasaditya2000·17 Şub

There is so much noise about models reaching human capabilities on several benchmarks. But not for real world tasks. Not even close. The highest rated model on vibecodearena.ai is Grok and sits well below human scores.

English

Vikas Aditya@vikasaditya2000·16 Şub

Good energy and vibes at #aiimpactsumit in India. @HackerEarth is presenting. We also see many students and non technical folks. Which jobs AI will replace (including their own) is on top of people's minds

English

408

Vikas Aditya@vikasaditya2000·16 Oca

AI Is Rewriting Software Engineering Jobs, and 2026 Hiring Will Reward "Aptitude Over Syntax," New Data Suggests prn.to/3YETJO9

English

Vikas Aditya@vikasaditya2000·19 Ara

So benchmarks like SWE-bench really matter? Models have learned how to crack benchmarks. We need to test them on real world apps. I used Gemini and GPT to generate code that draws city skyline. Here are the results: Gemini: produced London skyline with landmarks. Analytical, Structured and more efficient GPT5.2: produced New York skyline. It's more creative, deputy, artistic. Here is full analysis with working code: vibecodearena.ai/duel/4eccbccc-…

English

Chubby♨️@kimmonismus·19 Ara

It looks like we'll be getting an upgrade to Gemini 3.0 Pro soon! Nice!

English

273

13.2K

Vikas Aditya@vikasaditya2000·19 Ara

Another fun app I built at vibecodearena. This time using Claude and Kimi-K2 While Claude did score better, Kimi-K2 did pretty well and considering that it's open source, it's a tough contender Check it out here👇 vibecodearena.ai/duel/ac041a46-…

English

292

Vikas Aditya@vikasaditya2000·19 Ara

@victormustar Used VibeCodeArena to develop a 5x5 Tic Tac Toe game and compared Claude and Kimi-K2. Kimi-K2 did pretty good imho. Check it out 👇 vibecodearena.ai/duel/ac041a46-…

English

Victor M@victormustar·4 Ara

Kimi-K2 ✖️ Z-Image-Turbo (god I love open source)

English

605

34.2K

Vikas Aditya@vikasaditya2000·18 Ara

What creative personalities have you noticed in the models you use? Claude vs GPT? KimiK2 vs Llama? Gemini vs anyone? Drop your observations below. Genuinely curious what patterns people are seeing. 👇

English

Vikas Aditya@vikasaditya2000·18 Ara

Entire analysis with generated code for both models is at vibecodearena.ai/duel/4eccbccc-…

English

Vikas Aditya@vikasaditya2000·18 Ara

I used vibecodearena.ai to develop p5.js code using GPT-5.2 and Gemini 3 Pro - the two leading models. Exactly same instructions: "make a p5.js illustration of different city skylines during sunset" Both model generated the code but they have completely different creative DNA. Thread on what this reveals about AI personalities 🧵

English

Vikas Aditya@vikasaditya2000·17 Ara

Gave GPT-5.2 and Gemini the same prompt: "create sunset skylines in p5.js". Both leading models. Gemini: 53 sec, 526 lines, accurate London skyline but feels icon based GPT-5.2: 297 sec, 1086 lines, Feels more artistic, atmospheric mood piece Gemini seems more analytical (left brain) whereas GPT5.2 appears more creative (right brain). But we need both, right? Full comparison at vibecodearena.ai/duel/4eccbccc-…

English

368

AshutoshShrivastava@ai_for_success·17 Ara

If you have used both Nano Banana Pro and GPT Image 1.5, do you really think this ranking is correct? What is your verdict?

Arena.ai@arena

🚨BREAKING: Image Arena Shakeup @OpenAI’s gpt-image-1.5 and chatgpt-image-latest are now available in the Arena. 🥇gpt-image-1.5 is #1 in Text-to-Image (1264) 🥇chatgpt-image-latest is #1 on Image Edit (1409) 🔹gpt-image-1.5 #4 in Image Edit (1395) gpt-image-1.5 holds a commanding 29-point lead on Text-to-Image, while maintaining a narrow 3-point edge over @GoogleDeepMind’s Nano Banana Pro (and its 2K variant) on Image Edit. These scores are preliminary - we’ll see where they settle. gpt-image-1.5 delivers substantial gains over gpt-image-1: 🔹 +147 points in Text-to-Image 🔹 +245 points in Image Edit Huge congrats to the @OpenAI team on this incredible milestone! 👏

English

143

287

71.7K

Keşfet

@SouthwestAir @HackerEarth @victormustar @elonmusk @BarackObama @taylorswift13 @cristiano @BillGates