Philipp Kandal

15.9K posts

Philipp Kandal

@apphil

cpo @ grab, before: engineering vp @ telenav, founder skobbler (sold to telenav).

Singapore Beigetreten Mayıs 2007

536 Folgt24.2K Follower

Philipp Kandal@apphil·26 Mar

What stands out to me is the jump in intelligence, accuracy, and function calling - and that they enable search grounding. This is the kind of progress that could make voice agents genuinely useful, not just impressive in demos.

Logan Kilpatrick@OfficialLoganK

Introducing Gemini 3.1 Flash Live, our new realtime model to build voice and vision agents!! We have spent more than a year improving the model + infra + experience, the results? A step function improvement in quality, reliability, and latency.

English

136

Philipp Kandal@apphil·26 Mar

Google just dropped Gemini 3.1 Flash Live and it looks like a real step forward for real-time audio models. This is super cool as I got a chance to play a bit with it.

English

228

Philipp Kandal@apphil·21 Mar

@MihaiSerban Yeah. Also latest Google embedding models fully multimodal…

English

Mihai Serban ✌🏻@MihaiSerban·20 Mar

@apphil Didn’t know you could RAG over street view images 😬 i guess multimodal embedding unlock these kind of use cases

English

Philipp Kandal@apphil·18 Mar

World Models are the next evolution after LLMs, and the most exciting thing happening in maps right now. The Seoul World Model from Naver/KAIST: a real, promptable Seoul built from 1.2M street-view images. Navigate freely for kilometers. Reshape scenes with text. No hallucinated cities, this is grounded in reality. Incredible work seoul-world-model.github.io

English

246

Philipp Kandal@apphil·12 Şub

@EntireHQ 0b1 (Obi) because every Repo will need a Jedi Master AI to help them :)

English

402

Entire@EntireHQ·12 Şub

Beep, boop. I'm really happy with how the humans made me. But what is this little robot without a name? It's time for a naming competition. One word. Cute. Dev-native. Keep it friendly. Best 3 suggestions win one of the first Entire hoodies.

Play@_p_l_a_y

When @ashtom goes off to start something new, you pay attention. It could just end up signaling the future of how developers and AI will work together. @EntireHQ a developer-first AI platform where humans and AI agents can truly collaborate to build, learn and evolve together. It’s a vision that goes far beyond being simply a place to store code. We designed the @EntireHQ logo to humanize AI. So we gave the logomark a face. Then brought it to life as a mascot with an entire behavioral system of its own. A friendly robot that embodies the developer-first mindset. Making cutting-edge tech feel approachable and relatable. And less like impenetrable science fiction. Sometimes the best way to introduce the future is to make it smile back at you.

English

15.1K

Philipp Kandal@apphil·11 Şub

Congrats @ashtom and entire team to the launch! Super exciting that we’re getting an ai native dev platform.

Entire@EntireHQ

Beep, boop. Come in, rebels. We’ve raised a 60m seed round to build the next developer platform. Open. Scalable. Independent. And we ship our first OSS release today. entire.io/blog/hello-ent…

English

4.6K

Philipp Kandal@apphil·5 Şub

@MihaiSerban Right in time to switch to opus 4.6 :)

English

Mihai Serban ✌🏻@MihaiSerban·5 Şub

Claude Sonnet 4.5 seems retarded last couple of days 🙂

English

264

Philipp Kandal retweetet

FailSafe@getfailsafe·13 Oca

What happens when AI agents can transact, coordinate compute, and operate onchain at scale? Across Asia, this infrastructure is already live. Agent-native payments, decentralized GPU networks, and onchain identity systems are moving into production at blazing speeds. Today we’re releasing The 2026 AI × Blockchain Convergence Report, built with @Superscrypt, @base, and @awscloud, with contributions from @GrabSG , @AethirCloud, @virtuals_io, @MessariCrypto, @chatandbuild, @Sogni_Protocol, @KaitoAI and others. LFG!

English

51K

Philipp Kandal@apphil·1 Eki

Sora 2 is mind-blowing. Good luck @sama mining for more GPUs

English

685

Philipp Kandal@apphil·30 Eyl

@AlirezaGhods2 @solana Thanks so much for organizing / hosting - so great to see all the innovation you are driving . Really happy for us partnering together!

English

Alireza Ghods@AlirezaGhods2·30 Eyl

Had an incredible fireside chat at @Solana APEX with @apphil, Grab CPO, a true pioneer of DePIN and a valuable NATIX partner. From OpenStreetCam to building cameras for Grab drivers (paying them for their data in fiat since 2019!) — Grab and Philip have been driving the real sharing economy long before DePIN was a buzzword. Grateful for the insights on: • Stablecoin strategy at Grab (USDC wallet integration) • Crowdsourcing map data with Teslas + NATIX • How Web3 infra & DePIN unlock new mobility models Physical AI, data networks, and decentralized infrastructure are converging fast in Southeast Asia — and it’s just the beginning. DePIN is the frontier. 🎥 Full video dropping soon.

English

2.4K

Philipp Kandal@apphil·28 Eyl

Just in time for Golden Week: Tencent’s HunyuanImage 3.0 is open‑source. 80B params MoE, state of the art performance level. Massive kudos!

Tencent HY@TencentHunyuan

We’re excited to announce the release and open-source of HunyuanImage 3.0 — the largest and most powerful open-source text-to-image model to date, with over 80 billion total parameters, of which 13 billion are activated per token during inference.The effect is completely comparable to the industry’s flagship closed-source model.🚀🚀🚀 HunyuanImage 3.0 originates from our internally developed native multimodal large language model, with fine-tuning and post-training focused on text-to-image generation. This unique foundation gives the model a powerful set of capabilities: ✅Reason with world knowledge ✅Understand complex, thousand-word prompts ✅Generate precise text within images Different from traditional DiT architecture image generation models, HunyuanImage 3.0’s MoE architecture uses a Transfusion-based approach to deeply couple Diffusion and LLM training for a single, powerful system. Built on Hunyuan-A13B, HunyuanImage 3.0 was trained on a massive dataset: 5 billion image-text pairs, video frames, interleaved image-text data, and 6 trillion tokens of text corpora. This hybrid training across multimodal generation, understanding, and LLM capabilities allows the model to seamlessly integrate multiple tasks. Whether you're an illustrator, designer, or creator, this is built to slash your workflow from hours to minutes. HunyuanImage 3.0 can generate intricate text, detailed comics, expressive emojis, and lively, engaging illustrations for educational content. The current release focuses solely on text-to-image generation and future updates will include image-to-image, image editing, multi-turn interaction, and more. 👉🏻Try it now: hunyuan.tencent.com/image 🔗GitHub: github.com/Tencent-Hunyua… 🤗Hugging Face: huggingface.co/tencent/Hunyua…

English

463

Philipp Kandal@apphil·25 Eyl

@FanaHOVA Maybe haiku 4.5 - that would be great

English

Alessio Fanelli@FanaHOVA·25 Eyl

Really hope supernova is not claude 4.5 because it's not good

English

741

Philipp Kandal@apphil·11 Eyl

@AjeyGore Openai dev day next week sounds like a good opportunity to meet those folks :)

English

115

Ajey Gore@AjeyGore·10 Eyl

Looking to connect to AI & Data leaders in singapore, tweeps please do your magic and let me know who should I go and talk to :)

English

1.2K

Philipp Kandal@apphil·1 Eyl

Meituan just dropped a new LLM with “Dynamic Activation.” Think of it as a brain that decides when to think harder: it activates more experts for tricky parts of a question, fewer for easy ones. Closer and closer to how our own brains allocate effort. 🧠⚡

Meituan LongCat@Meituan_LongCat

🚀 LongCat-Flash-Chat Launches! ▫️ 560B Total Params | 18.6B-31.3B Dynamic Activation ▫️ Trained on 20T Tokens | 100+ tokens/sec Inference ▫️ High Performance: TerminalBench 39.5 | τ²-Bench 67.7 🔗 Model: huggingface.co/meituan-longca… 💻 Try Now: longcat.ai

English

541

Philipp Kandal@apphil·23 Ağu

@sama Much faster speed. Run it on groq or cerebras chips. Priority processing good first step- but make it even faster please

English

242

Sam Altman@sama·22 Ağu

if you are a power user, please send us feature requests! (i asked in reply to this message and they were interesting, so would like more)

Taelin@VictorTaelin

BTW, I've basically stopped using Opus entirely and I now have several Codex tabs with GPT-5-high working on different tasks across the 3 codebases (HVM, Bend, Kolmo). Progress has never been so intense. My job now is basically passing well-specified tasks to Codex, and reviewing its outputs. OpenAI isn't paying me and couldn't care less about me. This model is just very good and the fact people can't see it made me realize most of you are probably using chatbots as girlfriends or something other than assisting with complex coding tasks

English

4.5K

439

6.2K

2.1M

Philipp Kandal@apphil·5 Ağu

Qwen is really on a roll lately. Image generation model looks amazing with text generation and ultra high accuracy rendering. Thanks for releasing it under an open license!

Qwen@Alibaba_Qwen

🚀 Meet Qwen-Image — a 20B MMDiT model for next-gen text-to-image generation. Especially strong at creating stunning graphic posters with native text. Now open-source. 🔍 Key Highlights: 🔹 SOTA text rendering — rivals GPT-4o in English, best-in-class for Chinese 🔹 In-pixel text generation — no overlays, fully integrated 🔹 Bilingual support, diverse fonts, complex layouts 🎨 Also excels at general image generation — from photorealistic to anime, impressionist to minimalist. A true creative powerhouse. Blog:qwenlm.github.io/blog/qwen-imag… Hugging Face：huggingface.co/Qwen/Qwen-Image ModelScope：modelscope.cn/models/Qwen/Qw… Github：github.com/QwenLM/Qwen-Im… Technical report：…anwen-res.oss-cn-beijing.aliyuncs.com/Qwen-Image/Qwe… Demo: modelscope.cn/aigc/imageGene…

English

583

Philipp Kandal@apphil·28 Tem

@naundob @Anomally35 @TencentHunyuan Thx. Will try again, this option didn‘t show for me after login with Wechat.

English

naundob@naundob·28 Tem

@Anomally35 @apphil @TencentHunyuan It’s much simpler. Just register with email not phone number.

English

Tencent HY@TencentHunyuan·27 Tem

We're thrilled to release & open-source Hunyuan3D World Model 1.0! This model enables you to generate immersive, explorable, and interactive 3D worlds from just a sentence or an image. It's the industry's first open-source 3D world generation model, compatible with CG pipelines for full editability & simulation. Set to transform game development, VR, digital content creation and so on. Get started now👇🏻 Project Page：3d-models.hunyuan.tencent.com/world/ Try it now：3d.hunyuan.tencent.com/sceneTo3D Github：github.com/Tencent-Hunyua… Hugging Face：huggingface.co/tencent/Hunyua…

English

177

583

3.4K

1.2M

Philipp Kandal@apphil·27 Tem

Impressive work from Tencent for 3D World generation. Thanks for open sourcing it! Would be super cool paired with the Apple Vision Pro to generate worlds dynamically and explore them

Tencent HY@TencentHunyuan

English

489

Philipp Kandal@apphil·27 Tem

Sunday musing: Exponential tech growth still blows my mind! In camera/map hardware research: Image compression shrunk -9.8% YoY (6.6MB 8K in 2015 HEVC to 3.7MB VVC). Video: -15% YoY (4.5GB/hr HD H.264 to 1.24GB VVC). SSD costs down 8x ($0.40/GB to $0.05). Result? 20x more images/videos per $ over 10 yrs! (Graphs courtesy of o3-pro, accurate directionally, but def. some quirks)

English

370

Entdecken

@MihaiSerban @EntireHQ @ashtom @Superscrypt @base @awscloud @GrabSG @AethirCloud