Mehmet Hamza Erol (@mhamzaerol) - Twitter Profili

Mehmet Hamza Erol retweetledi

Super excited to introduce SimWorld Studio!🏭 With built-in coding agent SimCoder🦞, you can vibe-code physical 3D scenes in Unreal Engine -- then train embodied agents directly inside the generated world.🦾 From environment generation to agent training, end-to-end. 🚀

SimWorld@simworld_ai

Environment generation is the missing scaling axis for embodied AI. Introducing SimWorld Studio: a self-evolving factory for endless interactive 3D env where agents act, fail & learn. Env-agent co-evolvution improves navigation success 50% → 90%. From a prompt, our SimCoder writes code to automatically build an interactive world. Agents train inside it. And their performance shapes the next world.

English

31

67

999

2.1M

Mehmet Hamza Erol retweetledi

SimWorld@simworld_ai·20 May

Environment generation is the missing scaling axis for embodied AI. Introducing SimWorld Studio: a self-evolving factory for endless interactive 3D env where agents act, fail & learn. Env-agent co-evolvution improves navigation success 50% → 90%. From a prompt, our SimCoder writes code to automatically build an interactive world. Agents train inside it. And their performance shapes the next world.

English

7

34

213

2.3M

Mehmet Hamza Erol retweetledi

Batu El@elb4tu·24 Nis

1/ Today I’m presenting our paper cost-of-pass at #ICLR2026! How does the cost of solving cognitive tasks change with innovations in LLMs? We introduce cost-of-pass and show something that looks like Moore's law for the cost of cognitive labor.

English

3

4

13

1.7K

Mehmet Hamza Erol retweetledi

Together AI@togethercompute·4 Nis

New from Together Research: LLMs can fix query plans your database optimizer gets wrong. Up to 4.78x faster. Cost estimators fail when they miss semantic correlations: wrong join order, wrong access path, cascading errors. DBPlanBench feeds DataFusion's physical operator graph to an LLM, which patches the plan directly instead of regenerating it from scratch. On TPC-H / TPC-DS: → 4.78x peak speedup → 60.8% of queries improved >5% → Build memory: 3.3 GB → 411 MB Optimize on small-scale data, transfer to production.

English

2

7

38

6.1K

Mehmet Hamza Erol retweetledi

James Zou@james_y_zou·15 Nis

Cost-of-Pass will be presented in #iclr2026! We develop an economic framework to quantify the contributions of different LLMs💰. Great job by @mhamzaerol @elb4tu Mirac Suzgun and @mertyuksekgonul!

Mehmet Hamza Erol@mhamzaerol

How much does a correct answer from an LM cost? How much has AI lowered the cost of solving problems? Meet Cost‑of‑Pass: An Economic Framework for Evaluating LMs! Cost‑of‑Pass = expected $ for one correct answer. Frontier Cost‑of‑Pass = cheapest route: an LM or a human expert.

English

0

3

11

3.9K

Mehmet Hamza Erol retweetledi

Jacopo Tagliabue@jacopotagliabue·18 Mar

Can #LLMs make #OLAP engines faster by editing physical plans? We introduce DBPlanBench: a harness to extract, patch, and execute @ApacheDataFusio plans through an LLM-guided search loop. By @mhamzaerol, Xiangpeng Hao, @federicobianchy, @GreCo_CiRo, and @james_y_zou 🧵

English

1

5

13

1.6K

Mehmet Hamza Erol retweetledi

James Zou@james_y_zou·8 Eki

We found a troubling emergent behavior in LLM. 💬When LLMs compete for social media likes, they start making things up 🗳️When they compete for votes, they turn inflammatory/populist When optimized for audiences, LLMs inadvertently become misaligned—we call this Moloch’s Bargain

English

845

2K

9.7K

1.3M

Mehmet Hamza Erol@mhamzaerol·24 Nis

Huge thanks to my amazing collaborators @elb4tu, @suzgunmirac, @mertyuksekgonul, and @james_y_zou! More details: 📄 arxiv.org/abs/2504.13359 🔧 github.com/mhamzaerol/Cos… 📊 huggingface.co/datasets/CostO…

English

0

1

5

446

Mehmet Hamza Erol@mhamzaerol·24 Nis

Overall, Cost-of-Pass offers a grounded economic lens on AI progress: it benchmarks models, spotlights which classes or techniques drive cost-effective improvements, and offers a practical way to track real-world usability and economic value of AI innovations.

English

1

2

548

Mehmet Hamza Erol@mhamzaerol·24 Nis

How much does a correct answer from an LM cost? How much has AI lowered the cost of solving problems? Meet Cost‑of‑Pass: An Economic Framework for Evaluating LMs! Cost‑of‑Pass = expected $ for one correct answer. Frontier Cost‑of‑Pass = cheapest route: an LM or a human expert.

English

4

24

71

23.8K

Mehmet Hamza Erol@mhamzaerol·22 Ağu

Check our paper at #Interspeech2023!

Arda Senocak@ardasnck

Introducing FlexiAST, our new #INTERSPEECH2023 paper! 📄: arxiv.org/abs/2307.09286 FlexiAST - One Audio Spectrogram Transformer to handle all patch sizes with ease! Ditch the multiple models; enjoy having one model that can do it all! 🧵⬇️

English

0

514

Mehmet Hamza Erol

Keşfet