Beyza Ermiş

55 posts

Beyza Ermiş

@beyzaermis

Research @Cohere, @Cohere_Labs

เข้าร่วม Nisan 2010

240 กำลังติดตาม567 ผู้ติดตาม

Beyza Ermiş รีทวีตแล้ว

Ahmet Üstün@ahmetustun89·9 Haz

Excited to share North Mini Code: Our first open-source coding model. It is small but very strong in agentic coding for its size🔥 I’m incredibly proud of the team — Shipping a model like this takes outstanding research, engineering, infrastructure, and collaboration.

Cohere@cohere

Introducing Cohere's first open-source coding model: North Mini Code Small & efficient, designed for agentic performance and built for community input.

English

3.3K

Beyza Ermiş@beyzaermis·28 May

Really glad I got to be part of this one. Huge credit to @TheyCallMeMr_ for leading the work, and grateful to the whole team!

Saurabh Dash@TheyCallMeMr_

1/ RLVR has driven big gains in math and code because many outputs admit reliable automatic checks: an answer matches the expected result, or a program passes tests. But many real tasks are not like that. Code can be functionally correct but qualitatively terrible or a response may satisfy 4 syntactic constraints but fail 1 semantic constraint.

English

1.6K

Beyza Ermiş รีทวีตแล้ว

Ahmet Üstün@ahmetustun89·21 May

Our latest agentic reasoning model — fast, multilingual, multimodal and fully open source ♥️ Built to run efficiently, available for all.

Cohere@cohere

Introducing: Cohere Command A+ We’ve created our most powerful LLM yet, optimized it to run on as little hardware as possible, and released it open-source for all.

English

2.3K

Beyza Ermiş รีทวีตแล้ว

Nick Frosst@nickfrosst·20 May

Command A+ from @cohere is out now :) its our best model yet and its open source apache 2.0

English

132

1.3K

203.4K

Beyza Ermiş รีทวีตแล้ว

Nils Reimers@Nils_Reimers·19 May

Great to join forces with Reliant AI and to grow our Berlin team. Healthcare and biopharma need trustworthy AI that can run on-prem, and that is able to ground it statements on the latest biomedical research. Our state-of-the-art context engine together with tech&datasets from Reliant, will make AI for pharma even better.

Cohere@cohere

A major step forward for sovereign enterprise AI in healthcare and biopharma. Cohere has acquired @reliant_ai 🇨🇦🇩🇪

English

1.9K

Beyza Ermiş รีทวีตแล้ว

Joelle Pineau@jpineau1·24 Nis

Exciting news for the future of AI! @cohere and Aleph Alpha are partnering to build a global sovereign AI platform. As demand for AI grows, we are accelerating the development of next-generation frontier models while upholding strong standards for data security and ethical AI.

Cohere@cohere

🚀 Sovereign AI for the world. Cohere & Aleph Alpha form transatlantic AI powerhouse anchored in Canada & Germany! Combining our global scale with European R&D excellence to build sovereign, enterprise-grade AI. Security, privacy & trust for businesses & governments worldwide. #SovereignAI #AIPartnership Learn more: businesswire.com/news/home/2026… Image from left to right: Rolf Schumann, Schwarz Digits, Samuel Weinbach, Aleph Alpha, Aidan Gomez, Cohere, Minister Solomon, Canada, Minister Wildberger, Germany

English

8.6K

Beyza Ermiş รีทวีตแล้ว

Daniel D'souza @mrdanieldsouza·17 Şub

Extremely proud to present ✨Tiny Aya ✨ Tiny 🤏 but mighty 💪3.35B parameter models, massively multilingual from the ground up 🌎🌍🌏, built with immense care w.r.t language representation🤗 We had a blast building this! 💗 Have at it! 🎆

Cohere Labs@Cohere_Labs

Introducing ✨Tiny Aya✨, a family of massively multilingual small language models built to run where people actually are. Tiny Aya delivers strong multilingual performance in 70+ global languages in a 3.35B parameter model, efficient enough to run locally, even on a phone.

English

5.5K

Beyza Ermiş รีทวีตแล้ว

Marzieh Fadaee@mziizm·17 Şub

Very proud to share today we’re releasing Tiny Aya✨🤏: small enough to run on your phone, strong enough to support 70+ languages. Proof that multilingual progress comes from intentional design --- and evaluation that measures balance, not just peaks 🕯️

Cohere Labs@Cohere_Labs

English

5.1K

Beyza Ermiş@beyzaermis·21 Oca

So happy this is out in EACL Findings! 🥳 Loved being able to contribute a bit, big congrats to @_joestacey_ and all the authors! Really nice practical takeaways on robustness for closed-source LLM fine-tuning.

Joe Stacey@_joestacey_

Super excited to have my last PhD paper about NLI robustness published at EACL Findings😍 We investigate how to make closed-source LLMs more robust after fine-tuning. Here are the paper highlights 🧵

English

2.4K

Beyza Ermiş@beyzaermis·15 Oca

Proud to share this work led by @oliverjbolton 🎉 We introduce SimMerge, a practical way to make model merging more reliable at scale. Paper and results in the thread 👇

Cohere Labs@Cohere_Labs

🧩In modern LLM development, we often end up with many specialized checkpoints. Merging into one model is attractive, but the results depend a lot on the selected merge method & order. Our paper introduces SimMerge: a simple way to choose the merge configuration automatically.

English

5.3K

Beyza Ermiş@beyzaermis·13 Kas

Sadly, this echoes what many authors and ACs have been feeling. The peer-review system is under serious strain. We can't keep pretending it's fine.

Peter Richtarik@peter_richtarik

I am an AC for ICLR 2026. One of the papers in my batch was just withdrawn. The authors wrote a brief response, explaining why the reviewers failed at their job. I agree with most of their comments. The authors gave up. They are fed up. Just like many of us. I understand. We pretend the emperor has clothes, but he is naked. Here is the final part of their withdrawal notice. I took the liberty to make it public, to highlight that what we are doing with AI conference reviews these last few years is, basically, madness. --- Comment: We thank the reviewers for their time. However, upon reading the reviews for our paper, it became immediately apparent that the four "reject" ratings are not based on good-faith academic disagreement, but on a critical failure to read the submitted paper. The reviews are rife with demonstrably false claims that are directly contradicted by the text. The core justifications for rejection rely on asserting that key components are "missing" when they are explicitly detailed in the manuscript. Some specific examples are (and many are even fake claims). Claim: Harder tasks like GSM8K are missing. Fact: GSM8K results are in many tables, like Table 2 (Section 4.2) and Appendix G. Claim: The method does not use per-layer ranks. Fact: This is the entire point of our method. The reviewer clearly mistook our method for the baselines. (Section 2, Table 1). Claim: The GP kernel is not specified. Fact: It is specified in Appendix E (Table 6). Claim: There is no ablation of the method's three stages. Fact: Section 4.4 ("Ablation Study") and Appendix J are dedicated to this. Reviewers have a fundamental responsibility to read and evaluate the work they are assigned. The nature of these errors is so fundamental, so systemic in overlooking explicit content, that it goes far beyond what "limited time" or "oversight" can explain. This work has gone through several rounds of revision over the last year. In earlier submissions, the paper usually received borderline or weak-accept scores. Numerous signs strongly suggest that some reviewers are relying entirely on AI tools to automatically generate peer reviews, rather than fulfilling their fundamental responsibility of personally reading and evaluating manuscripts. We strongly protest this. This is a gross disrespect to the authors. It is a flagrant desecration of the reviewer's sacred duty. It fundamentally undermines the integrity of the entire peer-review process. Given that the reviews are not based on the actual content of our paper, we have decided to withdraw the submission. We leave this comment so that future readers of the OpenReview page are aware that the items described as "missing" are already present in the submitted manuscript. These negative reviews for this submission are factually unsound and do not reflect the content of the paper. We cannot and will not accept an assessment that is not based on the work we actually submitted.

English

1.5K

Beyza Ermiş รีทวีตแล้ว

Shivalika Singh@singhshiviii·18 Eyl

First acceptance at @NeurIPSConf :) Super thrilled to share that our work, ‘The Leaderboard Illusion’ got accepted at @NeurIPSConf 2025! 🎉 Huge congrats to all my coauthors! And special shout out to @mziizm @sarahookr @beyzaermis @YiyangNan — learnt a lot from each of you 💙

Sara Hooker@sarahookr

It is critical for scientific integrity that we trust our measure of progress. The @lmarena_ai has become the go-to evaluation for AI progress. Our release today demonstrates the difficulty in maintaining fair evaluations on @lmarena_ai, despite best intentions.

English

227

30.1K

Beyza Ermiş รีทวีตแล้ว

Marzieh Fadaee@mziizm·29 Ağu

Our scholar program is our big bet on a delicate balance: driving impactful research in ML and guiding the next generation of thinkers and researchers. Looking forward to reviewing all the applications! 🎶

Cohere Labs@Cohere_Labs

As we start to review applications, we're seeing so much expertise, energy, and thought from applicants around the world. Thank you for engaging with this process so generously, and congratulations on your applications. The potential you all bring to ML research is inspiring. ✨

English

9.4K

Beyza Ermiş รีทวีตแล้ว

Yong Zheng-Xin@yong_zhengxin·20 Ağu

🔥 Our one-year work (collaboration with @Cohere_Labs) on multilingual safety survey is accepted to EMNLP 2025 Main!! We got one crazy reviewer but we also received one of the most encouraging feedback: "I greatly appreciate the suggested research directions. These are clear, well-motivated, and tractable. I am personally eager to explore these in our own work." Paper: arxiv.org/abs/2505.24119

Yong Zheng-Xin@yong_zhengxin

🧵 Multilingual safety training/eval is now standard practice, but a critical question remains: Is multilingual safety actually solved? Our new survey with @Cohere_Labs answers this and dives deep into: - Language gap in safety research - Future priority areas Thread 👇

English

130

12.5K

Beyza Ermiş รีทวีตแล้ว

Joelle Pineau@jpineau1·14 Ağu

I’m thrilled to be joining @cohere in the role of Chief AI Officer, helping advance cutting-edge research and product development. Cohere has an incredible team and mission. Exciting new chapter for me!

Cohere@cohere

We’re excited to announce $500M in new funding to accelerate our global expansion and build the next generation of enterprise AI technology! We are also welcoming two additions to our leadership team: Joelle Pineau as Chief AI Officer and Francois Chadwick as Chief Financial Officer. cohere.com/blog/august-20…

English

123

1.7K

180.6K

Beyza Ermiş@beyzaermis·11 Ağu

I’m looking forward to working with the next cohort. Don’t miss the deadline!

Cohere Labs@Cohere_Labs

Applications are now open for the next cohort of the Cohere Labs Scholars Program! 🌟 This is your chance to collaborate with some of the brightest minds in AI & chart new courses in ML research. Let's change the spaces breakthroughs happen. Apply by Aug 29.

English

108

9.1K

Beyza Ermiş@beyzaermis·23 Tem

I’m very excited to be co-organizing this @NeurIPSConf workshop on LLM evaluations! Evaluating LLMs is a complex and evolving challenge. With this workshop, we hope to bring together diverse perspectives to make real progress. See the details below:

LLM Evals Workshop @NeurIPS@LLM_eval

We are happy to announce our @NeurIPSConf workshop on LLM evaluations! Mastering LLM evaluation is no longer optional -- it's fundamental to building reliable models. We'll tackle the field's most pressing evaluation challenges. For details: sites.google.com/corp/view/llm-…. 1/3

English

6.6K

Beyza Ermiş รีทวีตแล้ว

Sara Hooker@sarahookr·19 Tem

Sometimes it is important to take a moment and celebrate -- we achieved all of this in 3 years. Pretty incredible impact from @Cohere_Labs 🔥

English

162

16.5K

Beyza Ermiş รีทวีตแล้ว

Ammar Khairi@ammar__khairi·26 Haz

🚀 Want better LLM performance without extra training or special reward models? Happy to share my work with @Cohere_labs : "When Life Gives You Samples: Benefits of Scaling Inference Compute for Multilingual LLMs" 👀How we squeeze more from less at inference 🍋, details in 🧵

English

15.1K

Beyza Ermiş รีทวีตแล้ว

Cohere Labs@Cohere_Labs·24 Haz

This July join the Cohere Labs Open Science Community for ML Summer School. 📚 This series is organized and hosted by @AhmadMustafaAn1, @KanwalMehreen2 and @AnasZaf79138457

English

15.9K

ค้นพบ

@TheyCallMeMr_ @cohere @_joestacey_ @oliverjbolton @NeurIPSConf @mziizm @sarahookr @YiyangNan