Beyza Ermiş

55 posts

Beyza Ermiş

Beyza Ermiş

@beyzaermis

Research @Cohere, @Cohere_Labs

เข้าร่วม Nisan 2010
240 กำลังติดตาม567 ผู้ติดตาม
Beyza Ermiş รีทวีตแล้ว
Ahmet Üstün
Ahmet Üstün@ahmetustun89·
Excited to share North Mini Code: Our first open-source coding model. It is small but very strong in agentic coding for its size🔥 I’m incredibly proud of the team — Shipping a model like this takes outstanding research, engineering, infrastructure, and collaboration.
Cohere@cohere

Introducing Cohere's first open-source coding model: North Mini Code Small & efficient, designed for agentic performance and built for community input.

English
2
12
53
3.3K
Beyza Ermiş รีทวีตแล้ว
Nick Frosst
Nick Frosst@nickfrosst·
Command A+ from @cohere is out now :) its our best model yet and its open source apache 2.0
English
56
132
1.3K
203.4K
Beyza Ermiş รีทวีตแล้ว
Nils Reimers
Nils Reimers@Nils_Reimers·
Great to join forces with Reliant AI and to grow our Berlin team. Healthcare and biopharma need trustworthy AI that can run on-prem, and that is able to ground it statements on the latest biomedical research. Our state-of-the-art context engine together with tech&datasets from Reliant, will make AI for pharma even better.
Cohere@cohere

A major step forward for sovereign enterprise AI in healthcare and biopharma. Cohere has acquired @reliant_ai 🇨🇦🇩🇪

English
1
6
27
1.9K
Beyza Ermiş รีทวีตแล้ว
Joelle Pineau
Joelle Pineau@jpineau1·
Exciting news for the future of AI! @cohere and Aleph Alpha are partnering to build a global sovereign AI platform. As demand for AI grows, we are accelerating the development of next-generation frontier models while upholding strong standards for data security and ethical AI.
Cohere@cohere

🚀 Sovereign AI for the world. Cohere & Aleph Alpha form transatlantic AI powerhouse anchored in Canada & Germany! Combining our global scale with European R&D excellence to build sovereign, enterprise-grade AI. Security, privacy & trust for businesses & governments worldwide. #SovereignAI #AIPartnership Learn more: businesswire.com/news/home/2026… Image from left to right: Rolf Schumann, Schwarz Digits, Samuel Weinbach, Aleph Alpha, Aidan Gomez, Cohere, Minister Solomon, Canada, Minister Wildberger, Germany

English
10
11
82
8.6K
Beyza Ermiş รีทวีตแล้ว
Daniel D'souza 
Daniel D'souza @mrdanieldsouza·
Extremely proud to present ✨Tiny Aya ✨ Tiny 🤏 but mighty 💪3.35B parameter models, massively multilingual from the ground up 🌎🌍🌏, built with immense care w.r.t language representation🤗 We had a blast building this! 💗 Have at it! 🎆
Cohere Labs@Cohere_Labs

Introducing ✨Tiny Aya✨, a family of massively multilingual small language models built to run where people actually are. Tiny Aya delivers strong multilingual performance in 70+ global languages in a 3.35B parameter model, efficient enough to run locally, even on a phone.

English
0
12
45
5.5K
Beyza Ermiş รีทวีตแล้ว
Marzieh Fadaee
Marzieh Fadaee@mziizm·
Very proud to share today we’re releasing Tiny Aya✨🤏: small enough to run on your phone, strong enough to support 70+ languages. Proof that multilingual progress comes from intentional design --- and evaluation that measures balance, not just peaks 🕯️
Cohere Labs@Cohere_Labs

Introducing ✨Tiny Aya✨, a family of massively multilingual small language models built to run where people actually are. Tiny Aya delivers strong multilingual performance in 70+ global languages in a 3.35B parameter model, efficient enough to run locally, even on a phone.

English
5
10
66
5.1K
Beyza Ermiş
Beyza Ermiş@beyzaermis·
Sadly, this echoes what many authors and ACs have been feeling. The peer-review system is under serious strain. We can't keep pretending it's fine.
Peter Richtarik@peter_richtarik

I am an AC for ICLR 2026. One of the papers in my batch was just withdrawn. The authors wrote a brief response, explaining why the reviewers failed at their job. I agree with most of their comments. The authors gave up. They are fed up. Just like many of us. I understand. We pretend the emperor has clothes, but he is naked. Here is the final part of their withdrawal notice. I took the liberty to make it public, to highlight that what we are doing with AI conference reviews these last few years is, basically, madness. --- Comment: We thank the reviewers for their time. However, upon reading the reviews for our paper, it became immediately apparent that the four "reject" ratings are not based on good-faith academic disagreement, but on a critical failure to read the submitted paper. The reviews are rife with demonstrably false claims that are directly contradicted by the text. The core justifications for rejection rely on asserting that key components are "missing" when they are explicitly detailed in the manuscript. Some specific examples are (and many are even fake claims). Claim: Harder tasks like GSM8K are missing. Fact: GSM8K results are in many tables, like Table 2 (Section 4.2) and Appendix G. Claim: The method does not use per-layer ranks. Fact: This is the entire point of our method. The reviewer clearly mistook our method for the baselines. (Section 2, Table 1). Claim: The GP kernel is not specified. Fact: It is specified in Appendix E (Table 6). Claim: There is no ablation of the method's three stages. Fact: Section 4.4 ("Ablation Study") and Appendix J are dedicated to this. Reviewers have a fundamental responsibility to read and evaluate the work they are assigned. The nature of these errors is so fundamental, so systemic in overlooking explicit content, that it goes far beyond what "limited time" or "oversight" can explain. This work has gone through several rounds of revision over the last year. In earlier submissions, the paper usually received borderline or weak-accept scores. Numerous signs strongly suggest that some reviewers are relying entirely on AI tools to automatically generate peer reviews, rather than fulfilling their fundamental responsibility of personally reading and evaluating manuscripts. We strongly protest this. This is a gross disrespect to the authors. It is a flagrant desecration of the reviewer's sacred duty. It fundamentally undermines the integrity of the entire peer-review process. Given that the reviews are not based on the actual content of our paper, we have decided to withdraw the submission. We leave this comment so that future readers of the OpenReview page are aware that the items described as "missing" are already present in the submitted manuscript. These negative reviews for this submission are factually unsound and do not reflect the content of the paper. We cannot and will not accept an assessment that is not based on the work we actually submitted.

English
0
0
13
1.5K
Beyza Ermiş รีทวีตแล้ว
Shivalika Singh
Shivalika Singh@singhshiviii·
First acceptance at @NeurIPSConf :) Super thrilled to share that our work, ‘The Leaderboard Illusion’ got accepted at @NeurIPSConf 2025! 🎉 Huge congrats to all my coauthors! And special shout out to @mziizm @sarahookr @beyzaermis @YiyangNan — learnt a lot from each of you 💙
Sara Hooker@sarahookr

It is critical for scientific integrity that we trust our measure of progress. The @lmarena_ai has become the go-to evaluation for AI progress. Our release today demonstrates the difficulty in maintaining fair evaluations on @lmarena_ai, despite best intentions.

English
9
17
227
30.1K
Beyza Ermiş รีทวีตแล้ว
Marzieh Fadaee
Marzieh Fadaee@mziizm·
Our scholar program is our big bet on a delicate balance: driving impactful research in ML and guiding the next generation of thinkers and researchers. Looking forward to reviewing all the applications! 🎶
Cohere Labs@Cohere_Labs

As we start to review applications, we're seeing so much expertise, energy, and thought from applicants around the world. Thank you for engaging with this process so generously, and congratulations on your applications. The potential you all bring to ML research is inspiring. ✨

English
5
13
65
9.4K
Beyza Ermiş รีทวีตแล้ว
Yong Zheng-Xin
Yong Zheng-Xin@yong_zhengxin·
🔥 Our one-year work (collaboration with @Cohere_Labs) on multilingual safety survey is accepted to EMNLP 2025 Main!! We got one crazy reviewer but we also received one of the most encouraging feedback: "I greatly appreciate the suggested research directions. These are clear, well-motivated, and tractable. I am personally eager to explore these in our own work." Paper: arxiv.org/abs/2505.24119
Yong Zheng-Xin tweet media
Yong Zheng-Xin@yong_zhengxin

🧵 Multilingual safety training/eval is now standard practice, but a critical question remains: Is multilingual safety actually solved? Our new survey with @Cohere_Labs answers this and dives deep into: - Language gap in safety research - Future priority areas Thread 👇

English
11
13
130
12.5K
Beyza Ermiş รีทวีตแล้ว
Joelle Pineau
Joelle Pineau@jpineau1·
I’m thrilled to be joining @cohere in the role of Chief AI Officer, helping advance cutting-edge research and product development. Cohere has an incredible team and mission. Exciting new chapter for me!
Cohere@cohere

We’re excited to announce $500M in new funding to accelerate our global expansion and build the next generation of enterprise AI technology! We are also welcoming two additions to our leadership team: Joelle Pineau as Chief AI Officer and Francois Chadwick as Chief Financial Officer. cohere.com/blog/august-20…

English
123
71
1.7K
180.6K
Beyza Ermiş
Beyza Ermiş@beyzaermis·
I’m very excited to be co-organizing this @NeurIPSConf workshop on LLM evaluations! Evaluating LLMs is a complex and evolving challenge. With this workshop, we hope to bring together diverse perspectives to make real progress. See the details below:
LLM Evals Workshop @NeurIPS@LLM_eval

We are happy to announce our @NeurIPSConf workshop on LLM evaluations! Mastering LLM evaluation is no longer optional -- it's fundamental to building reliable models. We'll tackle the field's most pressing evaluation challenges. For details: sites.google.com/corp/view/llm-…. 1/3

English
1
13
52
6.6K
Beyza Ermiş รีทวีตแล้ว
Sara Hooker
Sara Hooker@sarahookr·
Sometimes it is important to take a moment and celebrate -- we achieved all of this in 3 years. Pretty incredible impact from @Cohere_Labs 🔥
Sara Hooker tweet media
English
9
24
162
16.5K
Beyza Ermiş รีทวีตแล้ว
Ammar Khairi
Ammar Khairi@ammar__khairi·
🚀 Want better LLM performance without extra training or special reward models? Happy to share my work with @Cohere_labs : "When Life Gives You Samples: Benefits of Scaling Inference Compute for Multilingual LLMs" 👀How we squeeze more from less at inference 🍋, details in 🧵
Ammar Khairi tweet media
English
2
22
41
15.1K