Giannis Chatziveroglou

222 posts

Giannis Chatziveroglou banner
Giannis Chatziveroglou

Giannis Chatziveroglou

@giannis2two

pretraining @cohere, CS + Math @MIT

New York Beigetreten Mayıs 2023
457 Folgt376 Follower
Giannis Chatziveroglou retweetet
Reid Wiseman
Reid Wiseman@astro_reid·
There are no words.
Reid Wiseman tweet media
English
7.9K
86.4K
649.6K
38.3M
Giannis Chatziveroglou retweetet
Nick Frosst
Nick Frosst@nickfrosst·
@cohere transcribe Sota open source transcription model running in the browser :) Weights on @huggingface link below
English
60
130
1.4K
185.6K
Giannis Chatziveroglou retweetet
Cohere
Cohere@cohere·
Introducing: Cohere Transcribe – a new state-of-the-art in open source speech recognition.
English
82
295
2.6K
601.6K
Giannis Chatziveroglou retweetet
Cohere
Cohere@cohere·
We’re excited to announce our partnership with @RWSGroup, bringing Cohere’s frontier AI models to Language Weaver Pro - unlocking new enterprise‑grade translation capabilities. Purpose-built for high-stakes environments, this integration empowers enterprises and governments to communicate seamlessly across languages and accelerate new opportunities for global collaboration and growth! Learn more: rws.com/about/news/202…
Cohere tweet media
English
1
10
51
4.7K
Giannis Chatziveroglou retweetet
Cohere
Cohere@cohere·
We’re excited to announce that we’ve signed a Memorandum of Understanding for advanced AI collaboration with Saab, a global leader in defense and security. Together, we’ll explore groundbreaking AI partnerships for their aerospace platforms and deliver tailored solutions critical to Saab’s operations.  Read more: cohere.link/cquJlw4
Cohere tweet media
English
2
5
39
3.1K
Giannis Chatziveroglou retweetet
Dwarak
Dwarak@DwaraknathG·
Hey all, I will be at GTC next week talking about all the work my team and I did on large-scale MoE training in JAX on GPUs! We decided early on to have a fully dropless training stack to avoid token dropping. (1/7)
English
2
11
103
15.3K
Giannis Chatziveroglou retweetet
Giannis Kaklamanis
Giannis Kaklamanis@gianniskakl·
We are excited to share our new paper on Verifiable Aggregate Receipts (VAR) — a cryptographic primitive for auditable engagement counts that prevents inflation while preserving user privacy. This is joint work with @wanggary142857, Jasleen Malvai, and @0xFanZhang at @YaleACL.
Giannis Kaklamanis tweet media
English
1
1
5
623
Giannis Chatziveroglou retweetet
Dwarak
Dwarak@DwaraknathG·
I will be at Nvidia GTC in March! @bharatvenki and I are gonna talk about all the systems work we do at Cohere. Come listen from me about all the custom kernel work we do for large scale LLM training on Hopper and Blackwell!! nvidia.com/gtc/session-ca…
English
1
3
15
575
Giannis Chatziveroglou retweetet
The Greek Analyst
The Greek Analyst@GreekAnalyst·
I love discovering extremely talented young Greeks like @giannis2two in NYC building super cool stuff. So many hidden (people) gems out there. Who are some of the highest signal yet most underfollowed GR techies you know? Drop them below, help me bring the spotlight they deserve.
English
5
5
45
189.7K
Giannis Chatziveroglou retweetet
Giannis Chatziveroglou retweetet
Kyle Duffy
Kyle Duffy@kyduffy·
@SahilBloom You should distinguish uncertainty of purpose from uncertainty of rewards E.g. it should be certain that you'll show up and take a step every day; even if it's uncertain you'll be rewarded Uncertainty of purpose should be extinguished, uncertainty of rewards should be expected
English
0
2
7
595
Giannis Chatziveroglou
Giannis Chatziveroglou@giannis2two·
Excited to share the preprint of my Master's thesis. A*-Decoding: Token-Efficient Inference Scaling A*-Decoding is a search-based inference-time scaling method that matches large model performance using up to 3x fewer tokens and 30% fewer PRM passes. arxiv.org/abs/2505.13672…
English
0
2
10
920
Laura Ruis
Laura Ruis@LauraRuis·
Excited to announce that this fall I'll be joining @jacobandreas's amazing lab at MIT for a postdoc to work on interp. for reasoning (with @ev_fedorenko 🤯 among others). Cannot wait to think more about this direction in such a dream academic context!
English
45
11
475
31.2K
Giannis Chatziveroglou retweetet
Edward Grefenstette
Edward Grefenstette@egrefen·
Still a 3 rows deep crowd at poster 208 at #ICLR2025 . Want to know what an internship project with one of the largest compute budgets I've ever seen for an internship looks like? Drop by @LauraRuis' poster in next 1h45 ✨
Edward Grefenstette tweet media
English
2
2
64
8.1K
Giannis Chatziveroglou retweetet
Max Bartolo
Max Bartolo@max_nlp·
I'm excited to the tech report for our @Cohere @CohereForAI Command A and Command R7B models. We highlight our novel approach to model training including the use of self-refinement algorithms and model merging techniques at scale. Command A is an efficient, agent-optimised multilingual model offering best-in-class capabilities. Read more below! ⬇️
Max Bartolo tweet media
English
10
71
267
84.4K
Giannis Chatziveroglou retweetet
Aidan Gomez
Aidan Gomez@aidangomez·
Today @cohere is very excited to introduce Command A, our new model succeeding Command R+. Command A is an open-weights 111B parameter model with a 256k context window focused on delivering great performance across agentic, multilingual, and coding usecases. 🧵
Aidan Gomez tweet media
English
29
118
818
183.9K