Filip Graliński

204 posts

Filip Graliński

@FilipGralinski

6502 and Haskell hacker, machine learner, hypopolyglot (many languages, all poor), opposite Pole, skeptical forteanist

Katılım Eylül 2021

587 Takip Edilen111 Takipçiler

Filip Graliński@FilipGralinski·2 Şub

ZXX

Filip Graliński retweetledi

Snowflake@Snowflake·5 Haz

Day 2 of #SnowflakeSummit flew by but not before a mountain of announcements from our Platform Keynote! We announced: Adaptive Compute, Snowflake Openflow, Cortex AISQL, Semantic Model Sharing, Snowflake Intelligence, and much more. See what's new: bit.ly/4mNjiqR

English

Filip Graliński@FilipGralinski·19 May

@jxmnop Yeah, great paper, also for educational purposes. I'm curious whether the embeddings are still available... Whether king + woman - man = queen was already there but they didn't realize it...

English

206

dr. jack morris@jxmnop·18 May

did you know people have been training neural networks on text since 2003? everyone talks about Attention Is All You Need. but this is the real paper that got our field started. it was in 2003, in montreal. i read it, and it was even more forward-thinking than i expected:

English

111

1.3K

94.9K

Filip Graliński@FilipGralinski·18 May

@spacemanidol Todo lenguaje es un alfabeto de símbolos cuyo ejercicio presupone un pasado que los interlocutores comparten...

Español

Daniel Campos@spacemanidol·18 May

Kinda crazy how much random literature applies to the LLM world now. Borges would probably have a field day with LLMs and the constant talk of the incoming singularity. Truly the creation of library of babel.

English

262

Filip Graliński retweetledi

Łukasz Borchmann@LukaszBorchmann·1 Nis

How can the most accurate SQL be generated for a given question? We propose a method to significantly boost text-to-SQL accuracy while drastically cutting costs.👇 #NLProc #AI #TextToSQL #LLMs

English

51.2K

Filip Graliński retweetledi

Anupam Datta@datta_cs·25 Mar

Our Snowflake AI Research team just released Arctic Embed’s core training code into the open source ArcticTraining project — making it easier for developers and researchers to reproduce, fine-tune, and build on our embedding models. Arctic Embed is the leading small embedding model on the MTEB leaderboard and is widely used with over 1M monthly downloads. What you’ll find: ✅ Clean, config-driven workflows powered by DeepSpeed ✅ Flexible contrastive data handling ✅ Example fine-tuning recipes and ready-to-use tooling Read more here and try it out: snowflake.com/en/engineering… @SnowflakeDB @DeepSpeedAI @lukemerrick_ @pxyumass @spacemanidol @rajhans_samdani @jeffra45 @StasBekman

English

10.1K

Filip Graliński@FilipGralinski·21 Mar

@eigenrobot Therefore aliens

English

118

eigenrobot@eigenrobot·21 Mar

i mean

Jay Anderson@TheProjectUnity

Don't think about it.

English

514

27.9K

Filip Graliński retweetledi

Luke Merrick@lukemerrick_·18 Ara

Connor Shorten was kind enough to give me the mic for a lot of hot takes on text embedding models in the latest Weaviate podcast.

Connor Shorten@CShorten30

Arctic Embed ❄️ has been one of the most impactful open-source text embedding models! In addition to the open model, which has helped a lot of companies kick off their own inference and fine-tuning services (including us), the Snowflake team has also published incredible research breaking down all the components of how to train these models! I am SUPER EXCITED to publish the 110th Weaviate Podcast with Luke Merrick (@lukemerrick_), Puxuan Yu (@pxyumass), and Charles Pierse (@cdpierse) discussing all things Arctic Embed! The podcast covers: • The origin of Arctic Embed • Pre-training embedding models • Matryoshka Representation Learning • Fine-tuning embedding models • Synthetic Query Generation • Hard Negative Mining • Single-Vector Embedding Models in the search model cohort of ColBERT, SPLADE, and Re-rankers I hope you enjoy the podcast! As always, please reach out if you would like to discuss any of these ideas further!

English

1.1K

Filip Graliński@FilipGralinski·14 Ara

@_chenson__ @gro_tsen I remember we used them in high school (Poland, 1990s).

English

Chris Henson@_chenson__·14 Ara

@gro_tsen Was this a standard notation in the 50s? Seems pretty confusing! (From the first reference)

English

864

Gro-Tsen@gro_tsen·14 Ara

Learned on MathOverflow: it is possible to write a finite formula for n! involving just the operations of addition, subtraction, multiplication, integer division, and exponentiation. Precise statement is here: mathoverflow.net/a/484115/17064

English

659

45.2K

Filip Graliński@FilipGralinski·10 Ara

@magoniareview Of interest! Actually, it'd be great if it covered languages other than English... (reading weird books is the best way to learn foreign languages!), I can share a list of recent "Magonian" books in a bunch of languages, if you're interested.

English

thePelican@magoniareview·9 Ara

I have now suspended the Pelican's 'Book News' blog, but may resurrect is as a stand-alone listing of forthcoming titles of Fortean, ufological and folkloric interest, if enough people think it will be of interest.

English

648

Filip Graliński retweetledi

Aurick Qiao@aurickq·5 Ara

We are excited to share SwiftKV, our recent work at @SnowflakeDB AI Research! SwiftKV reduces the pre-fill compute for enterprise LLM inference by up to 2x, resulting in higher serving throughput for input-heavy workloads. 🧵

English

2.8K

Filip Graliński@FilipGralinski·4 Ara

@kamilkazani

QME

239

Kamil Galeev@kamilkazani·4 Ara

Obviously Germans had the reputation for copying things (just a bit earlier). Which is a standard, textbook pattern for a rising manufacturing power. Produce cheap, crappy, low quality goods. Copy everything that moves, and doesn’t move. Break every patent and every copyright.

Crémieux@cremieuxrecueil

Pretty insane Community Note. Never mind that the Germans didn't have a reputation for copying like China does, the first two examples here were Jews and the last one was a spy, albeit for the Russians.

English

372

43.2K

Filip Graliński retweetledi

Daniel Campos@spacemanidol·4 Ara

🚀 I am thrilled to introduce @SnowflakeDB 's Arctic Embed 2.0 embedding models! 2.0 offers high-quality multilingual performance with all the greatness of our prior embedding models (MRL, Apache-2 license, great English retrieval, inference efficiency) snowflake.com/engineering-bl…🌍

English

7.1K

Filip Graliński@FilipGralinski·26 Kas

Więcej informacji: linkedin.com/posts/bartosz-…

Polski

Filip Graliński@FilipGralinski·26 Kas

ZXX

108

Filip Graliński retweetledi

Michał Pietruszka@MichaPietruszka·2 Kas

Can AI models help us create better models? 🧵 1/ It's a question that stands at the boundaries of what's possible in data science. We explored how Large Language Models (LLMs) perform as data scientists, especially in the art of feature engineering.

English

192

Filip Graliński@FilipGralinski·29 Eki

@jaroslavrudis Gratulacje!

Polski

298

Jaroslav Rudiš@jaroslavrudis·28 Eki

Díky / Danke

Čeština

2.1K

43.1K

Filip Graliński@FilipGralinski·16 Eki

@hrishioa @Hrishioa clarify what "clarification question" token is in the guide; I was, like, what is that magic token?? (but anyway, cool doc, thanks)

English

225

Hrishi@hrishioa·13 Eki

Entropix is beyond cool but hard to understand Here's a guide that means you don't have to walk my ragged path: southbridge-research.notion.site/Entropixplaine… Spent an hour talking to imaginary AI friends about the code, then ran it through Lumentis

English

475

81.1K

Filip Graliński@FilipGralinski·16 Eki

@hrishioa I want to know... what the token 2564 is... ... I'd prefer 1981, for esthetic reasons

English

106

Keşfet

@jxmnop @spacemanidol @SnowflakeDB @DeepSpeedAI @lukemerrick_ @pxyumass @rajhans_samdani @jeffra45