Aleksander Smywiński-Pohl 🇵🇱 🇺🇦

600 posts

Aleksander Smywiński-Pohl 🇵🇱 🇺🇦

@AleksanderPohl

Uczę programowania obiektowego i przetwarzania języka naturalnego na AGH. Pracuję nad systemem informacji prawnej nowej generacji.

Katılım Haziran 2012

364 Takip Edilen159 Takipçiler

Aleksander Smywiński-Pohl 🇵🇱 🇺🇦@AleksanderPohl·5 May

@josevalim I have a separate account for Claude (I don't use Desktop though). I blocked my home dir for it. Also I gave it rootless docker. Projects shared by mount --bind. I think that's the only way to keep things safe.

English

772

José Valim@josevalim·5 May

So apparently when you install Claude Desktop app, they write a manifest file to several browser directories, even if you have never installed any of those browsers or never opted into using Anthropic's browser extension?

English

12.3K

Aleksander Smywiński-Pohl 🇵🇱 🇺🇦@AleksanderPohl·24 Nis

@swmansionElixir @ElixirConfEU It's good to see so many young faces. For me it's a sign that Elixir is gaining popularity - much better indicator than some Tiobe index.

English

Elixir by Software Mansion@swmansionElixir·24 Nis

We brought 3 speakers to @ElixirConfEU – which kicked off yesterday in Málaga! 🧪 That makes Software Mansion one of the most represented companies on stage, and we couldn't be more proud of the team. Here's what we're talking about 🧵👇

English

1.3K

Aleksander Smywiński-Pohl 🇵🇱 🇺🇦@AleksanderPohl·22 Nis

@josevalim Interesting. The 4.7 finally managed to fix some cucumber/playwright test issues. Does not try to cheat on the tests. My experience is fine so far.

English

744

Aleksander Smywiński-Pohl 🇵🇱 🇺🇦@AleksanderPohl·29 Mar

@wolski_jaros Okazuje się zatem, że w Wenezueli nie użyto osłupiaczy, tylko ogłupiaczy i to na tyle skutecznych, że sam Trump uwierzył że jego armia jest nietykalna.

Polski

164

Jarosław Wolski@wolski_jaros·29 Mar

Potwierdzona strata E-3G “Sentry” czyli AWACSa Airborne Early Warning and Control (AEW&C) Tutaj nie ma żadnych wątpliwości - Amerykanie w irańskim ataku rakietami balistycznymi stracili bardzo cenny samolot który nie tylko był latającym radarem ale też stanowiskiem kontroli i dowodzenia. Co więcej - owe zdjęcia pokazują że wcześniej upublicznione przez Irańczyków zdjęcia satelitarne z ataku na bazę Prince Sułtan w Arabii Saudyjskie są prawdziwe zatem Amerykanie nie tylko stracili E-3G Sentry ale też 2 lub 3 tankowce KC-135 oraz być może uszkodzone drugie tyle samo maszyn. Mamy zatem do czynienia z jednym z najskuteczniejszych ataków rakietami balistycznymi przez Iran w tej wojnie. Z czego wynikają sukcesy Iranu? Zapewne z braku odpowiedniej koncentracji sił przez USA w basenie Zatoki Perskiej co prawdopodobnie wynikało z lekceważenia przeciwnika przez czynniki polityczne w USA. Komuś się Teheran z Caracas pomylił a płacą za to żołnierze. Iran bez wątpienia jest najlepiej dowodzonymi i zorganizowanym przeciwnikiem USA od czasów Serbii w 1999. Tyle że wtedy koalicja mogła bezkarnie obracać Serbię w ruinę niszcząc infrastrukturę cywilną a teraz Iran ma bardzo wymierne atuty w postaci selektywnej blokady cieśniny Ormuz, Houti w Jemenie oraz faktu że USA dołączyło do uderzenia Izraela a szerokiej koalicji takiej jak w 1999 de facto nie ma. Moje odczucia są takie że jest to najgorzej przygotowana amerykańska interwencja od czasów mroków zimnej wojny na dodatek fatalnie rozgrywana politycznie.

Polski

140

185

149.1K

Aleksander Smywiński-Pohl 🇵🇱 🇺🇦@AleksanderPohl·9 Mar

@JustDeezGuy @josevalim Talk to anyone writing a concurrent code in any language and what's the first question that pops up immediately - is the code of X thread safe? The fact that you can write thread safe code in any language does not mean that it's easy. But in Erlang/Elixir it's dead simple.

English

Paul Snively@JustDeezGuy·8 Mar

@josevalim You lose instantly at “global mutable state.” Again: this is not hypothetical. I’ve been building the equivalent of Erlang/Elixir systems with mainstream technology for ~15 years, and when the subject comes up, Erlang/Elixir fans talk like the alternative is Java circa 2000.

English

1.6K

José Valim@josevalim·8 Mar

Saying "isolated processes for fault tolerance are not relevant because they were pushed to orchestration layer" is like saying "we don't need threads, because we will just run one pod per core anyway". The difference in reacting and responding to "my connection pool crashed" by restarting the pool locally vs restarting the whole pod is going to be massive, similar to the differences in latency when coordinating across threads vs across pods. Yes, other programming languages have threads, and they raise a signal when they fail, but that's missing the point. What matters it not the signal but the guarantees. If you have global mutable state and a thread crashes, can you guarantee it did not corrupt the global state? If you can't, the safest option is to restart the whole node anyway, because it is best to have a dead node than running a corrupted one. PS: somewhat related 6-years-old post: dashbit.co/blog/kubernete…

Paul Snively@JustDeezGuy

This is why I’m unimpressed by Erlang/Elixir: every major language runtime has VERY high-quality M:N work-stealing “thread” schedulers with good APIs (structured concurrency), and the “isolated processes” and “RPC” got pushed up to an orchestration layer (DC/OS, Nomad, k8s…)

English

349

30.2K

Aleksander Smywiński-Pohl 🇵🇱 🇺🇦@AleksanderPohl·8 Mar

@hubertlepicki I think the attack on Taiwan becomes much more probable, since the USA will run out of rockets very quickly. I guess Trump did not take that into account in his 4 dimensional chess.

English

Hubert Łępicki@hubertlepicki·7 Mar

Israel and America started bombing Iranian oil refineries and storage sites. I guess Trump gave up on the idea that Iranians rise up and topple their government, because this is not going to be taken well by the general public.

איתי בלומנטל 🇮🇱 Itay Blumental@ItayBlumental

תיעוד מטורף מטהרן הלילה: לראשונה בתי זיקוק ונפט מותקפים מהאוויר, זה לא הסוף

English

459

Aleksander Smywiński-Pohl 🇵🇱 🇺🇦@AleksanderPohl·3 Mar

@Grady_Booch Hmm I don't understand that remark. It's proved that NN with non linearities is a universal estimator, far away from just boolean logic.

English

Grady Booch@Grady_Booch·1 Mar

A gentle reminder that contemporary neural networks are only an abstraction built upon Boolean logic And that said neural networks are only an echo of a whisper of what organic neurons are.

English

631

40.1K

Aleksander Smywiński-Pohl 🇵🇱 🇺🇦@AleksanderPohl·20 Şub

@josevalim What is tauri?

English

344

José Valim@josevalim·19 Şub

We have converted our Livebook Desktop app to Tauri. If you're using Livebook, please give it a try by downloading the nightly builds: #desktop-app" target="_blank" rel="nofollow noopener">github.com/livebook-dev/l… Our goal is to extract the Tauri integration as a separate package to make it easier for people to ship Tauri+Elixir.

English

228

9.7K

Aleksander Smywiński-Pohl 🇵🇱 🇺🇦@AleksanderPohl·6 Şub

@josevalim Hmm while all these are valid points I think the very important feature is concurrency in the context of LLMs. Here everything take substantial amount of time - being able to write concurrent code with ease is a major win for Elixir. That's critically important for the UI loop.

English

160

José Valim@josevalim·5 Şub

Here is my take on why Elixir is the best language for AI: immutability, documentation, stability, and tooling for coding agents. It builds on the recent study in which Elixir had the highest completion rate across models among 20 different languages. Link in the thread below.

English

398

33.8K

Aleksander Smywiński-Pohl 🇵🇱 🇺🇦@AleksanderPohl·22 Oca

@josevalim Moreover I think the reason ppl have some magic moments is for tasks when the codebase used for training already included a very similar project. Due to the size of the model it's possible to memorize a lot. What I hate the most is the amount of duplication the models do.

English

Aleksander Smywiński-Pohl 🇵🇱 🇺🇦@AleksanderPohl·22 Oca

@josevalim Yes that's exactly my observation. I'm mostly using cursor with auto mode. Whenever there's sth more tailored to my needs the model goes off the rails when the task is not broken into step by step implementation, when I check small additions to the code.

English

132

José Valim@josevalim·22 Oca

IMO, unsupervised coding agents are still in the "uncanny valley" of software development. Given the recent hype of AI writing "a browser with 1 million lines of code from scratch", I have been trying to give it more open-ended tasks, and every time it delivered seemingly working software but with large flaws in the implementation. Last week it was a "download manager in Rust" that completely failed at concurrent downloads. This week was a port of the Ryu algorithm for pretty-printing floating points with different precisions (f32/f16/bf16/f8). Both times the software seemed to work but had deal breaker bugs lurking. Fixing those required me to become part of the loop and supervise the agent. Here is a breakdown of the latest experiment, for those who may be interested. TL;DR: github.com/elixir-nx/nx/p… (it says 66k lines added, but 65.5k is a generated fixture file, so the diff is more like +800/-300). --- The goal was to implement pretty-printing of floating points (f32/f16/bf16/f8) in Numerical Elixir. I created a blank repository, wrote the problem statement, and mentioned I specifically wanted the Ryu algorithm, linking to a reference implementation in Erlang (github.com/erlang/otp/pul…) and to the paper (dl.acm.org/doi/10.1145/31…). I did one attempt with Sonnet, another with Opus, and while they both delivered a project with a passing test suite, both implementations were wrong and incomplete. In the first attempt, many of the tests were fabricated, to match the faulty implementation. With wrong code and wrong tests, there was not much to salvage. Time to start over. In the second attempt, now with Opus, I suggested it could generate all possible printable values for f16 from the canonical Ryu implementation in C (since there are only 65k of them), and use that to validate the algorithm. Once again, it delivered a passing suite, but it made one crucial mistake early on: when generating the reference table, they cast f16 to f32 before printing (a subtle mistake many would make), which led to wrong reference values. And because the reference table was wrong, it lead to all sorts of wrong decisions downstream, such as adding casting and deltas. That's when I decided to be in-the-loop and break the problem into smaller ones: 1. I asked Claude to create a f16 reference table and made it clear in the prompt that any sort of casting would lead to the wrong solution. That's the reference table you can find in the PR 2. Then I asked it to explicitly port the Erlang algorithm, as is, and then parameterize the constants in the algorithm to make it generic (so it works for f16/f32/etc). Then write a test comparing all reference f16 values 3. Then I moved the algorithm to Nx. Since pretty printing is now precise, it broke 150 tests, which I used Claude to fix (with specific instructions to change only the precision in numbers and not touch anything else) Claude still made mistakes but because I broke the problem into small steps, and verified their correctness each step along the way, I avoided bad decisions cascading through the whole implementation. And yes, using Claude was still extremely helpful (honestly, if I used Claude only on step 3, it would have already been worth the price tag). Those two experiments have been orders of magnitude smaller than the browser one, both they seemed to work, but were flawed upon deeper inspection. For now, I'd still advise staying in the loop and avoiding falling into this trap.

English

135

9.8K

Aleksander Smywiński-Pohl 🇵🇱 🇺🇦@AleksanderPohl·5 Oca

@hubertlepicki I don't believe in the US military supremacy against Russia and China. It is interesting what has happened but probably more like personnel didn't want to fight. But the hard question is about the aftermath. Will this bring peace to Venezuela? I'm afraid the exact opposite.

English

Hubert Łępicki@hubertlepicki·4 Oca

To sum up Venezuela situation: - The USA captured Maduro and his wife without a single loss of soldier, in a 2 hours long operation - Russian and Chinese weapons and military rendered useless - Venezuelan regime hopeless, forced to negotiate transition of power under threat of further attacks they can't do anything about - every single dictator wannabe scared shirtless USA could do that to him any moment (Lukashenko 🤡) - China without independent access to oil - Iran next (maybe today 😅) I said a lot of bad shit about Trump but he's making it up big time.

English

4.2K

Aleksander Smywiński-Pohl 🇵🇱 🇺🇦@AleksanderPohl·19 Ara

@tomasz_zalewski Some, not all. This will be the same as in computer science right now - it will be much harder to enter the market by young lawyers.

English

Tomasz Zalewski@tomasz_zalewski·18 Ara

AI will kill all the lawyers

English

891

Aleksander Smywiński-Pohl 🇵🇱 🇺🇦@AleksanderPohl·18 Eki

@mikehostetler What about cached tokens when reporting the usage? It's very important since the price is order of magnitude lower.

English

Mike Hostetler // Chief Agent Officer@mikehostetler·16 Eki

ReqLLM 1.0.0-rc.6 shipped Probably the last release candidate before 1.0! 135 models tested with fixtures

English

Aleksander Smywiński-Pohl 🇵🇱 🇺🇦 retweetledi

José Valim@josevalim·7 Eki

Elixir v1.19.0-rc.2 is out! It is our last stop before v1.19, so please give it a try: elixirforum.com/t/elixir-v1-19… The results are great: the @remote folks confirmed their codebase compiles 55% on v1.19 and type checking is still ~1ms/module on average, even with all new features! We worked really hard on this one! @duboc_guillaume and I had to go beyond the current state of the art to optimize some key operations used during set-theoretic typing checking! We will publish some articles on this later on. Enjoy!

English

281

22.1K

Aleksander Smywiński-Pohl 🇵🇱 🇺🇦@AleksanderPohl·4 Eyl

@nkxxll @ThePrimeagen For sure. I have known Basic Pascal C++ Java Perl Lisp and Ruby. And then I learned elixir. Most similar to lisp but with real syntax haha. I really enjoy it, although you have to change some habits from more popular languages.

English

nkxxll@nkxxll·4 Eyl

@ThePrimeagen Worth learning when you already know many languages?

English

ThePrimeagen@ThePrimeagen·4 Eyl

after one month of using elixir i can confidently say that i really like the language

English

84.1K

Aleksander Smywiński-Pohl 🇵🇱 🇺🇦@AleksanderPohl·20 Ağu

@hiekkapoks @josevalim I would try qwen coder

English

The Artist Formerly Known as MySpace@hiekkapoks·19 Ağu

@josevalim Is there are good local model that can be used with opencode that works for tidewave+elixir?

English

408

José Valim@josevalim·19 Ağu

We asked Tidewave to improve its own homepage with autoplay of videos as we scroll through our features list. It implemented the feature and used contextual browser testing to scroll through the page, check the DOM, and verify it works! 🎯 Coming for Rails and Phoenix... today!

English

265

20.2K

Aleksander Smywiński-Pohl 🇵🇱 🇺🇦@AleksanderPohl·6 Ağu

@josevalim Moreover - not that important in codebases - but for non-English texts you would have to properly configure the index to support e.g. inflection. While embeddings automatically resolve this problem.

English

Aleksander Smywiński-Pohl 🇵🇱 🇺🇦@AleksanderPohl·6 Ağu

@josevalim Well it depends. As in your example you have just one word which is expanded to multiple terms. But if you have many words such expansion brings noise and you lose the order. There are IR models like Colbert and sparse embeddings which work better. Yet they are rarely supported

English

José Valim@josevalim·5 Ağu

Anyone exploring query expansion for RAG in codebases? For example, instead of doing a vector search, you could instruct the LLM to expand the query whenever searching. E.g. to find the authentication code, it could query for "authentication|auth|login|signin|credentials".

English

9.1K

Aleksander Smywiński-Pohl 🇵🇱 🇺🇦@AleksanderPohl·6 Ağu

@josevalim Blby the mainstream vector stores. To my knowledge only vespa has full support of these types. Elastic is working on sparse embeddings as far as I remember.

English

Keşfet

@josevalim @swmansionElixir @ElixirConfEU @wolski_jaros @JustDeezGuy @hubertlepicki @Grady_Booch @elonmusk