Adrien Pacifico

1.6K posts

Adrien Pacifico

@psyfico

#Python #Economics #DataScience #OpenData #DoStuffWithData

Marseille, France Katılım Ocak 2016

1K Takip Edilen238 Takipçiler

Adrien Pacifico retweetledi

Maitre Eolas🇫🇷@Maitre_Eolas·23 Oca

L’assemblée nationale vient de décider que les policiers pouvaient tuer les citoyens sans avoir à expliquer pourquoi.

LCP@LCP

Les députés adoptent l'amendement du gouvernement qui prévoit une présomption d'usage légitime de l'arme pour les forces de l'ordre. #DirectAN

Français

462

3.6K

10.4K

425K

Adrien Pacifico retweetledi

Andrew Ng@AndrewYNg·28 Ağu

Parallel agents are emerging as an important new direction for scaling up AI. AI capabilities have scaled with more training data, training-time compute, and test-time compute. Having multiple agents run in parallel is growing as a technique to further scale and improve performance. We know from work at Baidu by my former team, and later OpenAI, that AI models’ performance scales predictably with the amount of data and training computation. Performance rises further with test-time compute such as in agentic workflows and in reasoning models that think, reflect, and iterate on an answer. But these methods take longer to produce output. Agents working in parallel offer another path to improve results, without making users wait. Reasoning models generate tokens sequentially and can take a long time to run. Similarly, most agentic workflows are initially implemented in a sequential way. But as LLM prices per token continue to fall — thus making these techniques practical — and product teams want to deliver results to users faster, more and more agentic workflows are being parallelized. Some examples: - Many research agents now fetch multiple web pages and examine their texts in parallel to try to synthesize deeply thoughtful research reports more quickly. - Some agentic coding frameworks allow users to orchestrate many agents working simultaneously on different parts of a code base. Our short course on Claude Code shows how to do this using git worktrees. - A rapidly growing design pattern for agentic workflows is to have a compute-heavy agent work for minutes or longer to accomplish a task, while another agent monitors the first and gives brief updates to the user to keep them informed. From here, it’s a short hop to parallel agents that work in the background while the UI agent keeps users informed and perhaps also routes asynchronous user feedback to the other agents. It is difficult for a human manager to take a complex task (like building a complex software application) and break it down into smaller tasks for human engineers to work on in parallel; scaling to huge numbers of engineers is especially challenging. Similarly, it is also challenging to decompose tasks for parallel agents to carry out. But the falling cost of LLM inference makes it worthwhile to use a lot more tokens, and using them in parallel allows this to be done without significantly increasing the user’s waiting time. I am also encouraged by the growing body of research on parallel agents. For example, I enjoyed reading “CodeMonkeys: Scaling Test-Time Compute for Software Engineering” by Ryan Ehrlich and others, which shows how parallel code generation helps you to explore the solution space. The mixture-of-agents architecture by Junlin Wang is a surprisingly simple way to organize parallel agents: Have multiple LLMs come up with different answers, then have an aggregator LLM combine them into the final output. There remains a lot of research as well as engineering to explore how best to leverage parallel agents, and I believe the number of agents that can work productively in parallel — like the humans who can work productively in parallel — will be very high. [Original text, with links: deeplearning.ai/the-batch/issu… ]

English

118

316

1.8K

323.7K

Adrien Pacifico retweetledi

GitHub Projects Community@GithubProjects·24 Tem

Run a full virtual desktop inside a Docker container, accessible via WebRTC, right from your browser.

English

495

5.6K

414.5K

Adrien Pacifico retweetledi

Tivadar Danka@TivadarDanka·19 Tem

A question we never ask: "How large is that number in the Law of Large Numbers?" Sometimes, a thousand samples are large enough. Sometimes, even ten million samples fall short. How do we know? I'll explain.

English

399

36.1K

Adrien Pacifico retweetledi

stefano palombarini@StefPalomba·24 Nis

Je ne ne sais pas si nous avons été écoutés. On verra. Mais nous avons exprimé publiquement nos désaccords, et strictement personne ne nous l’a fait payer, même dans une toute petite mesure. [5/x]

Français

352

14.5K

Adrien Pacifico retweetledi

Senior PowerPoint Engineer@ryxcommar·18 Oca

setting up a Python environment

Ballsack Sports@BallsackSports

Name something that is harder than this

English

899

28.9K

Adrien Pacifico retweetledi

Nicolas Hervieu@N_Hervieu·6 Ara

Lecture d'utilité publique. Voilà pourquoi des étrangers en situation régulière (qui travaillent & vivent sereinement dans notre pays) finissent souvent dans l'irrégularité. Au mépris de leurs droits & au détriment de l'intérêt de tous (sauf des responsables de ce chaos...)

AJDA@AJDA_Dalloz

"L'administration numérique des étrangers ne répond plus : des vies en suspens"

Français

228

394

32.7K

Adrien Pacifico retweetledi

Sumanth@Sumanth_077·21 Eki

This repository is absolute gold for all Data Science and Machine Learning practitioners! Best ideas and solutions shared by top performers in the Kaggle competitions: github.com/faridrashidi/k…

English

349

20.4K

Adrien Pacifico retweetledi

Santiago@svpino·15 Eki

Another step closer to having AI write code better than humans! The new release of AlphaCodium, an open-source state-of-the-art code generation tool, outperforms directly prompting OpenAI when generating code. This is a huge deal. The research team @QodoAI tested this on the Codeforces Code Contest benchmark, and the leap is huge: Using o1-preview • Direct prompting: 55% • AlphaCodium: 78% Using o1-mini • Direct prompting: 53% • AlphaCodium: 74% These results make AlphaCodium the best approach to generate code we've seen so far. I'm linking to a blog post with more information, the paper, and the GitHub repository below, but here is a 30-second summary of how AlphaCodium works: AlphaCodium relies on an iterative process that repeatedly runs and fixes the generated code using the testing data. 1. The first step is to have the model reason about the problem. They describe it using bullet points and focus on the goal, inputs, outputs, rules, constraints, and any other relevant details. 2. Then, they make the model reason about the public tests and come up with an explanation of why the input leads to that particular output. 3. The model generates two to three potential solutions in text and ranks them in terms of correctness, simplicity, and robustness. 4. Then, it generates more diverse tests for the problem, covering cases not part of the original public tests. 5. Iteratively, pick a solution, generate the code, and run it on a few test cases. If the tests fail, improve the code and repeat the process until the code passes every test. There's a lot more information in the paper and the blog post. Here are the links: • Blog: qodo.ai/blog/system-2-… • Paper: arxiv.org/abs/2401.08500 • Code: github.com/Codium-ai/Alph… I attached an image comparing AlphaCodium with direct prompting using different models.

English

586

73.1K

Adrien Pacifico@psyfico·4 Eyl

@AA_Avocats On suppose qu'il en sera de même pour les mesures sur le RIO pour le 11 octobre prochain ?

Français

104

Arié Alimi Avocats@AA_Avocats·4 Eyl

L'Etat hors-la-loi c'est aussi ça ⬇️

Français

385

928

38.9K

Adrien Pacifico retweetledi

Philipp Heimberger@heimbergecon·26 Ağu

This is a very useful reading list of recent advances in econometrics.

English

283

1.6K

151.9K

Adrien Pacifico retweetledi

Gael Varoquaux 🦋@GaelVaroquaux·19 Tem

⚡️ CARTE: toward table foundation models⚡️ gael-varoquaux.info/science/carte-… Why foundation models for tables are hard, and why we have made significant headway with “CARTE” Published at #ICML2024 🧵 1/7

English

116

16.1K

Adrien Pacifico retweetledi

œconomicus@adewed00·12 Tem

Non.

Europe 1@Europe1

« Évidemment qu’il y a un lien entre immigration et délinquance », déclare @fxbellamy #Europe1EtVous #Europe1

512

171.5K

Adrien Pacifico retweetledi

polars data@DataPolars·1 Tem

We are happy to announce Python Polars 1.0! pola.rs/posts/announci…

English

143

630

58.7K

Adrien Pacifico retweetledi

François Malaussena@malopedia·10 Haz

J'arrive pas à dormir. Alors je vais écrire. Ce que je pense que Macron tente, et comment on peut s'en sortir.

Français

281

6.2K

24.8K

5.3M

Adrien Pacifico retweetledi

Charlie Marsh@charliermarsh·4 Nis

Home Assistant (68k stars) migrated to uv. They now save over five hours of execution time on each build...

English

1.2K

215.4K

Adrien Pacifico retweetledi

Matt Harrison@__mharrison__·18 Mar

I enjoyed the talk "Accelerating Pandas with Zero Code Change using RAPIDS cuDF" at #GTC2024. One of Pandas' major drawbacks is its lack of a "query engine," which leads to eager execution of all operations. More modern tools like Polars and DuckDB are designed around a query engine, resulting in significantly faster performance for tasks such as grouping. By simply using cuDF, you can transform slow Pandas code into fast code, often achieving a 2-10x improvement over Polars and DuckDB. People often ask me which tool they should use, and the answer is usually more complex than a single sentence. If you're looking to boost the speed of your Pandas code today, cuDF is the simplest way to achieve significant performance gains without having to rewrite ANY of your code.

English

9.9K

Adrien Pacifico retweetledi

BLAST, Le souffle de l'info@blast_france·12 Mar

Deux jeunes abattus par la police à Vénissieux : la légitime défense flinguée Une version largement remise en cause par une expertise de @index_ngo. Révélations sur une affaire symptomatique de la sacralisation de la parole policière. Par @xavmon. blast-info.fr/articles/2024/…

Français

141

167

9.1K

Keşfet

@QodoAI @AA_Avocats @index_ngo @xavmon @elonmusk @BarackObama @taylorswift13 @cristiano