Mathieu Bastian

1.4K posts

Mathieu Bastian

Mathieu Bastian

@mathieubastian

@Gephi co-founder. ML at @GetYourGuide. @LinkedIn data science alumnus. 2x DadOps. All data matters.

Berlin Katılım Mart 2009
630 Takip Edilen2K Takipçiler
Mathieu Bastian retweetledi
Gephi
Gephi@Gephi·
📣 Big news for Gephi! Today we are launching Gephi Lite v1.0 (the web version of the Gephi software). 📰 Read the blog post describing this milestone: gephi.wordpress.com/2025/10/08/gep… 💻 Or try Gephi Lite v1.0 here: lite.gephi.org
English
1
12
22
3.3K
Mathieu Bastian retweetledi
Guillaume Lample @ NeurIPS 2024
Guillaume Lample @ NeurIPS 2024@GuillaumeLample·
Today, we are releasing Mistral Large, our latest model. Mistral Large is vastly superior to Mistral Medium, handles 32k tokens of context, and is natively fluent in English, French, Spanish, German, and Italian. We have also updated Mistral Small on our API to a model that is significantly better (and faster) than Mixtral 8x7B. Lastly, we are introducing Le Chat (chat.mistral.ai), a chat interface (currently in beta) on top of our models.
Guillaume Lample @ NeurIPS 2024 tweet mediaGuillaume Lample @ NeurIPS 2024 tweet mediaGuillaume Lample @ NeurIPS 2024 tweet media
English
166
758
5K
865K
Mathieu Bastian retweetledi
Chris
Chris@criccomini·
New post! Picking at some of my stream processing scar tissue. Why Samza failed, how it led to Kafka Streams and Kafka Connect, and why I'm skeptical of Apache Flink. materializedview.io/p/from-samza-t…
English
6
11
100
13.7K
Mathieu Bastian retweetledi
Prateek K. Keshari
Prateek K. Keshari@prkeshari·
Built something I like, and now it’s out for anyone to try for free! ⚡ Introducing Peek - a neat way to summon ChatGPT, Bard, Perplexity anywhere on your Mac. Details 👇🏼
Prateek K. Keshari tweet mediaPrateek K. Keshari tweet mediaPrateek K. Keshari tweet media
English
7
4
15
5.6K
Logan Kilpatrick
Logan Kilpatrick@OfficialLoganK·
Would you be excited / would it be useful if we released usage tracking based on API key for the @OpenAI API? 🤔 cc @andrwpng
English
223
18
1.1K
154.7K
Mathieu Bastian retweetledi
Jure Leskovec
Jure Leskovec@jure·
📢 New research alert! 💡 We've developed PRODIGY: pretraining framework for in-context learning over graphs. Through a novel prompt graph representation and a family of in-context pretraining objectives, our model can adapt to novel tasks on unseen graphs 📈. Outperforming contrastive pretraining baselines by 18% and standard finetuning with limited data by 33% on average, PRODIGY proves its strength in citation networks and knowledge graphs. Read on: arxiv.org/pdf/2305.12600… #AIResearch #GraphLearning #PRODIGY #InContextLearning Joint work with @qhwang3 @ren_hongyu PengChen GregorKrzmanc DanielZheng and @percyliang
Jure Leskovec tweet media
English
4
97
408
51.4K
Mathieu Bastian retweetledi
GetYourGuide
GetYourGuide@GetYourGuide·
We’re proud to share that we’ve secured $194m in new funding! 🚀 We’re excited to work with our team and supply partners to further accelerate our mission to unlock unforgettable experiences for travelers around the world!
GIF
English
2
6
29
4.8K
Mathieu Bastian retweetledi
Mike Conover
Mike Conover@vagabondjack·
Of all the announcements in the pipeline, I’m especially excited about first class support for LLM’s of all flavors in MLFlow. Hugging Face, OpenAI and Langchain all play well with Databricks. databricks.com/blog/2023/04/1…
English
1
14
76
16K
Mathieu Bastian retweetledi
Bojan Tunguz
Bojan Tunguz@tunguz·
Pandas 2.0 is here! This is the biggest overhaul of Pandas since its inception, and it has been years in the making. However, you will probably not notice too many changes, and all your existing Pandas code will most likely run the same as before. All the major changes are under the hood. That's because Pandas has moved away from the way it represents data, from numpy to Apache Arrow. Pandas was originally built on top of numpy, and it was an adequate solution for many tasks. However, three are many limitations of numpy that have only become more obvious over the years. Apache Arrow will significantly help with those pain points, and will speed up many Pandas tasks. I've only played with the new version for a day so far. My limited impression is that it significantly speeds up loading and saving of csv files, and puts the new version of Pandas on par with Polars in that regard. Lookign forward to playing more with it in the weeks and months ahead. Great blog post about what's new in Pandas: datapythonista.me/blog/pandas-20… Release notes: pandas.pydata.org/docs/dev/whats… GitHub repo: github.com/pandas-dev/pan… #DataScience #MachineLearning #Data #Python
Bojan Tunguz tweet media
English
9
147
936
157.7K
Mathieu Bastian retweetledi
Gephi
Gephi@Gephi·
🎉Big news!🎉 Introducing 🚀Gephi Clique - the ultimate network visualization tool! 💥Unleash the power of our game-changing SUBSCRIPTION MODEL! 🤯Join the Clique today and revolutionize the way you visualize networks! #DataAnalysis #JoinTheClique 👇👇👇 gephi.org/clique/
English
3
8
40
17.7K
Mathieu Bastian retweetledi
Matei Zaharia
Matei Zaharia@matei_zaharia·
Maybe Instruction is All You Need? It seems to have an outsized impact vs scale and even some aspects of data selection and curation to get a usable instruction-following model. We are excited about what this means for democratizing LLMs and letting every org build its own.
English
1
9
92
14.1K
Mathieu Bastian retweetledi
Gephi
Gephi@Gephi·
Gephi 0.10 is out! • Quick search • Dark mode • Apple Silicon support • Improved image export • Workspace management • More bugs fixed Check out the release post: gephi.wordpress.com/2023/01/09/gep…
Gephi tweet media
English
7
99
363
55.6K
Chris
Chris@criccomini·
I want embedded data stored for everything. SQLite and DuckDB have spoiled me. I want this for all types of data stores: JSON document stores, graph DBs, search indexes, embedded Redis, etc etc. (JVM doesn’t cut it. Needs to be portable like SQLite and DuckDB.)
English
6
2
56
12.7K
Clement Levallois / seinecle@ioc.exchange
#Java question: I have two projects A and B: an API endpoint, and a webapp that makes calls to it. Both projects have a dep to a 3rd project, which is the model of the data sent / requested to the API. Problem:
English
1
0
1
0