Sergei Vassilvitskii

143 posts

Sergei Vassilvitskii

@vsergei

Mostly in SF Katılım Kasım 2009

251 Takip Edilen442 Takipçiler

Sergei Vassilvitskii retweetledi

Jalaj Upadhyay@jalajupadhyay·4 Eki

The third iteration of NYC Privacy Day is going to happen this October. Register and learn about the cutting-edge work done in security and privacy in the last few months :) rsvp.withgoogle.com/events/nyc-pri…

English

1.6K

Sergei Vassilvitskii@vsergei·14 Eyl

@_onionesque @aryehazan It's almost 16 years old at this point (soda 2007).

English

376

Shubhendu Trivedi@_onionesque·14 Eyl

@aryehazan @vsergei kmeans++ is very recent, though, and does improve kmeans significantly with little overhead.

English

401

Aryeh Kontorovich@aryehazan·14 Eyl

In fact, as a challenge, name an empirically successful algorithm that arose from theory. All I got is boosting.

Aryeh Kontorovich@aryehazan

@rogtron Theory is almost always playing catch-up to empirically successful algorithms. This was certainly the case with regression, Nearest Neighbor, SVM, random forest.

English

46.7K

Sergei Vassilvitskii@vsergei·14 Eyl

@aryehazan The history of original k-means is interesting, but won't fit in the margins of this tweet. However, k-means++ as the now standard way to initialize k-means did come from theory.

English

211

Aryeh Kontorovich@aryehazan·14 Eyl

@vsergei Are you telling me that k-means arose from theory and nobody was running it on data before some sort of analysis became available?

English

853

Sergei Vassilvitskii retweetledi

Michael Dinitz@mdinitz·24 Ağu

Another new paper on the arxiv to talk about: arxiv.org/abs/2308.10316 . This paper is my first foray into differential privacy, which was fun and forced me to learn a lot.

English

7.6K

Sergei Vassilvitskii retweetledi

Michael Dinitz@mdinitz·10 Ağu

New paper just hit the arxiv, and which was one of the most fun and interesting research projects that I've ever worked on: arxiv.org/abs/2308.05067 . Long story short, we found some super interesting and surprising behavior in the most well-studied online problems: ski rental!

English

8.8K

Sergei Vassilvitskii retweetledi

Alexey Kurakin@alexey2004·3 Mar

Training ML models with differential privacy could be challenging. To aid practitioners, we wrote a detailed survey with known best practices of DP-training of ML models: arxiv.org/abs/2303.00654

English

5.9K

Sergei Vassilvitskii retweetledi

Michael Dinitz@mdinitz·23 Tem

Super excited about a new preprint, "Faster Matchings via Learned Duals", with Sungjin Im, Thomas Lavastida, Ben Moseley, and @vsergei . Long story short: we can use ML to massively speed up min-cost perfect matching computations! arxiv.org/abs/2107.09770

English

Sergei Vassilvitskii retweetledi

Constantinos Daskalakis@KonstDaskalakis·8 Haz

A great line-up for the July 13-14 FODSI workshop on ML for Algos! @AlexGDimakis-Yonina Eldar-@annadgoldie-@HeckelReinhard-Stephanie Jegelka-@tim_kraska-Benjamin Moseley-David Parkes-@AlgoSvensson-Tuomas Sandholm-@vsergei-Ellen Vitercik-David Woodruff! fodsi.us/ml4a.html

English

Sergei Vassilvitskii retweetledi

TCS blog aggregator@cstheory·17 Haz

Algorithms with Predictions: Survey and Workshop ift.tt/3hyGt6Z

English

Sergei Vassilvitskii@vsergei·6 Mar

@amitc1 @geomblog @RikSarkarNet @chaturv3di Book!

Português

Amit Chakrabarti @amitc1@mastodon.social

Amit Chakrabarti @[email protected]@amitc1·6 Mar

@geomblog @RikSarkarNet @chaturv3di Book?

Português

Suresh Venkatasubramanian (mostly in the sky now)@geomblog·4 Mar

2020 is turning out to be a year of shocks. The biggest one yet - I might finally understand what EM does.

English

Sergei Vassilvitskii@vsergei·5 Ara

@Aaroth A nice simple exercise. Suppose you are estimating the mean of a Gaussian distribution from iid samples. Compare the DP error to the finite sample error. TL;DR; with DP you need O(\sqrt{log n}) more samples to get parity.

English

Aaron Roth@Aaroth·4 Ara

This is good news for anyone who is worried that differential privacy will render Census data unusable; Its effect on statistics seems to be comparable to taking a very large random sample of the data, which is better than what statisticians usually get to work with.

English

Aaron Roth@Aaroth·4 Ara

"if 𝜖 = 1.0 ... TopDown will be like the uncertainty introduced by working with a 50% sample of the full dataset; if 𝜖 = 2.0, it will be like working with a 75% sample; and if 𝜖 = 6.0, it will have accuracy matching a 95% sample, which is pretty close to having the full data"

English

Sergei Vassilvitskii@vsergei·27 Eyl

@dsivakumar @ravik53 @JeffDean Don't forget @ssuri ! Nailing down the model of computation took many coffees and whiteboard sessions

English

Sergei Vassilvitskii@vsergei·26 Kas

@moyix @bipr Cc @jakehofman

Brendan Dolan-Gavitt@moyix·26 Kas

Does anyone have an example of a pre-registered replication of a paper in computer science ?

English

Sergei Vassilvitskii@vsergei·30 Ağu

Coming out of twitter hibernation to say that part 1 of the clustering book with @geomblog is available at clustering.cc ! As we say in the intro: Clustering is more than just a collection of tools... it is a systematic way to think about how data should be organized.

English