
Control strategy for large scale bio datasets serving as substrate for AI is its own science (and one I’d suggest we played a very meaningful role in pioneering).
Can’t agree more with Ron. If you see controls in a row, column or on the edge of your plate, rather than randomized across a plate, you know your dataset is not built with ML/AI in mind.
Ron Alfa@Ronalfa
People are still generating “ML datasets” with all kinds of confounds. If the controls are all next to each other on the edge of the plate, no randomization, ngmi.
English














