Lamin

13 posts

Lamin banner
Lamin

Lamin

@laminlabs

Open data framework for biology. Context and memory for datasets and models at scale.

Katılım Aralık 2021
6 Takip Edilen200 Takipçiler
Lamin retweetledi
Tyler Burns
Tyler Burns@tjburns08·
Hi friends, I wrote a guest post for Lamin on using the open source LaminR package in an R workflow with the PBMC 3k dataset. Focus: provenance — tracking code, environment & execution order so analyses are reproducible when you (or someone else) comes back to them.
English
1
2
3
222
Lamin
Lamin@laminlabs·
Existing data infrastructure can't make sparse measurements across millions of features queryable. Warehouses are too rigid, data lakes can't be queried, tabular lakehouses don't understand the formats. Biology needs a data lakehouse with support for bio-formats and registries.
Lamin tweet media
English
1
0
0
22
Lamin
Lamin@laminlabs·
We partnered with @jejomath to help us explain the relation between biology’s sparse measurements and the data lakehouse concept.
Lamin tweet media
English
1
4
7
240
Lamin retweetledi
Alex Wolf
Alex Wolf@falexwolf·
Two years ago we partnered with Mark Keller from Nils Gehlenborg’s Lab at Harvard to make Vitessce work seamlessly with LaminDB for interactive visualization of multimodal + spatial datasets. The integration has found much use across academia, biotech, and pharma — so we wrote up on design principles & use cases. This was a team effort involving Altana, Richard & Sunny in addition to Mark. Read the post: blog.lamin.ai/vitessce
Alex Wolf tweet media
English
0
2
9
679
Lamin retweetledi
Alex Wolf
Alex Wolf@falexwolf·
What should the shared memory layer for agents and humans look like? Will it live in embeddings or in records? A high-level note.
English
1
2
7
245
Lamin retweetledi
David Fischer
David Fischer@davidsebfischer·
Nice, detailed benchmark of backends that allow for batched training on a large scRNA-seq corpus - efficiently dealing with the specifics of a scenario can be a big engineering challenge, lowering this barrier will enable cool computational biology down the road!
Alex Wolf@falexwolf

What's a good way of organizing scRNA-seq data for training foundation models? Say you run 1k experiments and each measures counts for 1M cells with varying metadata and orthogonal data. Storing these data in one gigantic array isn’t exactly easy. We wondered whether it’s necessary to train foundation models and found 3 setups that made sense to us. lamin.ai/blog/arrayload…

English
0
4
20
3.7K
Lamin retweetledi
Alex Wolf
Alex Wolf@falexwolf·
What's a good way of organizing scRNA-seq data for training foundation models? Say you run 1k experiments and each measures counts for 1M cells with varying metadata and orthogonal data. Storing these data in one gigantic array isn’t exactly easy. We wondered whether it’s necessary to train foundation models and found 3 setups that made sense to us. lamin.ai/blog/arrayload…
Alex Wolf tweet media
English
2
34
123
42.5K
Lamin retweetledi
Sunny Sun
Sunny Sun@sunnyosun·
Thank you for the awesome collaboration, @marenbuettner! With Pytometry, we'd like to share readfcs: A package to load data and metadata from FCS files to AnnData. pip install readfcs
English
1
3
12
0
Lamin retweetledi
Alex Wolf
Alex Wolf@falexwolf·
New tool: nbproject helps manage Jupyter notebooks! A lightweight open-source ELN for the drylab. pip install nbproject
GIF
English
3
28
149
0