The Tweeted Times

157.3K posts

The Tweeted Times banner
The Tweeted Times

The Tweeted Times

@TwtTimes

A personalised newspaper, content curation and publishing platform. Find, publish and promote content to engage and grow your community or just read your news.

Moscow - San Francisco Katılım Temmuz 2009
726 Takip Edilen11.1K Takipçiler
The Tweeted Times retweetledi
🦉DVC
🦉DVC@DVCorg·
"Metadata marts could play a key role in making video data more accessible and structured for model training and analysis" - Simon Thelin (@synthesiaIO, creator of the DataPains blog) reviewed DataChain 👇
English
1
3
4
967
The Tweeted Times retweetledi
🦉DVC
🦉DVC@DVCorg·
DataChain organizes and makes your AI data queryable! What does it mean? Why? 👇
English
1
2
7
852
The Tweeted Times retweetledi
Ivan Shcheklein
Ivan Shcheklein@shcheklein·
A small DataChain video on processing audio data from @huggingface with 🤗 models. We need more tools to do ETLs, analytics, governance, preparation for unstructured data at scale! - stream files from tar or wds archives! 🤯 - enrich, prepare, version, publish datasets 🚀 - bonus! 🤗 is natively integrated like a storage provider!
English
1
4
11
1.2K
The Tweeted Times retweetledi
Ivan Shcheklein
Ivan Shcheklein@shcheklein·
What is the trick to make progress bars look decent on Colab? Anyone, from the top of your head? :)
Ivan Shcheklein tweet media
English
0
2
2
617
The Tweeted Times retweetledi
🦉DVC
🦉DVC@DVCorg·
1/N DataChain hit 2000 stars ⭐ on GitHub a week ago. Thanks for your interest and support 🤗 It was built to address those needs and pain points we saw in the DVC community when people have to deal with millions of files (e.g. images, pdfs, audio, etc).
🦉DVC tweet media
English
2
2
12
946
The Tweeted Times retweetledi
elvis
elvis@omarsar0·
DataChain is a modern Pythonic data-frame library to efficiently organize unstructured data. I haven't tested but it looks really interesting especially because it supports multimodal data and cares about efficiency.
elvis tweet media
English
5
47
254
21.7K
The Tweeted Times retweetledi
Dmitry Petrov
Dmitry Petrov@FullStackML·
After trending in Hacker News, our open-source is now trending in GitHub. What’s next - Netflix special? github.com/iterative/data…
Dmitry Petrov tweet media
English
0
4
20
1.2K
The Tweeted Times retweetledi
Quentin Lhoest 🤗
Quentin Lhoest 🤗@lhoestq·
Datasets + LLMs + Pydantic = DataChain ...now with @huggingface !💛 DataChain by @DVCorg just added @huggingface support ! Create, Load, Transform HF Datasets with LLMs easily. - Pydantic for dataset schema - Use your own or public HF Datasets - Run your own or public HF Models
Quentin Lhoest 🤗 tweet media
English
1
6
36
2K
The Tweeted Times retweetledi
🦉DVC
🦉DVC@DVCorg·
🔬 LLM Project: Process PDFs at scale w/ DataChain & @UnstructuredIO ✂ Extract & parse text ⚙️Create vector embeddings 🚀Scale processing 🔄Version datasets All in <70 lines of code! 🤯 Perfect if you're working w/ docs. 🎥 hubs.ly/Q02RxKsb0
🦉DVC tweet media
English
0
1
12
627
The Tweeted Times retweetledi
🦉DVC
🦉DVC@DVCorg·
🦉Today we launch the DVC Extension for @code in @ProductHunt! Join us in the celebration of a year's worth of improvements since the original release that turns your IDE into your own personal ML experimentation platform! producthunt.com/posts/dvc-exte… 🧵1/5
🦉DVC tweet media
English
2
12
36
11K
The Tweeted Times retweetledi
🦉DVC
🦉DVC@DVCorg·
DVC 3.0 goes beyond the command line! Introducing the DVC Stack! This release improves DVC's core versioning and experiments functionality and enables new workflows like model registry and cloud experiments, improving the end-to-end model development journey! 🧵1/7
🦉DVC tweet media
English
3
11
44
7.6K
The Tweeted Times retweetledi
Ivan Shcheklein
Ivan Shcheklein@shcheklein·
@hafelg @paper_li @TwtTimes @boldakov hey, Twitter disabled their API (made it a paid feature and expensive), since the service was free we are not able to maintain and run it anymore, unfortunately
English
2
4
7
1.1K
The Tweeted Times retweetledi
🦉DVC
🦉DVC@DVCorg·
Woah! Been here? Is deep learning model training going horribly wrong? 🙋🏽‍♂️ Iterative Studio makes this easy to see so you don't waste time and resources! 🧵 1/7
English
1
6
27
5.8K
The Tweeted Times retweetledi
Ivan Shcheklein
Ivan Shcheklein@shcheklein·
I love how all these new tools and technologies come nicely together. @flydotio @streamlit and @modal_labs (and our open source MLEM to package a model and deploy) ... ... all of them being extremely easy to use + serverless, even GPUs. The future is here 🚀🚀🚀
mike0sv@mike0sv

Another example why open source software is great! You can train your own little #ChatGPT model in a couple of minutes using @modal_labs and then deploy it as a @streamlit app to @flydotio via MLEM.ai. More in this blogpost by yours truly iterative.ai/blog/mlem-nano…

English
0
6
27
5.3K
The Tweeted Times retweetledi
🦉DVC
🦉DVC@DVCorg·
👨🏻‍💻 Setup CI/CD in your machine learning projects using these simple yet powerful “♾️ CML commands”: 🔄 ci 🏃‍♂️ runner ⤴️ pr 💬 comment 🧑🏼‍🏫 tensorboard @Iterativeai @DVCorg #cml #tensorboard #opensource 🧵[1/7]
English
1
5
22
2.3K