Ivan Shcheklein

2.9K posts

Ivan Shcheklein banner
Ivan Shcheklein

Ivan Shcheklein

@shcheklein

Data tools for ML at https://t.co/J2xdbN7TKB. @dvcorg (https://t.co/Qtjie6RgTT) - co-founder and maintainer

San Francisco Katılım Ocak 2009
212 Takip Edilen792 Takipçiler
Ivan Shcheklein retweetledi
Ivan Landabaso
Ivan Landabaso@IvanLandabaso·
Startup sales gold:
Ivan Landabaso tweet media
English
29
147
1.8K
127.5K
Ivan Shcheklein retweetledi
Dmitry Petrov
Dmitry Petrov@FullStackML·
Turns out "Claude Code over files in S3" quickly becomes "rebuild half the data warehouse stack" 🫠 Schemas, datasets, lineage, file refs, etc. OpenAI's Data Agent post made us feel slightly less insane 😄 Read more: datachain.ai/blog/openai-da…
English
7
5
85
554.4K
Ivan Shcheklein retweetledi
martin_casado
martin_casado@martin_casado·
Total token use as a measure of AI literacy is wrong headed. In my experience, after some baseline, more token use is inversely correlated with competency using AI.
English
86
33
402
38.3K
Ivan Shcheklein
Ivan Shcheklein@shcheklein·
@terronk @julien_c probably security? (banks are conservative for a, well, good reason). Good banks do use AI and copilots though, including agentic .. and for quite a while. It would be usually something like MS provided via VS Code, etc.
English
0
0
1
29
Ivan Shcheklein retweetledi
Dmitry Petrov
Dmitry Petrov@FullStackML·
OpenAI's data agent - how structured / SQL data done right: openai.com/index/inside-o… 🎥🔊🖼️ Multimodal data is harder: schemas and lineage aren't explicit - they must be inferred from Python code. The upside: a single language removes an entire layer of context and simplifies reasoning. ✨ True meaning lives in the code ✨
English
0
1
3
244
Ivan Shcheklein retweetledi
Dmitry Petrov
Dmitry Petrov@FullStackML·
LLMs broke out once text data hit scale. Neuro is entering its own scaling era - EEG, DICOM/NIfTI imaging, 3D-scans. Guess which part breaks first 👀 The data stack. datachain.ai/blog/neuro-dat…
English
0
2
5
377
Ivan Shcheklein
Ivan Shcheklein@shcheklein·
Or may be people won't care about this soon. Just some code that nobody is even reviewing touching directly at all.
English
0
0
0
32
Ivan Shcheklein
Ivan Shcheklein@shcheklein·
What "skill" is it to also generate succinct code? Can it be trained or models tuned to care a lot about it?
English
1
0
0
37
Ivan Shcheklein
Ivan Shcheklein@shcheklein·
Are there SE benchmarks that also measure "simplicity" and / or design of the generated code? Almost all time now just goes into "keep it simple", "refactor", etc ... Generating some code that just works is not an issue anymore.
English
1
0
0
49
Ivan Shcheklein retweetledi
Dmitry Petrov
Dmitry Petrov@FullStackML·
DBT + Fivetran 🚀 A huge milestone for the "modern data stack". Consolidation is on - who's next? Snowflake ❄️? Databricks 🔥? But maybe that doesn’t even matter. The next wave is here: Multimodal data stack It's not replacing the old one - it's for different users: 🤖 AI, not Analytics 🧠 Unstructured, not tabular 📂 Files, not tables 🐍 Python, not SQL ⚙️ Way more CPU/GPU-hungry 😅 Tabular data is just one modality - and whoever wins multimodality might own tabular too. Such an exciting time to be in the front row of this race 🔥
dbt@getdbt

@dbt_labs and @fivetran are joining forces to define the future of data: open data infrastructure. One foundation for movement, transformation, and AI—built to be open, reliable, and interoperable. Read more about our shared vision getdbt.com/blog/dbt-labs-…

English
2
2
7
646
Ivan Shcheklein
Ivan Shcheklein@shcheklein·
@Wattenberger just an observation: anthropic models do this in general (create a lot of tests files and reports) ... interestingly, working with GPT-5 - completely different experience. Not saying it is better at the end - but workflow is very different
English
0
0
0
118
Amelia Wattenberger 🪷
Amelia Wattenberger 🪷@Wattenberger·
I found the new "project_final_final.png" if you've been using Sonnet 4.5, you know what I mean
Amelia Wattenberger 🪷 tweet media
English
5
4
63
7.2K
Ivan Shcheklein retweetledi
Andrew Lee
Andrew Lee@startupandrew·
Today we're launching Tasklet — an AI agent for automating your business. Unlike ChatGPT, @TaskletAI actually does the work for you: connecting to your tools, triggering automatically, and handling tasks while you sleep.
English
52
66
293
80.3K
Ivan Shcheklein
Ivan Shcheklein@shcheklein·
Is there already a "Jervis"-like app that summarizes updates from agents when you are not at your computer and can get voice inputs to then feed back to those agents?
English
0
0
0
50
Ivan Shcheklein retweetledi
Dmitry Petrov
Dmitry Petrov@FullStackML·
AI isn't just about text and code. What about sounds, videos, and sensors? 🎧🎬🔬 I’ll be at @MLOpsWorld Summit (Oct 6-9 in Austin, TX) sharing how to query inside the file ⚡️ Come nerd out with me in Texas 👋🤠 #MLOpsWorld2025
Dmitry Petrov tweet media
English
0
3
7
296
Ivan Shcheklein retweetledi
Mitchell Hashimoto
Mitchell Hashimoto@mitchellh·
Look, say what you will about it, but right click editing a PHP file in an FTP client with upload-on-save is still the tightest and fastest feedback loop I've ever had in my life. We actually don't know how to do this anymore as an industry.
English
202
142
2.7K
254.1K
Ivan Shcheklein retweetledi
Andrey Dobry
Andrey Dobry@dobry·
🚨 Big milestone: our AI discovered new Parkinson’s drug leads. - Searched 40B molecules for $5 - 134 compounds made in 11 weeks - 14 hits, strongest at 110 nM Published today in the special issue of JCIM — my first scientific paper. pubs.acs.org/doi/10.1021/ac…
English
1
2
20
1.7K
Ivan Shcheklein retweetledi
Dmitry Petrov
Dmitry Petrov@FullStackML·
There's a trap I see AI/ML teams fall into with video, audio, and multimodal data. 🎥🎧👽 I wrote a blog post about it (with memes). In 🧵
Dmitry Petrov tweet media
English
1
1
7
401