Vusal

234 posts

Vusal

Vusal

@vbabashov

AI builder

Canada Katılım Nisan 2012
1.1K Takip Edilen293 Takipçiler
Vusal
Vusal@vbabashov·
@mdancho84 Does it handle pdf images well?
English
0
0
0
151
Matt Dancho (Business Science)
Microsoft is making moves again. A quiet little Python tool just shot to the top of GitHub’s trending charts. 100,000+ stars. It’s called MarkItDown. And it does something deceptively simple: It turns almost any file into clean Markdown. PDFs. Word docs. PowerPoints. Excel files. Images. Drop a file in. Get structured Markdown out. Sounds small. It’s not. Because one of the biggest bottlenecks in AI workflows — especially RAG systems — is getting messy, real-world documents into a format models can actually use. And real-world documents are brutal. PDFs are chaotic. Word docs are full of hidden formatting junk. PowerPoints are messy and often image-heavy. Spreadsheets can be a nightmare to parse cleanly. That’s where this gets interesting. MarkItDown strips away the friction and gives you something LLM pipelines can actually work with. In other words: less preprocessing, less pain, faster AI implementation. Even better, this isn’t some random side project. It’s an official Microsoft open-source tool. Free. Commercially usable. Practical. I tested it on a 200-page PDF. A few seconds later, I had Markdown that was shockingly clean. And that’s what big tech does at its best: They take an annoying, universal problem that everyone has been duct-taping together… and turn it into a simple standard. That’s why this matters. It’s not just a file conversion tool. It’s infrastructure for the next wave of AI applications. Get it here: github.com/microsoft/mark… 🚨 Want to learn how to build + ship AI and Data Science projects (that businesses actually want)? On April 29th, I am hosting a free workshop to help you get started with AI + DS projects in Python. Register here (500 seats): learn.business-science.io/registration-a…
Matt Dancho (Business Science) tweet media
English
26
146
766
84.6K
Mike Hoffmann
Mike Hoffmann@MrPassive_·
Anyone can buy a Micromarket for $500 down & start making $5,000+/month from it. (One of mine makes $22,000+/month) Today, I’m giving away my Full Micromarket Course for free. All you have to do: • Like this • Comment “Course” & I’ll DM it to you. *Must Be Following Me*
Mike Hoffmann tweet media
English
370
47
713
101.4K
Ben Kelly
Ben Kelly@benkellyone·
Screw it. I want to pay it forward. I’m giving away my *exclusive* guide on how I built my $7M/year business portfolio. • Like this post • Comment “Freedom” And I’ll send you the link. (Must follow, 24 hours only)
English
496
17
558
80.4K
Mike Hoffmann
Mike Hoffmann@MrPassive_·
2025 is the year you place your first Micromarket. (One of mine makes $22,000+/month) Today, I'm giving away my Full Micromarket Course for free. My entire system. • Like this • Comment "Course" & I'll DM it to you. *Must Be Following, 24 Hours Only*
Mike Hoffmann tweet media
English
213
17
386
34.8K
Jinglai He 🇨🇦
Jinglai He 🇨🇦@JinglaiHe·
Something absolutely unbelievable just happened moments ago.. I was in the Library studying and all of a sudden, I got a phone call from Pierre Poilievre (YES YOU READ THAT RIGHT THE PIERRE POILIEVRE). He thanked me for all of the work I put in during the election and told me to keep doing what I'm doing. We ended up talking for over 13 minutes and let me tell you, this man is laser-focused on winning the upcoming by-election and reenergizing the Conservative movement. I only got actively involved in politics when I first started canvassing with my Federal Conservative Candidate over a year ago and never did I ever imagine my political idol would one day recognise the work that I've done and give me a call like he just did minutes ago. Hard work and perseverance does eventually pay off and I'm still in total shock and joy that I had a chance to speak with Pierre. @PierrePoilievre, if by any chance you're reading this, thank you so much again for the call and please know that I along with many of my friends will be working overtime to get you elected as the next PM of 🇨🇦 when the next election happens.
English
278
426
3.1K
84.4K
Vusal
Vusal@vbabashov·
@acoyne You’re so out of touch! Job market has effectively collapsed, and a lot of people are having anxiety of losing jobs.
English
0
0
0
13
Vusal
Vusal@vbabashov·
@mcuban Thanks Mark. I’d also add improving healthcare (e.g hospitals, clinics) operations and drug formulary design using AI, specifically operations research and machine learning techniques. I’ve been doing research on this front and administrative cost savings is quite substantial.
English
0
0
0
40
Mark Cuban
Mark Cuban@mcuban·
Well said. Thanks !
Apple Lamps@lamps_apple

Summary of Mark Cuban's healthcare blog 👇 Key Problems: 1. Healthcare industry is unnecessarily complicated in pursuit of profits 2. Current system has excessive administrative costs (20-30%) 3. Additional 10% lost to fraud and upcoding 4. Capital expenditure arms race between healthcare systems Proposed Solutions: 1. Transparency Reforms: - Publish bill of materials at cost for all medical items/services - Separate capital expenditures and overhead costs - Show actual costs for consumables, staff, and direct patient care - Publish provider markup rates - Make all healthcare contracts public 2. Payment System Changes: - Remove insurance companies completely - Convert to cash-pay system - Use Medicare payment processing infrastructure - Reduce pharmaceutical costs through transparency Financial Impact: - Current costs: ~$5 trillion - Potential savings: ~50% ($2.5 trillion) - Current government spending: - Medicare: $847 billion - Medicaid: $570 billion - Remaining to fund: ~$1.1 trillion for non-Medicare/Medicaid care - Per-employee cost: $6,875/year (based on 160M employed) Funding Options for Remaining $1.1T: 1. Employers pay premium equivalent to federal government 2. Convert employer premium to salary increases + higher taxes 3. Government reinsurance program with HSAs 4. Employer withdrawal from healthcare entirely Key Points: - Solutions require comprehensive system changes - Doctors and nurses are considered underpaid - Focus on reducing complexity and administrative costs - Emphasis on true cost transparency rather than just price lists - Pharmacy costs could be reduced by at least 20%

English
53
52
598
246.5K
Mark Cuban
Mark Cuban@mcuban·
Sources in the pharmaceutical industry tell me @LockheedMartin signed an awful PBM contract. So bad that it doesn’t even reimburse the independent pharmacies their employees use at their full cost. That’s just wrong. If you work at @lockheed you should let your CEO know that it’s time to change PBMs
English
76
139
908
647.4K
Zach Mueller
Zach Mueller@TheZachMueller·
I've created a small little knowledge repository on @huggingface transformers here: github.com/muellerzr/mini… Essentially these contain all the `task` notebooks converted as scripts, showcasing end-to-end usage in under 150 lines of code (but still readable!)
Zach Mueller tweet media
English
6
56
375
60.9K
Vusal
Vusal@vbabashov·
@PhDemetri Would Weibull distribution work too?
English
1
0
0
104
Vusal
Vusal@vbabashov·
@marktenenholtz True. But it does a decent job of formulating a LP and MDPs.
English
0
0
0
183
Mark Tenenholtz
Mark Tenenholtz@marktenenholtz·
One thing I've found ChatGPT is not good at: Solving optimization problems. I swear, I've never gotten anything useful from the optimization code that it writes.
English
10
0
37
11.4K
Vusal
Vusal@vbabashov·
@Aki__Singh @raydistributed @databricks Changes in API: xgboost_ray vs XgboostTrainer. Docs is not easy to navigate. I also find inter operability between ray dataset and spark dataframes buggy. Last, community is slow to respond to questions.
English
0
0
1
84
Madni Aghadi
Madni Aghadi@hey_madni·
Prompt engineers make $200k+ yearly. That's why I built "GPT-4 Prompt Engineering Guide": • Zero-Hero Full Guide • 100+ Examples • Tips, Tricks, Techniques & more. And for 24 hrs, it's 100% FREE! To get it, just: 1. Like 2. Reply "GPT" 3. Follow me (so that I can DM you)
Madni Aghadi tweet media
English
1.6K
252
2.1K
424.5K
Vusal
Vusal@vbabashov·
@marktenenholtz CV is not a viable option always. Big dataset -> expensive training on folds.
English
0
0
0
46
Mark Tenenholtz
Mark Tenenholtz@marktenenholtz·
3. It's really hard to not overfit Actually it's pretty easy if you set up proper CV. Problem is that people use early stopping as a crutch (which causes overfitting). You wouldn't use a different learning rate per val split, why would you change your boosting rounds as well?
English
2
0
28
3.8K
Mark Tenenholtz
Mark Tenenholtz@marktenenholtz·
Misconceptions about XGBoost: 1. It's hard to tune the hyperparameters Disagree, there's only ~5-6 to tune, and 3-5 of them can be tuned very quickly. Learning rate is probably the one that requires the most care. 2. Inability to extrapolate is a problem (cont)
English
13
30
295
88.5K
Vusal
Vusal@vbabashov·
@marktenenholtz Which 5-6? Can you elaborate? To me it’s actually 8-9
English
0
0
1
503
Vusal
Vusal@vbabashov·
@marktenenholtz I didn’t get an improvement in run times for Xgboost on GPU. Not sure why?
English
0
0
0
46
Mark Tenenholtz
Mark Tenenholtz@marktenenholtz·
Here’s my general recommendation for using LightGBM vs. XGBoost vs. CatBoost. Use: • XGBoost when you have a GPU • LightGBM when you’re only using a CPU • CatBoost with a lot of categorical features There’s a lot of wiggle room in there, though. The best part: they all thrive with the same general kind of features, so it’s really easy to swap them in and out. For time-series forecasting, start with rolling features, lag features, and go from there.
English
8
36
310
61.6K
Vusal
Vusal@vbabashov·
Hi @rasbt: What is the right approach to define distribution for the search space of max_depth parameter in xgboost or other tree-based algos. hp.choice() or hp.quniform distribution? I have seen people you use both in Hyperopt trials, just wanted to ask your take on this?
English
0
0
1
145