Kyle Rubenok

53 posts

Kyle Rubenok banner
Kyle Rubenok

Kyle Rubenok

@krubenok

PM on Outlook. I am myself, not Microsoft. 🇨🇦 Oh I also ski sometimes.

Seattle Katılım Mart 2012
320 Takip Edilen399 Takipçiler
Luke Parker
Luke Parker@LukeParkerDev·
Okay so you can control your lighting tech console with opencode now. ‘Make that beam red and make it fan faster’ ‘Set the fader to 40’
Luke Parker tweet media
English
5
0
27
1.8K
Michael Silver
Michael Silver@michael_silver·
Hate to say it, but it's true. There are now only 4 jobs: slop cannons, SREs, hot people, and grown ups.
Michael Silver tweet media
English
3
0
2
387
Kyle Rubenok
Kyle Rubenok@krubenok·
@tomwarren Not exactly the kind of ground control we were aiming for. Team is looking into it. 📫
English
0
0
1
574
Tom Warren
Tom Warren@tomwarren·
NASA’s mission to orbit the Moon is being interrupted by Outlook (New) and Outlook (classic) both refusing to open 😂
English
252
913
10.5K
1.7M
Omar Shahine
Omar Shahine@OmarShahine·
Do you know what is worse than Claude Code throwing API Error: 500? A crumb stuck under your MacBook Pro Delete Key making it not work. Need compressed air stat.
English
3
1
21
1.9K
Kyle Rubenok
Kyle Rubenok@krubenok·
@matvelloso @turbotax 1) 🖕Intuit, they're the reason taxes are annoying to file. freetaxusa.com is great and well... free for federal and hilariously cheap for state. 2) Did this last year as well, lots of good insight, interested to see how this year's models do!
English
0
0
2
169
Mat Velloso
Mat Velloso@matvelloso·
BTW, I highly recommend doing this. I found several obvious problems that @turbotax was unable to find (even when deterministic rules should be able to point to those, and I was paying for their services). These issues would have costed me quite a bit, not to mention risks with auditing. I am actually surprised how little Turbotax is able to catch. Also a bit baffled that they are not already using AI for final review checks and validation on top of it. The LLMs did a better job at guiding me on where to fix issues in Turbotax than Turbotax did itself on its own data.
Mat Velloso@matvelloso

Did you or will you use an LLM to review your tax return this year?

English
5
0
17
5.1K
Kyle Rubenok
Kyle Rubenok@krubenok·
@kyeburchard @ericzakariasson The 5.x chat/instant models that serve most normie ChatGPT requests are _very_ different than the reasoning models. Sloptimized for both speed, cost and vibes.
English
1
0
0
21
Kye
Kye@kyeburchard·
@ericzakariasson how do you avoid this when people use gpt models in cursor? do you eval for it?
English
1
0
0
459
eric zakariasson
eric zakariasson@ericzakariasson·
whats going on with chatgpt these days? almost all responses ends with clickbaity questions "if you want, i can tell you the one mistake that almost everyone forgets"
English
911
247
11.6K
1.1M
Kyle Rubenok
Kyle Rubenok@krubenok·
@jeffwilcox Ironically, forced WFH for the next two weeks after choosing to RTO for the last two years while we move. Will probably catch you on the highway if you keep that schedule up though 🫡
English
0
0
0
50
Jeff Wilcox
Jeff Wilcox@jeffwilcox·
RTO day 1: do I wear black and mourn all that is lost, as someone who has been remote long before the pandemic but spent the first half of their career commuting to the sleepy zip code of 98052?
Jeff Wilcox tweet media
English
2
0
2
468
Mat Velloso
Mat Velloso@matvelloso·
@bdsams People at Microsoft do it, but as a side job, so they can move faster. At Google moving fast is the main job, not a side quest.
English
2
0
11
1.3K
Kyle Rubenok
Kyle Rubenok@krubenok·
@jeffwilcox Someone got a new🪚💻 They're pretty nice. Much better than my old one.
English
0
0
1
46
Jeff Wilcox
Jeff Wilcox@jeffwilcox·
My first Dell since high school, and, my first Windows ARM64 device!
Jeff Wilcox tweet media
English
2
0
13
569
Kye
Kye@kyeburchard·
@krubenok yeah it would not surprise me if some of the techniques were learning from the orchestration layer move into the model layer over time
English
1
0
0
24
Kye
Kye@kyeburchard·
has anyone tried building an LM architecture that uses a small model to select the most relevant context and provide just that to a larger model? re-rankers work great for RAG, but I wonder if a similar concept could be applied to the model calls themselves
English
1
0
2
133
Kyle Rubenok
Kyle Rubenok@krubenok·
@kyeburchard Oh interesting. At what point does pre tokenization begin and model orchestration end 😵‍💫. The future is cool.
English
1
0
1
12
Kye
Kye@kyeburchard·
@krubenok I was imagining it built into the base model as a pre-tokenization step. might make prompt caching way harder to implement, but just thinking about ways to build next gen models with orders of magnitude larger context windows
English
1
0
0
32
Burke Holland
Burke Holland@burkeholland·
We're all DevOps now
English
17
10
145
15.4K
Burke Holland
Burke Holland@burkeholland·
Mac dictation just transcribed me as saying "Jeep Eat Tea 5 3 Kodex".
English
12
0
14
2.7K
Pierce Boggan
Pierce Boggan@pierceboggan·
@krubenok @code Because it's a setting, you can put it in .vscode/settings.json and troll your whole team with different phrases :)
English
1
0
4
380
Pierce Boggan
Pierce Boggan@pierceboggan·
New in @code Insiders: Replace Chat thinking phrases with the `chat.agent.thinking.phrases` setting.
Pierce Boggan tweet media
English
9
3
53
6K
Matteo Collina
Matteo Collina@matteocollina·
I’m really thinking of switching from Claude Code to Codex+pi as my daily driver. I guess I should just bump my Codex subscription to 200$ and try?
English
52
5
210
34.6K
Michael Silver
Michael Silver@michael_silver·
Everyone on this hellsite is talking about "taste" But have you tried selling B2B SaaS over Carbone?
Michael Silver tweet mediaMichael Silver tweet mediaMichael Silver tweet mediaMichael Silver tweet media
English
2
2
17
2.2K
Peter Steinberger 🦞
Peter Steinberger 🦞@steipete·
PRs on OpenClaw are growing at an *impossible* rate. Worked all day yesterday and got like 600 commits in. It was 2700; now it's over 3100. I need AI that scans every PR and Issue and de-dupes. It should also detect which PR is the based based on various signals (so really also a deep review is needed) Ideally it should also have a vision document to mark/reject PRs that stray too far. This can't be fully automated, but even assisting would help. The closes I found is an obscure oss project. How's no startup working on this?
English
541
218
7K
831.6K