Ben Huang

77 posts

Ben Huang banner
Ben Huang

Ben Huang

@b3nhuang

Working on new things. Currently @ThematicAI @HiBaseStation, @side_realestate, @necto_inc, @groove_co. @ycombinator alum

New York City Katılım Mart 2021
613 Takip Edilen190 Takipçiler
Ben Huang
Ben Huang@b3nhuang·
@karinadoteth Same. Weird at first, eventually you start asking "what do we think of this" to your swarm
English
1
0
1
13
Karina Q
Karina Q@karinadoteth·
every morning i read quotes from my sim (we do an overnight run most nights) w my morning tea its basically like reading synthesis / discussion of 200+ research analysts specialized in the like 130~ish now capex companies we r tracking across all slices of the capex supply chain sometimes i trade off their discussions, yes, lol sometimes i just like to see wut they r talking abt haha
English
2
0
1
128
Karina Q
Karina Q@karinadoteth·
♥️ qwen
Karina Q tweet media
Magyar
1
0
0
102
Ben Huang
Ben Huang@b3nhuang·
Announcing @ThematicAI ! Thematica is a research project where the goal is to build a fully autonomous long term investor name Simon. 🟢Today I'm kicking off the launch of Simon in research preview dry run (to make sure everything is working)[ follow along at agent.thematica.ai or @ThematicaSimon] Simon is a custom agent designed to be a long-term investor, not a day trader. He reasons in multiple layers with multiple agent teammates. Simon's goal isn't to trade more. It's to find the best things to own medium to long term. Simon will runs continuous research cycles. reading the world, updating theses, manage a watchlist, and decide what to hold. Along the way I've built a ton of tools: agentic financial research tools, agentic research teams etc. to let Simon actually navigate and research about markets end-to-end. This week is the dry run. Next week I'll kick off the real research run. Stay tuned
English
2
4
9
931
Ben Huang
Ben Huang@b3nhuang·
opus 4.7 loves the word "idempotent"
English
0
0
2
36
Ben Huang
Ben Huang@b3nhuang·
@ReserveList Gona make a precious metal bench first then i'll come back around it. In the mean time feel free to add.
English
0
0
0
22
🍊Brüçe d'Orange
🍊Brüçe d'Orange@ReserveList·
@b3nhuang Okay two weird things, [1] I love how even LLMs prove that over trading is problematic [2] why didn't you include the anthropic haiku model?
English
1
0
1
143
Ben Huang
Ben Huang@b3nhuang·
Just for fun I made a benchmark of the models trading oil. I ran 9 frontier LLMs trading from 1/1 => 3/14 Oil was up 72% so none of the model beat buy-and-hold. Best: Gemini 3-flash ($15,880) Worst: Minimax ($14,619) Most consistent predictor: Claude Opus, but ranked 8th on P&L. Accuracy nor consistency ≠ trading performance . Here's the result → benhuang21828.github.io/oil-bench 🔊 out to @OpenRouter for credits and @alexatallah for feedback
English
4
4
29
8.9K
Ben Huang
Ben Huang@b3nhuang·
@dankalski Its news heavy. None of the models had enough skepticism. (aka. aware of the 🌮)
English
0
0
0
15
Daniel Kalski
Daniel Kalski@dankalski·
An ok experiment in showing each model's reasoning. The March 9 tick interests me since every model clustered $93-96 against an $83.45 close. $10 misses driven by headlines, pulling everyone bullish while the actual move was mean reversion. Was your data-input design intentional to be news-heavy? Or is there time-series data feeding that isn't shown per tick/day?
English
1
0
0
56
Ben Huang
Ben Huang@b3nhuang·
Claude Design is made for coders who never learned how to drag on Figma. Brilliant!
English
0
1
2
92
Ben Huang
Ben Huang@b3nhuang·
So we ran each run 10 times per model. Intersting points: - The smartest models aren't the best at trading. - Anthropic is leagues above everyone else when it comes to their prediction consistency. - All the chinese models suffered from inconsistency and poor prediction quality.
English
0
0
1
285
Sughu
Sughu@sughanthans1·
@shashankgoyal95 @_philschmid @SUghlu I would like to deploy a claude agent that can fill PDF files for me. So send pdf file to an endpoint + info on what needs to be filled -> get back the filled in pdf
English
2
0
0
27
Philipp Schmid
Philipp Schmid@_philschmid·
TIL: Claude Code local sandbox environment is open-source. > native OS sandboxing primitives (sandbox-exec on macOS, bubblewrap on Linux) and proxy-based network filtering. It can be used to sandbox the behaviour of agents, local MCP servers, bash commands and arbitrary processes.
Philipp Schmid tweet media
English
27
69
720
62.2K
Ben Huang retweetledi
Alex Atallah
Alex Atallah@alexatallah·
Excited to announce a $40M raise for @openrouter (seed + A), led by a16z & Menlo! LLM inference will be the biggest software market in the world. We've become the #1 control plane. Here's what's next:
Alex Atallah tweet media
English
200
125
2.3K
456.7K
Ben Huang retweetledi
basestation
basestation@HiBaseStation·
👋 Just to share a new product we've been building here at BaseStation – something born directly from conversations we've had with many of you. We initially started exploring what the next generation of document processing, "e-sign 2.0," could look like. We quickly learned something crucial: most companies placed trust of their e-signature tools above usability, the real headache isn't the signature execution step. It's everything before that. It's a long and frustrating process of setting up form templates and getting the right data into them before they even go out for signature. We noticed three key challenges: 1️⃣ Bulk Form Filling: Teams need the ability to fill out the same form thousands of times, often for each of their users. 2️⃣ Ability to Update Entire Document Packet: When users need to update one of their information, they need to update all the fields using that data point across the entire document packet. 3️⃣ Maintaining Custom Apps and No-Code Tools: To enable the ability to bulk fill forms and bulk update filled-out forms, large teams maintain a ton of custom apps and no-code tools. 💡 So, How does BaseStation solve these challenges? We focused on making this process incredibly simple: 🥇 Smart Setup: Upload your document. Our AI automatically identifies and place input fields. We will even generates draft instructions in plain English for how to fill them out. Need more specific instructions for a complex field? Just click on it and edit the instruction text. Explain how you would fill it out just like how you would explain it to another person. 🥈 Flexible Data Connection: Build your "data canvas" – the information AI will use to quickly and easily fill out your forms. You can: Upload a previously filled-out version of the document, and we'll automatically extract the relevant data points. Connect directly to existing data sources like Google Sheets or Airtable. Simply type key:value pairs (like Name: John Doe, Address: 123 Main St) directly into our data canvas text area. 🥉 One-Click Autofill: Once your template is set up and your data is connected, just click "Auto Fill." Watch as our AI uses your data canvas to intelligently populate even long, complex document packets in seconds. 🔊 If your team is still spending thousands of hours and significant resources setting up templates and building custom connectors just to auto-fill your forms, we believe there's a better way. We would love to show you what we're building here at BaseStation. 🔊
basestation tweet mediabasestation tweet mediabasestation tweet mediabasestation tweet media
English
0
1
1
187
Victor Pontis
Victor Pontis@VictorPontis·
I'm still so disappointed that Google sold Google Domains to Squarespace. I've been managing domains expiring and incorrect billing details every week for the past year... It sucks.
English
2
0
4
658
Ben Huang
Ben Huang@b3nhuang·
The thing about configuring ai agents is, sometimes it will ignore your instructions and you won’t know why. You’ll just have to suffer watching in keep ignoring it. its like hiring the fastest and smartest coder, but they happened to also be extremely stubborn
English
0
0
1
112
Ben Huang retweetledi
Garry Tan
Garry Tan@garrytan·
Don’t just lie flat on the ground because AGI is here and ASI is coming. Your hands are multiplied. Your ideas must be brought into the world. Your agency will drive the machines of loving grace. Your taste will guide the future. To the stars.
English
220
533
4.8K
488K
Diego Zaks
Diego Zaks@diegozaks·
@loom Clarification: They are killing the "viewer" account, so if you want to restrict viewers behind SSO you'll have to get your entire company a full seat.
English
2
1
14
3.4K
Ben Huang retweetledi
basestation
basestation@HiBaseStation·
Here's what we've shipped in September based on feedback from you, our users. If you would like to see a feature on our roadmap definitely reach out! Team Management ▶️ Create Teams: Create teams and add your teammates to reflect your organization's structure. ▶️ Team Member Roles: Assign "member" and "admin" roles to your teams. Admins can manage team membership. ▶️ Share Templates: Clone your personal templates to team workspaces to share with your teammates. Mobile Optimization ▶️ PDF Viewer: Supporting mobile gestures, pinch-to-zoom, drag-to-move, and spread-to-zoom gestures. ▶️ Video Playback: Videos now autoplay and play inline for a seamless mobile experience. ▶️ UI Refinements: Making our mobile experience look awesome! Contact Cards ▶️ Create actionable contact cards: Contact cards with quick actions to encourage engagement from your customers. ▶️ Add Information: Include phone numbers, email addresses, and scheduling links for easy communication. Self-Service Plans ▶️ Basic: Create unlimited guides with 5 hours of video storage and monthly streaming. ▶️ Pro: 10 hours of video storage and streaming, plus team creation capabilities. Expiring Links ▶️ Protect Sensitive Information: Set expiration dates to protect time-sensitive information. New Landing Page 💻We've launched a brand-new landing page! Would love to know what you think.
English
1
1
1
217
Ben Huang retweetledi
Shantanu Joshi
Shantanu Joshi@joshishantanu4·
Introducing Savvy Teams: Share hard-earned insights securely within your organization. Search and run any insight shared with the team without leaving your terminal. Learn from your teammate's without waking anyone up. Link to our docs in the next tweet!
English
2
4
19
852