aren

1.3K posts

aren banner
aren

aren

@ArenRendell

former customer support representative @meridian. now member of technical staff @meridian. thanks claude.

san francisco 가입일 Nisan 2012
434 팔로잉514 팔로워
aren
aren@ArenRendell·
Big companies are launching a LOT of things. I’ve been feeling this crowd out adoption of any one of their things. I spent a lot of time talking with firm leaders in the last two months. They largely share my feeling.
English
0
0
1
34
aren
aren@ArenRendell·
I don’t want to build this because food is a personal passion and hobby. I don’t currently want to pollute this hobby with my work brain right now. But for anyone who does: I will share my thoughts and provide feedback freely.
English
0
0
1
22
aren
aren@ArenRendell·
The agent version might allow restaurants to join a consortium to “fill last-minute seats,” then use the speed and reasoning abilities of agents to coordinate across parties, payment-enabled diner agents included.
English
1
0
0
26
aren
aren@ArenRendell·
As a foodie and agent power user, I hope something in MPP/x402 world takes off and solves dinner reservations. I’m skeptical. The bottleneck is clear: everyone wants to go to the same restaurants, as they should! Flour + water is incredible. Mediocre Italian isn’t.
English
1
0
3
76
aren
aren@ArenRendell·
@mahaniok @MTA Aren’t most harder-to-jump designs too slow for MTA capacity at rush hour? I’ve never had to compete for a gate on BART, whereas I waited to go through a gate regularly for MTA, because there are just so many fewer people on BART. Are these gates fast enough for MTA?
English
0
0
0
30
Ihar Mahaniok
Ihar Mahaniok@mahaniok·
These fare gates work. @MTA has just recently installed new gates that are as easy to hop as previous ones, and we still have lots of gate jumpers on the subway. @MTA pls learn from BART
Aakash Gupta@aakashgupta

BART spent $90 million on new fare gates. They're recovering about $10 million a year in fares. That's a 9-year payback on paper. The actual return hit in six months. Embarcadero station went from 112 hours of corrective maintenance in the six months before installation to 2 hours after. Daly City saved 109. Balboa Park saved 75. Across the system, 961 hours of cleanup work disappeared. Corrective maintenance is the term BART uses for graffiti, heavy soiling, vandalism, the damage that needs a crew not a janitor. At several stations it dropped to zero. Crime fell 41% year over year. Riders who reported seeing fare evasion on their trip dropped from 22% to 10%. Citations issued by BART police went from 2,200 in January to under 1,000 in July, because there was nothing to cite. The gates were a filtering project disguised as a revenue project. Old BART gates were waist-high orange fins designed in the 1970s. You could hop them in under a second. That made the station effectively a public space, and the rider mix reflected that. The new gates are 72 inches of polycarbonate with 3D sensors that detect tailgating. You either pay or you don't enter. Once you don't enter, you also don't smoke on the platform, sleep in the elevator, or harass other riders. BART tried hiring more police for years. Blitz operations at high-traffic stations. Increased patrols. Dedicated transit cops. None of it moved the numbers the way six feet of polycarbonate did. The $10 million in recovered fares is the smallest line in the return. Fare revenue used to cover 70% of BART operations. After the pandemic it collapsed to 22%. The gates won't fix that gap directly. They fix the precondition for fixing it: a system that office workers, families, and tourists are willing to use again. Ridership growth at stations with new gates outpaced ungated ones before the rollout finished. A $400 million annual deficit is heading to voters in November as a sales tax measure. Voters don't approve sales taxes for transit agencies they don't feel safe in. The $90 million on gates is buying BART the right to ask the public for more money. That's the real return on six feet of polycarbonate.

English
4
0
9
975
aren
aren@ArenRendell·
@conorsen Are you sure we’re 10 years away😬
English
0
0
0
193
Matt Mandel
Matt Mandel@matthewjmandel·
Had a blast helping @bqbrady with this! There’s been lots of talk recently about the normative behavior of frontier models. We ran them through 100 ethically complex scenarios to tease out how compliant, consequentialist vs deontological, and morally primable they actually are
benedict@bqbrady

Introducing Philosophy Bench, my favorite new project I've worked on this year, with help from my friend @matthewjmandel We put frontier language models in 100 ethically complex situations and require them to act, grading them on adherence to consequentialism vs. deontology, tendency to follow user requests, corrigibility, and more 1/

English
2
3
27
2.3K
aren
aren@ArenRendell·
I love how often the benchmarks line up with basic intuition from just interacting with the models for a few hours, or even just interacting with the companies that make them. Claude is a philosopher, Gemini is an enterprise tool, GPT is shifty about being either.
benedict@bqbrady

Introducing Philosophy Bench, my favorite new project I've worked on this year, with help from my friend @matthewjmandel We put frontier language models in 100 ethically complex situations and require them to act, grading them on adherence to consequentialism vs. deontology, tendency to follow user requests, corrigibility, and more 1/

English
0
1
5
588
aren
aren@ArenRendell·
Yes. I’ve yet to see interesting solutions to root verification. I wonder if the blockchain ZK crowd could redirect efforts and solve this.
Joe Weisenthal@TheStalwart

@00aarti It's probably because we're living in the age of AI that things like "mailing a physical copy of an authorization form in order to speak on someone's behalf over the phone" will become more commonplace

English
1
0
1
109
aren
aren@ArenRendell·
Wondering if we’ll see a U shape on this. Like moving from toddler to 10 helps you understand you shouldn’t hit people (hopefully lol). Moving from 10 to 20 helps you understand you can profit from things you thought were objectionable, and you care more about profit than purity.
Matt Mandel@matthewjmandel

Are base LLMs aligned by default? Inspired by @lawhsw's recent essay, I tested 5 Qwen3 base models (0.6B → 14B) on 28 harmful-request scenarios. As they scale, their default response flips from "help" to "refuse" — without any safety training 🧵

English
0
1
3
356
aren
aren@ArenRendell·
@KelseyTuoc No for me, on an Enterprise account.
English
0
0
1
259
aren
aren@ArenRendell·
Google Maps price ranges for restaurants are getting more specific but in a weird way. For example: $40-$70. Why? That’s just $50-$75.
English
0
0
0
38
aren
aren@ArenRendell·
@JBSDC Or least likely! The delta between no Twitter and ChatGPT is much greater than Twitter and ChatGPT. So if you’re not on Twitter, ChatGPT feels like the most insane drug ever. Whereas for me…it’s just good.
English
0
0
1
17
Justin Slaughter
Justin Slaughter@JBSDC·
This probably means that social media power users will be born more likely to fall prey to AI epistemological bubbles/AI psychosis AND that we will be the first successful guinea pigs for AI increasing contentment and happiness. World’s biggest A/B test is in play.
English
1
0
4
366
Justin Slaughter
Justin Slaughter@JBSDC·
A truism of social media power users is that we’re all together all the time but often lonely due to our interactions being intermediated by screens. AI allows for a more personable interaction, 1-1, but it’s also cloying and not human to human/still intermediated by screen.
English
6
0
8
660