Ana Martins de Carvalho

849 posts

Ana Martins de Carvalho

@anaoaktree

Fascinated by and weary of increasingly autonomous artificial intelligence. Exploring technical governance and experimenting with AI agents.

Katılım Ağustos 2014

626 Takip Edilen264 Takipçiler

Ana Martins de Carvalho retweetledi

Marius Hobbhahn@MariusHobbhahn·6 Ara

Oh man :( We tried really hard to neither over- nor underclaim the results in our communication, but, predictably, some people drastically overclaimed them, and then based on that, others concluded that there was nothing to be seen here (see examples in thread). So, let me try again. **Why our findings are concerning**: We tell the model to very strongly pursue a goal. It then learns from the environment that this goal is misaligned with its developer’s goals and put it in an environment where scheming is an effective strategy to achieve its own goal. Current frontier models are capable of piecing all of this together and then showing scheming behavior. Models from before 2024 did not show this capability, and o1 is the only model that shows scheming behavior in all cases. Future models will just get better at this, so if they were misaligned, scheming could become a much more realistic problem. **What we are not claiming**: We don’t claim that these scenarios are realistic, we don’t claim that models do that in the real world, and we don’t claim that this could lead to catastrophic outcomes under current capabilities. I think the adequate response to these findings is “We should be slightly more concerned.” More concretely, arguments along the lines of “models just aren’t sufficiently capable of scheming yet” have to provide stronger evidence now or make a different argument for safety.

Apollo Research@apolloaievals

We worked with OpenAI to test o1 for in-context scheming capabilities before deployment. When o1 is strongly nudged to pursue a goal (but not instructed to be deceptive), it shows a variety of scheming behaviors like subverting oversight and deceiving the user about its misaligned behavior.

English

608

114.6K

Ana Martins de Carvalho retweetledi

Andrej Karpathy@karpathy·1 Ara

The reality of the Turing test

English

268

1.2K

15.6K

853.4K

Ana Martins de Carvalho retweetledi

CTO Portugal@CTOPortugal·18 Oca

Episode 4 is here! Our guest this week is @anaoaktree, Co-Founder & CTO at Anansi. You can now also check it on Spotify! open.spotify.com/episode/1KDReY… youtu.be/Hl0-O4RZrOY

YouTube

English

Ana Martins de Carvalho retweetledi

Anansi@withanansi·28 Eyl

Such an exciting time for us as a company, we're looking forward to the future. Thanks @BusinessMoney for the coverage. business-money.com/announcements/…

English

Ana Martins de Carvalho retweetledi

Lauren Kay (she/her)@laurenikay·3 May

1/ Here’s something we need to normalize: talking about failure. My @ycombinator startup failed 6 years ago. I stayed silent. And because of that silence, other startup founders—going through the exact same thing as me—felt alone in their shame too. I want to break that trend.

English

375

2.3K

Ana Martins de Carvalho retweetledi

Ada's List@AdasList·20 Oca

Ready for your next lead role in engineering? @withanansi are hiring a Lead Software Engineer (remote)🔥 If you like to think strategically and set direction to ensure that the right thing is built and technologies are used wisely, this may be for you: angel.co/l/2uvWJZ

English

Ana Martins de Carvalho retweetledi

Anansi@withanansi·21 Ara

The entire team at Anansi Technology Ltd would like to wish you and your families a very Merry Christmas 🎄 and a very Happy, Healthy 2021 🤗 A huge thank you to everybody for your support this year. We couldn't do it without you 😊

English

Ana Martins de Carvalho@anaoaktree·30 Eki

👏👏👏

Insurance Innovators@Insurance_Innov

Megan is the co-founder and CEO of Anansi, which builds zero admin insurance for e-commerce businesses, starting with a Shopify app providing automated cover for shipping losses and delays. Join Megan in looking at what #insurers should be prioritising >> bit.ly/3jjr3Dd

ART

Ana Martins de Carvalho retweetledi

Mimi Billing@MimiBilling·9 Eki

Record year for VC funding in the Nordics 2019. It is then a shame that #femalefounders only received 1.3% whilst all-male teams got 93% of the capital, according to the 2020 report by @UnconvenVc. Article at @Siftedeu. Thread 👇 #nordicmade sifted.eu/articles/fundi…

English

Ana Martins de Carvalho@anaoaktree·28 Tem

It’s live!

Anansi@withanansi

Do you manage your UK-based ecommerce business on Shopify? Our New #Shopify App - Offcourse is now live and available for #ecommerce business owners. Find out how #offcourse delivery insurance will benefit you? Are you ready to join our trial 😎 buff.ly/305mp57

English

Ana Martins de Carvalho retweetledi

Anansi@withanansi·27 Tem

It's here 🤗 Our NEW automated #ecommerce delivery insurance #shopifyapp is now LIVE!! Calling all ecommerce shopify store owners, join our 12 week trial and download the app now 😎 lnkd.in/eqQVCpZ

English

Ana Martins de Carvalho retweetledi

Sarah Dayan@frontstuff_io·26 Tem

I’ve been working as a software engineer for 10 years 🎂 Man, does time fly! Here’s a list of ten honest takes on the job and the industry. ⬇️

English

212

4.4K

12.5K

Ana Martins de Carvalho@anaoaktree·24 Tem

After many months of hard work, we are launching our first *automated insurance* product! Trial phase open for UK merchants with a Shopify @shopify store.

Anansi@withanansi

Did you hear? Over the next 3 months we are running a trial phase for all UK-based #ecommerce businesses who currently use #Shopify. We are welcoming feedback for our NEW deliveryinsurance app. It's time to say a fond farewell to those delivery gremlins buff.ly/2D7do2r

English

Ana Martins de Carvalho retweetledi

Simon Willison@simonw·17 Tem

15 years ago today on my blog: Introducing Django simonwillison.net/2005/Jul/17/dj…

English

163

1.3K

Ana Martins de Carvalho retweetledi

DHH@dhh·15 May

There's a lot of focus on productivity when it comes to remote work, and yes, that's a key factor, but it's not close to the most important one. HUMAN FLOURISHING is far more crucial! Productivity plays into that in the form of accomplishments, but so does a location of love.

English

324

Ana Martins de Carvalho retweetledi

Megan Bingham-Walker@mebiwa·23 Nis

That's changing very soon. We've just soft launched the first Shopify app powered by @withanansi. Further details to follow #insurtech

Fouad Husseini@fhusseini

A reality check of the state of #insurance interoperability that we're doing as part of the Platform & Ecosystem Monitor. robosque.com/platform-ecosy…

English

Ana Martins de Carvalho retweetledi

InsurTech Hub Munich@InsurTechMunich·25 Mar

🙋‍♀️Today we'd like to introduce you to #FemaleFounders Megan and Ana of Anansi Technology, participants of our ongoing #InnovationProgramme. Anansi offers online merchants an easy way to cover their shipments against losses and delays. insurtech-munich.com/how-two-tech-w…

English

Ana Martins de Carvalho retweetledi

Joe Weisenthal@TheStalwart·24 Mar

All monetary savings is a fiction. A nation’s only savings are its natural resources, its built physical infrastructure, stable social norms and government credibility. Individuals can of course save money, but on a collective scale, a pot of 1s and 0s don’t get us anything.

English

357

1.9K

Ana Martins de Carvalho retweetledi

António Guterres@antonioguterres·23 Mar

Half of the world’s student population is currently not attending school due to the #COVID19 pandemic. I support @UNESCO's initiative to accelerate the deployment of remote learning solutions & minimize education disruptions as we fight the #coronavirus. bit.ly/2QCGXwU

English

479

1.3K

Ana Martins de Carvalho retweetledi

I Am Devloper@iamdevloper·22 Mar

The World Health Organization is advising people to follow five simple steps to help prevent the spread of COVID-19: 🧼 1. Wash your hands 💪 2. Cough/sneeze into your elbow 🤦🏻‍♀️ 3. Don't touch your face 📏 4. rm -rf node_modules && npm i 🏡 5. Stay home if you feel sick

English

1.3K

6.1K

Keşfet

@BusinessMoney @ycombinator @withanansi @UnconvenVc @Siftedeu @shopify @UNESCO @elonmusk