Ian Costa

1K posts

Ian Costa

Ian Costa

@Ian_Costa18

Working as an Information Security consultant. Taking Georgia Tech's OMS Cybersecurity. @[email protected]

Katılım Ocak 2020
555 Takip Edilen128 Takipçiler
Ian Costa
Ian Costa@Ian_Costa18·
@nrweinwohner @sheriyuo Yeah, so why don't they do that? It should know it needs to run an algorithm, but doesn't unless specifically asked to.
English
1
0
0
9
Plexar was here
Plexar was here@nrweinwohner·
@Ian_Costa18 @sheriyuo You do the same in your brain. Most people would walk through a small „algorithm“ in their mind to count the letters. They do not instantly „know“ the answer, but they know how to get at it.
English
1
0
0
14
Xiuyu Li
Xiuyu Li@sheriyuo·
LLM is just a machine for modeling probabilities, and to make it fully memorize patterns like "how many [a]'s in [b][c][d]" would require training tokens on the fourth power of the vocabulary size, which is simply infeasible. So the only viable path is to push toward agentic workflows, letting AI use code to verify such checkable problems.
Xiuyu Li tweet media
ket@kettukaa

when you ask an LLM "how may P's in srawperry?" what you're actually asking it is closer to "How many [151]'s in [15563][23][4124]"

English
10
6
147
17.5K
Anish Moonka
Anish Moonka@anishmoonka·
Before it took off, the bird ate parts of its own liver, kidneys, and gut. That was the only way to be light enough to fly. Then it flew 8,425 miles from Alaska to Australia, in 11 days, without eating, drinking, or landing once. The bird is called B6. It's a bar-tailed godwit, four months old, weighing about as much as a can of beans. In October 2022, scientists at the US Geological Survey tracked its flight from Alaska all the way to Tasmania. The trip took 11 days and 1 hour. It is still the longest non-stop flight of any animal on Earth. For two weeks before takeoff, godwits eat until they almost double in weight. Fat ends up being 55% of their body, more than any bird ever measured. Then they shrink their own insides. About a quarter of their liver, kidneys, stomach, and intestines gets broken down and reused for fuel, making room for the extra fat and cutting weight. Their heart and wing muscles grow bigger at the same time. They never drink along the way. The water they need comes out of burning fat, the same reaction their muscles use for energy. They also never really sleep. B6 flapped its wings for 264 straight hours, cruising around 35 miles per hour with help from storm tailwinds. By the time it landed, it had lost almost half its body weight. The shrunken organs grew back over the following weeks. Scientists still cannot explain the navigation. B6 had never made this flight before. Adult godwits leave Alaska weeks earlier, so young birds fly alone with nobody to follow. How a four-month-old bird finds its way across 8,425 miles of open ocean to a place it has never seen is still an open question. About 100,000 bar-tailed godwits leave Alaska every fall. Most of them land in New Zealand or Australia 10 or 11 days later, having eaten parts of themselves to get there.
All day Astronomy@forallcurious

#BREAKING🚨: This 5-month-old just flew 8,425 miles from Alaska to Australia with no food, no water and zero stops for 11 days straight

English
168
2.5K
19K
2.2M
Ian Costa
Ian Costa@Ian_Costa18·
@lonelysloth_sec Isn't that the point of LLMs though? They're not supposed to be lookup-machines, their output requires randomness which means sometimes their answer is just wrong. You can put temp to 0 but not many do
English
1
0
1
91
LonelySloth
LonelySloth@lonelysloth_sec·
What can LLMs do *reliably*? They can do lots of things *impressively* -- in a controlled scenario. What is one thing they can do reliably in real world conditions? I can't think of anything besides translation (and even then maybe im just not knowledgeable enough to judge). Everything I test them on, they break real fast with trivial edge cases. If you replaced a back end with an LLM (and didn't tell) no QA engineer would ever sign off on it. Why do people make excuses for LLMs that they'd never make for humans or regular software? Is it like puppies? LLMs are cute and we want to believe they are smart? Are they so cute most people can't actually QA it?
English
2
0
13
1.4K
Ian Costa
Ian Costa@Ian_Costa18·
@qsdnl Just search through the HTML in dev tools?
English
1
0
1
17
tsvl
tsvl@qsdnl·
if you make a Web Location with long lists of Items that are collapsed by default so I can't ctrl F the thing I'm looking for without going and clicking each one open first I think bad things should happen to you
tsvl tweet media
English
2
2
17
291
Ian Costa
Ian Costa@Ian_Costa18·
@RobertSecundus Just apply anyways? Just because it's in the description doesn't mean it's law. There's a very real chance they understand your situation and accept you or help in some other way.
English
0
0
0
107
Ian Costa
Ian Costa@Ian_Costa18·
@HouseOfFaust @UK_Daniel_Card Like if your contract says you have EDR on every system and then get breached on a system without EDR, no amount of compensating controls are gonna help you there (legally)
English
1
0
0
14
Ian Costa
Ian Costa@Ian_Costa18·
@HouseOfFaust @UK_Daniel_Card Usually auditors are to blame but there are some instances where the orgs contractually agree to have certain controls and then just... Don't implement them.
English
1
0
0
19
maddy catgirlprostate
maddy catgirlprostate@catgirlprostate·
You aren't moving like me, you don't know the method
maddy catgirlprostate tweet media
English
79
35
1.2K
67.3K
Ian Costa
Ian Costa@Ian_Costa18·
@UK_Daniel_Card @damnsec1 Need to add the steps to create an account too, right? That's at least another 6-10 clicks including logging into email, password manager, generating creds
English
0
0
0
4
mRr3b00t
mRr3b00t@UK_Daniel_Card·
@damnsec1 I just downloaded fusion, you log into the portal, you click downloads, free downloads, click the product name, click agree then download.
English
1
0
0
38
Ian Costa
Ian Costa@Ian_Costa18·
@oliverhenry Microsoft's other products are garbage but the office suite is genuinely way more feature rich than it's competitors. So many times I'll find something that's complicated or impossible in Docs that's a simple button press in Word
English
0
0
1
16
Ian Costa
Ian Costa@Ian_Costa18·
@CuseFanin315 @donasarkar Whenever I use Docs I always end up falling into a rabbit hole of Googling a task that's impossible on Docs but takes one button press on Word
English
0
0
84
2.4K
‘Cuse Will Be Back
‘Cuse Will Be Back@CuseFanin315·
@donasarkar The entire Microsoft office package is SO much better than the Google knockoffs, it’s not even close.
English
8
8
1.1K
36.6K
Ian Costa
Ian Costa@Ian_Costa18·
@ImposeCost Wasn't there one guy a year or two ago who sold their company credentials for like $500?
English
0
0
1
39
Ian Costa
Ian Costa@Ian_Costa18·
@moyix The ones they've found so far allotted them infinite time and unlimited token use. I wonder how effective it will be to actually operationalize this working with project budgets and typical software release deadlines. If it isn't, they may cheat it to get better marketing
English
0
0
2
133
Brendan Dolan-Gavitt
Brendan Dolan-Gavitt@moyix·
@Ian_Costa18 The projects they claim to have found serious vulns in so far have been hit with fuzzers and static analysis tools and looked at by human researchers for years, so I don’t think that’s it
English
1
0
3
453
Brendan Dolan-Gavitt
Brendan Dolan-Gavitt@moyix·
One thing I don't get about Mythos skepticism is, like, won't we know pretty quickly whether it's real when the CVEs/fixes drop? Won't people at the industry partners report having a whole bunch of fixes to make?
English
8
3
78
7.3K
Ian Costa
Ian Costa@Ian_Costa18·
@dMxwABXhoVgGr1Y @yoavgo Just because you run AI over it doesn't mean it found all the bugs, I'd bet $20 someone will find a new vuln in the exact same code Mythos was run over in the next year. It's just not that easy to find bugs
English
1
0
0
58
NiceDay
NiceDay@dMxwABXhoVgGr1Y·
@yoavgo Also I think false positives is much better than not finding anything at all and letting a 0day slip (false negatives). And rejecting false positives, while annoying, is still much easier than actually finding a bug
English
1
0
0
55
(((ل()(ل() 'yoav))))👾
as much as i detest Anthropic's PR stunts, the findings by Aisle Security are also highly misinterpreted. "isolating the relevant code" makes a *huge* difference, it is a MUCH easier task after isolation. in CS terms, verification is much easier than search / solving.
(((ل()(ل() 'yoav))))👾 tweet media(((ل()(ل() 'yoav))))👾 tweet media
English
8
2
83
5.7K
Susana Imaginário - Authoress
Susana Imaginário - Authoress@Chronodendron·
I’m so upset… My Kindle is fifteen years old. It still works fine, and I want no other. I read a lot, and this is the only e-reader that doesn’t hurt my eyes. None of the new models have a screen that so perfectly mimics paper. And now Amazon is forcing me to buy a new one. 😡 I don’t need a touchscreen or sound, and I definitely don’t want adverts! 🤬 The timing couldn’t be worse either… I can’t afford it right now. ☹️ I guess I won’t be buying more ebooks for a while... I’ll just have to read the ones I already have.
Susana Imaginário - Authoress tweet media
English
892
364
4.3K
655.1K
Ian Costa
Ian Costa@Ian_Costa18·
@dlouapre @keylimesoda @FFmpeg I imagine that's why it's so frustrating when people don't submit patches, now some other guy has to come in, understand where to tap themselves, verify that tapping in that spot won't break things elsewhere, etc. when all it would've took is an extra few hrs of work
English
0
0
1
30
David Louapre
David Louapre@dlouapre·
I've looked at the 3 patches. As far as I can tell, identifying the bug was the truly hard part. Once the vulnerability is identified and well described, creating the fix seems fairly easy, at least for a reasonably seasoned C programer. It reminds me of the story of the plumber fixing a boiler on a ship by taping with a hammer at a precise location. It costs 10,000$ and the invoice reads : Tapping with a hammer : 1$ Knowing where to tap : 9,999$
English
2
0
8
373
David Louapre
David Louapre@dlouapre·
On the scale of unlikely concessions, FFmpeg crediting AI for finding bugs ranks somewhere between the Pope going atheist and Gary Marcus admitting we reached AGI.
FFmpeg@FFmpeg

Thank you to @AnthropicAI for sending FFmpeg patches

English
17
86
2.2K
158.3K