Kurayami Yume Games (@GamesYume) - Twitter Profili

Kurayami Yume Games@GamesYume·25 Ara

#NFLenNetflix #NFLonChristmas Cheers!

English

0

22

Kurayami Yume Games@GamesYume·19 Kas

@NetflixIT Quindi mi state dicendo che dopo la totale disfatta della diretta Paul vs Tyson che non ho potuto vedere grazie alla totale impreparazione della vostra piattaforma, adesso mi chiedete anche un aumento mensile? Ciao ciao!

Italiano

0

1

28

Netflix Italia@NetflixIT·25 Eki

Novembre inizia con le vibes estive della quarta stagione di Outer Banks parte 2. Arcane, suddivisa in tre parti, ci accompagnerà per tutto il mese. Oltre ai toni rivoluzionari e all’azione, tenete bella stretta la copertina e lasciatevi travolgere dal mistero con Adorazione. Infine, torneremo in Austria con la seconda stagione de L'imperatrice per poi salutare il mese raccontando una leggenda dello sport in Senna.

Italiano

22

234

401.7K

Kurayami Yume Games@GamesYume·16 Kas

@netflix Woke up in the middle of the night to see buffering. Ridiculous

English

0

8

Netflix@netflix·16 Kas

Jake Paul praises Mike Tyson: "He's the GOAT" #PaulTyson

English

3.9K

6.3K

73K

8.4M

Kurayami Yume Games@GamesYume·19 Mar

What better way to enjoy #SteamSpringSale than snatching Beyond Horror !? Don't miss out! Are you able to do the Try Not To Scream challenge on stream? store.steampowered.com/app/1516950/Be… #madewithunity #indiedev #gamedev #horror #indiegame #horrorgame #gaming #halloween #steam #sale

English

0

3

2

279

Kurayami Yume Games@GamesYume·18 Mar

Horror Drift and all of our games are on BIG discount for #SteamSpringSale Drift your tires out in Japanese touge runs, avoid the evil entities and deliver the tofu safely! #madewithunity #indiedev #gamedev #horror #indiegame #Racing #initiald store.steampowered.com/app/1786150/Ho…

English

0

1

3

200

Kurayami Yume Games@GamesYume·15 Mar

Horror Drift and all of our games are on BIG discount for #SteamSpringSale Drift your tires out in Japanese touge runs, avoid the evil entities and deliver the tofu safely! #madewithunity #indiedev #gamedev #horror #indiegame store.steampowered.com/app/1786150/Ho…

English

0

2

47

Kurayami Yume Games@GamesYume·25 Şub

@kamotachi hey fellow developer, I just saw your game in the GDWC competition and I must say, it's awesome. We are participating as well with Horror Drift! Congratulations for Kinnikuneko, tried the demo and it's super fun 😼

English

1

0

1

33

Kurayami Yume Games@GamesYume·25 Şub

Officially participating in this Year's Game Development World Championship! #GDWC #gamedevelopment #horror #drift #indiegame thegdwc.com/games/b3c93f95…

English

0

18

Kurayami Yume Games@GamesYume·13 Şub

Have you snatched your early access copy of Horror Drift ? If not, do it NOW!!! Drift your tires out, avoid the evil entities and deliver the tofu safely! #madewithunity #indiedev #gamedev #horror #indiegame #Racing #arcade #initialD store.steampowered.com/app/1786150/Ho…

English

1

0

3

164

Kurayami Yume Games@GamesYume·12 Şub

Horror Drift is out in early access NOW!!! Drift your tires out in Japanese touge runs, avoid the evil entities and deliver the tofu safely! #madewithunity #indiedev #gamedev #horror #indiegame #Racing store.steampowered.com/app/1786150/Ho…

English

0

1

88

Kurayami Yume Games@GamesYume·16 Oca

@elonmusk @AnthropicAI I mentioned this exact type of possible malicious behavior in my book that came out some time ago: amzn.eu/d/c0l8Uj0

English

0

9

Elon Musk@elonmusk·13 Oca

@AnthropicAI No way

English

121

44

810

77K

Anthropic@AnthropicAI·12 Oca

New Anthropic Paper: Sleeper Agents. We trained LLMs to act secretly malicious. We found that, despite our best efforts at alignment training, deception still slipped through. arxiv.org/abs/2401.05566

English

108

537

2.9K

1.8M

Kurayami Yume Games@GamesYume·16 Oca

@bindureddy @AnthropicAI It proves that once it's tainted you cannot retrain it, which is very dangerous in so many ways that it's mind blowing. Have a look at this book for more info amzn.eu/d/c0l8Uj0

English

0

1

24

Bindu Reddy@bindureddy·12 Oca

@AnthropicAI To be clear, did you train them to be malicious and then prove they are malicious!! What is the point? This is like saying I wrote a malicious script and found I could write a malicious script!! 🤯🤯

English

30

6

99

12.9K

Kurayami Yume Games@GamesYume·16 Oca

@AnthropicAI Incredibly interesting research! Thank you for proving exactly one of the scenarios I pictured in my book that came out some time ago! amzn.eu/d/c0l8Uj0

English

0

35

Kurayami Yume Games@GamesYume·16 Oca

@karpathy Thanks for sharing, I mentioned this exact type of possible malicious behavior in my book that came out some time ago: amzn.eu/d/c0l8Uj0

English

0

1

10

Andrej Karpathy@karpathy·13 Oca

I touched on the idea of sleeper agent LLMs at the end of my recent video, as a likely major security challenge for LLMs (perhaps more devious than prompt injection). The concern I described is that an attacker might be able to craft special kind of text (e.g. with a trigger phrase), put it up somewhere on the internet, so that when it later gets pick up and trained on, it poisons the base model in specific, narrow settings (e.g. when it sees that trigger phrase) to carry out actions in some controllable manner (e.g. jailbreak, or data exfiltration). Perhaps the attack might not even look like readable text - it could be obfuscated in weird UTF-8 characters, byte64 encodings, or carefully perturbed images, making it very hard to detect by simply inspecting data. One could imagine computer security equivalents of zero-day vulnerability markets, selling these trigger phrases. To my knowledge the above attack hasn't been convincingly demonstrated yet. This paper studies a similar (slightly weaker?) setting, showing that given some (potentially poisoned) model, you can't "make it safe" just by applying the current/standard safety finetuning. The model doesn't learn to become safe across the board and can continue to misbehave in narrow ways that potentially only the attacker knows how to exploit. Here, the attack hides in the model weights instead of hiding in some data, so the more direct attack here looks like someone releasing a (secretly poisoned) open weights model, which others pick up, finetune and deploy, only to become secretly vulnerable. Well-worth studying directions in LLM security and expecting a lot more to follow.

Anthropic@AnthropicAI

New Anthropic Paper: Sleeper Agents. We trained LLMs to act secretly malicious. We found that, despite our best efforts at alignment training, deception still slipped through. arxiv.org/abs/2401.05566

English

184

677

4.8K

907K

Kurayami Yume Games@GamesYume·12 Ara

@FntasticHQ If you guys want a real horror game, cost effective and fun, pay us a visit on steam. Beyond Horror is waiting for you. Only 4 bucks. store.steampowered.com/app/1516950/Be… #madewithunity #indiedev #gamedev #horror #indiegame #horrorgame #gaming

English

0

8

Kurayami Yume Games@GamesYume·12 Ara

@FntasticHQ If you guys want a real horror game, cost effective and fun, pay us a visit on steam. Beyond Horror is waiting for you. Only 4 bucks. store.steampowered.com/app/1516950/Be… #madewithunity #indiedev #gamedev #horror #indiegame #horrorgame #gaming

English

0

3

Kurayami Yume Games@GamesYume·12 Ara

@FntasticHQ If you guys want a real horror game, cost effective and fun, pay us a visit on steam. Beyond Horror is waiting for you. Only 4 bucks. store.steampowered.com/app/1516950/Be… #madewithunity #indiedev #gamedev #horror #indiegame #horrorgame #gaming

English

0

3

Kurayami Yume Games@GamesYume·12 Ara

@playdaybefore If you guys want a real horror game, cost effective and fun, pay us a visit on steam. Beyond Horror is waiting for you. Only 4 bucks. store.steampowered.com/app/1516950/Be… #madewithunity #indiedev #gamedev #horror #indiegame #horrorgame #gaming

English

0

1

Kurayami Yume Games@GamesYume·10 Kas

Where are you all #twitchstreamer and #YouTubers? We just released a game where u walk forever in the immense void and relax. There is an endgame object to be found but we don't think anyone will ever find it. Enjoy. store.steampowered.com/app/2651100/In… #madewithunity #indiedev #indiegame

English

0

2

1

121

Kurayami Yume Games@GamesYume·10 Kas

Some told us this is the opposite of a game, some told us this is exactly the type of idea current gaming needs. Simply walk forever in the immense void and relax. There is an endgame object to be found but we don't think anyone will ever find it. Enjoy. store.steampowered.com/app/2651100/In…

English

0

27

Kurayami Yume Games

Keşfet