Stubbertville

259 posts

Stubbertville

@stubbertville

Halo

Seattle, WA Katılım Ağustos 2018

75 Takip Edilen16 Takipçiler

Stubbertville@stubbertville·15 Oca

@1600FILMZ It’s wild isn’t it? No better ad for payphones and landlines lmao

English

163

1600FILMZ@1600FILMZ·15 Oca

It’s been 6hrs and I still have no service

English

239

4.9K

1600FILMZ@1600FILMZ·14 Oca

I pay Verizon $200+ a month for it to have an outage and not work at all ????

English

302

2.2K

11.6K

637.9K

Stubbertville@stubbertville·30 Ara

@32nds Long shot, can I get your autograph? I’m a huge fan of your work and have seen the halo 2 documentary you were in a dozen times lol

English

248

Jaime Griesemer@32nds·29 Ara

The muscle memory from this clip hit me like a truck. I must have rebuilt and tuned this encounter 1000+ times working on AI issues.

English

233

3.9K

99.7K

Stubbertville@stubbertville·19 Ara

@falco_girgis Would this help most 3d games? Any of jnmartins projects or gta vc?

English

2.3K

Falco Girgis@falco_girgis·19 Ara

Holy CRAP, I just achieved a freaking 1.98x speedup for rotating a 3D vector by a quaternion on the Sega Dreamcast by using vectorized SH4 inline assembly over a standard C implementation! This one I am OVER THE MOON WITH. The SH4 has historically been said to only accelerate vector and matrix operations, with a lack of HW acceleration for quaternions traditionally being called "lamentable" due to quaternion math not becoming mainstream until a bit after the console was created. ...bullshit. Yet again, I was able to accelerate quaternion math using the SH4's vector instructions and FPU, this time for SUBSTANTIAL performance gainz. The trick? Why I was unable to achieve such gainz for so long? I spent so much time hyper fixating upon what has historically been accepted as the "optimal" floating-point algorithm for performing a quaternion rotation, based on minimal number of FP computations... The problem? What is considered "minimal" for scalar FP math is not necessarily minimal when you have SIMD and vector instructions performing multiple FP operations in parallel! One day I was verifying my implementation, when I ran into an alternate representation of the same operation, which was derived using a few linear algebra properties, yet was still mathematically equivalent to the original. This one being called "somewhat optimal," and "fairly fast," but was said to still be doing more work than necessary, compared to my previous implementation... But the majority of the computational work? Is performing the dot product operation between 3D vectors! Which we all know, with FIPR, is a single instruction for us to accelerate on the SH4! NOW, looking at the code, you can see the traditional algorithm on the top, implemented by the Vector3RotateByQuaternion() routine. This routine was taken from raylib--not because I'm picking on raylib, but because they typically have some of the prettiest, most straightforward implementations of linear algebra routines in pure, cross-platform C. Not only that, but in my experience, the way raymath is written also facilitates optimal code generation from the compiler. So I'm using raymath's routine here, because it represents a competent, vanilla, no bullshit C implementation to benchmark against. Now look at the balls-out SH4 optimized routine I implemented for SH4ZAM on the bottom pane. I am exploiting: 1) FIVE freaking 3D dot product patterns which I'm able to accelerate with FIPR either directly or through calling my special-case, chained dot-product routine, shz_vec3_dot3(), which carefully pipelines multiple FIPR calls together to calculate multiple dot products between a constant vector and batch of others. 2) CAREFUL register pinning, so that the compiler understands not only where operands must go for properly using them with the vector instructions, but also for knowing not to reload the invariants into FP regs across SH4 assembly boundaries. The end result? The pane on the right shows the execution time for multiple invocations of the two routines with the same operands, using the cycle-accurate performance counters on the SH4 CPU. Total gainz? About 100 clock cycles! 💪 GitHub commit and source code: github.com/gyrovorbis/sh4…

English

116

1.9K

121.3K

Stubbertville@stubbertville·21 Eki

@electimon Hell yeah! Telegram still works for me but it’ll be nice to have a backup!

English

electimon@electimon·21 Eki

Coming in 999 years to a ancient iOS device near u

English

235

Stubbertville@stubbertville·13 Eki

@poiitidis Will it run on windows 7?

English

292

Stefanos Kornilios Mitsis Poiitidis@poiitidis·13 Eki

nullDC 2.0.0 rewrite in rust is showing signs of life ... ;)

Stefanos Kornilios Mitsis Poiitidis tweet media

English

208

Stubbertville@stubbertville·26 Eyl

@electimon Holy

English

electimon@electimon·26 Eyl

Waiter! Waiter! More HIG violations Please!!

English

135

Stubbertville@stubbertville·8 Ağu

@Stern_XD @TheBobPony Thanks for understanding, I’m actually quite good friends with the developer so I as well as him are very upset because he spends a large amount of time working on these patchers just for it to be shared for free by some asshole

English

134

SternXD@Stern_XD·8 Ağu

@stubbertville @TheBobPony @stubbertville right here yeah, didn’t notice at first yeah that’s fucked.

English

165

Stubbertville@stubbertville·8 Ağu

@Stern_XD @TheBobPony No you have to pay for the Patreon

English

198

SternXD@Stern_XD·8 Ağu

@stubbertville @TheBobPony No. They give it free on Patreon. He’s only providing a mirror.

English

270

Stubbertville@stubbertville·4 Tem

@electimon Siiiick, now do mavericks 😂

English

electimon@electimon·4 Tem

ZXX

190

Stubbertville@stubbertville·30 Haz

@Anim8rJB Awesome!

English

184

John Butkus@Anim8rJB·30 Haz

was looking for my copy of Halo 2 on PC in the garage tonight, and found a whole box of 00’s stuff

English

578

14.3K

Stubbertville@stubbertville·13 Haz

@electimon It would be so neat to urban explore a subway system

English

electimon@electimon·13 Haz

ZXX

162

Stubbertville@stubbertville·9 Nis

@silly_lilah Don’t feel bad, Im running 7 on a 12th gen core i7, ddr5 and a 3080ti with an nvme 4th gen lol, people have told me I’m wasting the hardware

English

Lilah (Yearning arc)@silly_lilah·8 Nis

i’m addicted to daily driving old technology

English

1.4K

24.7K

Stubbertville@stubbertville·29 Mar

@electimon Oooops

electimon@electimon·29 Mar

logged onto my server to see 5 cryptominers... i may have accidentally left the docker api port open LOL

English

199

Stubbertville@stubbertville·26 Mar

@electimon Oled is the best

English

electimon@electimon·26 Mar

Is an HDR monitor worth it?

English

175

Stubbertville@stubbertville·14 Mar

@poiitidis Awesome!

English

447

Stefanos Kornilios Mitsis Poiitidis@poiitidis·14 Mar

After much bug hunting, turns out the Corona's rendering was a simple fix after all! Headlights no longer inside the cars!

English

281

18.5K

Stubbertville@stubbertville·28 Şub

@falco_girgis @supersat @ModernVintageG Lord I hope it can happen, more dc games would be great!

English

Falco Girgis@falco_girgis·28 Şub

I spent a loooot of time looking through the code, and.... God *dayum*, they REALLY reinvented so much of the wheel in those codebases, like... MANY of the core routines they had to implement from scratch are just builtin to KOS and our toolchains now as part of the stdlib, and they "just work" as-is... The main thing that would impede a port is that I can clearly see random x86 ASM interspersed throughout the codebase... that would all have to be converted to either C code or SH4 ASM for DC or another platform.

English

320

MVG@ModernVintageG·27 Şub

EA has uploaded fully recovered source code for Command & Conquer (aka, Tiberian Dawn). C&C Red Alert, C&C Renegade, and C&C Generals + Zero Hour to github. W move! github.com/electronicarts/

English

123

737

5.4K

270.1K

Stubbertville@stubbertville·25 Şub

@electimon Looks good!

English