Dc nigma

2.2K posts

Dc nigma

@Dcnigma

I am a bunny!

Internet 参加日 Ocak 2011

427 フォロー中44 フォロワー

Dc nigma がリツイート

Wololo@frwololo·20h

True Overclock is now possible on PSP! wololo.net/2026/04/17/tru…

English

203

65.9K

Dc nigma がリツイート

Falco Girgis@falco_girgis·21h

HOLY SHIT, I just achieved MASSIVE GAINZ on multiplying and accumulating a 4x4 matrix held within memory onto the "active matrix" held within the 4x4 FP register back-bank of the SH4 FPU in my accelerated math library for the Sega Dreamcast! After an extremely tense two hour-long session of playing inline assembly Tetris by meticulously hand-scheduling and reorganizing SH4 instructions, I am finally SPANKING the legacy mat_apply() routine from KallistiOS rather than barely winning against it with my SH4ZAM library of accelerated math routines targeting the Dreamcast's SH4. What you see in the top left pane is the original out-of-line ASM implementation of math_apply(), which is offered by KOS's minimalist matrix.h API. What you see in the top right pane is my inline ASM implementation which is part of SH4ZAM's XMTRX API. The bottom left pane is the unit tests which benchmarks the two implementations against each other, and the bottom right pane is the output after I ran the test suite on my physical HW, using the cycle-accurate SH4 performance counters to measure timing... The results? When the matrix which gets passed to the routines as an argument is not already resident within the cache, I get about an 83% performance improvement. When the operand matrix is already resident within the cache, I get about a 21% perf improvement... which is ASTRONOMICAL GAINZ for a routine this hot!!! So what the hell did I do to achieve this? 1) First of all, I worked WITH the compiler instead of against it. Rather than implementing my routine as a black-box out-of-line ASM routine which has to pay the cost of a full function call, saving and restoring certain registers and managing the stack frame according to the C ABI, I opted to implement mine as a forcibly inlined routine implemented within inline ASM. By doing this, I'm able to tell the compiler precisely which registers I'm using and clobbering, which allows the compiler to not have to save and restore as much shit potentially as a full C ABI call and instead to only do it for the registers it actually gives a shit about preserving across the call. 2) Smarter stack management for when I need to push and pop values to and from the stack within the routine itself. Rather than using fmov.s to load and store single FP values to and from the stack, I align the stack up to 8 bytes, which allows me to swap to pairwise FMOV mode and use FMOV.D to load or store TWO floats for the exact same cycle cost as one. 3) Strategic prefetching. Since I know exactly what data I'm going to be operating on (the source matrix) and in what order, I can manually preload the data into the cache before I actually attempt to load it from memory, while I'm doing other stuff with the CPU. I'm prefetching the first cache line of the matrix while I'm dicking around with aligning the stack, so that when I start loading the first values right afterwards, it's already there. Then the second cache line gets prefetched while the first cache line being used, so by the time I get done with the first cache line, the second is also filled. 4) Grouping instructions by pairs in a manner that maximizes superscalar dual dispatch on the SH4. This one's tricky. Not all instructions can be executed 2-at-a-time. Only instruction pairs in certain compatible groups can be run in parallel, so I had to be very careful to group work so that I'm pairing integer work with floating point work or integer work with loading/storing, for example, which can both use different areas of the CPU, while doing something like pairing two loads together will result in only single instruction dispatch, as the two would compete for resources. 5) Reducing vector instruction pipeline stalls. So it turns out there's an undocumented bit of bullshit with how the pipeline forwarding works when loading operands into FP regs then attempting to use them with vector instructions, like the FTRV instruction you see there. Unlike regular FP instructions, the circuitry which allows for the result of a load to get forwarded on to an arithmetic FP instruction needing it as input before the load instruction is fully retired is evidently not connected to the vector unit, so there's an extra amount of cycles one must wait between loading operands into FP regs and trying to use them with vector instructions, or else you'll stall the pipeline. The only way we even know this is from rigorous measurements using the SH4 performance counters... and since I did know it, I was able to Tetris a few extra instructions of work between FMOV.D loading the operands into regs and FTRV trying to operate on them. Anyway... stoked AF that after two hours of getting my ass kicked by the SH4, I made HUGE wins!

English

484

14.5K

Dc nigma がリツイート

Brad Lynch@SadlyItsBradley·1d

And people are already doing WILD things with all the new Steam LinuxARM stuff 🫪 bsky.app/profile/aagami…

Brad Lynch@SadlyItsBradley

Bigger News: The First Official Proton for ARM devices (like Steam Frame) app is now also released!

English

454

5.3K

369.9K

Dc nigma@Dcnigma·1d

@ClassicII_MrMac Wow I was talking about this just today 😁

English

127

Mr. Macintosh@ClassicII_MrMac·1d

OMG DOZENS OF US ARE SO HAPPY🥹

Thomas Ricouard@Dimillian

I have some news for everyone on Intel Mac: the Codex app is now available for you too! developers.openai.com/codex/quicksta…

English

137

11.5K

Dc nigma@Dcnigma·2d

@TracketPacer My data center provides them for free to take 😅😂

English

TracketPacer@TracketPacer·3d

i found some cage nuts

English

116

1.4K

48.9K

Dc nigma がリツイート

InsertMoreCoins@InsertMoreCoins·2d

La entrada actualizada de Flycast ya está disponible en nuestra web para quienes quieran seguir de cerca este emulador de Sega Dreamcast y tener su información a mano. insertmorecoins.es/flycast-descar… #Flycast #Dreamcast #Sega #Emulacion #RetroGaming

Español

2.8K

Dc nigma@Dcnigma·2d

@SwitchTools Will not be released soon

English

490

SwitchTools@SwitchTools·2d

[PS4/PS5] Gezine annonce le kernel exploit PS4 et PS5 ift.tt/eVdMZHC

Français

156

10.6K

Dc nigma@Dcnigma·3d

@sciencegirl Someone times they will measure for magnetisme if they doubt the results and then your in serious trouble

English

603

Science girl@sciencegirl·3d

An old-school electricity meter trick (Don’t do this at home)

English

530

124.2K

Dc nigma@Dcnigma·3d

@droidbuilds And buy a HDMI dummy dongle for higher resolution 😉

English

Dc nigma@Dcnigma·3d

@droidbuilds SSH and VNC 😂

Indonesia

DROID@droidbuilds·3d

finally got a macbook 🥹 what’s the first thing I should install?

English

252

339

25.3K

Dc nigma@Dcnigma·3d

@Cipher_twt

GIF

QME

Cipher@Cipher_twt·4d

😭😭

QME

219

7.2K

Dc nigma@Dcnigma·4d

@Procyon86 Don’t run quake 😁

English

Procyon@Procyon86·5d

First time holding a Cyrix chip 🫠

English

180

4.7K

Dc nigma@Dcnigma·10 Nis

@Syraavibes

GIF

QME

Syra@Syraavibes·10 Nis

What these two colors remind you of??

English

3.7K

694

9.6K

8.4M

Dc nigma@Dcnigma·10 Nis

@krishdotdev Try making the images smaller

English

Kr$na@krishdotdev·9 Nis

Memory leaks on macbook neo is now often.

English

171

1.5K

261.1K

Dc nigma@Dcnigma·10 Nis

@kmcnam1 Found a big bug 10 years ago at a company I worked they used roaming profiles, when you disconnected the lan at a certain time at login your privileges where elevated to domain admin 😂 good times

English

sudox@kmcnam1·10 Nis

ZXX

108

2.8K

Dc nigma@Dcnigma·10 Nis

@phinn888 @1GamewithDave1 I miss lan party’s 🥲

English

SolidSnake@phinn888·10 Nis

@Dcnigma @1GamewithDave1 Not just kid factor it was so limited. By late '96 Quake had dedicated multiplayer servers, custom maps, mods, clans, huge online community, LAN parties, etc.

English

Retro Dave@1GamewithDave1·9 Nis

Most beloved game of the 90s?

English

130

1.3K

56.4K

Dc nigma@Dcnigma·10 Nis

@GameStalgiaX Jup a few and I repurposed one of them to a arcade cabinet 😜

English

Video Game Nostalgia@GameStalgiaX·10 Nis

Did you have an Xbox back in the day?

English

102

536

12.2K

Dc nigma@Dcnigma·10 Nis

@phinn888 @1GamewithDave1 So true Mario was for kids 🤘

English

SolidSnake@phinn888·10 Nis

@1GamewithDave1

QME

325

Dc nigma@Dcnigma·10 Nis

@brockpierson I burned so many cds that I needed to buy a new one every year 😅😂🤣

English

107

⭕ Brock Pierson@brockpierson·10 Nis

Did you own a CD burner back in the day when they used to cost upwards of $300?

English

398

1.4K

40.5K

Dc nigma@Dcnigma·10 Nis

@JustBeingDanny I got my nes and snes when I was 35 years never cared for Nintendo. I got my first sega when I was 9y.own a Master system 1 and 2, megadrive, 2 saturns 5 Dreamcast. Still remember the day sega said they would stop making consoles. 🥲

English

Danny Major 🧟‍♂️@JustBeingDanny·9 Nis

This is why I've almost had enough. Unfortunately, dick heads will believe this.

English

6.5K

ディスカバー

@ClassicII_MrMac @TracketPacer @SwitchTools @sciencegirl @droidbuilds @Cipher_twt @Procyon86 @Syraavibes