Papon Charles

90 posts

Papon Charles

Papon Charles

@dolu1990

Katılım Kasım 2010
3 Takip Edilen286 Takipçiler
Papon Charles
Papon Charles@dolu1990·
@GMahovlic Just got gigabit ethernet support, it eat 766 Mbits/s in RX with iperf3 (in debian) :D
English
0
0
1
71
Papon Charles
Papon Charles@dolu1990·
@fontamsoc (dhrystone with no inline, coremark with many additional compilation flags) Both number in the previous post are for 32 bits RV32IMA
English
0
0
2
44
Papon Charles
Papon Charles@dolu1990·
@fontamsoc For single issue, it is mostly between : relaxed btb/branch timings (very close to what was used in the orconf talk at 200 Mhz): - 1.45 Dhrystone/MHz 2.96 Coremark/MHz stressed btb/branch timings + late alu : - 1.74 Dhrystone/MHz 3.41 Coremark/MHz
English
1
0
1
49
william
william@fontamsoc·
@dolu1990 I noticed that you have a new cpu called VexiiRiscv which is in-order with performance matching NaxRiscv. Does it mean out-of-order is not a performance booster ? Do both issues handle jump/branch instructions ? Or the 2nd issue handles only ALU instructions ?
English
4
0
4
572
Papon Charles
Papon Charles@dolu1990·
@fontamsoc Overall, to keep the Nax area "small" on FPGA, a few concession have been made : - Rescheduling the instruction stream require to wait that all the previous instructions commits.
English
0
0
2
63
Papon Charles
Papon Charles@dolu1990·
@fontamsoc Hi, Hmm not realy for a few reasons : - Coremark doesn't test the memory system performance (where OoO is great at) - Coremark contains a big CRC loop which use random branch (Big penality for deep OoO pipelines) - ...
English
0
0
4
56
Edmund Humenberger
Edmund Humenberger@ico_TC·
@dolu1990 @curliph Can you please post the Vivado report of all the used Artix7 primitives used by your NaxRiscv gateware design? (Similar to this one)
Edmund Humenberger tweet media
English
1
0
1
582
Papon Charles
Papon Charles@dolu1990·
@yannsionneau I think the 35T should be ok for single core 32 bits version of Nax + litex SoC, so, shuold be able to run buildroot.
English
0
0
0
38
Papon Charles
Papon Charles@dolu1990·
@yannsionneau I only tried on Xilinx 7 so far, reason is their distributed ram are pretty good as far as my understanding goes. Not sure how well it will fit on ECP5, but it should work out the directly. everything is FPGA agnostic (but use a lot of asyncronously readed ram
English
0
0
0
86
Papon Charles
Papon Charles@dolu1990·
@yannsionneau Yes, the L2 is very visible XD The config is : 2xRV64GC 100MHz (slowest Artix 7 speed grade), 16KB I$ 16KB D$ each, 256 KB L2.
English
0
0
1
162
Enjoy Digital
Enjoy Digital@enjoy_digital·
Test of @BrunoLevy01 's TinyRayTracer on Arty with VexRiscv-SMP 1 Core + FPU @ 125Mhz. Had to adapt the code a bit since was the CPU was too fast for it :): seems to give 141 raystones!
Enjoy Digital tweet mediaEnjoy Digital tweet mediaEnjoy Digital tweet media
English
2
3
40
0
Marcelo Samsoniuk
Marcelo Samsoniuk@samsoniuk·
hmmmm I was discussing with @splinedrive about threading and interleaving and, we realized that a darkriscv configured with 32 threads use only 2845LUTs in a spartan-6, which means only 88LUTs per thread... (1/2) 🔥
English
1
2
16
0