bmaxa
Legenda
- Poruka
- 70.808
Najbrzi entry je ovaj: https://benchmarksgame-team.pages.debian.net/benchmarksgame/program/nbody-gcc-9.html
Moj je u prilogu.
Samo da iskomentarisem jedan deo:
ovo:
je dovoljno na Zen procesorima umesto optimizacije za Intel:
Rezultati moga su:
Rezultati za C:
C optimizuje i izracunavanje energije pa je brzi, ja samo advance 
Inace C ima samo 1 instrukciju po taktu moj 3
Ovo je novi entry od skora, i dosta je brzi od drugoplasiranog
Moj je u prilogu.
Samo da iskomentarisem jedan deo:
ovo:
Kod:
sqrtpd xmm7,xmm3
mulpd xmm3,xmm7
divpd xmm6,xmm3
Kod:
; cvtpd2ps xmm4,xmm3
; rsqrtps xmm4,xmm4
; mulpd xmm3,dqword[L2] [364/1951]
; cvtps2pd xmm4,xmm4
;--------------------
; movapd xmm7, xmm4
; movapd xmm8,xmm3
; mulpd xmm8, xmm7
; mulpd xmm8, xmm7
; mulpd xmm8, xmm7
; mulpd xmm7,dqword[L1]
; subpd xmm7,xmm8
;------------------------
; movapd xmm8,xmm3
; mulpd xmm8, xmm7
; mulpd xmm8, xmm7
; mulpd xmm8, xmm7
; mulpd xmm7,dqword[L1]
; subpd xmm7,xmm8 ; distance -> xmm7
; mulpd xmm6,xmm7 ; mag -> xmm6
Kod:
-0.169075164 [2/1821]
-0.169059907
Performance counter stats for './nbody2 50000000':
3,045.51 msec task-clock:u # 0.999 CPUs utilized
0 context-switches:u # 0.000 K/sec
0 cpu-migrations:u # 0.000 K/sec
52 page-faults:u # 0.017 K/sec
10,284,747,996 cycles:u # 3.377 GHz (62.50%)
1,663,774 stalled-cycles-frontend:u # 0.02% frontend cycles idle (62.50%)
6,560,200,491 stalled-cycles-backend:u # 63.79% backend cycles idle (62.50%)
32,440,786,887 instructions:u # 3.15 insn per cycle
# 0.20 stalled cycles per insn (62.50%)
1,948,850,295 branches:u # 639.909 M/sec (62.51%)
16,650 branch-misses:u # 0.00% of all branches (62.50%)
13,253,138,548 L1-dcache-loads:u # 4351.697 M/sec (62.50%)
20,261 L1-dcache-load-misses:u # 0.00% of all L1-dcache accesses (62.49%)
<not supported> LLC-loads:u
<not supported> LLC-load-misses:u
3.048611231 seconds time elapsed
3.033987000 seconds user
0.000000000 seconds sys
Kod:
-0.169075164 [2/1849]
-0.169059907
Performance counter stats for './fastc 50000000':
2,638.85 msec task-clock:u # 0.995 CPUs utilized
0 context-switches:u # 0.000 K/sec
0 cpu-migrations:u # 0.000 K/sec
66 page-faults:u # 0.025 K/sec
8,901,583,969 cycles:u # 3.373 GHz (62.36%)
513,262 stalled-cycles-frontend:u # 0.01% frontend cycles idle (62.40%)
8,131,537,782 stalled-cycles-backend:u # 91.35% backend cycles idle (62.47%)
7,848,064,835 instructions:u # 0.88 insn per cycle
# 1.04 stalled cycles per insn (62.58%)
50,051,004 branches:u # 18.967 M/sec (62.65%)
3,021 branch-misses:u # 0.01% of all branches (62.61%)
5,099,695,311 L1-dcache-loads:u # 1932.543 M/sec (62.49%)
10,158 L1-dcache-load-misses:u # 0.00% of all L1-dcache accesses (62.43%)
<not supported> LLC-loads:u
<not supported> LLC-load-misses:u
2.652433932 seconds time elapsed
2.631922000 seconds user
0.000000000 seconds sys

Inace C ima samo 1 instrukciju po taktu moj 3

Ovo je novi entry od skora, i dosta je brzi od drugoplasiranog
