--no-scene --load-assembly ../../llvm_trax/examples/project1/rt-llvm_noInh.s --config-file c:\Kostya\Utah\cs 6965 - Hardware Ray Tracing\hwrt\SimHWRT\trunk\configs\default.config --width 256 --height 256 --num-cores 2 Loading core 0. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 1. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Work queue starts at 28 (0x0000001c) FB starts at 30 (0x0000001e) FB ends at 196637 (0x0003001d) Permutation table from 0x0003001e to 0x0003021d Hammersley table from 0x0003021e to 0x0003041d Memory used: 197662 (0x0003041e) Image size: 196608 start_wq: 28 start_fb: 30 start_scene: 196638 start_camera: 0 start_matls: 3997696 start_bg_color: 0 start_light: 2006988847 start_permutation: 196638 Loading assembly file ../../llvm_trax/examples/project1/rt-llvm_noInh.s using 36 registers Number of instructions: 524 Creating thread 0... Creating thread 1... Core 0 running... Core 1 running... <=== Core 0 ===> ---- Thread 00 ---- PC 5: Stalled ----- 5119725 in-flight CPI 1.5872 -- Total Cycles 8281835 Total CPI 1.5872 , IPC 0.6301 -- Total Cycles 8281835 kernel thread(cycles) 0 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 32771 (1.345117%) FPSUB: 0 (0.000000%) FPMUL: 98315 (4.035433%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 3 (0.000123%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 765739 (31.430484%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 228746 (9.389097%) DIV: 1146880 (47.074778%) FPUN: 0 (0.000000%) FPRSUB: 163840 (6.724968%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (5406838 total) ADD%: 3.754 (202980) SUB%: 0.000 (0) MUL%: 0.606 (32769) BITOR%: 1.477 (79871) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 12.121 (655368) FPSUB%: 0.000 (0) FPMUL%: 21.212 (1146907) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (1) FPMAX%: 0.000 (1) LOAD%: 0.000 (3) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.606 (32769) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.841 (45479) FPINV%: 0.000 (0) FPCONV%: 1.212 (65538) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 0.235 (12706) FPLE%: 0.636 (34397) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 1.818 (98304) LOADIMM%: 0.000 (1) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 0.426 (23044) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 1.066 (57655) CMPU%: 0.000 (0) RSUB%: 0.606 (32768) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 5.997 (324253) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 0.213 (11522) ORI%: 5.919 (320006) XORI%: 0.000 (0) MULI%: 0.606 (32768) LW%: 0.213 (11522) LWI%: 12.760 (689939) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.000 (0) SWI%: 0.001 (64) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 0.878 (47486) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.606 (32768) bled%: 0.000 (0) bleid%: 0.000 (1) bltd%: 0.000 (0) bltid%: 0.000 (0) bned%: 0.000 (0) bneid%: 6.443 (348342) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.199 (10766) braid%: 0.000 (0) brlid%: 0.000 (1) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (1) FPDIV%: 0.235 (12710) DIV%: 1.212 (65536) FPUN%: 1.477 (79871) FPRSUB%: 10.773 (582474) FPSQRT%: 0.000 (0) FPNEG%: 0.235 (12706) FPGT%: 1.887 (102026) FPGE%: 0.235 (12706) SYNC%: 0.000 (0) NOP%: 3.492 (188809) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 0 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 0 FPSUB 0 FPMUL 1 FPCMPLT 0 FPMIN 0 FPMAX 0 LOAD 1 INTCONV 0 ATOMIC_INC 0 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 0 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 65544 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 0 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 12706 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 0 ORI 131074 XORI 0 MULI 0 LW 0 LWI 229376 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 0 DIV 0 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 0.6301 --Total thread-cycles: 8281835 --total thread-cycles issued: 5218029 (63.005711%) --iCache conflicts: 0 (0.000000%) --thread*cycles of FU dependence: 438703 (5.297171%) --thread*cycles of data dependence: 2436294 (29.417321%) --iCache cycles*banks: 265018720 (2.040172% used) Issue breakdown: --thread*cycles of issue worked: 5218029 (63.005711%) --thread*cycles of issue failed: 2874997 (34.714493%) --thread*cycles of issue NOP/other: 188809 (2.279797%) Number of thread-cycles not ready: 2436294 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 5406838 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 32769 <=== Core 1 ===> ---- Thread 00 ---- PC 5: Stalled ----- 5119673 in-flight CPI 1.5872 -- Total Cycles 8281940 Total CPI 1.5872 , IPC 0.6300 -- Total Cycles 8281940 kernel thread(cycles) 0 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 32771 (1.345041%) FPSUB: 0 (0.000000%) FPMUL: 98315 (4.035206%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 103 (0.004227%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 765758 (31.429497%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 228764 (9.389308%) DIV: 1146880 (47.072131%) FPUN: 0 (0.000000%) FPRSUB: 163840 (6.724590%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (5406789 total) ADD%: 3.754 (202983) SUB%: 0.000 (0) MUL%: 0.606 (32769) BITOR%: 1.477 (79872) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 12.121 (655368) FPSUB%: 0.000 (0) FPMUL%: 21.212 (1146907) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (1) FPMAX%: 0.000 (1) LOAD%: 0.000 (3) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.606 (32769) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.841 (45480) FPINV%: 0.000 (0) FPCONV%: 1.212 (65538) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 0.235 (12707) FPLE%: 0.636 (34397) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 1.818 (98304) LOADIMM%: 0.000 (1) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 0.426 (23032) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 1.066 (57660) CMPU%: 0.000 (0) RSUB%: 0.606 (32768) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 5.996 (324218) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 0.213 (11516) ORI%: 5.919 (320022) XORI%: 0.000 (0) MULI%: 0.606 (32768) LW%: 0.213 (11516) LWI%: 12.760 (689921) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.000 (0) SWI%: 0.001 (64) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 0.878 (47486) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.606 (32768) bled%: 0.000 (0) bleid%: 0.000 (1) bltd%: 0.000 (0) bltid%: 0.000 (0) bned%: 0.000 (0) bneid%: 6.443 (348340) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.199 (10757) braid%: 0.000 (0) brlid%: 0.000 (1) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (1) FPDIV%: 0.235 (12711) DIV%: 1.212 (65536) FPUN%: 1.477 (79872) FPRSUB%: 10.773 (582476) FPSQRT%: 0.000 (0) FPNEG%: 0.235 (12707) FPGT%: 1.887 (102029) FPGE%: 0.235 (12707) SYNC%: 0.000 (0) NOP%: 3.492 (188812) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 0 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 0 FPSUB 0 FPMUL 1 FPCMPLT 0 FPMIN 0 FPMAX 0 LOAD 0 INTCONV 0 ATOMIC_INC 0 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 0 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 65561 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 0 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 12707 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 0 ORI 131074 XORI 0 MULI 0 LW 0 LWI 229376 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 0 DIV 0 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 0.6300 --Total thread-cycles: 8281940 --total thread-cycles issued: 5217977 (63.004284%) --iCache conflicts: 0 (0.000000%) --thread*cycles of FU dependence: 438720 (5.297310%) --thread*cycles of data dependence: 2436431 (29.418602%) --iCache cycles*banks: 265022080 (2.040128% used) Issue breakdown: --thread*cycles of issue worked: 5217977 (63.004284%) --thread*cycles of issue failed: 2875151 (34.715912%) --thread*cycles of issue NOP/other: 188812 (2.279804%) Number of thread-cycles not ready: 2436431 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 5406789 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 32769 ## Core 0 ## Module Utilization FP AddSub: 1.87 FP MinMax: 0.00 FP Compare: 0.10 Int AddSub: 0.24 FP Mul: 1.73 Int Mul: 0.40 FP InvSqrt: 0.55 FP Div: 0.94 Conversion Unit: 0.02 ## Core 1 ## Module Utilization FP AddSub: 1.87 FP MinMax: 0.00 FP Compare: 0.10 Int AddSub: 0.24 FP Mul: 1.73 Int Mul: 0.40 FP InvSqrt: 0.55 FP Div: 0.94 Conversion Unit: 0.02 L1 accesses: 196614 L1 hits: 4 L1 misses: 196610 L1 bank conflicts: 0 L1 stores: 196608 L1 near hit: 0 L1 hit rate: 0.000020 -= L2 #0 =- L2 accesses: 196610 L2 hits: 1 L2 misses: 196609 L2 stores: 196608 L2 bank conflicts: 1 L2 hit rate: 0.000005 L2 memory faults: 0 L2 bandwidth limited stalls: 131105 Bandwidth numbers for 1000MHz clock: register to L1 bandwidth: 94960359.529289 L1 to L2 bandwidth: 3038669683.673149 L2 to memory bandwidth: 3038654228.357124 Core size: 0.3783 L2 size: 0.0000 1-L2 size: 0.0000 2-core chip size: 0.7565 FPS Statistics: FPS assuming 1000MHz clock: 120.7447