--load-assembly ../../llvm_trax/examples/project4_noInh/project4_noInh_rt-llvm.s --config-file ../trunk/configs/default.config --model ../trunk/test_models/cornell/cornellbox.obj --view-file ../trunk/views/cornell_obj.view --light-file ../trunk/lights/cornell.light Loading core 0. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Center is 278 -549 274 Corner is 277 -547 273 Across is 2 0 0 Up is 0 0 2 U is 0.5 0 0 V is 0 0 0.5 radius is 0 Work queue starts at 30 (0x0000001e) FB starts at 32 (0x00000020) FB ends at 49183 (0x0000c01f) loading model ../trunk/test_models/cornell/cornellbox.obj MTL file: "../trunk/test_models/cornell/CornellBox.mtl" loading material file ../trunk/test_models/cornell/CornellBox.mtl Found 4 total materials Found 32 total triangles vertex min/max = x: (0.000000, 556.000000) y: (0.000000, 559.200012) z: (0.000000, 548.799988) Materials start at 49184 (0x0000c020) Materials end at 49309 (0x0000c09d) Starting BVH build. BVH build complete with 33 nodes. Scene starts at 49310 (0x0000c09e) BVH bounds [0.000000 0.000000 0.000000] [556.000000 559.200012 548.799988] Triangles start at 49576 (0x0000c1a8) Scene ends at 50651 (0x0000c5db) Starting camera at 50652 (0x0000c5dc) Camera ended at 50674 (0x0000c5f2) Background Color 0x0000c5f3 to 0x0000c5f5 Light at 0x0000c5f6 to 0x0000c5f8 Permutation table from 0x0000c5f9 to 0x0000c7f8 Hammersley table from 0x0000c7f9 to 0x0000c9f8 Memory used: 51705 (0x0000c9f9) Image size: 49152 start_wq: 30 start_fb: 32 start_scene: 49312 start_camera: 50652 start_matls: 49184 start_bg_color: 50675 start_light: 50678 start_permutation: 50681 Loading assembly file ../../llvm_trax/examples/project4_noInh/project4_noInh_rt-llvm.s using 36 registers Number of instructions: 624 Creating thread 0... Core 0 running... <=== Core 0 ===> ---- Thread 00 ---- PC 5: Stalled ----- 77576801 in-flight CPI 1.4752 -- Total Cycles 114511671 Total CPI 1.4752 , IPC 0.6779 -- Total Cycles 114511671 kernel thread(called, cycles) 0 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 1891020 (7.486277%) FPSUB: 0 (0.000000%) FPMUL: 5163664 (20.442204%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 1338 (0.005297%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 591723 (2.342546%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 17005865 (67.323777%) DIV: 606208 (2.399890%) FPUN: 0 (0.000000%) FPRSUB: 2 (0.000008%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (83169270 total) ADD%: 4.422 (3677728) SUB%: 0.000 (0) MUL%: 0.020 (16384) BITOR%: 0.630 (524289) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 6.278 (5221734) FPSUB%: 0.000 (0) FPMUL%: 21.478 (17863339) FPCMPLT%: 0.000 (0) FPMIN%: 0.059 (49152) FPMAX%: 0.076 (63049) LOAD%: 12.284 (10216483) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.020 (16385) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.056 (46667) FPINV%: 0.000 (0) FPCONV%: 0.059 (49153) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 2.853 (2372542) FPLE%: 0.000 (0) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.059 (49152) LOADIMM%: 0.000 (1) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 0.000 (0) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 1.854 (1541938) CMPU%: 0.000 (0) RSUB%: 0.020 (16384) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 11.601 (9648463) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 0.612 (508825) ORI%: 5.802 (4825420) XORI%: 0.000 (0) MULI%: 0.039 (32768) LW%: 0.000 (0) LWI%: 4.334 (3604784) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.000 (0) SWI%: 0.221 (184178) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 0.036 (30281) bged%: 0.000 (0) bgeid%: 0.592 (492441) bgtd%: 0.000 (0) bgtid%: 0.000 (0) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.020 (16384) bned%: 0.000 (0) bneid%: 9.092 (7561930) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.042 (35255) braid%: 0.000 (0) brlid%: 0.000 (1) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (1) FPDIV%: 1.203 (1000345) DIV%: 0.039 (32768) FPUN%: 0.000 (1) FPRSUB%: 8.441 (7020721) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (0) FPGT%: 1.091 (907006) FPGE%: 0.000 (1) SYNC%: 0.000 (0) NOP%: 6.665 (5543317) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 0 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 0 FPSUB 0 FPMUL 0 FPCMPLT 0 FPMIN 0 FPMAX 32768 LOAD 16386 INTCONV 0 ATOMIC_INC 1 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 0 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 32768 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 0 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 558553 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 0 ORI 3599739 XORI 0 MULI 0 LW 0 LWI 1842365 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 0 DIV 0 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 0.6779 --Total thread-cycles: 114511671 --total thread-cycles issued: 77625953 (67.788682%) --iCache conflicts: 0 (0.000000%) --thread*cycles of FU dependence: 6082581 (5.311756%) --thread*cycles of data dependence: 25259820 (22.058730%) --iCache cycles*banks: 3664373472 (2.269672% used) Issue breakdown: --thread*cycles of issue worked: 77625953 (67.788682%) --thread*cycles of issue failed: 31342401 (27.370486%) --thread*cycles of issue NOP/other: 5543317 (4.840831%) Number of thread-cycles not ready: 25259820 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 83169270 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 16385 ## Core 0 ## Module Utilization FP AddSub: 1.34 FP MinMax: 0.00 FP Compare: 0.09 Int AddSub: 0.41 FP Mul: 1.95 Int Mul: 0.02 FP InvSqrt: 0.04 FP Div: 0.90 Conversion Unit: 0.00 L1 accesses: 10282021 L1 hits: 10232851 L1 misses: 49170 L1 bank conflicts: 0 L1 stores: 49152 L1 near hit: 0 L1 hit rate: 0.995218 -= L2 #0 =- L2 accesses: 49170 L2 hits: 0 L2 misses: 49170 L2 stores: 49152 L2 bank conflicts: 0 L2 hit rate: 0.000000 L2 memory faults: 0 L2 bandwidth limited stalls: 32768 Bandwidth numbers for 1000MHz clock: register to L1 bandwidth: 359160627.748060 L1 to L2 bandwidth: 54961733.507830 L2 to memory bandwidth: 54961733.507830 Core size: 0.3783 L2 size: 0.0000 1-L2 size: 0.0000 1-core chip size: 0.3783 FPS Statistics: FPS assuming 1000MHz clock: 8.7327