--load-assembly ../../llvm_trax/examples/project4_noInh/project4_noInh_rt-llvm.s --config-file ../trunk/configs/default.config --model ../trunk/test_models/cornell/cornellbox.obj --view-file ../trunk/views/cornell_obj.view --light-file ../trunk/lights/cornell.light Loading core 0. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Center is 278 -549 274 Corner is 277 -547 273 Across is 2 0 0 Up is 0 0 2 U is 0.5 0 0 V is 0 0 0.5 radius is 0 Work queue starts at 30 (0x0000001e) FB starts at 32 (0x00000020) FB ends at 49183 (0x0000c01f) loading model ../trunk/test_models/cornell/cornellbox.obj MTL file: "../trunk/test_models/cornell/CornellBox.mtl" loading material file ../trunk/test_models/cornell/CornellBox.mtl Found 4 total materials Found 32 total triangles vertex min/max = x: (0.000000, 556.000000) y: (0.000000, 559.200012) z: (0.000000, 548.799988) Materials start at 49184 (0x0000c020) Materials end at 49309 (0x0000c09d) Starting BVH build. BVH build complete with 33 nodes. Scene starts at 49310 (0x0000c09e) BVH bounds [0.000000 0.000000 0.000000] [556.000000 559.200012 548.799988] Triangles start at 49576 (0x0000c1a8) Scene ends at 50651 (0x0000c5db) Starting camera at 50652 (0x0000c5dc) Camera ended at 50674 (0x0000c5f2) Background Color 0x0000c5f3 to 0x0000c5f5 Light at 0x0000c5f6 to 0x0000c5f8 Permutation table from 0x0000c5f9 to 0x0000c7f8 Hammersley table from 0x0000c7f9 to 0x0000c9f8 Memory used: 51705 (0x0000c9f9) Image size: 49152 start_wq: 30 start_fb: 32 start_scene: 49312 start_camera: 50652 start_matls: 49184 start_bg_color: 50675 start_light: 50678 start_permutation: 50681 Loading assembly file ../../llvm_trax/examples/project4_noInh/project4_noInh_rt-llvm.s using 36 registers Number of instructions: 1224 Creating thread 0... Core 0 running... <=== Core 0 ===> ---- Thread 00 ---- PC 5: Stalled ----- 77157541 in-flight CPI 1.2339 -- Total Cycles 95267309 Total CPI 1.2339 , IPC 0.8104 -- Total Cycles 95267309 kernel thread(called, cycles) 0 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 267816 (5.114633%) FPSUB: 0 (0.000000%) FPMUL: 966583 (18.459380%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 2276 (0.043466%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 542571 (10.361784%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 2834430 (54.130708%) DIV: 606208 (11.577096%) FPUN: 0 (0.000000%) FPRSUB: 16386 (0.312933%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (84504619 total) ADD%: 7.309 (6176170) SUB%: 0.000 (0) MUL%: 0.019 (16384) BITOR%: 1.370 (1157794) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.850 (718285) FPSUB%: 0.000 (0) FPMUL%: 5.435 (4592731) FPCMPLT%: 0.000 (0) FPMIN%: 0.058 (49152) FPMAX%: 0.075 (63049) LOAD%: 4.969 (4198998) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.019 (16385) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.055 (46667) FPINV%: 0.000 (0) FPCONV%: 0.058 (49153) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.169 (987503) FPLE%: 0.357 (301482) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.058 (49152) LOADIMM%: 0.000 (1) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.695 (2277249) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.752 (635798) CMPU%: 0.000 (0) RSUB%: 0.019 (16384) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.191 (12837207) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.136 (960125) ORI%: 1.623 (1371682) XORI%: 0.000 (0) MULI%: 3.169 (2677884) LW%: 1.109 (937114) LWI%: 13.837 (11693240) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.261 (220422) SWI%: 4.647 (3926555) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.380 (1166378) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.296 (250209) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.053 (44767) bned%: 0.000 (0) bneid%: 13.492 (11401401) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.677 (571767) braid%: 0.000 (0) brlid%: 0.000 (1) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (1) FPDIV%: 0.184 (155166) DIV%: 0.039 (32768) FPUN%: 1.323 (1118399) FPRSUB%: 3.736 (3157270) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (0) FPGT%: 2.976 (2515083) FPGE%: 0.967 (816917) SYNC%: 0.000 (0) NOP%: 8.636 (7297926) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 0 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 0 FPSUB 0 FPMUL 1 FPCMPLT 0 FPMIN 0 FPMAX 32768 LOAD 16385 INTCONV 0 ATOMIC_INC 0 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 0 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 32768 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 0 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 1241927 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 0 ORI 365299 XORI 0 MULI 0 LW 0 LWI 3837271 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 0 DIV 0 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 0.8104 --Total thread-cycles: 95267309 --total thread-cycles issued: 77206693 (81.042172%) --iCache conflicts: 0 (0.000000%) --thread*cycles of FU dependence: 5526420 (5.800962%) --thread*cycles of data dependence: 5236270 (5.496398%) --iCache cycles*banks: 3048553888 (2.771958% used) Issue breakdown: --thread*cycles of issue worked: 77206693 (81.042168%) --thread*cycles of issue failed: 10762690 (11.297359%) --thread*cycles of issue NOP/other: 7297926 (7.660472%) Number of thread-cycles not ready: 5236270 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 84504619 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 16385 ## Core 0 ## Module Utilization FP AddSub: 0.51 FP MinMax: 0.00 FP Compare: 0.19 Int AddSub: 0.72 FP Mul: 0.60 Int Mul: 1.41 FP InvSqrt: 0.05 FP Div: 0.20 Conversion Unit: 0.00 L1 accesses: 4264535 L1 hits: 4215357 L1 misses: 49178 L1 bank conflicts: 0 L1 stores: 49152 L1 near hit: 0 L1 hit rate: 0.988468 -= L2 #0 =- L2 accesses: 49178 L2 hits: 0 L2 misses: 49178 L2 stores: 49152 L2 bank conflicts: 0 L2 hit rate: 0.000000 L2 memory faults: 0 L2 bandwidth limited stalls: 32768 Bandwidth numbers for 1000MHz clock: register to L1 bandwidth: 179055540.057643 L1 to L2 bandwidth: 66074961.787523 L2 to memory bandwidth: 66074961.787523 Core size: 0.3783 L2 size: 0.0000 1-L2 size: 0.0000 1-core chip size: 0.3783 FPS Statistics: FPS assuming 1000MHz clock: 10.4968