--load-assembly ../../llvm_trax/examples/project5_noinh/project5_noinh_rt-llvm.s --config-file ../trunk/configs/default.config --model ../trunk/test_models/sponza.obj --view-file ../trunk/views/sponza_obj.view --light-file ../trunk/lights/sponza.light --num-samples 15 --ray-depth 3 --num-cores 20 --num-thread-procs 32 --num-l2s 4 --num-icaches 2 --num-icache-banks 16 Loading core 0. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 1. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 2. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 3. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 4. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 5. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 6. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 7. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 8. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 9. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 10. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 11. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 12. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 13. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 14. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 15. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 16. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 17. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 18. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 19. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 20. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 21. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 22. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 23. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 24. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 25. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 26. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 27. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 28. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 29. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 30. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 31. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 32. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 33. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 34. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 35. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 36. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 37. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 38. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 39. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 40. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 41. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 42. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 43. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 44. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 45. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 46. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 47. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 48. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 49. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 50. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 51. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 52. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 53. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 54. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 55. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 56. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 57. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 58. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 59. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 60. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 61. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 62. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 63. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 64. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 65. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 66. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 67. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 68. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 69. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 70. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 71. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 72. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 73. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 74. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 75. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 76. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 77. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 78. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 79. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Center is -10.56 3.28 1.674 Corner is -8.94028 2.58737 0.296831 Across is 0.426589 -0 1.95398 Up is -0.330503 1.97118 0.0721549 U is 0.123149 -0 0.564081 V is -0.0954107 0.569048 0.0208299 radius is 0 Work queue starts at 30 (0x0000001e) FB starts at 32 (0x00000020) FB ends at 49183 (0x0000c01f) loading model ../trunk/test_models/sponza.obj Found 66454 total triangles vertex min/max = x: (-17.402760, 17.417240) y: (-0.906689, 15.653312) z: (-7.798512, 7.801489) Materials start at 49184 (0x0000c020) Materials end at 49209 (0x0000c039) Starting BVH build. BVH build complete with 58807 nodes. Scene starts at 49210 (0x0000c03a) BVH bounds [-17.402760 -0.906689 -7.798512] [17.417240 15.653312 7.801489] Triangles start at 519672 (0x0007edf8) Scene ends at 2628543 (0x00281bbf) Starting camera at 2628544 (0x00281bc0) Camera ended at 2628566 (0x00281bd6) Background Color 0x00281bd7 to 0x00281bd9 Light at 0x00281bda to 0x00281bdc Permutation table from 0x00281bdd to 0x00281ddc Hammersley table from 0x00281ddd to 0x00281fdc Memory used: 2629597 (0x00281fdd) Image size: 49152 start_wq: 30 start_fb: 32 start_scene: 49216 start_camera: 2628544 start_matls: 49184 start_bg_color: 2628567 start_light: 2628570 start_permutation: 2628573 Loading assembly file ../../llvm_trax/examples/project5_noinh/project5_noinh_rt-llvm.s using 36 registers Number of instructions: 1363 Creating thread 0... Creating thread 1... Creating thread 2... Creating thread 3... Core 0 running... Creating thread 4... Core 1 running... Creating thread 5... Core 2 running... Core 3 running... Creating thread 6... Core 4 running... Core 5 running... Creating thread 7... Core 6 running... Creating thread 8... Creating thread 9... Core 7 running... Core 8 running... Creating thread 10... Creating thread 11... Core 9 running... Creating thread 12... Core 10 running... Core 11 running... Creating thread 13... Core 12 running... Creating thread 14... Creating thread 15... Core 13 running... Core 14 running... Creating thread 16... Creating thread 17... Core 15 running... Creating thread 18... Core 16 running... Creating thread 19... Core 17 running... Creating thread 20... Core 18 running... Creating thread 21... Core 19 running... Core 20 running... Creating thread 22... Core 21 running... Creating thread 23... Core 22 running... Creating thread 24... Core 23 running... Creating thread 25... Core 24 running... Creating thread 26... Core 25 running... Creating thread 27... Core 26 running... Creating thread 28... Creating thread 29... Core 27 running... Core 28 running... Creating thread 30... Core 29 running... Creating thread 31... Core 30 running... Creating thread 32... Creating thread 33... Core 31 running... Creating thread 34... Core 32 running... Core 33 running... Creating thread 35... Core 34 running... Creating thread 36... Creating thread 37... Core 35 running... Core 36 running... Creating thread 38... Creating thread 39... Core 37 running... Creating thread 40... Core 38 running... Creating thread 41... Core 39 running... Creating thread 42... Core 40 running... Creating thread 43... Core 41 running... Core 42 running... Creating thread 44... Creating thread 45... Core 43 running... Core 44 running... Creating thread 46... Core 45 running... Creating thread 47... Core 46 running... Creating thread 48... Core 47 running... Creating thread 49... Core 48 running... Creating thread 50... Core 49 running... Creating thread 51... Core 50 running... Creating thread 52... Core 51 running... Creating thread 53... Core 52 running... Creating thread 54... Core 53 running... Creating thread 55... Creating thread 56... Core 54 running... Creating thread 57... Core 55 running... Core 56 running... Creating thread 58... Core 57 running... Creating thread 59... Core 58 running... Creating thread 60... Core 59 running... Creating thread 61... Core 60 running... Creating thread 62... Core 61 running... Creating thread 63... Creating thread 64... Core 62 running... Core 63 running... Creating thread 65... Creating thread 66... Core 64 running... Creating thread 67... Core 65 running... Core 66 running... Creating thread 68... Creating thread 69... Creating thread 70... Core 67 running... Creating thread 71... Core 68 running... Creating thread 72... Core 69 running... Creating thread 73... Core 70 running... Core 72 running... Creating thread 74... Core 71 running... Creating thread 75... Core 73 running... Creating thread 76... Core 74 running... Core 75 running... Creating thread 77... Creating thread 78... Core 76 running... Creating thread 79... Core 77 running... Core 78 running... Core 79 running... <=== Core 0 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6454156 in-flight CPI 1.4160 -- Total Cycles 9138977 ---- Thread 01 ---- PC 5: Stalled ----- 6170818 in-flight CPI 1.4810 -- Total Cycles 9138977 ---- Thread 02 ---- PC 5: Stalled ----- 6458116 in-flight CPI 1.4151 -- Total Cycles 9138977 ---- Thread 03 ---- PC 5: Stalled ----- 6417994 in-flight CPI 1.4240 -- Total Cycles 9138977 ---- Thread 04 ---- PC 5: Stalled ----- 6406811 in-flight CPI 1.4264 -- Total Cycles 9138977 ---- Thread 05 ---- PC 5: Stalled ----- 6070116 in-flight CPI 1.5056 -- Total Cycles 9138977 ---- Thread 06 ---- PC 5: Stalled ----- 7146957 in-flight CPI 1.2787 -- Total Cycles 9138977 ---- Thread 07 ---- PC 5: Stalled ----- 6710279 in-flight CPI 1.3619 -- Total Cycles 9138977 ---- Thread 08 ---- PC 5: Stalled ----- 5950030 in-flight CPI 1.5360 -- Total Cycles 9138977 ---- Thread 09 ---- PC 5: Stalled ----- 5996634 in-flight CPI 1.5240 -- Total Cycles 9138977 ---- Thread 10 ---- PC 5: Stalled ----- 6945432 in-flight CPI 1.3158 -- Total Cycles 9138977 ---- Thread 11 ---- PC 5: Stalled ----- 6078119 in-flight CPI 1.5036 -- Total Cycles 9138977 ---- Thread 12 ---- PC 5: Stalled ----- 6934402 in-flight CPI 1.3179 -- Total Cycles 9138977 ---- Thread 13 ---- PC 5: Stalled ----- 6018459 in-flight CPI 1.5185 -- Total Cycles 9138977 ---- Thread 14 ---- PC 5: Stalled ----- 5985049 in-flight CPI 1.5270 -- Total Cycles 9138977 ---- Thread 15 ---- PC 5: Stalled ----- 6832230 in-flight CPI 1.3376 -- Total Cycles 9138977 ---- Thread 16 ---- PC 5: Stalled ----- 6653844 in-flight CPI 1.3735 -- Total Cycles 9138977 ---- Thread 17 ---- PC 5: Stalled ----- 5969911 in-flight CPI 1.5308 -- Total Cycles 9138977 ---- Thread 18 ---- PC 5: Stalled ----- 6244724 in-flight CPI 1.4635 -- Total Cycles 9138977 ---- Thread 19 ---- PC 5: Stalled ----- 5838083 in-flight CPI 1.5654 -- Total Cycles 9138977 ---- Thread 20 ---- PC 5: Stalled ----- 6170781 in-flight CPI 1.4810 -- Total Cycles 9138977 ---- Thread 21 ---- PC 5: Stalled ----- 5607779 in-flight CPI 1.6297 -- Total Cycles 9138977 ---- Thread 22 ---- PC 5: Stalled ----- 5543296 in-flight CPI 1.6486 -- Total Cycles 9138977 ---- Thread 23 ---- PC 5: Stalled ----- 6448232 in-flight CPI 1.4173 -- Total Cycles 9138977 ---- Thread 24 ---- PC 5: Stalled ----- 6040654 in-flight CPI 1.5129 -- Total Cycles 9138977 ---- Thread 25 ---- PC 5: Stalled ----- 5981958 in-flight CPI 1.5278 -- Total Cycles 9138977 ---- Thread 26 ---- PC 5: Stalled ----- 5631610 in-flight CPI 1.6228 -- Total Cycles 9138977 ---- Thread 27 ---- PC 5: Stalled ----- 5895064 in-flight CPI 1.5503 -- Total Cycles 9138977 ---- Thread 28 ---- PC 5: Stalled ----- 5437125 in-flight CPI 1.6808 -- Total Cycles 9138977 ---- Thread 29 ---- PC 5: Stalled ----- 6507968 in-flight CPI 1.4043 -- Total Cycles 9138977 ---- Thread 30 ---- PC 5: Stalled ----- 5650289 in-flight CPI 1.6174 -- Total Cycles 9138977 ---- Thread 31 ---- PC 5: Stalled ----- 6004101 in-flight CPI 1.5221 -- Total Cycles 9138977 Total CPI 0.0461 , IPC 21.6875 -- Total Cycles 9138977 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 442749 (2.085805%) FPSUB: 0 (0.000000%) FPMUL: 2024166 (9.535913%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 14862750 (70.018906%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 571409 (2.691927%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3317776 (15.630152%) DIV: 7457 (0.035130%) FPUN: 0 (0.000000%) FPRSUB: 460 (0.002167%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (217355941 total) ADD%: 8.195 (17811998) SUB%: 0.000 (0) MUL%: 0.000 (202) BITOR%: 1.222 (2655676) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.546 (1186349) FPSUB%: 0.000 (0) FPMUL%: 4.766 (10360050) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (606) FPMAX%: 0.000 (606) LOAD%: 4.950 (10758128) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (234) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41960) FPINV%: 0.000 (0) FPCONV%: 0.000 (670) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.063 (2311240) FPLE%: 0.387 (841327) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (606) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27810) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.966 (6446845) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.749 (1628511) CMPU%: 0.000 (0) RSUB%: 0.000 (202) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.760 (34254833) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.233 (2679149) ORI%: 1.264 (2747197) XORI%: 0.000 (0) MULI%: 3.363 (7309239) LW%: 1.193 (2592662) LWI%: 13.935 (30287908) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (654175) SWI%: 4.103 (8918242) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.483 (3222342) bged%: 0.000 (0) bgeid%: 0.000 (202) bgtd%: 0.000 (0) bgtid%: 0.322 (700844) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.040 (87262) bned%: 0.000 (0) bneid%: 13.704 (29785483) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.735 (1596491) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.087 (188773) DIV%: 0.000 (404) FPUN%: 1.178 (2560632) FPRSUB%: 3.713 (8070263) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (2) FPGT%: 3.102 (6742228) FPGE%: 0.796 (1730180) SYNC%: 0.000 (0) NOP%: 8.812 (19154314) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 175 SUB 0 MUL 27 BITOR 5 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 563 FPSUB 0 FPMUL 5420 FPCMPLT 0 FPMIN 0 FPMAX 393 LOAD 2381654 INTCONV 0 ATOMIC_INC 9 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 115 FPINV 0 FPCONV 19 FPEQ 0 FPNE 0 FPLT 12 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1886 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 2276 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3434921 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 779 ORI 605822 XORI 0 MULI 652992 LW 0 LWI 9619999 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1693 DIV 26 FPUN 0 FPRSUB 5 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.6875 --Total thread-cycles: 292447264 --total thread-cycles issued: 198201627 (67.773461%) --iCache conflicts: 6645518 (2.272382%) --thread*cycles of FU dependence: 16708817 (5.713446%) --thread*cycles of data dependence: 21226767 (7.258323%) --iCache cycles*banks: 292447264 (74.323134% used) Issue breakdown: --thread*cycles of issue worked: 198201627 (67.773459%) --thread*cycles of issue failed: 75091323 (25.676877%) --thread*cycles of issue NOP/other: 19154314 (6.549664%) Number of thread-cycles not ready: 21226767 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 217355941 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 7 2: 7 3: 8 4: 8 5: 7 6: 8 7: 8 8: 7 9: 7 10: 8 11: 7 12: 8 13: 7 14: 9 15: 8 16: 8 17: 7 18: 7 19: 7 20: 7 21: 6 22: 7 23: 7 24: 7 25: 7 26: 7 27: 7 28: 7 29: 7 30: 7 31: 8 <=== Core 1 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6996914 in-flight CPI 1.2749 -- Total Cycles 8920503 ---- Thread 01 ---- PC 5: Stalled ----- 6063487 in-flight CPI 1.4712 -- Total Cycles 8920503 ---- Thread 02 ---- PC 5: Stalled ----- 6441466 in-flight CPI 1.3849 -- Total Cycles 8920503 ---- Thread 03 ---- PC 5: Stalled ----- 6260828 in-flight CPI 1.4248 -- Total Cycles 8920503 ---- Thread 04 ---- PC 5: Stalled ----- 6043989 in-flight CPI 1.4759 -- Total Cycles 8920503 ---- Thread 05 ---- PC 5: Stalled ----- 6310842 in-flight CPI 1.4135 -- Total Cycles 8920503 ---- Thread 06 ---- PC 5: Stalled ----- 6629573 in-flight CPI 1.3456 -- Total Cycles 8920503 ---- Thread 07 ---- PC 5: Stalled ----- 6137246 in-flight CPI 1.4535 -- Total Cycles 8920503 ---- Thread 08 ---- PC 5: Stalled ----- 5988103 in-flight CPI 1.4897 -- Total Cycles 8920503 ---- Thread 09 ---- PC 5: Stalled ----- 6287736 in-flight CPI 1.4187 -- Total Cycles 8920503 ---- Thread 10 ---- PC 5: Stalled ----- 5939530 in-flight CPI 1.5019 -- Total Cycles 8920503 ---- Thread 11 ---- PC 5: Stalled ----- 6285866 in-flight CPI 1.4191 -- Total Cycles 8920503 ---- Thread 12 ---- PC 5: Stalled ----- 6420281 in-flight CPI 1.3894 -- Total Cycles 8920503 ---- Thread 13 ---- PC 5: Stalled ----- 6755174 in-flight CPI 1.3205 -- Total Cycles 8920503 ---- Thread 14 ---- PC 5: Stalled ----- 6191209 in-flight CPI 1.4408 -- Total Cycles 8920503 ---- Thread 15 ---- PC 5: Stalled ----- 5823125 in-flight CPI 1.5319 -- Total Cycles 8920503 ---- Thread 16 ---- PC 5: Stalled ----- 6175574 in-flight CPI 1.4445 -- Total Cycles 8920503 ---- Thread 17 ---- PC 5: Stalled ----- 6329234 in-flight CPI 1.4094 -- Total Cycles 8920503 ---- Thread 18 ---- PC 5: Stalled ----- 6525831 in-flight CPI 1.3669 -- Total Cycles 8920503 ---- Thread 19 ---- PC 5: Stalled ----- 6134917 in-flight CPI 1.4541 -- Total Cycles 8920503 ---- Thread 20 ---- PC 5: Stalled ----- 6555619 in-flight CPI 1.3607 -- Total Cycles 8920503 ---- Thread 21 ---- PC 5: Stalled ----- 5707658 in-flight CPI 1.5629 -- Total Cycles 8920503 ---- Thread 22 ---- PC 5: Stalled ----- 6085621 in-flight CPI 1.4658 -- Total Cycles 8920503 ---- Thread 23 ---- PC 5: Stalled ----- 6324377 in-flight CPI 1.4105 -- Total Cycles 8920503 ---- Thread 24 ---- PC 5: Stalled ----- 6468550 in-flight CPI 1.3791 -- Total Cycles 8920503 ---- Thread 25 ---- PC 5: Stalled ----- 5912304 in-flight CPI 1.5088 -- Total Cycles 8920503 ---- Thread 26 ---- PC 5: Stalled ----- 6460517 in-flight CPI 1.3808 -- Total Cycles 8920503 ---- Thread 27 ---- PC 5: Stalled ----- 6293918 in-flight CPI 1.4173 -- Total Cycles 8920503 ---- Thread 28 ---- PC 5: Stalled ----- 6258612 in-flight CPI 1.4253 -- Total Cycles 8920503 ---- Thread 29 ---- PC 5: Stalled ----- 6148690 in-flight CPI 1.4508 -- Total Cycles 8920503 ---- Thread 30 ---- PC 5: Stalled ----- 5214804 in-flight CPI 1.7106 -- Total Cycles 8920503 ---- Thread 31 ---- PC 5: Stalled ----- 5512660 in-flight CPI 1.6182 -- Total Cycles 8920503 Total CPI 0.0449 , IPC 22.2728 -- Total Cycles 8920503 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 439488 (2.048524%) FPSUB: 0 (0.000000%) FPMUL: 2018908 (9.410455%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 15109263 (70.426706%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 569955 (2.656652%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3308230 (15.420192%) DIV: 7574 (0.035304%) FPUN: 0 (0.000000%) FPRSUB: 465 (0.002167%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (217895913 total) ADD%: 8.194 (17854874) SUB%: 0.000 (0) MUL%: 0.000 (205) BITOR%: 1.225 (2669751) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.542 (1180743) FPSUB%: 0.000 (0) FPMUL%: 4.752 (10354200) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (615) FPMAX%: 0.000 (615) LOAD%: 4.954 (10795065) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (237) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41984) FPINV%: 0.000 (0) FPCONV%: 0.000 (679) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.061 (2312436) FPLE%: 0.392 (854056) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (615) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (28092) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.966 (6463363) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.748 (1628919) CMPU%: 0.000 (0) RSUB%: 0.000 (205) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.774 (34369842) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.232 (2684879) ORI%: 1.256 (2737509) XORI%: 0.000 (0) MULI%: 3.363 (7328062) LW%: 1.193 (2599414) LWI%: 13.919 (30329313) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.302 (657006) SWI%: 4.099 (8930973) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.482 (3229174) bged%: 0.000 (0) bgeid%: 0.000 (205) bgtd%: 0.000 (0) bgtid%: 0.323 (703544) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.040 (86557) bned%: 0.000 (0) bneid%: 13.713 (29879796) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.744 (1621333) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.086 (188139) DIV%: 0.000 (410) FPUN%: 1.182 (2575655) FPRSUB%: 3.707 (8077457) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (3) FPGT%: 3.105 (6766279) FPGE%: 0.795 (1732570) SYNC%: 0.000 (0) NOP%: 8.817 (19211043) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 128 SUB 0 MUL 20 BITOR 6 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 538 FPSUB 0 FPMUL 5176 FPCMPLT 0 FPMIN 0 FPMAX 405 LOAD 2364130 INTCONV 0 ATOMIC_INC 6 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 116 FPINV 0 FPCONV 11 FPEQ 0 FPNE 0 FPLT 6 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1782 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 1 ADDKC 0 BITXOR 0 ANDN 0 CMP 2219 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3439303 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 810 ORI 600492 XORI 0 MULI 653690 LW 0 LWI 9637508 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1726 DIV 13 FPUN 0 FPRSUB 4 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 22.2728 --Total thread-cycles: 285456096 --total thread-cycles issued: 198684870 (69.602600%) --iCache conflicts: 6679044 (2.339780%) --thread*cycles of FU dependence: 16708096 (5.853123%) --thread*cycles of data dependence: 21453883 (7.515651%) --iCache cycles*banks: 285456096 (76.332560% used) Issue breakdown: --thread*cycles of issue worked: 198684870 (69.602602%) --thread*cycles of issue failed: 67560183 (23.667451%) --thread*cycles of issue NOP/other: 66048026304310 (23137717.929243%) Number of thread-cycles not ready: 21453883 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 217895913 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 7 2: 8 3: 7 4: 7 5: 7 6: 8 7: 7 8: 7 9: 8 10: 8 11: 8 12: 8 13: 8 14: 7 15: 7 16: 8 17: 8 18: 7 19: 7 20: 9 21: 8 22: 7 23: 7 24: 7 25: 7 26: 7 27: 7 28: 7 29: 8 30: 6 31: 7 <=== Core 2 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6463082 in-flight CPI 1.4046 -- Total Cycles 9077865 ---- Thread 01 ---- PC 5: Stalled ----- 6566814 in-flight CPI 1.3824 -- Total Cycles 9077865 ---- Thread 02 ---- PC 5: Stalled ----- 6890796 in-flight CPI 1.3174 -- Total Cycles 9077865 ---- Thread 03 ---- PC 5: Stalled ----- 6790077 in-flight CPI 1.3369 -- Total Cycles 9077865 ---- Thread 04 ---- PC 5: Stalled ----- 6040061 in-flight CPI 1.5029 -- Total Cycles 9077865 ---- Thread 05 ---- PC 5: Stalled ----- 6790938 in-flight CPI 1.3368 -- Total Cycles 9077865 ---- Thread 06 ---- PC 5: Stalled ----- 6833487 in-flight CPI 1.3284 -- Total Cycles 9077865 ---- Thread 07 ---- PC 5: Stalled ----- 6058705 in-flight CPI 1.4983 -- Total Cycles 9077865 ---- Thread 08 ---- PC 5: Stalled ----- 5854879 in-flight CPI 1.5505 -- Total Cycles 9077865 ---- Thread 09 ---- PC 5: Stalled ----- 6374598 in-flight CPI 1.4241 -- Total Cycles 9077865 ---- Thread 10 ---- PC 5: Stalled ----- 5929553 in-flight CPI 1.5309 -- Total Cycles 9077865 ---- Thread 11 ---- PC 5: Stalled ----- 6798201 in-flight CPI 1.3353 -- Total Cycles 9077865 ---- Thread 12 ---- PC 5: Stalled ----- 6084345 in-flight CPI 1.4920 -- Total Cycles 9077865 ---- Thread 13 ---- PC 5: Stalled ----- 5743501 in-flight CPI 1.5805 -- Total Cycles 9077865 ---- Thread 14 ---- PC 5: Stalled ----- 6347991 in-flight CPI 1.4300 -- Total Cycles 9077865 ---- Thread 15 ---- PC 5: Stalled ----- 6350019 in-flight CPI 1.4296 -- Total Cycles 9077865 ---- Thread 16 ---- PC 5: Stalled ----- 6332166 in-flight CPI 1.4336 -- Total Cycles 9077865 ---- Thread 17 ---- PC 5: Stalled ----- 6442350 in-flight CPI 1.4091 -- Total Cycles 9077865 ---- Thread 18 ---- PC 5: Stalled ----- 6855495 in-flight CPI 1.3242 -- Total Cycles 9077865 ---- Thread 19 ---- PC 5: Stalled ----- 6545142 in-flight CPI 1.3870 -- Total Cycles 9077865 ---- Thread 20 ---- PC 5: Stalled ----- 6073255 in-flight CPI 1.4947 -- Total Cycles 9077865 ---- Thread 21 ---- PC 5: Stalled ----- 6772516 in-flight CPI 1.3404 -- Total Cycles 9077865 ---- Thread 22 ---- PC 5: Stalled ----- 5815501 in-flight CPI 1.5610 -- Total Cycles 9077865 ---- Thread 23 ---- PC 5: Stalled ----- 6180115 in-flight CPI 1.4689 -- Total Cycles 9077865 ---- Thread 24 ---- PC 5: Stalled ----- 6500602 in-flight CPI 1.3965 -- Total Cycles 9077865 ---- Thread 25 ---- PC 5: Stalled ----- 6081422 in-flight CPI 1.4927 -- Total Cycles 9077865 ---- Thread 26 ---- PC 5: Stalled ----- 6286868 in-flight CPI 1.4439 -- Total Cycles 9077865 ---- Thread 27 ---- PC 5: Stalled ----- 5773775 in-flight CPI 1.5723 -- Total Cycles 9077865 ---- Thread 28 ---- PC 5: Stalled ----- 5752243 in-flight CPI 1.5781 -- Total Cycles 9077865 ---- Thread 29 ---- PC 5: Stalled ----- 6490789 in-flight CPI 1.3986 -- Total Cycles 9077865 ---- Thread 30 ---- PC 5: Stalled ----- 6326369 in-flight CPI 1.4349 -- Total Cycles 9077865 ---- Thread 31 ---- PC 5: Stalled ----- 5204880 in-flight CPI 1.7441 -- Total Cycles 9077865 Total CPI 0.0451 , IPC 22.1804 -- Total Cycles 9077865 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 457398 (2.103572%) FPSUB: 0 (0.000000%) FPMUL: 2071456 (9.526621%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 15203920 (69.922794%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 578017 (2.658299%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3424852 (15.750887%) DIV: 7749 (0.035638%) FPUN: 0 (0.000000%) FPRSUB: 476 (0.002189%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (220821714 total) ADD%: 8.155 (18008606) SUB%: 0.000 (0) MUL%: 0.000 (210) BITOR%: 1.228 (2711604) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.552 (1219753) FPSUB%: 0.000 (0) FPMUL%: 4.788 (10573594) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (630) FPMAX%: 0.000 (630) LOAD%: 4.959 (10951104) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (242) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (42473) FPINV%: 0.000 (0) FPCONV%: 0.000 (694) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.066 (2353898) FPLE%: 0.393 (867883) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (630) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (28420) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.959 (6533137) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.751 (1659063) CMPU%: 0.000 (0) RSUB%: 0.000 (210) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.767 (34816660) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.231 (2718475) ORI%: 1.267 (2798506) XORI%: 0.000 (0) MULI%: 3.356 (7411367) LW%: 1.190 (2627450) LWI%: 13.915 (30726797) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.300 (663441) SWI%: 4.095 (9041931) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.479 (3264943) bged%: 0.000 (0) bgeid%: 0.000 (210) bgtd%: 0.000 (0) bgtid%: 0.322 (711746) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.040 (89320) bned%: 0.000 (0) bneid%: 13.710 (30273651) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.740 (1633710) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.088 (194945) DIV%: 0.000 (420) FPUN%: 1.183 (2611904) FPRSUB%: 3.719 (8212378) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (1) FPGT%: 3.100 (6845352) FPGE%: 0.795 (1755081) SYNC%: 0.000 (0) NOP%: 8.817 (19470549) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 179 SUB 0 MUL 31 BITOR 8 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 523 FPSUB 0 FPMUL 5349 FPCMPLT 0 FPMIN 0 FPMAX 409 LOAD 2380854 INTCONV 0 ATOMIC_INC 4 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 121 FPINV 0 FPCONV 15 FPEQ 0 FPNE 0 FPLT 5 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1868 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 2285 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3486106 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 810 ORI 624646 XORI 0 MULI 654022 LW 0 LWI 9767776 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1787 DIV 18 FPUN 0 FPRSUB 6 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 22.1805 --Total thread-cycles: 290491680 --total thread-cycles issued: 201351165 (69.313919%) --iCache conflicts: 6758106 (2.326437%) --thread*cycles of FU dependence: 16926825 (5.826957%) --thread*cycles of data dependence: 21743868 (7.485195%) --iCache cycles*banks: 290491680 (76.016548% used) Issue breakdown: --thread*cycles of issue worked: 201351165 (69.313918%) --thread*cycles of issue failed: 69669966 (23.983463%) --thread*cycles of issue NOP/other: 4572413662125955285 (1574025687112.951200%) Number of thread-cycles not ready: 21743868 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 220821714 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 9 2: 9 3: 8 4: 7 5: 8 6: 8 7: 7 8: 7 9: 7 10: 7 11: 8 12: 8 13: 7 14: 8 15: 8 16: 8 17: 7 18: 8 19: 7 20: 8 21: 9 22: 7 23: 8 24: 7 25: 8 26: 7 27: 8 28: 6 29: 7 30: 7 31: 7 <=== Core 3 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6485918 in-flight CPI 1.4316 -- Total Cycles 9285264 ---- Thread 01 ---- PC 5: Stalled ----- 6461155 in-flight CPI 1.4371 -- Total Cycles 9285264 ---- Thread 02 ---- PC 5: Stalled ----- 6025597 in-flight CPI 1.5410 -- Total Cycles 9285264 ---- Thread 03 ---- PC 5: Stalled ----- 6571028 in-flight CPI 1.4131 -- Total Cycles 9285264 ---- Thread 04 ---- PC 5: Stalled ----- 6204435 in-flight CPI 1.4965 -- Total Cycles 9285264 ---- Thread 05 ---- PC 5: Stalled ----- 6599752 in-flight CPI 1.4069 -- Total Cycles 9285264 ---- Thread 06 ---- PC 5: Stalled ----- 5764520 in-flight CPI 1.6108 -- Total Cycles 9285264 ---- Thread 07 ---- PC 5: Stalled ----- 6717001 in-flight CPI 1.3823 -- Total Cycles 9285264 ---- Thread 08 ---- PC 5: Stalled ----- 6025165 in-flight CPI 1.5411 -- Total Cycles 9285264 ---- Thread 09 ---- PC 5: Stalled ----- 6296129 in-flight CPI 1.4748 -- Total Cycles 9285264 ---- Thread 10 ---- PC 5: Stalled ----- 6514797 in-flight CPI 1.4253 -- Total Cycles 9285264 ---- Thread 11 ---- PC 5: Stalled ----- 5993273 in-flight CPI 1.5493 -- Total Cycles 9285264 ---- Thread 12 ---- PC 5: Stalled ----- 7210523 in-flight CPI 1.2877 -- Total Cycles 9285264 ---- Thread 13 ---- PC 5: Stalled ----- 6508948 in-flight CPI 1.4265 -- Total Cycles 9285264 ---- Thread 14 ---- PC 5: Stalled ----- 6076586 in-flight CPI 1.5280 -- Total Cycles 9285264 ---- Thread 15 ---- PC 5: Stalled ----- 6461963 in-flight CPI 1.4369 -- Total Cycles 9285264 ---- Thread 16 ---- PC 5: Stalled ----- 6140428 in-flight CPI 1.5121 -- Total Cycles 9285264 ---- Thread 17 ---- PC 5: Stalled ----- 6216814 in-flight CPI 1.4936 -- Total Cycles 9285264 ---- Thread 18 ---- PC 5: Stalled ----- 6365027 in-flight CPI 1.4588 -- Total Cycles 9285264 ---- Thread 19 ---- PC 5: Stalled ----- 5977168 in-flight CPI 1.5535 -- Total Cycles 9285264 ---- Thread 20 ---- PC 5: Stalled ----- 5775491 in-flight CPI 1.6077 -- Total Cycles 9285264 ---- Thread 21 ---- PC 5: Stalled ----- 5933263 in-flight CPI 1.5649 -- Total Cycles 9285264 ---- Thread 22 ---- PC 5: Stalled ----- 6380909 in-flight CPI 1.4552 -- Total Cycles 9285264 ---- Thread 23 ---- PC 5: Stalled ----- 5551785 in-flight CPI 1.6725 -- Total Cycles 9285264 ---- Thread 24 ---- PC 5: Stalled ----- 5939432 in-flight CPI 1.5633 -- Total Cycles 9285264 ---- Thread 25 ---- PC 5: Stalled ----- 5836916 in-flight CPI 1.5908 -- Total Cycles 9285264 ---- Thread 26 ---- PC 5: Stalled ----- 5414386 in-flight CPI 1.7149 -- Total Cycles 9285264 ---- Thread 27 ---- PC 5: Stalled ----- 6019622 in-flight CPI 1.5425 -- Total Cycles 9285264 ---- Thread 28 ---- PC 5: Stalled ----- 5280082 in-flight CPI 1.7585 -- Total Cycles 9285264 ---- Thread 29 ---- PC 5: Stalled ----- 5772656 in-flight CPI 1.6085 -- Total Cycles 9285264 ---- Thread 30 ---- PC 5: Stalled ----- 6178815 in-flight CPI 1.5028 -- Total Cycles 9285264 ---- Thread 31 ---- PC 5: Stalled ----- 6186726 in-flight CPI 1.5008 -- Total Cycles 9285264 Total CPI 0.0472 , IPC 21.2042 -- Total Cycles 9285264 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 435571 (2.010885%) FPSUB: 0 (0.000000%) FPMUL: 1997435 (9.221487%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 15381489 (71.011171%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 559647 (2.583702%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3278641 (15.136385%) DIV: 7418 (0.034246%) FPUN: 0 (0.000000%) FPRSUB: 460 (0.002124%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (215906702 total) ADD%: 8.211 (17729165) SUB%: 0.000 (0) MUL%: 0.000 (201) BITOR%: 1.227 (2648750) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.540 (1165131) FPSUB%: 0.000 (0) FPMUL%: 4.749 (10253653) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (603) FPMAX%: 0.000 (603) LOAD%: 4.953 (10693393) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (233) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41093) FPINV%: 0.000 (0) FPCONV%: 0.000 (667) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.061 (2290215) FPLE%: 0.390 (841761) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (603) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27258) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.968 (6407409) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.750 (1618743) CMPU%: 0.000 (0) RSUB%: 0.000 (201) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.765 (34037385) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.233 (2662692) ORI%: 1.261 (2722591) XORI%: 0.000 (0) MULI%: 3.363 (7261145) LW%: 1.193 (2576650) LWI%: 13.928 (30072352) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (650134) SWI%: 4.100 (8852253) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.483 (3202587) bged%: 0.000 (0) bgeid%: 0.000 (201) bgtd%: 0.000 (0) bgtid%: 0.323 (696918) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.040 (86724) bned%: 0.000 (0) bneid%: 13.705 (29590498) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.738 (1592442) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.086 (186577) DIV%: 0.000 (402) FPUN%: 1.183 (2554330) FPRSUB%: 3.708 (8006286) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (3) FPGT%: 3.099 (6691782) FPGE%: 0.798 (1723183) SYNC%: 0.000 (0) NOP%: 8.809 (19019789) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 172 SUB 0 MUL 23 BITOR 5 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 540 FPSUB 0 FPMUL 5182 FPCMPLT 0 FPMIN 0 FPMAX 393 LOAD 2381021 INTCONV 0 ATOMIC_INC 10 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 110 FPINV 0 FPCONV 11 FPEQ 0 FPNE 0 FPLT 14 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1732 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 2185 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3411058 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 877 ORI 595570 XORI 0 MULI 649027 LW 0 LWI 9552892 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1656 DIV 11 FPUN 0 FPRSUB 5 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.2042 --Total thread-cycles: 297128448 --total thread-cycles issued: 196886913 (66.263232%) --iCache conflicts: 6623217 (2.229075%) --thread*cycles of FU dependence: 16602514 (5.587655%) --thread*cycles of data dependence: 21660661 (7.289999%) --iCache cycles*banks: 297128448 (72.664444% used) Issue breakdown: --thread*cycles of issue worked: 196886913 (66.263232%) --thread*cycles of issue failed: 81221746 (27.335567%) --thread*cycles of issue NOP/other: 39920173 (13.435325%) Number of thread-cycles not ready: 21660661 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 215906702 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 7 3: 8 4: 7 5: 7 6: 7 7: 9 8: 8 9: 7 10: 8 11: 7 12: 8 13: 7 14: 7 15: 8 16: 7 17: 7 18: 7 19: 7 20: 8 21: 8 22: 7 23: 7 24: 7 25: 7 26: 6 27: 8 28: 7 29: 6 30: 7 31: 7 <=== Core 4 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6022205 in-flight CPI 1.4868 -- Total Cycles 8953613 ---- Thread 01 ---- PC 5: Stalled ----- 6343930 in-flight CPI 1.4114 -- Total Cycles 8953613 ---- Thread 02 ---- PC 5: Stalled ----- 6148464 in-flight CPI 1.4562 -- Total Cycles 8953613 ---- Thread 03 ---- PC 5: Stalled ----- 6431756 in-flight CPI 1.3921 -- Total Cycles 8953613 ---- Thread 04 ---- PC 5: Stalled ----- 6005397 in-flight CPI 1.4909 -- Total Cycles 8953613 ---- Thread 05 ---- PC 5: Stalled ----- 6242793 in-flight CPI 1.4342 -- Total Cycles 8953613 ---- Thread 06 ---- PC 5: Stalled ----- 6017541 in-flight CPI 1.4879 -- Total Cycles 8953613 ---- Thread 07 ---- PC 5: Stalled ----- 6187110 in-flight CPI 1.4471 -- Total Cycles 8953613 ---- Thread 08 ---- PC 5: Stalled ----- 5998763 in-flight CPI 1.4926 -- Total Cycles 8953613 ---- Thread 09 ---- PC 5: Stalled ----- 6095877 in-flight CPI 1.4688 -- Total Cycles 8953613 ---- Thread 10 ---- PC 5: Stalled ----- 6319652 in-flight CPI 1.4168 -- Total Cycles 8953613 ---- Thread 11 ---- PC 5: Stalled ----- 5882924 in-flight CPI 1.5220 -- Total Cycles 8953613 ---- Thread 12 ---- PC 5: Stalled ----- 5963768 in-flight CPI 1.5013 -- Total Cycles 8953613 ---- Thread 13 ---- PC 5: Stalled ----- 5761951 in-flight CPI 1.5539 -- Total Cycles 8953613 ---- Thread 14 ---- PC 5: Stalled ----- 6495654 in-flight CPI 1.3784 -- Total Cycles 8953613 ---- Thread 15 ---- PC 5: Stalled ----- 6424881 in-flight CPI 1.3936 -- Total Cycles 8953613 ---- Thread 16 ---- PC 5: Stalled ----- 6814828 in-flight CPI 1.3138 -- Total Cycles 8953613 ---- Thread 17 ---- PC 5: Stalled ----- 6367533 in-flight CPI 1.4061 -- Total Cycles 8953613 ---- Thread 18 ---- PC 5: Stalled ----- 5728034 in-flight CPI 1.5631 -- Total Cycles 8953613 ---- Thread 19 ---- PC 5: Stalled ----- 5767073 in-flight CPI 1.5525 -- Total Cycles 8953613 ---- Thread 20 ---- PC 5: Stalled ----- 5596305 in-flight CPI 1.5999 -- Total Cycles 8953613 ---- Thread 21 ---- PC 5: Stalled ----- 6246191 in-flight CPI 1.4334 -- Total Cycles 8953613 ---- Thread 22 ---- PC 5: Stalled ----- 5992990 in-flight CPI 1.4940 -- Total Cycles 8953613 ---- Thread 23 ---- PC 5: Stalled ----- 6143576 in-flight CPI 1.4574 -- Total Cycles 8953613 ---- Thread 24 ---- PC 5: Stalled ----- 5942347 in-flight CPI 1.5067 -- Total Cycles 8953613 ---- Thread 25 ---- PC 5: Stalled ----- 5562321 in-flight CPI 1.6097 -- Total Cycles 8953613 ---- Thread 26 ---- PC 5: Stalled ----- 5900395 in-flight CPI 1.5175 -- Total Cycles 8953613 ---- Thread 27 ---- PC 5: Stalled ----- 6202315 in-flight CPI 1.4436 -- Total Cycles 8953613 ---- Thread 28 ---- PC 5: Stalled ----- 5830832 in-flight CPI 1.5356 -- Total Cycles 8953613 ---- Thread 29 ---- PC 5: Stalled ----- 5363847 in-flight CPI 1.6692 -- Total Cycles 8953613 ---- Thread 30 ---- PC 5: Stalled ----- 5190428 in-flight CPI 1.7250 -- Total Cycles 8953613 ---- Thread 31 ---- PC 5: Stalled ----- 6331461 in-flight CPI 1.4141 -- Total Cycles 8953613 Total CPI 0.0463 , IPC 21.5917 -- Total Cycles 8953613 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 429571 (2.039511%) FPSUB: 0 (0.000000%) FPMUL: 1968507 (9.346050%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 14881195 (70.652729%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 554566 (2.632961%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3220814 (15.291736%) DIV: 7338 (0.034839%) FPUN: 0 (0.000000%) FPRSUB: 458 (0.002174%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (212017961 total) ADD%: 8.184 (17351257) SUB%: 0.000 (0) MUL%: 0.000 (199) BITOR%: 1.224 (2594856) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.544 (1152984) FPSUB%: 0.000 (0) FPMUL%: 4.761 (10093657) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (597) FPMAX%: 0.000 (597) LOAD%: 4.950 (10494460) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (231) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (40819) FPINV%: 0.000 (0) FPCONV%: 0.000 (661) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.062 (2251962) FPLE%: 0.390 (826917) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (597) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27238) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.966 (6288479) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.748 (1585698) CMPU%: 0.000 (0) RSUB%: 0.000 (199) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.767 (33428000) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.232 (2611916) ORI%: 1.262 (2675745) XORI%: 0.000 (0) MULI%: 3.362 (7128787) LW%: 1.193 (2529022) LWI%: 13.930 (29534941) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (638952) SWI%: 4.102 (8696969) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.482 (3142107) bged%: 0.000 (0) bgeid%: 0.000 (199) bgtd%: 0.000 (0) bgtid%: 0.323 (684569) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.040 (85286) bned%: 0.000 (0) bneid%: 13.709 (29066226) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.739 (1565792) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.086 (183198) DIV%: 0.000 (398) FPUN%: 1.181 (2503588) FPRSUB%: 3.711 (7868190) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (2) FPGT%: 3.104 (6581043) FPGE%: 0.796 (1687305) SYNC%: 0.000 (0) NOP%: 8.817 (18694222) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 164 SUB 0 MUL 30 BITOR 1 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 532 FPSUB 0 FPMUL 5297 FPCMPLT 0 FPMIN 0 FPMAX 385 LOAD 2338372 INTCONV 0 ATOMIC_INC 8 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 110 FPINV 0 FPCONV 9 FPEQ 0 FPNE 0 FPLT 6 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 2300 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 1 COS 0 SIN 0 ADDC 0 ADDK 1 ADDKC 0 BITXOR 0 ANDN 0 CMP 2230 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3351818 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 817 ORI 587623 XORI 0 MULI 639728 LW 0 LWI 9382841 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1582 DIV 12 FPUN 0 FPRSUB 4 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.5917 --Total thread-cycles: 286515616 --total thread-cycles issued: 193323739 (67.474069%) --iCache conflicts: 6534505 (2.280680%) --thread*cycles of FU dependence: 16313887 (5.693891%) --thread*cycles of data dependence: 21062449 (7.351239%) --iCache cycles*banks: 286515616 (73.998756% used) Issue breakdown: --thread*cycles of issue worked: 193323739 (67.474067%) --thread*cycles of issue failed: 74497655 (26.001255%) --thread*cycles of issue NOP/other: 18737138 (6.539657%) Number of thread-cycles not ready: 21062449 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 212017961 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 7 3: 7 4: 7 5: 7 6: 7 7: 7 8: 7 9: 7 10: 7 11: 7 12: 7 13: 7 14: 7 15: 8 16: 7 17: 8 18: 7 19: 7 20: 7 21: 7 22: 7 23: 8 24: 7 25: 7 26: 7 27: 8 28: 7 29: 7 30: 7 31: 9 <=== Core 5 ===> ---- Thread 00 ---- PC 5: Stalled ----- 5896870 in-flight CPI 1.4988 -- Total Cycles 8838220 ---- Thread 01 ---- PC 5: Stalled ----- 6424216 in-flight CPI 1.3758 -- Total Cycles 8838220 ---- Thread 02 ---- PC 5: Stalled ----- 6267147 in-flight CPI 1.4102 -- Total Cycles 8838220 ---- Thread 03 ---- PC 5: Stalled ----- 5944140 in-flight CPI 1.4869 -- Total Cycles 8838220 ---- Thread 04 ---- PC 5: Stalled ----- 6616841 in-flight CPI 1.3357 -- Total Cycles 8838220 ---- Thread 05 ---- PC 5: Stalled ----- 6025761 in-flight CPI 1.4667 -- Total Cycles 8838220 ---- Thread 06 ---- PC 5: Stalled ----- 6157892 in-flight CPI 1.4353 -- Total Cycles 8838220 ---- Thread 07 ---- PC 5: Stalled ----- 6035359 in-flight CPI 1.4644 -- Total Cycles 8838220 ---- Thread 08 ---- PC 5: Stalled ----- 5955107 in-flight CPI 1.4841 -- Total Cycles 8838220 ---- Thread 09 ---- PC 5: Stalled ----- 5800470 in-flight CPI 1.5237 -- Total Cycles 8838220 ---- Thread 10 ---- PC 5: Stalled ----- 6205707 in-flight CPI 1.4242 -- Total Cycles 8838220 ---- Thread 11 ---- PC 5: Stalled ----- 5987388 in-flight CPI 1.4761 -- Total Cycles 8838220 ---- Thread 12 ---- PC 5: Stalled ----- 6569940 in-flight CPI 1.3452 -- Total Cycles 8838220 ---- Thread 13 ---- PC 5: Stalled ----- 5757365 in-flight CPI 1.5351 -- Total Cycles 8838220 ---- Thread 14 ---- PC 5: Stalled ----- 6257191 in-flight CPI 1.4125 -- Total Cycles 8838220 ---- Thread 15 ---- PC 5: Stalled ----- 6501001 in-flight CPI 1.3595 -- Total Cycles 8838220 ---- Thread 16 ---- PC 5: Stalled ----- 6414317 in-flight CPI 1.3779 -- Total Cycles 8838220 ---- Thread 17 ---- PC 5: Stalled ----- 6129823 in-flight CPI 1.4418 -- Total Cycles 8838220 ---- Thread 18 ---- PC 5: Stalled ----- 6558657 in-flight CPI 1.3476 -- Total Cycles 8838220 ---- Thread 19 ---- PC 5: Stalled ----- 6219938 in-flight CPI 1.4209 -- Total Cycles 8838220 ---- Thread 20 ---- PC 5: Stalled ----- 5728209 in-flight CPI 1.5429 -- Total Cycles 8838220 ---- Thread 21 ---- PC 5: Stalled ----- 6148149 in-flight CPI 1.4375 -- Total Cycles 8838220 ---- Thread 22 ---- PC 5: Stalled ----- 5900197 in-flight CPI 1.4979 -- Total Cycles 8838220 ---- Thread 23 ---- PC 5: Stalled ----- 5878628 in-flight CPI 1.5034 -- Total Cycles 8838220 ---- Thread 24 ---- PC 5: Stalled ----- 5515330 in-flight CPI 1.6025 -- Total Cycles 8838220 ---- Thread 25 ---- PC 5: Stalled ----- 5780643 in-flight CPI 1.5289 -- Total Cycles 8838220 ---- Thread 26 ---- PC 5: Stalled ----- 6104846 in-flight CPI 1.4477 -- Total Cycles 8838220 ---- Thread 27 ---- PC 5: Stalled ----- 5545108 in-flight CPI 1.5939 -- Total Cycles 8838220 ---- Thread 28 ---- PC 5: Stalled ----- 5994905 in-flight CPI 1.4743 -- Total Cycles 8838220 ---- Thread 29 ---- PC 5: Stalled ----- 6384748 in-flight CPI 1.3843 -- Total Cycles 8838220 ---- Thread 30 ---- PC 5: Stalled ----- 6255085 in-flight CPI 1.4130 -- Total Cycles 8838220 ---- Thread 31 ---- PC 5: Stalled ----- 5878935 in-flight CPI 1.5034 -- Total Cycles 8838220 Total CPI 0.0454 , IPC 22.0452 -- Total Cycles 8838220 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 435561 (2.070170%) FPSUB: 0 (0.000000%) FPMUL: 1988250 (9.449916%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 14773100 (70.214791%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 557485 (2.649660%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3277416 (15.577169%) DIV: 7592 (0.036084%) FPUN: 0 (0.000000%) FPRSUB: 465 (0.002210%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (213689050 total) ADD%: 8.178 (17475532) SUB%: 0.000 (0) MUL%: 0.000 (206) BITOR%: 1.232 (2632452) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.547 (1168331) FPSUB%: 0.000 (0) FPMUL%: 4.767 (10186961) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (618) FPMAX%: 0.000 (618) LOAD%: 4.952 (10582694) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (238) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41125) FPINV%: 0.000 (0) FPCONV%: 0.000 (682) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.063 (2270564) FPLE%: 0.393 (839864) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (618) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27738) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.960 (6325691) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.748 (1598965) CMPU%: 0.000 (0) RSUB%: 0.000 (206) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.767 (33692851) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.230 (2628410) ORI%: 1.267 (2706688) XORI%: 0.000 (0) MULI%: 3.357 (7174108) LW%: 1.191 (2544142) LWI%: 13.910 (29724617) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (643915) SWI%: 4.094 (8748474) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.478 (3159389) bged%: 0.000 (0) bgeid%: 0.000 (206) bgtd%: 0.000 (0) bgtid%: 0.323 (690597) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.041 (86997) bned%: 0.000 (0) bneid%: 13.715 (29307189) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.741 (1582489) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.087 (186429) DIV%: 0.000 (412) FPUN%: 1.189 (2539704) FPRSUB%: 3.712 (7932691) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (0) FPGT%: 3.101 (6627405) FPGE%: 0.801 (1710619) SYNC%: 0.000 (0) NOP%: 8.821 (18848519) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 183 SUB 0 MUL 17 BITOR 10 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 538 FPSUB 0 FPMUL 5172 FPCMPLT 0 FPMIN 0 FPMAX 400 LOAD 2321025 INTCONV 0 ATOMIC_INC 17 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 122 FPINV 0 FPCONV 10 FPEQ 0 FPNE 0 FPLT 7 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1884 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 1 ADDKC 0 BITXOR 0 ANDN 0 CMP 2169 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3375147 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 818 ORI 595067 XORI 0 MULI 637360 LW 0 LWI 9454213 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1744 DIV 19 FPUN 0 FPRSUB 4 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 22.0452 --Total thread-cycles: 282823040 --total thread-cycles issued: 194840531 (68.891321%) --iCache conflicts: 6572329 (2.323831%) --thread*cycles of FU dependence: 16395930 (5.797240%) --thread*cycles of data dependence: 21039869 (7.439234%) --iCache cycles*banks: 282823040 (75.555755% used) Issue breakdown: --thread*cycles of issue worked: 194840531 (68.891322%) --thread*cycles of issue failed: 69133990 (24.444257%) --thread*cycles of issue NOP/other: 52427616075993 (18537250.740248%) Number of thread-cycles not ready: 21039869 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 213689050 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 9 2: 7 3: 8 4: 8 5: 7 6: 7 7: 8 8: 7 9: 7 10: 7 11: 7 12: 8 13: 7 14: 8 15: 8 16: 8 17: 7 18: 9 19: 7 20: 8 21: 8 22: 7 23: 7 24: 7 25: 7 26: 7 27: 7 28: 8 29: 7 30: 7 31: 7 <=== Core 6 ===> ---- Thread 00 ---- PC 5: Stalled ----- 5926208 in-flight CPI 1.5201 -- Total Cycles 9008523 ---- Thread 01 ---- PC 5: Stalled ----- 6697969 in-flight CPI 1.3450 -- Total Cycles 9008523 ---- Thread 02 ---- PC 5: Stalled ----- 6704377 in-flight CPI 1.3437 -- Total Cycles 9008523 ---- Thread 03 ---- PC 5: Stalled ----- 7005895 in-flight CPI 1.2858 -- Total Cycles 9008523 ---- Thread 04 ---- PC 5: Stalled ----- 5882493 in-flight CPI 1.5314 -- Total Cycles 9008523 ---- Thread 05 ---- PC 5: Stalled ----- 6470175 in-flight CPI 1.3923 -- Total Cycles 9008523 ---- Thread 06 ---- PC 5: Stalled ----- 6689608 in-flight CPI 1.3466 -- Total Cycles 9008523 ---- Thread 07 ---- PC 5: Stalled ----- 6245613 in-flight CPI 1.4424 -- Total Cycles 9008523 ---- Thread 08 ---- PC 5: Stalled ----- 5954976 in-flight CPI 1.5128 -- Total Cycles 9008523 ---- Thread 09 ---- PC 5: Stalled ----- 5956483 in-flight CPI 1.5124 -- Total Cycles 9008523 ---- Thread 10 ---- PC 5: Stalled ----- 6516331 in-flight CPI 1.3824 -- Total Cycles 9008523 ---- Thread 11 ---- PC 5: Stalled ----- 6585728 in-flight CPI 1.3679 -- Total Cycles 9008523 ---- Thread 12 ---- PC 5: Stalled ----- 5817628 in-flight CPI 1.5485 -- Total Cycles 9008523 ---- Thread 13 ---- PC 5: Stalled ----- 6338715 in-flight CPI 1.4212 -- Total Cycles 9008523 ---- Thread 14 ---- PC 5: Stalled ----- 6211641 in-flight CPI 1.4503 -- Total Cycles 9008523 ---- Thread 15 ---- PC 5: Stalled ----- 6853183 in-flight CPI 1.3145 -- Total Cycles 9008523 ---- Thread 16 ---- PC 5: Stalled ----- 6286609 in-flight CPI 1.4330 -- Total Cycles 9008523 ---- Thread 17 ---- PC 5: Stalled ----- 5750052 in-flight CPI 1.5667 -- Total Cycles 9008523 ---- Thread 18 ---- PC 5: Stalled ----- 5879134 in-flight CPI 1.5323 -- Total Cycles 9008523 ---- Thread 19 ---- PC 5: Stalled ----- 5981965 in-flight CPI 1.5059 -- Total Cycles 9008523 ---- Thread 20 ---- PC 5: Stalled ----- 6315706 in-flight CPI 1.4264 -- Total Cycles 9008523 ---- Thread 21 ---- PC 5: Stalled ----- 5704913 in-flight CPI 1.5791 -- Total Cycles 9008523 ---- Thread 22 ---- PC 5: Stalled ----- 6308067 in-flight CPI 1.4281 -- Total Cycles 9008523 ---- Thread 23 ---- PC 5: Stalled ----- 5560757 in-flight CPI 1.6200 -- Total Cycles 9008523 ---- Thread 24 ---- PC 5: Stalled ----- 6417474 in-flight CPI 1.4037 -- Total Cycles 9008523 ---- Thread 25 ---- PC 5: Stalled ----- 6363913 in-flight CPI 1.4156 -- Total Cycles 9008523 ---- Thread 26 ---- PC 5: Stalled ----- 6229065 in-flight CPI 1.4462 -- Total Cycles 9008523 ---- Thread 27 ---- PC 5: Stalled ----- 5526561 in-flight CPI 1.6300 -- Total Cycles 9008523 ---- Thread 28 ---- PC 5: Stalled ----- 5781287 in-flight CPI 1.5582 -- Total Cycles 9008523 ---- Thread 29 ---- PC 5: Stalled ----- 6218494 in-flight CPI 1.4487 -- Total Cycles 9008523 ---- Thread 30 ---- PC 5: Stalled ----- 5415963 in-flight CPI 1.6633 -- Total Cycles 9008523 ---- Thread 31 ---- PC 5: Stalled ----- 6376618 in-flight CPI 1.4127 -- Total Cycles 9008523 Total CPI 0.0455 , IPC 21.9763 -- Total Cycles 9008523 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 440447 (2.095315%) FPSUB: 0 (0.000000%) FPMUL: 2018654 (9.603235%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 14661879 (69.750173%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 573477 (2.728171%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3318031 (15.784691%) DIV: 7608 (0.036193%) FPUN: 0 (0.000000%) FPRSUB: 467 (0.002222%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (217128710 total) ADD%: 8.183 (17766823) SUB%: 0.000 (0) MUL%: 0.000 (206) BITOR%: 1.230 (2669982) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.545 (1182803) FPSUB%: 0.000 (0) FPMUL%: 4.762 (10338696) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (618) FPMAX%: 0.000 (618) LOAD%: 4.949 (10744842) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (238) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (42230) FPINV%: 0.000 (0) FPCONV%: 0.000 (682) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.063 (2309028) FPLE%: 0.395 (857380) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (618) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (28206) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.960 (6427101) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.747 (1622691) CMPU%: 0.000 (0) RSUB%: 0.000 (206) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.767 (34235560) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.230 (2670950) ORI%: 1.260 (2735742) XORI%: 0.000 (0) MULI%: 3.359 (7294195) LW%: 1.191 (2584978) LWI%: 13.921 (30225703) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (653271) SWI%: 4.096 (8893636) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.479 (3211210) bged%: 0.000 (0) bgeid%: 0.000 (206) bgtd%: 0.000 (0) bgtid%: 0.322 (699848) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.040 (86345) bned%: 0.000 (0) bneid%: 13.718 (29786397) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.742 (1611757) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.087 (188684) DIV%: 0.000 (412) FPUN%: 1.186 (2575333) FPRSUB%: 3.711 (8056863) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (3) FPGT%: 3.105 (6741096) FPGE%: 0.796 (1728966) SYNC%: 0.000 (0) NOP%: 8.822 (19154491) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 169 SUB 0 MUL 18 BITOR 4 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 557 FPSUB 0 FPMUL 5163 FPCMPLT 0 FPMIN 0 FPMAX 405 LOAD 2334729 INTCONV 0 ATOMIC_INC 4 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 123 FPINV 0 FPCONV 12 FPEQ 0 FPNE 0 FPLT 5 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1988 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 2231 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3430460 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 823 ORI 601455 XORI 0 MULI 649380 LW 0 LWI 9609248 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1736 DIV 8 FPUN 0 FPRSUB 5 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.9763 --Total thread-cycles: 288272736 --total thread-cycles issued: 197974219 (68.676014%) --iCache conflicts: 6626180 (2.298580%) --thread*cycles of FU dependence: 16638540 (5.771805%) --thread*cycles of data dependence: 21020563 (7.291901%) --iCache cycles*banks: 288272736 (75.320596% used) Issue breakdown: --thread*cycles of issue worked: 197974219 (68.676012%) --thread*cycles of issue failed: 71144026 (24.679415%) --thread*cycles of issue NOP/other: 4665173569760938359 (1618319385486.714100%) Number of thread-cycles not ready: 21020563 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 217128710 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 9 2: 8 3: 8 4: 7 5: 9 6: 8 7: 7 8: 7 9: 7 10: 8 11: 9 12: 7 13: 8 14: 7 15: 7 16: 7 17: 7 18: 7 19: 7 20: 7 21: 8 22: 8 23: 7 24: 8 25: 9 26: 7 27: 7 28: 6 29: 7 30: 6 31: 7 <=== Core 7 ===> ---- Thread 00 ---- PC 5: Stalled ----- 5981976 in-flight CPI 1.5292 -- Total Cycles 9147449 ---- Thread 01 ---- PC 5: Stalled ----- 5818236 in-flight CPI 1.5722 -- Total Cycles 9147449 ---- Thread 02 ---- PC 5: Stalled ----- 6048107 in-flight CPI 1.5124 -- Total Cycles 9147449 ---- Thread 03 ---- PC 5: Stalled ----- 5915429 in-flight CPI 1.5464 -- Total Cycles 9147449 ---- Thread 04 ---- PC 5: Stalled ----- 6092076 in-flight CPI 1.5015 -- Total Cycles 9147449 ---- Thread 05 ---- PC 5: Stalled ----- 6943767 in-flight CPI 1.3174 -- Total Cycles 9147449 ---- Thread 06 ---- PC 5: Stalled ----- 6121458 in-flight CPI 1.4943 -- Total Cycles 9147449 ---- Thread 07 ---- PC 5: Stalled ----- 6257193 in-flight CPI 1.4619 -- Total Cycles 9147449 ---- Thread 08 ---- PC 5: Stalled ----- 6157465 in-flight CPI 1.4856 -- Total Cycles 9147449 ---- Thread 09 ---- PC 5: Stalled ----- 7073059 in-flight CPI 1.2933 -- Total Cycles 9147449 ---- Thread 10 ---- PC 5: Stalled ----- 5802574 in-flight CPI 1.5764 -- Total Cycles 9147449 ---- Thread 11 ---- PC 5: Stalled ----- 6690862 in-flight CPI 1.3672 -- Total Cycles 9147449 ---- Thread 12 ---- PC 5: Stalled ----- 6605872 in-flight CPI 1.3847 -- Total Cycles 9147449 ---- Thread 13 ---- PC 5: Stalled ----- 6693965 in-flight CPI 1.3665 -- Total Cycles 9147449 ---- Thread 14 ---- PC 5: Stalled ----- 6867479 in-flight CPI 1.3320 -- Total Cycles 9147449 ---- Thread 15 ---- PC 5: Stalled ----- 6684213 in-flight CPI 1.3685 -- Total Cycles 9147449 ---- Thread 16 ---- PC 5: Stalled ----- 5970653 in-flight CPI 1.5321 -- Total Cycles 9147449 ---- Thread 17 ---- PC 5: Stalled ----- 7005741 in-flight CPI 1.3057 -- Total Cycles 9147449 ---- Thread 18 ---- PC 5: Stalled ----- 5812008 in-flight CPI 1.5739 -- Total Cycles 9147449 ---- Thread 19 ---- PC 5: Stalled ----- 6123071 in-flight CPI 1.4939 -- Total Cycles 9147449 ---- Thread 20 ---- PC 5: Stalled ----- 5617359 in-flight CPI 1.6284 -- Total Cycles 9147449 ---- Thread 21 ---- PC 5: Stalled ----- 5959414 in-flight CPI 1.5350 -- Total Cycles 9147449 ---- Thread 22 ---- PC 5: Stalled ----- 5644097 in-flight CPI 1.6207 -- Total Cycles 9147449 ---- Thread 23 ---- PC 5: Stalled ----- 6019672 in-flight CPI 1.5196 -- Total Cycles 9147449 ---- Thread 24 ---- PC 5: Stalled ----- 6258837 in-flight CPI 1.4615 -- Total Cycles 9147449 ---- Thread 25 ---- PC 5: Stalled ----- 6381201 in-flight CPI 1.4335 -- Total Cycles 9147449 ---- Thread 26 ---- PC 5: Stalled ----- 6588076 in-flight CPI 1.3885 -- Total Cycles 9147449 ---- Thread 27 ---- PC 5: Stalled ----- 5830767 in-flight CPI 1.5688 -- Total Cycles 9147449 ---- Thread 28 ---- PC 5: Stalled ----- 5457041 in-flight CPI 1.6763 -- Total Cycles 9147449 ---- Thread 29 ---- PC 5: Stalled ----- 5788386 in-flight CPI 1.5803 -- Total Cycles 9147449 ---- Thread 30 ---- PC 5: Stalled ----- 5753888 in-flight CPI 1.5898 -- Total Cycles 9147449 ---- Thread 31 ---- PC 5: Stalled ----- 5834238 in-flight CPI 1.5679 -- Total Cycles 9147449 Total CPI 0.0462 , IPC 21.6234 -- Total Cycles 9147449 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 435263 (2.035228%) FPSUB: 0 (0.000000%) FPMUL: 2004943 (9.374828%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 15092715 (70.571390%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 570694 (2.668484%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3274693 (15.311999%) DIV: 7669 (0.035859%) FPUN: 0 (0.000000%) FPRSUB: 473 (0.002212%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (216929576 total) ADD%: 8.175 (17733628) SUB%: 0.000 (0) MUL%: 0.000 (208) BITOR%: 1.228 (2663296) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.539 (1169765) FPSUB%: 0.000 (0) FPMUL%: 4.745 (10292644) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (624) FPMAX%: 0.000 (624) LOAD%: 4.949 (10736250) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (240) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41974) FPINV%: 0.000 (0) FPCONV%: 0.000 (688) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.061 (2301088) FPLE%: 0.393 (853578) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (624) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (28066) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.967 (6437038) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.747 (1619632) CMPU%: 0.000 (0) RSUB%: 0.000 (208) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.776 (34222387) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.232 (2672533) ORI%: 1.256 (2725249) XORI%: 0.000 (0) MULI%: 3.365 (7299199) LW%: 1.193 (2588868) LWI%: 13.932 (30222769) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (654022) SWI%: 4.102 (8899182) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.483 (3216512) bged%: 0.000 (0) bgeid%: 0.000 (208) bgtd%: 0.000 (0) bgtid%: 0.323 (700236) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.040 (86342) bned%: 0.000 (0) bneid%: 13.717 (29757162) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.744 (1612953) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.086 (186169) DIV%: 0.000 (416) FPUN%: 1.185 (2571038) FPRSUB%: 3.706 (8039008) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (4) FPGT%: 3.105 (6735903) FPGE%: 0.797 (1728373) SYNC%: 0.000 (0) NOP%: 8.819 (19130772) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 177 SUB 0 MUL 9 BITOR 2 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 513 FPSUB 0 FPMUL 5147 FPCMPLT 0 FPMIN 0 FPMAX 397 LOAD 2365716 INTCONV 0 ATOMIC_INC 3 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 102 FPINV 0 FPCONV 24 FPEQ 0 FPNE 0 FPLT 6 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1984 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 2154 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3427979 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 825 ORI 593686 XORI 0 MULI 650006 LW 0 LWI 9604859 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1746 DIV 19 FPUN 0 FPRSUB 6 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.6234 --Total thread-cycles: 292718368 --total thread-cycles issued: 197798804 (67.573074%) --iCache conflicts: 6628178 (2.264353%) --thread*cycles of FU dependence: 16655375 (5.689897%) --thread*cycles of data dependence: 21386450 (7.306152%) --iCache cycles*banks: 292718368 (74.108642% used) Issue breakdown: --thread*cycles of issue worked: 197798804 (67.573076%) --thread*cycles of issue failed: 75788792 (25.891369%) --thread*cycles of issue NOP/other: 19130772 (6.535556%) Number of thread-cycles not ready: 21386450 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 216929576 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 7 2: 7 3: 7 4: 7 5: 9 6: 7 7: 8 8: 7 9: 9 10: 7 11: 8 12: 7 13: 8 14: 9 15: 8 16: 7 17: 8 18: 7 19: 7 20: 7 21: 7 22: 7 23: 8 24: 7 25: 8 26: 7 27: 8 28: 9 29: 7 30: 7 31: 7 <=== Core 8 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6219642 in-flight CPI 1.4602 -- Total Cycles 9082152 ---- Thread 01 ---- PC 5: Stalled ----- 5892575 in-flight CPI 1.5413 -- Total Cycles 9082152 ---- Thread 02 ---- PC 5: Stalled ----- 6539771 in-flight CPI 1.3888 -- Total Cycles 9082152 ---- Thread 03 ---- PC 5: Stalled ----- 6473631 in-flight CPI 1.4029 -- Total Cycles 9082152 ---- Thread 04 ---- PC 5: Stalled ----- 6184062 in-flight CPI 1.4686 -- Total Cycles 9082152 ---- Thread 05 ---- PC 5: Stalled ----- 6479203 in-flight CPI 1.4017 -- Total Cycles 9082152 ---- Thread 06 ---- PC 5: Stalled ----- 6499814 in-flight CPI 1.3973 -- Total Cycles 9082152 ---- Thread 07 ---- PC 5: Stalled ----- 6228542 in-flight CPI 1.4581 -- Total Cycles 9082152 ---- Thread 08 ---- PC 5: Stalled ----- 7119744 in-flight CPI 1.2756 -- Total Cycles 9082152 ---- Thread 09 ---- PC 5: Stalled ----- 5818881 in-flight CPI 1.5608 -- Total Cycles 9082152 ---- Thread 10 ---- PC 5: Stalled ----- 6735304 in-flight CPI 1.3484 -- Total Cycles 9082152 ---- Thread 11 ---- PC 5: Stalled ----- 5929953 in-flight CPI 1.5316 -- Total Cycles 9082152 ---- Thread 12 ---- PC 5: Stalled ----- 6414940 in-flight CPI 1.4158 -- Total Cycles 9082152 ---- Thread 13 ---- PC 5: Stalled ----- 6290893 in-flight CPI 1.4437 -- Total Cycles 9082152 ---- Thread 14 ---- PC 5: Stalled ----- 5744771 in-flight CPI 1.5809 -- Total Cycles 9082152 ---- Thread 15 ---- PC 5: Stalled ----- 6566836 in-flight CPI 1.3830 -- Total Cycles 9082152 ---- Thread 16 ---- PC 5: Stalled ----- 5701045 in-flight CPI 1.5931 -- Total Cycles 9082152 ---- Thread 17 ---- PC 5: Stalled ----- 6430680 in-flight CPI 1.4123 -- Total Cycles 9082152 ---- Thread 18 ---- PC 5: Stalled ----- 5739214 in-flight CPI 1.5825 -- Total Cycles 9082152 ---- Thread 19 ---- PC 5: Stalled ----- 5951630 in-flight CPI 1.5260 -- Total Cycles 9082152 ---- Thread 20 ---- PC 5: Stalled ----- 5925895 in-flight CPI 1.5326 -- Total Cycles 9082152 ---- Thread 21 ---- PC 5: Stalled ----- 6799823 in-flight CPI 1.3356 -- Total Cycles 9082152 ---- Thread 22 ---- PC 5: Stalled ----- 6064988 in-flight CPI 1.4975 -- Total Cycles 9082152 ---- Thread 23 ---- PC 5: Stalled ----- 6264392 in-flight CPI 1.4498 -- Total Cycles 9082152 ---- Thread 24 ---- PC 5: Stalled ----- 6175397 in-flight CPI 1.4707 -- Total Cycles 9082152 ---- Thread 25 ---- PC 5: Stalled ----- 5692029 in-flight CPI 1.5956 -- Total Cycles 9082152 ---- Thread 26 ---- PC 5: Stalled ----- 6522101 in-flight CPI 1.3925 -- Total Cycles 9082152 ---- Thread 27 ---- PC 5: Stalled ----- 5861042 in-flight CPI 1.5496 -- Total Cycles 9082152 ---- Thread 28 ---- PC 5: Stalled ----- 5361145 in-flight CPI 1.6941 -- Total Cycles 9082152 ---- Thread 29 ---- PC 5: Stalled ----- 5944676 in-flight CPI 1.5278 -- Total Cycles 9082152 ---- Thread 30 ---- PC 5: Stalled ----- 5814960 in-flight CPI 1.5619 -- Total Cycles 9082152 ---- Thread 31 ---- PC 5: Stalled ----- 5722759 in-flight CPI 1.5870 -- Total Cycles 9082152 Total CPI 0.0461 , IPC 21.7031 -- Total Cycles 9082152 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 442301 (2.019561%) FPSUB: 0 (0.000000%) FPMUL: 2017267 (9.210906%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 15563103 (71.061626%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 560000 (2.556978%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3310421 (15.115488%) DIV: 7309 (0.033373%) FPUN: 0 (0.000000%) FPRSUB: 453 (0.002068%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (216179148 total) ADD%: 8.174 (17669896) SUB%: 0.000 (0) MUL%: 0.000 (198) BITOR%: 1.222 (2641714) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.547 (1182328) FPSUB%: 0.000 (0) FPMUL%: 4.776 (10324765) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (594) FPMAX%: 0.000 (594) LOAD%: 4.953 (10707634) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (230) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41226) FPINV%: 0.000 (0) FPCONV%: 0.000 (658) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.065 (2302185) FPLE%: 0.386 (835437) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (594) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27348) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.964 (6407486) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.750 (1621992) CMPU%: 0.000 (0) RSUB%: 0.000 (198) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.763 (34076007) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.232 (2663535) ORI%: 1.265 (2734571) XORI%: 0.000 (0) MULI%: 3.361 (7264921) LW%: 1.192 (2576732) LWI%: 13.925 (30103385) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (651010) SWI%: 4.096 (8853875) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.481 (3201538) bged%: 0.000 (0) bgeid%: 0.000 (198) bgtd%: 0.000 (0) bgtid%: 0.323 (698420) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.041 (87906) bned%: 0.000 (0) bneid%: 13.711 (29640124) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.735 (1588033) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.087 (188427) DIV%: 0.000 (396) FPUN%: 1.178 (2546454) FPRSUB%: 3.718 (8036758) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (1) FPGT%: 3.105 (6711747) FPGE%: 0.796 (1721721) SYNC%: 0.000 (0) NOP%: 8.821 (19068216) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 161 SUB 0 MUL 23 BITOR 4 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 537 FPSUB 0 FPMUL 5303 FPCMPLT 0 FPMIN 0 FPMAX 387 LOAD 2365297 INTCONV 0 ATOMIC_INC 5 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 100 FPINV 0 FPCONV 12 FPEQ 0 FPNE 0 FPLT 8 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1743 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 1 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 2351 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3419101 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 814 ORI 605109 XORI 0 MULI 640666 LW 0 LWI 9570294 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1767 DIV 11 FPUN 0 FPRSUB 5 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.7031 --Total thread-cycles: 290628864 --total thread-cycles issued: 197110932 (67.822213%) --iCache conflicts: 6615554 (2.276289%) --thread*cycles of FU dependence: 16613723 (5.716474%) --thread*cycles of data dependence: 21900854 (7.535678%) --iCache cycles*banks: 290628864 (74.383245% used) Issue breakdown: --thread*cycles of issue worked: 197110932 (67.822215%) --thread*cycles of issue failed: 74449716 (25.616766%) --thread*cycles of issue NOP/other: 783785592753058048 (269686080716.696500%) Number of thread-cycles not ready: 21900854 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 216179148 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 7 2: 7 3: 7 4: 8 5: 7 6: 7 7: 7 8: 9 9: 7 10: 7 11: 7 12: 7 13: 7 14: 7 15: 7 16: 7 17: 8 18: 7 19: 7 20: 7 21: 8 22: 7 23: 7 24: 7 25: 7 26: 8 27: 7 28: 7 29: 7 30: 7 31: 7 <=== Core 9 ===> ---- Thread 00 ---- PC 5: Stalled ----- 5856613 in-flight CPI 1.5270 -- Total Cycles 8943295 ---- Thread 01 ---- PC 5: Stalled ----- 6939674 in-flight CPI 1.2887 -- Total Cycles 8943295 ---- Thread 02 ---- PC 5: Stalled ----- 5834857 in-flight CPI 1.5327 -- Total Cycles 8943295 ---- Thread 03 ---- PC 5: Stalled ----- 5869622 in-flight CPI 1.5237 -- Total Cycles 8943295 ---- Thread 04 ---- PC 5: Stalled ----- 5795004 in-flight CPI 1.5433 -- Total Cycles 8943295 ---- Thread 05 ---- PC 5: Stalled ----- 6798852 in-flight CPI 1.3154 -- Total Cycles 8943295 ---- Thread 06 ---- PC 5: Stalled ----- 6752018 in-flight CPI 1.3245 -- Total Cycles 8943295 ---- Thread 07 ---- PC 5: Stalled ----- 6440062 in-flight CPI 1.3887 -- Total Cycles 8943295 ---- Thread 08 ---- PC 5: Stalled ----- 5939119 in-flight CPI 1.5058 -- Total Cycles 8943295 ---- Thread 09 ---- PC 5: Stalled ----- 5862686 in-flight CPI 1.5255 -- Total Cycles 8943295 ---- Thread 10 ---- PC 5: Stalled ----- 5877971 in-flight CPI 1.5215 -- Total Cycles 8943295 ---- Thread 11 ---- PC 5: Stalled ----- 5905918 in-flight CPI 1.5143 -- Total Cycles 8943295 ---- Thread 12 ---- PC 5: Stalled ----- 5892205 in-flight CPI 1.5178 -- Total Cycles 8943295 ---- Thread 13 ---- PC 5: Stalled ----- 6564946 in-flight CPI 1.3623 -- Total Cycles 8943295 ---- Thread 14 ---- PC 5: Stalled ----- 6153113 in-flight CPI 1.4535 -- Total Cycles 8943295 ---- Thread 15 ---- PC 5: Stalled ----- 6248880 in-flight CPI 1.4312 -- Total Cycles 8943295 ---- Thread 16 ---- PC 5: Stalled ----- 5940732 in-flight CPI 1.5054 -- Total Cycles 8943295 ---- Thread 17 ---- PC 5: Stalled ----- 5909087 in-flight CPI 1.5135 -- Total Cycles 8943295 ---- Thread 18 ---- PC 5: Stalled ----- 6146948 in-flight CPI 1.4549 -- Total Cycles 8943295 ---- Thread 19 ---- PC 5: Stalled ----- 5973866 in-flight CPI 1.4971 -- Total Cycles 8943295 ---- Thread 20 ---- PC 5: Stalled ----- 6240794 in-flight CPI 1.4330 -- Total Cycles 8943295 ---- Thread 21 ---- PC 5: Stalled ----- 6671716 in-flight CPI 1.3405 -- Total Cycles 8943295 ---- Thread 22 ---- PC 5: Stalled ----- 5910013 in-flight CPI 1.5132 -- Total Cycles 8943295 ---- Thread 23 ---- PC 5: Stalled ----- 6332304 in-flight CPI 1.4123 -- Total Cycles 8943295 ---- Thread 24 ---- PC 5: Stalled ----- 5568629 in-flight CPI 1.6060 -- Total Cycles 8943295 ---- Thread 25 ---- PC 5: Stalled ----- 6060488 in-flight CPI 1.4757 -- Total Cycles 8943295 ---- Thread 26 ---- PC 5: Stalled ----- 6289567 in-flight CPI 1.4219 -- Total Cycles 8943295 ---- Thread 27 ---- PC 5: Stalled ----- 5457287 in-flight CPI 1.6388 -- Total Cycles 8943295 ---- Thread 28 ---- PC 5: Stalled ----- 6075551 in-flight CPI 1.4720 -- Total Cycles 8943295 ---- Thread 29 ---- PC 5: Stalled ----- 5987816 in-flight CPI 1.4936 -- Total Cycles 8943295 ---- Thread 30 ---- PC 5: Stalled ----- 5624740 in-flight CPI 1.5900 -- Total Cycles 8943295 ---- Thread 31 ---- PC 5: Stalled ----- 5649993 in-flight CPI 1.5829 -- Total Cycles 8943295 Total CPI 0.0460 , IPC 21.7561 -- Total Cycles 8943295 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 446669 (2.064002%) FPSUB: 0 (0.000000%) FPMUL: 2012833 (9.301049%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 15291439 (70.659825%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 553094 (2.555778%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3329096 (15.383336%) DIV: 7342 (0.033926%) FPUN: 0 (0.000000%) FPRSUB: 451 (0.002084%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (213379176 total) ADD%: 8.177 (17448916) SUB%: 0.000 (0) MUL%: 0.000 (199) BITOR%: 1.225 (2613201) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.558 (1189843) FPSUB%: 0.000 (0) FPMUL%: 4.807 (10256404) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (597) FPMAX%: 0.000 (597) LOAD%: 4.958 (10580309) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (231) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (40684) FPINV%: 0.000 (0) FPCONV%: 0.000 (661) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.068 (2277930) FPLE%: 0.388 (827747) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (597) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27170) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.957 (6309476) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.752 (1605341) CMPU%: 0.000 (0) RSUB%: 0.000 (199) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.754 (33616318) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.231 (2625661) ORI%: 1.275 (2720939) XORI%: 0.000 (0) MULI%: 3.354 (7155705) LW%: 1.189 (2537364) LWI%: 13.907 (29675481) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.300 (640994) SWI%: 4.091 (8729991) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.478 (3152683) bged%: 0.000 (0) bgeid%: 0.000 (199) bgtd%: 0.000 (0) bgtid%: 0.323 (688307) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.041 (88302) bned%: 0.000 (0) bneid%: 13.700 (29232471) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.733 (1563988) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.089 (189603) DIV%: 0.000 (398) FPUN%: 1.179 (2516570) FPRSUB%: 3.725 (7947897) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (1) FPGT%: 3.097 (6609175) FPGE%: 0.796 (1699423) SYNC%: 0.000 (0) NOP%: 8.814 (18807508) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 198 SUB 0 MUL 22 BITOR 1 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 516 FPSUB 0 FPMUL 5225 FPCMPLT 0 FPMIN 0 FPMAX 388 LOAD 2329359 INTCONV 0 ATOMIC_INC 12 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 119 FPINV 0 FPCONV 12 FPEQ 0 FPNE 0 FPLT 8 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1851 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 2184 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3365879 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 796 ORI 612717 XORI 0 MULI 632298 LW 0 LWI 9433226 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1812 DIV 28 FPUN 0 FPRSUB 4 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.7561 --Total thread-cycles: 286185440 --total thread-cycles issued: 194571668 (67.987968%) --iCache conflicts: 6528054 (2.281057%) --thread*cycles of FU dependence: 16386666 (5.725891%) --thread*cycles of data dependence: 21640924 (7.561854%) --iCache cycles*banks: 286185440 (74.559771% used) Issue breakdown: --thread*cycles of issue worked: 194571668 (67.987969%) --thread*cycles of issue failed: 72806264 (25.440240%) --thread*cycles of issue NOP/other: 4620717599693404884 (1614588638644.022500%) Number of thread-cycles not ready: 21640924 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 213379176 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 9 2: 7 3: 7 4: 7 5: 8 6: 8 7: 7 8: 7 9: 7 10: 7 11: 7 12: 7 13: 7 14: 8 15: 7 16: 7 17: 8 18: 7 19: 7 20: 9 21: 7 22: 7 23: 7 24: 6 25: 7 26: 7 27: 6 28: 7 29: 7 30: 6 31: 9 <=== Core 10 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6549015 in-flight CPI 1.3788 -- Total Cycles 9029839 ---- Thread 01 ---- PC 5: Stalled ----- 6510008 in-flight CPI 1.3871 -- Total Cycles 9029839 ---- Thread 02 ---- PC 5: Stalled ----- 6869466 in-flight CPI 1.3145 -- Total Cycles 9029839 ---- Thread 03 ---- PC 5: Stalled ----- 6171350 in-flight CPI 1.4632 -- Total Cycles 9029839 ---- Thread 04 ---- PC 5: Stalled ----- 6434580 in-flight CPI 1.4033 -- Total Cycles 9029839 ---- Thread 05 ---- PC 5: Stalled ----- 6113220 in-flight CPI 1.4771 -- Total Cycles 9029839 ---- Thread 06 ---- PC 5: Stalled ----- 6714563 in-flight CPI 1.3448 -- Total Cycles 9029839 ---- Thread 07 ---- PC 5: Stalled ----- 5874458 in-flight CPI 1.5371 -- Total Cycles 9029839 ---- Thread 08 ---- PC 5: Stalled ----- 6521289 in-flight CPI 1.3847 -- Total Cycles 9029839 ---- Thread 09 ---- PC 5: Stalled ----- 6060002 in-flight CPI 1.4901 -- Total Cycles 9029839 ---- Thread 10 ---- PC 5: Stalled ----- 6151583 in-flight CPI 1.4679 -- Total Cycles 9029839 ---- Thread 11 ---- PC 5: Stalled ----- 5828905 in-flight CPI 1.5491 -- Total Cycles 9029839 ---- Thread 12 ---- PC 5: Stalled ----- 6407806 in-flight CPI 1.4092 -- Total Cycles 9029839 ---- Thread 13 ---- PC 5: Stalled ----- 6087049 in-flight CPI 1.4834 -- Total Cycles 9029839 ---- Thread 14 ---- PC 5: Stalled ----- 6766730 in-flight CPI 1.3344 -- Total Cycles 9029839 ---- Thread 15 ---- PC 5: Stalled ----- 6617053 in-flight CPI 1.3646 -- Total Cycles 9029839 ---- Thread 16 ---- PC 5: Stalled ----- 5714268 in-flight CPI 1.5802 -- Total Cycles 9029839 ---- Thread 17 ---- PC 5: Stalled ----- 5759385 in-flight CPI 1.5678 -- Total Cycles 9029839 ---- Thread 18 ---- PC 5: Stalled ----- 6379196 in-flight CPI 1.4155 -- Total Cycles 9029839 ---- Thread 19 ---- PC 5: Stalled ----- 6318622 in-flight CPI 1.4291 -- Total Cycles 9029839 ---- Thread 20 ---- PC 5: Stalled ----- 6249867 in-flight CPI 1.4448 -- Total Cycles 9029839 ---- Thread 21 ---- PC 5: Stalled ----- 5794485 in-flight CPI 1.5583 -- Total Cycles 9029839 ---- Thread 22 ---- PC 5: Stalled ----- 5624494 in-flight CPI 1.6054 -- Total Cycles 9029839 ---- Thread 23 ---- PC 5: Stalled ----- 5884064 in-flight CPI 1.5346 -- Total Cycles 9029839 ---- Thread 24 ---- PC 5: Stalled ----- 6550238 in-flight CPI 1.3785 -- Total Cycles 9029839 ---- Thread 25 ---- PC 5: Stalled ----- 6073521 in-flight CPI 1.4868 -- Total Cycles 9029839 ---- Thread 26 ---- PC 5: Stalled ----- 6182317 in-flight CPI 1.4606 -- Total Cycles 9029839 ---- Thread 27 ---- PC 5: Stalled ----- 5754820 in-flight CPI 1.5691 -- Total Cycles 9029839 ---- Thread 28 ---- PC 5: Stalled ----- 6535647 in-flight CPI 1.3816 -- Total Cycles 9029839 ---- Thread 29 ---- PC 5: Stalled ----- 5569377 in-flight CPI 1.6213 -- Total Cycles 9029839 ---- Thread 30 ---- PC 5: Stalled ----- 5907247 in-flight CPI 1.5286 -- Total Cycles 9029839 ---- Thread 31 ---- PC 5: Stalled ----- 5672292 in-flight CPI 1.5919 -- Total Cycles 9029839 Total CPI 0.0457 , IPC 21.8883 -- Total Cycles 9029839 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 434824 (2.054995%) FPSUB: 0 (0.000000%) FPMUL: 2002265 (9.462783%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 14876670 (70.307724%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 564446 (2.667594%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3273058 (15.468600%) DIV: 7636 (0.036088%) FPUN: 0 (0.000000%) FPRSUB: 469 (0.002217%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (216753099 total) ADD%: 8.178 (17726186) SUB%: 0.000 (0) MUL%: 0.000 (207) BITOR%: 1.232 (2669665) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.539 (1167258) FPSUB%: 0.000 (0) FPMUL%: 4.745 (10284237) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (621) FPMAX%: 0.000 (621) LOAD%: 4.949 (10726900) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (239) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41530) FPINV%: 0.000 (0) FPCONV%: 0.000 (685) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.060 (2297423) FPLE%: 0.393 (851336) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (621) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27932) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.968 (6432457) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.747 (1619145) CMPU%: 0.000 (0) RSUB%: 0.000 (207) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.774 (34190789) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.232 (2671403) ORI%: 1.260 (2732060) XORI%: 0.000 (0) MULI%: 3.365 (7292677) LW%: 1.193 (2586914) LWI%: 13.929 (30191805) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.302 (653714) SWI%: 4.100 (8887631) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.483 (3213946) bged%: 0.000 (0) bgeid%: 0.000 (207) bgtd%: 0.000 (0) bgtid%: 0.323 (699669) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.040 (85716) bned%: 0.000 (0) bneid%: 13.717 (29731562) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.741 (1606358) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.086 (186130) DIV%: 0.000 (414) FPUN%: 1.189 (2576664) FPRSUB%: 3.707 (8034030) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (1) FPGT%: 3.101 (6722293) FPGE%: 0.801 (1736189) SYNC%: 0.000 (0) NOP%: 8.814 (19105561) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 175 SUB 0 MUL 20 BITOR 2 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 540 FPSUB 0 FPMUL 5307 FPCMPLT 0 FPMIN 0 FPMAX 401 LOAD 2332637 INTCONV 0 ATOMIC_INC 7 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 96 FPINV 0 FPCONV 10 FPEQ 0 FPNE 0 FPLT 5 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1810 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 1 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 2222 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3424388 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 868 ORI 592959 XORI 0 MULI 654020 LW 0 LWI 9593418 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1687 DIV 16 FPUN 0 FPRSUB 2 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.8883 --Total thread-cycles: 288954848 --total thread-cycles issued: 197647538 (68.400837%) --iCache conflicts: 6711986 (2.322849%) --thread*cycles of FU dependence: 16610595 (5.748509%) --thread*cycles of data dependence: 21159368 (7.322725%) --iCache cycles*banks: 288954848 (75.012803% used) Issue breakdown: --thread*cycles of issue worked: 197647538 (68.400838%) --thread*cycles of issue failed: 72201749 (24.987208%) --thread*cycles of issue NOP/other: 4684756438787262240 (1621276289777.724400%) Number of thread-cycles not ready: 21159368 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 216753099 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 8 3: 7 4: 7 5: 9 6: 8 7: 7 8: 8 9: 7 10: 8 11: 8 12: 8 13: 8 14: 8 15: 8 16: 7 17: 7 18: 7 19: 7 20: 7 21: 7 22: 7 23: 7 24: 7 25: 7 26: 7 27: 8 28: 7 29: 7 30: 8 31: 7 <=== Core 11 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6580768 in-flight CPI 1.3652 -- Total Cycles 8983909 ---- Thread 01 ---- PC 5: Stalled ----- 5821684 in-flight CPI 1.5432 -- Total Cycles 8983909 ---- Thread 02 ---- PC 5: Stalled ----- 5877330 in-flight CPI 1.5286 -- Total Cycles 8983909 ---- Thread 03 ---- PC 5: Stalled ----- 5810979 in-flight CPI 1.5460 -- Total Cycles 8983909 ---- Thread 04 ---- PC 5: Stalled ----- 7096481 in-flight CPI 1.2660 -- Total Cycles 8983909 ---- Thread 05 ---- PC 5: Stalled ----- 5835655 in-flight CPI 1.5395 -- Total Cycles 8983909 ---- Thread 06 ---- PC 5: Stalled ----- 5999086 in-flight CPI 1.4975 -- Total Cycles 8983909 ---- Thread 07 ---- PC 5: Stalled ----- 6727943 in-flight CPI 1.3353 -- Total Cycles 8983909 ---- Thread 08 ---- PC 5: Stalled ----- 6640307 in-flight CPI 1.3529 -- Total Cycles 8983909 ---- Thread 09 ---- PC 5: Stalled ----- 5782843 in-flight CPI 1.5535 -- Total Cycles 8983909 ---- Thread 10 ---- PC 5: Stalled ----- 6087496 in-flight CPI 1.4758 -- Total Cycles 8983909 ---- Thread 11 ---- PC 5: Stalled ----- 5958122 in-flight CPI 1.5078 -- Total Cycles 8983909 ---- Thread 12 ---- PC 5: Stalled ----- 6684984 in-flight CPI 1.3439 -- Total Cycles 8983909 ---- Thread 13 ---- PC 5: Stalled ----- 6255701 in-flight CPI 1.4361 -- Total Cycles 8983909 ---- Thread 14 ---- PC 5: Stalled ----- 6580264 in-flight CPI 1.3653 -- Total Cycles 8983909 ---- Thread 15 ---- PC 5: Stalled ----- 6206618 in-flight CPI 1.4475 -- Total Cycles 8983909 ---- Thread 16 ---- PC 5: Stalled ----- 5908313 in-flight CPI 1.5205 -- Total Cycles 8983909 ---- Thread 17 ---- PC 5: Stalled ----- 5778007 in-flight CPI 1.5548 -- Total Cycles 8983909 ---- Thread 18 ---- PC 5: Stalled ----- 5915951 in-flight CPI 1.5186 -- Total Cycles 8983909 ---- Thread 19 ---- PC 5: Stalled ----- 5904160 in-flight CPI 1.5216 -- Total Cycles 8983909 ---- Thread 20 ---- PC 5: Stalled ----- 5687979 in-flight CPI 1.5795 -- Total Cycles 8983909 ---- Thread 21 ---- PC 5: Stalled ----- 5682230 in-flight CPI 1.5810 -- Total Cycles 8983909 ---- Thread 22 ---- PC 5: Stalled ----- 6186936 in-flight CPI 1.4521 -- Total Cycles 8983909 ---- Thread 23 ---- PC 5: Stalled ----- 5882506 in-flight CPI 1.5272 -- Total Cycles 8983909 ---- Thread 24 ---- PC 5: Stalled ----- 5934561 in-flight CPI 1.5138 -- Total Cycles 8983909 ---- Thread 25 ---- PC 5: Stalled ----- 6432458 in-flight CPI 1.3966 -- Total Cycles 8983909 ---- Thread 26 ---- PC 5: Stalled ----- 6116261 in-flight CPI 1.4689 -- Total Cycles 8983909 ---- Thread 27 ---- PC 5: Stalled ----- 5455751 in-flight CPI 1.6467 -- Total Cycles 8983909 ---- Thread 28 ---- PC 5: Stalled ----- 6021418 in-flight CPI 1.4920 -- Total Cycles 8983909 ---- Thread 29 ---- PC 5: Stalled ----- 6096518 in-flight CPI 1.4736 -- Total Cycles 8983909 ---- Thread 30 ---- PC 5: Stalled ----- 5635442 in-flight CPI 1.5942 -- Total Cycles 8983909 ---- Thread 31 ---- PC 5: Stalled ----- 5833901 in-flight CPI 1.5399 -- Total Cycles 8983909 Total CPI 0.0462 , IPC 21.6408 -- Total Cycles 8983909 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 430784 (2.080472%) FPSUB: 0 (0.000000%) FPMUL: 1976672 (9.546339%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 14469192 (69.878974%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 566851 (2.737607%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3254309 (15.716688%) DIV: 7789 (0.037617%) FPUN: 0 (0.000000%) FPRSUB: 477 (0.002304%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (213233938 total) ADD%: 8.148 (17375021) SUB%: 0.000 (0) MUL%: 0.000 (211) BITOR%: 1.232 (2627045) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.543 (1157519) FPSUB%: 0.000 (0) FPMUL%: 4.756 (10140656) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (633) FPMAX%: 0.000 (633) LOAD%: 4.949 (10552278) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (243) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.020 (41655) FPINV%: 0.000 (0) FPCONV%: 0.000 (697) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.062 (2263782) FPLE%: 0.397 (845846) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (633) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (28028) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.962 (6315852) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.746 (1589895) CMPU%: 0.000 (0) RSUB%: 0.000 (211) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.773 (33632448) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.230 (2622155) ORI%: 1.262 (2690231) XORI%: 0.000 (0) MULI%: 3.361 (7167594) LW%: 1.191 (2540324) LWI%: 13.934 (29712738) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (641807) SWI%: 4.102 (8746177) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.480 (3156051) bged%: 0.000 (0) bgeid%: 0.000 (211) bgtd%: 0.000 (0) bgtid%: 0.322 (686594) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.040 (86039) bned%: 0.000 (0) bneid%: 13.720 (29255307) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.744 (1586229) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.087 (185013) DIV%: 0.000 (422) FPUN%: 1.190 (2536705) FPRSUB%: 3.710 (7911692) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (1) FPGT%: 3.104 (6618906) FPGE%: 0.798 (1701708) SYNC%: 0.000 (0) NOP%: 8.823 (18814652) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 160 SUB 0 MUL 11 BITOR 3 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 510 FPSUB 0 FPMUL 5176 FPCMPLT 0 FPMIN 0 FPMAX 415 LOAD 2309421 INTCONV 0 ATOMIC_INC 6 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 97 FPINV 0 FPCONV 16 FPEQ 0 FPNE 0 FPLT 6 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 2017 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 2 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 2194 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3368175 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 768 ORI 587972 XORI 0 MULI 641473 LW 0 LWI 9444980 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1737 DIV 20 FPUN 0 FPRSUB 1 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.6408 --Total thread-cycles: 287485088 --total thread-cycles issued: 194419286 (67.627605%) --iCache conflicts: 6593215 (2.293411%) --thread*cycles of FU dependence: 16365188 (5.692535%) --thread*cycles of data dependence: 20706074 (7.202486%) --iCache cycles*banks: 287485088 (74.172185% used) Issue breakdown: --thread*cycles of issue worked: 194419286 (67.627607%) --thread*cycles of issue failed: 74251150 (25.827827%) --thread*cycles of issue NOP/other: 12355074741180092 (4297640210.535056%) Number of thread-cycles not ready: 20706074 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 213233938 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 7 2: 8 3: 7 4: 9 5: 8 6: 8 7: 8 8: 8 9: 8 10: 8 11: 7 12: 8 13: 7 14: 8 15: 7 16: 7 17: 7 18: 7 19: 7 20: 7 21: 7 22: 7 23: 8 24: 7 25: 8 26: 8 27: 8 28: 7 29: 10 30: 7 31: 7 <=== Core 12 ===> ---- Thread 00 ---- PC 5: Stalled ----- 5864197 in-flight CPI 1.5126 -- Total Cycles 8870422 ---- Thread 01 ---- PC 5: Stalled ----- 6418724 in-flight CPI 1.3820 -- Total Cycles 8870422 ---- Thread 02 ---- PC 5: Stalled ----- 6011465 in-flight CPI 1.4756 -- Total Cycles 8870422 ---- Thread 03 ---- PC 5: Stalled ----- 5846449 in-flight CPI 1.5172 -- Total Cycles 8870422 ---- Thread 04 ---- PC 5: Stalled ----- 6221586 in-flight CPI 1.4257 -- Total Cycles 8870422 ---- Thread 05 ---- PC 5: Stalled ----- 6715235 in-flight CPI 1.3209 -- Total Cycles 8870422 ---- Thread 06 ---- PC 5: Stalled ----- 6205557 in-flight CPI 1.4294 -- Total Cycles 8870422 ---- Thread 07 ---- PC 5: Stalled ----- 5839238 in-flight CPI 1.5191 -- Total Cycles 8870422 ---- Thread 08 ---- PC 5: Stalled ----- 6473383 in-flight CPI 1.3703 -- Total Cycles 8870422 ---- Thread 09 ---- PC 5: Stalled ----- 5890242 in-flight CPI 1.5059 -- Total Cycles 8870422 ---- Thread 10 ---- PC 5: Stalled ----- 6091867 in-flight CPI 1.4561 -- Total Cycles 8870422 ---- Thread 11 ---- PC 5: Stalled ----- 6201554 in-flight CPI 1.4304 -- Total Cycles 8870422 ---- Thread 12 ---- PC 5: Stalled ----- 6149823 in-flight CPI 1.4424 -- Total Cycles 8870422 ---- Thread 13 ---- PC 5: Stalled ----- 6153512 in-flight CPI 1.4415 -- Total Cycles 8870422 ---- Thread 14 ---- PC 5: Stalled ----- 6056097 in-flight CPI 1.4647 -- Total Cycles 8870422 ---- Thread 15 ---- PC 5: Stalled ----- 6493645 in-flight CPI 1.3660 -- Total Cycles 8870422 ---- Thread 16 ---- PC 5: Stalled ----- 5721697 in-flight CPI 1.5503 -- Total Cycles 8870422 ---- Thread 17 ---- PC 5: Stalled ----- 6162149 in-flight CPI 1.4395 -- Total Cycles 8870422 ---- Thread 18 ---- PC 5: Stalled ----- 6056318 in-flight CPI 1.4647 -- Total Cycles 8870422 ---- Thread 19 ---- PC 5: Stalled ----- 6728007 in-flight CPI 1.3184 -- Total Cycles 8870422 ---- Thread 20 ---- PC 5: Stalled ----- 5941260 in-flight CPI 1.4930 -- Total Cycles 8870422 ---- Thread 21 ---- PC 5: Stalled ----- 5735359 in-flight CPI 1.5466 -- Total Cycles 8870422 ---- Thread 22 ---- PC 5: Stalled ----- 6440833 in-flight CPI 1.3772 -- Total Cycles 8870422 ---- Thread 23 ---- PC 5: Stalled ----- 5578713 in-flight CPI 1.5900 -- Total Cycles 8870422 ---- Thread 24 ---- PC 5: Stalled ----- 5819701 in-flight CPI 1.5242 -- Total Cycles 8870422 ---- Thread 25 ---- PC 5: Stalled ----- 6479071 in-flight CPI 1.3691 -- Total Cycles 8870422 ---- Thread 26 ---- PC 5: Stalled ----- 6050391 in-flight CPI 1.4661 -- Total Cycles 8870422 ---- Thread 27 ---- PC 5: Stalled ----- 5782371 in-flight CPI 1.5340 -- Total Cycles 8870422 ---- Thread 28 ---- PC 5: Stalled ----- 5603356 in-flight CPI 1.5831 -- Total Cycles 8870422 ---- Thread 29 ---- PC 5: Stalled ----- 5941138 in-flight CPI 1.4930 -- Total Cycles 8870422 ---- Thread 30 ---- PC 5: Stalled ----- 5509018 in-flight CPI 1.6102 -- Total Cycles 8870422 ---- Thread 31 ---- PC 5: Stalled ----- 5779085 in-flight CPI 1.5349 -- Total Cycles 8870422 Total CPI 0.0457 , IPC 21.8661 -- Total Cycles 8870422 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 423025 (2.003222%) FPSUB: 0 (0.000000%) FPMUL: 1958471 (9.274279%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 14981969 (70.946648%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 558681 (2.645616%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3187294 (15.093332%) DIV: 7341 (0.034763%) FPUN: 0 (0.000000%) FPRSUB: 452 (0.002140%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (212715213 total) ADD%: 8.181 (17402371) SUB%: 0.000 (0) MUL%: 0.000 (199) BITOR%: 1.227 (2610281) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.535 (1138776) FPSUB%: 0.000 (0) FPMUL%: 4.736 (10075112) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (597) FPMAX%: 0.000 (597) LOAD%: 4.947 (10522201) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (231) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41132) FPINV%: 0.000 (0) FPCONV%: 0.000 (661) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.059 (2252866) FPLE%: 0.393 (835782) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (597) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27404) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.970 (6317330) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.746 (1587484) CMPU%: 0.000 (0) RSUB%: 0.000 (199) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.777 (33560091) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.233 (2622923) ORI%: 1.253 (2665885) XORI%: 0.000 (0) MULI%: 3.367 (7161928) LW%: 1.194 (2540668) LWI%: 13.941 (29653895) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.302 (642017) SWI%: 4.105 (8731806) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.484 (3156422) bged%: 0.000 (0) bgeid%: 0.000 (199) bgtd%: 0.000 (0) bgtid%: 0.323 (687099) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.039 (82885) bned%: 0.000 (0) bneid%: 13.717 (29178143) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.743 (1580700) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.085 (181178) DIV%: 0.000 (398) FPUN%: 1.184 (2519585) FPRSUB%: 3.705 (7880819) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (1) FPGT%: 3.106 (6606560) FPGE%: 0.797 (1694520) SYNC%: 0.000 (0) NOP%: 8.816 (18753575) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 171 SUB 0 MUL 18 BITOR 5 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 508 FPSUB 0 FPMUL 5040 FPCMPLT 0 FPMIN 0 FPMAX 388 LOAD 2347044 INTCONV 0 ATOMIC_INC 8 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 112 FPINV 0 FPCONV 20 FPEQ 0 FPNE 0 FPLT 9 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1978 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 2169 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3366210 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 759 ORI 575940 XORI 0 MULI 641604 LW 0 LWI 9422786 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1685 DIV 19 FPUN 0 FPRSUB 2 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.8661 --Total thread-cycles: 283853504 --total thread-cycles issued: 193961638 (68.331597%) --iCache conflicts: 6563450 (2.312267%) --thread*cycles of FU dependence: 16366488 (5.765822%) --thread*cycles of data dependence: 21117233 (7.439483%) --iCache cycles*banks: 283853504 (74.938390% used) Issue breakdown: --thread*cycles of issue worked: 193961638 (68.331599%) --thread*cycles of issue failed: 71138291 (25.061622%) --thread*cycles of issue NOP/other: 18753575 (6.606779%) Number of thread-cycles not ready: 21117233 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 212715213 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 7 3: 7 4: 8 5: 8 6: 7 7: 7 8: 8 9: 7 10: 7 11: 7 12: 7 13: 8 14: 7 15: 8 16: 7 17: 8 18: 7 19: 9 20: 7 21: 7 22: 7 23: 6 24: 7 25: 7 26: 7 27: 7 28: 7 29: 7 30: 7 31: 6 <=== Core 13 ===> ---- Thread 00 ---- PC 5: Stalled ----- 5872060 in-flight CPI 1.5456 -- Total Cycles 9076074 ---- Thread 01 ---- PC 5: Stalled ----- 6396585 in-flight CPI 1.4189 -- Total Cycles 9076074 ---- Thread 02 ---- PC 5: Stalled ----- 6531956 in-flight CPI 1.3895 -- Total Cycles 9076074 ---- Thread 03 ---- PC 5: Stalled ----- 5944885 in-flight CPI 1.5267 -- Total Cycles 9076074 ---- Thread 04 ---- PC 5: Stalled ----- 6298909 in-flight CPI 1.4409 -- Total Cycles 9076074 ---- Thread 05 ---- PC 5: Stalled ----- 6139677 in-flight CPI 1.4783 -- Total Cycles 9076074 ---- Thread 06 ---- PC 5: Stalled ----- 5958294 in-flight CPI 1.5233 -- Total Cycles 9076074 ---- Thread 07 ---- PC 5: Stalled ----- 5872118 in-flight CPI 1.5456 -- Total Cycles 9076074 ---- Thread 08 ---- PC 5: Stalled ----- 5766733 in-flight CPI 1.5739 -- Total Cycles 9076074 ---- Thread 09 ---- PC 5: Stalled ----- 6489028 in-flight CPI 1.3987 -- Total Cycles 9076074 ---- Thread 10 ---- PC 5: Stalled ----- 6176179 in-flight CPI 1.4695 -- Total Cycles 9076074 ---- Thread 11 ---- PC 5: Stalled ----- 6260945 in-flight CPI 1.4496 -- Total Cycles 9076074 ---- Thread 12 ---- PC 5: Stalled ----- 6710948 in-flight CPI 1.3524 -- Total Cycles 9076074 ---- Thread 13 ---- PC 5: Stalled ----- 6091545 in-flight CPI 1.4899 -- Total Cycles 9076074 ---- Thread 14 ---- PC 5: Stalled ----- 6775540 in-flight CPI 1.3395 -- Total Cycles 9076074 ---- Thread 15 ---- PC 5: Stalled ----- 5938943 in-flight CPI 1.5282 -- Total Cycles 9076074 ---- Thread 16 ---- PC 5: Stalled ----- 6182135 in-flight CPI 1.4681 -- Total Cycles 9076074 ---- Thread 17 ---- PC 5: Stalled ----- 6132723 in-flight CPI 1.4799 -- Total Cycles 9076074 ---- Thread 18 ---- PC 5: Stalled ----- 6374895 in-flight CPI 1.4237 -- Total Cycles 9076074 ---- Thread 19 ---- PC 5: Stalled ----- 5934793 in-flight CPI 1.5293 -- Total Cycles 9076074 ---- Thread 20 ---- PC 5: Stalled ----- 5712124 in-flight CPI 1.5889 -- Total Cycles 9076074 ---- Thread 21 ---- PC 5: Stalled ----- 6626946 in-flight CPI 1.3696 -- Total Cycles 9076074 ---- Thread 22 ---- PC 5: Stalled ----- 6338639 in-flight CPI 1.4319 -- Total Cycles 9076074 ---- Thread 23 ---- PC 5: Stalled ----- 5970711 in-flight CPI 1.5201 -- Total Cycles 9076074 ---- Thread 24 ---- PC 5: Stalled ----- 5606210 in-flight CPI 1.6189 -- Total Cycles 9076074 ---- Thread 25 ---- PC 5: Stalled ----- 6417961 in-flight CPI 1.4142 -- Total Cycles 9076074 ---- Thread 26 ---- PC 5: Stalled ----- 5490725 in-flight CPI 1.6530 -- Total Cycles 9076074 ---- Thread 27 ---- PC 5: Stalled ----- 6661912 in-flight CPI 1.3624 -- Total Cycles 9076074 ---- Thread 28 ---- PC 5: Stalled ----- 5445836 in-flight CPI 1.6666 -- Total Cycles 9076074 ---- Thread 29 ---- PC 5: Stalled ----- 5422443 in-flight CPI 1.6738 -- Total Cycles 9076074 ---- Thread 30 ---- PC 5: Stalled ----- 5392114 in-flight CPI 1.6832 -- Total Cycles 9076074 ---- Thread 31 ---- PC 5: Stalled ----- 5792954 in-flight CPI 1.5667 -- Total Cycles 9076074 Total CPI 0.0466 , IPC 21.4551 -- Total Cycles 9076074 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 437606 (2.031308%) FPSUB: 0 (0.000000%) FPMUL: 1992052 (9.246838%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 15262503 (70.846489%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 561069 (2.604407%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3281829 (15.233809%) DIV: 7535 (0.034976%) FPUN: 0 (0.000000%) FPRSUB: 468 (0.002172%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (213545521 total) ADD%: 8.184 (17477483) SUB%: 0.000 (0) MUL%: 0.000 (204) BITOR%: 1.224 (2614641) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.549 (1171383) FPSUB%: 0.000 (0) FPMUL%: 4.774 (10195075) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (612) FPMAX%: 0.000 (612) LOAD%: 4.954 (10579521) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (236) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41157) FPINV%: 0.000 (0) FPCONV%: 0.000 (676) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.064 (2271757) FPLE%: 0.389 (830197) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (612) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27324) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.964 (6329537) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.750 (1602526) CMPU%: 0.000 (0) RSUB%: 0.000 (204) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.762 (33659182) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.232 (2631752) ORI%: 1.266 (2703275) XORI%: 0.000 (0) MULI%: 3.360 (7175315) LW%: 1.192 (2545510) LWI%: 13.924 (29735089) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (642156) SWI%: 4.100 (8756027) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.482 (3163928) bged%: 0.000 (0) bgeid%: 0.000 (204) bgtd%: 0.000 (0) bgtid%: 0.322 (688321) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.040 (86182) bned%: 0.000 (0) bneid%: 13.704 (29264399) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.736 (1572282) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.087 (186769) DIV%: 0.000 (408) FPUN%: 1.180 (2520030) FPRSUB%: 3.715 (7932702) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (1) FPGT%: 3.100 (6620258) FPGE%: 0.796 (1700435) SYNC%: 0.000 (0) NOP%: 8.812 (18817443) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 187 SUB 0 MUL 30 BITOR 2 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 493 FPSUB 0 FPMUL 5333 FPCMPLT 0 FPMIN 0 FPMAX 401 LOAD 2338941 INTCONV 0 ATOMIC_INC 8 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 109 FPINV 0 FPCONV 20 FPEQ 0 FPNE 0 FPLT 5 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1926 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 1 ADDKC 0 BITXOR 0 ANDN 0 CMP 2220 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3373236 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 840 ORI 598325 XORI 0 MULI 635776 LW 0 LWI 9448280 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1803 DIV 14 FPUN 0 FPRSUB 6 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.4551 --Total thread-cycles: 290434368 --total thread-cycles issued: 194728078 (67.047189%) --iCache conflicts: 6526846 (2.247271%) --thread*cycles of FU dependence: 16407961 (5.649456%) --thread*cycles of data dependence: 21543062 (7.417532%) --iCache cycles*banks: 290434368 (73.526268% used) Issue breakdown: --thread*cycles of issue worked: 194728078 (67.047188%) --thread*cycles of issue failed: 76888847 (26.473743%) --thread*cycles of issue NOP/other: 39671057 (13.659216%) Number of thread-cycles not ready: 21543062 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 213545521 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 9 1: 8 2: 8 3: 7 4: 8 5: 7 6: 7 7: 9 8: 8 9: 8 10: 7 11: 8 12: 7 13: 7 14: 7 15: 7 16: 7 17: 7 18: 9 19: 7 20: 7 21: 7 22: 8 23: 7 24: 6 25: 9 26: 6 27: 7 28: 6 29: 8 30: 6 31: 7 <=== Core 14 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6119150 in-flight CPI 1.4677 -- Total Cycles 8981091 ---- Thread 01 ---- PC 5: Stalled ----- 6815818 in-flight CPI 1.3177 -- Total Cycles 8981091 ---- Thread 02 ---- PC 5: Stalled ----- 6796628 in-flight CPI 1.3214 -- Total Cycles 8981091 ---- Thread 03 ---- PC 5: Stalled ----- 6097923 in-flight CPI 1.4728 -- Total Cycles 8981091 ---- Thread 04 ---- PC 5: Stalled ----- 5893435 in-flight CPI 1.5239 -- Total Cycles 8981091 ---- Thread 05 ---- PC 5: Stalled ----- 6060270 in-flight CPI 1.4820 -- Total Cycles 8981091 ---- Thread 06 ---- PC 5: Stalled ----- 6983263 in-flight CPI 1.2861 -- Total Cycles 8981091 ---- Thread 07 ---- PC 5: Stalled ----- 6389728 in-flight CPI 1.4055 -- Total Cycles 8981091 ---- Thread 08 ---- PC 5: Stalled ----- 6195260 in-flight CPI 1.4497 -- Total Cycles 8981091 ---- Thread 09 ---- PC 5: Stalled ----- 6470555 in-flight CPI 1.3880 -- Total Cycles 8981091 ---- Thread 10 ---- PC 5: Stalled ----- 5789227 in-flight CPI 1.5513 -- Total Cycles 8981091 ---- Thread 11 ---- PC 5: Stalled ----- 6022274 in-flight CPI 1.4913 -- Total Cycles 8981091 ---- Thread 12 ---- PC 5: Stalled ----- 5940984 in-flight CPI 1.5117 -- Total Cycles 8981091 ---- Thread 13 ---- PC 5: Stalled ----- 6241630 in-flight CPI 1.4389 -- Total Cycles 8981091 ---- Thread 14 ---- PC 5: Stalled ----- 6173996 in-flight CPI 1.4547 -- Total Cycles 8981091 ---- Thread 15 ---- PC 5: Stalled ----- 6583174 in-flight CPI 1.3642 -- Total Cycles 8981091 ---- Thread 16 ---- PC 5: Stalled ----- 6294454 in-flight CPI 1.4268 -- Total Cycles 8981091 ---- Thread 17 ---- PC 5: Stalled ----- 6323990 in-flight CPI 1.4202 -- Total Cycles 8981091 ---- Thread 18 ---- PC 5: Stalled ----- 6739084 in-flight CPI 1.3327 -- Total Cycles 8981091 ---- Thread 19 ---- PC 5: Stalled ----- 6392084 in-flight CPI 1.4050 -- Total Cycles 8981091 ---- Thread 20 ---- PC 5: Stalled ----- 6287001 in-flight CPI 1.4285 -- Total Cycles 8981091 ---- Thread 21 ---- PC 5: Stalled ----- 6519243 in-flight CPI 1.3776 -- Total Cycles 8981091 ---- Thread 22 ---- PC 5: Stalled ----- 5516352 in-flight CPI 1.6281 -- Total Cycles 8981091 ---- Thread 23 ---- PC 5: Stalled ----- 6053899 in-flight CPI 1.4835 -- Total Cycles 8981091 ---- Thread 24 ---- PC 5: Stalled ----- 5703206 in-flight CPI 1.5747 -- Total Cycles 8981091 ---- Thread 25 ---- PC 5: Stalled ----- 6480459 in-flight CPI 1.3859 -- Total Cycles 8981091 ---- Thread 26 ---- PC 5: Stalled ----- 6325392 in-flight CPI 1.4198 -- Total Cycles 8981091 ---- Thread 27 ---- PC 5: Stalled ----- 5762576 in-flight CPI 1.5585 -- Total Cycles 8981091 ---- Thread 28 ---- PC 5: Stalled ----- 5694680 in-flight CPI 1.5771 -- Total Cycles 8981091 ---- Thread 29 ---- PC 5: Stalled ----- 5370476 in-flight CPI 1.6723 -- Total Cycles 8981091 ---- Thread 30 ---- PC 5: Stalled ----- 5830600 in-flight CPI 1.5403 -- Total Cycles 8981091 ---- Thread 31 ---- PC 5: Stalled ----- 6321565 in-flight CPI 1.4207 -- Total Cycles 8981091 Total CPI 0.0453 , IPC 22.0674 -- Total Cycles 8981091 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 445055 (2.063897%) FPSUB: 0 (0.000000%) FPMUL: 2029310 (9.410719%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 15192809 (70.455107%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 564585 (2.618206%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3324176 (15.415528%) DIV: 7423 (0.034423%) FPUN: 0 (0.000000%) FPRSUB: 457 (0.002119%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (217340258 total) ADD%: 8.197 (17816092) SUB%: 0.000 (0) MUL%: 0.000 (201) BITOR%: 1.226 (2664976) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.548 (1190312) FPSUB%: 0.000 (0) FPMUL%: 4.775 (10378859) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (603) FPMAX%: 0.000 (603) LOAD%: 4.952 (10763128) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (233) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41469) FPINV%: 0.000 (0) FPCONV%: 0.000 (667) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.064 (2311659) FPLE%: 0.389 (845640) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (603) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27470) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.963 (6439236) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.751 (1631221) CMPU%: 0.000 (0) RSUB%: 0.000 (201) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.763 (34258732) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.232 (2678307) ORI%: 1.266 (2751403) XORI%: 0.000 (0) MULI%: 3.359 (7299471) LW%: 1.191 (2589472) LWI%: 13.917 (30246586) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (653434) SWI%: 4.095 (8899751) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.481 (3218440) bged%: 0.000 (0) bgeid%: 0.000 (201) bgtd%: 0.000 (0) bgtid%: 0.322 (700652) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.040 (86585) bned%: 0.000 (0) bneid%: 13.706 (29788775) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.737 (1601498) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.087 (189219) DIV%: 0.000 (402) FPUN%: 1.181 (2567687) FPRSUB%: 3.715 (8074753) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (2) FPGT%: 3.100 (6737573) FPGE%: 0.797 (1732767) SYNC%: 0.000 (0) NOP%: 8.812 (19151279) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 180 SUB 0 MUL 24 BITOR 3 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 494 FPSUB 0 FPMUL 5040 FPCMPLT 0 FPMIN 0 FPMAX 387 LOAD 2365476 INTCONV 0 ATOMIC_INC 6 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 114 FPINV 0 FPCONV 23 FPEQ 0 FPNE 0 FPLT 3 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1734 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 2240 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3432281 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 803 ORI 609570 XORI 0 MULI 646329 LW 0 LWI 9613536 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1686 DIV 21 FPUN 0 FPRSUB 7 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 22.0674 --Total thread-cycles: 287394912 --total thread-cycles issued: 198188979 (68.960503%) --iCache conflicts: 6631729 (2.307532%) --thread*cycles of FU dependence: 16679983 (5.803855%) --thread*cycles of data dependence: 21563815 (7.503200%) --iCache cycles*banks: 287394912 (75.624265% used) Issue breakdown: --thread*cycles of issue worked: 198188979 (68.960504%) --thread*cycles of issue failed: 70054654 (24.375746%) --thread*cycles of issue NOP/other: 19151279 (6.663750%) Number of thread-cycles not ready: 21563815 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 217340258 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 7 2: 8 3: 8 4: 7 5: 7 6: 7 7: 8 8: 8 9: 8 10: 8 11: 7 12: 7 13: 7 14: 7 15: 9 16: 7 17: 7 18: 7 19: 7 20: 7 21: 8 22: 6 23: 7 24: 7 25: 7 26: 8 27: 7 28: 7 29: 6 30: 7 31: 8 <=== Core 15 ===> ---- Thread 00 ---- PC 5: Stalled ----- 5960723 in-flight CPI 1.4446 -- Total Cycles 8610777 ---- Thread 01 ---- PC 5: Stalled ----- 5756565 in-flight CPI 1.4958 -- Total Cycles 8610777 ---- Thread 02 ---- PC 5: Stalled ----- 5927149 in-flight CPI 1.4528 -- Total Cycles 8610777 ---- Thread 03 ---- PC 5: Stalled ----- 6687032 in-flight CPI 1.2877 -- Total Cycles 8610777 ---- Thread 04 ---- PC 5: Stalled ----- 5881798 in-flight CPI 1.4640 -- Total Cycles 8610777 ---- Thread 05 ---- PC 5: Stalled ----- 5810681 in-flight CPI 1.4819 -- Total Cycles 8610777 ---- Thread 06 ---- PC 5: Stalled ----- 6227088 in-flight CPI 1.3828 -- Total Cycles 8610777 ---- Thread 07 ---- PC 5: Stalled ----- 6294417 in-flight CPI 1.3680 -- Total Cycles 8610777 ---- Thread 08 ---- PC 5: Stalled ----- 6454631 in-flight CPI 1.3340 -- Total Cycles 8610777 ---- Thread 09 ---- PC 5: Stalled ----- 5877474 in-flight CPI 1.4650 -- Total Cycles 8610777 ---- Thread 10 ---- PC 5: Stalled ----- 5996765 in-flight CPI 1.4359 -- Total Cycles 8610777 ---- Thread 11 ---- PC 5: Stalled ----- 5901150 in-flight CPI 1.4592 -- Total Cycles 8610777 ---- Thread 12 ---- PC 5: Stalled ----- 6410129 in-flight CPI 1.3433 -- Total Cycles 8610777 ---- Thread 13 ---- PC 5: Stalled ----- 5708474 in-flight CPI 1.5084 -- Total Cycles 8610777 ---- Thread 14 ---- PC 5: Stalled ----- 6215081 in-flight CPI 1.3855 -- Total Cycles 8610777 ---- Thread 15 ---- PC 5: Stalled ----- 5710083 in-flight CPI 1.5080 -- Total Cycles 8610777 ---- Thread 16 ---- PC 5: Stalled ----- 5995382 in-flight CPI 1.4362 -- Total Cycles 8610777 ---- Thread 17 ---- PC 5: Stalled ----- 5949857 in-flight CPI 1.4472 -- Total Cycles 8610777 ---- Thread 18 ---- PC 5: Stalled ----- 6211982 in-flight CPI 1.3862 -- Total Cycles 8610777 ---- Thread 19 ---- PC 5: Stalled ----- 6172380 in-flight CPI 1.3950 -- Total Cycles 8610777 ---- Thread 20 ---- PC 5: Stalled ----- 6395063 in-flight CPI 1.3465 -- Total Cycles 8610777 ---- Thread 21 ---- PC 5: Stalled ----- 5557177 in-flight CPI 1.5495 -- Total Cycles 8610777 ---- Thread 22 ---- PC 5: Stalled ----- 6005656 in-flight CPI 1.4338 -- Total Cycles 8610777 ---- Thread 23 ---- PC 5: Stalled ----- 5612798 in-flight CPI 1.5341 -- Total Cycles 8610777 ---- Thread 24 ---- PC 5: Stalled ----- 6032768 in-flight CPI 1.4273 -- Total Cycles 8610777 ---- Thread 25 ---- PC 5: Stalled ----- 6334740 in-flight CPI 1.3593 -- Total Cycles 8610777 ---- Thread 26 ---- PC 5: Stalled ----- 6126723 in-flight CPI 1.4054 -- Total Cycles 8610777 ---- Thread 27 ---- PC 5: Stalled ----- 6116469 in-flight CPI 1.4078 -- Total Cycles 8610777 ---- Thread 28 ---- PC 5: Stalled ----- 5368832 in-flight CPI 1.6038 -- Total Cycles 8610777 ---- Thread 29 ---- PC 5: Stalled ----- 6016928 in-flight CPI 1.4311 -- Total Cycles 8610777 ---- Thread 30 ---- PC 5: Stalled ----- 6078598 in-flight CPI 1.4166 -- Total Cycles 8610777 ---- Thread 31 ---- PC 5: Stalled ----- 5279479 in-flight CPI 1.6310 -- Total Cycles 8610777 Total CPI 0.0448 , IPC 22.3063 -- Total Cycles 8610777 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 430169 (2.043219%) FPSUB: 0 (0.000000%) FPMUL: 1966188 (9.339010%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 14875280 (70.654682%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 549226 (2.608717%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3224991 (15.318079%) DIV: 7194 (0.034170%) FPUN: 0 (0.000000%) FPRSUB: 447 (0.002123%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (210652477 total) ADD%: 8.194 (17261103) SUB%: 0.000 (0) MUL%: 0.000 (195) BITOR%: 1.227 (2584235) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.547 (1152527) FPSUB%: 0.000 (0) FPMUL%: 4.772 (10052463) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (585) FPMAX%: 0.000 (585) LOAD%: 4.948 (10423386) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (227) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (40439) FPINV%: 0.000 (0) FPCONV%: 0.000 (649) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.065 (2243608) FPLE%: 0.392 (825579) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (585) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (26976) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.959 (6234199) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.749 (1577236) CMPU%: 0.000 (0) RSUB%: 0.000 (195) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.760 (33199535) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.230 (2591574) ORI%: 1.265 (2665167) XORI%: 0.000 (0) MULI%: 3.359 (7075533) LW%: 1.190 (2507178) LWI%: 13.921 (29324652) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (633648) SWI%: 4.094 (8623793) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.479 (3114769) bged%: 0.000 (0) bgeid%: 0.000 (195) bgtd%: 0.000 (0) bgtid%: 0.323 (679432) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.040 (84828) bned%: 0.000 (0) bneid%: 13.712 (28884566) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.737 (1552669) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.087 (183516) DIV%: 0.000 (390) FPUN%: 1.183 (2491539) FPRSUB%: 3.714 (7824347) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (1) FPGT%: 3.103 (6535934) FPGE%: 0.796 (1676523) SYNC%: 0.000 (0) NOP%: 8.819 (18577820) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 168 SUB 0 MUL 30 BITOR 4 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 521 FPSUB 0 FPMUL 5268 FPCMPLT 0 FPMIN 0 FPMAX 374 LOAD 2329463 INTCONV 0 ATOMIC_INC 6 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 102 FPINV 0 FPCONV 10 FPEQ 0 FPNE 0 FPLT 4 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1902 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 1 COS 0 SIN 0 ADDC 0 ADDK 1 ADDKC 0 BITXOR 0 ANDN 0 CMP 2169 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3327077 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 761 ORI 589716 XORI 0 MULI 626700 LW 0 LWI 9315703 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1659 DIV 12 FPUN 0 FPRSUB 4 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 22.3063 --Total thread-cycles: 275544864 --total thread-cycles issued: 192074657 (69.707217%) --iCache conflicts: 6488970 (2.354960%) --thread*cycles of FU dependence: 16201684 (5.879872%) --thread*cycles of data dependence: 21053495 (7.640678%) --iCache cycles*banks: 275544864 (76.449441% used) Issue breakdown: --thread*cycles of issue worked: 192074657 (69.707217%) --thread*cycles of issue failed: 64892387 (23.550570%) --thread*cycles of issue NOP/other: -4624590542928643684 (-1678343945807.911600%) Number of thread-cycles not ready: 21053495 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 210652477 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 7 2: 7 3: 7 4: 8 5: 7 6: 7 7: 7 8: 7 9: 8 10: 7 11: 7 12: 7 13: 7 14: 8 15: 7 16: 7 17: 7 18: 7 19: 7 20: 7 21: 7 22: 7 23: 7 24: 7 25: 7 26: 8 27: 7 28: 6 29: 7 30: 7 31: 7 <=== Core 16 ===> ---- Thread 00 ---- PC 5: Stalled ----- 5855237 in-flight CPI 1.5328 -- Total Cycles 8974795 ---- Thread 01 ---- PC 5: Stalled ----- 5878951 in-flight CPI 1.5266 -- Total Cycles 8974795 ---- Thread 02 ---- PC 5: Stalled ----- 6447013 in-flight CPI 1.3921 -- Total Cycles 8974795 ---- Thread 03 ---- PC 5: Stalled ----- 6381309 in-flight CPI 1.4064 -- Total Cycles 8974795 ---- Thread 04 ---- PC 5: Stalled ----- 5932488 in-flight CPI 1.5128 -- Total Cycles 8974795 ---- Thread 05 ---- PC 5: Stalled ----- 6102654 in-flight CPI 1.4706 -- Total Cycles 8974795 ---- Thread 06 ---- PC 5: Stalled ----- 6985455 in-flight CPI 1.2848 -- Total Cycles 8974795 ---- Thread 07 ---- PC 5: Stalled ----- 6255204 in-flight CPI 1.4348 -- Total Cycles 8974795 ---- Thread 08 ---- PC 5: Stalled ----- 6313271 in-flight CPI 1.4216 -- Total Cycles 8974795 ---- Thread 09 ---- PC 5: Stalled ----- 6162775 in-flight CPI 1.4563 -- Total Cycles 8974795 ---- Thread 10 ---- PC 5: Stalled ----- 6565256 in-flight CPI 1.3670 -- Total Cycles 8974795 ---- Thread 11 ---- PC 5: Stalled ----- 6366275 in-flight CPI 1.4097 -- Total Cycles 8974795 ---- Thread 12 ---- PC 5: Stalled ----- 6242267 in-flight CPI 1.4377 -- Total Cycles 8974795 ---- Thread 13 ---- PC 5: Stalled ----- 6167472 in-flight CPI 1.4552 -- Total Cycles 8974795 ---- Thread 14 ---- PC 5: Stalled ----- 6140317 in-flight CPI 1.4616 -- Total Cycles 8974795 ---- Thread 15 ---- PC 5: Stalled ----- 6027581 in-flight CPI 1.4890 -- Total Cycles 8974795 ---- Thread 16 ---- PC 5: Stalled ----- 5717871 in-flight CPI 1.5696 -- Total Cycles 8974795 ---- Thread 17 ---- PC 5: Stalled ----- 6375429 in-flight CPI 1.4077 -- Total Cycles 8974795 ---- Thread 18 ---- PC 5: Stalled ----- 6151529 in-flight CPI 1.4589 -- Total Cycles 8974795 ---- Thread 19 ---- PC 5: Stalled ----- 6014850 in-flight CPI 1.4921 -- Total Cycles 8974795 ---- Thread 20 ---- PC 5: Stalled ----- 5738657 in-flight CPI 1.5639 -- Total Cycles 8974795 ---- Thread 21 ---- PC 5: Stalled ----- 6140247 in-flight CPI 1.4616 -- Total Cycles 8974795 ---- Thread 22 ---- PC 5: Stalled ----- 5854597 in-flight CPI 1.5329 -- Total Cycles 8974795 ---- Thread 23 ---- PC 5: Stalled ----- 6491798 in-flight CPI 1.3825 -- Total Cycles 8974795 ---- Thread 24 ---- PC 5: Stalled ----- 5740795 in-flight CPI 1.5633 -- Total Cycles 8974795 ---- Thread 25 ---- PC 5: Stalled ----- 6035344 in-flight CPI 1.4870 -- Total Cycles 8974795 ---- Thread 26 ---- PC 5: Stalled ----- 5816180 in-flight CPI 1.5431 -- Total Cycles 8974795 ---- Thread 27 ---- PC 5: Stalled ----- 6233003 in-flight CPI 1.4399 -- Total Cycles 8974795 ---- Thread 28 ---- PC 5: Stalled ----- 5641297 in-flight CPI 1.5909 -- Total Cycles 8974795 ---- Thread 29 ---- PC 5: Stalled ----- 5394752 in-flight CPI 1.6636 -- Total Cycles 8974795 ---- Thread 30 ---- PC 5: Stalled ----- 5642664 in-flight CPI 1.5905 -- Total Cycles 8974795 ---- Thread 31 ---- PC 5: Stalled ----- 5928901 in-flight CPI 1.5137 -- Total Cycles 8974795 Total CPI 0.0461 , IPC 21.6988 -- Total Cycles 8974795 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 437974 (2.045621%) FPSUB: 0 (0.000000%) FPMUL: 1995740 (9.321390%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 15134526 (70.687979%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 558989 (2.610838%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3275182 (15.297208%) DIV: 7451 (0.034801%) FPUN: 0 (0.000000%) FPRSUB: 463 (0.002163%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (213579308 total) ADD%: 8.189 (17490837) SUB%: 0.000 (0) MUL%: 0.000 (202) BITOR%: 1.229 (2624016) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.549 (1173267) FPSUB%: 0.000 (0) FPMUL%: 4.776 (10199524) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (606) FPMAX%: 0.000 (606) LOAD%: 4.946 (10564075) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (234) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41086) FPINV%: 0.000 (0) FPCONV%: 0.000 (670) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.066 (2275735) FPLE%: 0.388 (828393) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (606) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27362) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.960 (6322374) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.749 (1600569) CMPU%: 0.000 (0) RSUB%: 0.000 (202) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.758 (33655097) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.230 (2628048) ORI%: 1.271 (2713980) XORI%: 0.000 (0) MULI%: 3.358 (7172935) LW%: 1.190 (2542660) LWI%: 13.912 (29712447) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (642089) SWI%: 4.092 (8740041) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.479 (3159535) bged%: 0.000 (0) bgeid%: 0.000 (202) bgtd%: 0.000 (0) bgtid%: 0.323 (688829) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.041 (86933) bned%: 0.000 (0) bneid%: 13.716 (29294462) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.734 (1566938) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.087 (186363) DIV%: 0.000 (404) FPUN%: 1.185 (2530235) FPRSUB%: 3.714 (7932352) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (1) FPGT%: 3.102 (6625541) FPGE%: 0.802 (1712493) SYNC%: 0.000 (0) NOP%: 8.820 (18837263) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 188 SUB 0 MUL 28 BITOR 6 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 570 FPSUB 0 FPMUL 5228 FPCMPLT 0 FPMIN 0 FPMAX 398 LOAD 2355826 INTCONV 0 ATOMIC_INC 9 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 125 FPINV 0 FPCONV 8 FPEQ 0 FPNE 0 FPLT 5 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1843 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 2242 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3373150 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 786 ORI 599402 XORI 0 MULI 643388 LW 0 LWI 9441713 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1764 DIV 17 FPUN 0 FPRSUB 4 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.6988 --Total thread-cycles: 287193440 --total thread-cycles issued: 194742045 (67.808669%) --iCache conflicts: 6622582 (2.305966%) --thread*cycles of FU dependence: 16426726 (5.719743%) --thread*cycles of data dependence: 21410325 (7.455019%) --iCache cycles*banks: 287193440 (74.367764% used) Issue breakdown: --thread*cycles of issue worked: 194742045 (67.808668%) --thread*cycles of issue failed: 73614132 (25.632247%) --thread*cycles of issue NOP/other: 18837263 (6.559085%) Number of thread-cycles not ready: 21410325 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 213579308 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 7 3: 7 4: 7 5: 7 6: 8 7: 7 8: 8 9: 9 10: 8 11: 7 12: 7 13: 7 14: 8 15: 7 16: 7 17: 7 18: 7 19: 7 20: 8 21: 9 22: 7 23: 7 24: 7 25: 7 26: 7 27: 7 28: 7 29: 7 30: 7 31: 7 <=== Core 17 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6153032 in-flight CPI 1.4808 -- Total Cycles 9111321 ---- Thread 01 ---- PC 5: Stalled ----- 6985813 in-flight CPI 1.3043 -- Total Cycles 9111321 ---- Thread 02 ---- PC 5: Stalled ----- 6480423 in-flight CPI 1.4060 -- Total Cycles 9111321 ---- Thread 03 ---- PC 5: Stalled ----- 7172048 in-flight CPI 1.2704 -- Total Cycles 9111321 ---- Thread 04 ---- PC 5: Stalled ----- 6110149 in-flight CPI 1.4912 -- Total Cycles 9111321 ---- Thread 05 ---- PC 5: Stalled ----- 6644132 in-flight CPI 1.3713 -- Total Cycles 9111321 ---- Thread 06 ---- PC 5: Stalled ----- 6123487 in-flight CPI 1.4879 -- Total Cycles 9111321 ---- Thread 07 ---- PC 5: Stalled ----- 6424766 in-flight CPI 1.4182 -- Total Cycles 9111321 ---- Thread 08 ---- PC 5: Stalled ----- 5979019 in-flight CPI 1.5239 -- Total Cycles 9111321 ---- Thread 09 ---- PC 5: Stalled ----- 5888468 in-flight CPI 1.5473 -- Total Cycles 9111321 ---- Thread 10 ---- PC 5: Stalled ----- 6295036 in-flight CPI 1.4474 -- Total Cycles 9111321 ---- Thread 11 ---- PC 5: Stalled ----- 5832316 in-flight CPI 1.5622 -- Total Cycles 9111321 ---- Thread 12 ---- PC 5: Stalled ----- 6590740 in-flight CPI 1.3824 -- Total Cycles 9111321 ---- Thread 13 ---- PC 5: Stalled ----- 6305900 in-flight CPI 1.4449 -- Total Cycles 9111321 ---- Thread 14 ---- PC 5: Stalled ----- 6499163 in-flight CPI 1.4019 -- Total Cycles 9111321 ---- Thread 15 ---- PC 5: Stalled ----- 6181632 in-flight CPI 1.4739 -- Total Cycles 9111321 ---- Thread 16 ---- PC 5: Stalled ----- 6290710 in-flight CPI 1.4484 -- Total Cycles 9111321 ---- Thread 17 ---- PC 5: Stalled ----- 5965003 in-flight CPI 1.5275 -- Total Cycles 9111321 ---- Thread 18 ---- PC 5: Stalled ----- 6022367 in-flight CPI 1.5129 -- Total Cycles 9111321 ---- Thread 19 ---- PC 5: Stalled ----- 6134490 in-flight CPI 1.4853 -- Total Cycles 9111321 ---- Thread 20 ---- PC 5: Stalled ----- 6033567 in-flight CPI 1.5101 -- Total Cycles 9111321 ---- Thread 21 ---- PC 5: Stalled ----- 5894818 in-flight CPI 1.5456 -- Total Cycles 9111321 ---- Thread 22 ---- PC 5: Stalled ----- 5795838 in-flight CPI 1.5720 -- Total Cycles 9111321 ---- Thread 23 ---- PC 5: Stalled ----- 6571641 in-flight CPI 1.3865 -- Total Cycles 9111321 ---- Thread 24 ---- PC 5: Stalled ----- 6240668 in-flight CPI 1.4600 -- Total Cycles 9111321 ---- Thread 25 ---- PC 5: Stalled ----- 6434940 in-flight CPI 1.4159 -- Total Cycles 9111321 ---- Thread 26 ---- PC 5: Stalled ----- 5570260 in-flight CPI 1.6357 -- Total Cycles 9111321 ---- Thread 27 ---- PC 5: Stalled ----- 6471346 in-flight CPI 1.4079 -- Total Cycles 9111321 ---- Thread 28 ---- PC 5: Stalled ----- 6216807 in-flight CPI 1.4656 -- Total Cycles 9111321 ---- Thread 29 ---- PC 5: Stalled ----- 6019440 in-flight CPI 1.5136 -- Total Cycles 9111321 ---- Thread 30 ---- PC 5: Stalled ----- 5905155 in-flight CPI 1.5429 -- Total Cycles 9111321 ---- Thread 31 ---- PC 5: Stalled ----- 5313917 in-flight CPI 1.7146 -- Total Cycles 9111321 Total CPI 0.0459 , IPC 21.7913 -- Total Cycles 9111321 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 447779 (2.054290%) FPSUB: 0 (0.000000%) FPMUL: 2033511 (9.329201%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 15382104 (70.568954%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 567117 (2.601780%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3358616 (15.408426%) DIV: 7672 (0.035197%) FPUN: 0 (0.000000%) FPRSUB: 469 (0.002152%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (217741889 total) ADD%: 8.188 (17828000) SUB%: 0.000 (0) MUL%: 0.000 (208) BITOR%: 1.224 (2664916) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.549 (1194560) FPSUB%: 0.000 (0) FPMUL%: 4.775 (10397492) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (624) FPMAX%: 0.000 (624) LOAD%: 4.956 (10790812) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (240) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41654) FPINV%: 0.000 (0) FPCONV%: 0.000 (688) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.065 (2318512) FPLE%: 0.389 (847773) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (624) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27774) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.962 (6449529) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.751 (1634586) CMPU%: 0.000 (0) RSUB%: 0.000 (208) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.760 (34317063) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.232 (2681775) ORI%: 1.266 (2756630) XORI%: 0.000 (0) MULI%: 3.359 (7313874) LW%: 1.191 (2593742) LWI%: 13.924 (30317551) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (654906) SWI%: 4.098 (8922918) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.480 (3223177) bged%: 0.000 (0) bgeid%: 0.000 (208) bgtd%: 0.000 (0) bgtid%: 0.323 (702399) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.041 (88945) bned%: 0.000 (0) bneid%: 13.704 (29839577) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.736 (1602476) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.088 (191166) DIV%: 0.000 (416) FPUN%: 1.180 (2568383) FPRSUB%: 3.716 (8090702) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (2) FPGT%: 3.101 (6751508) FPGE%: 0.795 (1731377) SYNC%: 0.000 (0) NOP%: 8.815 (19194174) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 198 SUB 0 MUL 25 BITOR 8 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 526 FPSUB 0 FPMUL 5446 FPCMPLT 0 FPMIN 0 FPMAX 407 LOAD 2360269 INTCONV 0 ATOMIC_INC 7 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 88 FPINV 0 FPCONV 7 FPEQ 0 FPNE 0 FPLT 8 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1958 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 2378 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3440492 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 804 ORI 612060 XORI 0 MULI 649033 LW 0 LWI 9632527 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1813 DIV 21 FPUN 0 FPRSUB 5 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.7913 --Total thread-cycles: 291562272 --total thread-cycles issued: 198547715 (68.097875%) --iCache conflicts: 6683170 (2.292193%) --thread*cycles of FU dependence: 16708109 (5.730546%) --thread*cycles of data dependence: 21797268 (7.476025%) --iCache cycles*banks: 291562272 (74.681103% used) Issue breakdown: --thread*cycles of issue worked: 198547715 (68.097876%) --thread*cycles of issue failed: 73820383 (25.318908%) --thread*cycles of issue NOP/other: 4596586734499979582 (1576536875971.380900%) Number of thread-cycles not ready: 21797268 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 217741889 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 8 3: 8 4: 8 5: 7 6: 7 7: 7 8: 7 9: 7 10: 7 11: 7 12: 9 13: 7 14: 8 15: 7 16: 8 17: 7 18: 9 19: 7 20: 8 21: 8 22: 7 23: 8 24: 7 25: 8 26: 7 27: 8 28: 7 29: 7 30: 8 31: 7 <=== Core 18 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6198902 in-flight CPI 1.4075 -- Total Cycles 8725088 ---- Thread 01 ---- PC 5: Stalled ----- 6714701 in-flight CPI 1.2994 -- Total Cycles 8725088 ---- Thread 02 ---- PC 5: Stalled ----- 6114691 in-flight CPI 1.4269 -- Total Cycles 8725088 ---- Thread 03 ---- PC 5: Stalled ----- 6042826 in-flight CPI 1.4439 -- Total Cycles 8725088 ---- Thread 04 ---- PC 5: Stalled ----- 6536247 in-flight CPI 1.3349 -- Total Cycles 8725088 ---- Thread 05 ---- PC 5: Stalled ----- 5907266 in-flight CPI 1.4770 -- Total Cycles 8725088 ---- Thread 06 ---- PC 5: Stalled ----- 6191642 in-flight CPI 1.4092 -- Total Cycles 8725088 ---- Thread 07 ---- PC 5: Stalled ----- 6608852 in-flight CPI 1.3202 -- Total Cycles 8725088 ---- Thread 08 ---- PC 5: Stalled ----- 6531935 in-flight CPI 1.3358 -- Total Cycles 8725088 ---- Thread 09 ---- PC 5: Stalled ----- 5831834 in-flight CPI 1.4961 -- Total Cycles 8725088 ---- Thread 10 ---- PC 5: Stalled ----- 6743135 in-flight CPI 1.2939 -- Total Cycles 8725088 ---- Thread 11 ---- PC 5: Stalled ----- 6004243 in-flight CPI 1.4531 -- Total Cycles 8725088 ---- Thread 12 ---- PC 5: Stalled ----- 6508704 in-flight CPI 1.3405 -- Total Cycles 8725088 ---- Thread 13 ---- PC 5: Stalled ----- 5973029 in-flight CPI 1.4607 -- Total Cycles 8725088 ---- Thread 14 ---- PC 5: Stalled ----- 6373300 in-flight CPI 1.3690 -- Total Cycles 8725088 ---- Thread 15 ---- PC 5: Stalled ----- 6617740 in-flight CPI 1.3184 -- Total Cycles 8725088 ---- Thread 16 ---- PC 5: Stalled ----- 5815653 in-flight CPI 1.5003 -- Total Cycles 8725088 ---- Thread 17 ---- PC 5: Stalled ----- 5699506 in-flight CPI 1.5308 -- Total Cycles 8725088 ---- Thread 18 ---- PC 5: Stalled ----- 6233084 in-flight CPI 1.3998 -- Total Cycles 8725088 ---- Thread 19 ---- PC 5: Stalled ----- 6079975 in-flight CPI 1.4350 -- Total Cycles 8725088 ---- Thread 20 ---- PC 5: Stalled ----- 6294699 in-flight CPI 1.3861 -- Total Cycles 8725088 ---- Thread 21 ---- PC 5: Stalled ----- 6159469 in-flight CPI 1.4165 -- Total Cycles 8725088 ---- Thread 22 ---- PC 5: Stalled ----- 6224751 in-flight CPI 1.4017 -- Total Cycles 8725088 ---- Thread 23 ---- PC 5: Stalled ----- 5514108 in-flight CPI 1.5823 -- Total Cycles 8725088 ---- Thread 24 ---- PC 5: Stalled ----- 5907314 in-flight CPI 1.4770 -- Total Cycles 8725088 ---- Thread 25 ---- PC 5: Stalled ----- 6405062 in-flight CPI 1.3622 -- Total Cycles 8725088 ---- Thread 26 ---- PC 5: Stalled ----- 6086709 in-flight CPI 1.4335 -- Total Cycles 8725088 ---- Thread 27 ---- PC 5: Stalled ----- 6142764 in-flight CPI 1.4204 -- Total Cycles 8725088 ---- Thread 28 ---- PC 5: Stalled ----- 5447748 in-flight CPI 1.6016 -- Total Cycles 8725088 ---- Thread 29 ---- PC 5: Stalled ----- 5934786 in-flight CPI 1.4702 -- Total Cycles 8725088 ---- Thread 30 ---- PC 5: Stalled ----- 5714466 in-flight CPI 1.5268 -- Total Cycles 8725088 ---- Thread 31 ---- PC 5: Stalled ----- 6064045 in-flight CPI 1.4388 -- Total Cycles 8725088 Total CPI 0.0444 , IPC 22.5354 -- Total Cycles 8725088 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 433227 (2.040049%) FPSUB: 0 (0.000000%) FPMUL: 1993632 (9.387934%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 14979695 (70.538790%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 561823 (2.645602%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3259775 (15.350151%) DIV: 7494 (0.035289%) FPUN: 0 (0.000000%) FPRSUB: 464 (0.002185%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (215632951 total) ADD%: 8.200 (17682839) SUB%: 0.000 (0) MUL%: 0.000 (203) BITOR%: 1.223 (2636789) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.539 (1162699) FPSUB%: 0.000 (0) FPMUL%: 4.748 (10237431) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (609) FPMAX%: 0.000 (609) LOAD%: 4.949 (10670873) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (235) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41389) FPINV%: 0.000 (0) FPCONV%: 0.000 (673) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.061 (2287973) FPLE%: 0.389 (838230) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (609) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27782) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.969 (6401088) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.748 (1611971) CMPU%: 0.000 (0) RSUB%: 0.000 (203) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.767 (33998225) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.233 (2658649) ORI%: 1.258 (2711779) XORI%: 0.000 (0) MULI%: 3.365 (7256921) LW%: 1.194 (2574308) LWI%: 13.935 (30048258) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.302 (650889) SWI%: 4.103 (8848440) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.483 (3197805) bged%: 0.000 (0) bgeid%: 0.000 (203) bgtd%: 0.000 (0) bgtid%: 0.323 (696909) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.040 (85278) bned%: 0.000 (0) bneid%: 13.710 (29563617) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.738 (1592136) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.086 (185377) DIV%: 0.000 (406) FPUN%: 1.180 (2543940) FPRSUB%: 3.708 (7995639) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (2) FPGT%: 3.105 (6696157) FPGE%: 0.796 (1716556) SYNC%: 0.000 (0) NOP%: 8.816 (19009156) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 170 SUB 0 MUL 25 BITOR 9 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 543 FPSUB 0 FPMUL 5491 FPCMPLT 0 FPMIN 0 FPMAX 389 LOAD 2349353 INTCONV 0 ATOMIC_INC 5 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 112 FPINV 0 FPCONV 10 FPEQ 0 FPNE 0 FPLT 11 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1873 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 2355 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3409922 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 789 ORI 591176 XORI 0 MULI 651792 LW 0 LWI 9545602 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1751 DIV 12 FPUN 0 FPRSUB 5 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 22.5354 --Total thread-cycles: 279202816 --total thread-cycles issued: 196623795 (70.423284%) --iCache conflicts: 6644261 (2.379726%) --thread*cycles of FU dependence: 16561417 (5.931680%) --thread*cycles of data dependence: 21236110 (7.605980%) --iCache cycles*banks: 279202816 (77.231665% used) Issue breakdown: --thread*cycles of issue worked: 196623795 (70.423285%) --thread*cycles of issue failed: 63569865 (22.768347%) --thread*cycles of issue NOP/other: -4652578505112809852 (-1666379505682.639400%) Number of thread-cycles not ready: 21236110 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 215632951 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 7 3: 7 4: 8 5: 7 6: 7 7: 8 8: 8 9: 7 10: 8 11: 7 12: 7 13: 7 14: 9 15: 7 16: 7 17: 7 18: 8 19: 7 20: 7 21: 10 22: 7 23: 6 24: 7 25: 7 26: 7 27: 8 28: 7 29: 7 30: 6 31: 8 <=== Core 19 ===> ---- Thread 00 ---- PC 5: Stalled ----- 5947269 in-flight CPI 1.5320 -- Total Cycles 9111355 ---- Thread 01 ---- PC 5: Stalled ----- 6381411 in-flight CPI 1.4278 -- Total Cycles 9111355 ---- Thread 02 ---- PC 5: Stalled ----- 6045821 in-flight CPI 1.5070 -- Total Cycles 9111355 ---- Thread 03 ---- PC 5: Stalled ----- 5843438 in-flight CPI 1.5592 -- Total Cycles 9111355 ---- Thread 04 ---- PC 5: Stalled ----- 5822053 in-flight CPI 1.5650 -- Total Cycles 9111355 ---- Thread 05 ---- PC 5: Stalled ----- 7010031 in-flight CPI 1.2998 -- Total Cycles 9111355 ---- Thread 06 ---- PC 5: Stalled ----- 6183442 in-flight CPI 1.4735 -- Total Cycles 9111355 ---- Thread 07 ---- PC 5: Stalled ----- 6550263 in-flight CPI 1.3910 -- Total Cycles 9111355 ---- Thread 08 ---- PC 5: Stalled ----- 6471579 in-flight CPI 1.4079 -- Total Cycles 9111355 ---- Thread 09 ---- PC 5: Stalled ----- 6016715 in-flight CPI 1.5143 -- Total Cycles 9111355 ---- Thread 10 ---- PC 5: Stalled ----- 6879344 in-flight CPI 1.3244 -- Total Cycles 9111355 ---- Thread 11 ---- PC 5: Stalled ----- 6224061 in-flight CPI 1.4639 -- Total Cycles 9111355 ---- Thread 12 ---- PC 5: Stalled ----- 6995194 in-flight CPI 1.3025 -- Total Cycles 9111355 ---- Thread 13 ---- PC 5: Stalled ----- 6118207 in-flight CPI 1.4892 -- Total Cycles 9111355 ---- Thread 14 ---- PC 5: Stalled ----- 5891946 in-flight CPI 1.5464 -- Total Cycles 9111355 ---- Thread 15 ---- PC 5: Stalled ----- 6124434 in-flight CPI 1.4877 -- Total Cycles 9111355 ---- Thread 16 ---- PC 5: Stalled ----- 6398134 in-flight CPI 1.4241 -- Total Cycles 9111355 ---- Thread 17 ---- PC 5: Stalled ----- 6849270 in-flight CPI 1.3303 -- Total Cycles 9111355 ---- Thread 18 ---- PC 5: Stalled ----- 6516640 in-flight CPI 1.3982 -- Total Cycles 9111355 ---- Thread 19 ---- PC 5: Stalled ----- 6086890 in-flight CPI 1.4969 -- Total Cycles 9111355 ---- Thread 20 ---- PC 5: Stalled ----- 5832552 in-flight CPI 1.5622 -- Total Cycles 9111355 ---- Thread 21 ---- PC 5: Stalled ----- 5815312 in-flight CPI 1.5668 -- Total Cycles 9111355 ---- Thread 22 ---- PC 5: Stalled ----- 5532292 in-flight CPI 1.6469 -- Total Cycles 9111355 ---- Thread 23 ---- PC 5: Stalled ----- 6480415 in-flight CPI 1.4060 -- Total Cycles 9111355 ---- Thread 24 ---- PC 5: Stalled ----- 5500441 in-flight CPI 1.6565 -- Total Cycles 9111355 ---- Thread 25 ---- PC 5: Stalled ----- 6025807 in-flight CPI 1.5121 -- Total Cycles 9111355 ---- Thread 26 ---- PC 5: Stalled ----- 6008926 in-flight CPI 1.5163 -- Total Cycles 9111355 ---- Thread 27 ---- PC 5: Stalled ----- 6515683 in-flight CPI 1.3984 -- Total Cycles 9111355 ---- Thread 28 ---- PC 5: Stalled ----- 6002046 in-flight CPI 1.5180 -- Total Cycles 9111355 ---- Thread 29 ---- PC 5: Stalled ----- 5757924 in-flight CPI 1.5824 -- Total Cycles 9111355 ---- Thread 30 ---- PC 5: Stalled ----- 5930467 in-flight CPI 1.5364 -- Total Cycles 9111355 ---- Thread 31 ---- PC 5: Stalled ----- 5984345 in-flight CPI 1.5225 -- Total Cycles 9111355 Total CPI 0.0461 , IPC 21.7029 -- Total Cycles 9111355 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 444904 (2.053041%) FPSUB: 0 (0.000000%) FPMUL: 2024576 (9.342551%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 15294749 (70.578714%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 564978 (2.607131%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3333245 (15.381498%) DIV: 7568 (0.034923%) FPUN: 0 (0.000000%) FPRSUB: 464 (0.002141%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (216861197 total) ADD%: 8.182 (17742523) SUB%: 0.000 (0) MUL%: 0.000 (205) BITOR%: 1.227 (2660645) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.549 (1189628) FPSUB%: 0.000 (0) FPMUL%: 4.775 (10356119) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (615) FPMAX%: 0.000 (615) LOAD%: 4.954 (10742914) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (237) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41602) FPINV%: 0.000 (0) FPCONV%: 0.000 (679) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.064 (2308237) FPLE%: 0.390 (845433) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (615) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27862) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.962 (6422753) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.750 (1626407) CMPU%: 0.000 (0) RSUB%: 0.000 (205) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.764 (34185534) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.231 (2670384) ORI%: 1.266 (2746038) XORI%: 0.000 (0) MULI%: 3.359 (7283725) LW%: 1.191 (2583030) LWI%: 13.917 (30180377) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (653030) SWI%: 4.096 (8882057) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.480 (3208726) bged%: 0.000 (0) bgeid%: 0.000 (205) bgtd%: 0.000 (0) bgtid%: 0.323 (700458) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.040 (87770) bned%: 0.000 (0) bneid%: 13.709 (29729518) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.737 (1598308) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.087 (189675) DIV%: 0.000 (410) FPUN%: 1.183 (2564749) FPRSUB%: 3.715 (8057069) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (2) FPGT%: 3.101 (6724340) FPGE%: 0.798 (1730172) SYNC%: 0.000 (0) NOP%: 8.816 (19118230) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 157 SUB 0 MUL 19 BITOR 4 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 532 FPSUB 0 FPMUL 5277 FPCMPLT 0 FPMIN 0 FPMAX 400 LOAD 2382705 INTCONV 0 ATOMIC_INC 3 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 111 FPINV 0 FPCONV 11 FPEQ 0 FPNE 0 FPLT 7 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 2033 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 1 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 2252 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3423834 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 807 ORI 608116 XORI 0 MULI 640146 LW 0 LWI 9595388 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1764 DIV 18 FPUN 0 FPRSUB 5 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.7029 --Total thread-cycles: 291563360 --total thread-cycles issued: 197742967 (67.821608%) --iCache conflicts: 6631413 (2.274433%) --thread*cycles of FU dependence: 16663610 (5.715262%) --thread*cycles of data dependence: 21670484 (7.432513%) --iCache cycles*banks: 291563360 (74.378766% used) Issue breakdown: --thread*cycles of issue worked: 197742967 (67.821611%) --thread*cycles of issue failed: 74702163 (25.621245%) --thread*cycles of issue NOP/other: 39970068 (13.708879%) Number of thread-cycles not ready: 21670484 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 216861197 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 7 3: 7 4: 7 5: 8 6: 7 7: 8 8: 8 9: 7 10: 10 11: 7 12: 7 13: 7 14: 9 15: 8 16: 7 17: 8 18: 7 19: 7 20: 7 21: 7 22: 7 23: 8 24: 7 25: 7 26: 7 27: 7 28: 7 29: 7 30: 7 31: 8 <=== Core 20 ===> ---- Thread 00 ---- PC 5: Stalled ----- 5861299 in-flight CPI 1.5439 -- Total Cycles 9049122 ---- Thread 01 ---- PC 5: Stalled ----- 6651204 in-flight CPI 1.3605 -- Total Cycles 9049122 ---- Thread 02 ---- PC 5: Stalled ----- 6013289 in-flight CPI 1.5048 -- Total Cycles 9049122 ---- Thread 03 ---- PC 5: Stalled ----- 6175526 in-flight CPI 1.4653 -- Total Cycles 9049122 ---- Thread 04 ---- PC 5: Stalled ----- 6392707 in-flight CPI 1.4155 -- Total Cycles 9049122 ---- Thread 05 ---- PC 5: Stalled ----- 5968998 in-flight CPI 1.5160 -- Total Cycles 9049122 ---- Thread 06 ---- PC 5: Stalled ----- 6523779 in-flight CPI 1.3871 -- Total Cycles 9049122 ---- Thread 07 ---- PC 5: Stalled ----- 6503945 in-flight CPI 1.3913 -- Total Cycles 9049122 ---- Thread 08 ---- PC 5: Stalled ----- 6244105 in-flight CPI 1.4492 -- Total Cycles 9049122 ---- Thread 09 ---- PC 5: Stalled ----- 6501718 in-flight CPI 1.3918 -- Total Cycles 9049122 ---- Thread 10 ---- PC 5: Stalled ----- 5946828 in-flight CPI 1.5217 -- Total Cycles 9049122 ---- Thread 11 ---- PC 5: Stalled ----- 6556719 in-flight CPI 1.3801 -- Total Cycles 9049122 ---- Thread 12 ---- PC 5: Stalled ----- 6211227 in-flight CPI 1.4569 -- Total Cycles 9049122 ---- Thread 13 ---- PC 5: Stalled ----- 5990918 in-flight CPI 1.5105 -- Total Cycles 9049122 ---- Thread 14 ---- PC 5: Stalled ----- 6386644 in-flight CPI 1.4169 -- Total Cycles 9049122 ---- Thread 15 ---- PC 5: Stalled ----- 6082955 in-flight CPI 1.4876 -- Total Cycles 9049122 ---- Thread 16 ---- PC 5: Stalled ----- 6366287 in-flight CPI 1.4214 -- Total Cycles 9049122 ---- Thread 17 ---- PC 5: Stalled ----- 6161489 in-flight CPI 1.4687 -- Total Cycles 9049122 ---- Thread 18 ---- PC 5: Stalled ----- 5611663 in-flight CPI 1.6126 -- Total Cycles 9049122 ---- Thread 19 ---- PC 5: Stalled ----- 5936639 in-flight CPI 1.5243 -- Total Cycles 9049122 ---- Thread 20 ---- PC 5: Stalled ----- 6179898 in-flight CPI 1.4643 -- Total Cycles 9049122 ---- Thread 21 ---- PC 5: Stalled ----- 5725247 in-flight CPI 1.5806 -- Total Cycles 9049122 ---- Thread 22 ---- PC 5: Stalled ----- 6128753 in-flight CPI 1.4765 -- Total Cycles 9049122 ---- Thread 23 ---- PC 5: Stalled ----- 6737630 in-flight CPI 1.3431 -- Total Cycles 9049122 ---- Thread 24 ---- PC 5: Stalled ----- 5668129 in-flight CPI 1.5965 -- Total Cycles 9049122 ---- Thread 25 ---- PC 5: Stalled ----- 5690354 in-flight CPI 1.5903 -- Total Cycles 9049122 ---- Thread 26 ---- PC 5: Stalled ----- 6224937 in-flight CPI 1.4537 -- Total Cycles 9049122 ---- Thread 27 ---- PC 5: Stalled ----- 6173850 in-flight CPI 1.4657 -- Total Cycles 9049122 ---- Thread 28 ---- PC 5: Stalled ----- 6303493 in-flight CPI 1.4356 -- Total Cycles 9049122 ---- Thread 29 ---- PC 5: Stalled ----- 5546144 in-flight CPI 1.6316 -- Total Cycles 9049122 ---- Thread 30 ---- PC 5: Stalled ----- 5547045 in-flight CPI 1.6313 -- Total Cycles 9049122 ---- Thread 31 ---- PC 5: Stalled ----- 5236144 in-flight CPI 1.7282 -- Total Cycles 9049122 Total CPI 0.0463 , IPC 21.5767 -- Total Cycles 9049122 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 442964 (2.095895%) FPSUB: 0 (0.000000%) FPMUL: 2007777 (9.499848%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 14825112 (70.145395%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 550668 (2.605500%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3300512 (15.616456%) DIV: 7342 (0.034739%) FPUN: 0 (0.000000%) FPRSUB: 458 (0.002167%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (214111056 total) ADD%: 8.196 (17548803) SUB%: 0.000 (0) MUL%: 0.000 (199) BITOR%: 1.223 (2618062) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.551 (1178833) FPSUB%: 0.000 (0) FPMUL%: 4.786 (10248097) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (597) FPMAX%: 0.000 (597) LOAD%: 4.957 (10614377) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (231) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (40432) FPINV%: 0.000 (0) FPCONV%: 0.000 (661) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.066 (2281418) FPLE%: 0.385 (824873) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (597) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (26850) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.963 (6345124) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.752 (1611025) CMPU%: 0.000 (0) RSUB%: 0.000 (199) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.758 (33739407) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.233 (2640479) ORI%: 1.270 (2718501) XORI%: 0.000 (0) MULI%: 3.359 (7192295) LW%: 1.192 (2551516) LWI%: 13.914 (29791440) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (644033) SWI%: 4.094 (8766635) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.481 (3171184) bged%: 0.000 (0) bgeid%: 0.000 (199) bgtd%: 0.000 (0) bgtid%: 0.323 (690782) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.040 (86152) bned%: 0.000 (0) bneid%: 13.701 (29335369) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.733 (1569378) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.088 (187968) DIV%: 0.000 (398) FPUN%: 1.177 (2520859) FPRSUB%: 3.719 (7961858) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (1) FPGT%: 3.098 (6634209) FPGE%: 0.797 (1706426) SYNC%: 0.000 (0) NOP%: 8.809 (18860896) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 176 SUB 0 MUL 36 BITOR 2 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 475 FPSUB 0 FPMUL 5331 FPCMPLT 0 FPMIN 0 FPMAX 387 LOAD 2350763 INTCONV 0 ATOMIC_INC 5 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 89 FPINV 0 FPCONV 14 FPEQ 0 FPNE 0 FPLT 8 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 2294 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 2 ADDKC 0 BITXOR 0 ANDN 0 CMP 2232 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3377897 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 800 ORI 605776 XORI 0 MULI 641714 LW 0 LWI 9462560 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1725 DIV 16 FPUN 0 FPRSUB 3 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.5767 --Total thread-cycles: 289571904 --total thread-cycles issued: 195250160 (67.427177%) --iCache conflicts: 6598786 (2.278807%) --thread*cycles of FU dependence: 16452314 (5.681599%) --thread*cycles of data dependence: 21134833 (7.298648%) --iCache cycles*banks: 289571904 (73.940560% used) Issue breakdown: --thread*cycles of issue worked: 195250160 (67.427177%) --thread*cycles of issue failed: 75460848 (26.059451%) --thread*cycles of issue NOP/other: 4620979731211209568 (1595796991137.375700%) Number of thread-cycles not ready: 21134833 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 214111056 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 7 2: 7 3: 7 4: 7 5: 7 6: 7 7: 8 8: 7 9: 7 10: 8 11: 7 12: 7 13: 7 14: 7 15: 7 16: 8 17: 7 18: 8 19: 8 20: 8 21: 7 22: 8 23: 9 24: 7 25: 7 26: 7 27: 8 28: 7 29: 6 30: 6 31: 6 <=== Core 21 ===> ---- Thread 00 ---- PC 5: Stalled ----- 5843246 in-flight CPI 1.5489 -- Total Cycles 9050770 ---- Thread 01 ---- PC 5: Stalled ----- 6094836 in-flight CPI 1.4850 -- Total Cycles 9050770 ---- Thread 02 ---- PC 5: Stalled ----- 6068278 in-flight CPI 1.4915 -- Total Cycles 9050770 ---- Thread 03 ---- PC 5: Stalled ----- 5946047 in-flight CPI 1.5221 -- Total Cycles 9050770 ---- Thread 04 ---- PC 5: Stalled ----- 6033224 in-flight CPI 1.5002 -- Total Cycles 9050770 ---- Thread 05 ---- PC 5: Stalled ----- 6115477 in-flight CPI 1.4800 -- Total Cycles 9050770 ---- Thread 06 ---- PC 5: Stalled ----- 6915715 in-flight CPI 1.3087 -- Total Cycles 9050770 ---- Thread 07 ---- PC 5: Stalled ----- 6567639 in-flight CPI 1.3781 -- Total Cycles 9050770 ---- Thread 08 ---- PC 5: Stalled ----- 5966241 in-flight CPI 1.5170 -- Total Cycles 9050770 ---- Thread 09 ---- PC 5: Stalled ----- 5880641 in-flight CPI 1.5391 -- Total Cycles 9050770 ---- Thread 10 ---- PC 5: Stalled ----- 5837330 in-flight CPI 1.5505 -- Total Cycles 9050770 ---- Thread 11 ---- PC 5: Stalled ----- 6738473 in-flight CPI 1.3431 -- Total Cycles 9050770 ---- Thread 12 ---- PC 5: Stalled ----- 6260505 in-flight CPI 1.4457 -- Total Cycles 9050770 ---- Thread 13 ---- PC 5: Stalled ----- 5936351 in-flight CPI 1.5246 -- Total Cycles 9050770 ---- Thread 14 ---- PC 5: Stalled ----- 7001301 in-flight CPI 1.2927 -- Total Cycles 9050770 ---- Thread 15 ---- PC 5: Stalled ----- 5891587 in-flight CPI 1.5362 -- Total Cycles 9050770 ---- Thread 16 ---- PC 5: Stalled ----- 6312246 in-flight CPI 1.4338 -- Total Cycles 9050770 ---- Thread 17 ---- PC 5: Stalled ----- 6244418 in-flight CPI 1.4494 -- Total Cycles 9050770 ---- Thread 18 ---- PC 5: Stalled ----- 6110391 in-flight CPI 1.4812 -- Total Cycles 9050770 ---- Thread 19 ---- PC 5: Stalled ----- 6302482 in-flight CPI 1.4361 -- Total Cycles 9050770 ---- Thread 20 ---- PC 5: Stalled ----- 6455088 in-flight CPI 1.4021 -- Total Cycles 9050770 ---- Thread 21 ---- PC 5: Stalled ----- 5750143 in-flight CPI 1.5740 -- Total Cycles 9050770 ---- Thread 22 ---- PC 5: Stalled ----- 5829591 in-flight CPI 1.5526 -- Total Cycles 9050770 ---- Thread 23 ---- PC 5: Stalled ----- 6088489 in-flight CPI 1.4865 -- Total Cycles 9050770 ---- Thread 24 ---- PC 5: Stalled ----- 5813515 in-flight CPI 1.5568 -- Total Cycles 9050770 ---- Thread 25 ---- PC 5: Stalled ----- 6369864 in-flight CPI 1.4209 -- Total Cycles 9050770 ---- Thread 26 ---- PC 5: Stalled ----- 6136428 in-flight CPI 1.4749 -- Total Cycles 9050770 ---- Thread 27 ---- PC 5: Stalled ----- 6484394 in-flight CPI 1.3958 -- Total Cycles 9050770 ---- Thread 28 ---- PC 5: Stalled ----- 6436074 in-flight CPI 1.4063 -- Total Cycles 9050770 ---- Thread 29 ---- PC 5: Stalled ----- 5754740 in-flight CPI 1.5727 -- Total Cycles 9050770 ---- Thread 30 ---- PC 5: Stalled ----- 5447324 in-flight CPI 1.6615 -- Total Cycles 9050770 ---- Thread 31 ---- PC 5: Stalled ----- 6107594 in-flight CPI 1.4819 -- Total Cycles 9050770 Total CPI 0.0460 , IPC 21.7374 -- Total Cycles 9050770 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 452624 (2.121152%) FPSUB: 0 (0.000000%) FPMUL: 2034581 (9.534747%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 14886428 (69.762928%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 565574 (2.650475%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3391253 (15.892579%) DIV: 7663 (0.035911%) FPUN: 0 (0.000000%) FPRSUB: 471 (0.002207%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (215767185 total) ADD%: 8.167 (17621395) SUB%: 0.000 (0) MUL%: 0.000 (208) BITOR%: 1.229 (2652674) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.560 (1207871) FPSUB%: 0.000 (0) FPMUL%: 4.805 (10367960) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (624) FPMAX%: 0.000 (624) LOAD%: 4.961 (10704068) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (240) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41626) FPINV%: 0.000 (0) FPCONV%: 0.000 (688) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.068 (2303766) FPLE%: 0.389 (839421) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (624) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27946) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.954 (6374697) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.752 (1622931) CMPU%: 0.000 (0) RSUB%: 0.000 (208) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.758 (34000082) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.230 (2653601) ORI%: 1.276 (2754009) XORI%: 0.000 (0) MULI%: 3.351 (7230606) LW%: 1.188 (2563850) LWI%: 13.891 (29971566) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.300 (648067) SWI%: 4.088 (8821152) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.476 (3184934) bged%: 0.000 (0) bgeid%: 0.000 (208) bgtd%: 0.000 (0) bgtid%: 0.322 (695648) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.041 (88833) bned%: 0.000 (0) bneid%: 13.709 (29580313) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.736 (1587890) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.089 (193081) DIV%: 0.000 (416) FPUN%: 1.184 (2554387) FPRSUB%: 3.722 (8031818) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (2) FPGT%: 3.099 (6686347) FPGE%: 0.800 (1725819) SYNC%: 0.000 (0) NOP%: 8.818 (19026889) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 176 SUB 0 MUL 28 BITOR 3 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 553 FPSUB 0 FPMUL 5097 FPCMPLT 0 FPMIN 0 FPMAX 406 LOAD 2352154 INTCONV 0 ATOMIC_INC 10 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 96 FPINV 0 FPCONV 14 FPEQ 0 FPNE 0 FPLT 8 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1959 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 2302 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3400850 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 856 ORI 620483 XORI 0 MULI 639109 LW 0 LWI 9535035 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1895 DIV 18 FPUN 0 FPRSUB 4 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.7374 --Total thread-cycles: 289624640 --total thread-cycles issued: 196740296 (67.929403%) --iCache conflicts: 6611156 (2.282664%) --thread*cycles of FU dependence: 16561074 (5.718116%) --thread*cycles of data dependence: 21338594 (7.367672%) --iCache cycles*banks: 289624640 (74.498916% used) Issue breakdown: --thread*cycles of issue worked: 196740296 (67.929405%) --thread*cycles of issue failed: 73857455 (25.501095%) --thread*cycles of issue NOP/other: 19066503 (6.583177%) Number of thread-cycles not ready: 21338594 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 215767185 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 7 2: 7 3: 8 4: 7 5: 7 6: 8 7: 8 8: 7 9: 7 10: 8 11: 8 12: 8 13: 8 14: 9 15: 7 16: 8 17: 7 18: 8 19: 7 20: 7 21: 7 22: 7 23: 8 24: 8 25: 7 26: 7 27: 8 28: 8 29: 8 30: 7 31: 7 <=== Core 22 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6154787 in-flight CPI 1.4864 -- Total Cycles 9148337 ---- Thread 01 ---- PC 5: Stalled ----- 6027963 in-flight CPI 1.5176 -- Total Cycles 9148337 ---- Thread 02 ---- PC 5: Stalled ----- 6493393 in-flight CPI 1.4089 -- Total Cycles 9148337 ---- Thread 03 ---- PC 5: Stalled ----- 6057382 in-flight CPI 1.5103 -- Total Cycles 9148337 ---- Thread 04 ---- PC 5: Stalled ----- 5984642 in-flight CPI 1.5286 -- Total Cycles 9148337 ---- Thread 05 ---- PC 5: Stalled ----- 6723825 in-flight CPI 1.3606 -- Total Cycles 9148337 ---- Thread 06 ---- PC 5: Stalled ----- 6703874 in-flight CPI 1.3646 -- Total Cycles 9148337 ---- Thread 07 ---- PC 5: Stalled ----- 6240182 in-flight CPI 1.4660 -- Total Cycles 9148337 ---- Thread 08 ---- PC 5: Stalled ----- 6429874 in-flight CPI 1.4228 -- Total Cycles 9148337 ---- Thread 09 ---- PC 5: Stalled ----- 6675787 in-flight CPI 1.3704 -- Total Cycles 9148337 ---- Thread 10 ---- PC 5: Stalled ----- 6257389 in-flight CPI 1.4620 -- Total Cycles 9148337 ---- Thread 11 ---- PC 5: Stalled ----- 5937257 in-flight CPI 1.5408 -- Total Cycles 9148337 ---- Thread 12 ---- PC 5: Stalled ----- 5835930 in-flight CPI 1.5676 -- Total Cycles 9148337 ---- Thread 13 ---- PC 5: Stalled ----- 7074144 in-flight CPI 1.2932 -- Total Cycles 9148337 ---- Thread 14 ---- PC 5: Stalled ----- 6500600 in-flight CPI 1.4073 -- Total Cycles 9148337 ---- Thread 15 ---- PC 5: Stalled ----- 5990432 in-flight CPI 1.5272 -- Total Cycles 9148337 ---- Thread 16 ---- PC 5: Stalled ----- 5727154 in-flight CPI 1.5974 -- Total Cycles 9148337 ---- Thread 17 ---- PC 5: Stalled ----- 5903167 in-flight CPI 1.5497 -- Total Cycles 9148337 ---- Thread 18 ---- PC 5: Stalled ----- 6757008 in-flight CPI 1.3539 -- Total Cycles 9148337 ---- Thread 19 ---- PC 5: Stalled ----- 5952515 in-flight CPI 1.5369 -- Total Cycles 9148337 ---- Thread 20 ---- PC 5: Stalled ----- 6659606 in-flight CPI 1.3737 -- Total Cycles 9148337 ---- Thread 21 ---- PC 5: Stalled ----- 5698148 in-flight CPI 1.6055 -- Total Cycles 9148337 ---- Thread 22 ---- PC 5: Stalled ----- 5494903 in-flight CPI 1.6649 -- Total Cycles 9148337 ---- Thread 23 ---- PC 5: Stalled ----- 6269458 in-flight CPI 1.4592 -- Total Cycles 9148337 ---- Thread 24 ---- PC 5: Stalled ----- 5805226 in-flight CPI 1.5759 -- Total Cycles 9148337 ---- Thread 25 ---- PC 5: Stalled ----- 6330880 in-flight CPI 1.4450 -- Total Cycles 9148337 ---- Thread 26 ---- PC 5: Stalled ----- 6272022 in-flight CPI 1.4586 -- Total Cycles 9148337 ---- Thread 27 ---- PC 5: Stalled ----- 6503877 in-flight CPI 1.4066 -- Total Cycles 9148337 ---- Thread 28 ---- PC 5: Stalled ----- 6068117 in-flight CPI 1.5076 -- Total Cycles 9148337 ---- Thread 29 ---- PC 5: Stalled ----- 5714745 in-flight CPI 1.6008 -- Total Cycles 9148337 ---- Thread 30 ---- PC 5: Stalled ----- 6158762 in-flight CPI 1.4854 -- Total Cycles 9148337 ---- Thread 31 ---- PC 5: Stalled ----- 5989685 in-flight CPI 1.5273 -- Total Cycles 9148337 Total CPI 0.0461 , IPC 21.6863 -- Total Cycles 9148337 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 446155 (1.999723%) FPSUB: 0 (0.000000%) FPMUL: 2033942 (9.116386%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 15913924 (71.328228%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 569327 (2.551796%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3339493 (14.968032%) DIV: 7526 (0.033732%) FPUN: 0 (0.000000%) FPRSUB: 469 (0.002102%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (217585752 total) ADD%: 8.197 (17834496) SUB%: 0.000 (0) MUL%: 0.000 (204) BITOR%: 1.221 (2656175) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.549 (1195463) FPSUB%: 0.000 (0) FPMUL%: 4.779 (10398708) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (612) FPMAX%: 0.000 (612) LOAD%: 4.951 (10772140) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (236) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41968) FPINV%: 0.000 (0) FPCONV%: 0.000 (676) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.065 (2317282) FPLE%: 0.390 (848239) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (612) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (28120) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.960 (6440757) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.749 (1629257) CMPU%: 0.000 (0) RSUB%: 0.000 (204) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.759 (34290246) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.230 (2676999) ORI%: 1.263 (2747226) XORI%: 0.000 (0) MULI%: 3.358 (7306409) LW%: 1.191 (2590366) LWI%: 13.921 (30290733) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (655004) SWI%: 4.097 (8914213) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.479 (3217531) bged%: 0.000 (0) bgeid%: 0.000 (204) bgtd%: 0.000 (0) bgtid%: 0.323 (702541) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.041 (88156) bned%: 0.000 (0) bneid%: 13.706 (29822263) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.738 (1606243) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.087 (189975) DIV%: 0.000 (408) FPUN%: 1.177 (2560899) FPRSUB%: 3.717 (8086909) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (1) FPGT%: 3.106 (6757503) FPGE%: 0.792 (1723660) SYNC%: 0.000 (0) NOP%: 8.821 (19192406) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 183 SUB 0 MUL 20 BITOR 11 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 567 FPSUB 0 FPMUL 5487 FPCMPLT 0 FPMIN 0 FPMAX 397 LOAD 2359027 INTCONV 0 ATOMIC_INC 6 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 102 FPINV 0 FPCONV 12 FPEQ 0 FPNE 0 FPLT 16 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1808 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 1 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 2387 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3440109 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 800 ORI 610652 XORI 0 MULI 644664 LW 0 LWI 9630465 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1787 DIV 26 FPUN 0 FPRSUB 3 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.6863 --Total thread-cycles: 292746784 --total thread-cycles issued: 198393346 (67.769607%) --iCache conflicts: 6599527 (2.254347%) --thread*cycles of FU dependence: 16698549 (5.704093%) --thread*cycles of data dependence: 22310836 (7.621206%) --iCache cycles*banks: 292746784 (74.325593% used) Issue breakdown: --thread*cycles of issue worked: 198393346 (67.769607%) --thread*cycles of issue failed: 75161032 (25.674418%) --thread*cycles of issue NOP/other: 19192406 (6.555975%) Number of thread-cycles not ready: 22310836 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 217585752 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 7 2: 8 3: 7 4: 7 5: 8 6: 8 7: 7 8: 8 9: 8 10: 8 11: 7 12: 8 13: 8 14: 8 15: 7 16: 7 17: 7 18: 7 19: 7 20: 8 21: 7 22: 6 23: 7 24: 7 25: 8 26: 9 27: 7 28: 7 29: 7 30: 7 31: 7 <=== Core 23 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6517143 in-flight CPI 1.4163 -- Total Cycles 9230501 ---- Thread 01 ---- PC 5: Stalled ----- 5953761 in-flight CPI 1.5504 -- Total Cycles 9230501 ---- Thread 02 ---- PC 5: Stalled ----- 5981403 in-flight CPI 1.5432 -- Total Cycles 9230501 ---- Thread 03 ---- PC 5: Stalled ----- 5951936 in-flight CPI 1.5508 -- Total Cycles 9230501 ---- Thread 04 ---- PC 5: Stalled ----- 5840694 in-flight CPI 1.5804 -- Total Cycles 9230501 ---- Thread 05 ---- PC 5: Stalled ----- 6860851 in-flight CPI 1.3454 -- Total Cycles 9230501 ---- Thread 06 ---- PC 5: Stalled ----- 6141016 in-flight CPI 1.5031 -- Total Cycles 9230501 ---- Thread 07 ---- PC 5: Stalled ----- 6599621 in-flight CPI 1.3986 -- Total Cycles 9230501 ---- Thread 08 ---- PC 5: Stalled ----- 6215623 in-flight CPI 1.4850 -- Total Cycles 9230501 ---- Thread 09 ---- PC 5: Stalled ----- 5983008 in-flight CPI 1.5428 -- Total Cycles 9230501 ---- Thread 10 ---- PC 5: Stalled ----- 7207262 in-flight CPI 1.2807 -- Total Cycles 9230501 ---- Thread 11 ---- PC 5: Stalled ----- 6645789 in-flight CPI 1.3889 -- Total Cycles 9230501 ---- Thread 12 ---- PC 5: Stalled ----- 5867276 in-flight CPI 1.5732 -- Total Cycles 9230501 ---- Thread 13 ---- PC 5: Stalled ----- 6210337 in-flight CPI 1.4863 -- Total Cycles 9230501 ---- Thread 14 ---- PC 5: Stalled ----- 6361407 in-flight CPI 1.4510 -- Total Cycles 9230501 ---- Thread 15 ---- PC 5: Stalled ----- 5969666 in-flight CPI 1.5462 -- Total Cycles 9230501 ---- Thread 16 ---- PC 5: Stalled ----- 5937093 in-flight CPI 1.5547 -- Total Cycles 9230501 ---- Thread 17 ---- PC 5: Stalled ----- 6003900 in-flight CPI 1.5374 -- Total Cycles 9230501 ---- Thread 18 ---- PC 5: Stalled ----- 6058708 in-flight CPI 1.5235 -- Total Cycles 9230501 ---- Thread 19 ---- PC 5: Stalled ----- 6184804 in-flight CPI 1.4924 -- Total Cycles 9230501 ---- Thread 20 ---- PC 5: Stalled ----- 5816097 in-flight CPI 1.5871 -- Total Cycles 9230501 ---- Thread 21 ---- PC 5: Stalled ----- 6130605 in-flight CPI 1.5056 -- Total Cycles 9230501 ---- Thread 22 ---- PC 5: Stalled ----- 6638631 in-flight CPI 1.3904 -- Total Cycles 9230501 ---- Thread 23 ---- PC 5: Stalled ----- 6065253 in-flight CPI 1.5219 -- Total Cycles 9230501 ---- Thread 24 ---- PC 5: Stalled ----- 5691389 in-flight CPI 1.6218 -- Total Cycles 9230501 ---- Thread 25 ---- PC 5: Stalled ----- 5979463 in-flight CPI 1.5437 -- Total Cycles 9230501 ---- Thread 26 ---- PC 5: Stalled ----- 5836916 in-flight CPI 1.5814 -- Total Cycles 9230501 ---- Thread 27 ---- PC 5: Stalled ----- 5643959 in-flight CPI 1.6355 -- Total Cycles 9230501 ---- Thread 28 ---- PC 5: Stalled ----- 5368721 in-flight CPI 1.7193 -- Total Cycles 9230501 ---- Thread 29 ---- PC 5: Stalled ----- 6260814 in-flight CPI 1.4743 -- Total Cycles 9230501 ---- Thread 30 ---- PC 5: Stalled ----- 6298434 in-flight CPI 1.4655 -- Total Cycles 9230501 ---- Thread 31 ---- PC 5: Stalled ----- 6039868 in-flight CPI 1.5283 -- Total Cycles 9230501 Total CPI 0.0470 , IPC 21.2623 -- Total Cycles 9230501 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 430054 (2.054936%) FPSUB: 0 (0.000000%) FPMUL: 1983802 (9.479245%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 14681888 (70.154788%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 571138 (2.729081%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3252736 (15.542620%) DIV: 7752 (0.037042%) FPUN: 0 (0.000000%) FPRSUB: 479 (0.002289%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (215238991 total) ADD%: 8.197 (17642224) SUB%: 0.000 (0) MUL%: 0.000 (210) BITOR%: 1.226 (2639284) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.538 (1157888) FPSUB%: 0.000 (0) FPMUL%: 4.737 (10195813) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (630) FPMAX%: 0.000 (630) LOAD%: 4.948 (10649775) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (242) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41969) FPINV%: 0.000 (0) FPCONV%: 0.000 (694) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.060 (2281447) FPLE%: 0.393 (845999) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (630) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (28146) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.967 (6386759) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.747 (1607186) CMPU%: 0.000 (0) RSUB%: 0.000 (210) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.771 (33945773) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.232 (2652121) ORI%: 1.253 (2697603) XORI%: 0.000 (0) MULI%: 3.365 (7243610) LW%: 1.193 (2568754) LWI%: 13.938 (29999428) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (648578) SWI%: 4.105 (8835176) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.483 (3191888) bged%: 0.000 (0) bgeid%: 0.000 (210) bgtd%: 0.000 (0) bgtid%: 0.323 (694282) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.039 (84973) bned%: 0.000 (0) bneid%: 13.714 (29517997) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.744 (1600449) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.086 (184872) DIV%: 0.000 (420) FPUN%: 1.183 (2547347) FPRSUB%: 3.704 (7971414) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (1) FPGT%: 3.106 (6685079) FPGE%: 0.796 (1712271) SYNC%: 0.000 (0) NOP%: 8.817 (18976913) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 144 SUB 0 MUL 31 BITOR 4 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 513 FPSUB 0 FPMUL 4820 FPCMPLT 0 FPMIN 0 FPMAX 405 LOAD 2339219 INTCONV 0 ATOMIC_INC 11 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 103 FPINV 0 FPCONV 16 FPEQ 0 FPNE 0 FPLT 4 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1894 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 1 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 2131 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3401254 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 764 ORI 586088 XORI 0 MULI 652401 LW 0 LWI 9532535 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1716 DIV 24 FPUN 0 FPRSUB 5 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.2623 --Total thread-cycles: 295376032 --total thread-cycles issued: 196262078 (66.444822%) --iCache conflicts: 6611228 (2.238241%) --thread*cycles of FU dependence: 16524105 (5.594261%) --thread*cycles of data dependence: 20927849 (7.085155%) --iCache cycles*banks: 295376032 (72.869495% used) Issue breakdown: --thread*cycles of issue worked: 196262078 (66.444822%) --thread*cycles of issue failed: 80137041 (27.130516%) --thread*cycles of issue NOP/other: 4589586247044927633 (1553811328552.523700%) Number of thread-cycles not ready: 20927849 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 215238991 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 7 2: 8 3: 8 4: 7 5: 8 6: 8 7: 9 8: 7 9: 7 10: 8 11: 8 12: 8 13: 7 14: 8 15: 7 16: 7 17: 8 18: 8 19: 7 20: 7 21: 7 22: 8 23: 7 24: 8 25: 7 26: 7 27: 7 28: 7 29: 7 30: 8 31: 9 <=== Core 24 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6810662 in-flight CPI 1.3332 -- Total Cycles 9079847 ---- Thread 01 ---- PC 5: Stalled ----- 6987069 in-flight CPI 1.2995 -- Total Cycles 9079847 ---- Thread 02 ---- PC 5: Stalled ----- 7124478 in-flight CPI 1.2745 -- Total Cycles 9079847 ---- Thread 03 ---- PC 5: Stalled ----- 6229370 in-flight CPI 1.4576 -- Total Cycles 9079847 ---- Thread 04 ---- PC 5: Stalled ----- 5869631 in-flight CPI 1.5469 -- Total Cycles 9079847 ---- Thread 05 ---- PC 5: Stalled ----- 6136795 in-flight CPI 1.4796 -- Total Cycles 9079847 ---- Thread 06 ---- PC 5: Stalled ----- 6864214 in-flight CPI 1.3228 -- Total Cycles 9079847 ---- Thread 07 ---- PC 5: Stalled ----- 6305704 in-flight CPI 1.4399 -- Total Cycles 9079847 ---- Thread 08 ---- PC 5: Stalled ----- 6860953 in-flight CPI 1.3234 -- Total Cycles 9079847 ---- Thread 09 ---- PC 5: Stalled ----- 6516678 in-flight CPI 1.3933 -- Total Cycles 9079847 ---- Thread 10 ---- PC 5: Stalled ----- 6331636 in-flight CPI 1.4340 -- Total Cycles 9079847 ---- Thread 11 ---- PC 5: Stalled ----- 6213804 in-flight CPI 1.4612 -- Total Cycles 9079847 ---- Thread 12 ---- PC 5: Stalled ----- 6856949 in-flight CPI 1.3242 -- Total Cycles 9079847 ---- Thread 13 ---- PC 5: Stalled ----- 6215132 in-flight CPI 1.4609 -- Total Cycles 9079847 ---- Thread 14 ---- PC 5: Stalled ----- 5730929 in-flight CPI 1.5844 -- Total Cycles 9079847 ---- Thread 15 ---- PC 5: Stalled ----- 6039786 in-flight CPI 1.5033 -- Total Cycles 9079847 ---- Thread 16 ---- PC 5: Stalled ----- 5947770 in-flight CPI 1.5266 -- Total Cycles 9079847 ---- Thread 17 ---- PC 5: Stalled ----- 5842184 in-flight CPI 1.5542 -- Total Cycles 9079847 ---- Thread 18 ---- PC 5: Stalled ----- 5708824 in-flight CPI 1.5905 -- Total Cycles 9079847 ---- Thread 19 ---- PC 5: Stalled ----- 6115528 in-flight CPI 1.4847 -- Total Cycles 9079847 ---- Thread 20 ---- PC 5: Stalled ----- 6396195 in-flight CPI 1.4196 -- Total Cycles 9079847 ---- Thread 21 ---- PC 5: Stalled ----- 6174168 in-flight CPI 1.4706 -- Total Cycles 9079847 ---- Thread 22 ---- PC 5: Stalled ----- 6601791 in-flight CPI 1.3754 -- Total Cycles 9079847 ---- Thread 23 ---- PC 5: Stalled ----- 6667994 in-flight CPI 1.3617 -- Total Cycles 9079847 ---- Thread 24 ---- PC 5: Stalled ----- 5743634 in-flight CPI 1.5808 -- Total Cycles 9079847 ---- Thread 25 ---- PC 5: Stalled ----- 5563858 in-flight CPI 1.6319 -- Total Cycles 9079847 ---- Thread 26 ---- PC 5: Stalled ----- 5702366 in-flight CPI 1.5923 -- Total Cycles 9079847 ---- Thread 27 ---- PC 5: Stalled ----- 6185813 in-flight CPI 1.4678 -- Total Cycles 9079847 ---- Thread 28 ---- PC 5: Stalled ----- 5702546 in-flight CPI 1.5922 -- Total Cycles 9079847 ---- Thread 29 ---- PC 5: Stalled ----- 5843476 in-flight CPI 1.5538 -- Total Cycles 9079847 ---- Thread 30 ---- PC 5: Stalled ----- 5893213 in-flight CPI 1.5407 -- Total Cycles 9079847 ---- Thread 31 ---- PC 5: Stalled ----- 6249191 in-flight CPI 1.4530 -- Total Cycles 9079847 Total CPI 0.0455 , IPC 21.9643 -- Total Cycles 9079847 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 437995 (2.071539%) FPSUB: 0 (0.000000%) FPMUL: 2018239 (9.545453%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 14799362 (69.994990%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 575653 (2.722606%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3304029 (15.626719%) DIV: 7703 (0.036432%) FPUN: 0 (0.000000%) FPRSUB: 478 (0.002261%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (218715181 total) ADD%: 8.170 (17869600) SUB%: 0.000 (0) MUL%: 0.000 (209) BITOR%: 1.229 (2687159) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.538 (1176185) FPSUB%: 0.000 (0) FPMUL%: 4.740 (10367598) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (627) FPMAX%: 0.000 (627) LOAD%: 4.950 (10826519) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (241) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (42320) FPINV%: 0.000 (0) FPCONV%: 0.000 (691) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.060 (2318568) FPLE%: 0.392 (857257) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (627) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (28242) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.969 (6494083) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.748 (1635022) CMPU%: 0.000 (0) RSUB%: 0.000 (209) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.775 (34503249) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.233 (2697144) ORI%: 1.257 (2749808) XORI%: 0.000 (0) MULI%: 3.366 (7362792) LW%: 1.194 (2611790) LWI%: 13.937 (30482765) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.302 (659575) SWI%: 4.105 (8978511) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.484 (3245301) bged%: 0.000 (0) bgeid%: 0.000 (209) bgtd%: 0.000 (0) bgtid%: 0.323 (706167) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.039 (86286) bned%: 0.000 (0) bneid%: 13.717 (30001324) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.742 (1622169) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.086 (187842) DIV%: 0.000 (418) FPUN%: 1.186 (2593150) FPRSUB%: 3.705 (8103276) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (5) FPGT%: 3.104 (6788428) FPGE%: 0.799 (1746879) SYNC%: 0.000 (0) NOP%: 8.816 (19282213) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 162 SUB 0 MUL 13 BITOR 7 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 531 FPSUB 0 FPMUL 5294 FPCMPLT 0 FPMIN 0 FPMAX 407 LOAD 2364432 INTCONV 0 ATOMIC_INC 5 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 125 FPINV 0 FPCONV 8 FPEQ 0 FPNE 0 FPLT 14 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1917 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 2150 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3456765 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 833 ORI 596717 XORI 0 MULI 661388 LW 0 LWI 9685787 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1699 DIV 24 FPUN 0 FPRSUB 9 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.9644 --Total thread-cycles: 290555104 --total thread-cycles issued: 199432968 (68.638602%) --iCache conflicts: 6701497 (2.306446%) --thread*cycles of FU dependence: 16778317 (5.774573%) --thread*cycles of data dependence: 21143459 (7.276919%) --iCache cycles*banks: 290555104 (75.274951% used) Issue breakdown: --thread*cycles of issue worked: 199432968 (68.638604%) --thread*cycles of issue failed: 71839923 (24.725060%) --thread*cycles of issue NOP/other: -4650607630519748315 (-1600594023817.165000%) Number of thread-cycles not ready: 21143459 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 218715181 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 8 3: 8 4: 8 5: 7 6: 8 7: 7 8: 8 9: 8 10: 7 11: 7 12: 8 13: 8 14: 7 15: 7 16: 8 17: 7 18: 7 19: 8 20: 8 21: 8 22: 8 23: 7 24: 8 25: 7 26: 7 27: 7 28: 7 29: 7 30: 7 31: 8 <=== Core 25 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6246055 in-flight CPI 1.4395 -- Total Cycles 8990917 ---- Thread 01 ---- PC 5: Stalled ----- 5938408 in-flight CPI 1.5140 -- Total Cycles 8990917 ---- Thread 02 ---- PC 5: Stalled ----- 6593614 in-flight CPI 1.3636 -- Total Cycles 8990917 ---- Thread 03 ---- PC 5: Stalled ----- 5853938 in-flight CPI 1.5359 -- Total Cycles 8990917 ---- Thread 04 ---- PC 5: Stalled ----- 5974938 in-flight CPI 1.5048 -- Total Cycles 8990917 ---- Thread 05 ---- PC 5: Stalled ----- 5855045 in-flight CPI 1.5356 -- Total Cycles 8990917 ---- Thread 06 ---- PC 5: Stalled ----- 6719899 in-flight CPI 1.3380 -- Total Cycles 8990917 ---- Thread 07 ---- PC 5: Stalled ----- 6019060 in-flight CPI 1.4937 -- Total Cycles 8990917 ---- Thread 08 ---- PC 5: Stalled ----- 6068836 in-flight CPI 1.4815 -- Total Cycles 8990917 ---- Thread 09 ---- PC 5: Stalled ----- 6078872 in-flight CPI 1.4790 -- Total Cycles 8990917 ---- Thread 10 ---- PC 5: Stalled ----- 6230864 in-flight CPI 1.4430 -- Total Cycles 8990917 ---- Thread 11 ---- PC 5: Stalled ----- 6349960 in-flight CPI 1.4159 -- Total Cycles 8990917 ---- Thread 12 ---- PC 5: Stalled ----- 6349037 in-flight CPI 1.4161 -- Total Cycles 8990917 ---- Thread 13 ---- PC 5: Stalled ----- 5955591 in-flight CPI 1.5097 -- Total Cycles 8990917 ---- Thread 14 ---- PC 5: Stalled ----- 5818156 in-flight CPI 1.5453 -- Total Cycles 8990917 ---- Thread 15 ---- PC 5: Stalled ----- 6391996 in-flight CPI 1.4066 -- Total Cycles 8990917 ---- Thread 16 ---- PC 5: Stalled ----- 6673694 in-flight CPI 1.3472 -- Total Cycles 8990917 ---- Thread 17 ---- PC 5: Stalled ----- 6616811 in-flight CPI 1.3588 -- Total Cycles 8990917 ---- Thread 18 ---- PC 5: Stalled ----- 6221789 in-flight CPI 1.4451 -- Total Cycles 8990917 ---- Thread 19 ---- PC 5: Stalled ----- 6046745 in-flight CPI 1.4869 -- Total Cycles 8990917 ---- Thread 20 ---- PC 5: Stalled ----- 6158020 in-flight CPI 1.4600 -- Total Cycles 8990917 ---- Thread 21 ---- PC 5: Stalled ----- 5815662 in-flight CPI 1.5460 -- Total Cycles 8990917 ---- Thread 22 ---- PC 5: Stalled ----- 6332047 in-flight CPI 1.4199 -- Total Cycles 8990917 ---- Thread 23 ---- PC 5: Stalled ----- 5812712 in-flight CPI 1.5468 -- Total Cycles 8990917 ---- Thread 24 ---- PC 5: Stalled ----- 5825227 in-flight CPI 1.5434 -- Total Cycles 8990917 ---- Thread 25 ---- PC 5: Stalled ----- 5871311 in-flight CPI 1.5313 -- Total Cycles 8990917 ---- Thread 26 ---- PC 5: Stalled ----- 6458245 in-flight CPI 1.3922 -- Total Cycles 8990917 ---- Thread 27 ---- PC 5: Stalled ----- 5805290 in-flight CPI 1.5487 -- Total Cycles 8990917 ---- Thread 28 ---- PC 5: Stalled ----- 6124781 in-flight CPI 1.4680 -- Total Cycles 8990917 ---- Thread 29 ---- PC 5: Stalled ----- 6497790 in-flight CPI 1.3837 -- Total Cycles 8990917 ---- Thread 30 ---- PC 5: Stalled ----- 5274820 in-flight CPI 1.7045 -- Total Cycles 8990917 ---- Thread 31 ---- PC 5: Stalled ----- 6036137 in-flight CPI 1.4895 -- Total Cycles 8990917 Total CPI 0.0459 , IPC 21.8015 -- Total Cycles 8990917 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 434770 (2.060114%) FPSUB: 0 (0.000000%) FPMUL: 1995866 (9.457209%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 14838714 (70.311743%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 562183 (2.663847%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3264717 (15.469531%) DIV: 7463 (0.035363%) FPUN: 0 (0.000000%) FPRSUB: 463 (0.002194%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (214988200 total) ADD%: 8.198 (17624087) SUB%: 0.000 (0) MUL%: 0.000 (202) BITOR%: 1.222 (2627222) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.544 (1169231) FPSUB%: 0.000 (0) FPMUL%: 4.759 (10231551) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (606) FPMAX%: 0.000 (606) LOAD%: 4.948 (10638108) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (234) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41461) FPINV%: 0.000 (0) FPCONV%: 0.000 (670) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.063 (2285691) FPLE%: 0.392 (843079) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (606) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27722) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.963 (6369174) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.746 (1604868) CMPU%: 0.000 (0) RSUB%: 0.000 (202) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.767 (33897033) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.230 (2644659) ORI%: 1.258 (2704792) XORI%: 0.000 (0) MULI%: 3.360 (7224282) LW%: 1.191 (2561576) LWI%: 13.925 (29936667) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (648185) SWI%: 4.098 (8810689) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.480 (3181258) bged%: 0.000 (0) bgeid%: 0.000 (202) bgtd%: 0.000 (0) bgtid%: 0.323 (694496) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.040 (86813) bned%: 0.000 (0) bneid%: 13.713 (29480840) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.742 (1594386) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.086 (185656) DIV%: 0.000 (404) FPUN%: 1.179 (2535591) FPRSUB%: 3.710 (7976945) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (0) FPGT%: 3.108 (6682723) FPGE%: 0.792 (1703343) SYNC%: 0.000 (0) NOP%: 8.825 (18972244) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 209 SUB 0 MUL 15 BITOR 5 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 530 FPSUB 0 FPMUL 5197 FPCMPLT 0 FPMIN 0 FPMAX 392 LOAD 2333509 INTCONV 0 ATOMIC_INC 7 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 117 FPINV 0 FPCONV 21 FPEQ 0 FPNE 0 FPLT 10 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 2064 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 2167 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3397792 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 710 ORI 595413 XORI 0 MULI 643887 LW 0 LWI 9515455 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1695 DIV 16 FPUN 0 FPRSUB 3 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.8016 --Total thread-cycles: 287709344 --total thread-cycles issued: 196015956 (68.129853%) --iCache conflicts: 6599067 (2.293658%) --thread*cycles of FU dependence: 16499217 (5.734682%) --thread*cycles of data dependence: 21104176 (7.335242%) --iCache cycles*banks: 287709344 (74.724105% used) Issue breakdown: --thread*cycles of issue worked: 196015956 (68.129854%) --thread*cycles of issue failed: 72721144 (25.275906%) --thread*cycles of issue NOP/other: -4610725600907133356 (-1602563732134.863800%) Number of thread-cycles not ready: 21104176 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 214988200 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 7 2: 8 3: 8 4: 7 5: 7 6: 8 7: 7 8: 7 9: 7 10: 7 11: 8 12: 8 13: 7 14: 7 15: 8 16: 8 17: 8 18: 7 19: 8 20: 7 21: 7 22: 8 23: 7 24: 7 25: 7 26: 7 27: 7 28: 7 29: 7 30: 7 31: 7 <=== Core 26 ===> ---- Thread 00 ---- PC 5: Stalled ----- 5864143 in-flight CPI 1.5187 -- Total Cycles 8906027 ---- Thread 01 ---- PC 5: Stalled ----- 5946409 in-flight CPI 1.4977 -- Total Cycles 8906027 ---- Thread 02 ---- PC 5: Stalled ----- 6835804 in-flight CPI 1.3028 -- Total Cycles 8906027 ---- Thread 03 ---- PC 5: Stalled ----- 6644849 in-flight CPI 1.3403 -- Total Cycles 8906027 ---- Thread 04 ---- PC 5: Stalled ----- 6531774 in-flight CPI 1.3635 -- Total Cycles 8906027 ---- Thread 05 ---- PC 5: Stalled ----- 6349498 in-flight CPI 1.4026 -- Total Cycles 8906027 ---- Thread 06 ---- PC 5: Stalled ----- 6539671 in-flight CPI 1.3618 -- Total Cycles 8906027 ---- Thread 07 ---- PC 5: Stalled ----- 5881225 in-flight CPI 1.5143 -- Total Cycles 8906027 ---- Thread 08 ---- PC 5: Stalled ----- 6103787 in-flight CPI 1.4591 -- Total Cycles 8906027 ---- Thread 09 ---- PC 5: Stalled ----- 5899334 in-flight CPI 1.5097 -- Total Cycles 8906027 ---- Thread 10 ---- PC 5: Stalled ----- 6265178 in-flight CPI 1.4215 -- Total Cycles 8906027 ---- Thread 11 ---- PC 5: Stalled ----- 6128653 in-flight CPI 1.4532 -- Total Cycles 8906027 ---- Thread 12 ---- PC 5: Stalled ----- 6020797 in-flight CPI 1.4792 -- Total Cycles 8906027 ---- Thread 13 ---- PC 5: Stalled ----- 6629941 in-flight CPI 1.3433 -- Total Cycles 8906027 ---- Thread 14 ---- PC 5: Stalled ----- 6884059 in-flight CPI 1.2937 -- Total Cycles 8906027 ---- Thread 15 ---- PC 5: Stalled ----- 6025876 in-flight CPI 1.4780 -- Total Cycles 8906027 ---- Thread 16 ---- PC 5: Stalled ----- 6003271 in-flight CPI 1.4835 -- Total Cycles 8906027 ---- Thread 17 ---- PC 5: Stalled ----- 6537403 in-flight CPI 1.3623 -- Total Cycles 8906027 ---- Thread 18 ---- PC 5: Stalled ----- 6338723 in-flight CPI 1.4050 -- Total Cycles 8906027 ---- Thread 19 ---- PC 5: Stalled ----- 6302202 in-flight CPI 1.4132 -- Total Cycles 8906027 ---- Thread 20 ---- PC 5: Stalled ----- 6754642 in-flight CPI 1.3185 -- Total Cycles 8906027 ---- Thread 21 ---- PC 5: Stalled ----- 5816829 in-flight CPI 1.5311 -- Total Cycles 8906027 ---- Thread 22 ---- PC 5: Stalled ----- 5487908 in-flight CPI 1.6228 -- Total Cycles 8906027 ---- Thread 23 ---- PC 5: Stalled ----- 6334977 in-flight CPI 1.4058 -- Total Cycles 8906027 ---- Thread 24 ---- PC 5: Stalled ----- 6208373 in-flight CPI 1.4345 -- Total Cycles 8906027 ---- Thread 25 ---- PC 5: Stalled ----- 5606866 in-flight CPI 1.5884 -- Total Cycles 8906027 ---- Thread 26 ---- PC 5: Stalled ----- 5404289 in-flight CPI 1.6480 -- Total Cycles 8906027 ---- Thread 27 ---- PC 5: Stalled ----- 6255361 in-flight CPI 1.4237 -- Total Cycles 8906027 ---- Thread 28 ---- PC 5: Stalled ----- 6360972 in-flight CPI 1.4001 -- Total Cycles 8906027 ---- Thread 29 ---- PC 5: Stalled ----- 5949176 in-flight CPI 1.4970 -- Total Cycles 8906027 ---- Thread 30 ---- PC 5: Stalled ----- 6077137 in-flight CPI 1.4655 -- Total Cycles 8906027 ---- Thread 31 ---- PC 5: Stalled ----- 5353259 in-flight CPI 1.6637 -- Total Cycles 8906027 Total CPI 0.0451 , IPC 22.1584 -- Total Cycles 8906027 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 442477 (2.092830%) FPSUB: 0 (0.000000%) FPMUL: 2017939 (9.544461%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 14788920 (69.948727%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 565750 (2.675888%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3319352 (15.699892%) DIV: 7610 (0.035994%) FPUN: 0 (0.000000%) FPRSUB: 467 (0.002209%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (216426109 total) ADD%: 8.194 (17734994) SUB%: 0.000 (0) MUL%: 0.000 (206) BITOR%: 1.225 (2650422) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.548 (1185540) FPSUB%: 0.000 (0) FPMUL%: 4.771 (10325353) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (618) FPMAX%: 0.000 (618) LOAD%: 4.952 (10717460) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (238) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41677) FPINV%: 0.000 (0) FPCONV%: 0.000 (682) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.064 (2303157) FPLE%: 0.394 (851690) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (618) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27906) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.961 (6407699) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.748 (1619682) CMPU%: 0.000 (0) RSUB%: 0.000 (206) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.764 (34118258) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.231 (2663274) ORI%: 1.262 (2731053) XORI%: 0.000 (0) MULI%: 3.358 (7268628) LW%: 1.191 (2577070) LWI%: 13.921 (30128292) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (651653) SWI%: 4.098 (8868888) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.479 (3201055) bged%: 0.000 (0) bgeid%: 0.000 (206) bgtd%: 0.000 (0) bgtid%: 0.323 (698349) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.040 (87016) bned%: 0.000 (0) bneid%: 13.707 (29666151) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.740 (1601530) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.087 (188827) DIV%: 0.000 (412) FPUN%: 1.181 (2555648) FPRSUB%: 3.714 (8037305) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (3) FPGT%: 3.103 (6715703) FPGE%: 0.792 (1714821) SYNC%: 0.000 (0) NOP%: 8.817 (19083105) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 184 SUB 0 MUL 15 BITOR 2 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 525 FPSUB 0 FPMUL 5359 FPCMPLT 0 FPMIN 0 FPMAX 398 LOAD 2327114 INTCONV 0 ATOMIC_INC 7 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 113 FPINV 0 FPCONV 24 FPEQ 0 FPNE 0 FPLT 11 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1777 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 1 ADDKC 0 BITXOR 0 ANDN 0 CMP 2322 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3418671 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 799 ORI 605766 XORI 0 MULI 645568 LW 0 LWI 9575454 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1796 DIV 14 FPUN 0 FPRSUB 3 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 22.1584 --Total thread-cycles: 284992864 --total thread-cycles issued: 197343004 (69.244894%) --iCache conflicts: 6625136 (2.324667%) --thread*cycles of FU dependence: 16585935 (5.819772%) --thread*cycles of data dependence: 21142515 (7.418612%) --iCache cycles*banks: 284992864 (75.940898% used) Issue breakdown: --thread*cycles of issue worked: 197343004 (69.244893%) --thread*cycles of issue failed: 68566755 (24.059113%) --thread*cycles of issue NOP/other: -9223372035070521530 (-3236351923208.337400%) Number of thread-cycles not ready: 21142515 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 216426109 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 7 2: 8 3: 7 4: 9 5: 8 6: 8 7: 7 8: 7 9: 7 10: 7 11: 8 12: 7 13: 8 14: 9 15: 7 16: 7 17: 8 18: 7 19: 7 20: 8 21: 7 22: 6 23: 7 24: 8 25: 7 26: 6 27: 10 28: 7 29: 7 30: 8 31: 7 <=== Core 27 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6484960 in-flight CPI 1.3782 -- Total Cycles 8937606 ---- Thread 01 ---- PC 5: Stalled ----- 6423435 in-flight CPI 1.3914 -- Total Cycles 8937606 ---- Thread 02 ---- PC 5: Stalled ----- 6507266 in-flight CPI 1.3735 -- Total Cycles 8937606 ---- Thread 03 ---- PC 5: Stalled ----- 5914677 in-flight CPI 1.5111 -- Total Cycles 8937606 ---- Thread 04 ---- PC 5: Stalled ----- 6639004 in-flight CPI 1.3462 -- Total Cycles 8937606 ---- Thread 05 ---- PC 5: Stalled ----- 6532137 in-flight CPI 1.3682 -- Total Cycles 8937606 ---- Thread 06 ---- PC 5: Stalled ----- 6369448 in-flight CPI 1.4032 -- Total Cycles 8937606 ---- Thread 07 ---- PC 5: Stalled ----- 6352516 in-flight CPI 1.4069 -- Total Cycles 8937606 ---- Thread 08 ---- PC 5: Stalled ----- 6379456 in-flight CPI 1.4010 -- Total Cycles 8937606 ---- Thread 09 ---- PC 5: Stalled ----- 6992589 in-flight CPI 1.2781 -- Total Cycles 8937606 ---- Thread 10 ---- PC 5: Stalled ----- 5989768 in-flight CPI 1.4921 -- Total Cycles 8937606 ---- Thread 11 ---- PC 5: Stalled ----- 6038303 in-flight CPI 1.4801 -- Total Cycles 8937606 ---- Thread 12 ---- PC 5: Stalled ----- 6266649 in-flight CPI 1.4262 -- Total Cycles 8937606 ---- Thread 13 ---- PC 5: Stalled ----- 5787141 in-flight CPI 1.5444 -- Total Cycles 8937606 ---- Thread 14 ---- PC 5: Stalled ----- 6278253 in-flight CPI 1.4236 -- Total Cycles 8937606 ---- Thread 15 ---- PC 5: Stalled ----- 6610298 in-flight CPI 1.3521 -- Total Cycles 8937606 ---- Thread 16 ---- PC 5: Stalled ----- 5840647 in-flight CPI 1.5302 -- Total Cycles 8937606 ---- Thread 17 ---- PC 5: Stalled ----- 5886237 in-flight CPI 1.5184 -- Total Cycles 8937606 ---- Thread 18 ---- PC 5: Stalled ----- 6547802 in-flight CPI 1.3650 -- Total Cycles 8937606 ---- Thread 19 ---- PC 5: Stalled ----- 5651289 in-flight CPI 1.5815 -- Total Cycles 8937606 ---- Thread 20 ---- PC 5: Stalled ----- 5645873 in-flight CPI 1.5830 -- Total Cycles 8937606 ---- Thread 21 ---- PC 5: Stalled ----- 6522535 in-flight CPI 1.3703 -- Total Cycles 8937606 ---- Thread 22 ---- PC 5: Stalled ----- 5801326 in-flight CPI 1.5406 -- Total Cycles 8937606 ---- Thread 23 ---- PC 5: Stalled ----- 6062679 in-flight CPI 1.4742 -- Total Cycles 8937606 ---- Thread 24 ---- PC 5: Stalled ----- 6237043 in-flight CPI 1.4330 -- Total Cycles 8937606 ---- Thread 25 ---- PC 5: Stalled ----- 5471872 in-flight CPI 1.6334 -- Total Cycles 8937606 ---- Thread 26 ---- PC 5: Stalled ----- 6275075 in-flight CPI 1.4243 -- Total Cycles 8937606 ---- Thread 27 ---- PC 5: Stalled ----- 5407308 in-flight CPI 1.6529 -- Total Cycles 8937606 ---- Thread 28 ---- PC 5: Stalled ----- 5838353 in-flight CPI 1.5308 -- Total Cycles 8937606 ---- Thread 29 ---- PC 5: Stalled ----- 5324085 in-flight CPI 1.6787 -- Total Cycles 8937606 ---- Thread 30 ---- PC 5: Stalled ----- 5628219 in-flight CPI 1.5880 -- Total Cycles 8937606 ---- Thread 31 ---- PC 5: Stalled ----- 6019271 in-flight CPI 1.4848 -- Total Cycles 8937606 Total CPI 0.0457 , IPC 21.8992 -- Total Cycles 8937606 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 432891 (2.045072%) FPSUB: 0 (0.000000%) FPMUL: 1987895 (9.391254%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 14924383 (70.506070%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 562154 (2.655739%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3252151 (15.363877%) DIV: 7572 (0.035772%) FPUN: 0 (0.000000%) FPRSUB: 469 (0.002216%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (214657493 total) ADD%: 8.187 (17572997) SUB%: 0.000 (0) MUL%: 0.000 (205) BITOR%: 1.223 (2625681) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.542 (1163392) FPSUB%: 0.000 (0) FPMUL%: 4.753 (10203115) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (615) FPMAX%: 0.000 (615) LOAD%: 4.950 (10626338) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (237) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41434) FPINV%: 0.000 (0) FPCONV%: 0.000 (679) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.061 (2277951) FPLE%: 0.391 (838334) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (615) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27820) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.967 (6368458) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.747 (1603501) CMPU%: 0.000 (0) RSUB%: 0.000 (205) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.771 (33854047) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.232 (2644363) ORI%: 1.258 (2699770) XORI%: 0.000 (0) MULI%: 3.363 (7218281) LW%: 1.193 (2561292) LWI%: 13.929 (29900370) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.302 (647862) SWI%: 4.102 (8805871) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.482 (3181239) bged%: 0.000 (0) bgeid%: 0.000 (205) bgtd%: 0.000 (0) bgtid%: 0.323 (693842) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.040 (85860) bned%: 0.000 (0) bneid%: 13.712 (29433019) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.742 (1591702) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.086 (184903) DIV%: 0.000 (410) FPUN%: 1.181 (2534088) FPRSUB%: 3.709 (7961693) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (1) FPGT%: 3.107 (6668434) FPGE%: 0.795 (1706589) SYNC%: 0.000 (0) NOP%: 8.819 (18931364) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 175 SUB 0 MUL 31 BITOR 10 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 553 FPSUB 0 FPMUL 5295 FPCMPLT 0 FPMIN 0 FPMAX 391 LOAD 2368273 INTCONV 0 ATOMIC_INC 3 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 111 FPINV 0 FPCONV 10 FPEQ 0 FPNE 0 FPLT 9 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1929 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 1 COS 0 SIN 0 ADDC 0 ADDK 1 ADDKC 0 BITXOR 0 ANDN 0 CMP 2185 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3395430 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 775 ORI 591868 XORI 0 MULI 644602 LW 0 LWI 9506365 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1731 DIV 15 FPUN 0 FPRSUB 5 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.8992 --Total thread-cycles: 286003392 --total thread-cycles issued: 195726129 (68.434897%) --iCache conflicts: 6587187 (2.303185%) --thread*cycles of FU dependence: 16519791 (5.776082%) --thread*cycles of data dependence: 21167515 (7.401141%) --iCache cycles*banks: 286003392 (75.054189% used) Issue breakdown: --thread*cycles of issue worked: 195726129 (68.434898%) --thread*cycles of issue failed: 71345899 (24.945823%) --thread*cycles of issue NOP/other: 39771958 (13.906114%) Number of thread-cycles not ready: 21167515 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 214657493 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 8 3: 7 4: 9 5: 8 6: 8 7: 7 8: 8 9: 10 10: 7 11: 8 12: 8 13: 7 14: 7 15: 8 16: 7 17: 7 18: 8 19: 7 20: 7 21: 7 22: 7 23: 7 24: 7 25: 6 26: 8 27: 7 28: 7 29: 6 30: 7 31: 7 <=== Core 28 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6479461 in-flight CPI 1.3436 -- Total Cycles 8705593 ---- Thread 01 ---- PC 5: Stalled ----- 6522156 in-flight CPI 1.3348 -- Total Cycles 8705593 ---- Thread 02 ---- PC 5: Stalled ----- 6151645 in-flight CPI 1.4152 -- Total Cycles 8705593 ---- Thread 03 ---- PC 5: Stalled ----- 6253045 in-flight CPI 1.3922 -- Total Cycles 8705593 ---- Thread 04 ---- PC 5: Stalled ----- 6611105 in-flight CPI 1.3168 -- Total Cycles 8705593 ---- Thread 05 ---- PC 5: Stalled ----- 6241184 in-flight CPI 1.3949 -- Total Cycles 8705593 ---- Thread 06 ---- PC 5: Stalled ----- 5925202 in-flight CPI 1.4692 -- Total Cycles 8705593 ---- Thread 07 ---- PC 5: Stalled ----- 6567528 in-flight CPI 1.3255 -- Total Cycles 8705593 ---- Thread 08 ---- PC 5: Stalled ----- 6042316 in-flight CPI 1.4408 -- Total Cycles 8705593 ---- Thread 09 ---- PC 5: Stalled ----- 6002406 in-flight CPI 1.4503 -- Total Cycles 8705593 ---- Thread 10 ---- PC 5: Stalled ----- 6680823 in-flight CPI 1.3031 -- Total Cycles 8705593 ---- Thread 11 ---- PC 5: Stalled ----- 6484177 in-flight CPI 1.3426 -- Total Cycles 8705593 ---- Thread 12 ---- PC 5: Stalled ----- 6626764 in-flight CPI 1.3137 -- Total Cycles 8705593 ---- Thread 13 ---- PC 5: Stalled ----- 6100544 in-flight CPI 1.4270 -- Total Cycles 8705593 ---- Thread 14 ---- PC 5: Stalled ----- 5913840 in-flight CPI 1.4721 -- Total Cycles 8705593 ---- Thread 15 ---- PC 5: Stalled ----- 5989180 in-flight CPI 1.4535 -- Total Cycles 8705593 ---- Thread 16 ---- PC 5: Stalled ----- 5743738 in-flight CPI 1.5157 -- Total Cycles 8705593 ---- Thread 17 ---- PC 5: Stalled ----- 6052539 in-flight CPI 1.4383 -- Total Cycles 8705593 ---- Thread 18 ---- PC 5: Stalled ----- 5827289 in-flight CPI 1.4939 -- Total Cycles 8705593 ---- Thread 19 ---- PC 5: Stalled ----- 6576089 in-flight CPI 1.3238 -- Total Cycles 8705593 ---- Thread 20 ---- PC 5: Stalled ----- 6287779 in-flight CPI 1.3845 -- Total Cycles 8705593 ---- Thread 21 ---- PC 5: Stalled ----- 5825657 in-flight CPI 1.4943 -- Total Cycles 8705593 ---- Thread 22 ---- PC 5: Stalled ----- 5588777 in-flight CPI 1.5577 -- Total Cycles 8705593 ---- Thread 23 ---- PC 5: Stalled ----- 5585970 in-flight CPI 1.5585 -- Total Cycles 8705593 ---- Thread 24 ---- PC 5: Stalled ----- 5602129 in-flight CPI 1.5540 -- Total Cycles 8705593 ---- Thread 25 ---- PC 5: Stalled ----- 5635303 in-flight CPI 1.5448 -- Total Cycles 8705593 ---- Thread 26 ---- PC 5: Stalled ----- 5555981 in-flight CPI 1.5669 -- Total Cycles 8705593 ---- Thread 27 ---- PC 5: Stalled ----- 5867402 in-flight CPI 1.4837 -- Total Cycles 8705593 ---- Thread 28 ---- PC 5: Stalled ----- 5824151 in-flight CPI 1.4947 -- Total Cycles 8705593 ---- Thread 29 ---- PC 5: Stalled ----- 5830379 in-flight CPI 1.4931 -- Total Cycles 8705593 ---- Thread 30 ---- PC 5: Stalled ----- 5798377 in-flight CPI 1.5014 -- Total Cycles 8705593 ---- Thread 31 ---- PC 5: Stalled ----- 5489286 in-flight CPI 1.5859 -- Total Cycles 8705593 Total CPI 0.0449 , IPC 22.2481 -- Total Cycles 8705593 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 431192 (2.073657%) FPSUB: 0 (0.000000%) FPMUL: 1972298 (9.485032%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 14567018 (70.054646%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 561581 (2.700715%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3253864 (15.648247%) DIV: 7386 (0.035520%) FPUN: 0 (0.000000%) FPRSUB: 454 (0.002183%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (212411127 total) ADD%: 8.186 (17387919) SUB%: 0.000 (0) MUL%: 0.000 (200) BITOR%: 1.226 (2603350) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.545 (1158059) FPSUB%: 0.000 (0) FPMUL%: 4.761 (10112342) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (600) FPMAX%: 0.000 (600) LOAD%: 4.954 (10522786) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (232) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41343) FPINV%: 0.000 (0) FPCONV%: 0.000 (664) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.063 (2257197) FPLE%: 0.392 (831727) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (600) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27466) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.964 (6295099) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.749 (1590526) CMPU%: 0.000 (0) RSUB%: 0.000 (200) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.767 (33491424) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.232 (2616679) ORI%: 1.260 (2676235) XORI%: 0.000 (0) MULI%: 3.360 (7137372) LW%: 1.192 (2531842) LWI%: 13.924 (29575272) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (639592) SWI%: 4.101 (8711554) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.481 (3145558) bged%: 0.000 (0) bgeid%: 0.000 (200) bgtd%: 0.000 (0) bgtid%: 0.323 (685243) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.040 (84476) bned%: 0.000 (0) bneid%: 13.711 (29123289) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.741 (1574080) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.087 (185066) DIV%: 0.000 (400) FPUN%: 1.182 (2510031) FPRSUB%: 3.710 (7881388) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (4) FPGT%: 3.104 (6593074) FPGE%: 0.795 (1689037) SYNC%: 0.000 (0) NOP%: 8.817 (18728305) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 187 SUB 0 MUL 20 BITOR 5 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 549 FPSUB 0 FPMUL 5072 FPCMPLT 0 FPMIN 0 FPMAX 387 LOAD 2343601 INTCONV 0 ATOMIC_INC 10 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 101 FPINV 0 FPCONV 5 FPEQ 0 FPNE 0 FPLT 7 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1732 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 2219 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3354637 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 788 ORI 589044 XORI 0 MULI 644078 LW 0 LWI 9397643 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1807 DIV 8 FPUN 0 FPRSUB 4 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 22.2481 --Total thread-cycles: 278578976 --total thread-cycles issued: 193682822 (69.525281%) --iCache conflicts: 6619220 (2.376066%) --thread*cycles of FU dependence: 16341917 (5.866170%) --thread*cycles of data dependence: 20793793 (7.464236%) --iCache cycles*banks: 278578976 (76.248094% used) Issue breakdown: --thread*cycles of issue worked: 193682822 (69.525283%) --thread*cycles of issue failed: 66167849 (23.751918%) --thread*cycles of issue NOP/other: 4621025000166376817 (1658784545236.599600%) Number of thread-cycles not ready: 20793793 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 212411127 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 7 3: 9 4: 7 5: 8 6: 7 7: 8 8: 7 9: 7 10: 7 11: 8 12: 8 13: 7 14: 7 15: 7 16: 7 17: 7 18: 7 19: 7 20: 7 21: 7 22: 7 23: 7 24: 7 25: 7 26: 7 27: 7 28: 7 29: 7 30: 7 31: 7 <=== Core 29 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6402581 in-flight CPI 1.4125 -- Total Cycles 9043986 ---- Thread 01 ---- PC 5: Stalled ----- 5886892 in-flight CPI 1.5363 -- Total Cycles 9043986 ---- Thread 02 ---- PC 5: Stalled ----- 6820814 in-flight CPI 1.3259 -- Total Cycles 9043986 ---- Thread 03 ---- PC 5: Stalled ----- 6983325 in-flight CPI 1.2951 -- Total Cycles 9043986 ---- Thread 04 ---- PC 5: Stalled ----- 6457924 in-flight CPI 1.4004 -- Total Cycles 9043986 ---- Thread 05 ---- PC 5: Stalled ----- 6103973 in-flight CPI 1.4817 -- Total Cycles 9043986 ---- Thread 06 ---- PC 5: Stalled ----- 6043429 in-flight CPI 1.4965 -- Total Cycles 9043986 ---- Thread 07 ---- PC 5: Stalled ----- 5913296 in-flight CPI 1.5294 -- Total Cycles 9043986 ---- Thread 08 ---- PC 5: Stalled ----- 6242352 in-flight CPI 1.4488 -- Total Cycles 9043986 ---- Thread 09 ---- PC 5: Stalled ----- 5951525 in-flight CPI 1.5196 -- Total Cycles 9043986 ---- Thread 10 ---- PC 5: Stalled ----- 6103937 in-flight CPI 1.4817 -- Total Cycles 9043986 ---- Thread 11 ---- PC 5: Stalled ----- 6485486 in-flight CPI 1.3945 -- Total Cycles 9043986 ---- Thread 12 ---- PC 5: Stalled ----- 6722287 in-flight CPI 1.3454 -- Total Cycles 9043986 ---- Thread 13 ---- PC 5: Stalled ----- 6568445 in-flight CPI 1.3769 -- Total Cycles 9043986 ---- Thread 14 ---- PC 5: Stalled ----- 5745994 in-flight CPI 1.5740 -- Total Cycles 9043986 ---- Thread 15 ---- PC 5: Stalled ----- 6206275 in-flight CPI 1.4572 -- Total Cycles 9043986 ---- Thread 16 ---- PC 5: Stalled ----- 5999967 in-flight CPI 1.5073 -- Total Cycles 9043986 ---- Thread 17 ---- PC 5: Stalled ----- 6393727 in-flight CPI 1.4145 -- Total Cycles 9043986 ---- Thread 18 ---- PC 5: Stalled ----- 6089505 in-flight CPI 1.4852 -- Total Cycles 9043986 ---- Thread 19 ---- PC 5: Stalled ----- 6042089 in-flight CPI 1.4968 -- Total Cycles 9043986 ---- Thread 20 ---- PC 5: Stalled ----- 6404017 in-flight CPI 1.4122 -- Total Cycles 9043986 ---- Thread 21 ---- PC 5: Stalled ----- 6267306 in-flight CPI 1.4430 -- Total Cycles 9043986 ---- Thread 22 ---- PC 5: Stalled ----- 6201669 in-flight CPI 1.4583 -- Total Cycles 9043986 ---- Thread 23 ---- PC 5: Stalled ----- 5866838 in-flight CPI 1.5415 -- Total Cycles 9043986 ---- Thread 24 ---- PC 5: Stalled ----- 5428378 in-flight CPI 1.6661 -- Total Cycles 9043986 ---- Thread 25 ---- PC 5: Stalled ----- 6445334 in-flight CPI 1.4032 -- Total Cycles 9043986 ---- Thread 26 ---- PC 5: Stalled ----- 5851897 in-flight CPI 1.5455 -- Total Cycles 9043986 ---- Thread 27 ---- PC 5: Stalled ----- 5453842 in-flight CPI 1.6583 -- Total Cycles 9043986 ---- Thread 28 ---- PC 5: Stalled ----- 6080365 in-flight CPI 1.4874 -- Total Cycles 9043986 ---- Thread 29 ---- PC 5: Stalled ----- 6164925 in-flight CPI 1.4670 -- Total Cycles 9043986 ---- Thread 30 ---- PC 5: Stalled ----- 6477637 in-flight CPI 1.3962 -- Total Cycles 9043986 ---- Thread 31 ---- PC 5: Stalled ----- 6124632 in-flight CPI 1.4767 -- Total Cycles 9043986 Total CPI 0.0457 , IPC 21.8854 -- Total Cycles 9043986 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 439832 (2.069932%) FPSUB: 0 (0.000000%) FPMUL: 2016881 (9.491819%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 14934000 (70.282198%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 558149 (2.626754%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3291703 (15.491370%) DIV: 7591 (0.035725%) FPUN: 0 (0.000000%) FPRSUB: 468 (0.002202%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (217075581 total) ADD%: 8.193 (17784109) SUB%: 0.000 (0) MUL%: 0.000 (206) BITOR%: 1.227 (2662788) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.542 (1177319) FPSUB%: 0.000 (0) FPMUL%: 4.759 (10331400) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (618) FPMAX%: 0.000 (618) LOAD%: 4.949 (10742555) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (238) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41089) FPINV%: 0.000 (0) FPCONV%: 0.000 (682) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.063 (2306494) FPLE%: 0.391 (849539) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (618) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27542) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.964 (6433813) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.748 (1622897) CMPU%: 0.000 (0) RSUB%: 0.000 (206) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.767 (34225788) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.231 (2672052) ORI%: 1.263 (2742169) XORI%: 0.000 (0) MULI%: 3.362 (7297954) LW%: 1.192 (2587326) LWI%: 13.924 (30225799) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (653991) SWI%: 4.095 (8888950) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.481 (3214353) bged%: 0.000 (0) bgeid%: 0.000 (206) bgtd%: 0.000 (0) bgtid%: 0.323 (700887) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.041 (88124) bned%: 0.000 (0) bneid%: 13.712 (29766403) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.739 (1604147) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.086 (187294) DIV%: 0.000 (412) FPUN%: 1.184 (2569632) FPRSUB%: 3.711 (8055653) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (0) FPGT%: 3.103 (6736540) FPGE%: 0.797 (1730774) SYNC%: 0.000 (0) NOP%: 8.819 (19144300) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 169 SUB 0 MUL 10 BITOR 5 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 533 FPSUB 0 FPMUL 5351 FPCMPLT 0 FPMIN 0 FPMAX 399 LOAD 2318211 INTCONV 0 ATOMIC_INC 20 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 118 FPINV 0 FPCONV 15 FPEQ 0 FPNE 0 FPLT 8 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1772 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 2322 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3428864 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 806 ORI 601939 XORI 0 MULI 649521 LW 0 LWI 9604506 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1736 DIV 20 FPUN 0 FPRSUB 2 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.8854 --Total thread-cycles: 289407552 --total thread-cycles issued: 197931281 (68.391885%) --iCache conflicts: 6659234 (2.300988%) --thread*cycles of FU dependence: 16616329 (5.741498%) --thread*cycles of data dependence: 21248624 (7.342111%) --iCache cycles*banks: 289407552 (75.006893% used) Issue breakdown: --thread*cycles of issue worked: 197931281 (68.391885%) --thread*cycles of issue failed: 72331971 (24.993118%) --thread*cycles of issue NOP/other: 19182816 (6.628305%) Number of thread-cycles not ready: 21248624 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 217075581 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 7 2: 8 3: 8 4: 7 5: 7 6: 7 7: 7 8: 7 9: 7 10: 7 11: 7 12: 8 13: 8 14: 7 15: 7 16: 8 17: 7 18: 7 19: 7 20: 8 21: 10 22: 9 23: 7 24: 7 25: 7 26: 7 27: 9 28: 8 29: 7 30: 7 31: 7 <=== Core 30 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6539626 in-flight CPI 1.3698 -- Total Cycles 8958126 ---- Thread 01 ---- PC 5: Stalled ----- 6232254 in-flight CPI 1.4374 -- Total Cycles 8958126 ---- Thread 02 ---- PC 5: Stalled ----- 5947818 in-flight CPI 1.5061 -- Total Cycles 8958126 ---- Thread 03 ---- PC 5: Stalled ----- 6342639 in-flight CPI 1.4124 -- Total Cycles 8958126 ---- Thread 04 ---- PC 5: Stalled ----- 5787088 in-flight CPI 1.5479 -- Total Cycles 8958126 ---- Thread 05 ---- PC 5: Stalled ----- 6618117 in-flight CPI 1.3536 -- Total Cycles 8958126 ---- Thread 06 ---- PC 5: Stalled ----- 6121765 in-flight CPI 1.4633 -- Total Cycles 8958126 ---- Thread 07 ---- PC 5: Stalled ----- 6174256 in-flight CPI 1.4509 -- Total Cycles 8958126 ---- Thread 08 ---- PC 5: Stalled ----- 6238229 in-flight CPI 1.4360 -- Total Cycles 8958126 ---- Thread 09 ---- PC 5: Stalled ----- 5920593 in-flight CPI 1.5130 -- Total Cycles 8958126 ---- Thread 10 ---- PC 5: Stalled ----- 6080853 in-flight CPI 1.4732 -- Total Cycles 8958126 ---- Thread 11 ---- PC 5: Stalled ----- 6463012 in-flight CPI 1.3861 -- Total Cycles 8958126 ---- Thread 12 ---- PC 5: Stalled ----- 6329156 in-flight CPI 1.4154 -- Total Cycles 8958126 ---- Thread 13 ---- PC 5: Stalled ----- 5723774 in-flight CPI 1.5651 -- Total Cycles 8958126 ---- Thread 14 ---- PC 5: Stalled ----- 6087180 in-flight CPI 1.4716 -- Total Cycles 8958126 ---- Thread 15 ---- PC 5: Stalled ----- 6639721 in-flight CPI 1.3492 -- Total Cycles 8958126 ---- Thread 16 ---- PC 5: Stalled ----- 5834114 in-flight CPI 1.5355 -- Total Cycles 8958126 ---- Thread 17 ---- PC 5: Stalled ----- 5974439 in-flight CPI 1.4994 -- Total Cycles 8958126 ---- Thread 18 ---- PC 5: Stalled ----- 6383934 in-flight CPI 1.4032 -- Total Cycles 8958126 ---- Thread 19 ---- PC 5: Stalled ----- 6680364 in-flight CPI 1.3410 -- Total Cycles 8958126 ---- Thread 20 ---- PC 5: Stalled ----- 5842333 in-flight CPI 1.5333 -- Total Cycles 8958126 ---- Thread 21 ---- PC 5: Stalled ----- 6204580 in-flight CPI 1.4438 -- Total Cycles 8958126 ---- Thread 22 ---- PC 5: Stalled ----- 6289084 in-flight CPI 1.4244 -- Total Cycles 8958126 ---- Thread 23 ---- PC 5: Stalled ----- 5766733 in-flight CPI 1.5534 -- Total Cycles 8958126 ---- Thread 24 ---- PC 5: Stalled ----- 5791925 in-flight CPI 1.5467 -- Total Cycles 8958126 ---- Thread 25 ---- PC 5: Stalled ----- 5847877 in-flight CPI 1.5319 -- Total Cycles 8958126 ---- Thread 26 ---- PC 5: Stalled ----- 6445145 in-flight CPI 1.3899 -- Total Cycles 8958126 ---- Thread 27 ---- PC 5: Stalled ----- 6501380 in-flight CPI 1.3779 -- Total Cycles 8958126 ---- Thread 28 ---- PC 5: Stalled ----- 5881304 in-flight CPI 1.5231 -- Total Cycles 8958126 ---- Thread 29 ---- PC 5: Stalled ----- 5437535 in-flight CPI 1.6475 -- Total Cycles 8958126 ---- Thread 30 ---- PC 5: Stalled ----- 5277797 in-flight CPI 1.6973 -- Total Cycles 8958126 ---- Thread 31 ---- PC 5: Stalled ----- 5987318 in-flight CPI 1.4962 -- Total Cycles 8958126 Total CPI 0.0458 , IPC 21.8118 -- Total Cycles 8958126 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 433658 (2.022992%) FPSUB: 0 (0.000000%) FPMUL: 1987966 (9.273759%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 15184816 (70.836381%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 562768 (2.625284%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3259300 (15.204466%) DIV: 7490 (0.034940%) FPUN: 0 (0.000000%) FPRSUB: 467 (0.002179%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (214275818 total) ADD%: 8.192 (17552849) SUB%: 0.000 (0) MUL%: 0.000 (203) BITOR%: 1.227 (2629870) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.543 (1163986) FPSUB%: 0.000 (0) FPMUL%: 4.758 (10195240) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (609) FPMAX%: 0.000 (609) LOAD%: 4.950 (10606141) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (235) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41419) FPINV%: 0.000 (0) FPCONV%: 0.000 (673) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.062 (2275138) FPLE%: 0.391 (838673) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (609) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27660) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.965 (6354151) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.748 (1603193) CMPU%: 0.000 (0) RSUB%: 0.000 (203) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.768 (33786772) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.232 (2640892) ORI%: 1.260 (2699459) XORI%: 0.000 (0) MULI%: 3.362 (7204610) LW%: 1.193 (2555522) LWI%: 13.926 (29839621) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (645637) SWI%: 4.101 (8787092) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.482 (3174974) bged%: 0.000 (0) bgeid%: 0.000 (203) bgtd%: 0.000 (0) bgtid%: 0.323 (691361) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.039 (84071) bned%: 0.000 (0) bneid%: 13.711 (29379038) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.740 (1585333) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.087 (185357) DIV%: 0.000 (406) FPUN%: 1.184 (2536003) FPRSUB%: 3.710 (7949416) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (4) FPGT%: 3.102 (6647109) FPGE%: 0.797 (1708115) SYNC%: 0.000 (0) NOP%: 8.813 (18883266) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 172 SUB 0 MUL 23 BITOR 5 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 523 FPSUB 0 FPMUL 5184 FPCMPLT 0 FPMIN 0 FPMAX 388 LOAD 2319982 INTCONV 0 ATOMIC_INC 7 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 122 FPINV 0 FPCONV 8 FPEQ 0 FPNE 0 FPLT 6 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1688 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 1 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 2236 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3385090 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 797 ORI 591907 XORI 0 MULI 645872 LW 0 LWI 9483796 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1764 DIV 21 FPUN 0 FPRSUB 7 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.8118 --Total thread-cycles: 286660032 --total thread-cycles issued: 195392552 (68.161767%) --iCache conflicts: 6572649 (2.292838%) --thread*cycles of FU dependence: 16439604 (5.734878%) --thread*cycles of data dependence: 21436465 (7.478010%) --iCache cycles*banks: 286660032 (74.749120% used) Issue breakdown: --thread*cycles of issue worked: 195392552 (68.161770%) --thread*cycles of issue failed: 72384214 (25.250892%) --thread*cycles of issue NOP/other: 18883266 (6.587338%) Number of thread-cycles not ready: 21436465 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 214275818 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 7 3: 8 4: 7 5: 8 6: 7 7: 9 8: 8 9: 7 10: 7 11: 7 12: 7 13: 7 14: 7 15: 8 16: 7 17: 7 18: 7 19: 7 20: 8 21: 8 22: 8 23: 7 24: 7 25: 7 26: 7 27: 7 28: 7 29: 7 30: 6 31: 8 <=== Core 31 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6257088 in-flight CPI 1.4620 -- Total Cycles 9147585 ---- Thread 01 ---- PC 5: Stalled ----- 6581556 in-flight CPI 1.3899 -- Total Cycles 9147585 ---- Thread 02 ---- PC 5: Stalled ----- 6596807 in-flight CPI 1.3867 -- Total Cycles 9147585 ---- Thread 03 ---- PC 5: Stalled ----- 6944953 in-flight CPI 1.3172 -- Total Cycles 9147585 ---- Thread 04 ---- PC 5: Stalled ----- 7190192 in-flight CPI 1.2722 -- Total Cycles 9147585 ---- Thread 05 ---- PC 5: Stalled ----- 6953719 in-flight CPI 1.3155 -- Total Cycles 9147585 ---- Thread 06 ---- PC 5: Stalled ----- 6245125 in-flight CPI 1.4648 -- Total Cycles 9147585 ---- Thread 07 ---- PC 5: Stalled ----- 5883323 in-flight CPI 1.5548 -- Total Cycles 9147585 ---- Thread 08 ---- PC 5: Stalled ----- 5816496 in-flight CPI 1.5727 -- Total Cycles 9147585 ---- Thread 09 ---- PC 5: Stalled ----- 5934438 in-flight CPI 1.5414 -- Total Cycles 9147585 ---- Thread 10 ---- PC 5: Stalled ----- 6210248 in-flight CPI 1.4730 -- Total Cycles 9147585 ---- Thread 11 ---- PC 5: Stalled ----- 6495110 in-flight CPI 1.4084 -- Total Cycles 9147585 ---- Thread 12 ---- PC 5: Stalled ----- 5726548 in-flight CPI 1.5974 -- Total Cycles 9147585 ---- Thread 13 ---- PC 5: Stalled ----- 6949255 in-flight CPI 1.3163 -- Total Cycles 9147585 ---- Thread 14 ---- PC 5: Stalled ----- 5706909 in-flight CPI 1.6029 -- Total Cycles 9147585 ---- Thread 15 ---- PC 5: Stalled ----- 5935215 in-flight CPI 1.5412 -- Total Cycles 9147585 ---- Thread 16 ---- PC 5: Stalled ----- 5879435 in-flight CPI 1.5559 -- Total Cycles 9147585 ---- Thread 17 ---- PC 5: Stalled ----- 6552748 in-flight CPI 1.3960 -- Total Cycles 9147585 ---- Thread 18 ---- PC 5: Stalled ----- 5958393 in-flight CPI 1.5352 -- Total Cycles 9147585 ---- Thread 19 ---- PC 5: Stalled ----- 6054920 in-flight CPI 1.5108 -- Total Cycles 9147585 ---- Thread 20 ---- PC 5: Stalled ----- 5922615 in-flight CPI 1.5445 -- Total Cycles 9147585 ---- Thread 21 ---- PC 5: Stalled ----- 5607791 in-flight CPI 1.6312 -- Total Cycles 9147585 ---- Thread 22 ---- PC 5: Stalled ----- 5974957 in-flight CPI 1.5310 -- Total Cycles 9147585 ---- Thread 23 ---- PC 5: Stalled ----- 5801929 in-flight CPI 1.5766 -- Total Cycles 9147585 ---- Thread 24 ---- PC 5: Stalled ----- 6049570 in-flight CPI 1.5121 -- Total Cycles 9147585 ---- Thread 25 ---- PC 5: Stalled ----- 5680049 in-flight CPI 1.6105 -- Total Cycles 9147585 ---- Thread 26 ---- PC 5: Stalled ----- 6196310 in-flight CPI 1.4763 -- Total Cycles 9147585 ---- Thread 27 ---- PC 5: Stalled ----- 5877060 in-flight CPI 1.5565 -- Total Cycles 9147585 ---- Thread 28 ---- PC 5: Stalled ----- 6284078 in-flight CPI 1.4557 -- Total Cycles 9147585 ---- Thread 29 ---- PC 5: Stalled ----- 5634329 in-flight CPI 1.6235 -- Total Cycles 9147585 ---- Thread 30 ---- PC 5: Stalled ----- 6239198 in-flight CPI 1.4661 -- Total Cycles 9147585 ---- Thread 31 ---- PC 5: Stalled ----- 5567037 in-flight CPI 1.6432 -- Total Cycles 9147585 Total CPI 0.0465 , IPC 21.5038 -- Total Cycles 9147585 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 433664 (2.034524%) FPSUB: 0 (0.000000%) FPMUL: 1994917 (9.359102%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 15053293 (70.622138%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 565752 (2.654211%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3259370 (15.291251%) DIV: 7787 (0.036533%) FPUN: 0 (0.000000%) FPRSUB: 478 (0.002243%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (215722999 total) ADD%: 8.187 (17662061) SUB%: 0.000 (0) MUL%: 0.000 (211) BITOR%: 1.227 (2645851) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.540 (1164289) FPSUB%: 0.000 (0) FPMUL%: 4.748 (10241582) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (633) FPMAX%: 0.000 (633) LOAD%: 4.950 (10677449) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (243) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41568) FPINV%: 0.000 (0) FPCONV%: 0.000 (697) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.060 (2287583) FPLE%: 0.395 (851054) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (633) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27838) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.967 (6401167) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.747 (1611078) CMPU%: 0.000 (0) RSUB%: 0.000 (211) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.773 (34025410) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.232 (2658340) ORI%: 1.256 (2709940) XORI%: 0.000 (0) MULI%: 3.364 (7256384) LW%: 1.193 (2574414) LWI%: 13.934 (30059645) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (650236) SWI%: 4.105 (8855425) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.483 (3198790) bged%: 0.000 (0) bgeid%: 0.000 (211) bgtd%: 0.000 (0) bgtid%: 0.323 (695839) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.040 (85223) bned%: 0.000 (0) bneid%: 13.710 (29576698) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.744 (1603935) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.086 (185309) DIV%: 0.000 (422) FPUN%: 1.184 (2553436) FPRSUB%: 3.707 (7996062) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (5) FPGT%: 3.103 (6694297) FPGE%: 0.794 (1713136) SYNC%: 0.000 (0) NOP%: 8.815 (19014965) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 174 SUB 0 MUL 26 BITOR 3 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 508 FPSUB 0 FPMUL 5480 FPCMPLT 0 FPMIN 0 FPMAX 410 LOAD 2352539 INTCONV 0 ATOMIC_INC 7 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 111 FPINV 0 FPCONV 14 FPEQ 0 FPNE 0 FPLT 6 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1916 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 1 ADDKC 0 BITXOR 0 ANDN 0 CMP 2287 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3410066 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 781 ORI 591833 XORI 0 MULI 647723 LW 0 LWI 9549367 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1642 DIV 15 FPUN 0 FPRSUB 3 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.5038 --Total thread-cycles: 292722720 --total thread-cycles issued: 196708034 (67.199441%) --iCache conflicts: 6584887 (2.249531%) --thread*cycles of FU dependence: 16564940 (5.658918%) --thread*cycles of data dependence: 21315261 (7.281724%) --iCache cycles*banks: 292722720 (73.695349% used) Issue breakdown: --thread*cycles of issue worked: 196708034 (67.199442%) --thread*cycles of issue failed: 76999721 (26.304662%) --thread*cycles of issue NOP/other: -4646891830973946571 (-1587472209527.824500%) Number of thread-cycles not ready: 21315261 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 215722999 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 7 3: 8 4: 9 5: 8 6: 7 7: 9 8: 7 9: 7 10: 8 11: 10 12: 7 13: 8 14: 6 15: 7 16: 8 17: 8 18: 7 19: 9 20: 8 21: 7 22: 7 23: 7 24: 10 25: 6 26: 7 27: 7 28: 7 29: 8 30: 7 31: 7 <=== Core 32 ===> ---- Thread 00 ---- PC 5: Stalled ----- 7004180 in-flight CPI 1.2980 -- Total Cycles 9091440 ---- Thread 01 ---- PC 5: Stalled ----- 5811313 in-flight CPI 1.5644 -- Total Cycles 9091440 ---- Thread 02 ---- PC 5: Stalled ----- 6056792 in-flight CPI 1.5010 -- Total Cycles 9091440 ---- Thread 03 ---- PC 5: Stalled ----- 6569783 in-flight CPI 1.3838 -- Total Cycles 9091440 ---- Thread 04 ---- PC 5: Stalled ----- 6146062 in-flight CPI 1.4792 -- Total Cycles 9091440 ---- Thread 05 ---- PC 5: Stalled ----- 5860521 in-flight CPI 1.5513 -- Total Cycles 9091440 ---- Thread 06 ---- PC 5: Stalled ----- 6410391 in-flight CPI 1.4182 -- Total Cycles 9091440 ---- Thread 07 ---- PC 5: Stalled ----- 5967028 in-flight CPI 1.5236 -- Total Cycles 9091440 ---- Thread 08 ---- PC 5: Stalled ----- 6250364 in-flight CPI 1.4545 -- Total Cycles 9091440 ---- Thread 09 ---- PC 5: Stalled ----- 6565967 in-flight CPI 1.3846 -- Total Cycles 9091440 ---- Thread 10 ---- PC 5: Stalled ----- 6275965 in-flight CPI 1.4486 -- Total Cycles 9091440 ---- Thread 11 ---- PC 5: Stalled ----- 5928502 in-flight CPI 1.5335 -- Total Cycles 9091440 ---- Thread 12 ---- PC 5: Stalled ----- 6221627 in-flight CPI 1.4613 -- Total Cycles 9091440 ---- Thread 13 ---- PC 5: Stalled ----- 5900291 in-flight CPI 1.5408 -- Total Cycles 9091440 ---- Thread 14 ---- PC 5: Stalled ----- 5832975 in-flight CPI 1.5586 -- Total Cycles 9091440 ---- Thread 15 ---- PC 5: Stalled ----- 6185313 in-flight CPI 1.4698 -- Total Cycles 9091440 ---- Thread 16 ---- PC 5: Stalled ----- 6296171 in-flight CPI 1.4440 -- Total Cycles 9091440 ---- Thread 17 ---- PC 5: Stalled ----- 5941112 in-flight CPI 1.5303 -- Total Cycles 9091440 ---- Thread 18 ---- PC 5: Stalled ----- 5989968 in-flight CPI 1.5178 -- Total Cycles 9091440 ---- Thread 19 ---- PC 5: Stalled ----- 6906276 in-flight CPI 1.3164 -- Total Cycles 9091440 ---- Thread 20 ---- PC 5: Stalled ----- 5798083 in-flight CPI 1.5680 -- Total Cycles 9091440 ---- Thread 21 ---- PC 5: Stalled ----- 5576306 in-flight CPI 1.6304 -- Total Cycles 9091440 ---- Thread 22 ---- PC 5: Stalled ----- 6227825 in-flight CPI 1.4598 -- Total Cycles 9091440 ---- Thread 23 ---- PC 5: Stalled ----- 5665652 in-flight CPI 1.6047 -- Total Cycles 9091440 ---- Thread 24 ---- PC 5: Stalled ----- 6201328 in-flight CPI 1.4660 -- Total Cycles 9091440 ---- Thread 25 ---- PC 5: Stalled ----- 6115181 in-flight CPI 1.4867 -- Total Cycles 9091440 ---- Thread 26 ---- PC 5: Stalled ----- 5920105 in-flight CPI 1.5357 -- Total Cycles 9091440 ---- Thread 27 ---- PC 5: Stalled ----- 5620401 in-flight CPI 1.6176 -- Total Cycles 9091440 ---- Thread 28 ---- PC 5: Stalled ----- 5876825 in-flight CPI 1.5470 -- Total Cycles 9091440 ---- Thread 29 ---- PC 5: Stalled ----- 5635873 in-flight CPI 1.6131 -- Total Cycles 9091440 ---- Thread 30 ---- PC 5: Stalled ----- 6451936 in-flight CPI 1.4091 -- Total Cycles 9091440 ---- Thread 31 ---- PC 5: Stalled ----- 5786050 in-flight CPI 1.5713 -- Total Cycles 9091440 Total CPI 0.0466 , IPC 21.4484 -- Total Cycles 9091440 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 447707 (2.100886%) FPSUB: 0 (0.000000%) FPMUL: 2015392 (9.457319%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 14914866 (69.988689%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 561787 (2.636211%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3362686 (15.779557%) DIV: 7494 (0.035166%) FPUN: 0 (0.000000%) FPRSUB: 463 (0.002173%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (213860156 total) ADD%: 8.166 (17462909) SUB%: 0.000 (0) MUL%: 0.000 (203) BITOR%: 1.226 (2621547) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.559 (1195933) FPSUB%: 0.000 (0) FPMUL%: 4.804 (10273941) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (609) FPMAX%: 0.000 (609) LOAD%: 4.967 (10621733) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (235) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41426) FPINV%: 0.000 (0) FPCONV%: 0.000 (673) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.068 (2282966) FPLE%: 0.392 (837858) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (609) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27770) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.955 (6318702) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.752 (1608094) CMPU%: 0.000 (0) RSUB%: 0.000 (203) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.763 (33711744) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.230 (2629965) ORI%: 1.270 (2715005) XORI%: 0.000 (0) MULI%: 3.351 (7166821) LW%: 1.188 (2541404) LWI%: 13.891 (29706900) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (642851) SWI%: 4.090 (8747794) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.476 (3156356) bged%: 0.000 (0) bgeid%: 0.000 (203) bgtd%: 0.000 (0) bgtid%: 0.323 (690192) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.041 (88104) bned%: 0.000 (0) bneid%: 13.706 (29312511) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.742 (1585818) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.090 (191422) DIV%: 0.000 (406) FPUN%: 1.180 (2524450) FPRSUB%: 3.722 (7959039) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (1) FPGT%: 3.101 (6632241) FPGE%: 0.794 (1697432) SYNC%: 0.000 (0) NOP%: 8.820 (18863381) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 168 SUB 0 MUL 13 BITOR 11 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 563 FPSUB 0 FPMUL 5283 FPCMPLT 0 FPMIN 0 FPMAX 397 LOAD 2363859 INTCONV 0 ATOMIC_INC 4 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 105 FPINV 0 FPCONV 10 FPEQ 0 FPNE 0 FPLT 10 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 2037 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 2220 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3368474 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 773 ORI 613189 XORI 0 MULI 638844 LW 0 LWI 9446232 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1792 DIV 14 FPUN 0 FPRSUB 5 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.4484 --Total thread-cycles: 290926080 --total thread-cycles issued: 194996775 (67.026225%) --iCache conflicts: 6604812 (2.270272%) --thread*cycles of FU dependence: 16444005 (5.652297%) --thread*cycles of data dependence: 21310395 (7.325021%) --iCache cycles*banks: 290926080 (73.510147% used) Issue breakdown: --thread*cycles of issue worked: 194996775 (67.026227%) --thread*cycles of issue failed: 77065924 (26.489864%) --thread*cycles of issue NOP/other: 4590901056793203989 (1578030081315.915000%) Number of thread-cycles not ready: 21310395 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 213860156 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 7 2: 8 3: 8 4: 7 5: 7 6: 8 7: 7 8: 8 9: 8 10: 7 11: 8 12: 7 13: 7 14: 7 15: 7 16: 7 17: 7 18: 7 19: 8 20: 7 21: 7 22: 8 23: 7 24: 7 25: 7 26: 8 27: 7 28: 7 29: 7 30: 7 31: 8 <=== Core 33 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6163503 in-flight CPI 1.4619 -- Total Cycles 9010215 ---- Thread 01 ---- PC 5: Stalled ----- 6463244 in-flight CPI 1.3941 -- Total Cycles 9010215 ---- Thread 02 ---- PC 5: Stalled ----- 6470727 in-flight CPI 1.3925 -- Total Cycles 9010215 ---- Thread 03 ---- PC 5: Stalled ----- 6792737 in-flight CPI 1.3264 -- Total Cycles 9010215 ---- Thread 04 ---- PC 5: Stalled ----- 5993012 in-flight CPI 1.5034 -- Total Cycles 9010215 ---- Thread 05 ---- PC 5: Stalled ----- 7095636 in-flight CPI 1.2698 -- Total Cycles 9010215 ---- Thread 06 ---- PC 5: Stalled ----- 6137043 in-flight CPI 1.4682 -- Total Cycles 9010215 ---- Thread 07 ---- PC 5: Stalled ----- 6177094 in-flight CPI 1.4586 -- Total Cycles 9010215 ---- Thread 08 ---- PC 5: Stalled ----- 5757365 in-flight CPI 1.5650 -- Total Cycles 9010215 ---- Thread 09 ---- PC 5: Stalled ----- 6196841 in-flight CPI 1.4540 -- Total Cycles 9010215 ---- Thread 10 ---- PC 5: Stalled ----- 5888214 in-flight CPI 1.5302 -- Total Cycles 9010215 ---- Thread 11 ---- PC 5: Stalled ----- 6218510 in-flight CPI 1.4489 -- Total Cycles 9010215 ---- Thread 12 ---- PC 5: Stalled ----- 6838307 in-flight CPI 1.3176 -- Total Cycles 9010215 ---- Thread 13 ---- PC 5: Stalled ----- 6430096 in-flight CPI 1.4013 -- Total Cycles 9010215 ---- Thread 14 ---- PC 5: Stalled ----- 6064152 in-flight CPI 1.4858 -- Total Cycles 9010215 ---- Thread 15 ---- PC 5: Stalled ----- 6086311 in-flight CPI 1.4804 -- Total Cycles 9010215 ---- Thread 16 ---- PC 5: Stalled ----- 5765664 in-flight CPI 1.5627 -- Total Cycles 9010215 ---- Thread 17 ---- PC 5: Stalled ----- 6529636 in-flight CPI 1.3799 -- Total Cycles 9010215 ---- Thread 18 ---- PC 5: Stalled ----- 5790551 in-flight CPI 1.5560 -- Total Cycles 9010215 ---- Thread 19 ---- PC 5: Stalled ----- 6555914 in-flight CPI 1.3744 -- Total Cycles 9010215 ---- Thread 20 ---- PC 5: Stalled ----- 5743162 in-flight CPI 1.5689 -- Total Cycles 9010215 ---- Thread 21 ---- PC 5: Stalled ----- 5862795 in-flight CPI 1.5368 -- Total Cycles 9010215 ---- Thread 22 ---- PC 5: Stalled ----- 5956637 in-flight CPI 1.5126 -- Total Cycles 9010215 ---- Thread 23 ---- PC 5: Stalled ----- 5888801 in-flight CPI 1.5301 -- Total Cycles 9010215 ---- Thread 24 ---- PC 5: Stalled ----- 5947539 in-flight CPI 1.5149 -- Total Cycles 9010215 ---- Thread 25 ---- PC 5: Stalled ----- 6432173 in-flight CPI 1.4008 -- Total Cycles 9010215 ---- Thread 26 ---- PC 5: Stalled ----- 6448173 in-flight CPI 1.3973 -- Total Cycles 9010215 ---- Thread 27 ---- PC 5: Stalled ----- 6370681 in-flight CPI 1.4143 -- Total Cycles 9010215 ---- Thread 28 ---- PC 5: Stalled ----- 5397986 in-flight CPI 1.6692 -- Total Cycles 9010215 ---- Thread 29 ---- PC 5: Stalled ----- 5810179 in-flight CPI 1.5508 -- Total Cycles 9010215 ---- Thread 30 ---- PC 5: Stalled ----- 5482988 in-flight CPI 1.6433 -- Total Cycles 9010215 ---- Thread 31 ---- PC 5: Stalled ----- 5394537 in-flight CPI 1.6702 -- Total Cycles 9010215 Total CPI 0.0459 , IPC 21.7698 -- Total Cycles 9010215 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 439843 (2.066815%) FPSUB: 0 (0.000000%) FPMUL: 2005104 (9.421949%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 14962445 (70.308270%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 560887 (2.635598%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3304966 (15.529978%) DIV: 7493 (0.035209%) FPUN: 0 (0.000000%) FPRSUB: 464 (0.002180%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (215114844 total) ADD%: 8.184 (17604543) SUB%: 0.000 (0) MUL%: 0.000 (203) BITOR%: 1.223 (2631510) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.546 (1175044) FPSUB%: 0.000 (0) FPMUL%: 4.771 (10263958) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (609) FPMAX%: 0.000 (609) LOAD%: 4.956 (10661128) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (235) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41181) FPINV%: 0.000 (0) FPCONV%: 0.000 (673) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.064 (2289522) FPLE%: 0.392 (842284) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (609) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27528) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.963 (6373582) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.750 (1612859) CMPU%: 0.000 (0) RSUB%: 0.000 (203) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.764 (33910038) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.232 (2649474) ORI%: 1.262 (2714848) XORI%: 0.000 (0) MULI%: 3.360 (7228083) LW%: 1.192 (2563164) LWI%: 13.929 (29963320) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (646956) SWI%: 4.098 (8815741) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.481 (3185522) bged%: 0.000 (0) bgeid%: 0.000 (203) bgtd%: 0.000 (0) bgtid%: 0.322 (693295) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.041 (87394) bned%: 0.000 (0) bneid%: 13.705 (29481765) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.739 (1588793) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.087 (188112) DIV%: 0.000 (406) FPUN%: 1.179 (2536810) FPRSUB%: 3.716 (7992958) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (4) FPGT%: 3.102 (6672310) FPGE%: 0.793 (1705245) SYNC%: 0.000 (0) NOP%: 8.816 (18964027) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 183 SUB 0 MUL 12 BITOR 7 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 553 FPSUB 0 FPMUL 5395 FPCMPLT 0 FPMIN 0 FPMAX 395 LOAD 2344837 INTCONV 0 ATOMIC_INC 9 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 109 FPINV 0 FPCONV 14 FPEQ 0 FPNE 0 FPLT 8 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1915 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 1 COS 0 SIN 0 ADDC 0 ADDK 1 ADDKC 0 BITXOR 0 ANDN 0 CMP 2216 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3396462 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 797 ORI 601363 XORI 0 MULI 642787 LW 0 LWI 9519311 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1793 DIV 13 FPUN 0 FPRSUB 3 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.7698 --Total thread-cycles: 288326880 --total thread-cycles issued: 196150817 (68.030707%) --iCache conflicts: 6640390 (2.303077%) --thread*cycles of FU dependence: 16518211 (5.728988%) --thread*cycles of data dependence: 21281202 (7.380929%) --iCache cycles*banks: 288326880 (74.607985% used) Issue breakdown: --thread*cycles of issue worked: 196150817 (68.030708%) --thread*cycles of issue failed: 73212036 (25.392026%) --thread*cycles of issue NOP/other: -4610725600907141573 (-1599131375093.137900%) Number of thread-cycles not ready: 21281202 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 215114844 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 8 3: 8 4: 7 5: 8 6: 9 7: 7 8: 7 9: 8 10: 7 11: 7 12: 8 13: 7 14: 7 15: 7 16: 7 17: 9 18: 7 19: 7 20: 7 21: 7 22: 7 23: 7 24: 7 25: 7 26: 7 27: 8 28: 7 29: 7 30: 7 31: 6 <=== Core 34 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6114540 in-flight CPI 1.4659 -- Total Cycles 8963274 ---- Thread 01 ---- PC 5: Stalled ----- 5968368 in-flight CPI 1.5018 -- Total Cycles 8963274 ---- Thread 02 ---- PC 5: Stalled ----- 6064329 in-flight CPI 1.4780 -- Total Cycles 8963274 ---- Thread 03 ---- PC 5: Stalled ----- 6361005 in-flight CPI 1.4091 -- Total Cycles 8963274 ---- Thread 04 ---- PC 5: Stalled ----- 6475088 in-flight CPI 1.3843 -- Total Cycles 8963274 ---- Thread 05 ---- PC 5: Stalled ----- 5917376 in-flight CPI 1.5147 -- Total Cycles 8963274 ---- Thread 06 ---- PC 5: Stalled ----- 6228218 in-flight CPI 1.4391 -- Total Cycles 8963274 ---- Thread 07 ---- PC 5: Stalled ----- 6317853 in-flight CPI 1.4187 -- Total Cycles 8963274 ---- Thread 08 ---- PC 5: Stalled ----- 5866151 in-flight CPI 1.5280 -- Total Cycles 8963274 ---- Thread 09 ---- PC 5: Stalled ----- 6572062 in-flight CPI 1.3638 -- Total Cycles 8963274 ---- Thread 10 ---- PC 5: Stalled ----- 5810462 in-flight CPI 1.5426 -- Total Cycles 8963274 ---- Thread 11 ---- PC 5: Stalled ----- 6042301 in-flight CPI 1.4834 -- Total Cycles 8963274 ---- Thread 12 ---- PC 5: Stalled ----- 6848979 in-flight CPI 1.3087 -- Total Cycles 8963274 ---- Thread 13 ---- PC 5: Stalled ----- 6061133 in-flight CPI 1.4788 -- Total Cycles 8963274 ---- Thread 14 ---- PC 5: Stalled ----- 5842652 in-flight CPI 1.5341 -- Total Cycles 8963274 ---- Thread 15 ---- PC 5: Stalled ----- 5826546 in-flight CPI 1.5383 -- Total Cycles 8963274 ---- Thread 16 ---- PC 5: Stalled ----- 5924340 in-flight CPI 1.5130 -- Total Cycles 8963274 ---- Thread 17 ---- PC 5: Stalled ----- 5889835 in-flight CPI 1.5218 -- Total Cycles 8963274 ---- Thread 18 ---- PC 5: Stalled ----- 5986084 in-flight CPI 1.4973 -- Total Cycles 8963274 ---- Thread 19 ---- PC 5: Stalled ----- 5910632 in-flight CPI 1.5165 -- Total Cycles 8963274 ---- Thread 20 ---- PC 5: Stalled ----- 6192526 in-flight CPI 1.4474 -- Total Cycles 8963274 ---- Thread 21 ---- PC 5: Stalled ----- 6641010 in-flight CPI 1.3497 -- Total Cycles 8963274 ---- Thread 22 ---- PC 5: Stalled ----- 5769403 in-flight CPI 1.5536 -- Total Cycles 8963274 ---- Thread 23 ---- PC 5: Stalled ----- 6085200 in-flight CPI 1.4730 -- Total Cycles 8963274 ---- Thread 24 ---- PC 5: Stalled ----- 6568604 in-flight CPI 1.3646 -- Total Cycles 8963274 ---- Thread 25 ---- PC 5: Stalled ----- 5917268 in-flight CPI 1.5148 -- Total Cycles 8963274 ---- Thread 26 ---- PC 5: Stalled ----- 6012299 in-flight CPI 1.4908 -- Total Cycles 8963274 ---- Thread 27 ---- PC 5: Stalled ----- 5694530 in-flight CPI 1.5740 -- Total Cycles 8963274 ---- Thread 28 ---- PC 5: Stalled ----- 6062791 in-flight CPI 1.4784 -- Total Cycles 8963274 ---- Thread 29 ---- PC 5: Stalled ----- 5704680 in-flight CPI 1.5712 -- Total Cycles 8963274 ---- Thread 30 ---- PC 5: Stalled ----- 5387329 in-flight CPI 1.6638 -- Total Cycles 8963274 ---- Thread 31 ---- PC 5: Stalled ----- 5654386 in-flight CPI 1.5852 -- Total Cycles 8963274 Total CPI 0.0463 , IPC 21.6125 -- Total Cycles 8963274 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 434527 (2.026669%) FPSUB: 0 (0.000000%) FPMUL: 1980523 (9.237320%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 15192923 (70.861027%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 554114 (2.584433%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3270414 (15.253476%) DIV: 7487 (0.034920%) FPUN: 0 (0.000000%) FPRSUB: 462 (0.002155%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (212452176 total) ADD%: 8.168 (17353777) SUB%: 0.000 (0) MUL%: 0.000 (203) BITOR%: 1.229 (2610738) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.548 (1163679) FPSUB%: 0.000 (0) FPMUL%: 4.772 (10137434) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (609) FPMAX%: 0.000 (609) LOAD%: 4.957 (10531508) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (235) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (40805) FPINV%: 0.000 (0) FPCONV%: 0.000 (673) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.064 (2260289) FPLE%: 0.394 (836176) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (609) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27442) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.961 (6291150) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.749 (1592178) CMPU%: 0.000 (0) RSUB%: 0.000 (203) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.769 (33502355) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.231 (2615569) ORI%: 1.264 (2685764) XORI%: 0.000 (0) MULI%: 3.358 (7134328) LW%: 1.191 (2530164) LWI%: 13.916 (29565836) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (639616) SWI%: 4.097 (8703745) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.479 (3143052) bged%: 0.000 (0) bgeid%: 0.000 (203) bgtd%: 0.000 (0) bgtid%: 0.323 (685614) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.040 (85730) bned%: 0.000 (0) bneid%: 13.712 (29130549) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.741 (1574363) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.088 (186092) DIV%: 0.000 (406) FPUN%: 1.185 (2516954) FPRSUB%: 3.714 (7890515) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (3) FPGT%: 3.101 (6587864) FPGE%: 0.796 (1691454) SYNC%: 0.000 (0) NOP%: 8.818 (18733587) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 166 SUB 0 MUL 21 BITOR 7 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 560 FPSUB 0 FPMUL 5268 FPCMPLT 0 FPMIN 0 FPMAX 401 LOAD 2315911 INTCONV 0 ATOMIC_INC 4 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 98 FPINV 0 FPCONV 20 FPEQ 0 FPNE 0 FPLT 7 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1866 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 2247 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3354207 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 736 ORI 593817 XORI 0 MULI 639358 LW 0 LWI 9397748 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1716 DIV 13 FPUN 0 FPRSUB 4 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.6125 --Total thread-cycles: 286824768 --total thread-cycles issued: 193718589 (67.539004%) --iCache conflicts: 6559691 (2.287003%) --thread*cycles of FU dependence: 16314195 (5.687861%) --thread*cycles of data dependence: 21440450 (7.475104%) --iCache cycles*banks: 286824768 (74.070384% used) Issue breakdown: --thread*cycles of issue worked: 193718589 (67.539003%) --thread*cycles of issue failed: 74372592 (25.929627%) --thread*cycles of issue NOP/other: -9223372035070851616 (-3215681860177.030800%) Number of thread-cycles not ready: 21440450 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 212452176 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 9 2: 8 3: 7 4: 8 5: 7 6: 8 7: 7 8: 7 9: 9 10: 8 11: 7 12: 7 13: 7 14: 7 15: 7 16: 7 17: 7 18: 7 19: 7 20: 7 21: 7 22: 7 23: 8 24: 7 25: 8 26: 7 27: 7 28: 7 29: 7 30: 6 31: 8 <=== Core 35 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6454154 in-flight CPI 1.4288 -- Total Cycles 9221805 ---- Thread 01 ---- PC 5: Stalled ----- 6480284 in-flight CPI 1.4231 -- Total Cycles 9221805 ---- Thread 02 ---- PC 5: Stalled ----- 7231471 in-flight CPI 1.2752 -- Total Cycles 9221805 ---- Thread 03 ---- PC 5: Stalled ----- 6306346 in-flight CPI 1.4623 -- Total Cycles 9221805 ---- Thread 04 ---- PC 5: Stalled ----- 6646715 in-flight CPI 1.3874 -- Total Cycles 9221805 ---- Thread 05 ---- PC 5: Stalled ----- 6566288 in-flight CPI 1.4044 -- Total Cycles 9221805 ---- Thread 06 ---- PC 5: Stalled ----- 6478571 in-flight CPI 1.4234 -- Total Cycles 9221805 ---- Thread 07 ---- PC 5: Stalled ----- 6298900 in-flight CPI 1.4640 -- Total Cycles 9221805 ---- Thread 08 ---- PC 5: Stalled ----- 6184867 in-flight CPI 1.4910 -- Total Cycles 9221805 ---- Thread 09 ---- PC 5: Stalled ----- 5806734 in-flight CPI 1.5881 -- Total Cycles 9221805 ---- Thread 10 ---- PC 5: Stalled ----- 5791355 in-flight CPI 1.5923 -- Total Cycles 9221805 ---- Thread 11 ---- PC 5: Stalled ----- 6467627 in-flight CPI 1.4258 -- Total Cycles 9221805 ---- Thread 12 ---- PC 5: Stalled ----- 6539887 in-flight CPI 1.4101 -- Total Cycles 9221805 ---- Thread 13 ---- PC 5: Stalled ----- 6482115 in-flight CPI 1.4226 -- Total Cycles 9221805 ---- Thread 14 ---- PC 5: Stalled ----- 6079597 in-flight CPI 1.5168 -- Total Cycles 9221805 ---- Thread 15 ---- PC 5: Stalled ----- 6107804 in-flight CPI 1.5098 -- Total Cycles 9221805 ---- Thread 16 ---- PC 5: Stalled ----- 6331954 in-flight CPI 1.4564 -- Total Cycles 9221805 ---- Thread 17 ---- PC 5: Stalled ----- 6602524 in-flight CPI 1.3967 -- Total Cycles 9221805 ---- Thread 18 ---- PC 5: Stalled ----- 6517307 in-flight CPI 1.4150 -- Total Cycles 9221805 ---- Thread 19 ---- PC 5: Stalled ----- 6242420 in-flight CPI 1.4773 -- Total Cycles 9221805 ---- Thread 20 ---- PC 5: Stalled ----- 6165911 in-flight CPI 1.4956 -- Total Cycles 9221805 ---- Thread 21 ---- PC 5: Stalled ----- 6601950 in-flight CPI 1.3968 -- Total Cycles 9221805 ---- Thread 22 ---- PC 5: Stalled ----- 6754759 in-flight CPI 1.3652 -- Total Cycles 9221805 ---- Thread 23 ---- PC 5: Stalled ----- 5843717 in-flight CPI 1.5781 -- Total Cycles 9221805 ---- Thread 24 ---- PC 5: Stalled ----- 6277500 in-flight CPI 1.4690 -- Total Cycles 9221805 ---- Thread 25 ---- PC 5: Stalled ----- 6633470 in-flight CPI 1.3902 -- Total Cycles 9221805 ---- Thread 26 ---- PC 5: Stalled ----- 6107591 in-flight CPI 1.5099 -- Total Cycles 9221805 ---- Thread 27 ---- PC 5: Stalled ----- 5984845 in-flight CPI 1.5409 -- Total Cycles 9221805 ---- Thread 28 ---- PC 5: Stalled ----- 5784029 in-flight CPI 1.5944 -- Total Cycles 9221805 ---- Thread 29 ---- PC 5: Stalled ----- 5408411 in-flight CPI 1.7051 -- Total Cycles 9221805 ---- Thread 30 ---- PC 5: Stalled ----- 5922812 in-flight CPI 1.5570 -- Total Cycles 9221805 ---- Thread 31 ---- PC 5: Stalled ----- 5494057 in-flight CPI 1.6785 -- Total Cycles 9221805 Total CPI 0.0460 , IPC 21.7524 -- Total Cycles 9221805 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 447868 (2.060642%) FPSUB: 0 (0.000000%) FPMUL: 2048443 (9.424890%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 15309744 (70.440163%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 570261 (2.623772%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3350009 (15.413398%) DIV: 7601 (0.034972%) FPUN: 0 (0.000000%) FPRSUB: 470 (0.002162%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (220000498 total) ADD%: 8.159 (17950203) SUB%: 0.000 (0) MUL%: 0.000 (206) BITOR%: 1.226 (2696977) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.544 (1197264) FPSUB%: 0.000 (0) FPMUL%: 4.766 (10485933) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (618) FPMAX%: 0.000 (618) LOAD%: 4.953 (10896694) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (238) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41888) FPINV%: 0.000 (0) FPCONV%: 0.000 (682) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.064 (2340801) FPLE%: 0.390 (858911) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (618) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27898) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.965 (6523429) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.749 (1648316) CMPU%: 0.000 (0) RSUB%: 0.000 (206) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.772 (34698243) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.232 (2711104) ORI%: 1.263 (2778341) XORI%: 0.000 (0) MULI%: 3.362 (7397442) LW%: 1.192 (2623346) LWI%: 13.926 (30637118) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (662078) SWI%: 4.096 (9012149) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.482 (3260387) bged%: 0.000 (0) bgeid%: 0.000 (206) bgtd%: 0.000 (0) bgtid%: 0.322 (709492) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.040 (88407) bned%: 0.000 (0) bneid%: 13.717 (30177505) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.739 (1625964) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.087 (190651) DIV%: 0.000 (412) FPUN%: 1.182 (2600671) FPRSUB%: 3.713 (8169593) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (6) FPGT%: 3.104 (6829260) FPGE%: 0.797 (1752619) SYNC%: 0.000 (0) NOP%: 8.820 (19403908) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 173 SUB 0 MUL 39 BITOR 1 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 528 FPSUB 0 FPMUL 5344 FPCMPLT 0 FPMIN 0 FPMAX 405 LOAD 2420957 INTCONV 0 ATOMIC_INC 5 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 112 FPINV 0 FPCONV 15 FPEQ 0 FPNE 0 FPLT 5 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1967 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 2322 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3474428 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 820 ORI 612196 XORI 0 MULI 661224 LW 0 LWI 9732671 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1843 DIV 15 FPUN 0 FPRSUB 2 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.7524 --Total thread-cycles: 295097760 --total thread-cycles issued: 200596590 (67.976318%) --iCache conflicts: 6803511 (2.305511%) --thread*cycles of FU dependence: 16915102 (5.732033%) --thread*cycles of data dependence: 21734396 (7.365151%) --iCache cycles*banks: 295097760 (74.551745% used) Issue breakdown: --thread*cycles of issue worked: 200596590 (67.976317%) --thread*cycles of issue failed: 75097262 (25.448266%) --thread*cycles of issue NOP/other: -4609964763019471740 (-1562182228363.736800%) Number of thread-cycles not ready: 21734396 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 220000498 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 8 3: 7 4: 7 5: 8 6: 7 7: 8 8: 7 9: 7 10: 7 11: 8 12: 7 13: 9 14: 7 15: 7 16: 8 17: 7 18: 8 19: 7 20: 7 21: 8 22: 8 23: 7 24: 8 25: 8 26: 7 27: 7 28: 7 29: 7 30: 8 31: 7 <=== Core 36 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6648719 in-flight CPI 1.3733 -- Total Cycles 9130453 ---- Thread 01 ---- PC 5: Stalled ----- 6060839 in-flight CPI 1.5065 -- Total Cycles 9130453 ---- Thread 02 ---- PC 5: Stalled ----- 6426436 in-flight CPI 1.4208 -- Total Cycles 9130453 ---- Thread 03 ---- PC 5: Stalled ----- 6470841 in-flight CPI 1.4110 -- Total Cycles 9130453 ---- Thread 04 ---- PC 5: Stalled ----- 6013355 in-flight CPI 1.5184 -- Total Cycles 9130453 ---- Thread 05 ---- PC 5: Stalled ----- 5872830 in-flight CPI 1.5547 -- Total Cycles 9130453 ---- Thread 06 ---- PC 5: Stalled ----- 6080191 in-flight CPI 1.5017 -- Total Cycles 9130453 ---- Thread 07 ---- PC 5: Stalled ----- 5854085 in-flight CPI 1.5597 -- Total Cycles 9130453 ---- Thread 08 ---- PC 5: Stalled ----- 5964668 in-flight CPI 1.5308 -- Total Cycles 9130453 ---- Thread 09 ---- PC 5: Stalled ----- 6341043 in-flight CPI 1.4399 -- Total Cycles 9130453 ---- Thread 10 ---- PC 5: Stalled ----- 5839312 in-flight CPI 1.5636 -- Total Cycles 9130453 ---- Thread 11 ---- PC 5: Stalled ----- 6212121 in-flight CPI 1.4698 -- Total Cycles 9130453 ---- Thread 12 ---- PC 5: Stalled ----- 5751009 in-flight CPI 1.5876 -- Total Cycles 9130453 ---- Thread 13 ---- PC 5: Stalled ----- 6130408 in-flight CPI 1.4894 -- Total Cycles 9130453 ---- Thread 14 ---- PC 5: Stalled ----- 6744379 in-flight CPI 1.3538 -- Total Cycles 9130453 ---- Thread 15 ---- PC 5: Stalled ----- 7013978 in-flight CPI 1.3017 -- Total Cycles 9130453 ---- Thread 16 ---- PC 5: Stalled ----- 5908828 in-flight CPI 1.5452 -- Total Cycles 9130453 ---- Thread 17 ---- PC 5: Stalled ----- 6127736 in-flight CPI 1.4900 -- Total Cycles 9130453 ---- Thread 18 ---- PC 5: Stalled ----- 5913081 in-flight CPI 1.5441 -- Total Cycles 9130453 ---- Thread 19 ---- PC 5: Stalled ----- 5833278 in-flight CPI 1.5652 -- Total Cycles 9130453 ---- Thread 20 ---- PC 5: Stalled ----- 6374521 in-flight CPI 1.4323 -- Total Cycles 9130453 ---- Thread 21 ---- PC 5: Stalled ----- 6412377 in-flight CPI 1.4239 -- Total Cycles 9130453 ---- Thread 22 ---- PC 5: Stalled ----- 5714349 in-flight CPI 1.5978 -- Total Cycles 9130453 ---- Thread 23 ---- PC 5: Stalled ----- 5587956 in-flight CPI 1.6339 -- Total Cycles 9130453 ---- Thread 24 ---- PC 5: Stalled ----- 5527924 in-flight CPI 1.6517 -- Total Cycles 9130453 ---- Thread 25 ---- PC 5: Stalled ----- 5651378 in-flight CPI 1.6156 -- Total Cycles 9130453 ---- Thread 26 ---- PC 5: Stalled ----- 5749468 in-flight CPI 1.5880 -- Total Cycles 9130453 ---- Thread 27 ---- PC 5: Stalled ----- 5805997 in-flight CPI 1.5726 -- Total Cycles 9130453 ---- Thread 28 ---- PC 5: Stalled ----- 5524251 in-flight CPI 1.6528 -- Total Cycles 9130453 ---- Thread 29 ---- PC 5: Stalled ----- 6339290 in-flight CPI 1.4403 -- Total Cycles 9130453 ---- Thread 30 ---- PC 5: Stalled ----- 5833081 in-flight CPI 1.5653 -- Total Cycles 9130453 ---- Thread 31 ---- PC 5: Stalled ----- 5872260 in-flight CPI 1.5548 -- Total Cycles 9130453 Total CPI 0.0472 , IPC 21.2038 -- Total Cycles 9130453 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 428629 (2.032810%) FPSUB: 0 (0.000000%) FPMUL: 1969512 (9.340579%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 14898437 (70.657111%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 558868 (2.650479%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3222297 (15.282019%) DIV: 7345 (0.034834%) FPUN: 0 (0.000000%) FPRSUB: 457 (0.002167%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (212329805 total) ADD%: 8.202 (17415503) SUB%: 0.000 (0) MUL%: 0.000 (199) BITOR%: 1.225 (2600260) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.543 (1153272) FPSUB%: 0.000 (0) FPMUL%: 4.757 (10099690) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (597) FPMAX%: 0.000 (597) LOAD%: 4.946 (10500962) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (231) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41220) FPINV%: 0.000 (0) FPCONV%: 0.000 (661) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.062 (2255869) FPLE%: 0.391 (829638) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (597) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27538) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.963 (6290859) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.747 (1585599) CMPU%: 0.000 (0) RSUB%: 0.000 (199) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.766 (33475181) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.231 (2613143) ORI%: 1.258 (2671684) XORI%: 0.000 (0) MULI%: 3.362 (7137492) LW%: 1.192 (2530154) LWI%: 13.924 (29564387) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (640101) SWI%: 4.097 (8699627) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.480 (3142295) bged%: 0.000 (0) bgeid%: 0.000 (199) bgtd%: 0.000 (0) bgtid%: 0.323 (685795) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.040 (84236) bned%: 0.000 (0) bneid%: 13.715 (29121894) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.740 (1572289) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.086 (183202) DIV%: 0.000 (398) FPUN%: 1.182 (2508770) FPRSUB%: 3.710 (7877331) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (4) FPGT%: 3.108 (6598901) FPGE%: 0.796 (1689916) SYNC%: 0.000 (0) NOP%: 8.821 (18729219) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 178 SUB 0 MUL 31 BITOR 6 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 563 FPSUB 0 FPMUL 5002 FPCMPLT 0 FPMIN 0 FPMAX 381 LOAD 2308278 INTCONV 0 ATOMIC_INC 9 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 98 FPINV 0 FPCONV 13 FPEQ 0 FPNE 0 FPLT 8 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1961 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 2201 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3356833 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 779 ORI 585621 XORI 0 MULI 638140 LW 0 LWI 9400824 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1752 DIV 19 FPUN 0 FPRSUB 4 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.2038 --Total thread-cycles: 292174496 --total thread-cycles issued: 193600586 (66.261975%) --iCache conflicts: 6523132 (2.232615%) --thread*cycles of FU dependence: 16302718 (5.579788%) --thread*cycles of data dependence: 21085545 (7.216764%) --iCache cycles*banks: 292174496 (72.672269% used) Issue breakdown: --thread*cycles of issue worked: 193600586 (66.261973%) --thread*cycles of issue failed: 79844691 (27.327742%) --thread*cycles of issue NOP/other: -4646453675590563581 (-1590300912366.616700%) Number of thread-cycles not ready: 21085545 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 212329805 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 9 1: 7 2: 8 3: 8 4: 7 5: 7 6: 7 7: 7 8: 7 9: 7 10: 7 11: 8 12: 8 13: 7 14: 7 15: 8 16: 7 17: 7 18: 7 19: 7 20: 7 21: 7 22: 7 23: 7 24: 7 25: 7 26: 7 27: 7 28: 7 29: 7 30: 7 31: 7 <=== Core 37 ===> ---- Thread 00 ---- PC 5: Stalled ----- 5968924 in-flight CPI 1.4896 -- Total Cycles 8891550 ---- Thread 01 ---- PC 5: Stalled ----- 5804838 in-flight CPI 1.5317 -- Total Cycles 8891550 ---- Thread 02 ---- PC 5: Stalled ----- 6159537 in-flight CPI 1.4435 -- Total Cycles 8891550 ---- Thread 03 ---- PC 5: Stalled ----- 6250034 in-flight CPI 1.4226 -- Total Cycles 8891550 ---- Thread 04 ---- PC 5: Stalled ----- 6157860 in-flight CPI 1.4439 -- Total Cycles 8891550 ---- Thread 05 ---- PC 5: Stalled ----- 6946864 in-flight CPI 1.2799 -- Total Cycles 8891550 ---- Thread 06 ---- PC 5: Stalled ----- 6211363 in-flight CPI 1.4315 -- Total Cycles 8891550 ---- Thread 07 ---- PC 5: Stalled ----- 6452024 in-flight CPI 1.3781 -- Total Cycles 8891550 ---- Thread 08 ---- PC 5: Stalled ----- 6132740 in-flight CPI 1.4498 -- Total Cycles 8891550 ---- Thread 09 ---- PC 5: Stalled ----- 6613558 in-flight CPI 1.3444 -- Total Cycles 8891550 ---- Thread 10 ---- PC 5: Stalled ----- 5784940 in-flight CPI 1.5370 -- Total Cycles 8891550 ---- Thread 11 ---- PC 5: Stalled ----- 6730908 in-flight CPI 1.3210 -- Total Cycles 8891550 ---- Thread 12 ---- PC 5: Stalled ----- 6243591 in-flight CPI 1.4241 -- Total Cycles 8891550 ---- Thread 13 ---- PC 5: Stalled ----- 6632764 in-flight CPI 1.3405 -- Total Cycles 8891550 ---- Thread 14 ---- PC 5: Stalled ----- 5948265 in-flight CPI 1.4948 -- Total Cycles 8891550 ---- Thread 15 ---- PC 5: Stalled ----- 5732417 in-flight CPI 1.5511 -- Total Cycles 8891550 ---- Thread 16 ---- PC 5: Stalled ----- 5861083 in-flight CPI 1.5170 -- Total Cycles 8891550 ---- Thread 17 ---- PC 5: Stalled ----- 5655643 in-flight CPI 1.5722 -- Total Cycles 8891550 ---- Thread 18 ---- PC 5: Stalled ----- 5762298 in-flight CPI 1.5431 -- Total Cycles 8891550 ---- Thread 19 ---- PC 5: Stalled ----- 6086210 in-flight CPI 1.4609 -- Total Cycles 8891550 ---- Thread 20 ---- PC 5: Stalled ----- 5976529 in-flight CPI 1.4877 -- Total Cycles 8891550 ---- Thread 21 ---- PC 5: Stalled ----- 6287820 in-flight CPI 1.4141 -- Total Cycles 8891550 ---- Thread 22 ---- PC 5: Stalled ----- 5697415 in-flight CPI 1.5606 -- Total Cycles 8891550 ---- Thread 23 ---- PC 5: Stalled ----- 6378081 in-flight CPI 1.3941 -- Total Cycles 8891550 ---- Thread 24 ---- PC 5: Stalled ----- 5768406 in-flight CPI 1.5414 -- Total Cycles 8891550 ---- Thread 25 ---- PC 5: Stalled ----- 5766172 in-flight CPI 1.5420 -- Total Cycles 8891550 ---- Thread 26 ---- PC 5: Stalled ----- 6016423 in-flight CPI 1.4779 -- Total Cycles 8891550 ---- Thread 27 ---- PC 5: Stalled ----- 5852941 in-flight CPI 1.5192 -- Total Cycles 8891550 ---- Thread 28 ---- PC 5: Stalled ----- 6203736 in-flight CPI 1.4333 -- Total Cycles 8891550 ---- Thread 29 ---- PC 5: Stalled ----- 6448757 in-flight CPI 1.3788 -- Total Cycles 8891550 ---- Thread 30 ---- PC 5: Stalled ----- 5764288 in-flight CPI 1.5425 -- Total Cycles 8891550 ---- Thread 31 ---- PC 5: Stalled ----- 5829412 in-flight CPI 1.5253 -- Total Cycles 8891550 Total CPI 0.0456 , IPC 21.9451 -- Total Cycles 8891550 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 437192 (2.028479%) FPSUB: 0 (0.000000%) FPMUL: 1994918 (9.256000%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 15287756 (70.931973%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 551952 (2.560941%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3273044 (15.186236%) DIV: 7382 (0.034251%) FPUN: 0 (0.000000%) FPRSUB: 457 (0.002120%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (213999820 total) ADD%: 8.205 (17559386) SUB%: 0.000 (0) MUL%: 0.000 (200) BITOR%: 1.227 (2626115) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.547 (1170106) FPSUB%: 0.000 (0) FPMUL%: 4.769 (10205363) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (600) FPMAX%: 0.000 (600) LOAD%: 4.952 (10596352) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (232) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (40698) FPINV%: 0.000 (0) FPCONV%: 0.000 (664) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.064 (2277061) FPLE%: 0.390 (834147) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (600) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27314) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.960 (6335294) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.749 (1603436) CMPU%: 0.000 (0) RSUB%: 0.000 (200) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.764 (33734632) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.231 (2633756) ORI%: 1.265 (2707487) XORI%: 0.000 (0) MULI%: 3.358 (7185868) LW%: 1.191 (2547780) LWI%: 13.906 (29759090) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (644507) SWI%: 4.090 (8752118) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.479 (3164493) bged%: 0.000 (0) bgeid%: 0.000 (200) bgtd%: 0.000 (0) bgtid%: 0.323 (691443) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.040 (86282) bned%: 0.000 (0) bneid%: 13.713 (29346608) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.739 (1580536) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.087 (186258) DIV%: 0.000 (400) FPUN%: 1.183 (2531755) FPRSUB%: 3.713 (7944936) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (5) FPGT%: 3.104 (6641558) FPGE%: 0.798 (1708265) SYNC%: 0.000 (0) NOP%: 8.819 (18873379) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 176 SUB 0 MUL 30 BITOR 8 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 524 FPSUB 0 FPMUL 5214 FPCMPLT 0 FPMIN 0 FPMAX 390 LOAD 2310536 INTCONV 0 ATOMIC_INC 6 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 125 FPINV 0 FPCONV 22 FPEQ 0 FPNE 0 FPLT 9 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1799 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 2191 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3380256 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 834 ORI 598733 XORI 0 MULI 638544 LW 0 LWI 9461884 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1772 DIV 16 FPUN 0 FPRSUB 4 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.9452 --Total thread-cycles: 284529600 --total thread-cycles issued: 195126441 (68.578611%) --iCache conflicts: 6567129 (2.308065%) --thread*cycles of FU dependence: 16403076 (5.764981%) --thread*cycles of data dependence: 21552701 (7.574854%) --iCache cycles*banks: 284529600 (75.211806% used) Issue breakdown: --thread*cycles of issue worked: 195126441 (68.578609%) --thread*cycles of issue failed: 70529780 (24.788205%) --thread*cycles of issue NOP/other: -4643430568369849309 (-1631967488925.528100%) Number of thread-cycles not ready: 21552701 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 213999820 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 7 2: 7 3: 7 4: 8 5: 9 6: 7 7: 8 8: 7 9: 7 10: 7 11: 8 12: 8 13: 7 14: 7 15: 7 16: 7 17: 8 18: 7 19: 7 20: 7 21: 7 22: 7 23: 7 24: 7 25: 7 26: 7 27: 7 28: 7 29: 8 30: 7 31: 7 <=== Core 38 ===> ---- Thread 00 ---- PC 5: Stalled ----- 5982675 in-flight CPI 1.4912 -- Total Cycles 8921567 ---- Thread 01 ---- PC 5: Stalled ----- 6291137 in-flight CPI 1.4181 -- Total Cycles 8921567 ---- Thread 02 ---- PC 5: Stalled ----- 6802162 in-flight CPI 1.3116 -- Total Cycles 8921567 ---- Thread 03 ---- PC 5: Stalled ----- 5947225 in-flight CPI 1.5001 -- Total Cycles 8921567 ---- Thread 04 ---- PC 5: Stalled ----- 5979465 in-flight CPI 1.4920 -- Total Cycles 8921567 ---- Thread 05 ---- PC 5: Stalled ----- 6155737 in-flight CPI 1.4493 -- Total Cycles 8921567 ---- Thread 06 ---- PC 5: Stalled ----- 6577065 in-flight CPI 1.3565 -- Total Cycles 8921567 ---- Thread 07 ---- PC 5: Stalled ----- 6961283 in-flight CPI 1.2816 -- Total Cycles 8921567 ---- Thread 08 ---- PC 5: Stalled ----- 5861421 in-flight CPI 1.5221 -- Total Cycles 8921567 ---- Thread 09 ---- PC 5: Stalled ----- 6437440 in-flight CPI 1.3859 -- Total Cycles 8921567 ---- Thread 10 ---- PC 5: Stalled ----- 6675463 in-flight CPI 1.3365 -- Total Cycles 8921567 ---- Thread 11 ---- PC 5: Stalled ----- 5998285 in-flight CPI 1.4873 -- Total Cycles 8921567 ---- Thread 12 ---- PC 5: Stalled ----- 6414081 in-flight CPI 1.3909 -- Total Cycles 8921567 ---- Thread 13 ---- PC 5: Stalled ----- 5846364 in-flight CPI 1.5260 -- Total Cycles 8921567 ---- Thread 14 ---- PC 5: Stalled ----- 6750322 in-flight CPI 1.3216 -- Total Cycles 8921567 ---- Thread 15 ---- PC 5: Stalled ----- 5985650 in-flight CPI 1.4905 -- Total Cycles 8921567 ---- Thread 16 ---- PC 5: Stalled ----- 5796544 in-flight CPI 1.5391 -- Total Cycles 8921567 ---- Thread 17 ---- PC 5: Stalled ----- 5808917 in-flight CPI 1.5358 -- Total Cycles 8921567 ---- Thread 18 ---- PC 5: Stalled ----- 6166853 in-flight CPI 1.4467 -- Total Cycles 8921567 ---- Thread 19 ---- PC 5: Stalled ----- 6405690 in-flight CPI 1.3928 -- Total Cycles 8921567 ---- Thread 20 ---- PC 5: Stalled ----- 6007579 in-flight CPI 1.4850 -- Total Cycles 8921567 ---- Thread 21 ---- PC 5: Stalled ----- 5797733 in-flight CPI 1.5388 -- Total Cycles 8921567 ---- Thread 22 ---- PC 5: Stalled ----- 6043657 in-flight CPI 1.4762 -- Total Cycles 8921567 ---- Thread 23 ---- PC 5: Stalled ----- 5849298 in-flight CPI 1.5252 -- Total Cycles 8921567 ---- Thread 24 ---- PC 5: Stalled ----- 6235268 in-flight CPI 1.4308 -- Total Cycles 8921567 ---- Thread 25 ---- PC 5: Stalled ----- 5816148 in-flight CPI 1.5339 -- Total Cycles 8921567 ---- Thread 26 ---- PC 5: Stalled ----- 6476821 in-flight CPI 1.3775 -- Total Cycles 8921567 ---- Thread 27 ---- PC 5: Stalled ----- 5583449 in-flight CPI 1.5979 -- Total Cycles 8921567 ---- Thread 28 ---- PC 5: Stalled ----- 5397385 in-flight CPI 1.6529 -- Total Cycles 8921567 ---- Thread 29 ---- PC 5: Stalled ----- 6242210 in-flight CPI 1.4292 -- Total Cycles 8921567 ---- Thread 30 ---- PC 5: Stalled ----- 5983855 in-flight CPI 1.4909 -- Total Cycles 8921567 ---- Thread 31 ---- PC 5: Stalled ----- 5911901 in-flight CPI 1.5091 -- Total Cycles 8921567 Total CPI 0.0455 , IPC 21.9905 -- Total Cycles 8921567 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 426605 (2.068135%) FPSUB: 0 (0.000000%) FPMUL: 1977840 (9.588354%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 14421245 (69.912636%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 569833 (2.762489%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3223738 (15.628333%) DIV: 7785 (0.037741%) FPUN: 0 (0.000000%) FPRSUB: 477 (0.002312%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (215167202 total) ADD%: 8.177 (17595102) SUB%: 0.000 (0) MUL%: 0.000 (211) BITOR%: 1.227 (2640512) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.535 (1150905) FPSUB%: 0.000 (0) FPMUL%: 4.731 (10180172) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (633) FPMAX%: 0.000 (633) LOAD%: 4.944 (10638013) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (243) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41948) FPINV%: 0.000 (0) FPCONV%: 0.000 (697) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.059 (2278106) FPLE%: 0.396 (852308) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (633) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (28204) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.969 (6388151) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.744 (1601048) CMPU%: 0.000 (0) RSUB%: 0.000 (211) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.776 (33944840) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.231 (2649776) ORI%: 1.252 (2694081) XORI%: 0.000 (0) MULI%: 3.367 (7244769) LW%: 1.194 (2569366) LWI%: 13.949 (30012858) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.302 (649549) SWI%: 4.110 (8842901) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.483 (3191571) bged%: 0.000 (0) bgeid%: 0.000 (211) bgtd%: 0.000 (0) bgtid%: 0.323 (694650) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.040 (85149) bned%: 0.000 (0) bneid%: 13.716 (29512828) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.745 (1603569) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.085 (183150) DIV%: 0.000 (422) FPUN%: 1.186 (2551494) FPRSUB%: 3.702 (7966483) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (3) FPGT%: 3.106 (6684097) FPGE%: 0.795 (1710123) SYNC%: 0.000 (0) NOP%: 8.820 (18977486) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 174 SUB 0 MUL 27 BITOR 6 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 528 FPSUB 0 FPMUL 4996 FPCMPLT 0 FPMIN 0 FPMAX 416 LOAD 2306781 INTCONV 0 ATOMIC_INC 14 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 106 FPINV 0 FPCONV 16 FPEQ 0 FPNE 0 FPLT 8 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 2004 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 1 ADDKC 0 BITXOR 0 ANDN 0 CMP 2148 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3403319 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 716 ORI 580490 XORI 0 MULI 651378 LW 0 LWI 9534449 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1592 DIV 13 FPUN 0 FPRSUB 6 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.9905 --Total thread-cycles: 285490144 --total thread-cycles issued: 196189716 (68.720310%) --iCache conflicts: 6642144 (2.326576%) --thread*cycles of FU dependence: 16489194 (5.775749%) --thread*cycles of data dependence: 20627523 (7.225301%) --iCache cycles*banks: 285490144 (75.367658% used) Issue breakdown: --thread*cycles of issue worked: 196189716 (68.720311%) --thread*cycles of issue failed: 70322942 (24.632354%) --thread*cycles of issue NOP/other: -4629185639317663026 (-1621487023845.440700%) Number of thread-cycles not ready: 20627523 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 215167202 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 9 1: 7 2: 8 3: 8 4: 7 5: 7 6: 8 7: 9 8: 7 9: 8 10: 9 11: 7 12: 8 13: 9 14: 8 15: 7 16: 7 17: 7 18: 7 19: 7 20: 7 21: 7 22: 7 23: 7 24: 7 25: 8 26: 7 27: 7 28: 7 29: 9 30: 9 31: 7 <=== Core 39 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6395195 in-flight CPI 1.4132 -- Total Cycles 9037648 ---- Thread 01 ---- PC 5: Stalled ----- 6261392 in-flight CPI 1.4434 -- Total Cycles 9037648 ---- Thread 02 ---- PC 5: Stalled ----- 6961050 in-flight CPI 1.2983 -- Total Cycles 9037648 ---- Thread 03 ---- PC 5: Stalled ----- 6101855 in-flight CPI 1.4811 -- Total Cycles 9037648 ---- Thread 04 ---- PC 5: Stalled ----- 5811953 in-flight CPI 1.5550 -- Total Cycles 9037648 ---- Thread 05 ---- PC 5: Stalled ----- 5848149 in-flight CPI 1.5454 -- Total Cycles 9037648 ---- Thread 06 ---- PC 5: Stalled ----- 6195828 in-flight CPI 1.4587 -- Total Cycles 9037648 ---- Thread 07 ---- PC 5: Stalled ----- 7069139 in-flight CPI 1.2785 -- Total Cycles 9037648 ---- Thread 08 ---- PC 5: Stalled ----- 6136174 in-flight CPI 1.4728 -- Total Cycles 9037648 ---- Thread 09 ---- PC 5: Stalled ----- 6069272 in-flight CPI 1.4891 -- Total Cycles 9037648 ---- Thread 10 ---- PC 5: Stalled ----- 6183643 in-flight CPI 1.4615 -- Total Cycles 9037648 ---- Thread 11 ---- PC 5: Stalled ----- 6799031 in-flight CPI 1.3293 -- Total Cycles 9037648 ---- Thread 12 ---- PC 5: Stalled ----- 5885842 in-flight CPI 1.5355 -- Total Cycles 9037648 ---- Thread 13 ---- PC 5: Stalled ----- 6280234 in-flight CPI 1.4391 -- Total Cycles 9037648 ---- Thread 14 ---- PC 5: Stalled ----- 5814311 in-flight CPI 1.5544 -- Total Cycles 9037648 ---- Thread 15 ---- PC 5: Stalled ----- 5839505 in-flight CPI 1.5477 -- Total Cycles 9037648 ---- Thread 16 ---- PC 5: Stalled ----- 6086693 in-flight CPI 1.4848 -- Total Cycles 9037648 ---- Thread 17 ---- PC 5: Stalled ----- 6572217 in-flight CPI 1.3751 -- Total Cycles 9037648 ---- Thread 18 ---- PC 5: Stalled ----- 5712448 in-flight CPI 1.5821 -- Total Cycles 9037648 ---- Thread 19 ---- PC 5: Stalled ----- 5747180 in-flight CPI 1.5725 -- Total Cycles 9037648 ---- Thread 20 ---- PC 5: Stalled ----- 6172790 in-flight CPI 1.4641 -- Total Cycles 9037648 ---- Thread 21 ---- PC 5: Stalled ----- 6247412 in-flight CPI 1.4466 -- Total Cycles 9037648 ---- Thread 22 ---- PC 5: Stalled ----- 6452809 in-flight CPI 1.4006 -- Total Cycles 9037648 ---- Thread 23 ---- PC 5: Stalled ----- 6448371 in-flight CPI 1.4015 -- Total Cycles 9037648 ---- Thread 24 ---- PC 5: Stalled ----- 5630442 in-flight CPI 1.6051 -- Total Cycles 9037648 ---- Thread 25 ---- PC 5: Stalled ----- 6578989 in-flight CPI 1.3737 -- Total Cycles 9037648 ---- Thread 26 ---- PC 5: Stalled ----- 5784784 in-flight CPI 1.5623 -- Total Cycles 9037648 ---- Thread 27 ---- PC 5: Stalled ----- 6111535 in-flight CPI 1.4788 -- Total Cycles 9037648 ---- Thread 28 ---- PC 5: Stalled ----- 6271408 in-flight CPI 1.4411 -- Total Cycles 9037648 ---- Thread 29 ---- PC 5: Stalled ----- 5696044 in-flight CPI 1.5866 -- Total Cycles 9037648 ---- Thread 30 ---- PC 5: Stalled ----- 5776088 in-flight CPI 1.5647 -- Total Cycles 9037648 ---- Thread 31 ---- PC 5: Stalled ----- 5909188 in-flight CPI 1.5294 -- Total Cycles 9037648 Total CPI 0.0459 , IPC 21.7813 -- Total Cycles 9037648 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 446381 (2.042000%) FPSUB: 0 (0.000000%) FPMUL: 2025663 (9.266530%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 15476453 (70.798063%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 562643 (2.573848%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3340932 (15.283315%) DIV: 7457 (0.034113%) FPUN: 0 (0.000000%) FPRSUB: 466 (0.002132%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (215896722 total) ADD%: 8.177 (17653709) SUB%: 0.000 (0) MUL%: 0.000 (202) BITOR%: 1.222 (2638986) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.553 (1193385) FPSUB%: 0.000 (0) FPMUL%: 4.790 (10340684) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (606) FPMAX%: 0.000 (606) LOAD%: 4.957 (10701778) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (234) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41466) FPINV%: 0.000 (0) FPCONV%: 0.000 (670) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.067 (2302611) FPLE%: 0.391 (843476) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (606) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27644) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.958 (6385618) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.751 (1620357) CMPU%: 0.000 (0) RSUB%: 0.000 (202) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.763 (34031904) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.230 (2656365) ORI%: 1.265 (2730037) XORI%: 0.000 (0) MULI%: 3.356 (7245825) LW%: 1.190 (2568144) LWI%: 13.913 (30037626) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (649254) SWI%: 4.093 (8836176) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.478 (3190136) bged%: 0.000 (0) bgeid%: 0.000 (202) bgtd%: 0.000 (0) bgtid%: 0.323 (696764) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.041 (87490) bned%: 0.000 (0) bneid%: 13.708 (29595774) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.739 (1596501) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.088 (190142) DIV%: 0.000 (404) FPUN%: 1.178 (2542232) FPRSUB%: 3.720 (8030828) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (2) FPGT%: 3.105 (6703287) FPGE%: 0.792 (1709548) SYNC%: 0.000 (0) NOP%: 8.821 (19045145) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 172 SUB 0 MUL 27 BITOR 6 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 579 FPSUB 0 FPMUL 5248 FPCMPLT 0 FPMIN 0 FPMAX 393 LOAD 2370030 INTCONV 0 ATOMIC_INC 5 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 116 FPINV 0 FPCONV 16 FPEQ 0 FPNE 0 FPLT 8 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1870 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 1 ADDKC 0 BITXOR 0 ANDN 0 CMP 2126 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3409064 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 832 ORI 610943 XORI 0 MULI 637005 LW 0 LWI 9551072 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1771 DIV 12 FPUN 0 FPRSUB 5 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.7813 --Total thread-cycles: 289204736 --total thread-cycles issued: 196851577 (68.066515%) --iCache conflicts: 6587299 (2.277729%) --thread*cycles of FU dependence: 16591326 (5.736879%) --thread*cycles of data dependence: 21859995 (7.558657%) --iCache cycles*banks: 289204736 (74.651874% used) Issue breakdown: --thread*cycles of issue worked: 196851577 (68.066512%) --thread*cycles of issue failed: 73308014 (25.348137%) --thread*cycles of issue NOP/other: 39893383 (13.794167%) Number of thread-cycles not ready: 21859995 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 215896722 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 8 3: 7 4: 7 5: 7 6: 7 7: 8 8: 7 9: 7 10: 7 11: 8 12: 8 13: 7 14: 7 15: 7 16: 7 17: 8 18: 7 19: 7 20: 8 21: 7 22: 8 23: 7 24: 7 25: 8 26: 7 27: 8 28: 7 29: 7 30: 7 31: 7 <=== Core 40 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6503897 in-flight CPI 1.3312 -- Total Cycles 8658099 ---- Thread 01 ---- PC 5: Stalled ----- 6283102 in-flight CPI 1.3780 -- Total Cycles 8658099 ---- Thread 02 ---- PC 5: Stalled ----- 6712818 in-flight CPI 1.2898 -- Total Cycles 8658099 ---- Thread 03 ---- PC 5: Stalled ----- 6517933 in-flight CPI 1.3283 -- Total Cycles 8658099 ---- Thread 04 ---- PC 5: Stalled ----- 6114888 in-flight CPI 1.4159 -- Total Cycles 8658099 ---- Thread 05 ---- PC 5: Stalled ----- 6094933 in-flight CPI 1.4205 -- Total Cycles 8658099 ---- Thread 06 ---- PC 5: Stalled ----- 6599712 in-flight CPI 1.3119 -- Total Cycles 8658099 ---- Thread 07 ---- PC 5: Stalled ----- 6491936 in-flight CPI 1.3337 -- Total Cycles 8658099 ---- Thread 08 ---- PC 5: Stalled ----- 6700117 in-flight CPI 1.2922 -- Total Cycles 8658099 ---- Thread 09 ---- PC 5: Stalled ----- 6347827 in-flight CPI 1.3639 -- Total Cycles 8658099 ---- Thread 10 ---- PC 5: Stalled ----- 5772805 in-flight CPI 1.4998 -- Total Cycles 8658099 ---- Thread 11 ---- PC 5: Stalled ----- 5717884 in-flight CPI 1.5142 -- Total Cycles 8658099 ---- Thread 12 ---- PC 5: Stalled ----- 6011379 in-flight CPI 1.4403 -- Total Cycles 8658099 ---- Thread 13 ---- PC 5: Stalled ----- 5967878 in-flight CPI 1.4508 -- Total Cycles 8658099 ---- Thread 14 ---- PC 5: Stalled ----- 6444135 in-flight CPI 1.3436 -- Total Cycles 8658099 ---- Thread 15 ---- PC 5: Stalled ----- 6211072 in-flight CPI 1.3940 -- Total Cycles 8658099 ---- Thread 16 ---- PC 5: Stalled ----- 5681212 in-flight CPI 1.5240 -- Total Cycles 8658099 ---- Thread 17 ---- PC 5: Stalled ----- 5791275 in-flight CPI 1.4950 -- Total Cycles 8658099 ---- Thread 18 ---- PC 5: Stalled ----- 6024714 in-flight CPI 1.4371 -- Total Cycles 8658099 ---- Thread 19 ---- PC 5: Stalled ----- 5714045 in-flight CPI 1.5152 -- Total Cycles 8658099 ---- Thread 20 ---- PC 5: Stalled ----- 5951209 in-flight CPI 1.4548 -- Total Cycles 8658099 ---- Thread 21 ---- PC 5: Stalled ----- 5795929 in-flight CPI 1.4938 -- Total Cycles 8658099 ---- Thread 22 ---- PC 5: Stalled ----- 6090302 in-flight CPI 1.4216 -- Total Cycles 8658099 ---- Thread 23 ---- PC 5: Stalled ----- 6062117 in-flight CPI 1.4282 -- Total Cycles 8658099 ---- Thread 24 ---- PC 5: Stalled ----- 5445164 in-flight CPI 1.5900 -- Total Cycles 8658099 ---- Thread 25 ---- PC 5: Stalled ----- 5900518 in-flight CPI 1.4673 -- Total Cycles 8658099 ---- Thread 26 ---- PC 5: Stalled ----- 6104337 in-flight CPI 1.4183 -- Total Cycles 8658099 ---- Thread 27 ---- PC 5: Stalled ----- 6029084 in-flight CPI 1.4361 -- Total Cycles 8658099 ---- Thread 28 ---- PC 5: Stalled ----- 5965460 in-flight CPI 1.4514 -- Total Cycles 8658099 ---- Thread 29 ---- PC 5: Stalled ----- 5522779 in-flight CPI 1.5677 -- Total Cycles 8658099 ---- Thread 30 ---- PC 5: Stalled ----- 6099480 in-flight CPI 1.4195 -- Total Cycles 8658099 ---- Thread 31 ---- PC 5: Stalled ----- 5530638 in-flight CPI 1.5655 -- Total Cycles 8658099 Total CPI 0.0446 , IPC 22.4300 -- Total Cycles 8658099 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 435138 (2.017418%) FPSUB: 0 (0.000000%) FPMUL: 1985632 (9.205930%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 15319983 (71.027611%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 555664 (2.576210%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3264593 (15.135542%) DIV: 7571 (0.035101%) FPUN: 0 (0.000000%) FPRSUB: 472 (0.002188%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (212988558 total) ADD%: 8.195 (17455161) SUB%: 0.000 (0) MUL%: 0.000 (205) BITOR%: 1.227 (2613629) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.547 (1164601) FPSUB%: 0.000 (0) FPMUL%: 4.770 (10159559) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (615) FPMAX%: 0.000 (615) LOAD%: 4.946 (10533411) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (237) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (40894) FPINV%: 0.000 (0) FPCONV%: 0.000 (679) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.065 (2268477) FPLE%: 0.388 (827159) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (615) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27510) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.960 (6304712) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.749 (1594992) CMPU%: 0.000 (0) RSUB%: 0.000 (205) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.756 (33559145) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.231 (2621160) ORI%: 1.267 (2698164) XORI%: 0.000 (0) MULI%: 3.359 (7154264) LW%: 1.191 (2535636) LWI%: 13.919 (29646789) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (640983) SWI%: 4.094 (8720060) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.479 (3149916) bged%: 0.000 (0) bgeid%: 0.000 (205) bgtd%: 0.000 (0) bgtid%: 0.323 (687142) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.040 (85283) bned%: 0.000 (0) bneid%: 13.716 (29213241) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.735 (1564402) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.087 (185727) DIV%: 0.000 (410) FPUN%: 1.183 (2519720) FPRSUB%: 3.714 (7910823) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (3) FPGT%: 3.104 (6611508) FPGE%: 0.800 (1703241) SYNC%: 0.000 (0) NOP%: 8.821 (18787364) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 167 SUB 0 MUL 26 BITOR 7 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 556 FPSUB 0 FPMUL 5374 FPCMPLT 0 FPMIN 0 FPMAX 399 LOAD 2359032 INTCONV 0 ATOMIC_INC 3 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 91 FPINV 0 FPCONV 24 FPEQ 0 FPNE 0 FPLT 13 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1828 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 1 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 2250 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3367543 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 832 ORI 594263 XORI 0 MULI 641837 LW 0 LWI 9424473 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1766 DIV 16 FPUN 0 FPRSUB 1 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 22.4300 --Total thread-cycles: 277059168 --total thread-cycles issued: 194201194 (70.093764%) --iCache conflicts: 6558093 (2.367037%) --thread*cycles of FU dependence: 16400526 (5.919503%) --thread*cycles of data dependence: 21569053 (7.784999%) --iCache cycles*banks: 277059168 (76.874767% used) Issue breakdown: --thread*cycles of issue worked: 194201194 (70.093762%) --thread*cycles of issue failed: 64070610 (23.125244%) --thread*cycles of issue NOP/other: 4621584909819882532 (1668085897745.813700%) Number of thread-cycles not ready: 21569053 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 212988558 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 7 2: 8 3: 8 4: 7 5: 7 6: 9 7: 8 8: 8 9: 8 10: 7 11: 7 12: 8 13: 7 14: 8 15: 8 16: 7 17: 7 18: 7 19: 7 20: 7 21: 7 22: 7 23: 7 24: 6 25: 9 26: 7 27: 7 28: 9 29: 8 30: 7 31: 6 <=== Core 41 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6067312 in-flight CPI 1.4753 -- Total Cycles 8951206 ---- Thread 01 ---- PC 5: Stalled ----- 5891130 in-flight CPI 1.5194 -- Total Cycles 8951206 ---- Thread 02 ---- PC 5: Stalled ----- 5882565 in-flight CPI 1.5216 -- Total Cycles 8951206 ---- Thread 03 ---- PC 5: Stalled ----- 6117068 in-flight CPI 1.4633 -- Total Cycles 8951206 ---- Thread 04 ---- PC 5: Stalled ----- 7005617 in-flight CPI 1.2777 -- Total Cycles 8951206 ---- Thread 05 ---- PC 5: Stalled ----- 5934011 in-flight CPI 1.5085 -- Total Cycles 8951206 ---- Thread 06 ---- PC 5: Stalled ----- 6466397 in-flight CPI 1.3843 -- Total Cycles 8951206 ---- Thread 07 ---- PC 5: Stalled ----- 6186492 in-flight CPI 1.4469 -- Total Cycles 8951206 ---- Thread 08 ---- PC 5: Stalled ----- 6062732 in-flight CPI 1.4764 -- Total Cycles 8951206 ---- Thread 09 ---- PC 5: Stalled ----- 6491526 in-flight CPI 1.3789 -- Total Cycles 8951206 ---- Thread 10 ---- PC 5: Stalled ----- 5854139 in-flight CPI 1.5290 -- Total Cycles 8951206 ---- Thread 11 ---- PC 5: Stalled ----- 6135763 in-flight CPI 1.4589 -- Total Cycles 8951206 ---- Thread 12 ---- PC 5: Stalled ----- 5897192 in-flight CPI 1.5179 -- Total Cycles 8951206 ---- Thread 13 ---- PC 5: Stalled ----- 5993063 in-flight CPI 1.4936 -- Total Cycles 8951206 ---- Thread 14 ---- PC 5: Stalled ----- 5943024 in-flight CPI 1.5062 -- Total Cycles 8951206 ---- Thread 15 ---- PC 5: Stalled ----- 6504023 in-flight CPI 1.3763 -- Total Cycles 8951206 ---- Thread 16 ---- PC 5: Stalled ----- 5623773 in-flight CPI 1.5917 -- Total Cycles 8951206 ---- Thread 17 ---- PC 5: Stalled ----- 6783457 in-flight CPI 1.3196 -- Total Cycles 8951206 ---- Thread 18 ---- PC 5: Stalled ----- 6260525 in-flight CPI 1.4298 -- Total Cycles 8951206 ---- Thread 19 ---- PC 5: Stalled ----- 6208468 in-flight CPI 1.4418 -- Total Cycles 8951206 ---- Thread 20 ---- PC 5: Stalled ----- 6499632 in-flight CPI 1.3772 -- Total Cycles 8951206 ---- Thread 21 ---- PC 5: Stalled ----- 6057417 in-flight CPI 1.4777 -- Total Cycles 8951206 ---- Thread 22 ---- PC 5: Stalled ----- 6607980 in-flight CPI 1.3546 -- Total Cycles 8951206 ---- Thread 23 ---- PC 5: Stalled ----- 6373549 in-flight CPI 1.4044 -- Total Cycles 8951206 ---- Thread 24 ---- PC 5: Stalled ----- 5979808 in-flight CPI 1.4969 -- Total Cycles 8951206 ---- Thread 25 ---- PC 5: Stalled ----- 5838104 in-flight CPI 1.5332 -- Total Cycles 8951206 ---- Thread 26 ---- PC 5: Stalled ----- 5583732 in-flight CPI 1.6031 -- Total Cycles 8951206 ---- Thread 27 ---- PC 5: Stalled ----- 5649395 in-flight CPI 1.5844 -- Total Cycles 8951206 ---- Thread 28 ---- PC 5: Stalled ----- 5382649 in-flight CPI 1.6630 -- Total Cycles 8951206 ---- Thread 29 ---- PC 5: Stalled ----- 5606400 in-flight CPI 1.5966 -- Total Cycles 8951206 ---- Thread 30 ---- PC 5: Stalled ----- 6332218 in-flight CPI 1.4136 -- Total Cycles 8951206 ---- Thread 31 ---- PC 5: Stalled ----- 5232243 in-flight CPI 1.7108 -- Total Cycles 8951206 Total CPI 0.0460 , IPC 21.7235 -- Total Cycles 8951206 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 434346 (2.062836%) FPSUB: 0 (0.000000%) FPMUL: 1983094 (9.418292%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 14820274 (70.385805%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 554166 (2.631896%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3255903 (15.463233%) DIV: 7524 (0.035734%) FPUN: 0 (0.000000%) FPRSUB: 464 (0.002204%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (213244816 total) ADD%: 8.202 (17489952) SUB%: 0.000 (0) MUL%: 0.000 (204) BITOR%: 1.224 (2609255) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.545 (1162667) FPSUB%: 0.000 (0) FPMUL%: 4.764 (10158660) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (612) FPMAX%: 0.000 (612) LOAD%: 4.955 (10566729) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (236) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (40830) FPINV%: 0.000 (0) FPCONV%: 0.000 (676) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.062 (2265092) FPLE%: 0.389 (828749) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (612) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27450) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.966 (6325013) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.749 (1597381) CMPU%: 0.000 (0) RSUB%: 0.000 (204) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.766 (33620546) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.232 (2628227) ORI%: 1.262 (2690107) XORI%: 0.000 (0) MULI%: 3.361 (7168061) LW%: 1.193 (2543734) LWI%: 13.916 (29675890) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.302 (643272) SWI%: 4.100 (8742239) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.482 (3159665) bged%: 0.000 (0) bgeid%: 0.000 (204) bgtd%: 0.000 (0) bgtid%: 0.323 (689458) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.040 (85791) bned%: 0.000 (0) bneid%: 13.705 (29225205) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.739 (1576602) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.087 (185217) DIV%: 0.000 (408) FPUN%: 1.180 (2516367) FPRSUB%: 3.711 (7912669) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (4) FPGT%: 3.102 (6615037) FPGE%: 0.796 (1698283) SYNC%: 0.000 (0) NOP%: 8.813 (18792800) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 186 SUB 0 MUL 21 BITOR 5 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 520 FPSUB 0 FPMUL 5098 FPCMPLT 0 FPMIN 0 FPMAX 394 LOAD 2324712 INTCONV 0 ATOMIC_INC 7 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 100 FPINV 0 FPCONV 10 FPEQ 0 FPNE 0 FPLT 8 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1963 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 2168 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3368066 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 775 ORI 592937 XORI 0 MULI 640003 LW 0 LWI 9431658 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1663 DIV 12 FPUN 0 FPRSUB 4 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.7236 --Total thread-cycles: 286438592 --total thread-cycles issued: 194452016 (67.886109%) --iCache conflicts: 6536233 (2.281897%) --thread*cycles of FU dependence: 16370325 (5.715125%) --thread*cycles of data dependence: 21055771 (7.350885%) --iCache cycles*banks: 286438592 (74.446968% used) Issue breakdown: --thread*cycles of issue worked: 194452016 (67.886109%) --thread*cycles of issue failed: 73193776 (25.553043%) --thread*cycles of issue NOP/other: 18799867 (6.563315%) Number of thread-cycles not ready: 21055771 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 213244816 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 7 2: 7 3: 7 4: 9 5: 7 6: 8 7: 7 8: 7 9: 7 10: 8 11: 8 12: 7 13: 7 14: 7 15: 9 16: 7 17: 7 18: 7 19: 7 20: 7 21: 7 22: 8 23: 7 24: 8 25: 9 26: 7 27: 8 28: 7 29: 7 30: 8 31: 6 <=== Core 42 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6051924 in-flight CPI 1.5083 -- Total Cycles 9128002 ---- Thread 01 ---- PC 5: Stalled ----- 6355738 in-flight CPI 1.4362 -- Total Cycles 9128002 ---- Thread 02 ---- PC 5: Stalled ----- 6566282 in-flight CPI 1.3901 -- Total Cycles 9128002 ---- Thread 03 ---- PC 5: Stalled ----- 7070457 in-flight CPI 1.2910 -- Total Cycles 9128002 ---- Thread 04 ---- PC 5: Stalled ----- 6710013 in-flight CPI 1.3604 -- Total Cycles 9128002 ---- Thread 05 ---- PC 5: Stalled ----- 6030381 in-flight CPI 1.5137 -- Total Cycles 9128002 ---- Thread 06 ---- PC 5: Stalled ----- 6815374 in-flight CPI 1.3393 -- Total Cycles 9128002 ---- Thread 07 ---- PC 5: Stalled ----- 6046068 in-flight CPI 1.5097 -- Total Cycles 9128002 ---- Thread 08 ---- PC 5: Stalled ----- 6544620 in-flight CPI 1.3947 -- Total Cycles 9128002 ---- Thread 09 ---- PC 5: Stalled ----- 5921047 in-flight CPI 1.5416 -- Total Cycles 9128002 ---- Thread 10 ---- PC 5: Stalled ----- 6399557 in-flight CPI 1.4263 -- Total Cycles 9128002 ---- Thread 11 ---- PC 5: Stalled ----- 6480384 in-flight CPI 1.4086 -- Total Cycles 9128002 ---- Thread 12 ---- PC 5: Stalled ----- 6703927 in-flight CPI 1.3616 -- Total Cycles 9128002 ---- Thread 13 ---- PC 5: Stalled ----- 6422025 in-flight CPI 1.4214 -- Total Cycles 9128002 ---- Thread 14 ---- PC 5: Stalled ----- 5675000 in-flight CPI 1.6085 -- Total Cycles 9128002 ---- Thread 15 ---- PC 5: Stalled ----- 6337851 in-flight CPI 1.4402 -- Total Cycles 9128002 ---- Thread 16 ---- PC 5: Stalled ----- 6229500 in-flight CPI 1.4653 -- Total Cycles 9128002 ---- Thread 17 ---- PC 5: Stalled ----- 5779320 in-flight CPI 1.5794 -- Total Cycles 9128002 ---- Thread 18 ---- PC 5: Stalled ----- 5846839 in-flight CPI 1.5612 -- Total Cycles 9128002 ---- Thread 19 ---- PC 5: Stalled ----- 6928908 in-flight CPI 1.3174 -- Total Cycles 9128002 ---- Thread 20 ---- PC 5: Stalled ----- 6648075 in-flight CPI 1.3730 -- Total Cycles 9128002 ---- Thread 21 ---- PC 5: Stalled ----- 6033857 in-flight CPI 1.5128 -- Total Cycles 9128002 ---- Thread 22 ---- PC 5: Stalled ----- 5595873 in-flight CPI 1.6312 -- Total Cycles 9128002 ---- Thread 23 ---- PC 5: Stalled ----- 6028277 in-flight CPI 1.5142 -- Total Cycles 9128002 ---- Thread 24 ---- PC 5: Stalled ----- 5901298 in-flight CPI 1.5468 -- Total Cycles 9128002 ---- Thread 25 ---- PC 5: Stalled ----- 5467963 in-flight CPI 1.6694 -- Total Cycles 9128002 ---- Thread 26 ---- PC 5: Stalled ----- 6126929 in-flight CPI 1.4898 -- Total Cycles 9128002 ---- Thread 27 ---- PC 5: Stalled ----- 5444224 in-flight CPI 1.6766 -- Total Cycles 9128002 ---- Thread 28 ---- PC 5: Stalled ----- 5368283 in-flight CPI 1.7004 -- Total Cycles 9128002 ---- Thread 29 ---- PC 5: Stalled ----- 5361998 in-flight CPI 1.7023 -- Total Cycles 9128002 ---- Thread 30 ---- PC 5: Stalled ----- 5915375 in-flight CPI 1.5431 -- Total Cycles 9128002 ---- Thread 31 ---- PC 5: Stalled ----- 5523399 in-flight CPI 1.6526 -- Total Cycles 9128002 Total CPI 0.0465 , IPC 21.5087 -- Total Cycles 9128002 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 443992 (2.095327%) FPSUB: 0 (0.000000%) FPMUL: 2014157 (9.505391%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 14843464 (70.050608%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 559473 (2.640315%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3320472 (15.670270%) DIV: 7601 (0.035871%) FPUN: 0 (0.000000%) FPRSUB: 470 (0.002218%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (215315309 total) ADD%: 8.171 (17594417) SUB%: 0.000 (0) MUL%: 0.000 (206) BITOR%: 1.234 (2656392) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.551 (1186089) FPSUB%: 0.000 (0) FPMUL%: 4.781 (10293914) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (618) FPMAX%: 0.000 (618) LOAD%: 4.952 (10662681) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (238) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41150) FPINV%: 0.000 (0) FPCONV%: 0.000 (682) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.065 (2292694) FPLE%: 0.390 (840568) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (618) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27512) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.959 (6371080) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.750 (1615913) CMPU%: 0.000 (0) RSUB%: 0.000 (206) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.763 (33940918) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.231 (2650528) ORI%: 1.273 (2740992) XORI%: 0.000 (0) MULI%: 3.356 (7225002) LW%: 1.190 (2562240) LWI%: 13.903 (29935421) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (647313) SWI%: 4.091 (8807868) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.479 (3183511) bged%: 0.000 (0) bgeid%: 0.000 (206) bgtd%: 0.000 (0) bgtid%: 0.322 (694269) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.040 (86397) bned%: 0.000 (0) bneid%: 13.717 (29534178) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.736 (1585278) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.088 (188986) DIV%: 0.000 (412) FPUN%: 1.189 (2559682) FPRSUB%: 3.716 (8000679) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (0) FPGT%: 3.099 (6672032) FPGE%: 0.803 (1729780) SYNC%: 0.000 (0) NOP%: 8.817 (18983925) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 176 SUB 0 MUL 22 BITOR 5 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 523 FPSUB 0 FPMUL 5036 FPCMPLT 0 FPMIN 0 FPMAX 405 LOAD 2328304 INTCONV 0 ATOMIC_INC 7 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 99 FPINV 0 FPCONV 8 FPEQ 0 FPNE 0 FPLT 6 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1931 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 2 ADDKC 0 BITXOR 0 ANDN 0 CMP 2176 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3396371 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 828 ORI 607746 XORI 0 MULI 641728 LW 0 LWI 9518974 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1786 DIV 17 FPUN 0 FPRSUB 5 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.5087 --Total thread-cycles: 292096064 --total thread-cycles issued: 196331384 (67.214665%) --iCache conflicts: 6600946 (2.259854%) --thread*cycles of FU dependence: 16506168 (5.650938%) --thread*cycles of data dependence: 21189629 (7.254336%) --iCache cycles*banks: 292096064 (73.713880% used) Issue breakdown: --thread*cycles of issue worked: 196331384 (67.214663%) --thread*cycles of issue failed: 76780755 (26.286131%) --thread*cycles of issue NOP/other: 34691791102241781 (11876843058.810194%) Number of thread-cycles not ready: 21189629 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 215315309 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 7 2: 9 3: 9 4: 9 5: 7 6: 8 7: 7 8: 8 9: 8 10: 8 11: 8 12: 9 13: 7 14: 6 15: 7 16: 7 17: 7 18: 7 19: 8 20: 7 21: 7 22: 7 23: 9 24: 7 25: 6 26: 7 27: 7 28: 7 29: 7 30: 7 31: 7 <=== Core 43 ===> ---- Thread 00 ---- PC 5: Stalled ----- 5864981 in-flight CPI 1.5392 -- Total Cycles 9027277 ---- Thread 01 ---- PC 5: Stalled ----- 6440809 in-flight CPI 1.4016 -- Total Cycles 9027277 ---- Thread 02 ---- PC 5: Stalled ----- 6934568 in-flight CPI 1.3018 -- Total Cycles 9027277 ---- Thread 03 ---- PC 5: Stalled ----- 6689456 in-flight CPI 1.3495 -- Total Cycles 9027277 ---- Thread 04 ---- PC 5: Stalled ----- 6215240 in-flight CPI 1.4524 -- Total Cycles 9027277 ---- Thread 05 ---- PC 5: Stalled ----- 7049719 in-flight CPI 1.2805 -- Total Cycles 9027277 ---- Thread 06 ---- PC 5: Stalled ----- 6926152 in-flight CPI 1.3034 -- Total Cycles 9027277 ---- Thread 07 ---- PC 5: Stalled ----- 6189522 in-flight CPI 1.4585 -- Total Cycles 9027277 ---- Thread 08 ---- PC 5: Stalled ----- 5790970 in-flight CPI 1.5588 -- Total Cycles 9027277 ---- Thread 09 ---- PC 5: Stalled ----- 5763948 in-flight CPI 1.5662 -- Total Cycles 9027277 ---- Thread 10 ---- PC 5: Stalled ----- 6023567 in-flight CPI 1.4987 -- Total Cycles 9027277 ---- Thread 11 ---- PC 5: Stalled ----- 6904664 in-flight CPI 1.3074 -- Total Cycles 9027277 ---- Thread 12 ---- PC 5: Stalled ----- 5930349 in-flight CPI 1.5222 -- Total Cycles 9027277 ---- Thread 13 ---- PC 5: Stalled ----- 6367595 in-flight CPI 1.4177 -- Total Cycles 9027277 ---- Thread 14 ---- PC 5: Stalled ----- 5895001 in-flight CPI 1.5313 -- Total Cycles 9027277 ---- Thread 15 ---- PC 5: Stalled ----- 6297122 in-flight CPI 1.4336 -- Total Cycles 9027277 ---- Thread 16 ---- PC 5: Stalled ----- 6257470 in-flight CPI 1.4426 -- Total Cycles 9027277 ---- Thread 17 ---- PC 5: Stalled ----- 6259913 in-flight CPI 1.4421 -- Total Cycles 9027277 ---- Thread 18 ---- PC 5: Stalled ----- 5891598 in-flight CPI 1.5322 -- Total Cycles 9027277 ---- Thread 19 ---- PC 5: Stalled ----- 6741557 in-flight CPI 1.3390 -- Total Cycles 9027277 ---- Thread 20 ---- PC 5: Stalled ----- 6639502 in-flight CPI 1.3596 -- Total Cycles 9027277 ---- Thread 21 ---- PC 5: Stalled ----- 6343166 in-flight CPI 1.4231 -- Total Cycles 9027277 ---- Thread 22 ---- PC 5: Stalled ----- 6084593 in-flight CPI 1.4836 -- Total Cycles 9027277 ---- Thread 23 ---- PC 5: Stalled ----- 5485364 in-flight CPI 1.6457 -- Total Cycles 9027277 ---- Thread 24 ---- PC 5: Stalled ----- 5529868 in-flight CPI 1.6325 -- Total Cycles 9027277 ---- Thread 25 ---- PC 5: Stalled ----- 5684016 in-flight CPI 1.5882 -- Total Cycles 9027277 ---- Thread 26 ---- PC 5: Stalled ----- 6109921 in-flight CPI 1.4775 -- Total Cycles 9027277 ---- Thread 27 ---- PC 5: Stalled ----- 6506616 in-flight CPI 1.3874 -- Total Cycles 9027277 ---- Thread 28 ---- PC 5: Stalled ----- 5954166 in-flight CPI 1.5161 -- Total Cycles 9027277 ---- Thread 29 ---- PC 5: Stalled ----- 5406109 in-flight CPI 1.6698 -- Total Cycles 9027277 ---- Thread 30 ---- PC 5: Stalled ----- 5857690 in-flight CPI 1.5411 -- Total Cycles 9027277 ---- Thread 31 ---- PC 5: Stalled ----- 6152875 in-flight CPI 1.4672 -- Total Cycles 9027277 Total CPI 0.0455 , IPC 21.9544 -- Total Cycles 9027277 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 437897 (2.053870%) FPSUB: 0 (0.000000%) FPMUL: 2014416 (9.448224%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 15003626 (70.371569%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 567364 (2.661110%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3289242 (15.427545%) DIV: 7564 (0.035477%) FPUN: 0 (0.000000%) FPRSUB: 470 (0.002204%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (217349301 total) ADD%: 8.182 (17783652) SUB%: 0.000 (0) MUL%: 0.000 (205) BITOR%: 1.227 (2666682) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.539 (1172423) FPSUB%: 0.000 (0) FPMUL%: 4.753 (10329910) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (615) FPMAX%: 0.000 (615) LOAD%: 4.949 (10756480) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (237) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41657) FPINV%: 0.000 (0) FPCONV%: 0.000 (679) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.062 (2307698) FPLE%: 0.391 (850602) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (615) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27660) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.966 (6446297) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.749 (1627035) CMPU%: 0.000 (0) RSUB%: 0.000 (205) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.771 (34277652) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.233 (2679522) ORI%: 1.257 (2732778) XORI%: 0.000 (0) MULI%: 3.365 (7313325) LW%: 1.193 (2592394) LWI%: 13.934 (30286422) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (654293) SWI%: 4.099 (8908829) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.482 (3221852) bged%: 0.000 (0) bgeid%: 0.000 (205) bgtd%: 0.000 (0) bgtid%: 0.322 (700725) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.039 (84984) bned%: 0.000 (0) bneid%: 13.716 (29811054) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.740 (1609449) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.086 (187114) DIV%: 0.000 (410) FPUN%: 1.183 (2571060) FPRSUB%: 3.710 (8064319) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (1) FPGT%: 3.105 (6747738) FPGE%: 0.797 (1731213) SYNC%: 0.000 (0) NOP%: 8.816 (19160599) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 184 SUB 0 MUL 14 BITOR 4 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 532 FPSUB 0 FPMUL 5308 FPCMPLT 0 FPMIN 0 FPMAX 398 LOAD 2379645 INTCONV 0 ATOMIC_INC 11 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 102 FPINV 0 FPCONV 6 FPEQ 0 FPNE 0 FPLT 7 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 2189 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 2 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 2136 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3434041 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 809 ORI 597159 XORI 0 MULI 647152 LW 0 LWI 9625323 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1710 DIV 22 FPUN 0 FPRSUB 3 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.9544 --Total thread-cycles: 288872864 --total thread-cycles issued: 198188702 (68.607588%) --iCache conflicts: 6615252 (2.290022%) --thread*cycles of FU dependence: 16696784 (5.779977%) --thread*cycles of data dependence: 21320579 (7.380610%) --iCache cycles*banks: 288872864 (75.240481% used) Issue breakdown: --thread*cycles of issue worked: 198188702 (68.607587%) --thread*cycles of issue failed: 71523563 (24.759530%) --thread*cycles of issue NOP/other: 19160599 (6.632883%) Number of thread-cycles not ready: 21320579 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 217349301 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 8 3: 8 4: 7 5: 8 6: 8 7: 7 8: 8 9: 7 10: 7 11: 8 12: 7 13: 8 14: 9 15: 9 16: 7 17: 9 18: 7 19: 7 20: 7 21: 7 22: 7 23: 7 24: 7 25: 7 26: 7 27: 7 28: 7 29: 6 30: 7 31: 7 <=== Core 44 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6043176 in-flight CPI 1.5040 -- Total Cycles 9088948 ---- Thread 01 ---- PC 5: Stalled ----- 6690359 in-flight CPI 1.3585 -- Total Cycles 9088948 ---- Thread 02 ---- PC 5: Stalled ----- 6714429 in-flight CPI 1.3536 -- Total Cycles 9088948 ---- Thread 03 ---- PC 5: Stalled ----- 5988092 in-flight CPI 1.5178 -- Total Cycles 9088948 ---- Thread 04 ---- PC 5: Stalled ----- 6620211 in-flight CPI 1.3729 -- Total Cycles 9088948 ---- Thread 05 ---- PC 5: Stalled ----- 6575788 in-flight CPI 1.3822 -- Total Cycles 9088948 ---- Thread 06 ---- PC 5: Stalled ----- 5802619 in-flight CPI 1.5663 -- Total Cycles 9088948 ---- Thread 07 ---- PC 5: Stalled ----- 6627089 in-flight CPI 1.3715 -- Total Cycles 9088948 ---- Thread 08 ---- PC 5: Stalled ----- 6116994 in-flight CPI 1.4858 -- Total Cycles 9088948 ---- Thread 09 ---- PC 5: Stalled ----- 6070851 in-flight CPI 1.4971 -- Total Cycles 9088948 ---- Thread 10 ---- PC 5: Stalled ----- 6402371 in-flight CPI 1.4196 -- Total Cycles 9088948 ---- Thread 11 ---- PC 5: Stalled ----- 5975258 in-flight CPI 1.5211 -- Total Cycles 9088948 ---- Thread 12 ---- PC 5: Stalled ----- 5820477 in-flight CPI 1.5615 -- Total Cycles 9088948 ---- Thread 13 ---- PC 5: Stalled ----- 5786630 in-flight CPI 1.5707 -- Total Cycles 9088948 ---- Thread 14 ---- PC 5: Stalled ----- 6104742 in-flight CPI 1.4888 -- Total Cycles 9088948 ---- Thread 15 ---- PC 5: Stalled ----- 6377957 in-flight CPI 1.4251 -- Total Cycles 9088948 ---- Thread 16 ---- PC 5: Stalled ----- 6287198 in-flight CPI 1.4456 -- Total Cycles 9088948 ---- Thread 17 ---- PC 5: Stalled ----- 6424459 in-flight CPI 1.4147 -- Total Cycles 9088948 ---- Thread 18 ---- PC 5: Stalled ----- 6927830 in-flight CPI 1.3119 -- Total Cycles 9088948 ---- Thread 19 ---- PC 5: Stalled ----- 5640872 in-flight CPI 1.6113 -- Total Cycles 9088948 ---- Thread 20 ---- PC 5: Stalled ----- 6259909 in-flight CPI 1.4519 -- Total Cycles 9088948 ---- Thread 21 ---- PC 5: Stalled ----- 5915865 in-flight CPI 1.5364 -- Total Cycles 9088948 ---- Thread 22 ---- PC 5: Stalled ----- 5596482 in-flight CPI 1.6240 -- Total Cycles 9088948 ---- Thread 23 ---- PC 5: Stalled ----- 6572572 in-flight CPI 1.3829 -- Total Cycles 9088948 ---- Thread 24 ---- PC 5: Stalled ----- 5444785 in-flight CPI 1.6693 -- Total Cycles 9088948 ---- Thread 25 ---- PC 5: Stalled ----- 6528342 in-flight CPI 1.3922 -- Total Cycles 9088948 ---- Thread 26 ---- PC 5: Stalled ----- 5782841 in-flight CPI 1.5717 -- Total Cycles 9088948 ---- Thread 27 ---- PC 5: Stalled ----- 6404425 in-flight CPI 1.4192 -- Total Cycles 9088948 ---- Thread 28 ---- PC 5: Stalled ----- 6018681 in-flight CPI 1.5101 -- Total Cycles 9088948 ---- Thread 29 ---- PC 5: Stalled ----- 5914595 in-flight CPI 1.5367 -- Total Cycles 9088948 ---- Thread 30 ---- PC 5: Stalled ----- 6222007 in-flight CPI 1.4608 -- Total Cycles 9088948 ---- Thread 31 ---- PC 5: Stalled ----- 5412436 in-flight CPI 1.6793 -- Total Cycles 9088948 Total CPI 0.0461 , IPC 21.6825 -- Total Cycles 9088948 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 437982 (2.061822%) FPSUB: 0 (0.000000%) FPMUL: 2007744 (9.451554%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 14943456 (70.347060%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 562993 (2.650317%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3282233 (15.451275%) DIV: 7600 (0.035777%) FPUN: 0 (0.000000%) FPRSUB: 466 (0.002194%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (216124815 total) ADD%: 8.192 (17703999) SUB%: 0.000 (0) MUL%: 0.000 (206) BITOR%: 1.224 (2644792) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.543 (1172672) FPSUB%: 0.000 (0) FPMUL%: 4.761 (10290718) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (618) FPMAX%: 0.000 (618) LOAD%: 4.948 (10693392) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (238) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41338) FPINV%: 0.000 (0) FPCONV%: 0.000 (682) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.063 (2297074) FPLE%: 0.390 (842183) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (618) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27642) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.965 (6408108) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.748 (1616873) CMPU%: 0.000 (0) RSUB%: 0.000 (206) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.764 (34069418) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.232 (2662748) ORI%: 1.261 (2724617) XORI%: 0.000 (0) MULI%: 3.363 (7268110) LW%: 1.192 (2577052) LWI%: 13.934 (30115449) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (650608) SWI%: 4.099 (8858552) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.482 (3202574) bged%: 0.000 (0) bgeid%: 0.000 (206) bgtd%: 0.000 (0) bgtid%: 0.322 (696618) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.040 (85924) bned%: 0.000 (0) bneid%: 13.710 (29631476) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.738 (1594544) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.086 (186731) DIV%: 0.000 (412) FPUN%: 1.180 (2550666) FPRSUB%: 3.713 (8024640) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (3) FPGT%: 3.104 (6709325) FPGE%: 0.795 (1719214) SYNC%: 0.000 (0) NOP%: 8.816 (19053855) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 175 SUB 0 MUL 18 BITOR 2 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 519 FPSUB 0 FPMUL 5345 FPCMPLT 0 FPMIN 0 FPMAX 399 LOAD 2347892 INTCONV 0 ATOMIC_INC 4 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 110 FPINV 0 FPCONV 13 FPEQ 0 FPNE 0 FPLT 8 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1716 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 2267 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3416716 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 788 ORI 597988 XORI 0 MULI 643946 LW 0 LWI 9570275 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1747 DIV 21 FPUN 0 FPRSUB 4 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.6825 --Total thread-cycles: 290846336 --total thread-cycles issued: 197070960 (67.757759%) --iCache conflicts: 6588934 (2.265435%) --thread*cycles of FU dependence: 16589967 (5.704032%) --thread*cycles of data dependence: 21242474 (7.303676%) --iCache cycles*banks: 290846336 (74.308946% used) Issue breakdown: --thread*cycles of issue worked: 197070960 (67.757759%) --thread*cycles of issue failed: 74721521 (25.691065%) --thread*cycles of issue NOP/other: 19053855 (6.551176%) Number of thread-cycles not ready: 21242474 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 216124815 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 8 3: 7 4: 7 5: 7 6: 8 7: 8 8: 8 9: 7 10: 7 11: 7 12: 7 13: 7 14: 8 15: 8 16: 7 17: 7 18: 9 19: 7 20: 7 21: 8 22: 8 23: 9 24: 7 25: 7 26: 7 27: 7 28: 7 29: 8 30: 7 31: 6 <=== Core 45 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6378263 in-flight CPI 1.3944 -- Total Cycles 8894098 ---- Thread 01 ---- PC 5: Stalled ----- 6216146 in-flight CPI 1.4308 -- Total Cycles 8894098 ---- Thread 02 ---- PC 5: Stalled ----- 6187144 in-flight CPI 1.4375 -- Total Cycles 8894098 ---- Thread 03 ---- PC 5: Stalled ----- 6566359 in-flight CPI 1.3545 -- Total Cycles 8894098 ---- Thread 04 ---- PC 5: Stalled ----- 6831566 in-flight CPI 1.3019 -- Total Cycles 8894098 ---- Thread 05 ---- PC 5: Stalled ----- 6360589 in-flight CPI 1.3983 -- Total Cycles 8894098 ---- Thread 06 ---- PC 5: Stalled ----- 6808079 in-flight CPI 1.3064 -- Total Cycles 8894098 ---- Thread 07 ---- PC 5: Stalled ----- 6179366 in-flight CPI 1.4393 -- Total Cycles 8894098 ---- Thread 08 ---- PC 5: Stalled ----- 6611815 in-flight CPI 1.3452 -- Total Cycles 8894098 ---- Thread 09 ---- PC 5: Stalled ----- 6896536 in-flight CPI 1.2896 -- Total Cycles 8894098 ---- Thread 10 ---- PC 5: Stalled ----- 5995136 in-flight CPI 1.4835 -- Total Cycles 8894098 ---- Thread 11 ---- PC 5: Stalled ----- 6017819 in-flight CPI 1.4780 -- Total Cycles 8894098 ---- Thread 12 ---- PC 5: Stalled ----- 5987291 in-flight CPI 1.4855 -- Total Cycles 8894098 ---- Thread 13 ---- PC 5: Stalled ----- 6098338 in-flight CPI 1.4584 -- Total Cycles 8894098 ---- Thread 14 ---- PC 5: Stalled ----- 5831883 in-flight CPI 1.5251 -- Total Cycles 8894098 ---- Thread 15 ---- PC 5: Stalled ----- 5929004 in-flight CPI 1.5001 -- Total Cycles 8894098 ---- Thread 16 ---- PC 5: Stalled ----- 5894092 in-flight CPI 1.5090 -- Total Cycles 8894098 ---- Thread 17 ---- PC 5: Stalled ----- 5981197 in-flight CPI 1.4870 -- Total Cycles 8894098 ---- Thread 18 ---- PC 5: Stalled ----- 5880703 in-flight CPI 1.5124 -- Total Cycles 8894098 ---- Thread 19 ---- PC 5: Stalled ----- 6017636 in-flight CPI 1.4780 -- Total Cycles 8894098 ---- Thread 20 ---- PC 5: Stalled ----- 6086149 in-flight CPI 1.4614 -- Total Cycles 8894098 ---- Thread 21 ---- PC 5: Stalled ----- 5908782 in-flight CPI 1.5052 -- Total Cycles 8894098 ---- Thread 22 ---- PC 5: Stalled ----- 6465823 in-flight CPI 1.3756 -- Total Cycles 8894098 ---- Thread 23 ---- PC 5: Stalled ----- 6175701 in-flight CPI 1.4402 -- Total Cycles 8894098 ---- Thread 24 ---- PC 5: Stalled ----- 5988286 in-flight CPI 1.4852 -- Total Cycles 8894098 ---- Thread 25 ---- PC 5: Stalled ----- 5476218 in-flight CPI 1.6241 -- Total Cycles 8894098 ---- Thread 26 ---- PC 5: Stalled ----- 5636864 in-flight CPI 1.5778 -- Total Cycles 8894098 ---- Thread 27 ---- PC 5: Stalled ----- 5481366 in-flight CPI 1.6226 -- Total Cycles 8894098 ---- Thread 28 ---- PC 5: Stalled ----- 6454621 in-flight CPI 1.3779 -- Total Cycles 8894098 ---- Thread 29 ---- PC 5: Stalled ----- 6061019 in-flight CPI 1.4674 -- Total Cycles 8894098 ---- Thread 30 ---- PC 5: Stalled ----- 5749353 in-flight CPI 1.5470 -- Total Cycles 8894098 ---- Thread 31 ---- PC 5: Stalled ----- 6345543 in-flight CPI 1.4016 -- Total Cycles 8894098 Total CPI 0.0453 , IPC 22.0932 -- Total Cycles 8894098 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 435666 (2.017507%) FPSUB: 0 (0.000000%) FPMUL: 1999156 (9.257805%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 15305578 (70.877940%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 565555 (2.619004%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3280287 (15.190539%) DIV: 7567 (0.035042%) FPUN: 0 (0.000000%) FPRSUB: 467 (0.002163%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (215510753 total) ADD%: 8.186 (17642701) SUB%: 0.000 (0) MUL%: 0.000 (205) BITOR%: 1.227 (2643468) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.543 (1169701) FPSUB%: 0.000 (0) FPMUL%: 4.755 (10247060) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (615) FPMAX%: 0.000 (615) LOAD%: 4.952 (10671745) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (237) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41701) FPINV%: 0.000 (0) FPCONV%: 0.000 (679) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.062 (2289600) FPLE%: 0.395 (851091) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (615) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27970) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.963 (6384799) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.747 (1609373) CMPU%: 0.000 (0) RSUB%: 0.000 (205) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.772 (33990104) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.230 (2651706) ORI%: 1.257 (2709300) XORI%: 0.000 (0) MULI%: 3.361 (7243209) LW%: 1.192 (2567930) LWI%: 13.923 (30005237) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.302 (649816) SWI%: 4.099 (8833219) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.480 (3189089) bged%: 0.000 (0) bgeid%: 0.000 (205) bgtd%: 0.000 (0) bgtid%: 0.323 (696123) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.040 (86562) bned%: 0.000 (0) bneid%: 13.714 (29556084) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.744 (1604435) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.087 (186517) DIV%: 0.000 (410) FPUN%: 1.184 (2551100) FPRSUB%: 3.708 (7992193) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (2) FPGT%: 3.105 (6692666) FPGE%: 0.794 (1710919) SYNC%: 0.000 (0) NOP%: 8.822 (19011451) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 163 SUB 0 MUL 15 BITOR 6 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 563 FPSUB 0 FPMUL 5378 FPCMPLT 0 FPMIN 0 FPMAX 402 LOAD 2350823 INTCONV 0 ATOMIC_INC 3 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 96 FPINV 0 FPCONV 10 FPEQ 0 FPNE 0 FPLT 6 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1932 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 2246 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3404019 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 782 ORI 594942 XORI 0 MULI 647561 LW 0 LWI 9536678 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1734 DIV 13 FPUN 0 FPRSUB 4 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 22.0932 --Total thread-cycles: 284611136 --total thread-cycles issued: 196499302 (69.041324%) --iCache conflicts: 6616302 (2.324681%) --thread*cycles of FU dependence: 16547380 (5.814031%) --thread*cycles of data dependence: 21594276 (7.587291%) --iCache cycles*banks: 284611136 (75.721136% used) Issue breakdown: --thread*cycles of issue worked: 196499302 (69.041326%) --thread*cycles of issue failed: 69100383 (24.278875%) --thread*cycles of issue NOP/other: 19011451 (6.679799%) Number of thread-cycles not ready: 21594276 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 215510753 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 7 2: 8 3: 8 4: 8 5: 8 6: 8 7: 7 8: 8 9: 8 10: 8 11: 7 12: 7 13: 7 14: 7 15: 6 16: 8 17: 7 18: 7 19: 8 20: 7 21: 7 22: 7 23: 7 24: 7 25: 7 26: 7 27: 7 28: 9 29: 7 30: 8 31: 7 <=== Core 46 ===> ---- Thread 00 ---- PC 5: Stalled ----- 5833636 in-flight CPI 1.5628 -- Total Cycles 9117056 ---- Thread 01 ---- PC 5: Stalled ----- 6649156 in-flight CPI 1.3712 -- Total Cycles 9117056 ---- Thread 02 ---- PC 5: Stalled ----- 5910777 in-flight CPI 1.5424 -- Total Cycles 9117056 ---- Thread 03 ---- PC 5: Stalled ----- 6543344 in-flight CPI 1.3933 -- Total Cycles 9117056 ---- Thread 04 ---- PC 5: Stalled ----- 6232423 in-flight CPI 1.4628 -- Total Cycles 9117056 ---- Thread 05 ---- PC 5: Stalled ----- 6333100 in-flight CPI 1.4396 -- Total Cycles 9117056 ---- Thread 06 ---- PC 5: Stalled ----- 6628924 in-flight CPI 1.3753 -- Total Cycles 9117056 ---- Thread 07 ---- PC 5: Stalled ----- 6311801 in-flight CPI 1.4444 -- Total Cycles 9117056 ---- Thread 08 ---- PC 5: Stalled ----- 5794723 in-flight CPI 1.5733 -- Total Cycles 9117056 ---- Thread 09 ---- PC 5: Stalled ----- 5861534 in-flight CPI 1.5554 -- Total Cycles 9117056 ---- Thread 10 ---- PC 5: Stalled ----- 5818038 in-flight CPI 1.5670 -- Total Cycles 9117056 ---- Thread 11 ---- PC 5: Stalled ----- 6297032 in-flight CPI 1.4478 -- Total Cycles 9117056 ---- Thread 12 ---- PC 5: Stalled ----- 6083308 in-flight CPI 1.4987 -- Total Cycles 9117056 ---- Thread 13 ---- PC 5: Stalled ----- 6039651 in-flight CPI 1.5095 -- Total Cycles 9117056 ---- Thread 14 ---- PC 5: Stalled ----- 6321463 in-flight CPI 1.4422 -- Total Cycles 9117056 ---- Thread 15 ---- PC 5: Stalled ----- 6379250 in-flight CPI 1.4292 -- Total Cycles 9117056 ---- Thread 16 ---- PC 5: Stalled ----- 6731927 in-flight CPI 1.3543 -- Total Cycles 9117056 ---- Thread 17 ---- PC 5: Stalled ----- 5707829 in-flight CPI 1.5973 -- Total Cycles 9117056 ---- Thread 18 ---- PC 5: Stalled ----- 5839204 in-flight CPI 1.5613 -- Total Cycles 9117056 ---- Thread 19 ---- PC 5: Stalled ----- 6104715 in-flight CPI 1.4934 -- Total Cycles 9117056 ---- Thread 20 ---- PC 5: Stalled ----- 5714544 in-flight CPI 1.5954 -- Total Cycles 9117056 ---- Thread 21 ---- PC 5: Stalled ----- 6510068 in-flight CPI 1.4005 -- Total Cycles 9117056 ---- Thread 22 ---- PC 5: Stalled ----- 6819260 in-flight CPI 1.3370 -- Total Cycles 9117056 ---- Thread 23 ---- PC 5: Stalled ----- 6230210 in-flight CPI 1.4634 -- Total Cycles 9117056 ---- Thread 24 ---- PC 5: Stalled ----- 6334807 in-flight CPI 1.4392 -- Total Cycles 9117056 ---- Thread 25 ---- PC 5: Stalled ----- 5897979 in-flight CPI 1.5458 -- Total Cycles 9117056 ---- Thread 26 ---- PC 5: Stalled ----- 6207455 in-flight CPI 1.4687 -- Total Cycles 9117056 ---- Thread 27 ---- PC 5: Stalled ----- 6075076 in-flight CPI 1.5007 -- Total Cycles 9117056 ---- Thread 28 ---- PC 5: Stalled ----- 5997643 in-flight CPI 1.5201 -- Total Cycles 9117056 ---- Thread 29 ---- PC 5: Stalled ----- 6434001 in-flight CPI 1.4170 -- Total Cycles 9117056 ---- Thread 30 ---- PC 5: Stalled ----- 5308018 in-flight CPI 1.7176 -- Total Cycles 9117056 ---- Thread 31 ---- PC 5: Stalled ----- 5953453 in-flight CPI 1.5314 -- Total Cycles 9117056 Total CPI 0.0463 , IPC 21.5974 -- Total Cycles 9117056 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 439807 (2.057854%) FPSUB: 0 (0.000000%) FPMUL: 2009480 (9.402343%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 15050053 (70.419093%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 563530 (2.636753%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3301167 (15.446137%) DIV: 7609 (0.035602%) FPUN: 0 (0.000000%) FPRSUB: 474 (0.002218%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (215951562 total) ADD%: 8.187 (17679856) SUB%: 0.000 (0) MUL%: 0.000 (206) BITOR%: 1.226 (2647309) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.546 (1178892) FPSUB%: 0.000 (0) FPMUL%: 4.767 (10293904) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (618) FPMAX%: 0.000 (618) LOAD%: 4.953 (10697023) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (238) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41558) FPINV%: 0.000 (0) FPCONV%: 0.000 (682) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.063 (2294626) FPLE%: 0.390 (841499) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (618) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27886) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.964 (6399882) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.748 (1615540) CMPU%: 0.000 (0) RSUB%: 0.000 (206) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.767 (34049367) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.231 (2658313) ORI%: 1.263 (2726927) XORI%: 0.000 (0) MULI%: 3.360 (7255857) LW%: 1.192 (2573944) LWI%: 13.913 (30044694) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.302 (651265) SWI%: 4.095 (8843777) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.480 (3196641) bged%: 0.000 (0) bgeid%: 0.000 (206) bgtd%: 0.000 (0) bgtid%: 0.323 (698243) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.041 (87939) bned%: 0.000 (0) bneid%: 13.712 (29611204) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.740 (1598855) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.087 (187749) DIV%: 0.000 (412) FPUN%: 1.183 (2554371) FPRSUB%: 3.712 (8016424) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (2) FPGT%: 3.104 (6703795) FPGE%: 0.798 (1723725) SYNC%: 0.000 (0) NOP%: 8.820 (19046595) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 174 SUB 0 MUL 34 BITOR 4 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 539 FPSUB 0 FPMUL 5322 FPCMPLT 0 FPMIN 0 FPMAX 402 LOAD 2345154 INTCONV 0 ATOMIC_INC 9 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 102 FPINV 0 FPCONV 16 FPEQ 0 FPNE 0 FPLT 11 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1889 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 2 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 2260 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3411371 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 843 ORI 600721 XORI 0 MULI 646427 LW 0 LWI 9554894 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1755 DIV 18 FPUN 0 FPRSUB 3 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.5974 --Total thread-cycles: 291745792 --total thread-cycles issued: 196904967 (67.491962%) --iCache conflicts: 6610720 (2.265918%) --thread*cycles of FU dependence: 16571960 (5.680274%) --thread*cycles of data dependence: 21372120 (7.325597%) --iCache cycles*banks: 291745792 (74.020466% used) Issue breakdown: --thread*cycles of issue worked: 196904967 (67.491965%) --thread*cycles of issue failed: 75794230 (25.979545%) --thread*cycles of issue NOP/other: 19046595 (6.528490%) Number of thread-cycles not ready: 21372120 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 215951562 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 7 3: 8 4: 8 5: 7 6: 9 7: 8 8: 7 9: 7 10: 8 11: 7 12: 7 13: 8 14: 7 15: 8 16: 8 17: 7 18: 8 19: 8 20: 7 21: 8 22: 7 23: 7 24: 8 25: 7 26: 7 27: 7 28: 8 29: 7 30: 6 31: 7 <=== Core 47 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6828113 in-flight CPI 1.3101 -- Total Cycles 8945769 ---- Thread 01 ---- PC 5: Stalled ----- 6694907 in-flight CPI 1.3362 -- Total Cycles 8945769 ---- Thread 02 ---- PC 5: Stalled ----- 6034926 in-flight CPI 1.4823 -- Total Cycles 8945769 ---- Thread 03 ---- PC 5: Stalled ----- 6910890 in-flight CPI 1.2944 -- Total Cycles 8945769 ---- Thread 04 ---- PC 5: Stalled ----- 5945930 in-flight CPI 1.5045 -- Total Cycles 8945769 ---- Thread 05 ---- PC 5: Stalled ----- 5855237 in-flight CPI 1.5278 -- Total Cycles 8945769 ---- Thread 06 ---- PC 5: Stalled ----- 6870269 in-flight CPI 1.3021 -- Total Cycles 8945769 ---- Thread 07 ---- PC 5: Stalled ----- 6494718 in-flight CPI 1.3774 -- Total Cycles 8945769 ---- Thread 08 ---- PC 5: Stalled ----- 5990038 in-flight CPI 1.4934 -- Total Cycles 8945769 ---- Thread 09 ---- PC 5: Stalled ----- 6420644 in-flight CPI 1.3933 -- Total Cycles 8945769 ---- Thread 10 ---- PC 5: Stalled ----- 5800540 in-flight CPI 1.5422 -- Total Cycles 8945769 ---- Thread 11 ---- PC 5: Stalled ----- 6632328 in-flight CPI 1.3488 -- Total Cycles 8945769 ---- Thread 12 ---- PC 5: Stalled ----- 5830662 in-flight CPI 1.5343 -- Total Cycles 8945769 ---- Thread 13 ---- PC 5: Stalled ----- 6167541 in-flight CPI 1.4505 -- Total Cycles 8945769 ---- Thread 14 ---- PC 5: Stalled ----- 6087818 in-flight CPI 1.4695 -- Total Cycles 8945769 ---- Thread 15 ---- PC 5: Stalled ----- 6580355 in-flight CPI 1.3595 -- Total Cycles 8945769 ---- Thread 16 ---- PC 5: Stalled ----- 6152341 in-flight CPI 1.4540 -- Total Cycles 8945769 ---- Thread 17 ---- PC 5: Stalled ----- 5859350 in-flight CPI 1.5267 -- Total Cycles 8945769 ---- Thread 18 ---- PC 5: Stalled ----- 5645907 in-flight CPI 1.5845 -- Total Cycles 8945769 ---- Thread 19 ---- PC 5: Stalled ----- 6180328 in-flight CPI 1.4475 -- Total Cycles 8945769 ---- Thread 20 ---- PC 5: Stalled ----- 6206370 in-flight CPI 1.4414 -- Total Cycles 8945769 ---- Thread 21 ---- PC 5: Stalled ----- 5588844 in-flight CPI 1.6006 -- Total Cycles 8945769 ---- Thread 22 ---- PC 5: Stalled ----- 6700682 in-flight CPI 1.3350 -- Total Cycles 8945769 ---- Thread 23 ---- PC 5: Stalled ----- 6387485 in-flight CPI 1.4005 -- Total Cycles 8945769 ---- Thread 24 ---- PC 5: Stalled ----- 5877209 in-flight CPI 1.5221 -- Total Cycles 8945769 ---- Thread 25 ---- PC 5: Stalled ----- 6196889 in-flight CPI 1.4436 -- Total Cycles 8945769 ---- Thread 26 ---- PC 5: Stalled ----- 6096287 in-flight CPI 1.4674 -- Total Cycles 8945769 ---- Thread 27 ---- PC 5: Stalled ----- 5389903 in-flight CPI 1.6597 -- Total Cycles 8945769 ---- Thread 28 ---- PC 5: Stalled ----- 5478156 in-flight CPI 1.6330 -- Total Cycles 8945769 ---- Thread 29 ---- PC 5: Stalled ----- 5332491 in-flight CPI 1.6776 -- Total Cycles 8945769 ---- Thread 30 ---- PC 5: Stalled ----- 5765297 in-flight CPI 1.5517 -- Total Cycles 8945769 ---- Thread 31 ---- PC 5: Stalled ----- 5597787 in-flight CPI 1.5981 -- Total Cycles 8945769 Total CPI 0.0457 , IPC 21.8652 -- Total Cycles 8945769 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 441125 (2.037357%) FPSUB: 0 (0.000000%) FPMUL: 2003640 (9.253909%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 15338873 (70.843329%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 554700 (2.561909%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3305300 (15.265688%) DIV: 7713 (0.035623%) FPUN: 0 (0.000000%) FPRSUB: 473 (0.002185%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (214522696 total) ADD%: 8.194 (17578170) SUB%: 0.000 (0) MUL%: 0.000 (209) BITOR%: 1.223 (2622610) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.549 (1178277) FPSUB%: 0.000 (0) FPMUL%: 4.778 (10249697) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (627) FPMAX%: 0.000 (627) LOAD%: 4.955 (10630095) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (241) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (40928) FPINV%: 0.000 (0) FPCONV%: 0.000 (691) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.065 (2284741) FPLE%: 0.392 (841107) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (627) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27762) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.959 (6347513) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.749 (1606818) CMPU%: 0.000 (0) RSUB%: 0.000 (209) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.763 (33815187) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.230 (2639332) ORI%: 1.262 (2708166) XORI%: 0.000 (0) MULI%: 3.357 (7200547) LW%: 1.190 (2552862) LWI%: 13.916 (29851907) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (646466) SWI%: 4.095 (8785734) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.478 (3169880) bged%: 0.000 (0) bgeid%: 0.000 (209) bgtd%: 0.000 (0) bgtid%: 0.323 (692991) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.040 (86312) bned%: 0.000 (0) bneid%: 13.707 (29404454) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.741 (1588548) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.088 (188064) DIV%: 0.000 (418) FPUN%: 1.178 (2527694) FPRSUB%: 3.717 (7972948) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (3) FPGT%: 3.105 (6660769) FPGE%: 0.791 (1697333) SYNC%: 0.000 (0) NOP%: 8.820 (18921827) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 167 SUB 0 MUL 8 BITOR 3 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 548 FPSUB 0 FPMUL 5341 FPCMPLT 0 FPMIN 0 FPMAX 408 LOAD 2356416 INTCONV 0 ATOMIC_INC 4 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 116 FPINV 0 FPCONV 19 FPEQ 0 FPNE 0 FPLT 7 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 2020 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 2 COS 0 SIN 0 ADDC 0 ADDK 1 ADDKC 0 BITXOR 0 ANDN 0 CMP 2313 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3390129 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 707 ORI 602764 XORI 0 MULI 632597 LW 0 LWI 9492384 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1706 DIV 16 FPUN 0 FPRSUB 4 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.8652 --Total thread-cycles: 286264608 --total thread-cycles issued: 195600869 (68.328693%) --iCache conflicts: 6525127 (2.279404%) --thread*cycles of FU dependence: 16487690 (5.759598%) --thread*cycles of data dependence: 21651824 (7.563570%) --iCache cycles*banks: 286264608 (74.938613% used) Issue breakdown: --thread*cycles of issue worked: 195600869 (68.328694%) --thread*cycles of issue failed: 71741912 (25.061398%) --thread*cycles of issue NOP/other: 18921827 (6.609908%) Number of thread-cycles not ready: 21651824 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 214522696 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 10 1: 7 2: 7 3: 8 4: 7 5: 8 6: 8 7: 7 8: 7 9: 9 10: 9 11: 8 12: 7 13: 9 14: 7 15: 8 16: 8 17: 7 18: 7 19: 8 20: 7 21: 7 22: 9 23: 8 24: 7 25: 7 26: 7 27: 6 28: 7 29: 7 30: 7 31: 6 <=== Core 48 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6654730 in-flight CPI 1.3537 -- Total Cycles 9008737 ---- Thread 01 ---- PC 5: Stalled ----- 5924109 in-flight CPI 1.5207 -- Total Cycles 9008737 ---- Thread 02 ---- PC 5: Stalled ----- 6490421 in-flight CPI 1.3880 -- Total Cycles 9008737 ---- Thread 03 ---- PC 5: Stalled ----- 6480154 in-flight CPI 1.3902 -- Total Cycles 9008737 ---- Thread 04 ---- PC 5: Stalled ----- 6912787 in-flight CPI 1.3032 -- Total Cycles 9008737 ---- Thread 05 ---- PC 5: Stalled ----- 6105297 in-flight CPI 1.4756 -- Total Cycles 9008737 ---- Thread 06 ---- PC 5: Stalled ----- 6409256 in-flight CPI 1.4056 -- Total Cycles 9008737 ---- Thread 07 ---- PC 5: Stalled ----- 6821346 in-flight CPI 1.3207 -- Total Cycles 9008737 ---- Thread 08 ---- PC 5: Stalled ----- 6128900 in-flight CPI 1.4699 -- Total Cycles 9008737 ---- Thread 09 ---- PC 5: Stalled ----- 6593447 in-flight CPI 1.3663 -- Total Cycles 9008737 ---- Thread 10 ---- PC 5: Stalled ----- 6454556 in-flight CPI 1.3957 -- Total Cycles 9008737 ---- Thread 11 ---- PC 5: Stalled ----- 6480910 in-flight CPI 1.3900 -- Total Cycles 9008737 ---- Thread 12 ---- PC 5: Stalled ----- 6658117 in-flight CPI 1.3530 -- Total Cycles 9008737 ---- Thread 13 ---- PC 5: Stalled ----- 6276564 in-flight CPI 1.4353 -- Total Cycles 9008737 ---- Thread 14 ---- PC 5: Stalled ----- 6894709 in-flight CPI 1.3066 -- Total Cycles 9008737 ---- Thread 15 ---- PC 5: Stalled ----- 6882165 in-flight CPI 1.3090 -- Total Cycles 9008737 ---- Thread 16 ---- PC 5: Stalled ----- 5837043 in-flight CPI 1.5434 -- Total Cycles 9008737 ---- Thread 17 ---- PC 5: Stalled ----- 6587500 in-flight CPI 1.3675 -- Total Cycles 9008737 ---- Thread 18 ---- PC 5: Stalled ----- 6072595 in-flight CPI 1.4835 -- Total Cycles 9008737 ---- Thread 19 ---- PC 5: Stalled ----- 6493601 in-flight CPI 1.3873 -- Total Cycles 9008737 ---- Thread 20 ---- PC 5: Stalled ----- 6405671 in-flight CPI 1.4064 -- Total Cycles 9008737 ---- Thread 21 ---- PC 5: Stalled ----- 5984435 in-flight CPI 1.5054 -- Total Cycles 9008737 ---- Thread 22 ---- PC 5: Stalled ----- 5607335 in-flight CPI 1.6066 -- Total Cycles 9008737 ---- Thread 23 ---- PC 5: Stalled ----- 5826966 in-flight CPI 1.5460 -- Total Cycles 9008737 ---- Thread 24 ---- PC 5: Stalled ----- 6272376 in-flight CPI 1.4363 -- Total Cycles 9008737 ---- Thread 25 ---- PC 5: Stalled ----- 6281418 in-flight CPI 1.4342 -- Total Cycles 9008737 ---- Thread 26 ---- PC 5: Stalled ----- 6469993 in-flight CPI 1.3924 -- Total Cycles 9008737 ---- Thread 27 ---- PC 5: Stalled ----- 6494558 in-flight CPI 1.3871 -- Total Cycles 9008737 ---- Thread 28 ---- PC 5: Stalled ----- 5955575 in-flight CPI 1.5127 -- Total Cycles 9008737 ---- Thread 29 ---- PC 5: Stalled ----- 5560473 in-flight CPI 1.6201 -- Total Cycles 9008737 ---- Thread 30 ---- PC 5: Stalled ----- 5296102 in-flight CPI 1.7010 -- Total Cycles 9008737 ---- Thread 31 ---- PC 5: Stalled ----- 6190654 in-flight CPI 1.4552 -- Total Cycles 9008737 Total CPI 0.0447 , IPC 22.3677 -- Total Cycles 9008737 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 452776 (2.059522%) FPSUB: 0 (0.000000%) FPMUL: 2062653 (9.382298%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 15492136 (70.468387%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 575772 (2.618988%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3393033 (15.433738%) DIV: 7675 (0.034911%) FPUN: 0 (0.000000%) FPRSUB: 474 (0.002156%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (220989044 total) ADD%: 8.179 (18075743) SUB%: 0.000 (0) MUL%: 0.000 (208) BITOR%: 1.227 (2710774) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.549 (1212189) FPSUB%: 0.000 (0) FPMUL%: 4.774 (10549038) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (624) FPMAX%: 0.000 (624) LOAD%: 4.953 (10946305) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (240) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (42320) FPINV%: 0.000 (0) FPCONV%: 0.000 (688) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.065 (2353438) FPLE%: 0.389 (860414) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (624) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (28332) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.962 (6546599) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.750 (1658013) CMPU%: 0.000 (0) RSUB%: 0.000 (208) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.764 (34835834) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.232 (2721722) ORI%: 1.268 (2801324) XORI%: 0.000 (0) MULI%: 3.359 (7423194) LW%: 1.191 (2632758) LWI%: 13.917 (30756149) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (664952) SWI%: 4.096 (9051296) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.480 (3271409) bged%: 0.000 (0) bgeid%: 0.000 (208) bgtd%: 0.000 (0) bgtid%: 0.323 (713295) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.041 (89859) bned%: 0.000 (0) bneid%: 13.711 (30299127) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.736 (1626556) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.087 (193112) DIV%: 0.000 (416) FPUN%: 1.182 (2613158) FPRSUB%: 3.714 (8207361) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (0) FPGT%: 3.101 (6852390) FPGE%: 0.798 (1763790) SYNC%: 0.000 (0) NOP%: 8.817 (19484657) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 163 SUB 0 MUL 14 BITOR 4 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 529 FPSUB 0 FPMUL 5262 FPCMPLT 0 FPMIN 0 FPMAX 407 LOAD 2439284 INTCONV 0 ATOMIC_INC 12 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 99 FPINV 0 FPCONV 13 FPEQ 0 FPNE 0 FPLT 8 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 2137 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 1 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 2400 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3491227 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 832 ORI 619317 XORI 0 MULI 657998 LW 0 LWI 9774017 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1711 DIV 33 FPUN 0 FPRSUB 1 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 22.3677 --Total thread-cycles: 288279584 --total thread-cycles issued: 201504387 (69.898944%) --iCache conflicts: 6767064 (2.347396%) --thread*cycles of FU dependence: 16995486 (5.895487%) --thread*cycles of data dependence: 21984519 (7.626110%) --iCache cycles*banks: 288279584 (76.657900% used) Issue breakdown: --thread*cycles of issue worked: 201504387 (69.898945%) --thread*cycles of issue failed: 67290540 (23.342111%) --thread*cycles of issue NOP/other: 19484657 (6.758944%) Number of thread-cycles not ready: 21984519 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 220989044 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 7 2: 7 3: 8 4: 8 5: 7 6: 8 7: 8 8: 7 9: 8 10: 7 11: 7 12: 8 13: 7 14: 8 15: 8 16: 7 17: 8 18: 7 19: 9 20: 8 21: 7 22: 7 23: 7 24: 9 25: 7 26: 9 27: 7 28: 7 29: 7 30: 6 31: 7 <=== Core 49 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6414472 in-flight CPI 1.3975 -- Total Cycles 8964153 ---- Thread 01 ---- PC 5: Stalled ----- 6509037 in-flight CPI 1.3772 -- Total Cycles 8964153 ---- Thread 02 ---- PC 5: Stalled ----- 6011023 in-flight CPI 1.4913 -- Total Cycles 8964153 ---- Thread 03 ---- PC 5: Stalled ----- 6111760 in-flight CPI 1.4667 -- Total Cycles 8964153 ---- Thread 04 ---- PC 5: Stalled ----- 6314339 in-flight CPI 1.4196 -- Total Cycles 8964153 ---- Thread 05 ---- PC 5: Stalled ----- 6434546 in-flight CPI 1.3931 -- Total Cycles 8964153 ---- Thread 06 ---- PC 5: Stalled ----- 5990228 in-flight CPI 1.4965 -- Total Cycles 8964153 ---- Thread 07 ---- PC 5: Stalled ----- 5949275 in-flight CPI 1.5068 -- Total Cycles 8964153 ---- Thread 08 ---- PC 5: Stalled ----- 6181325 in-flight CPI 1.4502 -- Total Cycles 8964153 ---- Thread 09 ---- PC 5: Stalled ----- 5784540 in-flight CPI 1.5497 -- Total Cycles 8964153 ---- Thread 10 ---- PC 5: Stalled ----- 5847936 in-flight CPI 1.5329 -- Total Cycles 8964153 ---- Thread 11 ---- PC 5: Stalled ----- 6165985 in-flight CPI 1.4538 -- Total Cycles 8964153 ---- Thread 12 ---- PC 5: Stalled ----- 5826966 in-flight CPI 1.5384 -- Total Cycles 8964153 ---- Thread 13 ---- PC 5: Stalled ----- 5832117 in-flight CPI 1.5370 -- Total Cycles 8964153 ---- Thread 14 ---- PC 5: Stalled ----- 5698673 in-flight CPI 1.5730 -- Total Cycles 8964153 ---- Thread 15 ---- PC 5: Stalled ----- 5719194 in-flight CPI 1.5674 -- Total Cycles 8964153 ---- Thread 16 ---- PC 5: Stalled ----- 5581583 in-flight CPI 1.6060 -- Total Cycles 8964153 ---- Thread 17 ---- PC 5: Stalled ----- 6593234 in-flight CPI 1.3596 -- Total Cycles 8964153 ---- Thread 18 ---- PC 5: Stalled ----- 6019191 in-flight CPI 1.4893 -- Total Cycles 8964153 ---- Thread 19 ---- PC 5: Stalled ----- 6262549 in-flight CPI 1.4314 -- Total Cycles 8964153 ---- Thread 20 ---- PC 5: Stalled ----- 5896781 in-flight CPI 1.5202 -- Total Cycles 8964153 ---- Thread 21 ---- PC 5: Stalled ----- 5833949 in-flight CPI 1.5365 -- Total Cycles 8964153 ---- Thread 22 ---- PC 5: Stalled ----- 6114458 in-flight CPI 1.4661 -- Total Cycles 8964153 ---- Thread 23 ---- PC 5: Stalled ----- 5650711 in-flight CPI 1.5864 -- Total Cycles 8964153 ---- Thread 24 ---- PC 5: Stalled ----- 5720065 in-flight CPI 1.5671 -- Total Cycles 8964153 ---- Thread 25 ---- PC 5: Stalled ----- 5803869 in-flight CPI 1.5445 -- Total Cycles 8964153 ---- Thread 26 ---- PC 5: Stalled ----- 5688694 in-flight CPI 1.5758 -- Total Cycles 8964153 ---- Thread 27 ---- PC 5: Stalled ----- 5669454 in-flight CPI 1.5811 -- Total Cycles 8964153 ---- Thread 28 ---- PC 5: Stalled ----- 5564695 in-flight CPI 1.6109 -- Total Cycles 8964153 ---- Thread 29 ---- PC 5: Stalled ----- 5814619 in-flight CPI 1.5417 -- Total Cycles 8964153 ---- Thread 30 ---- PC 5: Stalled ----- 6429577 in-flight CPI 1.3942 -- Total Cycles 8964153 ---- Thread 31 ---- PC 5: Stalled ----- 5518030 in-flight CPI 1.6245 -- Total Cycles 8964153 Total CPI 0.0469 , IPC 21.3019 -- Total Cycles 8964153 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 435074 (2.018108%) FPSUB: 0 (0.000000%) FPMUL: 1968479 (9.130870%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 15335939 (71.136378%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 551868 (2.559862%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3259465 (15.119161%) DIV: 7235 (0.033560%) FPUN: 0 (0.000000%) FPRSUB: 444 (0.002060%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (209416887 total) ADD%: 8.181 (17132788) SUB%: 0.000 (0) MUL%: 0.000 (196) BITOR%: 1.223 (2562010) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.555 (1162099) FPSUB%: 0.000 (0) FPMUL%: 4.794 (10038921) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (588) FPMAX%: 0.000 (588) LOAD%: 4.958 (10383499) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (228) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (40487) FPINV%: 0.000 (0) FPCONV%: 0.000 (652) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.067 (2235224) FPLE%: 0.388 (813467) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (588) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (26772) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.958 (6194688) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.753 (1576318) CMPU%: 0.000 (0) RSUB%: 0.000 (196) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.757 (32998566) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.231 (2578337) ORI%: 1.268 (2656266) XORI%: 0.000 (0) MULI%: 3.356 (7027359) LW%: 1.190 (2491304) LWI%: 13.911 (29131857) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.300 (628291) SWI%: 4.093 (8570552) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.479 (3096738) bged%: 0.000 (0) bgeid%: 0.000 (196) bgtd%: 0.000 (0) bgtid%: 0.322 (674778) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.041 (85648) bned%: 0.000 (0) bneid%: 13.705 (28701538) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.736 (1541336) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.089 (185571) DIV%: 0.000 (392) FPUN%: 1.178 (2466742) FPRSUB%: 3.720 (7790230) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (3) FPGT%: 3.101 (6494633) FPGE%: 0.794 (1663721) SYNC%: 0.000 (0) NOP%: 8.817 (18463424) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 149 SUB 0 MUL 12 BITOR 6 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 515 FPSUB 0 FPMUL 5148 FPCMPLT 0 FPMIN 0 FPMAX 383 LOAD 2327006 INTCONV 0 ATOMIC_INC 2 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 137 FPINV 0 FPCONV 15 FPEQ 0 FPNE 0 FPLT 11 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1779 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 2061 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3302885 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 741 ORI 595278 XORI 0 MULI 627147 LW 0 LWI 9257363 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1826 DIV 17 FPUN 0 FPRSUB 4 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.3019 --Total thread-cycles: 286852896 --total thread-cycles issued: 190953463 (66.568425%) --iCache conflicts: 6462001 (2.252723%) --thread*cycles of FU dependence: 16122487 (5.620472%) --thread*cycles of data dependence: 21558504 (7.515526%) --iCache cycles*banks: 286852896 (73.004987% used) Issue breakdown: --thread*cycles of issue worked: 190953463 (66.568428%) --thread*cycles of issue failed: 77436009 (26.995024%) --thread*cycles of issue NOP/other: 18463424 (6.436548%) Number of thread-cycles not ready: 21558504 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 209416887 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 9 2: 7 3: 7 4: 7 5: 7 6: 7 7: 7 8: 7 9: 7 10: 8 11: 7 12: 8 13: 7 14: 7 15: 7 16: 6 17: 8 18: 7 19: 7 20: 7 21: 7 22: 7 23: 7 24: 8 25: 8 26: 7 27: 7 28: 6 29: 6 30: 8 31: 6 <=== Core 50 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6021463 in-flight CPI 1.5136 -- Total Cycles 9114348 ---- Thread 01 ---- PC 5: Stalled ----- 6369414 in-flight CPI 1.4310 -- Total Cycles 9114348 ---- Thread 02 ---- PC 5: Stalled ----- 7026174 in-flight CPI 1.2972 -- Total Cycles 9114348 ---- Thread 03 ---- PC 5: Stalled ----- 7006199 in-flight CPI 1.3009 -- Total Cycles 9114348 ---- Thread 04 ---- PC 5: Stalled ----- 5950088 in-flight CPI 1.5318 -- Total Cycles 9114348 ---- Thread 05 ---- PC 5: Stalled ----- 6062296 in-flight CPI 1.5034 -- Total Cycles 9114348 ---- Thread 06 ---- PC 5: Stalled ----- 6856768 in-flight CPI 1.3292 -- Total Cycles 9114348 ---- Thread 07 ---- PC 5: Stalled ----- 6177148 in-flight CPI 1.4755 -- Total Cycles 9114348 ---- Thread 08 ---- PC 5: Stalled ----- 6272178 in-flight CPI 1.4531 -- Total Cycles 9114348 ---- Thread 09 ---- PC 5: Stalled ----- 6293205 in-flight CPI 1.4483 -- Total Cycles 9114348 ---- Thread 10 ---- PC 5: Stalled ----- 6877173 in-flight CPI 1.3253 -- Total Cycles 9114348 ---- Thread 11 ---- PC 5: Stalled ----- 6090076 in-flight CPI 1.4966 -- Total Cycles 9114348 ---- Thread 12 ---- PC 5: Stalled ----- 6786437 in-flight CPI 1.3430 -- Total Cycles 9114348 ---- Thread 13 ---- PC 5: Stalled ----- 6746448 in-flight CPI 1.3510 -- Total Cycles 9114348 ---- Thread 14 ---- PC 5: Stalled ----- 5943501 in-flight CPI 1.5335 -- Total Cycles 9114348 ---- Thread 15 ---- PC 5: Stalled ----- 7074485 in-flight CPI 1.2883 -- Total Cycles 9114348 ---- Thread 16 ---- PC 5: Stalled ----- 6237864 in-flight CPI 1.4611 -- Total Cycles 9114348 ---- Thread 17 ---- PC 5: Stalled ----- 5875425 in-flight CPI 1.5513 -- Total Cycles 9114348 ---- Thread 18 ---- PC 5: Stalled ----- 6175355 in-flight CPI 1.4759 -- Total Cycles 9114348 ---- Thread 19 ---- PC 5: Stalled ----- 5964363 in-flight CPI 1.5281 -- Total Cycles 9114348 ---- Thread 20 ---- PC 5: Stalled ----- 6244410 in-flight CPI 1.4596 -- Total Cycles 9114348 ---- Thread 21 ---- PC 5: Stalled ----- 5879289 in-flight CPI 1.5502 -- Total Cycles 9114348 ---- Thread 22 ---- PC 5: Stalled ----- 5777162 in-flight CPI 1.5776 -- Total Cycles 9114348 ---- Thread 23 ---- PC 5: Stalled ----- 6498840 in-flight CPI 1.4025 -- Total Cycles 9114348 ---- Thread 24 ---- PC 5: Stalled ----- 5906238 in-flight CPI 1.5432 -- Total Cycles 9114348 ---- Thread 25 ---- PC 5: Stalled ----- 5884630 in-flight CPI 1.5488 -- Total Cycles 9114348 ---- Thread 26 ---- PC 5: Stalled ----- 5900223 in-flight CPI 1.5447 -- Total Cycles 9114348 ---- Thread 27 ---- PC 5: Stalled ----- 5505668 in-flight CPI 1.6554 -- Total Cycles 9114348 ---- Thread 28 ---- PC 5: Stalled ----- 5792590 in-flight CPI 1.5734 -- Total Cycles 9114348 ---- Thread 29 ---- PC 5: Stalled ----- 6045774 in-flight CPI 1.5076 -- Total Cycles 9114348 ---- Thread 30 ---- PC 5: Stalled ----- 5232440 in-flight CPI 1.7419 -- Total Cycles 9114348 ---- Thread 31 ---- PC 5: Stalled ----- 5860622 in-flight CPI 1.5552 -- Total Cycles 9114348 Total CPI 0.0460 , IPC 21.7607 -- Total Cycles 9114348 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 440356 (2.074639%) FPSUB: 0 (0.000000%) FPMUL: 2016522 (9.500394%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 14892721 (70.163736%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 566295 (2.667973%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3301778 (15.555591%) DIV: 7530 (0.035476%) FPUN: 0 (0.000000%) FPRSUB: 465 (0.002191%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (217502855 total) ADD%: 8.163 (17754651) SUB%: 0.000 (0) MUL%: 0.000 (204) BITOR%: 1.232 (2679333) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.543 (1181937) FPSUB%: 0.000 (0) FPMUL%: 4.757 (10347176) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (612) FPMAX%: 0.000 (612) LOAD%: 4.953 (10771888) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (236) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41729) FPINV%: 0.000 (0) FPCONV%: 0.000 (676) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.061 (2307206) FPLE%: 0.392 (851657) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (612) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27908) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.968 (6455233) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.748 (1627539) CMPU%: 0.000 (0) RSUB%: 0.000 (204) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.776 (34313024) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.233 (2681508) ORI%: 1.265 (2751625) XORI%: 0.000 (0) MULI%: 3.362 (7313307) LW%: 1.194 (2596082) LWI%: 13.919 (30275218) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.302 (656372) SWI%: 4.100 (8917844) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.483 (3224870) bged%: 0.000 (0) bgeid%: 0.000 (204) bgtd%: 0.000 (0) bgtid%: 0.323 (703150) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.040 (86837) bned%: 0.000 (0) bneid%: 13.715 (29830433) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.739 (1607955) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.086 (187796) DIV%: 0.000 (408) FPUN%: 1.189 (2585326) FPRSUB%: 3.709 (8067430) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (2) FPGT%: 3.099 (6741095) FPGE%: 0.802 (1744563) SYNC%: 0.000 (0) NOP%: 8.813 (19168297) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 150 SUB 0 MUL 25 BITOR 4 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 580 FPSUB 0 FPMUL 5227 FPCMPLT 0 FPMIN 0 FPMAX 400 LOAD 2359740 INTCONV 0 ATOMIC_INC 9 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 99 FPINV 0 FPCONV 8 FPEQ 0 FPNE 0 FPLT 14 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1932 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 2207 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3436783 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 847 ORI 601778 XORI 0 MULI 654365 LW 0 LWI 9624325 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1723 DIV 14 FPUN 0 FPRSUB 6 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.7607 --Total thread-cycles: 291659136 --total thread-cycles issued: 198334558 (68.002176%) --iCache conflicts: 6697191 (2.296239%) --thread*cycles of FU dependence: 16690253 (5.722520%) --thread*cycles of data dependence: 21225667 (7.277559%) --iCache cycles*banks: 291659136 (74.574344% used) Issue breakdown: --thread*cycles of issue worked: 198334558 (68.002176%) --thread*cycles of issue failed: 74156281 (25.425667%) --thread*cycles of issue NOP/other: 19168297 (6.572157%) Number of thread-cycles not ready: 21225667 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 217502855 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 8 3: 8 4: 7 5: 7 6: 9 7: 7 8: 7 9: 8 10: 8 11: 7 12: 9 13: 7 14: 7 15: 8 16: 7 17: 7 18: 8 19: 7 20: 9 21: 7 22: 7 23: 7 24: 8 25: 7 26: 7 27: 6 28: 7 29: 7 30: 6 31: 7 <=== Core 51 ===> ---- Thread 00 ---- PC 5: Stalled ----- 5855896 in-flight CPI 1.5789 -- Total Cycles 9245765 ---- Thread 01 ---- PC 5: Stalled ----- 6224418 in-flight CPI 1.4854 -- Total Cycles 9245765 ---- Thread 02 ---- PC 5: Stalled ----- 6485708 in-flight CPI 1.4256 -- Total Cycles 9245765 ---- Thread 03 ---- PC 5: Stalled ----- 6390999 in-flight CPI 1.4467 -- Total Cycles 9245765 ---- Thread 04 ---- PC 5: Stalled ----- 6099072 in-flight CPI 1.5159 -- Total Cycles 9245765 ---- Thread 05 ---- PC 5: Stalled ----- 6940797 in-flight CPI 1.3321 -- Total Cycles 9245765 ---- Thread 06 ---- PC 5: Stalled ----- 6078615 in-flight CPI 1.5210 -- Total Cycles 9245765 ---- Thread 07 ---- PC 5: Stalled ----- 5942465 in-flight CPI 1.5559 -- Total Cycles 9245765 ---- Thread 08 ---- PC 5: Stalled ----- 6784026 in-flight CPI 1.3629 -- Total Cycles 9245765 ---- Thread 09 ---- PC 5: Stalled ----- 6890159 in-flight CPI 1.3419 -- Total Cycles 9245765 ---- Thread 10 ---- PC 5: Stalled ----- 6519574 in-flight CPI 1.4182 -- Total Cycles 9245765 ---- Thread 11 ---- PC 5: Stalled ----- 5799997 in-flight CPI 1.5941 -- Total Cycles 9245765 ---- Thread 12 ---- PC 5: Stalled ----- 5837939 in-flight CPI 1.5837 -- Total Cycles 9245765 ---- Thread 13 ---- PC 5: Stalled ----- 6289398 in-flight CPI 1.4701 -- Total Cycles 9245765 ---- Thread 14 ---- PC 5: Stalled ----- 6230904 in-flight CPI 1.4839 -- Total Cycles 9245765 ---- Thread 15 ---- PC 5: Stalled ----- 5885627 in-flight CPI 1.5709 -- Total Cycles 9245765 ---- Thread 16 ---- PC 5: Stalled ----- 6109023 in-flight CPI 1.5135 -- Total Cycles 9245765 ---- Thread 17 ---- PC 5: Stalled ----- 6368495 in-flight CPI 1.4518 -- Total Cycles 9245765 ---- Thread 18 ---- PC 5: Stalled ----- 5682068 in-flight CPI 1.6272 -- Total Cycles 9245765 ---- Thread 19 ---- PC 5: Stalled ----- 6099318 in-flight CPI 1.5159 -- Total Cycles 9245765 ---- Thread 20 ---- PC 5: Stalled ----- 5825375 in-flight CPI 1.5871 -- Total Cycles 9245765 ---- Thread 21 ---- PC 5: Stalled ----- 6305396 in-flight CPI 1.4663 -- Total Cycles 9245765 ---- Thread 22 ---- PC 5: Stalled ----- 5828517 in-flight CPI 1.5863 -- Total Cycles 9245765 ---- Thread 23 ---- PC 5: Stalled ----- 5795255 in-flight CPI 1.5954 -- Total Cycles 9245765 ---- Thread 24 ---- PC 5: Stalled ----- 6531750 in-flight CPI 1.4155 -- Total Cycles 9245765 ---- Thread 25 ---- PC 5: Stalled ----- 6002871 in-flight CPI 1.5402 -- Total Cycles 9245765 ---- Thread 26 ---- PC 5: Stalled ----- 5507115 in-flight CPI 1.6789 -- Total Cycles 9245765 ---- Thread 27 ---- PC 5: Stalled ----- 6097987 in-flight CPI 1.5162 -- Total Cycles 9245765 ---- Thread 28 ---- PC 5: Stalled ----- 6668649 in-flight CPI 1.3864 -- Total Cycles 9245765 ---- Thread 29 ---- PC 5: Stalled ----- 5640787 in-flight CPI 1.6391 -- Total Cycles 9245765 ---- Thread 30 ---- PC 5: Stalled ----- 5260519 in-flight CPI 1.7576 -- Total Cycles 9245765 ---- Thread 31 ---- PC 5: Stalled ----- 6058227 in-flight CPI 1.5261 -- Total Cycles 9245765 Total CPI 0.0472 , IPC 21.2030 -- Total Cycles 9245765 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 437200 (2.035725%) FPSUB: 0 (0.000000%) FPMUL: 1998703 (9.306518%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 15206230 (70.804444%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 552751 (2.573763%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3273535 (15.242491%) DIV: 7496 (0.034903%) FPUN: 0 (0.000000%) FPRSUB: 463 (0.002156%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (214986644 total) ADD%: 8.198 (17623917) SUB%: 0.000 (0) MUL%: 0.000 (203) BITOR%: 1.224 (2631962) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.543 (1167997) FPSUB%: 0.000 (0) FPMUL%: 4.764 (10241444) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (609) FPMAX%: 0.000 (609) LOAD%: 4.951 (10642949) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (235) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (40673) FPINV%: 0.000 (0) FPCONV%: 0.000 (673) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.063 (2285378) FPLE%: 0.388 (834727) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (609) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27224) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.966 (6376215) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.749 (1611201) CMPU%: 0.000 (0) RSUB%: 0.000 (203) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.761 (33884064) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.232 (2649467) ORI%: 1.265 (2719047) XORI%: 0.000 (0) MULI%: 3.362 (7227278) LW%: 1.193 (2564134) LWI%: 13.929 (29945753) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (647866) SWI%: 4.100 (8814544) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.482 (3185889) bged%: 0.000 (0) bgeid%: 0.000 (203) bgtd%: 0.000 (0) bgtid%: 0.323 (694660) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.041 (87142) bned%: 0.000 (0) bneid%: 13.705 (29464158) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.735 (1579343) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.087 (186293) DIV%: 0.000 (406) FPUN%: 1.181 (2538294) FPRSUB%: 3.713 (7982234) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (0) FPGT%: 3.101 (6665722) FPGE%: 0.797 (1714134) SYNC%: 0.000 (0) NOP%: 8.814 (18949089) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 216 SUB 0 MUL 16 BITOR 8 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 503 FPSUB 0 FPMUL 5319 FPCMPLT 0 FPMIN 0 FPMAX 392 LOAD 2368012 INTCONV 0 ATOMIC_INC 7 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 111 FPINV 0 FPCONV 10 FPEQ 0 FPNE 0 FPLT 8 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1928 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 1 ADDKC 0 BITXOR 0 ANDN 0 CMP 2320 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3398357 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 847 ORI 597799 XORI 0 MULI 644666 LW 0 LWI 9514175 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1699 DIV 18 FPUN 0 FPRSUB 6 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.2030 --Total thread-cycles: 295864480 --total thread-cycles issued: 196037555 (66.259239%) --iCache conflicts: 6568101 (2.219969%) --thread*cycles of FU dependence: 16536422 (5.589188%) --thread*cycles of data dependence: 21476378 (7.258856%) --iCache cycles*banks: 295864480 (72.663902% used) Issue breakdown: --thread*cycles of issue worked: 196037555 (66.259240%) --thread*cycles of issue failed: 80877836 (27.336109%) --thread*cycles of issue NOP/other: 18949089 (6.404652%) Number of thread-cycles not ready: 21476378 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 214986644 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 8 3: 8 4: 8 5: 8 6: 7 7: 7 8: 8 9: 8 10: 8 11: 7 12: 7 13: 7 14: 8 15: 7 16: 7 17: 7 18: 6 19: 7 20: 8 21: 7 22: 7 23: 7 24: 8 25: 8 26: 7 27: 7 28: 9 29: 6 30: 6 31: 7 <=== Core 52 ===> ---- Thread 00 ---- PC 5: Stalled ----- 7205887 in-flight CPI 1.2773 -- Total Cycles 9204356 ---- Thread 01 ---- PC 5: Stalled ----- 6913316 in-flight CPI 1.3314 -- Total Cycles 9204356 ---- Thread 02 ---- PC 5: Stalled ----- 6856733 in-flight CPI 1.3424 -- Total Cycles 9204356 ---- Thread 03 ---- PC 5: Stalled ----- 6841108 in-flight CPI 1.3454 -- Total Cycles 9204356 ---- Thread 04 ---- PC 5: Stalled ----- 6919502 in-flight CPI 1.3302 -- Total Cycles 9204356 ---- Thread 05 ---- PC 5: Stalled ----- 6341785 in-flight CPI 1.4514 -- Total Cycles 9204356 ---- Thread 06 ---- PC 5: Stalled ----- 6024241 in-flight CPI 1.5279 -- Total Cycles 9204356 ---- Thread 07 ---- PC 5: Stalled ----- 6901820 in-flight CPI 1.3336 -- Total Cycles 9204356 ---- Thread 08 ---- PC 5: Stalled ----- 6298860 in-flight CPI 1.4613 -- Total Cycles 9204356 ---- Thread 09 ---- PC 5: Stalled ----- 6005742 in-flight CPI 1.5326 -- Total Cycles 9204356 ---- Thread 10 ---- PC 5: Stalled ----- 6204551 in-flight CPI 1.4835 -- Total Cycles 9204356 ---- Thread 11 ---- PC 5: Stalled ----- 5823222 in-flight CPI 1.5806 -- Total Cycles 9204356 ---- Thread 12 ---- PC 5: Stalled ----- 5832338 in-flight CPI 1.5782 -- Total Cycles 9204356 ---- Thread 13 ---- PC 5: Stalled ----- 5897960 in-flight CPI 1.5606 -- Total Cycles 9204356 ---- Thread 14 ---- PC 5: Stalled ----- 6320455 in-flight CPI 1.4563 -- Total Cycles 9204356 ---- Thread 15 ---- PC 5: Stalled ----- 6292419 in-flight CPI 1.4628 -- Total Cycles 9204356 ---- Thread 16 ---- PC 5: Stalled ----- 5717995 in-flight CPI 1.6097 -- Total Cycles 9204356 ---- Thread 17 ---- PC 5: Stalled ----- 6088444 in-flight CPI 1.5118 -- Total Cycles 9204356 ---- Thread 18 ---- PC 5: Stalled ----- 5603049 in-flight CPI 1.6427 -- Total Cycles 9204356 ---- Thread 19 ---- PC 5: Stalled ----- 5643281 in-flight CPI 1.6310 -- Total Cycles 9204356 ---- Thread 20 ---- PC 5: Stalled ----- 5758332 in-flight CPI 1.5984 -- Total Cycles 9204356 ---- Thread 21 ---- PC 5: Stalled ----- 6049594 in-flight CPI 1.5215 -- Total Cycles 9204356 ---- Thread 22 ---- PC 5: Stalled ----- 5748165 in-flight CPI 1.6013 -- Total Cycles 9204356 ---- Thread 23 ---- PC 5: Stalled ----- 6126048 in-flight CPI 1.5025 -- Total Cycles 9204356 ---- Thread 24 ---- PC 5: Stalled ----- 6626008 in-flight CPI 1.3891 -- Total Cycles 9204356 ---- Thread 25 ---- PC 5: Stalled ----- 5831570 in-flight CPI 1.5784 -- Total Cycles 9204356 ---- Thread 26 ---- PC 5: Stalled ----- 5842280 in-flight CPI 1.5755 -- Total Cycles 9204356 ---- Thread 27 ---- PC 5: Stalled ----- 5452457 in-flight CPI 1.6881 -- Total Cycles 9204356 ---- Thread 28 ---- PC 5: Stalled ----- 5739900 in-flight CPI 1.6036 -- Total Cycles 9204356 ---- Thread 29 ---- PC 5: Stalled ----- 5359508 in-flight CPI 1.7174 -- Total Cycles 9204356 ---- Thread 30 ---- PC 5: Stalled ----- 6381917 in-flight CPI 1.4423 -- Total Cycles 9204356 ---- Thread 31 ---- PC 5: Stalled ----- 5461425 in-flight CPI 1.6853 -- Total Cycles 9204356 Total CPI 0.0469 , IPC 21.3063 -- Total Cycles 9204356 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 435173 (2.002520%) FPSUB: 0 (0.000000%) FPMUL: 1996811 (9.188655%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 15465319 (71.166216%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 560699 (2.580149%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3265239 (15.025536%) DIV: 7556 (0.034770%) FPUN: 0 (0.000000%) FPRSUB: 468 (0.002154%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (215084128 total) ADD%: 8.164 (17559236) SUB%: 0.000 (0) MUL%: 0.000 (205) BITOR%: 1.230 (2645631) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.542 (1165800) FPSUB%: 0.000 (0) FPMUL%: 4.758 (10233734) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (615) FPMAX%: 0.000 (615) LOAD%: 4.950 (10646992) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (237) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41199) FPINV%: 0.000 (0) FPCONV%: 0.000 (679) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.063 (2285715) FPLE%: 0.395 (850537) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (615) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27520) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.963 (6372337) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.748 (1608577) CMPU%: 0.000 (0) RSUB%: 0.000 (205) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.775 (33930351) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.231 (2648450) ORI%: 1.259 (2708356) XORI%: 0.000 (0) MULI%: 3.361 (7229204) LW%: 1.191 (2562718) LWI%: 13.925 (29950009) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (647149) SWI%: 4.095 (8808358) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.481 (3184510) bged%: 0.000 (0) bgeid%: 0.000 (205) bgtd%: 0.000 (0) bgtid%: 0.322 (693093) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.039 (84893) bned%: 0.000 (0) bneid%: 13.722 (29513444) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.743 (1599144) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.086 (185741) DIV%: 0.000 (410) FPUN%: 1.186 (2551486) FPRSUB%: 3.711 (7982153) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (2) FPGT%: 3.105 (6678672) FPGE%: 0.796 (1711634) SYNC%: 0.000 (0) NOP%: 8.821 (18973601) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 158 SUB 0 MUL 19 BITOR 3 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 545 FPSUB 0 FPMUL 5319 FPCMPLT 0 FPMIN 0 FPMAX 404 LOAD 2367489 INTCONV 0 ATOMIC_INC 6 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 113 FPINV 0 FPCONV 9 FPEQ 0 FPNE 0 FPLT 8 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1993 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 1 COS 0 SIN 0 ADDC 0 ADDK 1 ADDKC 0 BITXOR 0 ANDN 0 CMP 2215 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3400389 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 798 ORI 594743 XORI 0 MULI 637700 LW 0 LWI 9523454 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1728 DIV 12 FPUN 0 FPRSUB 6 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.3063 --Total thread-cycles: 294539392 --total thread-cycles issued: 196110527 (66.582105%) --iCache conflicts: 6533459 (2.218195%) --thread*cycles of FU dependence: 16537145 (5.614578%) --thread*cycles of data dependence: 21731265 (7.378050%) --iCache cycles*banks: 294539392 (73.023903% used) Issue breakdown: --thread*cycles of issue worked: 196110527 (66.582105%) --thread*cycles of issue failed: 79455264 (26.976108%) --thread*cycles of issue NOP/other: 18973601 (6.441787%) Number of thread-cycles not ready: 21731265 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 215084128 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 8 3: 8 4: 9 5: 7 6: 8 7: 8 8: 7 9: 8 10: 7 11: 7 12: 7 13: 7 14: 7 15: 7 16: 8 17: 8 18: 7 19: 7 20: 7 21: 7 22: 7 23: 8 24: 8 25: 7 26: 7 27: 7 28: 7 29: 7 30: 8 31: 6 <=== Core 53 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6016258 in-flight CPI 1.4517 -- Total Cycles 8733569 ---- Thread 01 ---- PC 5: Stalled ----- 6459559 in-flight CPI 1.3520 -- Total Cycles 8733569 ---- Thread 02 ---- PC 5: Stalled ----- 6127060 in-flight CPI 1.4254 -- Total Cycles 8733569 ---- Thread 03 ---- PC 5: Stalled ----- 6171796 in-flight CPI 1.4151 -- Total Cycles 8733569 ---- Thread 04 ---- PC 5: Stalled ----- 6481819 in-flight CPI 1.3474 -- Total Cycles 8733569 ---- Thread 05 ---- PC 5: Stalled ----- 5837361 in-flight CPI 1.4961 -- Total Cycles 8733569 ---- Thread 06 ---- PC 5: Stalled ----- 6520995 in-flight CPI 1.3393 -- Total Cycles 8733569 ---- Thread 07 ---- PC 5: Stalled ----- 5938948 in-flight CPI 1.4706 -- Total Cycles 8733569 ---- Thread 08 ---- PC 5: Stalled ----- 5849034 in-flight CPI 1.4932 -- Total Cycles 8733569 ---- Thread 09 ---- PC 5: Stalled ----- 5774995 in-flight CPI 1.5123 -- Total Cycles 8733569 ---- Thread 10 ---- PC 5: Stalled ----- 6458076 in-flight CPI 1.3523 -- Total Cycles 8733569 ---- Thread 11 ---- PC 5: Stalled ----- 6307300 in-flight CPI 1.3847 -- Total Cycles 8733569 ---- Thread 12 ---- PC 5: Stalled ----- 6035466 in-flight CPI 1.4470 -- Total Cycles 8733569 ---- Thread 13 ---- PC 5: Stalled ----- 6749933 in-flight CPI 1.2939 -- Total Cycles 8733569 ---- Thread 14 ---- PC 5: Stalled ----- 6206648 in-flight CPI 1.4071 -- Total Cycles 8733569 ---- Thread 15 ---- PC 5: Stalled ----- 6049818 in-flight CPI 1.4436 -- Total Cycles 8733569 ---- Thread 16 ---- PC 5: Stalled ----- 5787895 in-flight CPI 1.5089 -- Total Cycles 8733569 ---- Thread 17 ---- PC 5: Stalled ----- 5706783 in-flight CPI 1.5304 -- Total Cycles 8733569 ---- Thread 18 ---- PC 5: Stalled ----- 6522720 in-flight CPI 1.3389 -- Total Cycles 8733569 ---- Thread 19 ---- PC 5: Stalled ----- 5707702 in-flight CPI 1.5301 -- Total Cycles 8733569 ---- Thread 20 ---- PC 5: Stalled ----- 5662088 in-flight CPI 1.5425 -- Total Cycles 8733569 ---- Thread 21 ---- PC 5: Stalled ----- 6454916 in-flight CPI 1.3530 -- Total Cycles 8733569 ---- Thread 22 ---- PC 5: Stalled ----- 5742337 in-flight CPI 1.5209 -- Total Cycles 8733569 ---- Thread 23 ---- PC 5: Stalled ----- 6221467 in-flight CPI 1.4038 -- Total Cycles 8733569 ---- Thread 24 ---- PC 5: Stalled ----- 6191760 in-flight CPI 1.4105 -- Total Cycles 8733569 ---- Thread 25 ---- PC 5: Stalled ----- 5835722 in-flight CPI 1.4966 -- Total Cycles 8733569 ---- Thread 26 ---- PC 5: Stalled ----- 5465800 in-flight CPI 1.5979 -- Total Cycles 8733569 ---- Thread 27 ---- PC 5: Stalled ----- 5902030 in-flight CPI 1.4798 -- Total Cycles 8733569 ---- Thread 28 ---- PC 5: Stalled ----- 6107304 in-flight CPI 1.4300 -- Total Cycles 8733569 ---- Thread 29 ---- PC 5: Stalled ----- 5527807 in-flight CPI 1.5799 -- Total Cycles 8733569 ---- Thread 30 ---- PC 5: Stalled ----- 5472889 in-flight CPI 1.5958 -- Total Cycles 8733569 ---- Thread 31 ---- PC 5: Stalled ----- 5542099 in-flight CPI 1.5759 -- Total Cycles 8733569 Total CPI 0.0453 , IPC 22.0800 -- Total Cycles 8733569 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 426492 (2.027154%) FPSUB: 0 (0.000000%) FPMUL: 1958664 (9.309704%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 14867340 (70.665789%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 556310 (2.644191%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3222128 (15.315061%) DIV: 7553 (0.035900%) FPUN: 0 (0.000000%) FPRSUB: 463 (0.002201%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (211495338 total) ADD%: 8.185 (17310466) SUB%: 0.000 (0) MUL%: 0.000 (205) BITOR%: 1.224 (2588892) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.541 (1144838) FPSUB%: 0.000 (0) FPMUL%: 4.753 (10053046) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (615) FPMAX%: 0.000 (615) LOAD%: 4.950 (10469592) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (237) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41005) FPINV%: 0.000 (0) FPCONV%: 0.000 (679) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.062 (2246693) FPLE%: 0.395 (834786) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (615) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27564) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.963 (6266807) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.747 (1579224) CMPU%: 0.000 (0) RSUB%: 0.000 (205) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.767 (33347495) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.230 (2602330) ORI%: 1.256 (2655343) XORI%: 0.000 (0) MULI%: 3.362 (7110314) LW%: 1.192 (2520530) LWI%: 13.938 (29478212) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.302 (637761) SWI%: 4.104 (8678744) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.480 (3130254) bged%: 0.000 (0) bgeid%: 0.000 (205) bgtd%: 0.000 (0) bgtid%: 0.323 (683180) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.040 (85299) bned%: 0.000 (0) bneid%: 13.711 (28997470) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.743 (1570623) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.087 (183194) DIV%: 0.000 (410) FPUN%: 1.181 (2498651) FPRSUB%: 3.710 (7846951) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (3) FPGT%: 3.106 (6569279) FPGE%: 0.792 (1674572) SYNC%: 0.000 (0) NOP%: 8.822 (18658338) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 177 SUB 0 MUL 21 BITOR 6 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 576 FPSUB 0 FPMUL 5268 FPCMPLT 0 FPMIN 0 FPMAX 398 LOAD 2314462 INTCONV 0 ATOMIC_INC 6 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 103 FPINV 0 FPCONV 10 FPEQ 0 FPNE 0 FPLT 11 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 2007 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 2 ADDKC 0 BITXOR 0 ANDN 0 CMP 2261 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3344564 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 747 ORI 581235 XORI 0 MULI 635465 LW 0 LWI 9368193 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1716 DIV 10 FPUN 0 FPRSUB 2 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 22.0800 --Total thread-cycles: 279474208 --total thread-cycles issued: 192837000 (68.999924%) --iCache conflicts: 6497413 (2.324870%) --thread*cycles of FU dependence: 16257259 (5.817087%) --thread*cycles of data dependence: 21038950 (7.528047%) --iCache cycles*banks: 279474208 (75.676168% used) Issue breakdown: --thread*cycles of issue worked: 192837000 (68.999927%) --thread*cycles of issue failed: 67978870 (24.323844%) --thread*cycles of issue NOP/other: 18658338 (6.676229%) Number of thread-cycles not ready: 21038950 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 211495338 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 7 3: 8 4: 9 5: 7 6: 8 7: 7 8: 10 9: 7 10: 8 11: 7 12: 7 13: 8 14: 8 15: 7 16: 6 17: 7 18: 7 19: 7 20: 6 21: 9 22: 7 23: 7 24: 7 25: 7 26: 7 27: 7 28: 8 29: 7 30: 8 31: 7 <=== Core 54 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6210040 in-flight CPI 1.4921 -- Total Cycles 9266091 ---- Thread 01 ---- PC 5: Stalled ----- 6389293 in-flight CPI 1.4502 -- Total Cycles 9266091 ---- Thread 02 ---- PC 5: Stalled ----- 6108721 in-flight CPI 1.5169 -- Total Cycles 9266091 ---- Thread 03 ---- PC 5: Stalled ----- 6953360 in-flight CPI 1.3326 -- Total Cycles 9266091 ---- Thread 04 ---- PC 5: Stalled ----- 6386204 in-flight CPI 1.4510 -- Total Cycles 9266091 ---- Thread 05 ---- PC 5: Stalled ----- 6598874 in-flight CPI 1.4042 -- Total Cycles 9266091 ---- Thread 06 ---- PC 5: Stalled ----- 6569532 in-flight CPI 1.4105 -- Total Cycles 9266091 ---- Thread 07 ---- PC 5: Stalled ----- 5999647 in-flight CPI 1.5444 -- Total Cycles 9266091 ---- Thread 08 ---- PC 5: Stalled ----- 6116735 in-flight CPI 1.5149 -- Total Cycles 9266091 ---- Thread 09 ---- PC 5: Stalled ----- 6052439 in-flight CPI 1.5310 -- Total Cycles 9266091 ---- Thread 10 ---- PC 5: Stalled ----- 5979969 in-flight CPI 1.5495 -- Total Cycles 9266091 ---- Thread 11 ---- PC 5: Stalled ----- 5980727 in-flight CPI 1.5493 -- Total Cycles 9266091 ---- Thread 12 ---- PC 5: Stalled ----- 5984328 in-flight CPI 1.5484 -- Total Cycles 9266091 ---- Thread 13 ---- PC 5: Stalled ----- 5977149 in-flight CPI 1.5502 -- Total Cycles 9266091 ---- Thread 14 ---- PC 5: Stalled ----- 6200291 in-flight CPI 1.4945 -- Total Cycles 9266091 ---- Thread 15 ---- PC 5: Stalled ----- 7131665 in-flight CPI 1.2993 -- Total Cycles 9266091 ---- Thread 16 ---- PC 5: Stalled ----- 5683292 in-flight CPI 1.6304 -- Total Cycles 9266091 ---- Thread 17 ---- PC 5: Stalled ----- 5769835 in-flight CPI 1.6059 -- Total Cycles 9266091 ---- Thread 18 ---- PC 5: Stalled ----- 5828529 in-flight CPI 1.5898 -- Total Cycles 9266091 ---- Thread 19 ---- PC 5: Stalled ----- 5753545 in-flight CPI 1.6105 -- Total Cycles 9266091 ---- Thread 20 ---- PC 5: Stalled ----- 6010367 in-flight CPI 1.5417 -- Total Cycles 9266091 ---- Thread 21 ---- PC 5: Stalled ----- 6247317 in-flight CPI 1.4832 -- Total Cycles 9266091 ---- Thread 22 ---- PC 5: Stalled ----- 6290564 in-flight CPI 1.4730 -- Total Cycles 9266091 ---- Thread 23 ---- PC 5: Stalled ----- 5746228 in-flight CPI 1.6125 -- Total Cycles 9266091 ---- Thread 24 ---- PC 5: Stalled ----- 5772732 in-flight CPI 1.6051 -- Total Cycles 9266091 ---- Thread 25 ---- PC 5: Stalled ----- 6097793 in-flight CPI 1.5196 -- Total Cycles 9266091 ---- Thread 26 ---- PC 5: Stalled ----- 6027739 in-flight CPI 1.5372 -- Total Cycles 9266091 ---- Thread 27 ---- PC 5: Stalled ----- 6031574 in-flight CPI 1.5363 -- Total Cycles 9266091 ---- Thread 28 ---- PC 5: Stalled ----- 5731173 in-flight CPI 1.6168 -- Total Cycles 9266091 ---- Thread 29 ---- PC 5: Stalled ----- 5615885 in-flight CPI 1.6500 -- Total Cycles 9266091 ---- Thread 30 ---- PC 5: Stalled ----- 5819138 in-flight CPI 1.5923 -- Total Cycles 9266091 ---- Thread 31 ---- PC 5: Stalled ----- 5272731 in-flight CPI 1.7574 -- Total Cycles 9266091 Total CPI 0.0477 , IPC 20.9730 -- Total Cycles 9266091 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 436396 (2.060829%) FPSUB: 0 (0.000000%) FPMUL: 1990348 (9.399185%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 14902132 (70.373574%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 564714 (2.666796%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3273972 (15.460949%) DIV: 7716 (0.036438%) FPUN: 0 (0.000000%) FPRSUB: 472 (0.002229%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (213147026 total) ADD%: 8.173 (17419733) SUB%: 0.000 (0) MUL%: 0.000 (209) BITOR%: 1.224 (2608350) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.550 (1172895) FPSUB%: 0.000 (0) FPMUL%: 4.776 (10179057) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (627) FPMAX%: 0.000 (627) LOAD%: 4.947 (10544286) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (241) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41535) FPINV%: 0.000 (0) FPCONV%: 0.000 (691) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.066 (2271371) FPLE%: 0.390 (830989) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (627) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27970) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.960 (6309663) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.747 (1592883) CMPU%: 0.000 (0) RSUB%: 0.000 (209) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.760 (33591566) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.230 (2621152) ORI%: 1.266 (2699305) XORI%: 0.000 (0) MULI%: 3.359 (7159719) LW%: 1.191 (2537810) LWI%: 13.925 (29680411) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (641491) SWI%: 4.100 (8739583) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.479 (3152536) bged%: 0.000 (0) bgeid%: 0.000 (209) bgtd%: 0.000 (0) bgtid%: 0.322 (687110) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.041 (86580) bned%: 0.000 (0) bneid%: 13.714 (29230356) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.737 (1571700) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.087 (186179) DIV%: 0.000 (418) FPUN%: 1.181 (2516504) FPRSUB%: 3.714 (7916488) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (1) FPGT%: 3.106 (6620501) FPGE%: 0.796 (1696365) SYNC%: 0.000 (0) NOP%: 8.824 (18808983) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 172 SUB 0 MUL 28 BITOR 2 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 509 FPSUB 0 FPMUL 5290 FPCMPLT 0 FPMIN 0 FPMAX 403 LOAD 2326781 INTCONV 0 ATOMIC_INC 6 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 103 FPINV 0 FPCONV 7 FPEQ 0 FPNE 0 FPLT 15 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1812 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 2207 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3366822 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 736 ORI 596698 XORI 0 MULI 641906 LW 0 LWI 9430759 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1740 DIV 14 FPUN 0 FPRSUB 4 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 20.9730 --Total thread-cycles: 296514912 --total thread-cycles issued: 194338043 (65.540733%) --iCache conflicts: 6596277 (2.224602%) --thread*cycles of FU dependence: 16376031 (5.522836%) --thread*cycles of data dependence: 21175750 (7.141546%) --iCache cycles*banks: 296514912 (71.884094% used) Issue breakdown: --thread*cycles of issue worked: 194338043 (65.540732%) --thread*cycles of issue failed: 83367886 (28.115917%) --thread*cycles of issue NOP/other: 18808983 (6.343351%) Number of thread-cycles not ready: 21175750 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 213147026 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 9 2: 7 3: 8 4: 8 5: 8 6: 7 7: 8 8: 8 9: 8 10: 8 11: 7 12: 7 13: 7 14: 8 15: 10 16: 8 17: 7 18: 7 19: 7 20: 7 21: 7 22: 9 23: 7 24: 7 25: 8 26: 7 27: 7 28: 7 29: 6 30: 8 31: 6 <=== Core 55 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6679999 in-flight CPI 1.3236 -- Total Cycles 8841970 ---- Thread 01 ---- PC 5: Stalled ----- 6608743 in-flight CPI 1.3379 -- Total Cycles 8841970 ---- Thread 02 ---- PC 5: Stalled ----- 6645193 in-flight CPI 1.3306 -- Total Cycles 8841970 ---- Thread 03 ---- PC 5: Stalled ----- 5861599 in-flight CPI 1.5085 -- Total Cycles 8841970 ---- Thread 04 ---- PC 5: Stalled ----- 6283710 in-flight CPI 1.4071 -- Total Cycles 8841970 ---- Thread 05 ---- PC 5: Stalled ----- 6569151 in-flight CPI 1.3460 -- Total Cycles 8841970 ---- Thread 06 ---- PC 5: Stalled ----- 6709655 in-flight CPI 1.3178 -- Total Cycles 8841970 ---- Thread 07 ---- PC 5: Stalled ----- 6046471 in-flight CPI 1.4623 -- Total Cycles 8841970 ---- Thread 08 ---- PC 5: Stalled ----- 6352351 in-flight CPI 1.3919 -- Total Cycles 8841970 ---- Thread 09 ---- PC 5: Stalled ----- 6168794 in-flight CPI 1.4333 -- Total Cycles 8841970 ---- Thread 10 ---- PC 5: Stalled ----- 6653530 in-flight CPI 1.3289 -- Total Cycles 8841970 ---- Thread 11 ---- PC 5: Stalled ----- 6578996 in-flight CPI 1.3440 -- Total Cycles 8841970 ---- Thread 12 ---- PC 5: Stalled ----- 5917558 in-flight CPI 1.4942 -- Total Cycles 8841970 ---- Thread 13 ---- PC 5: Stalled ----- 6140241 in-flight CPI 1.4400 -- Total Cycles 8841970 ---- Thread 14 ---- PC 5: Stalled ----- 5761346 in-flight CPI 1.5347 -- Total Cycles 8841970 ---- Thread 15 ---- PC 5: Stalled ----- 6107890 in-flight CPI 1.4476 -- Total Cycles 8841970 ---- Thread 16 ---- PC 5: Stalled ----- 5913338 in-flight CPI 1.4953 -- Total Cycles 8841970 ---- Thread 17 ---- PC 5: Stalled ----- 6032296 in-flight CPI 1.4658 -- Total Cycles 8841970 ---- Thread 18 ---- PC 5: Stalled ----- 5841009 in-flight CPI 1.5138 -- Total Cycles 8841970 ---- Thread 19 ---- PC 5: Stalled ----- 6263164 in-flight CPI 1.4117 -- Total Cycles 8841970 ---- Thread 20 ---- PC 5: Stalled ----- 6147444 in-flight CPI 1.4383 -- Total Cycles 8841970 ---- Thread 21 ---- PC 5: Stalled ----- 6603854 in-flight CPI 1.3389 -- Total Cycles 8841970 ---- Thread 22 ---- PC 5: Stalled ----- 6254049 in-flight CPI 1.4138 -- Total Cycles 8841970 ---- Thread 23 ---- PC 5: Stalled ----- 5833531 in-flight CPI 1.5157 -- Total Cycles 8841970 ---- Thread 24 ---- PC 5: Stalled ----- 6122803 in-flight CPI 1.4441 -- Total Cycles 8841970 ---- Thread 25 ---- PC 5: Stalled ----- 5932833 in-flight CPI 1.4903 -- Total Cycles 8841970 ---- Thread 26 ---- PC 5: Stalled ----- 5790051 in-flight CPI 1.5271 -- Total Cycles 8841970 ---- Thread 27 ---- PC 5: Stalled ----- 6116675 in-flight CPI 1.4455 -- Total Cycles 8841970 ---- Thread 28 ---- PC 5: Stalled ----- 5684421 in-flight CPI 1.5555 -- Total Cycles 8841970 ---- Thread 29 ---- PC 5: Stalled ----- 5932410 in-flight CPI 1.4904 -- Total Cycles 8841970 ---- Thread 30 ---- PC 5: Stalled ----- 6042940 in-flight CPI 1.4632 -- Total Cycles 8841970 ---- Thread 31 ---- PC 5: Stalled ----- 5229510 in-flight CPI 1.6908 -- Total Cycles 8841970 Total CPI 0.0449 , IPC 22.2604 -- Total Cycles 8841970 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 445034 (2.070439%) FPSUB: 0 (0.000000%) FPMUL: 2019802 (9.396756%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 15126001 (70.370929%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 560447 (2.607376%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3335393 (15.517300%) DIV: 7530 (0.035032%) FPUN: 0 (0.000000%) FPRSUB: 466 (0.002168%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (215867788 total) ADD%: 8.198 (17697327) SUB%: 0.000 (0) MUL%: 0.000 (204) BITOR%: 1.223 (2639689) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.551 (1189815) FPSUB%: 0.000 (0) FPMUL%: 4.782 (10323186) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (612) FPMAX%: 0.000 (612) LOAD%: 4.955 (10696953) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (236) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41334) FPINV%: 0.000 (0) FPCONV%: 0.000 (676) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.066 (2300742) FPLE%: 0.389 (839229) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (612) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27860) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.959 (6387183) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.750 (1619566) CMPU%: 0.000 (0) RSUB%: 0.000 (204) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.758 (34016886) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.231 (2656431) ORI%: 1.266 (2732945) XORI%: 0.000 (0) MULI%: 3.356 (7244548) LW%: 1.190 (2568758) LWI%: 13.908 (30023503) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (649956) SWI%: 4.093 (8834740) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.478 (3190271) bged%: 0.000 (0) bgeid%: 0.000 (204) bgtd%: 0.000 (0) bgtid%: 0.323 (697400) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.041 (87647) bned%: 0.000 (0) bneid%: 13.708 (29591577) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.737 (1591416) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.088 (189826) DIV%: 0.000 (408) FPUN%: 1.178 (2543496) FPRSUB%: 3.717 (8023443) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (5) FPGT%: 3.104 (6701434) FPGE%: 0.795 (1715137) SYNC%: 0.000 (0) NOP%: 8.821 (19041621) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 164 SUB 0 MUL 17 BITOR 8 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 553 FPSUB 0 FPMUL 5100 FPCMPLT 0 FPMIN 0 FPMAX 396 LOAD 2379705 INTCONV 0 ATOMIC_INC 8 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 107 FPINV 0 FPCONV 14 FPEQ 0 FPNE 0 FPLT 6 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1875 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 1 COS 0 SIN 0 ADDC 0 ADDK 1 ADDKC 0 BITXOR 0 ANDN 0 CMP 2308 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3409250 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 839 ORI 608539 XORI 0 MULI 644211 LW 0 LWI 9543975 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1792 DIV 18 FPUN 0 FPRSUB 5 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 22.2604 --Total thread-cycles: 282943040 --total thread-cycles issued: 196826167 (69.563881%) --iCache conflicts: 6655352 (2.352188%) --thread*cycles of FU dependence: 16598903 (5.866518%) --thread*cycles of data dependence: 21494673 (7.596820%) --iCache cycles*banks: 282943040 (76.293737% used) Issue breakdown: --thread*cycles of issue worked: 196826167 (69.563884%) --thread*cycles of issue failed: 67075252 (23.706274%) --thread*cycles of issue NOP/other: 19041621 (6.729843%) Number of thread-cycles not ready: 21494673 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 215867788 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 7 3: 7 4: 7 5: 7 6: 8 7: 7 8: 8 9: 7 10: 8 11: 8 12: 7 13: 7 14: 7 15: 7 16: 7 17: 7 18: 7 19: 8 20: 7 21: 8 22: 8 23: 7 24: 9 25: 8 26: 7 27: 8 28: 6 29: 7 30: 7 31: 7 <=== Core 56 ===> ---- Thread 00 ---- PC 5: Stalled ----- 5960156 in-flight CPI 1.5230 -- Total Cycles 9077203 ---- Thread 01 ---- PC 5: Stalled ----- 6691252 in-flight CPI 1.3566 -- Total Cycles 9077203 ---- Thread 02 ---- PC 5: Stalled ----- 6586727 in-flight CPI 1.3781 -- Total Cycles 9077203 ---- Thread 03 ---- PC 5: Stalled ----- 6288537 in-flight CPI 1.4434 -- Total Cycles 9077203 ---- Thread 04 ---- PC 5: Stalled ----- 6366775 in-flight CPI 1.4257 -- Total Cycles 9077203 ---- Thread 05 ---- PC 5: Stalled ----- 7124041 in-flight CPI 1.2742 -- Total Cycles 9077203 ---- Thread 06 ---- PC 5: Stalled ----- 6772623 in-flight CPI 1.3403 -- Total Cycles 9077203 ---- Thread 07 ---- PC 5: Stalled ----- 6161307 in-flight CPI 1.4733 -- Total Cycles 9077203 ---- Thread 08 ---- PC 5: Stalled ----- 5906265 in-flight CPI 1.5369 -- Total Cycles 9077203 ---- Thread 09 ---- PC 5: Stalled ----- 6388063 in-flight CPI 1.4210 -- Total Cycles 9077203 ---- Thread 10 ---- PC 5: Stalled ----- 6191770 in-flight CPI 1.4660 -- Total Cycles 9077203 ---- Thread 11 ---- PC 5: Stalled ----- 5996931 in-flight CPI 1.5136 -- Total Cycles 9077203 ---- Thread 12 ---- PC 5: Stalled ----- 6543907 in-flight CPI 1.3871 -- Total Cycles 9077203 ---- Thread 13 ---- PC 5: Stalled ----- 5704056 in-flight CPI 1.5914 -- Total Cycles 9077203 ---- Thread 14 ---- PC 5: Stalled ----- 5971882 in-flight CPI 1.5200 -- Total Cycles 9077203 ---- Thread 15 ---- PC 5: Stalled ----- 5975264 in-flight CPI 1.5191 -- Total Cycles 9077203 ---- Thread 16 ---- PC 5: Stalled ----- 5995584 in-flight CPI 1.5140 -- Total Cycles 9077203 ---- Thread 17 ---- PC 5: Stalled ----- 6197188 in-flight CPI 1.4647 -- Total Cycles 9077203 ---- Thread 18 ---- PC 5: Stalled ----- 6130838 in-flight CPI 1.4806 -- Total Cycles 9077203 ---- Thread 19 ---- PC 5: Stalled ----- 5872923 in-flight CPI 1.5456 -- Total Cycles 9077203 ---- Thread 20 ---- PC 5: Stalled ----- 5732129 in-flight CPI 1.5836 -- Total Cycles 9077203 ---- Thread 21 ---- PC 5: Stalled ----- 6573709 in-flight CPI 1.3808 -- Total Cycles 9077203 ---- Thread 22 ---- PC 5: Stalled ----- 6244574 in-flight CPI 1.4536 -- Total Cycles 9077203 ---- Thread 23 ---- PC 5: Stalled ----- 6248703 in-flight CPI 1.4527 -- Total Cycles 9077203 ---- Thread 24 ---- PC 5: Stalled ----- 6165527 in-flight CPI 1.4722 -- Total Cycles 9077203 ---- Thread 25 ---- PC 5: Stalled ----- 6366472 in-flight CPI 1.4258 -- Total Cycles 9077203 ---- Thread 26 ---- PC 5: Stalled ----- 5899701 in-flight CPI 1.5386 -- Total Cycles 9077203 ---- Thread 27 ---- PC 5: Stalled ----- 5345981 in-flight CPI 1.6979 -- Total Cycles 9077203 ---- Thread 28 ---- PC 5: Stalled ----- 5872222 in-flight CPI 1.5458 -- Total Cycles 9077203 ---- Thread 29 ---- PC 5: Stalled ----- 5636622 in-flight CPI 1.6104 -- Total Cycles 9077203 ---- Thread 30 ---- PC 5: Stalled ----- 5333248 in-flight CPI 1.7020 -- Total Cycles 9077203 ---- Thread 31 ---- PC 5: Stalled ----- 5897860 in-flight CPI 1.5391 -- Total Cycles 9077203 Total CPI 0.0463 , IPC 21.6084 -- Total Cycles 9077203 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 430321 (2.020250%) FPSUB: 0 (0.000000%) FPMUL: 1987629 (9.331422%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 15077624 (70.785678%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 566319 (2.658726%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3230663 (15.167156%) DIV: 7372 (0.034610%) FPUN: 0 (0.000000%) FPRSUB: 460 (0.002160%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (215100149 total) ADD%: 8.203 (17645040) SUB%: 0.000 (0) MUL%: 0.000 (200) BITOR%: 1.222 (2628918) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.538 (1157020) FPSUB%: 0.000 (0) FPMUL%: 4.745 (10207427) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (600) FPMAX%: 0.000 (600) LOAD%: 4.943 (10631627) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (232) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41620) FPINV%: 0.000 (0) FPCONV%: 0.000 (664) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.060 (2281012) FPLE%: 0.390 (837925) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (600) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27566) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.969 (6386097) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.747 (1606766) CMPU%: 0.000 (0) RSUB%: 0.000 (200) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.766 (33912656) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.233 (2652247) ORI%: 1.255 (2699624) XORI%: 0.000 (0) MULI%: 3.367 (7241959) LW%: 1.194 (2568258) LWI%: 13.948 (30003058) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (648150) SWI%: 4.106 (8831737) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.484 (3191775) bged%: 0.000 (0) bgeid%: 0.000 (200) bgtd%: 0.000 (0) bgtid%: 0.323 (693781) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.039 (84010) bned%: 0.000 (0) bneid%: 13.709 (29487441) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.738 (1587684) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.085 (183684) DIV%: 0.000 (400) FPUN%: 1.179 (2536426) FPRSUB%: 3.708 (7976571) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (1) FPGT%: 3.106 (6680281) FPGE%: 0.795 (1709284) SYNC%: 0.000 (0) NOP%: 8.813 (18956712) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 178 SUB 0 MUL 15 BITOR 3 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 513 FPSUB 0 FPMUL 5202 FPCMPLT 0 FPMIN 0 FPMAX 393 LOAD 2363750 INTCONV 0 ATOMIC_INC 6 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 110 FPINV 0 FPCONV 10 FPEQ 0 FPNE 0 FPLT 6 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 2153 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 1 COS 0 SIN 0 ADDC 0 ADDK 1 ADDKC 0 BITXOR 0 ANDN 0 CMP 2217 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3404174 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 846 ORI 587294 XORI 0 MULI 651545 LW 0 LWI 9530070 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1681 DIV 12 FPUN 0 FPRSUB 5 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.6084 --Total thread-cycles: 290470496 --total thread-cycles issued: 196143437 (67.526115%) --iCache conflicts: 6607160 (2.274641%) --thread*cycles of FU dependence: 16550212 (5.697726%) --thread*cycles of data dependence: 21300388 (7.333064%) --iCache cycles*banks: 290470496 (74.052334% used) Issue breakdown: --thread*cycles of issue worked: 196143437 (67.526114%) --thread*cycles of issue failed: 75370347 (25.947677%) --thread*cycles of issue NOP/other: 18956712 (6.526209%) Number of thread-cycles not ready: 21300388 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 215100149 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 7 2: 8 3: 7 4: 8 5: 8 6: 8 7: 7 8: 8 9: 7 10: 7 11: 7 12: 8 13: 7 14: 7 15: 7 16: 8 17: 8 18: 7 19: 7 20: 7 21: 7 22: 7 23: 7 24: 7 25: 7 26: 7 27: 6 28: 8 29: 7 30: 7 31: 7 <=== Core 57 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6369277 in-flight CPI 1.3961 -- Total Cycles 8892326 ---- Thread 01 ---- PC 5: Stalled ----- 6419677 in-flight CPI 1.3852 -- Total Cycles 8892326 ---- Thread 02 ---- PC 5: Stalled ----- 5859743 in-flight CPI 1.5175 -- Total Cycles 8892326 ---- Thread 03 ---- PC 5: Stalled ----- 6671529 in-flight CPI 1.3329 -- Total Cycles 8892326 ---- Thread 04 ---- PC 5: Stalled ----- 6820173 in-flight CPI 1.3038 -- Total Cycles 8892326 ---- Thread 05 ---- PC 5: Stalled ----- 6292687 in-flight CPI 1.4131 -- Total Cycles 8892326 ---- Thread 06 ---- PC 5: Stalled ----- 6039059 in-flight CPI 1.4725 -- Total Cycles 8892326 ---- Thread 07 ---- PC 5: Stalled ----- 6281392 in-flight CPI 1.4157 -- Total Cycles 8892326 ---- Thread 08 ---- PC 5: Stalled ----- 6554259 in-flight CPI 1.3567 -- Total Cycles 8892326 ---- Thread 09 ---- PC 5: Stalled ----- 5878769 in-flight CPI 1.5126 -- Total Cycles 8892326 ---- Thread 10 ---- PC 5: Stalled ----- 6515155 in-flight CPI 1.3649 -- Total Cycles 8892326 ---- Thread 11 ---- PC 5: Stalled ----- 5781410 in-flight CPI 1.5381 -- Total Cycles 8892326 ---- Thread 12 ---- PC 5: Stalled ----- 6103376 in-flight CPI 1.4569 -- Total Cycles 8892326 ---- Thread 13 ---- PC 5: Stalled ----- 6473756 in-flight CPI 1.3736 -- Total Cycles 8892326 ---- Thread 14 ---- PC 5: Stalled ----- 6104171 in-flight CPI 1.4568 -- Total Cycles 8892326 ---- Thread 15 ---- PC 5: Stalled ----- 5924433 in-flight CPI 1.5010 -- Total Cycles 8892326 ---- Thread 16 ---- PC 5: Stalled ----- 5665736 in-flight CPI 1.5695 -- Total Cycles 8892326 ---- Thread 17 ---- PC 5: Stalled ----- 5722048 in-flight CPI 1.5540 -- Total Cycles 8892326 ---- Thread 18 ---- PC 5: Stalled ----- 5737467 in-flight CPI 1.5499 -- Total Cycles 8892326 ---- Thread 19 ---- PC 5: Stalled ----- 5711526 in-flight CPI 1.5569 -- Total Cycles 8892326 ---- Thread 20 ---- PC 5: Stalled ----- 5926880 in-flight CPI 1.5003 -- Total Cycles 8892326 ---- Thread 21 ---- PC 5: Stalled ----- 6122505 in-flight CPI 1.4524 -- Total Cycles 8892326 ---- Thread 22 ---- PC 5: Stalled ----- 5533612 in-flight CPI 1.6070 -- Total Cycles 8892326 ---- Thread 23 ---- PC 5: Stalled ----- 6154655 in-flight CPI 1.4448 -- Total Cycles 8892326 ---- Thread 24 ---- PC 5: Stalled ----- 5801017 in-flight CPI 1.5329 -- Total Cycles 8892326 ---- Thread 25 ---- PC 5: Stalled ----- 5365355 in-flight CPI 1.6574 -- Total Cycles 8892326 ---- Thread 26 ---- PC 5: Stalled ----- 5488636 in-flight CPI 1.6201 -- Total Cycles 8892326 ---- Thread 27 ---- PC 5: Stalled ----- 6525028 in-flight CPI 1.3628 -- Total Cycles 8892326 ---- Thread 28 ---- PC 5: Stalled ----- 5403972 in-flight CPI 1.6455 -- Total Cycles 8892326 ---- Thread 29 ---- PC 5: Stalled ----- 5463401 in-flight CPI 1.6276 -- Total Cycles 8892326 ---- Thread 30 ---- PC 5: Stalled ----- 5347485 in-flight CPI 1.6629 -- Total Cycles 8892326 ---- Thread 31 ---- PC 5: Stalled ----- 5883630 in-flight CPI 1.5114 -- Total Cycles 8892326 Total CPI 0.0463 , IPC 21.5852 -- Total Cycles 8892326 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 425329 (2.025406%) FPSUB: 0 (0.000000%) FPMUL: 1949422 (9.283100%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 14847776 (70.704746%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 555868 (2.647030%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3213261 (15.301470%) DIV: 7565 (0.036024%) FPUN: 0 (0.000000%) FPRSUB: 467 (0.002224%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (210509241 total) ADD%: 8.172 (17203725) SUB%: 0.000 (0) MUL%: 0.000 (205) BITOR%: 1.226 (2580512) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.543 (1142343) FPSUB%: 0.000 (0) FPMUL%: 4.754 (10008002) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (615) FPMAX%: 0.000 (615) LOAD%: 4.952 (10424528) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (237) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (40922) FPINV%: 0.000 (0) FPCONV%: 0.000 (679) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.062 (2234791) FPLE%: 0.393 (827720) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (615) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27482) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.965 (6241174) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.747 (1572431) CMPU%: 0.000 (0) RSUB%: 0.000 (205) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.770 (33197726) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.231 (2591792) ORI%: 1.259 (2649375) XORI%: 0.000 (0) MULI%: 3.362 (7077767) LW%: 1.192 (2510236) LWI%: 13.934 (29332587) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (634522) SWI%: 4.104 (8639274) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.481 (3118239) bged%: 0.000 (0) bgeid%: 0.000 (205) bgtd%: 0.000 (0) bgtid%: 0.323 (679300) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.040 (84736) bned%: 0.000 (0) bneid%: 13.712 (28864631) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.742 (1562501) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.087 (182688) DIV%: 0.000 (410) FPUN%: 1.183 (2490560) FPRSUB%: 3.710 (7809633) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (3) FPGT%: 3.105 (6535846) FPGE%: 0.795 (1673506) SYNC%: 0.000 (0) NOP%: 8.820 (18566807) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 174 SUB 0 MUL 13 BITOR 4 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 504 FPSUB 0 FPMUL 4872 FPCMPLT 0 FPMIN 0 FPMAX 400 LOAD 2309364 INTCONV 0 ATOMIC_INC 3 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 106 FPINV 0 FPCONV 13 FPEQ 0 FPNE 0 FPLT 10 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1952 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 1 COS 0 SIN 0 ADDC 0 ADDK 1 ADDKC 0 BITXOR 0 ANDN 0 CMP 2311 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3328824 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 777 ORI 580071 XORI 0 MULI 633228 LW 0 LWI 9324037 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1702 DIV 15 FPUN 0 FPRSUB 3 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.5852 --Total thread-cycles: 284554432 --total thread-cycles issued: 191942434 (67.453679%) --iCache conflicts: 6478637 (2.276765%) --thread*cycles of FU dependence: 16188390 (5.689031%) --thread*cycles of data dependence: 20999688 (7.379849%) --iCache cycles*banks: 284554432 (73.978561% used) Issue breakdown: --thread*cycles of issue worked: 191942434 (67.453679%) --thread*cycles of issue failed: 74045191 (26.021451%) --thread*cycles of issue NOP/other: 18566807 (6.524870%) Number of thread-cycles not ready: 20999688 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 210509241 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 7 2: 7 3: 8 4: 9 5: 7 6: 8 7: 8 8: 9 9: 7 10: 8 11: 7 12: 7 13: 9 14: 8 15: 7 16: 7 17: 7 18: 8 19: 7 20: 7 21: 7 22: 7 23: 7 24: 7 25: 6 26: 7 27: 7 28: 8 29: 7 30: 6 31: 8 <=== Core 58 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6501386 in-flight CPI 1.3767 -- Total Cycles 8950560 ---- Thread 01 ---- PC 5: Stalled ----- 5966353 in-flight CPI 1.5002 -- Total Cycles 8950560 ---- Thread 02 ---- PC 5: Stalled ----- 5889239 in-flight CPI 1.5198 -- Total Cycles 8950560 ---- Thread 03 ---- PC 5: Stalled ----- 6532680 in-flight CPI 1.3701 -- Total Cycles 8950560 ---- Thread 04 ---- PC 5: Stalled ----- 6539964 in-flight CPI 1.3686 -- Total Cycles 8950560 ---- Thread 05 ---- PC 5: Stalled ----- 5897384 in-flight CPI 1.5177 -- Total Cycles 8950560 ---- Thread 06 ---- PC 5: Stalled ----- 5766558 in-flight CPI 1.5521 -- Total Cycles 8950560 ---- Thread 07 ---- PC 5: Stalled ----- 6274973 in-flight CPI 1.4264 -- Total Cycles 8950560 ---- Thread 08 ---- PC 5: Stalled ----- 6024395 in-flight CPI 1.4857 -- Total Cycles 8950560 ---- Thread 09 ---- PC 5: Stalled ----- 6096202 in-flight CPI 1.4682 -- Total Cycles 8950560 ---- Thread 10 ---- PC 5: Stalled ----- 6375545 in-flight CPI 1.4039 -- Total Cycles 8950560 ---- Thread 11 ---- PC 5: Stalled ----- 6576225 in-flight CPI 1.3610 -- Total Cycles 8950560 ---- Thread 12 ---- PC 5: Stalled ----- 6633600 in-flight CPI 1.3493 -- Total Cycles 8950560 ---- Thread 13 ---- PC 5: Stalled ----- 5809381 in-flight CPI 1.5407 -- Total Cycles 8950560 ---- Thread 14 ---- PC 5: Stalled ----- 6570251 in-flight CPI 1.3623 -- Total Cycles 8950560 ---- Thread 15 ---- PC 5: Stalled ----- 6147317 in-flight CPI 1.4560 -- Total Cycles 8950560 ---- Thread 16 ---- PC 5: Stalled ----- 5730776 in-flight CPI 1.5618 -- Total Cycles 8950560 ---- Thread 17 ---- PC 5: Stalled ----- 6067416 in-flight CPI 1.4752 -- Total Cycles 8950560 ---- Thread 18 ---- PC 5: Stalled ----- 6706883 in-flight CPI 1.3345 -- Total Cycles 8950560 ---- Thread 19 ---- PC 5: Stalled ----- 6336168 in-flight CPI 1.4126 -- Total Cycles 8950560 ---- Thread 20 ---- PC 5: Stalled ----- 5990537 in-flight CPI 1.4941 -- Total Cycles 8950560 ---- Thread 21 ---- PC 5: Stalled ----- 6252715 in-flight CPI 1.4315 -- Total Cycles 8950560 ---- Thread 22 ---- PC 5: Stalled ----- 6148451 in-flight CPI 1.4557 -- Total Cycles 8950560 ---- Thread 23 ---- PC 5: Stalled ----- 5877165 in-flight CPI 1.5229 -- Total Cycles 8950560 ---- Thread 24 ---- PC 5: Stalled ----- 5493510 in-flight CPI 1.6293 -- Total Cycles 8950560 ---- Thread 25 ---- PC 5: Stalled ----- 6166631 in-flight CPI 1.4514 -- Total Cycles 8950560 ---- Thread 26 ---- PC 5: Stalled ----- 6168980 in-flight CPI 1.4509 -- Total Cycles 8950560 ---- Thread 27 ---- PC 5: Stalled ----- 5870265 in-flight CPI 1.5247 -- Total Cycles 8950560 ---- Thread 28 ---- PC 5: Stalled ----- 5963376 in-flight CPI 1.5009 -- Total Cycles 8950560 ---- Thread 29 ---- PC 5: Stalled ----- 5705884 in-flight CPI 1.5686 -- Total Cycles 8950560 ---- Thread 30 ---- PC 5: Stalled ----- 5422966 in-flight CPI 1.6505 -- Total Cycles 8950560 ---- Thread 31 ---- PC 5: Stalled ----- 6401184 in-flight CPI 1.3983 -- Total Cycles 8950560 Total CPI 0.0457 , IPC 21.8874 -- Total Cycles 8950560 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 436817 (1.997728%) FPSUB: 0 (0.000000%) FPMUL: 1997601 (9.135780%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 15576236 (71.235981%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 561008 (2.565700%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3286031 (15.028255%) DIV: 7528 (0.034428%) FPUN: 0 (0.000000%) FPRSUB: 465 (0.002127%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (214858750 total) ADD%: 8.195 (17606787) SUB%: 0.000 (0) MUL%: 0.000 (204) BITOR%: 1.224 (2630892) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.545 (1170964) FPSUB%: 0.000 (0) FPMUL%: 4.764 (10236762) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (612) FPMAX%: 0.000 (612) LOAD%: 4.955 (10646212) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (236) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41338) FPINV%: 0.000 (0) FPCONV%: 0.000 (676) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.063 (2283864) FPLE%: 0.391 (840920) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (612) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27620) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.961 (6362925) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.749 (1609622) CMPU%: 0.000 (0) RSUB%: 0.000 (204) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.770 (33882759) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.231 (2645276) ORI%: 1.257 (2701164) XORI%: 0.000 (0) MULI%: 3.359 (7216565) LW%: 1.191 (2559058) LWI%: 13.912 (29891661) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (647001) SWI%: 4.093 (8794578) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.479 (3178770) bged%: 0.000 (0) bgeid%: 0.000 (204) bgtd%: 0.000 (0) bgtid%: 0.323 (694006) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.040 (85834) bned%: 0.000 (0) bneid%: 13.714 (29466307) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.744 (1598110) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.087 (186915) DIV%: 0.000 (408) FPUN%: 1.180 (2536169) FPRSUB%: 3.712 (7976295) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (0) FPGT%: 3.107 (6676735) FPGE%: 0.794 (1705999) SYNC%: 0.000 (0) NOP%: 8.822 (18953778) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 186 SUB 0 MUL 44 BITOR 4 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 531 FPSUB 0 FPMUL 5189 FPCMPLT 0 FPMIN 0 FPMAX 398 LOAD 2358182 INTCONV 0 ATOMIC_INC 4 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 92 FPINV 0 FPCONV 10 FPEQ 0 FPNE 0 FPLT 10 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1863 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 2179 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3395701 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 823 ORI 595920 XORI 0 MULI 639067 LW 0 LWI 9510593 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1761 DIV 12 FPUN 0 FPRSUB 2 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.8875 --Total thread-cycles: 286417920 --total thread-cycles issued: 195904972 (68.398296%) --iCache conflicts: 6561351 (2.290831%) --thread*cycles of FU dependence: 16512572 (5.765202%) --thread*cycles of data dependence: 21865686 (7.634189%) --iCache cycles*banks: 286417920 (75.015831% used) Issue breakdown: --thread*cycles of issue worked: 195904972 (68.398294%) --thread*cycles of issue failed: 71559170 (24.984180%) --thread*cycles of issue NOP/other: 18953778 (6.617525%) Number of thread-cycles not ready: 21865686 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 214858750 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 9 1: 7 2: 8 3: 8 4: 7 5: 7 6: 7 7: 9 8: 7 9: 8 10: 8 11: 8 12: 7 13: 7 14: 7 15: 7 16: 7 17: 8 18: 9 19: 7 20: 7 21: 7 22: 8 23: 7 24: 6 25: 7 26: 7 27: 7 28: 7 29: 7 30: 6 31: 8 <=== Core 59 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6808739 in-flight CPI 1.3271 -- Total Cycles 9035993 ---- Thread 01 ---- PC 5: Stalled ----- 5920508 in-flight CPI 1.5262 -- Total Cycles 9035993 ---- Thread 02 ---- PC 5: Stalled ----- 5967078 in-flight CPI 1.5143 -- Total Cycles 9035993 ---- Thread 03 ---- PC 5: Stalled ----- 6226274 in-flight CPI 1.4513 -- Total Cycles 9035993 ---- Thread 04 ---- PC 5: Stalled ----- 6878473 in-flight CPI 1.3137 -- Total Cycles 9035993 ---- Thread 05 ---- PC 5: Stalled ----- 5806148 in-flight CPI 1.5563 -- Total Cycles 9035993 ---- Thread 06 ---- PC 5: Stalled ----- 6380077 in-flight CPI 1.4163 -- Total Cycles 9035993 ---- Thread 07 ---- PC 5: Stalled ----- 6966266 in-flight CPI 1.2971 -- Total Cycles 9035993 ---- Thread 08 ---- PC 5: Stalled ----- 6229898 in-flight CPI 1.4504 -- Total Cycles 9035993 ---- Thread 09 ---- PC 5: Stalled ----- 6368482 in-flight CPI 1.4189 -- Total Cycles 9035993 ---- Thread 10 ---- PC 5: Stalled ----- 6057727 in-flight CPI 1.4916 -- Total Cycles 9035993 ---- Thread 11 ---- PC 5: Stalled ----- 5738713 in-flight CPI 1.5746 -- Total Cycles 9035993 ---- Thread 12 ---- PC 5: Stalled ----- 6604323 in-flight CPI 1.3682 -- Total Cycles 9035993 ---- Thread 13 ---- PC 5: Stalled ----- 5973058 in-flight CPI 1.5128 -- Total Cycles 9035993 ---- Thread 14 ---- PC 5: Stalled ----- 6954121 in-flight CPI 1.2994 -- Total Cycles 9035993 ---- Thread 15 ---- PC 5: Stalled ----- 6454005 in-flight CPI 1.4001 -- Total Cycles 9035993 ---- Thread 16 ---- PC 5: Stalled ----- 6310307 in-flight CPI 1.4319 -- Total Cycles 9035993 ---- Thread 17 ---- PC 5: Stalled ----- 6506138 in-flight CPI 1.3888 -- Total Cycles 9035993 ---- Thread 18 ---- PC 5: Stalled ----- 6185178 in-flight CPI 1.4609 -- Total Cycles 9035993 ---- Thread 19 ---- PC 5: Stalled ----- 6480016 in-flight CPI 1.3944 -- Total Cycles 9035993 ---- Thread 20 ---- PC 5: Stalled ----- 6504790 in-flight CPI 1.3891 -- Total Cycles 9035993 ---- Thread 21 ---- PC 5: Stalled ----- 5769998 in-flight CPI 1.5660 -- Total Cycles 9035993 ---- Thread 22 ---- PC 5: Stalled ----- 6376729 in-flight CPI 1.4170 -- Total Cycles 9035993 ---- Thread 23 ---- PC 5: Stalled ----- 5941860 in-flight CPI 1.5207 -- Total Cycles 9035993 ---- Thread 24 ---- PC 5: Stalled ----- 6216794 in-flight CPI 1.4535 -- Total Cycles 9035993 ---- Thread 25 ---- PC 5: Stalled ----- 6483514 in-flight CPI 1.3937 -- Total Cycles 9035993 ---- Thread 26 ---- PC 5: Stalled ----- 5803999 in-flight CPI 1.5569 -- Total Cycles 9035993 ---- Thread 27 ---- PC 5: Stalled ----- 5866401 in-flight CPI 1.5403 -- Total Cycles 9035993 ---- Thread 28 ---- PC 5: Stalled ----- 6093221 in-flight CPI 1.4830 -- Total Cycles 9035993 ---- Thread 29 ---- PC 5: Stalled ----- 5839515 in-flight CPI 1.5474 -- Total Cycles 9035993 ---- Thread 30 ---- PC 5: Stalled ----- 5480420 in-flight CPI 1.6488 -- Total Cycles 9035993 ---- Thread 31 ---- PC 5: Stalled ----- 5354904 in-flight CPI 1.6874 -- Total Cycles 9035993 Total CPI 0.0455 , IPC 21.9730 -- Total Cycles 9035993 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 443637 (2.038249%) FPSUB: 0 (0.000000%) FPMUL: 2026266 (9.309493%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 15394229 (70.727368%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 571571 (2.626030%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3321625 (15.260900%) DIV: 7783 (0.035758%) FPUN: 0 (0.000000%) FPRSUB: 479 (0.002201%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (217744441 total) ADD%: 8.199 (17852728) SUB%: 0.000 (0) MUL%: 0.000 (211) BITOR%: 1.229 (2675938) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.546 (1189195) FPSUB%: 0.000 (0) FPMUL%: 4.765 (10376351) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (633) FPMAX%: 0.000 (633) LOAD%: 4.946 (10768957) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (243) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (42082) FPINV%: 0.000 (0) FPCONV%: 0.000 (697) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.063 (2315357) FPLE%: 0.393 (856783) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (633) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (28264) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.961 (6448131) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.747 (1627367) CMPU%: 0.000 (0) RSUB%: 0.000 (211) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.761 (34319722) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.230 (2678455) ORI%: 1.265 (2754220) XORI%: 0.000 (0) MULI%: 3.359 (7313644) LW%: 1.191 (2593390) LWI%: 13.920 (30310959) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (655648) SWI%: 4.099 (8924654) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.479 (3221480) bged%: 0.000 (0) bgeid%: 0.000 (211) bgtd%: 0.000 (0) bgtid%: 0.323 (702744) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.040 (88085) bned%: 0.000 (0) bneid%: 13.710 (29853014) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.738 (1607487) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.087 (188893) DIV%: 0.000 (422) FPUN%: 1.186 (2582238) FPRSUB%: 3.711 (8080494) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (2) FPGT%: 3.101 (6751613) FPGE%: 0.797 (1736422) SYNC%: 0.000 (0) NOP%: 8.816 (19196134) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 181 SUB 0 MUL 19 BITOR 4 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 519 FPSUB 0 FPMUL 5464 FPCMPLT 0 FPMIN 0 FPMAX 411 LOAD 2378918 INTCONV 0 ATOMIC_INC 3 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 111 FPINV 0 FPCONV 10 FPEQ 0 FPNE 0 FPLT 10 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1984 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 2 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 2332 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3441185 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 816 ORI 605346 XORI 0 MULI 645370 LW 0 LWI 9633190 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1769 DIV 11 FPUN 0 FPRSUB 3 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.9730 --Total thread-cycles: 289151776 --total thread-cycles issued: 198548307 (68.665774%) --iCache conflicts: 6652327 (2.300635%) --thread*cycles of FU dependence: 16717676 (5.781627%) --thread*cycles of data dependence: 21765590 (7.527393%) --iCache cycles*banks: 289151776 (75.304560% used) Issue breakdown: --thread*cycles of issue worked: 198548307 (68.665775%) --thread*cycles of issue failed: 71407335 (24.695451%) --thread*cycles of issue NOP/other: 19196134 (6.638774%) Number of thread-cycles not ready: 21765590 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 217744441 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 9 1: 9 2: 7 3: 9 4: 8 5: 7 6: 7 7: 8 8: 7 9: 8 10: 7 11: 7 12: 7 13: 7 14: 8 15: 8 16: 7 17: 7 18: 7 19: 8 20: 7 21: 7 22: 9 23: 7 24: 7 25: 8 26: 8 27: 7 28: 8 29: 7 30: 7 31: 9 <=== Core 60 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6024436 in-flight CPI 1.5009 -- Total Cycles 9041965 ---- Thread 01 ---- PC 5: Stalled ----- 6578419 in-flight CPI 1.3745 -- Total Cycles 9041965 ---- Thread 02 ---- PC 5: Stalled ----- 5936732 in-flight CPI 1.5231 -- Total Cycles 9041965 ---- Thread 03 ---- PC 5: Stalled ----- 6561950 in-flight CPI 1.3779 -- Total Cycles 9041965 ---- Thread 04 ---- PC 5: Stalled ----- 6423535 in-flight CPI 1.4076 -- Total Cycles 9041965 ---- Thread 05 ---- PC 5: Stalled ----- 6218228 in-flight CPI 1.4541 -- Total Cycles 9041965 ---- Thread 06 ---- PC 5: Stalled ----- 6121159 in-flight CPI 1.4772 -- Total Cycles 9041965 ---- Thread 07 ---- PC 5: Stalled ----- 6107852 in-flight CPI 1.4804 -- Total Cycles 9041965 ---- Thread 08 ---- PC 5: Stalled ----- 6001658 in-flight CPI 1.5066 -- Total Cycles 9041965 ---- Thread 09 ---- PC 5: Stalled ----- 6136841 in-flight CPI 1.4734 -- Total Cycles 9041965 ---- Thread 10 ---- PC 5: Stalled ----- 6108827 in-flight CPI 1.4801 -- Total Cycles 9041965 ---- Thread 11 ---- PC 5: Stalled ----- 6092622 in-flight CPI 1.4841 -- Total Cycles 9041965 ---- Thread 12 ---- PC 5: Stalled ----- 5979798 in-flight CPI 1.5121 -- Total Cycles 9041965 ---- Thread 13 ---- PC 5: Stalled ----- 6779779 in-flight CPI 1.3337 -- Total Cycles 9041965 ---- Thread 14 ---- PC 5: Stalled ----- 5972202 in-flight CPI 1.5140 -- Total Cycles 9041965 ---- Thread 15 ---- PC 5: Stalled ----- 5857741 in-flight CPI 1.5436 -- Total Cycles 9041965 ---- Thread 16 ---- PC 5: Stalled ----- 6137365 in-flight CPI 1.4733 -- Total Cycles 9041965 ---- Thread 17 ---- PC 5: Stalled ----- 6497405 in-flight CPI 1.3916 -- Total Cycles 9041965 ---- Thread 18 ---- PC 5: Stalled ----- 6146502 in-flight CPI 1.4711 -- Total Cycles 9041965 ---- Thread 19 ---- PC 5: Stalled ----- 6801910 in-flight CPI 1.3293 -- Total Cycles 9041965 ---- Thread 20 ---- PC 5: Stalled ----- 5549552 in-flight CPI 1.6293 -- Total Cycles 9041965 ---- Thread 21 ---- PC 5: Stalled ----- 6349480 in-flight CPI 1.4240 -- Total Cycles 9041965 ---- Thread 22 ---- PC 5: Stalled ----- 5778193 in-flight CPI 1.5648 -- Total Cycles 9041965 ---- Thread 23 ---- PC 5: Stalled ----- 6000566 in-flight CPI 1.5068 -- Total Cycles 9041965 ---- Thread 24 ---- PC 5: Stalled ----- 6622819 in-flight CPI 1.3653 -- Total Cycles 9041965 ---- Thread 25 ---- PC 5: Stalled ----- 6639511 in-flight CPI 1.3618 -- Total Cycles 9041965 ---- Thread 26 ---- PC 5: Stalled ----- 6413978 in-flight CPI 1.4097 -- Total Cycles 9041965 ---- Thread 27 ---- PC 5: Stalled ----- 5735506 in-flight CPI 1.5765 -- Total Cycles 9041965 ---- Thread 28 ---- PC 5: Stalled ----- 5498398 in-flight CPI 1.6445 -- Total Cycles 9041965 ---- Thread 29 ---- PC 5: Stalled ----- 6159157 in-flight CPI 1.4680 -- Total Cycles 9041965 ---- Thread 30 ---- PC 5: Stalled ----- 5279024 in-flight CPI 1.7128 -- Total Cycles 9041965 ---- Thread 31 ---- PC 5: Stalled ----- 5306556 in-flight CPI 1.7039 -- Total Cycles 9041965 Total CPI 0.0462 , IPC 21.6566 -- Total Cycles 9041965 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 436647 (2.032748%) FPSUB: 0 (0.000000%) FPMUL: 1995674 (9.290577%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 15195299 (70.739554%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 557799 (2.596754%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3287339 (15.303739%) DIV: 7410 (0.034496%) FPUN: 0 (0.000000%) FPRSUB: 458 (0.002132%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (214752016 total) ADD%: 8.198 (17604595) SUB%: 0.000 (0) MUL%: 0.000 (201) BITOR%: 1.222 (2624369) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.545 (1169572) FPSUB%: 0.000 (0) FPMUL%: 4.765 (10232677) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (603) FPMAX%: 0.000 (603) LOAD%: 4.957 (10646184) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (233) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41155) FPINV%: 0.000 (0) FPCONV%: 0.000 (667) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.063 (2281789) FPLE%: 0.392 (842736) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (603) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27676) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.964 (6364945) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.748 (1607322) CMPU%: 0.000 (0) RSUB%: 0.000 (201) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.767 (33860902) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.231 (2644569) ORI%: 1.259 (2702756) XORI%: 0.000 (0) MULI%: 3.360 (7215148) LW%: 1.192 (2559802) LWI%: 13.923 (29898982) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.302 (647860) SWI%: 4.099 (8803385) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.480 (3178961) bged%: 0.000 (0) bgeid%: 0.000 (201) bgtd%: 0.000 (0) bgtid%: 0.323 (694244) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.040 (86903) bned%: 0.000 (0) bneid%: 13.703 (29428430) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.742 (1592623) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.087 (187023) DIV%: 0.000 (402) FPUN%: 1.179 (2531109) FPRSUB%: 3.713 (7974487) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (0) FPGT%: 3.104 (6665094) FPGE%: 0.791 (1699196) SYNC%: 0.000 (0) NOP%: 8.817 (18933712) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 183 SUB 0 MUL 14 BITOR 9 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 532 FPSUB 0 FPMUL 5291 FPCMPLT 0 FPMIN 0 FPMAX 392 LOAD 2378958 INTCONV 0 ATOMIC_INC 7 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 108 FPINV 0 FPCONV 10 FPEQ 0 FPNE 0 FPLT 4 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1868 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 1 COS 0 SIN 0 ADDC 0 ADDK 1 ADDKC 0 BITXOR 0 ANDN 0 CMP 2261 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3393689 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 825 ORI 597208 XORI 0 MULI 640452 LW 0 LWI 9505256 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1839 DIV 19 FPUN 0 FPRSUB 5 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.6566 --Total thread-cycles: 289342880 --total thread-cycles issued: 195818304 (67.676904%) --iCache conflicts: 6574664 (2.272274%) --thread*cycles of FU dependence: 16528939 (5.712578%) --thread*cycles of data dependence: 21480626 (7.423935%) --iCache cycles*banks: 289342880 (74.220609% used) Issue breakdown: --thread*cycles of issue worked: 195818304 (67.676904%) --thread*cycles of issue failed: 74590864 (25.779402%) --thread*cycles of issue NOP/other: 18933712 (6.543694%) Number of thread-cycles not ready: 21480626 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 214752016 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 7 3: 8 4: 8 5: 9 6: 7 7: 7 8: 7 9: 8 10: 7 11: 7 12: 7 13: 7 14: 8 15: 7 16: 7 17: 8 18: 7 19: 8 20: 7 21: 7 22: 7 23: 7 24: 8 25: 7 26: 7 27: 7 28: 7 29: 7 30: 6 31: 7 <=== Core 61 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6692897 in-flight CPI 1.3496 -- Total Cycles 9032678 ---- Thread 01 ---- PC 5: Stalled ----- 6713798 in-flight CPI 1.3454 -- Total Cycles 9032678 ---- Thread 02 ---- PC 5: Stalled ----- 6302305 in-flight CPI 1.4332 -- Total Cycles 9032678 ---- Thread 03 ---- PC 5: Stalled ----- 6233930 in-flight CPI 1.4490 -- Total Cycles 9032678 ---- Thread 04 ---- PC 5: Stalled ----- 5997317 in-flight CPI 1.5061 -- Total Cycles 9032678 ---- Thread 05 ---- PC 5: Stalled ----- 6568788 in-flight CPI 1.3751 -- Total Cycles 9032678 ---- Thread 06 ---- PC 5: Stalled ----- 6740051 in-flight CPI 1.3401 -- Total Cycles 9032678 ---- Thread 07 ---- PC 5: Stalled ----- 6232621 in-flight CPI 1.4493 -- Total Cycles 9032678 ---- Thread 08 ---- PC 5: Stalled ----- 6552298 in-flight CPI 1.3785 -- Total Cycles 9032678 ---- Thread 09 ---- PC 5: Stalled ----- 6456646 in-flight CPI 1.3990 -- Total Cycles 9032678 ---- Thread 10 ---- PC 5: Stalled ----- 5893490 in-flight CPI 1.5326 -- Total Cycles 9032678 ---- Thread 11 ---- PC 5: Stalled ----- 6370532 in-flight CPI 1.4179 -- Total Cycles 9032678 ---- Thread 12 ---- PC 5: Stalled ----- 7006189 in-flight CPI 1.2892 -- Total Cycles 9032678 ---- Thread 13 ---- PC 5: Stalled ----- 6053632 in-flight CPI 1.4921 -- Total Cycles 9032678 ---- Thread 14 ---- PC 5: Stalled ----- 5996491 in-flight CPI 1.5063 -- Total Cycles 9032678 ---- Thread 15 ---- PC 5: Stalled ----- 6174113 in-flight CPI 1.4630 -- Total Cycles 9032678 ---- Thread 16 ---- PC 5: Stalled ----- 6101493 in-flight CPI 1.4804 -- Total Cycles 9032678 ---- Thread 17 ---- PC 5: Stalled ----- 5747245 in-flight CPI 1.5716 -- Total Cycles 9032678 ---- Thread 18 ---- PC 5: Stalled ----- 5789244 in-flight CPI 1.5602 -- Total Cycles 9032678 ---- Thread 19 ---- PC 5: Stalled ----- 6067897 in-flight CPI 1.4886 -- Total Cycles 9032678 ---- Thread 20 ---- PC 5: Stalled ----- 6005293 in-flight CPI 1.5041 -- Total Cycles 9032678 ---- Thread 21 ---- PC 5: Stalled ----- 6063401 in-flight CPI 1.4897 -- Total Cycles 9032678 ---- Thread 22 ---- PC 5: Stalled ----- 5911820 in-flight CPI 1.5279 -- Total Cycles 9032678 ---- Thread 23 ---- PC 5: Stalled ----- 5678578 in-flight CPI 1.5907 -- Total Cycles 9032678 ---- Thread 24 ---- PC 5: Stalled ----- 5484142 in-flight CPI 1.6470 -- Total Cycles 9032678 ---- Thread 25 ---- PC 5: Stalled ----- 5931416 in-flight CPI 1.5228 -- Total Cycles 9032678 ---- Thread 26 ---- PC 5: Stalled ----- 5999562 in-flight CPI 1.5056 -- Total Cycles 9032678 ---- Thread 27 ---- PC 5: Stalled ----- 6371959 in-flight CPI 1.4176 -- Total Cycles 9032678 ---- Thread 28 ---- PC 5: Stalled ----- 6317279 in-flight CPI 1.4298 -- Total Cycles 9032678 ---- Thread 29 ---- PC 5: Stalled ----- 5322729 in-flight CPI 1.6970 -- Total Cycles 9032678 ---- Thread 30 ---- PC 5: Stalled ----- 5769011 in-flight CPI 1.5657 -- Total Cycles 9032678 ---- Thread 31 ---- PC 5: Stalled ----- 6050665 in-flight CPI 1.4928 -- Total Cycles 9032678 Total CPI 0.0459 , IPC 21.7651 -- Total Cycles 9032678 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 440472 (2.067468%) FPSUB: 0 (0.000000%) FPMUL: 2007852 (9.424368%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 14985952 (70.340404%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 561073 (2.633540%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3301634 (15.497065%) DIV: 7454 (0.034987%) FPUN: 0 (0.000000%) FPRSUB: 462 (0.002169%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (215610479 total) ADD%: 8.176 (17629292) SUB%: 0.000 (0) MUL%: 0.000 (202) BITOR%: 1.227 (2646512) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.547 (1179814) FPSUB%: 0.000 (0) FPMUL%: 4.771 (10287554) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (606) FPMAX%: 0.000 (606) LOAD%: 4.953 (10680237) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (234) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41344) FPINV%: 0.000 (0) FPCONV%: 0.000 (670) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.063 (2292319) FPLE%: 0.391 (843967) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (606) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27728) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.962 (6387006) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.750 (1616361) CMPU%: 0.000 (0) RSUB%: 0.000 (202) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.767 (33996357) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.232 (2655410) ORI%: 1.265 (2727417) XORI%: 0.000 (0) MULI%: 3.358 (7239548) LW%: 1.191 (2568656) LWI%: 13.917 (30006588) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (649526) SWI%: 4.097 (8833319) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.480 (3190684) bged%: 0.000 (0) bgeid%: 0.000 (202) bgtd%: 0.000 (0) bgtid%: 0.323 (696713) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.040 (86555) bned%: 0.000 (0) bneid%: 13.711 (29562892) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.739 (1593135) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.087 (187849) DIV%: 0.000 (404) FPUN%: 1.183 (2551254) FPRSUB%: 3.714 (8008735) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (3) FPGT%: 3.102 (6688714) FPGE%: 0.797 (1718121) SYNC%: 0.000 (0) NOP%: 8.818 (19013041) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 168 SUB 0 MUL 24 BITOR 7 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 531 FPSUB 0 FPMUL 5467 FPCMPLT 0 FPMIN 0 FPMAX 394 LOAD 2353669 INTCONV 0 ATOMIC_INC 4 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 113 FPINV 0 FPCONV 15 FPEQ 0 FPNE 0 FPLT 6 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1812 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 1 ADDKC 0 BITXOR 0 ANDN 0 CMP 2286 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3407407 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 797 ORI 602034 XORI 0 MULI 645177 LW 0 LWI 9540417 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1744 DIV 16 FPUN 0 FPRSUB 3 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.7651 --Total thread-cycles: 289045696 --total thread-cycles issued: 196597438 (68.016041%) --iCache conflicts: 6634725 (2.295390%) --thread*cycles of FU dependence: 16562112 (5.729929%) --thread*cycles of data dependence: 21304899 (7.370772%) --iCache cycles*banks: 289045696 (74.593919% used) Issue breakdown: --thread*cycles of issue worked: 196597438 (68.016041%) --thread*cycles of issue failed: 73435217 (25.406093%) --thread*cycles of issue NOP/other: 19013041 (6.577867%) Number of thread-cycles not ready: 21304899 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 215610479 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 7 3: 7 4: 7 5: 7 6: 7 7: 8 8: 8 9: 8 10: 7 11: 9 12: 8 13: 8 14: 7 15: 7 16: 8 17: 6 18: 7 19: 7 20: 7 21: 7 22: 7 23: 8 24: 8 25: 7 26: 7 27: 7 28: 7 29: 6 30: 7 31: 7 <=== Core 62 ===> ---- Thread 00 ---- PC 5: Stalled ----- 5877392 in-flight CPI 1.5402 -- Total Cycles 9052480 ---- Thread 01 ---- PC 5: Stalled ----- 6237815 in-flight CPI 1.4512 -- Total Cycles 9052480 ---- Thread 02 ---- PC 5: Stalled ----- 6580681 in-flight CPI 1.3756 -- Total Cycles 9052480 ---- Thread 03 ---- PC 5: Stalled ----- 6343960 in-flight CPI 1.4269 -- Total Cycles 9052480 ---- Thread 04 ---- PC 5: Stalled ----- 6149648 in-flight CPI 1.4720 -- Total Cycles 9052480 ---- Thread 05 ---- PC 5: Stalled ----- 6494263 in-flight CPI 1.3939 -- Total Cycles 9052480 ---- Thread 06 ---- PC 5: Stalled ----- 7101105 in-flight CPI 1.2748 -- Total Cycles 9052480 ---- Thread 07 ---- PC 5: Stalled ----- 5914367 in-flight CPI 1.5306 -- Total Cycles 9052480 ---- Thread 08 ---- PC 5: Stalled ----- 5887468 in-flight CPI 1.5376 -- Total Cycles 9052480 ---- Thread 09 ---- PC 5: Stalled ----- 5999284 in-flight CPI 1.5089 -- Total Cycles 9052480 ---- Thread 10 ---- PC 5: Stalled ----- 5881957 in-flight CPI 1.5390 -- Total Cycles 9052480 ---- Thread 11 ---- PC 5: Stalled ----- 6194418 in-flight CPI 1.4614 -- Total Cycles 9052480 ---- Thread 12 ---- PC 5: Stalled ----- 5940184 in-flight CPI 1.5239 -- Total Cycles 9052480 ---- Thread 13 ---- PC 5: Stalled ----- 5720023 in-flight CPI 1.5826 -- Total Cycles 9052480 ---- Thread 14 ---- PC 5: Stalled ----- 5805992 in-flight CPI 1.5592 -- Total Cycles 9052480 ---- Thread 15 ---- PC 5: Stalled ----- 5668058 in-flight CPI 1.5971 -- Total Cycles 9052480 ---- Thread 16 ---- PC 5: Stalled ----- 6609823 in-flight CPI 1.3695 -- Total Cycles 9052480 ---- Thread 17 ---- PC 5: Stalled ----- 5751532 in-flight CPI 1.5739 -- Total Cycles 9052480 ---- Thread 18 ---- PC 5: Stalled ----- 5617879 in-flight CPI 1.6114 -- Total Cycles 9052480 ---- Thread 19 ---- PC 5: Stalled ----- 5966410 in-flight CPI 1.5172 -- Total Cycles 9052480 ---- Thread 20 ---- PC 5: Stalled ----- 5965492 in-flight CPI 1.5175 -- Total Cycles 9052480 ---- Thread 21 ---- PC 5: Stalled ----- 5617175 in-flight CPI 1.6116 -- Total Cycles 9052480 ---- Thread 22 ---- PC 5: Stalled ----- 5722094 in-flight CPI 1.5820 -- Total Cycles 9052480 ---- Thread 23 ---- PC 5: Stalled ----- 5563366 in-flight CPI 1.6272 -- Total Cycles 9052480 ---- Thread 24 ---- PC 5: Stalled ----- 6133897 in-flight CPI 1.4758 -- Total Cycles 9052480 ---- Thread 25 ---- PC 5: Stalled ----- 5636428 in-flight CPI 1.6061 -- Total Cycles 9052480 ---- Thread 26 ---- PC 5: Stalled ----- 6194584 in-flight CPI 1.4613 -- Total Cycles 9052480 ---- Thread 27 ---- PC 5: Stalled ----- 6210514 in-flight CPI 1.4576 -- Total Cycles 9052480 ---- Thread 28 ---- PC 5: Stalled ----- 5677250 in-flight CPI 1.5945 -- Total Cycles 9052480 ---- Thread 29 ---- PC 5: Stalled ----- 6053231 in-flight CPI 1.4955 -- Total Cycles 9052480 ---- Thread 30 ---- PC 5: Stalled ----- 6482326 in-flight CPI 1.3965 -- Total Cycles 9052480 ---- Thread 31 ---- PC 5: Stalled ----- 6250924 in-flight CPI 1.4482 -- Total Cycles 9052480 Total CPI 0.0468 , IPC 21.3478 -- Total Cycles 9052480 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 434156 (2.081280%) FPSUB: 0 (0.000000%) FPMUL: 1977478 (9.479737%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 14621751 (70.094509%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 559661 (2.682932%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3258975 (15.623044%) DIV: 7564 (0.036261%) FPUN: 0 (0.000000%) FPRSUB: 467 (0.002239%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (211932015 total) ADD%: 8.196 (17369396) SUB%: 0.000 (0) MUL%: 0.000 (205) BITOR%: 1.225 (2597024) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.549 (1163895) FPSUB%: 0.000 (0) FPMUL%: 4.772 (10114403) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (615) FPMAX%: 0.000 (615) LOAD%: 4.950 (10491628) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (237) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41122) FPINV%: 0.000 (0) FPCONV%: 0.000 (679) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.064 (2255453) FPLE%: 0.389 (824072) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (615) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27552) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.962 (6277201) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.749 (1586776) CMPU%: 0.000 (0) RSUB%: 0.000 (205) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.759 (33397647) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.231 (2608764) ORI%: 1.266 (2683496) XORI%: 0.000 (0) MULI%: 3.359 (7119519) LW%: 1.191 (2524638) LWI%: 13.919 (29498154) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (637568) SWI%: 4.098 (8685491) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.480 (3136984) bged%: 0.000 (0) bgeid%: 0.000 (205) bgtd%: 0.000 (0) bgtid%: 0.322 (683080) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.040 (85825) bned%: 0.000 (0) bneid%: 13.708 (29050920) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.737 (1561391) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.087 (185391) DIV%: 0.000 (410) FPUN%: 1.182 (2504500) FPRSUB%: 3.713 (7869516) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (3) FPGT%: 3.102 (6573735) FPGE%: 0.798 (1691129) SYNC%: 0.000 (0) NOP%: 8.815 (18681860) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 170 SUB 0 MUL 31 BITOR 5 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 574 FPSUB 0 FPMUL 5173 FPCMPLT 0 FPMIN 0 FPMAX 401 LOAD 2293223 INTCONV 0 ATOMIC_INC 5 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 120 FPINV 0 FPCONV 15 FPEQ 0 FPNE 0 FPLT 8 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1788 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 1 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 2281 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3346182 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 779 ORI 593303 XORI 0 MULI 637588 LW 0 LWI 9375841 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1724 DIV 11 FPUN 0 FPRSUB 2 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.3478 --Total thread-cycles: 289679360 --total thread-cycles issued: 193250155 (66.711746%) --iCache conflicts: 6500046 (2.243876%) --thread*cycles of FU dependence: 16259251 (5.612844%) --thread*cycles of data dependence: 20860052 (7.201083%) --iCache cycles*banks: 289679360 (73.160907% used) Issue breakdown: --thread*cycles of issue worked: 193250155 (66.711745%) --thread*cycles of issue failed: 77747345 (26.839104%) --thread*cycles of issue NOP/other: 18681860 (6.449151%) Number of thread-cycles not ready: 20860052 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 211932015 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 9 3: 7 4: 8 5: 8 6: 8 7: 7 8: 7 9: 7 10: 8 11: 7 12: 7 13: 7 14: 7 15: 7 16: 7 17: 7 18: 6 19: 7 20: 7 21: 9 22: 7 23: 8 24: 7 25: 7 26: 8 27: 8 28: 7 29: 7 30: 9 31: 7 <=== Core 63 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6091425 in-flight CPI 1.5043 -- Total Cycles 9163104 ---- Thread 01 ---- PC 5: Stalled ----- 7019143 in-flight CPI 1.3054 -- Total Cycles 9163104 ---- Thread 02 ---- PC 5: Stalled ----- 6765013 in-flight CPI 1.3545 -- Total Cycles 9163104 ---- Thread 03 ---- PC 5: Stalled ----- 6604851 in-flight CPI 1.3873 -- Total Cycles 9163104 ---- Thread 04 ---- PC 5: Stalled ----- 6270350 in-flight CPI 1.4613 -- Total Cycles 9163104 ---- Thread 05 ---- PC 5: Stalled ----- 7155099 in-flight CPI 1.2806 -- Total Cycles 9163104 ---- Thread 06 ---- PC 5: Stalled ----- 6347414 in-flight CPI 1.4436 -- Total Cycles 9163104 ---- Thread 07 ---- PC 5: Stalled ----- 7158013 in-flight CPI 1.2801 -- Total Cycles 9163104 ---- Thread 08 ---- PC 5: Stalled ----- 5884404 in-flight CPI 1.5572 -- Total Cycles 9163104 ---- Thread 09 ---- PC 5: Stalled ----- 6069147 in-flight CPI 1.5098 -- Total Cycles 9163104 ---- Thread 10 ---- PC 5: Stalled ----- 6841550 in-flight CPI 1.3393 -- Total Cycles 9163104 ---- Thread 11 ---- PC 5: Stalled ----- 6585764 in-flight CPI 1.3913 -- Total Cycles 9163104 ---- Thread 12 ---- PC 5: Stalled ----- 5873381 in-flight CPI 1.5601 -- Total Cycles 9163104 ---- Thread 13 ---- PC 5: Stalled ----- 6606813 in-flight CPI 1.3869 -- Total Cycles 9163104 ---- Thread 14 ---- PC 5: Stalled ----- 6969682 in-flight CPI 1.3147 -- Total Cycles 9163104 ---- Thread 15 ---- PC 5: Stalled ----- 6170673 in-flight CPI 1.4849 -- Total Cycles 9163104 ---- Thread 16 ---- PC 5: Stalled ----- 5947448 in-flight CPI 1.5407 -- Total Cycles 9163104 ---- Thread 17 ---- PC 5: Stalled ----- 5832582 in-flight CPI 1.5710 -- Total Cycles 9163104 ---- Thread 18 ---- PC 5: Stalled ----- 6199349 in-flight CPI 1.4781 -- Total Cycles 9163104 ---- Thread 19 ---- PC 5: Stalled ----- 6417337 in-flight CPI 1.4279 -- Total Cycles 9163104 ---- Thread 20 ---- PC 5: Stalled ----- 6277440 in-flight CPI 1.4597 -- Total Cycles 9163104 ---- Thread 21 ---- PC 5: Stalled ----- 5533774 in-flight CPI 1.6558 -- Total Cycles 9163104 ---- Thread 22 ---- PC 5: Stalled ----- 6086778 in-flight CPI 1.5054 -- Total Cycles 9163104 ---- Thread 23 ---- PC 5: Stalled ----- 6517196 in-flight CPI 1.4060 -- Total Cycles 9163104 ---- Thread 24 ---- PC 5: Stalled ----- 5808298 in-flight CPI 1.5776 -- Total Cycles 9163104 ---- Thread 25 ---- PC 5: Stalled ----- 6763612 in-flight CPI 1.3548 -- Total Cycles 9163104 ---- Thread 26 ---- PC 5: Stalled ----- 6098105 in-flight CPI 1.5026 -- Total Cycles 9163104 ---- Thread 27 ---- PC 5: Stalled ----- 6080511 in-flight CPI 1.5070 -- Total Cycles 9163104 ---- Thread 28 ---- PC 5: Stalled ----- 6386698 in-flight CPI 1.4347 -- Total Cycles 9163104 ---- Thread 29 ---- PC 5: Stalled ----- 5367798 in-flight CPI 1.7070 -- Total Cycles 9163104 ---- Thread 30 ---- PC 5: Stalled ----- 5666495 in-flight CPI 1.6171 -- Total Cycles 9163104 ---- Thread 31 ---- PC 5: Stalled ----- 5464634 in-flight CPI 1.6768 -- Total Cycles 9163104 Total CPI 0.0456 , IPC 21.9207 -- Total Cycles 9163104 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 448159 (2.043409%) FPSUB: 0 (0.000000%) FPMUL: 2049848 (9.346411%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 15500109 (70.673721%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 571772 (2.607030%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3353822 (15.291962%) DIV: 7739 (0.035286%) FPUN: 0 (0.000000%) FPRSUB: 478 (0.002179%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (220290597 total) ADD%: 8.210 (18086420) SUB%: 0.000 (0) MUL%: 0.000 (210) BITOR%: 1.226 (2700566) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.545 (1200608) FPSUB%: 0.000 (0) FPMUL%: 4.762 (10490398) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (630) FPMAX%: 0.000 (630) LOAD%: 4.946 (10896509) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (242) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (42108) FPINV%: 0.000 (0) FPCONV%: 0.000 (694) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.064 (2344663) FPLE%: 0.391 (861844) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (630) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (28254) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.962 (6524229) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.748 (1646755) CMPU%: 0.000 (0) RSUB%: 0.000 (210) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.761 (34720334) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.230 (2710087) ORI%: 1.264 (2783441) XORI%: 0.000 (0) MULI%: 3.360 (7401099) LW%: 1.191 (2623830) LWI%: 13.917 (30658342) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (663443) SWI%: 4.095 (9020116) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.480 (3259302) bged%: 0.000 (0) bgeid%: 0.000 (210) bgtd%: 0.000 (0) bgtid%: 0.323 (711125) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.041 (89313) bned%: 0.000 (0) bneid%: 13.713 (30208247) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.738 (1625050) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.087 (190787) DIV%: 0.000 (420) FPUN%: 1.183 (2605660) FPRSUB%: 3.710 (8172863) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (2) FPGT%: 3.104 (6837247) FPGE%: 0.797 (1754793) SYNC%: 0.000 (0) NOP%: 8.820 (19429190) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 172 SUB 0 MUL 23 BITOR 6 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 534 FPSUB 0 FPMUL 5241 FPCMPLT 0 FPMIN 0 FPMAX 405 LOAD 2405350 INTCONV 0 ATOMIC_INC 5 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 115 FPINV 0 FPCONV 18 FPEQ 0 FPNE 0 FPLT 6 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1741 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 1 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 2309 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3480797 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 812 ORI 613717 XORI 0 MULI 654219 LW 0 LWI 9742559 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1779 DIV 15 FPUN 0 FPRSUB 1 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.9207 --Total thread-cycles: 293219328 --total thread-cycles issued: 200861407 (68.502104%) --iCache conflicts: 6695449 (2.283427%) --thread*cycles of FU dependence: 16909850 (5.766963%) --thread*cycles of data dependence: 21931927 (7.479700%) --iCache cycles*banks: 293219328 (75.128277% used) Issue breakdown: --thread*cycles of issue worked: 200861407 (68.502103%) --thread*cycles of issue failed: 72928731 (24.871734%) --thread*cycles of issue NOP/other: 19429190 (6.626163%) Number of thread-cycles not ready: 21931927 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 220290597 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 9 3: 7 4: 8 5: 8 6: 7 7: 8 8: 7 9: 7 10: 10 11: 7 12: 7 13: 7 14: 8 15: 7 16: 7 17: 7 18: 7 19: 8 20: 8 21: 6 22: 9 23: 8 24: 7 25: 8 26: 8 27: 9 28: 8 29: 7 30: 7 31: 6 <=== Core 64 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6706914 in-flight CPI 1.3333 -- Total Cycles 8942279 ---- Thread 01 ---- PC 5: Stalled ----- 7004637 in-flight CPI 1.2766 -- Total Cycles 8942279 ---- Thread 02 ---- PC 5: Stalled ----- 6372536 in-flight CPI 1.4032 -- Total Cycles 8942279 ---- Thread 03 ---- PC 5: Stalled ----- 6360511 in-flight CPI 1.4059 -- Total Cycles 8942279 ---- Thread 04 ---- PC 5: Stalled ----- 6126503 in-flight CPI 1.4596 -- Total Cycles 8942279 ---- Thread 05 ---- PC 5: Stalled ----- 6222913 in-flight CPI 1.4370 -- Total Cycles 8942279 ---- Thread 06 ---- PC 5: Stalled ----- 6063529 in-flight CPI 1.4748 -- Total Cycles 8942279 ---- Thread 07 ---- PC 5: Stalled ----- 6626554 in-flight CPI 1.3495 -- Total Cycles 8942279 ---- Thread 08 ---- PC 5: Stalled ----- 5806701 in-flight CPI 1.5400 -- Total Cycles 8942279 ---- Thread 09 ---- PC 5: Stalled ----- 5801017 in-flight CPI 1.5415 -- Total Cycles 8942279 ---- Thread 10 ---- PC 5: Stalled ----- 6855408 in-flight CPI 1.3044 -- Total Cycles 8942279 ---- Thread 11 ---- PC 5: Stalled ----- 5777122 in-flight CPI 1.5479 -- Total Cycles 8942279 ---- Thread 12 ---- PC 5: Stalled ----- 5911970 in-flight CPI 1.5126 -- Total Cycles 8942279 ---- Thread 13 ---- PC 5: Stalled ----- 6682367 in-flight CPI 1.3382 -- Total Cycles 8942279 ---- Thread 14 ---- PC 5: Stalled ----- 5994189 in-flight CPI 1.4918 -- Total Cycles 8942279 ---- Thread 15 ---- PC 5: Stalled ----- 6645980 in-flight CPI 1.3455 -- Total Cycles 8942279 ---- Thread 16 ---- PC 5: Stalled ----- 6203724 in-flight CPI 1.4414 -- Total Cycles 8942279 ---- Thread 17 ---- PC 5: Stalled ----- 5973306 in-flight CPI 1.4970 -- Total Cycles 8942279 ---- Thread 18 ---- PC 5: Stalled ----- 6068239 in-flight CPI 1.4736 -- Total Cycles 8942279 ---- Thread 19 ---- PC 5: Stalled ----- 6087835 in-flight CPI 1.4689 -- Total Cycles 8942279 ---- Thread 20 ---- PC 5: Stalled ----- 5666625 in-flight CPI 1.5781 -- Total Cycles 8942279 ---- Thread 21 ---- PC 5: Stalled ----- 6293991 in-flight CPI 1.4208 -- Total Cycles 8942279 ---- Thread 22 ---- PC 5: Stalled ----- 6420783 in-flight CPI 1.3927 -- Total Cycles 8942279 ---- Thread 23 ---- PC 5: Stalled ----- 5741320 in-flight CPI 1.5575 -- Total Cycles 8942279 ---- Thread 24 ---- PC 5: Stalled ----- 6270183 in-flight CPI 1.4262 -- Total Cycles 8942279 ---- Thread 25 ---- PC 5: Stalled ----- 5720457 in-flight CPI 1.5632 -- Total Cycles 8942279 ---- Thread 26 ---- PC 5: Stalled ----- 6363484 in-flight CPI 1.4052 -- Total Cycles 8942279 ---- Thread 27 ---- PC 5: Stalled ----- 6250371 in-flight CPI 1.4307 -- Total Cycles 8942279 ---- Thread 28 ---- PC 5: Stalled ----- 6262995 in-flight CPI 1.4278 -- Total Cycles 8942279 ---- Thread 29 ---- PC 5: Stalled ----- 5333151 in-flight CPI 1.6767 -- Total Cycles 8942279 ---- Thread 30 ---- PC 5: Stalled ----- 5344211 in-flight CPI 1.6733 -- Total Cycles 8942279 ---- Thread 31 ---- PC 5: Stalled ----- 6034356 in-flight CPI 1.4819 -- Total Cycles 8942279 Total CPI 0.0454 , IPC 22.0296 -- Total Cycles 8942279 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 437426 (2.026238%) FPSUB: 0 (0.000000%) FPMUL: 2005892 (9.291663%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 15284277 (70.799599%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 566339 (2.623387%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3286202 (15.222296%) DIV: 7492 (0.034704%) FPUN: 0 (0.000000%) FPRSUB: 456 (0.002112%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (216046255 total) ADD%: 8.179 (17670042) SUB%: 0.000 (0) MUL%: 0.000 (203) BITOR%: 1.227 (2651424) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.544 (1174562) FPSUB%: 0.000 (0) FPMUL%: 4.760 (10284837) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (609) FPMAX%: 0.000 (609) LOAD%: 4.947 (10687606) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (235) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41695) FPINV%: 0.000 (0) FPCONV%: 0.000 (673) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.063 (2295523) FPLE%: 0.390 (842354) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (609) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27862) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.965 (6405023) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.748 (1615953) CMPU%: 0.000 (0) RSUB%: 0.000 (203) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.766 (34060927) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.232 (2660962) ORI%: 1.263 (2729675) XORI%: 0.000 (0) MULI%: 3.362 (7263044) LW%: 1.192 (2575950) LWI%: 13.930 (30094494) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (650997) SWI%: 4.099 (8856031) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.481 (3200160) bged%: 0.000 (0) bgeid%: 0.000 (203) bgtd%: 0.000 (0) bgtid%: 0.323 (697535) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.040 (86392) bned%: 0.000 (0) bneid%: 13.714 (29629648) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.737 (1591803) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.087 (186899) DIV%: 0.000 (406) FPUN%: 1.184 (2557853) FPRSUB%: 3.712 (8019772) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (2) FPGT%: 3.104 (6705235) FPGE%: 0.799 (1726385) SYNC%: 0.000 (0) NOP%: 8.818 (19051764) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 177 SUB 0 MUL 24 BITOR 6 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 538 FPSUB 0 FPMUL 5378 FPCMPLT 0 FPMIN 0 FPMAX 397 LOAD 2387509 INTCONV 0 ATOMIC_INC 11 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 97 FPINV 0 FPCONV 8 FPEQ 0 FPNE 0 FPLT 12 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1967 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 1 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 2336 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3417805 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 806 ORI 598364 XORI 0 MULI 645412 LW 0 LWI 9570024 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1699 DIV 15 FPUN 0 FPRSUB 3 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 22.0296 --Total thread-cycles: 286152928 --total thread-cycles issued: 196994491 (68.842383%) --iCache conflicts: 6585898 (2.301531%) --thread*cycles of FU dependence: 16632620 (5.812493%) --thread*cycles of data dependence: 21588084 (7.544247%) --iCache cycles*banks: 286152928 (75.500289% used) Issue breakdown: --thread*cycles of issue worked: 196994491 (68.842382%) --thread*cycles of issue failed: 70106673 (24.499722%) --thread*cycles of issue NOP/other: 19051764 (6.657896%) Number of thread-cycles not ready: 21588084 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 216046255 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 9 1: 8 2: 8 3: 7 4: 8 5: 8 6: 7 7: 7 8: 8 9: 7 10: 8 11: 7 12: 7 13: 8 14: 7 15: 9 16: 8 17: 7 18: 7 19: 7 20: 7 21: 7 22: 7 23: 7 24: 7 25: 7 26: 7 27: 7 28: 7 29: 7 30: 6 31: 7 <=== Core 65 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6199318 in-flight CPI 1.4480 -- Total Cycles 8976467 ---- Thread 01 ---- PC 5: Stalled ----- 5780970 in-flight CPI 1.5528 -- Total Cycles 8976467 ---- Thread 02 ---- PC 5: Stalled ----- 5888182 in-flight CPI 1.5245 -- Total Cycles 8976467 ---- Thread 03 ---- PC 5: Stalled ----- 6051112 in-flight CPI 1.4834 -- Total Cycles 8976467 ---- Thread 04 ---- PC 5: Stalled ----- 6210163 in-flight CPI 1.4454 -- Total Cycles 8976467 ---- Thread 05 ---- PC 5: Stalled ----- 6790553 in-flight CPI 1.3219 -- Total Cycles 8976467 ---- Thread 06 ---- PC 5: Stalled ----- 6246075 in-flight CPI 1.4371 -- Total Cycles 8976467 ---- Thread 07 ---- PC 5: Stalled ----- 6328687 in-flight CPI 1.4184 -- Total Cycles 8976467 ---- Thread 08 ---- PC 5: Stalled ----- 6597602 in-flight CPI 1.3606 -- Total Cycles 8976467 ---- Thread 09 ---- PC 5: Stalled ----- 6929068 in-flight CPI 1.2955 -- Total Cycles 8976467 ---- Thread 10 ---- PC 5: Stalled ----- 6206851 in-flight CPI 1.4462 -- Total Cycles 8976467 ---- Thread 11 ---- PC 5: Stalled ----- 5900237 in-flight CPI 1.5214 -- Total Cycles 8976467 ---- Thread 12 ---- PC 5: Stalled ----- 6031887 in-flight CPI 1.4882 -- Total Cycles 8976467 ---- Thread 13 ---- PC 5: Stalled ----- 5747571 in-flight CPI 1.5618 -- Total Cycles 8976467 ---- Thread 14 ---- PC 5: Stalled ----- 6321478 in-flight CPI 1.4200 -- Total Cycles 8976467 ---- Thread 15 ---- PC 5: Stalled ----- 5699150 in-flight CPI 1.5750 -- Total Cycles 8976467 ---- Thread 16 ---- PC 5: Stalled ----- 5896091 in-flight CPI 1.5224 -- Total Cycles 8976467 ---- Thread 17 ---- PC 5: Stalled ----- 6051939 in-flight CPI 1.4832 -- Total Cycles 8976467 ---- Thread 18 ---- PC 5: Stalled ----- 5742476 in-flight CPI 1.5632 -- Total Cycles 8976467 ---- Thread 19 ---- PC 5: Stalled ----- 6252955 in-flight CPI 1.4356 -- Total Cycles 8976467 ---- Thread 20 ---- PC 5: Stalled ----- 6200994 in-flight CPI 1.4476 -- Total Cycles 8976467 ---- Thread 21 ---- PC 5: Stalled ----- 5827851 in-flight CPI 1.5403 -- Total Cycles 8976467 ---- Thread 22 ---- PC 5: Stalled ----- 6696089 in-flight CPI 1.3405 -- Total Cycles 8976467 ---- Thread 23 ---- PC 5: Stalled ----- 6270310 in-flight CPI 1.4316 -- Total Cycles 8976467 ---- Thread 24 ---- PC 5: Stalled ----- 5662574 in-flight CPI 1.5852 -- Total Cycles 8976467 ---- Thread 25 ---- PC 5: Stalled ----- 5468865 in-flight CPI 1.6414 -- Total Cycles 8976467 ---- Thread 26 ---- PC 5: Stalled ----- 5474664 in-flight CPI 1.6396 -- Total Cycles 8976467 ---- Thread 27 ---- PC 5: Stalled ----- 5838742 in-flight CPI 1.5374 -- Total Cycles 8976467 ---- Thread 28 ---- PC 5: Stalled ----- 6317461 in-flight CPI 1.4209 -- Total Cycles 8976467 ---- Thread 29 ---- PC 5: Stalled ----- 5868207 in-flight CPI 1.5297 -- Total Cycles 8976467 ---- Thread 30 ---- PC 5: Stalled ----- 5825003 in-flight CPI 1.5410 -- Total Cycles 8976467 ---- Thread 31 ---- PC 5: Stalled ----- 5460375 in-flight CPI 1.6439 -- Total Cycles 8976467 Total CPI 0.0463 , IPC 21.5880 -- Total Cycles 8976467 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 433790 (2.086127%) FPSUB: 0 (0.000000%) FPMUL: 1979363 (9.518899%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 14543872 (69.942530%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 563056 (2.707777%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3265875 (15.705828%) DIV: 7608 (0.036587%) FPUN: 0 (0.000000%) FPRSUB: 468 (0.002251%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (212527090 total) ADD%: 8.179 (17383203) SUB%: 0.000 (0) MUL%: 0.000 (206) BITOR%: 1.226 (2605718) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.548 (1164523) FPSUB%: 0.000 (0) FPMUL%: 4.770 (10137528) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (618) FPMAX%: 0.000 (618) LOAD%: 4.954 (10528100) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (238) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41427) FPINV%: 0.000 (0) FPCONV%: 0.000 (682) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.063 (2260042) FPLE%: 0.394 (836905) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (618) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27664) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.961 (6292361) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.748 (1589635) CMPU%: 0.000 (0) RSUB%: 0.000 (206) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.768 (33510349) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.230 (2614980) ORI%: 1.261 (2679101) XORI%: 0.000 (0) MULI%: 3.358 (7137583) LW%: 1.191 (2530818) LWI%: 13.920 (29584344) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (639425) SWI%: 4.099 (8710516) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.479 (3144153) bged%: 0.000 (0) bgeid%: 0.000 (206) bgtd%: 0.000 (0) bgtid%: 0.322 (684908) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.040 (85456) bned%: 0.000 (0) bneid%: 13.710 (29138251) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.743 (1578913) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.087 (185752) DIV%: 0.000 (412) FPUN%: 1.182 (2513069) FPRSUB%: 3.713 (7891663) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (1) FPGT%: 3.104 (6596924) FPGE%: 0.794 (1686906) SYNC%: 0.000 (0) NOP%: 8.819 (18742972) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 160 SUB 0 MUL 21 BITOR 2 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 524 FPSUB 0 FPMUL 5147 FPCMPLT 0 FPMIN 0 FPMAX 398 LOAD 2303768 INTCONV 0 ATOMIC_INC 9 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 98 FPINV 0 FPCONV 15 FPEQ 0 FPNE 0 FPLT 8 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1706 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 1 ADDKC 0 BITXOR 0 ANDN 0 CMP 2191 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3354627 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 699 ORI 592444 XORI 0 MULI 637350 LW 0 LWI 9406116 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1697 DIV 33 FPUN 0 FPRSUB 2 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.5880 --Total thread-cycles: 287246944 --total thread-cycles issued: 193784118 (67.462550%) --iCache conflicts: 6545968 (2.278864%) --thread*cycles of FU dependence: 16307026 (5.677006%) --thread*cycles of data dependence: 20794032 (7.239079%) --iCache cycles*banks: 287246944 (73.987601% used) Issue breakdown: --thread*cycles of issue worked: 193784118 (67.462552%) --thread*cycles of issue failed: 74719854 (26.012410%) --thread*cycles of issue NOP/other: 18742972 (6.525038%) Number of thread-cycles not ready: 20794032 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 212527090 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 7 2: 7 3: 7 4: 7 5: 9 6: 9 7: 8 8: 8 9: 8 10: 7 11: 7 12: 7 13: 7 14: 7 15: 7 16: 8 17: 8 18: 7 19: 7 20: 7 21: 8 22: 9 23: 9 24: 7 25: 7 26: 7 27: 7 28: 7 29: 7 30: 7 31: 7 <=== Core 66 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6102739 in-flight CPI 1.4624 -- Total Cycles 8924462 ---- Thread 01 ---- PC 5: Stalled ----- 5849474 in-flight CPI 1.5257 -- Total Cycles 8924462 ---- Thread 02 ---- PC 5: Stalled ----- 6180476 in-flight CPI 1.4440 -- Total Cycles 8924462 ---- Thread 03 ---- PC 5: Stalled ----- 5887766 in-flight CPI 1.5158 -- Total Cycles 8924462 ---- Thread 04 ---- PC 5: Stalled ----- 6976639 in-flight CPI 1.2792 -- Total Cycles 8924462 ---- Thread 05 ---- PC 5: Stalled ----- 5927121 in-flight CPI 1.5057 -- Total Cycles 8924462 ---- Thread 06 ---- PC 5: Stalled ----- 6091203 in-flight CPI 1.4651 -- Total Cycles 8924462 ---- Thread 07 ---- PC 5: Stalled ----- 6043776 in-flight CPI 1.4766 -- Total Cycles 8924462 ---- Thread 08 ---- PC 5: Stalled ----- 6087120 in-flight CPI 1.4661 -- Total Cycles 8924462 ---- Thread 09 ---- PC 5: Stalled ----- 6898193 in-flight CPI 1.2937 -- Total Cycles 8924462 ---- Thread 10 ---- PC 5: Stalled ----- 6137996 in-flight CPI 1.4540 -- Total Cycles 8924462 ---- Thread 11 ---- PC 5: Stalled ----- 6000579 in-flight CPI 1.4873 -- Total Cycles 8924462 ---- Thread 12 ---- PC 5: Stalled ----- 6002983 in-flight CPI 1.4867 -- Total Cycles 8924462 ---- Thread 13 ---- PC 5: Stalled ----- 6187609 in-flight CPI 1.4423 -- Total Cycles 8924462 ---- Thread 14 ---- PC 5: Stalled ----- 6305352 in-flight CPI 1.4154 -- Total Cycles 8924462 ---- Thread 15 ---- PC 5: Stalled ----- 5917480 in-flight CPI 1.5081 -- Total Cycles 8924462 ---- Thread 16 ---- PC 5: Stalled ----- 6573664 in-flight CPI 1.3576 -- Total Cycles 8924462 ---- Thread 17 ---- PC 5: Stalled ----- 6276851 in-flight CPI 1.4218 -- Total Cycles 8924462 ---- Thread 18 ---- PC 5: Stalled ----- 6020287 in-flight CPI 1.4824 -- Total Cycles 8924462 ---- Thread 19 ---- PC 5: Stalled ----- 6802337 in-flight CPI 1.3120 -- Total Cycles 8924462 ---- Thread 20 ---- PC 5: Stalled ----- 6245119 in-flight CPI 1.4290 -- Total Cycles 8924462 ---- Thread 21 ---- PC 5: Stalled ----- 6260474 in-flight CPI 1.4255 -- Total Cycles 8924462 ---- Thread 22 ---- PC 5: Stalled ----- 5635094 in-flight CPI 1.5837 -- Total Cycles 8924462 ---- Thread 23 ---- PC 5: Stalled ----- 5548718 in-flight CPI 1.6084 -- Total Cycles 8924462 ---- Thread 24 ---- PC 5: Stalled ----- 6483851 in-flight CPI 1.3764 -- Total Cycles 8924462 ---- Thread 25 ---- PC 5: Stalled ----- 5749557 in-flight CPI 1.5522 -- Total Cycles 8924462 ---- Thread 26 ---- PC 5: Stalled ----- 5844909 in-flight CPI 1.5269 -- Total Cycles 8924462 ---- Thread 27 ---- PC 5: Stalled ----- 5859180 in-flight CPI 1.5232 -- Total Cycles 8924462 ---- Thread 28 ---- PC 5: Stalled ----- 6110061 in-flight CPI 1.4606 -- Total Cycles 8924462 ---- Thread 29 ---- PC 5: Stalled ----- 5660418 in-flight CPI 1.5766 -- Total Cycles 8924462 ---- Thread 30 ---- PC 5: Stalled ----- 5664858 in-flight CPI 1.5754 -- Total Cycles 8924462 ---- Thread 31 ---- PC 5: Stalled ----- 5872050 in-flight CPI 1.5198 -- Total Cycles 8924462 Total CPI 0.0457 , IPC 21.8730 -- Total Cycles 8924462 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 439697 (2.044295%) FPSUB: 0 (0.000000%) FPMUL: 2001437 (9.305335%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 15205169 (70.693804%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 559267 (2.600215%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3295000 (15.319533%) DIV: 7458 (0.034675%) FPUN: 0 (0.000000%) FPRSUB: 461 (0.002143%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (214082456 total) ADD%: 8.189 (17531267) SUB%: 0.000 (0) MUL%: 0.000 (202) BITOR%: 1.221 (2614288) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.549 (1175501) FPSUB%: 0.000 (0) FPMUL%: 4.778 (10227857) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (606) FPMAX%: 0.000 (606) LOAD%: 4.958 (10613645) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (234) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41172) FPINV%: 0.000 (0) FPCONV%: 0.000 (670) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.065 (2280028) FPLE%: 0.391 (837003) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (606) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27466) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.961 (6339181) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.750 (1604762) CMPU%: 0.000 (0) RSUB%: 0.000 (202) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.766 (33751781) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.231 (2635229) ORI%: 1.260 (2698379) XORI%: 0.000 (0) MULI%: 3.359 (7191019) LW%: 1.191 (2549446) LWI%: 13.918 (29795464) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (644335) SWI%: 4.096 (8768505) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.479 (3167254) bged%: 0.000 (0) bgeid%: 0.000 (202) bgtd%: 0.000 (0) bgtid%: 0.323 (690951) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.041 (87102) bned%: 0.000 (0) bneid%: 13.705 (29340537) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.741 (1586881) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.088 (187502) DIV%: 0.000 (404) FPUN%: 1.177 (2520062) FPRSUB%: 3.716 (7954532) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (6) FPGT%: 3.104 (6645795) FPGE%: 0.791 (1693762) SYNC%: 0.000 (0) NOP%: 8.818 (18877916) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 166 SUB 0 MUL 10 BITOR 2 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 480 FPSUB 0 FPMUL 5207 FPCMPLT 0 FPMIN 0 FPMAX 396 LOAD 2323320 INTCONV 0 ATOMIC_INC 6 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 98 FPINV 0 FPCONV 16 FPEQ 0 FPNE 0 FPLT 8 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1766 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 1 ADDKC 0 BITXOR 0 ANDN 0 CMP 2314 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3380309 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 784 ORI 601285 XORI 0 MULI 638783 LW 0 LWI 9468479 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1802 DIV 17 FPUN 0 FPRSUB 4 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.8730 --Total thread-cycles: 285582784 --total thread-cycles issued: 195204540 (68.353050%) --iCache conflicts: 6577577 (2.303212%) --thread*cycles of FU dependence: 16425276 (5.751494%) --thread*cycles of data dependence: 21508489 (7.531438%) --iCache cycles*banks: 285582784 (74.963373% used) Issue breakdown: --thread*cycles of issue worked: 195204540 (68.353049%) --thread*cycles of issue failed: 71500328 (25.036638%) --thread*cycles of issue NOP/other: 18877916 (6.610313%) Number of thread-cycles not ready: 21508489 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 214082456 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 7 2: 7 3: 8 4: 8 5: 7 6: 7 7: 7 8: 7 9: 8 10: 7 11: 7 12: 7 13: 7 14: 7 15: 7 16: 9 17: 7 18: 9 19: 8 20: 7 21: 7 22: 7 23: 6 24: 7 25: 7 26: 7 27: 8 28: 7 29: 7 30: 7 31: 9 <=== Core 67 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6548849 in-flight CPI 1.3658 -- Total Cycles 8944317 ---- Thread 01 ---- PC 5: Stalled ----- 6774018 in-flight CPI 1.3204 -- Total Cycles 8944317 ---- Thread 02 ---- PC 5: Stalled ----- 6896682 in-flight CPI 1.2969 -- Total Cycles 8944317 ---- Thread 03 ---- PC 5: Stalled ----- 5908947 in-flight CPI 1.5137 -- Total Cycles 8944317 ---- Thread 04 ---- PC 5: Stalled ----- 5972067 in-flight CPI 1.4977 -- Total Cycles 8944317 ---- Thread 05 ---- PC 5: Stalled ----- 5820860 in-flight CPI 1.5366 -- Total Cycles 8944317 ---- Thread 06 ---- PC 5: Stalled ----- 6365804 in-flight CPI 1.4051 -- Total Cycles 8944317 ---- Thread 07 ---- PC 5: Stalled ----- 6033928 in-flight CPI 1.4823 -- Total Cycles 8944317 ---- Thread 08 ---- PC 5: Stalled ----- 5846969 in-flight CPI 1.5297 -- Total Cycles 8944317 ---- Thread 09 ---- PC 5: Stalled ----- 6891794 in-flight CPI 1.2978 -- Total Cycles 8944317 ---- Thread 10 ---- PC 5: Stalled ----- 6790057 in-flight CPI 1.3173 -- Total Cycles 8944317 ---- Thread 11 ---- PC 5: Stalled ----- 6616031 in-flight CPI 1.3519 -- Total Cycles 8944317 ---- Thread 12 ---- PC 5: Stalled ----- 6625873 in-flight CPI 1.3499 -- Total Cycles 8944317 ---- Thread 13 ---- PC 5: Stalled ----- 6661221 in-flight CPI 1.3427 -- Total Cycles 8944317 ---- Thread 14 ---- PC 5: Stalled ----- 6080123 in-flight CPI 1.4711 -- Total Cycles 8944317 ---- Thread 15 ---- PC 5: Stalled ----- 6184260 in-flight CPI 1.4463 -- Total Cycles 8944317 ---- Thread 16 ---- PC 5: Stalled ----- 5924552 in-flight CPI 1.5097 -- Total Cycles 8944317 ---- Thread 17 ---- PC 5: Stalled ----- 5828592 in-flight CPI 1.5346 -- Total Cycles 8944317 ---- Thread 18 ---- PC 5: Stalled ----- 5581386 in-flight CPI 1.6025 -- Total Cycles 8944317 ---- Thread 19 ---- PC 5: Stalled ----- 6042399 in-flight CPI 1.4803 -- Total Cycles 8944317 ---- Thread 20 ---- PC 5: Stalled ----- 5955275 in-flight CPI 1.5019 -- Total Cycles 8944317 ---- Thread 21 ---- PC 5: Stalled ----- 6032509 in-flight CPI 1.4827 -- Total Cycles 8944317 ---- Thread 22 ---- PC 5: Stalled ----- 6000139 in-flight CPI 1.4907 -- Total Cycles 8944317 ---- Thread 23 ---- PC 5: Stalled ----- 6066527 in-flight CPI 1.4744 -- Total Cycles 8944317 ---- Thread 24 ---- PC 5: Stalled ----- 6367079 in-flight CPI 1.4048 -- Total Cycles 8944317 ---- Thread 25 ---- PC 5: Stalled ----- 5479266 in-flight CPI 1.6324 -- Total Cycles 8944317 ---- Thread 26 ---- PC 5: Stalled ----- 6450388 in-flight CPI 1.3866 -- Total Cycles 8944317 ---- Thread 27 ---- PC 5: Stalled ----- 6417313 in-flight CPI 1.3938 -- Total Cycles 8944317 ---- Thread 28 ---- PC 5: Stalled ----- 6493763 in-flight CPI 1.3774 -- Total Cycles 8944317 ---- Thread 29 ---- PC 5: Stalled ----- 5244715 in-flight CPI 1.7054 -- Total Cycles 8944317 ---- Thread 30 ---- PC 5: Stalled ----- 5694103 in-flight CPI 1.5708 -- Total Cycles 8944317 ---- Thread 31 ---- PC 5: Stalled ----- 6069490 in-flight CPI 1.4736 -- Total Cycles 8944317 Total CPI 0.0452 , IPC 22.0996 -- Total Cycles 8944317 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 446180 (2.005272%) FPSUB: 0 (0.000000%) FPMUL: 2027471 (9.112085%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 15873225 (71.339212%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 558096 (2.508257%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3337538 (14.999934%) DIV: 7382 (0.033177%) FPUN: 0 (0.000000%) FPRSUB: 459 (0.002063%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (216776534 total) ADD%: 8.203 (17781305) SUB%: 0.000 (0) MUL%: 0.000 (200) BITOR%: 1.224 (2654427) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.549 (1191007) FPSUB%: 0.000 (0) FPMUL%: 4.779 (10359804) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (600) FPMAX%: 0.000 (600) LOAD%: 4.955 (10741282) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (232) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41131) FPINV%: 0.000 (0) FPCONV%: 0.000 (664) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.065 (2309538) FPLE%: 0.390 (845053) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (600) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27566) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.961 (6417672) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.751 (1627871) CMPU%: 0.000 (0) RSUB%: 0.000 (200) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.758 (34160616) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.231 (2668519) ORI%: 1.268 (2748257) XORI%: 0.000 (0) MULI%: 3.357 (7276206) LW%: 1.191 (2580836) LWI%: 13.913 (30161002) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (652427) SWI%: 4.094 (8874066) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.479 (3206122) bged%: 0.000 (0) bgeid%: 0.000 (200) bgtd%: 0.000 (0) bgtid%: 0.323 (700679) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.041 (89207) bned%: 0.000 (0) bneid%: 13.703 (29705560) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.735 (1593727) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.088 (190002) DIV%: 0.000 (400) FPUN%: 1.180 (2558282) FPRSUB%: 3.716 (8056285) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (3) FPGT%: 3.100 (6719323) FPGE%: 0.795 (1724012) SYNC%: 0.000 (0) NOP%: 8.816 (19110955) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 168 SUB 0 MUL 20 BITOR 4 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 527 FPSUB 0 FPMUL 5315 FPCMPLT 0 FPMIN 0 FPMAX 389 LOAD 2383552 INTCONV 0 ATOMIC_INC 5 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 98 FPINV 0 FPCONV 7 FPEQ 0 FPNE 0 FPLT 10 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 2045 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 1 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 2268 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3424923 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 830 ORI 611401 XORI 0 MULI 634694 LW 0 LWI 9587147 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1801 DIV 19 FPUN 0 FPRSUB 4 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 22.0996 --Total thread-cycles: 286218144 --total thread-cycles issued: 197665579 (69.061165%) --iCache conflicts: 6545853 (2.287015%) --thread*cycles of FU dependence: 16655232 (5.819069%) --thread*cycles of data dependence: 22250351 (7.773914%) --iCache cycles*banks: 286218144 (75.738233% used) Issue breakdown: --thread*cycles of issue worked: 197665579 (69.061163%) --thread*cycles of issue failed: 69441610 (24.261778%) --thread*cycles of issue NOP/other: 56558878367067179 (19760759250.492229%) Number of thread-cycles not ready: 22250351 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 216776534 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 8 3: 7 4: 7 5: 7 6: 8 7: 7 8: 6 9: 8 10: 9 11: 8 12: 7 13: 7 14: 7 15: 7 16: 8 17: 7 18: 7 19: 7 20: 7 21: 7 22: 7 23: 7 24: 7 25: 7 26: 7 27: 7 28: 7 29: 6 30: 7 31: 9 <=== Core 68 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6268629 in-flight CPI 1.4455 -- Total Cycles 9061615 ---- Thread 01 ---- PC 5: Stalled ----- 5911978 in-flight CPI 1.5328 -- Total Cycles 9061615 ---- Thread 02 ---- PC 5: Stalled ----- 6349766 in-flight CPI 1.4271 -- Total Cycles 9061615 ---- Thread 03 ---- PC 5: Stalled ----- 6527671 in-flight CPI 1.3882 -- Total Cycles 9061615 ---- Thread 04 ---- PC 5: Stalled ----- 5850565 in-flight CPI 1.5488 -- Total Cycles 9061615 ---- Thread 05 ---- PC 5: Stalled ----- 5887821 in-flight CPI 1.5390 -- Total Cycles 9061615 ---- Thread 06 ---- PC 5: Stalled ----- 5924023 in-flight CPI 1.5296 -- Total Cycles 9061615 ---- Thread 07 ---- PC 5: Stalled ----- 6399273 in-flight CPI 1.4160 -- Total Cycles 9061615 ---- Thread 08 ---- PC 5: Stalled ----- 5917986 in-flight CPI 1.5312 -- Total Cycles 9061615 ---- Thread 09 ---- PC 5: Stalled ----- 6080666 in-flight CPI 1.4902 -- Total Cycles 9061615 ---- Thread 10 ---- PC 5: Stalled ----- 7056912 in-flight CPI 1.2841 -- Total Cycles 9061615 ---- Thread 11 ---- PC 5: Stalled ----- 6403215 in-flight CPI 1.4152 -- Total Cycles 9061615 ---- Thread 12 ---- PC 5: Stalled ----- 6130875 in-flight CPI 1.4780 -- Total Cycles 9061615 ---- Thread 13 ---- PC 5: Stalled ----- 6681310 in-flight CPI 1.3563 -- Total Cycles 9061615 ---- Thread 14 ---- PC 5: Stalled ----- 5994027 in-flight CPI 1.5118 -- Total Cycles 9061615 ---- Thread 15 ---- PC 5: Stalled ----- 5908936 in-flight CPI 1.5335 -- Total Cycles 9061615 ---- Thread 16 ---- PC 5: Stalled ----- 6111050 in-flight CPI 1.4828 -- Total Cycles 9061615 ---- Thread 17 ---- PC 5: Stalled ----- 6102916 in-flight CPI 1.4848 -- Total Cycles 9061615 ---- Thread 18 ---- PC 5: Stalled ----- 6027935 in-flight CPI 1.5033 -- Total Cycles 9061615 ---- Thread 19 ---- PC 5: Stalled ----- 5833880 in-flight CPI 1.5533 -- Total Cycles 9061615 ---- Thread 20 ---- PC 5: Stalled ----- 5618280 in-flight CPI 1.6129 -- Total Cycles 9061615 ---- Thread 21 ---- PC 5: Stalled ----- 6325209 in-flight CPI 1.4326 -- Total Cycles 9061615 ---- Thread 22 ---- PC 5: Stalled ----- 5668948 in-flight CPI 1.5985 -- Total Cycles 9061615 ---- Thread 23 ---- PC 5: Stalled ----- 6366714 in-flight CPI 1.4233 -- Total Cycles 9061615 ---- Thread 24 ---- PC 5: Stalled ----- 6206912 in-flight CPI 1.4599 -- Total Cycles 9061615 ---- Thread 25 ---- PC 5: Stalled ----- 6664464 in-flight CPI 1.3597 -- Total Cycles 9061615 ---- Thread 26 ---- PC 5: Stalled ----- 6209679 in-flight CPI 1.4593 -- Total Cycles 9061615 ---- Thread 27 ---- PC 5: Stalled ----- 6134031 in-flight CPI 1.4773 -- Total Cycles 9061615 ---- Thread 28 ---- PC 5: Stalled ----- 6083559 in-flight CPI 1.4895 -- Total Cycles 9061615 ---- Thread 29 ---- PC 5: Stalled ----- 5565940 in-flight CPI 1.6280 -- Total Cycles 9061615 ---- Thread 30 ---- PC 5: Stalled ----- 5964552 in-flight CPI 1.5192 -- Total Cycles 9061615 ---- Thread 31 ---- PC 5: Stalled ----- 5799961 in-flight CPI 1.5624 -- Total Cycles 9061615 Total CPI 0.0462 , IPC 21.6273 -- Total Cycles 9061615 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 438404 (2.073209%) FPSUB: 0 (0.000000%) FPMUL: 2001729 (9.466160%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 14846199 (70.207551%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 563864 (2.666508%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3287890 (15.548404%) DIV: 7604 (0.035959%) FPUN: 0 (0.000000%) FPRSUB: 467 (0.002208%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (214940925 total) ADD%: 8.186 (17594445) SUB%: 0.000 (0) MUL%: 0.000 (206) BITOR%: 1.230 (2643426) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.546 (1174486) FPSUB%: 0.000 (0) FPMUL%: 4.767 (10246695) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (618) FPMAX%: 0.000 (618) LOAD%: 4.949 (10637405) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (238) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41507) FPINV%: 0.000 (0) FPCONV%: 0.000 (682) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.063 (2285813) FPLE%: 0.392 (842842) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (618) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27792) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.960 (6362489) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.748 (1606867) CMPU%: 0.000 (0) RSUB%: 0.000 (206) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.765 (33886148) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.230 (2643663) ORI%: 1.265 (2718855) XORI%: 0.000 (0) MULI%: 3.358 (7216983) LW%: 1.191 (2558926) LWI%: 13.911 (29899831) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (646897) SWI%: 4.094 (8799122) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.479 (3178743) bged%: 0.000 (0) bgeid%: 0.000 (206) bgtd%: 0.000 (0) bgtid%: 0.323 (693194) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.040 (86639) bned%: 0.000 (0) bneid%: 13.718 (29484621) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.741 (1591800) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.087 (187009) DIV%: 0.000 (412) FPUN%: 1.186 (2550172) FPRSUB%: 3.712 (7978178) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (2) FPGT%: 3.104 (6671715) FPGE%: 0.799 (1718136) SYNC%: 0.000 (0) NOP%: 8.822 (18962624) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 176 SUB 0 MUL 30 BITOR 2 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 508 FPSUB 0 FPMUL 4964 FPCMPLT 0 FPMIN 0 FPMAX 402 LOAD 2373975 INTCONV 0 ATOMIC_INC 4 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 98 FPINV 0 FPCONV 12 FPEQ 0 FPNE 0 FPLT 9 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1992 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 2186 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3395034 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 804 ORI 598876 XORI 0 MULI 640572 LW 0 LWI 9508096 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1813 DIV 14 FPUN 0 FPRSUB 3 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.6273 --Total thread-cycles: 289971680 --total thread-cycles issued: 195978301 (67.585326%) --iCache conflicts: 6598650 (2.275619%) --thread*cycles of FU dependence: 16529592 (5.700416%) --thread*cycles of data dependence: 21146157 (7.292490%) --iCache cycles*banks: 289971680 (74.124810% used) Issue breakdown: --thread*cycles of issue worked: 195978301 (67.585325%) --thread*cycles of issue failed: 75030755 (25.875201%) --thread*cycles of issue NOP/other: 18962624 (6.539474%) Number of thread-cycles not ready: 21146157 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 214940925 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 7 3: 8 4: 8 5: 8 6: 7 7: 8 8: 7 9: 7 10: 8 11: 8 12: 8 13: 9 14: 7 15: 7 16: 7 17: 7 18: 8 19: 7 20: 7 21: 7 22: 7 23: 7 24: 7 25: 7 26: 7 27: 8 28: 8 29: 7 30: 7 31: 7 <=== Core 69 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6948990 in-flight CPI 1.2771 -- Total Cycles 8874238 ---- Thread 01 ---- PC 5: Stalled ----- 6834218 in-flight CPI 1.2985 -- Total Cycles 8874238 ---- Thread 02 ---- PC 5: Stalled ----- 6102661 in-flight CPI 1.4542 -- Total Cycles 8874238 ---- Thread 03 ---- PC 5: Stalled ----- 6350620 in-flight CPI 1.3974 -- Total Cycles 8874238 ---- Thread 04 ---- PC 5: Stalled ----- 5956780 in-flight CPI 1.4898 -- Total Cycles 8874238 ---- Thread 05 ---- PC 5: Stalled ----- 6137137 in-flight CPI 1.4460 -- Total Cycles 8874238 ---- Thread 06 ---- PC 5: Stalled ----- 6284344 in-flight CPI 1.4121 -- Total Cycles 8874238 ---- Thread 07 ---- PC 5: Stalled ----- 6097645 in-flight CPI 1.4554 -- Total Cycles 8874238 ---- Thread 08 ---- PC 5: Stalled ----- 6158679 in-flight CPI 1.4409 -- Total Cycles 8874238 ---- Thread 09 ---- PC 5: Stalled ----- 5895029 in-flight CPI 1.5054 -- Total Cycles 8874238 ---- Thread 10 ---- PC 5: Stalled ----- 5998444 in-flight CPI 1.4794 -- Total Cycles 8874238 ---- Thread 11 ---- PC 5: Stalled ----- 6558683 in-flight CPI 1.3530 -- Total Cycles 8874238 ---- Thread 12 ---- PC 5: Stalled ----- 6407190 in-flight CPI 1.3850 -- Total Cycles 8874238 ---- Thread 13 ---- PC 5: Stalled ----- 5868074 in-flight CPI 1.5123 -- Total Cycles 8874238 ---- Thread 14 ---- PC 5: Stalled ----- 6877350 in-flight CPI 1.2904 -- Total Cycles 8874238 ---- Thread 15 ---- PC 5: Stalled ----- 6286297 in-flight CPI 1.4117 -- Total Cycles 8874238 ---- Thread 16 ---- PC 5: Stalled ----- 5768625 in-flight CPI 1.5384 -- Total Cycles 8874238 ---- Thread 17 ---- PC 5: Stalled ----- 6235463 in-flight CPI 1.4232 -- Total Cycles 8874238 ---- Thread 18 ---- PC 5: Stalled ----- 5888326 in-flight CPI 1.5071 -- Total Cycles 8874238 ---- Thread 19 ---- PC 5: Stalled ----- 6410701 in-flight CPI 1.3843 -- Total Cycles 8874238 ---- Thread 20 ---- PC 5: Stalled ----- 6566383 in-flight CPI 1.3515 -- Total Cycles 8874238 ---- Thread 21 ---- PC 5: Stalled ----- 5702904 in-flight CPI 1.5561 -- Total Cycles 8874238 ---- Thread 22 ---- PC 5: Stalled ----- 6311677 in-flight CPI 1.4060 -- Total Cycles 8874238 ---- Thread 23 ---- PC 5: Stalled ----- 6091728 in-flight CPI 1.4568 -- Total Cycles 8874238 ---- Thread 24 ---- PC 5: Stalled ----- 6429231 in-flight CPI 1.3803 -- Total Cycles 8874238 ---- Thread 25 ---- PC 5: Stalled ----- 6286051 in-flight CPI 1.4117 -- Total Cycles 8874238 ---- Thread 26 ---- PC 5: Stalled ----- 5554036 in-flight CPI 1.5978 -- Total Cycles 8874238 ---- Thread 27 ---- PC 5: Stalled ----- 6185846 in-flight CPI 1.4346 -- Total Cycles 8874238 ---- Thread 28 ---- PC 5: Stalled ----- 5466999 in-flight CPI 1.6232 -- Total Cycles 8874238 ---- Thread 29 ---- PC 5: Stalled ----- 5603385 in-flight CPI 1.5837 -- Total Cycles 8874238 ---- Thread 30 ---- PC 5: Stalled ----- 5475106 in-flight CPI 1.6208 -- Total Cycles 8874238 ---- Thread 31 ---- PC 5: Stalled ----- 5982189 in-flight CPI 1.4834 -- Total Cycles 8874238 Total CPI 0.0451 , IPC 22.1677 -- Total Cycles 8874238 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 437515 (2.026013%) FPSUB: 0 (0.000000%) FPMUL: 2003460 (9.277478%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 15296859 (70.835589%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 560841 (2.597102%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3288173 (15.226634%) DIV: 7568 (0.035045%) FPUN: 0 (0.000000%) FPRSUB: 462 (0.002139%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (215741093 total) ADD%: 8.188 (17664094) SUB%: 0.000 (0) MUL%: 0.000 (205) BITOR%: 1.230 (2653723) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.544 (1172767) FPSUB%: 0.000 (0) FPMUL%: 4.759 (10267432) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (615) FPMAX%: 0.000 (615) LOAD%: 4.950 (10680112) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (237) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41243) FPINV%: 0.000 (0) FPCONV%: 0.000 (679) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.062 (2291467) FPLE%: 0.389 (839144) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (615) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27552) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.964 (6395248) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.750 (1616987) CMPU%: 0.000 (0) RSUB%: 0.000 (205) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.766 (34013143) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.232 (2658658) ORI%: 1.266 (2731602) XORI%: 0.000 (0) MULI%: 3.361 (7251019) LW%: 1.192 (2571900) LWI%: 13.917 (30025489) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (649830) SWI%: 4.095 (8835594) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.481 (3195484) bged%: 0.000 (0) bgeid%: 0.000 (205) bgtd%: 0.000 (0) bgtid%: 0.323 (696610) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.040 (85866) bned%: 0.000 (0) bneid%: 13.716 (29590105) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.737 (1590584) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.087 (187090) DIV%: 0.000 (410) FPUN%: 1.186 (2558525) FPRSUB%: 3.710 (8004417) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (4) FPGT%: 3.102 (6691753) FPGE%: 0.802 (1730082) SYNC%: 0.000 (0) NOP%: 8.816 (19019687) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 171 SUB 0 MUL 23 BITOR 4 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 516 FPSUB 0 FPMUL 5152 FPCMPLT 0 FPMIN 0 FPMAX 401 LOAD 2364934 INTCONV 0 ATOMIC_INC 3 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 104 FPINV 0 FPCONV 10 FPEQ 0 FPNE 0 FPLT 10 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 2115 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 2261 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3409273 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 881 ORI 598370 XORI 0 MULI 648187 LW 0 LWI 9543937 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1761 DIV 13 FPUN 0 FPRSUB 4 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 22.1677 --Total thread-cycles: 283975616 --total thread-cycles issued: 196721406 (69.274049%) --iCache conflicts: 6612393 (2.328507%) --thread*cycles of FU dependence: 16578148 (5.837877%) --thread*cycles of data dependence: 21594878 (7.604483%) --iCache cycles*banks: 283975616 (75.971708% used) Issue breakdown: --thread*cycles of issue worked: 196721406 (69.274049%) --thread*cycles of issue failed: 68234523 (24.028304%) --thread*cycles of issue NOP/other: 19019687 (6.697648%) Number of thread-cycles not ready: 21594878 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 215741093 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 7 3: 8 4: 7 5: 8 6: 8 7: 8 8: 7 9: 7 10: 7 11: 7 12: 7 13: 7 14: 8 15: 9 16: 7 17: 8 18: 7 19: 7 20: 7 21: 7 22: 8 23: 7 24: 9 25: 8 26: 8 27: 7 28: 6 29: 7 30: 6 31: 7 <=== Core 70 ===> ---- Thread 00 ---- PC 5: Stalled ----- 5854155 in-flight CPI 1.5263 -- Total Cycles 8935096 ---- Thread 01 ---- PC 5: Stalled ----- 6983023 in-flight CPI 1.2795 -- Total Cycles 8935096 ---- Thread 02 ---- PC 5: Stalled ----- 6052712 in-flight CPI 1.4762 -- Total Cycles 8935096 ---- Thread 03 ---- PC 5: Stalled ----- 6677953 in-flight CPI 1.3380 -- Total Cycles 8935096 ---- Thread 04 ---- PC 5: Stalled ----- 5953775 in-flight CPI 1.5007 -- Total Cycles 8935096 ---- Thread 05 ---- PC 5: Stalled ----- 6270868 in-flight CPI 1.4249 -- Total Cycles 8935096 ---- Thread 06 ---- PC 5: Stalled ----- 6793169 in-flight CPI 1.3153 -- Total Cycles 8935096 ---- Thread 07 ---- PC 5: Stalled ----- 6370480 in-flight CPI 1.4026 -- Total Cycles 8935096 ---- Thread 08 ---- PC 5: Stalled ----- 6483820 in-flight CPI 1.3781 -- Total Cycles 8935096 ---- Thread 09 ---- PC 5: Stalled ----- 6708870 in-flight CPI 1.3318 -- Total Cycles 8935096 ---- Thread 10 ---- PC 5: Stalled ----- 6354598 in-flight CPI 1.4061 -- Total Cycles 8935096 ---- Thread 11 ---- PC 5: Stalled ----- 6020367 in-flight CPI 1.4841 -- Total Cycles 8935096 ---- Thread 12 ---- PC 5: Stalled ----- 6398337 in-flight CPI 1.3965 -- Total Cycles 8935096 ---- Thread 13 ---- PC 5: Stalled ----- 6112790 in-flight CPI 1.4617 -- Total Cycles 8935096 ---- Thread 14 ---- PC 5: Stalled ----- 5873394 in-flight CPI 1.5213 -- Total Cycles 8935096 ---- Thread 15 ---- PC 5: Stalled ----- 5869275 in-flight CPI 1.5223 -- Total Cycles 8935096 ---- Thread 16 ---- PC 5: Stalled ----- 6008839 in-flight CPI 1.4870 -- Total Cycles 8935096 ---- Thread 17 ---- PC 5: Stalled ----- 6271283 in-flight CPI 1.4248 -- Total Cycles 8935096 ---- Thread 18 ---- PC 5: Stalled ----- 5864736 in-flight CPI 1.5235 -- Total Cycles 8935096 ---- Thread 19 ---- PC 5: Stalled ----- 6399522 in-flight CPI 1.3962 -- Total Cycles 8935096 ---- Thread 20 ---- PC 5: Stalled ----- 6069513 in-flight CPI 1.4721 -- Total Cycles 8935096 ---- Thread 21 ---- PC 5: Stalled ----- 5593429 in-flight CPI 1.5974 -- Total Cycles 8935096 ---- Thread 22 ---- PC 5: Stalled ----- 5632446 in-flight CPI 1.5864 -- Total Cycles 8935096 ---- Thread 23 ---- PC 5: Stalled ----- 6144124 in-flight CPI 1.4542 -- Total Cycles 8935096 ---- Thread 24 ---- PC 5: Stalled ----- 6117988 in-flight CPI 1.4605 -- Total Cycles 8935096 ---- Thread 25 ---- PC 5: Stalled ----- 6336290 in-flight CPI 1.4101 -- Total Cycles 8935096 ---- Thread 26 ---- PC 5: Stalled ----- 5634822 in-flight CPI 1.5857 -- Total Cycles 8935096 ---- Thread 27 ---- PC 5: Stalled ----- 6362488 in-flight CPI 1.4043 -- Total Cycles 8935096 ---- Thread 28 ---- PC 5: Stalled ----- 5947762 in-flight CPI 1.5023 -- Total Cycles 8935096 ---- Thread 29 ---- PC 5: Stalled ----- 5728681 in-flight CPI 1.5597 -- Total Cycles 8935096 ---- Thread 30 ---- PC 5: Stalled ----- 6409328 in-flight CPI 1.3941 -- Total Cycles 8935096 ---- Thread 31 ---- PC 5: Stalled ----- 5493095 in-flight CPI 1.6266 -- Total Cycles 8935096 Total CPI 0.0454 , IPC 22.0247 -- Total Cycles 8935096 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 440830 (2.052491%) FPSUB: 0 (0.000000%) FPMUL: 2011260 (9.364366%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 15150009 (70.537986%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 559543 (2.605215%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3308167 (15.402726%) DIV: 7532 (0.035069%) FPUN: 0 (0.000000%) FPRSUB: 461 (0.002146%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (215835960 total) ADD%: 8.168 (17629603) SUB%: 0.000 (0) MUL%: 0.000 (204) BITOR%: 1.232 (2658564) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.547 (1180308) FPSUB%: 0.000 (0) FPMUL%: 4.770 (10296395) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (612) FPMAX%: 0.000 (612) LOAD%: 4.955 (10693866) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (236) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41269) FPINV%: 0.000 (0) FPCONV%: 0.000 (676) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.064 (2295734) FPLE%: 0.393 (849027) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (612) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27736) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.960 (6388378) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.749 (1616251) CMPU%: 0.000 (0) RSUB%: 0.000 (204) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.771 (34040460) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.230 (2655575) ORI%: 1.266 (2732459) XORI%: 0.000 (0) MULI%: 3.357 (7244819) LW%: 1.190 (2569244) LWI%: 13.904 (30010017) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (649957) SWI%: 4.091 (8830355) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.478 (3191012) bged%: 0.000 (0) bgeid%: 0.000 (204) bgtd%: 0.000 (0) bgtid%: 0.323 (697108) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.040 (87069) bned%: 0.000 (0) bneid%: 13.720 (29612368) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.742 (1602536) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.087 (188216) DIV%: 0.000 (408) FPUN%: 1.188 (2563726) FPRSUB%: 3.713 (8013663) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (1) FPGT%: 3.103 (6697457) FPGE%: 0.799 (1725507) SYNC%: 0.000 (0) NOP%: 8.823 (19043416) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 200 SUB 0 MUL 32 BITOR 11 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 508 FPSUB 0 FPMUL 5313 FPCMPLT 0 FPMIN 0 FPMAX 396 LOAD 2354142 INTCONV 0 ATOMIC_INC 1 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 109 FPINV 0 FPCONV 20 FPEQ 0 FPNE 0 FPLT 10 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1810 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 2 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 2352 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3407991 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 844 ORI 602578 XORI 0 MULI 644345 LW 0 LWI 9543152 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1688 DIV 27 FPUN 0 FPRSUB 2 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 22.0247 --Total thread-cycles: 285923072 --total thread-cycles issued: 196792544 (68.827095%) --iCache conflicts: 6639077 (2.321980%) --thread*cycles of FU dependence: 16565535 (5.793703%) --thread*cycles of data dependence: 21477802 (7.511741%) --iCache cycles*banks: 285923072 (75.487435% used) Issue breakdown: --thread*cycles of issue worked: 196792544 (68.827095%) --thread*cycles of issue failed: 70087112 (24.512577%) --thread*cycles of issue NOP/other: 19043416 (6.660329%) Number of thread-cycles not ready: 21477802 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 215835960 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 7 3: 8 4: 7 5: 7 6: 8 7: 8 8: 7 9: 8 10: 8 11: 8 12: 7 13: 7 14: 8 15: 7 16: 7 17: 7 18: 8 19: 8 20: 8 21: 6 22: 7 23: 8 24: 9 25: 7 26: 7 27: 7 28: 7 29: 7 30: 7 31: 6 <=== Core 71 ===> ---- Thread 00 ---- PC 5: Stalled ----- 5887715 in-flight CPI 1.4838 -- Total Cycles 8736306 ---- Thread 01 ---- PC 5: Stalled ----- 6689719 in-flight CPI 1.3059 -- Total Cycles 8736306 ---- Thread 02 ---- PC 5: Stalled ----- 5891572 in-flight CPI 1.4828 -- Total Cycles 8736306 ---- Thread 03 ---- PC 5: Stalled ----- 6282192 in-flight CPI 1.3906 -- Total Cycles 8736306 ---- Thread 04 ---- PC 5: Stalled ----- 6140866 in-flight CPI 1.4226 -- Total Cycles 8736306 ---- Thread 05 ---- PC 5: Stalled ----- 5995660 in-flight CPI 1.4571 -- Total Cycles 8736306 ---- Thread 06 ---- PC 5: Stalled ----- 6086589 in-flight CPI 1.4353 -- Total Cycles 8736306 ---- Thread 07 ---- PC 5: Stalled ----- 5861057 in-flight CPI 1.4906 -- Total Cycles 8736306 ---- Thread 08 ---- PC 5: Stalled ----- 6642425 in-flight CPI 1.3152 -- Total Cycles 8736306 ---- Thread 09 ---- PC 5: Stalled ----- 6400956 in-flight CPI 1.3648 -- Total Cycles 8736306 ---- Thread 10 ---- PC 5: Stalled ----- 6289871 in-flight CPI 1.3889 -- Total Cycles 8736306 ---- Thread 11 ---- PC 5: Stalled ----- 6779962 in-flight CPI 1.2885 -- Total Cycles 8736306 ---- Thread 12 ---- PC 5: Stalled ----- 6212430 in-flight CPI 1.4063 -- Total Cycles 8736306 ---- Thread 13 ---- PC 5: Stalled ----- 5925901 in-flight CPI 1.4743 -- Total Cycles 8736306 ---- Thread 14 ---- PC 5: Stalled ----- 6535979 in-flight CPI 1.3366 -- Total Cycles 8736306 ---- Thread 15 ---- PC 5: Stalled ----- 5904835 in-flight CPI 1.4795 -- Total Cycles 8736306 ---- Thread 16 ---- PC 5: Stalled ----- 5905572 in-flight CPI 1.4793 -- Total Cycles 8736306 ---- Thread 17 ---- PC 5: Stalled ----- 5678574 in-flight CPI 1.5385 -- Total Cycles 8736306 ---- Thread 18 ---- PC 5: Stalled ----- 5688314 in-flight CPI 1.5358 -- Total Cycles 8736306 ---- Thread 19 ---- PC 5: Stalled ----- 6520091 in-flight CPI 1.3399 -- Total Cycles 8736306 ---- Thread 20 ---- PC 5: Stalled ----- 5765922 in-flight CPI 1.5152 -- Total Cycles 8736306 ---- Thread 21 ---- PC 5: Stalled ----- 6466665 in-flight CPI 1.3510 -- Total Cycles 8736306 ---- Thread 22 ---- PC 5: Stalled ----- 5861504 in-flight CPI 1.4905 -- Total Cycles 8736306 ---- Thread 23 ---- PC 5: Stalled ----- 5810542 in-flight CPI 1.5035 -- Total Cycles 8736306 ---- Thread 24 ---- PC 5: Stalled ----- 6414452 in-flight CPI 1.3620 -- Total Cycles 8736306 ---- Thread 25 ---- PC 5: Stalled ----- 5424443 in-flight CPI 1.6105 -- Total Cycles 8736306 ---- Thread 26 ---- PC 5: Stalled ----- 5461181 in-flight CPI 1.5997 -- Total Cycles 8736306 ---- Thread 27 ---- PC 5: Stalled ----- 5909014 in-flight CPI 1.4785 -- Total Cycles 8736306 ---- Thread 28 ---- PC 5: Stalled ----- 6331703 in-flight CPI 1.3798 -- Total Cycles 8736306 ---- Thread 29 ---- PC 5: Stalled ----- 6268241 in-flight CPI 1.3937 -- Total Cycles 8736306 ---- Thread 30 ---- PC 5: Stalled ----- 5559529 in-flight CPI 1.5714 -- Total Cycles 8736306 ---- Thread 31 ---- PC 5: Stalled ----- 5789910 in-flight CPI 1.5089 -- Total Cycles 8736306 Total CPI 0.0449 , IPC 22.2501 -- Total Cycles 8736306 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 424362 (2.025844%) FPSUB: 0 (0.000000%) FPMUL: 1961124 (9.362128%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 14773819 (70.528121%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 565861 (2.701340%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3214071 (15.343520%) DIV: 7709 (0.036802%) FPUN: 0 (0.000000%) FPRSUB: 470 (0.002244%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (213173155 total) ADD%: 8.207 (17495668) SUB%: 0.000 (0) MUL%: 0.000 (209) BITOR%: 1.228 (2617618) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.536 (1142900) FPSUB%: 0.000 (0) FPMUL%: 4.732 (10087222) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (627) FPMAX%: 0.000 (627) LOAD%: 4.944 (10540318) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (241) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.020 (41622) FPINV%: 0.000 (0) FPCONV%: 0.000 (691) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.059 (2257735) FPLE%: 0.393 (838106) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (627) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27784) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.968 (6326110) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.746 (1590276) CMPU%: 0.000 (0) RSUB%: 0.000 (209) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.768 (33613276) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.232 (2625971) ORI%: 1.255 (2674438) XORI%: 0.000 (0) MULI%: 3.366 (7175515) LW%: 1.194 (2544408) LWI%: 13.942 (29719720) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.302 (643027) SWI%: 4.107 (8754536) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.483 (3160864) bged%: 0.000 (0) bgeid%: 0.000 (209) bgtd%: 0.000 (0) bgtid%: 0.323 (688386) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.040 (84461) bned%: 0.000 (0) bneid%: 13.712 (29229805) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.742 (1581010) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.086 (182648) DIV%: 0.000 (418) FPUN%: 1.186 (2527531) FPRSUB%: 3.702 (7892385) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (2) FPGT%: 3.104 (6616535) FPGE%: 0.798 (1700182) SYNC%: 0.000 (0) NOP%: 8.814 (18789142) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 177 SUB 0 MUL 18 BITOR 5 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 502 FPSUB 0 FPMUL 5155 FPCMPLT 0 FPMIN 0 FPMAX 409 LOAD 2300673 INTCONV 0 ATOMIC_INC 4 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 98 FPINV 0 FPCONV 8 FPEQ 0 FPNE 0 FPLT 6 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1990 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 2 ADDKC 0 BITXOR 0 ANDN 0 CMP 2227 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3370201 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 769 ORI 577211 XORI 0 MULI 638285 LW 0 LWI 9442938 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1765 DIV 17 FPUN 0 FPRSUB 4 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 22.2501 --Total thread-cycles: 279561792 --total thread-cycles issued: 194384013 (69.531682%) --iCache conflicts: 6524651 (2.333885%) --thread*cycles of FU dependence: 16342485 (5.845751%) --thread*cycles of data dependence: 20947416 (7.492947%) --iCache cycles*banks: 279561792 (76.252619% used) Issue breakdown: --thread*cycles of issue worked: 194384013 (69.531681%) --thread*cycles of issue failed: 66388637 (23.747393%) --thread*cycles of issue NOP/other: 18789142 (6.720926%) Number of thread-cycles not ready: 20947416 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 213173155 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 9 2: 9 3: 7 4: 7 5: 7 6: 7 7: 8 8: 9 9: 8 10: 7 11: 8 12: 7 13: 7 14: 7 15: 7 16: 8 17: 7 18: 7 19: 8 20: 7 21: 8 22: 7 23: 7 24: 7 25: 7 26: 9 27: 7 28: 8 29: 8 30: 7 31: 8 <=== Core 72 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6709026 in-flight CPI 1.3637 -- Total Cycles 9149459 ---- Thread 01 ---- PC 5: Stalled ----- 6710177 in-flight CPI 1.3635 -- Total Cycles 9149459 ---- Thread 02 ---- PC 5: Stalled ----- 5917538 in-flight CPI 1.5462 -- Total Cycles 9149459 ---- Thread 03 ---- PC 5: Stalled ----- 7057109 in-flight CPI 1.2965 -- Total Cycles 9149459 ---- Thread 04 ---- PC 5: Stalled ----- 5924571 in-flight CPI 1.5443 -- Total Cycles 9149459 ---- Thread 05 ---- PC 5: Stalled ----- 6686147 in-flight CPI 1.3684 -- Total Cycles 9149459 ---- Thread 06 ---- PC 5: Stalled ----- 6389097 in-flight CPI 1.4320 -- Total Cycles 9149459 ---- Thread 07 ---- PC 5: Stalled ----- 5793057 in-flight CPI 1.5794 -- Total Cycles 9149459 ---- Thread 08 ---- PC 5: Stalled ----- 6254697 in-flight CPI 1.4628 -- Total Cycles 9149459 ---- Thread 09 ---- PC 5: Stalled ----- 6102099 in-flight CPI 1.4994 -- Total Cycles 9149459 ---- Thread 10 ---- PC 5: Stalled ----- 5669568 in-flight CPI 1.6138 -- Total Cycles 9149459 ---- Thread 11 ---- PC 5: Stalled ----- 6044693 in-flight CPI 1.5136 -- Total Cycles 9149459 ---- Thread 12 ---- PC 5: Stalled ----- 6748262 in-flight CPI 1.3558 -- Total Cycles 9149459 ---- Thread 13 ---- PC 5: Stalled ----- 6139330 in-flight CPI 1.4903 -- Total Cycles 9149459 ---- Thread 14 ---- PC 5: Stalled ----- 5978360 in-flight CPI 1.5304 -- Total Cycles 9149459 ---- Thread 15 ---- PC 5: Stalled ----- 5888027 in-flight CPI 1.5539 -- Total Cycles 9149459 ---- Thread 16 ---- PC 5: Stalled ----- 5894984 in-flight CPI 1.5521 -- Total Cycles 9149459 ---- Thread 17 ---- PC 5: Stalled ----- 5998552 in-flight CPI 1.5253 -- Total Cycles 9149459 ---- Thread 18 ---- PC 5: Stalled ----- 6916636 in-flight CPI 1.3228 -- Total Cycles 9149459 ---- Thread 19 ---- PC 5: Stalled ----- 5971306 in-flight CPI 1.5322 -- Total Cycles 9149459 ---- Thread 20 ---- PC 5: Stalled ----- 5922419 in-flight CPI 1.5449 -- Total Cycles 9149459 ---- Thread 21 ---- PC 5: Stalled ----- 5864545 in-flight CPI 1.5601 -- Total Cycles 9149459 ---- Thread 22 ---- PC 5: Stalled ----- 6285185 in-flight CPI 1.4557 -- Total Cycles 9149459 ---- Thread 23 ---- PC 5: Stalled ----- 6676476 in-flight CPI 1.3704 -- Total Cycles 9149459 ---- Thread 24 ---- PC 5: Stalled ----- 6641175 in-flight CPI 1.3777 -- Total Cycles 9149459 ---- Thread 25 ---- PC 5: Stalled ----- 5706654 in-flight CPI 1.6033 -- Total Cycles 9149459 ---- Thread 26 ---- PC 5: Stalled ----- 6264991 in-flight CPI 1.4604 -- Total Cycles 9149459 ---- Thread 27 ---- PC 5: Stalled ----- 6350418 in-flight CPI 1.4408 -- Total Cycles 9149459 ---- Thread 28 ---- PC 5: Stalled ----- 5854370 in-flight CPI 1.5628 -- Total Cycles 9149459 ---- Thread 29 ---- PC 5: Stalled ----- 5776008 in-flight CPI 1.5840 -- Total Cycles 9149459 ---- Thread 30 ---- PC 5: Stalled ----- 6188106 in-flight CPI 1.4786 -- Total Cycles 9149459 ---- Thread 31 ---- PC 5: Stalled ----- 5529848 in-flight CPI 1.6546 -- Total Cycles 9149459 Total CPI 0.0462 , IPC 21.6247 -- Total Cycles 9149459 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 447411 (2.089788%) FPSUB: 0 (0.000000%) FPMUL: 2031993 (9.491128%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 15008155 (70.100790%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 566151 (2.644404%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3347460 (15.635472%) DIV: 7748 (0.036190%) FPUN: 0 (0.000000%) FPRSUB: 477 (0.002228%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (216994087 total) ADD%: 8.181 (17752990) SUB%: 0.000 (0) MUL%: 0.000 (210) BITOR%: 1.229 (2666684) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.552 (1196899) FPSUB%: 0.000 (0) FPMUL%: 4.782 (10377216) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (630) FPMAX%: 0.000 (630) LOAD%: 4.951 (10742870) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (242) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41635) FPINV%: 0.000 (0) FPCONV%: 0.000 (694) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.066 (2313465) FPLE%: 0.390 (846595) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (630) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (28012) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.958 (6417703) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.750 (1627163) CMPU%: 0.000 (0) RSUB%: 0.000 (210) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.761 (34199612) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.230 (2669325) ORI%: 1.269 (2754726) XORI%: 0.000 (0) MULI%: 3.356 (7282614) LW%: 1.189 (2581058) LWI%: 13.906 (30175628) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.300 (652017) SWI%: 4.091 (8876379) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.478 (3206871) bged%: 0.000 (0) bgeid%: 0.000 (210) bgtd%: 0.000 (0) bgtid%: 0.322 (699362) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.040 (87583) bned%: 0.000 (0) bneid%: 13.716 (29763223) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.737 (1599797) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.088 (190499) DIV%: 0.000 (420) FPUN%: 1.184 (2569897) FPRSUB%: 3.716 (8063514) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (1) FPGT%: 3.103 (6732593) FPGE%: 0.799 (1734158) SYNC%: 0.000 (0) NOP%: 8.821 (19140026) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 165 SUB 0 MUL 16 BITOR 5 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 533 FPSUB 0 FPMUL 5471 FPCMPLT 0 FPMIN 0 FPMAX 411 LOAD 2347517 INTCONV 0 ATOMIC_INC 7 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 101 FPINV 0 FPCONV 10 FPEQ 0 FPNE 0 FPLT 12 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1820 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 2173 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3424083 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 757 ORI 611742 XORI 0 MULI 645632 LW 0 LWI 9594937 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1773 DIV 16 FPUN 0 FPRSUB 3 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.6247 --Total thread-cycles: 292782688 --total thread-cycles issued: 197854061 (67.577105%) --iCache conflicts: 6633605 (2.265709%) --thread*cycles of FU dependence: 16637198 (5.682439%) --thread*cycles of data dependence: 21409395 (7.312384%) --iCache cycles*banks: 292782688 (74.114395% used) Issue breakdown: --thread*cycles of issue worked: 197854061 (67.577104%) --thread*cycles of issue failed: 75788601 (25.885616%) --thread*cycles of issue NOP/other: 19140026 (6.537281%) Number of thread-cycles not ready: 21409395 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 216994087 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 10 3: 9 4: 7 5: 9 6: 7 7: 8 8: 7 9: 7 10: 6 11: 8 12: 8 13: 7 14: 7 15: 7 16: 7 17: 7 18: 9 19: 7 20: 8 21: 8 22: 7 23: 8 24: 7 25: 7 26: 7 27: 7 28: 8 29: 8 30: 7 31: 7 <=== Core 73 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6079375 in-flight CPI 1.4746 -- Total Cycles 8964813 ---- Thread 01 ---- PC 5: Stalled ----- 6426626 in-flight CPI 1.3949 -- Total Cycles 8964813 ---- Thread 02 ---- PC 5: Stalled ----- 6075494 in-flight CPI 1.4756 -- Total Cycles 8964813 ---- Thread 03 ---- PC 5: Stalled ----- 6305257 in-flight CPI 1.4218 -- Total Cycles 8964813 ---- Thread 04 ---- PC 5: Stalled ----- 6181611 in-flight CPI 1.4502 -- Total Cycles 8964813 ---- Thread 05 ---- PC 5: Stalled ----- 6699799 in-flight CPI 1.3381 -- Total Cycles 8964813 ---- Thread 06 ---- PC 5: Stalled ----- 6058234 in-flight CPI 1.4798 -- Total Cycles 8964813 ---- Thread 07 ---- PC 5: Stalled ----- 6193881 in-flight CPI 1.4474 -- Total Cycles 8964813 ---- Thread 08 ---- PC 5: Stalled ----- 6351267 in-flight CPI 1.4115 -- Total Cycles 8964813 ---- Thread 09 ---- PC 5: Stalled ----- 6078785 in-flight CPI 1.4748 -- Total Cycles 8964813 ---- Thread 10 ---- PC 5: Stalled ----- 6322253 in-flight CPI 1.4180 -- Total Cycles 8964813 ---- Thread 11 ---- PC 5: Stalled ----- 6028189 in-flight CPI 1.4871 -- Total Cycles 8964813 ---- Thread 12 ---- PC 5: Stalled ----- 6651884 in-flight CPI 1.3477 -- Total Cycles 8964813 ---- Thread 13 ---- PC 5: Stalled ----- 6440029 in-flight CPI 1.3920 -- Total Cycles 8964813 ---- Thread 14 ---- PC 5: Stalled ----- 6373815 in-flight CPI 1.4065 -- Total Cycles 8964813 ---- Thread 15 ---- PC 5: Stalled ----- 6430955 in-flight CPI 1.3940 -- Total Cycles 8964813 ---- Thread 16 ---- PC 5: Stalled ----- 6050526 in-flight CPI 1.4817 -- Total Cycles 8964813 ---- Thread 17 ---- PC 5: Stalled ----- 6854008 in-flight CPI 1.3080 -- Total Cycles 8964813 ---- Thread 18 ---- PC 5: Stalled ----- 6186628 in-flight CPI 1.4491 -- Total Cycles 8964813 ---- Thread 19 ---- PC 5: Stalled ----- 5851627 in-flight CPI 1.5320 -- Total Cycles 8964813 ---- Thread 20 ---- PC 5: Stalled ----- 5626877 in-flight CPI 1.5932 -- Total Cycles 8964813 ---- Thread 21 ---- PC 5: Stalled ----- 5866410 in-flight CPI 1.5282 -- Total Cycles 8964813 ---- Thread 22 ---- PC 5: Stalled ----- 5753205 in-flight CPI 1.5582 -- Total Cycles 8964813 ---- Thread 23 ---- PC 5: Stalled ----- 5844540 in-flight CPI 1.5339 -- Total Cycles 8964813 ---- Thread 24 ---- PC 5: Stalled ----- 6238653 in-flight CPI 1.4370 -- Total Cycles 8964813 ---- Thread 25 ---- PC 5: Stalled ----- 5948591 in-flight CPI 1.5070 -- Total Cycles 8964813 ---- Thread 26 ---- PC 5: Stalled ----- 5514162 in-flight CPI 1.6258 -- Total Cycles 8964813 ---- Thread 27 ---- PC 5: Stalled ----- 5546425 in-flight CPI 1.6163 -- Total Cycles 8964813 ---- Thread 28 ---- PC 5: Stalled ----- 5940002 in-flight CPI 1.5092 -- Total Cycles 8964813 ---- Thread 29 ---- PC 5: Stalled ----- 5830262 in-flight CPI 1.5376 -- Total Cycles 8964813 ---- Thread 30 ---- PC 5: Stalled ----- 6120218 in-flight CPI 1.4648 -- Total Cycles 8964813 ---- Thread 31 ---- PC 5: Stalled ----- 5315908 in-flight CPI 1.6864 -- Total Cycles 8964813 Total CPI 0.0459 , IPC 21.7725 -- Total Cycles 8964813 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 432362 (2.017296%) FPSUB: 0 (0.000000%) FPMUL: 1984897 (9.261047%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 15181188 (70.831732%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 568939 (2.654531%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3257256 (15.197564%) DIV: 7638 (0.035637%) FPUN: 0 (0.000000%) FPRSUB: 470 (0.002193%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (214055048 total) ADD%: 8.193 (17537689) SUB%: 0.000 (0) MUL%: 0.000 (207) BITOR%: 1.222 (2615605) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.543 (1162613) FPSUB%: 0.000 (0) FPMUL%: 4.756 (10180517) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (621) FPMAX%: 0.000 (621) LOAD%: 4.952 (10599434) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (239) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.020 (41809) FPINV%: 0.000 (0) FPCONV%: 0.000 (685) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.061 (2272031) FPLE%: 0.392 (839363) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (621) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (28086) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.966 (6348508) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.747 (1599621) CMPU%: 0.000 (0) RSUB%: 0.000 (207) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.770 (33755809) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.232 (2637601) ORI%: 1.254 (2683465) XORI%: 0.000 (0) MULI%: 3.363 (7199602) LW%: 1.193 (2553384) LWI%: 13.933 (29823556) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (644792) SWI%: 4.104 (8785689) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.482 (3172666) bged%: 0.000 (0) bgeid%: 0.000 (207) bgtd%: 0.000 (0) bgtid%: 0.322 (689917) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.039 (84114) bned%: 0.000 (0) bneid%: 13.708 (29342315) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.744 (1592268) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.087 (185179) DIV%: 0.000 (414) FPUN%: 1.179 (2522845) FPRSUB%: 3.709 (7940093) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (3) FPGT%: 3.106 (6649205) FPGE%: 0.792 (1694420) SYNC%: 0.000 (0) NOP%: 8.815 (18868931) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 194 SUB 0 MUL 17 BITOR 4 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 536 FPSUB 0 FPMUL 5095 FPCMPLT 0 FPMIN 0 FPMAX 403 LOAD 2356077 INTCONV 0 ATOMIC_INC 2 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 103 FPINV 0 FPCONV 20 FPEQ 0 FPNE 0 FPLT 7 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1970 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 1 COS 0 SIN 0 ADDC 0 ADDK 2 ADDKC 0 BITXOR 0 ANDN 0 CMP 2175 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3382170 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 774 ORI 590401 XORI 0 MULI 647916 LW 0 LWI 9477676 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1782 DIV 10 FPUN 0 FPRSUB 4 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.7725 --Total thread-cycles: 286874016 --total thread-cycles issued: 195186117 (68.038965%) --iCache conflicts: 6602208 (2.301431%) --thread*cycles of FU dependence: 16467354 (5.740274%) --thread*cycles of data dependence: 21432750 (7.471137%) --iCache cycles*banks: 286874016 (74.616406% used) Issue breakdown: --thread*cycles of issue worked: 195186117 (68.038967%) --thread*cycles of issue failed: 72818968 (25.383605%) --thread*cycles of issue NOP/other: 18868931 (6.577428%) Number of thread-cycles not ready: 21432750 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 214055048 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 7 2: 7 3: 8 4: 8 5: 8 6: 7 7: 8 8: 8 9: 7 10: 7 11: 7 12: 9 13: 8 14: 7 15: 7 16: 8 17: 8 18: 7 19: 8 20: 7 21: 7 22: 8 23: 7 24: 7 25: 7 26: 7 27: 7 28: 7 29: 8 30: 9 31: 7 <=== Core 74 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6133340 in-flight CPI 1.4522 -- Total Cycles 8907012 ---- Thread 01 ---- PC 5: Stalled ----- 6675455 in-flight CPI 1.3343 -- Total Cycles 8907012 ---- Thread 02 ---- PC 5: Stalled ----- 6706043 in-flight CPI 1.3282 -- Total Cycles 8907012 ---- Thread 03 ---- PC 5: Stalled ----- 6641486 in-flight CPI 1.3411 -- Total Cycles 8907012 ---- Thread 04 ---- PC 5: Stalled ----- 6695494 in-flight CPI 1.3303 -- Total Cycles 8907012 ---- Thread 05 ---- PC 5: Stalled ----- 6696248 in-flight CPI 1.3301 -- Total Cycles 8907012 ---- Thread 06 ---- PC 5: Stalled ----- 6060279 in-flight CPI 1.4697 -- Total Cycles 8907012 ---- Thread 07 ---- PC 5: Stalled ----- 6460847 in-flight CPI 1.3786 -- Total Cycles 8907012 ---- Thread 08 ---- PC 5: Stalled ----- 6048308 in-flight CPI 1.4726 -- Total Cycles 8907012 ---- Thread 09 ---- PC 5: Stalled ----- 6936012 in-flight CPI 1.2842 -- Total Cycles 8907012 ---- Thread 10 ---- PC 5: Stalled ----- 5833430 in-flight CPI 1.5269 -- Total Cycles 8907012 ---- Thread 11 ---- PC 5: Stalled ----- 6344594 in-flight CPI 1.4039 -- Total Cycles 8907012 ---- Thread 12 ---- PC 5: Stalled ----- 6314224 in-flight CPI 1.4106 -- Total Cycles 8907012 ---- Thread 13 ---- PC 5: Stalled ----- 5956905 in-flight CPI 1.4952 -- Total Cycles 8907012 ---- Thread 14 ---- PC 5: Stalled ----- 6465604 in-flight CPI 1.3776 -- Total Cycles 8907012 ---- Thread 15 ---- PC 5: Stalled ----- 6255111 in-flight CPI 1.4240 -- Total Cycles 8907012 ---- Thread 16 ---- PC 5: Stalled ----- 5835617 in-flight CPI 1.5263 -- Total Cycles 8907012 ---- Thread 17 ---- PC 5: Stalled ----- 5771743 in-flight CPI 1.5432 -- Total Cycles 8907012 ---- Thread 18 ---- PC 5: Stalled ----- 6186804 in-flight CPI 1.4397 -- Total Cycles 8907012 ---- Thread 19 ---- PC 5: Stalled ----- 5834616 in-flight CPI 1.5266 -- Total Cycles 8907012 ---- Thread 20 ---- PC 5: Stalled ----- 6212773 in-flight CPI 1.4337 -- Total Cycles 8907012 ---- Thread 21 ---- PC 5: Stalled ----- 5568315 in-flight CPI 1.5996 -- Total Cycles 8907012 ---- Thread 22 ---- PC 5: Stalled ----- 5749339 in-flight CPI 1.5492 -- Total Cycles 8907012 ---- Thread 23 ---- PC 5: Stalled ----- 6023175 in-flight CPI 1.4788 -- Total Cycles 8907012 ---- Thread 24 ---- PC 5: Stalled ----- 6185538 in-flight CPI 1.4400 -- Total Cycles 8907012 ---- Thread 25 ---- PC 5: Stalled ----- 6051781 in-flight CPI 1.4718 -- Total Cycles 8907012 ---- Thread 26 ---- PC 5: Stalled ----- 5554029 in-flight CPI 1.6037 -- Total Cycles 8907012 ---- Thread 27 ---- PC 5: Stalled ----- 5858024 in-flight CPI 1.5205 -- Total Cycles 8907012 ---- Thread 28 ---- PC 5: Stalled ----- 6104190 in-flight CPI 1.4592 -- Total Cycles 8907012 ---- Thread 29 ---- PC 5: Stalled ----- 6226405 in-flight CPI 1.4305 -- Total Cycles 8907012 ---- Thread 30 ---- PC 5: Stalled ----- 5947597 in-flight CPI 1.4976 -- Total Cycles 8907012 ---- Thread 31 ---- PC 5: Stalled ----- 6255071 in-flight CPI 1.4240 -- Total Cycles 8907012 Total CPI 0.0451 , IPC 22.1835 -- Total Cycles 8907012 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 433829 (2.042816%) FPSUB: 0 (0.000000%) FPMUL: 2001722 (9.425719%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 14936814 (70.334546%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 577138 (2.717630%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3278954 (15.439955%) DIV: 7866 (0.037039%) FPUN: 0 (0.000000%) FPRSUB: 487 (0.002293%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (216704539 total) ADD%: 8.177 (17719783) SUB%: 0.000 (0) MUL%: 0.000 (213) BITOR%: 1.226 (2656635) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.540 (1169622) FPSUB%: 0.000 (0) FPMUL%: 4.742 (10277205) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (639) FPMAX%: 0.000 (639) LOAD%: 4.946 (10719021) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (245) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.020 (42421) FPINV%: 0.000 (0) FPCONV%: 0.000 (703) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.061 (2299332) FPLE%: 0.395 (855154) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (639) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (28412) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.965 (6425774) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.746 (1617075) CMPU%: 0.000 (0) RSUB%: 0.000 (213) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.773 (34179917) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.231 (2667739) ORI%: 1.254 (2717517) XORI%: 0.000 (0) MULI%: 3.365 (7292651) LW%: 1.193 (2584532) LWI%: 13.943 (30214495) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (652788) SWI%: 4.106 (8896903) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.482 (3211140) bged%: 0.000 (0) bgeid%: 0.000 (213) bgtd%: 0.000 (0) bgtid%: 0.323 (699108) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.040 (86415) bned%: 0.000 (0) bneid%: 13.716 (29723749) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.744 (1612597) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.086 (186343) DIV%: 0.000 (426) FPUN%: 1.184 (2564756) FPRSUB%: 3.705 (8029564) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (3) FPGT%: 3.107 (6733746) FPGE%: 0.794 (1720613) SYNC%: 0.000 (0) NOP%: 8.821 (19115503) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 156 SUB 0 MUL 19 BITOR 5 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 524 FPSUB 0 FPMUL 5048 FPCMPLT 0 FPMIN 0 FPMAX 413 LOAD 2333611 INTCONV 0 ATOMIC_INC 4 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 129 FPINV 0 FPCONV 10 FPEQ 0 FPNE 0 FPLT 7 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1868 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 2283 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3427218 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 854 ORI 591276 XORI 0 MULI 653957 LW 0 LWI 9599914 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1765 DIV 15 FPUN 0 FPRSUB 3 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 22.1835 --Total thread-cycles: 285024384 --total thread-cycles issued: 197589036 (69.323557%) --iCache conflicts: 6665018 (2.338403%) --thread*cycles of FU dependence: 16619102 (5.830765%) --thread*cycles of data dependence: 21236810 (7.450875%) --iCache cycles*banks: 285024384 (76.030187% used) Issue breakdown: --thread*cycles of issue worked: 197589036 (69.323555%) --thread*cycles of issue failed: 68319845 (23.969825%) --thread*cycles of issue NOP/other: 19115503 (6.706620%) Number of thread-cycles not ready: 21236810 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 216704539 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 8 3: 8 4: 8 5: 8 6: 7 7: 10 8: 8 9: 8 10: 7 11: 8 12: 8 13: 7 14: 8 15: 8 16: 7 17: 7 18: 7 19: 8 20: 8 21: 7 22: 8 23: 7 24: 8 25: 7 26: 8 27: 8 28: 7 29: 7 30: 7 31: 7 <=== Core 75 ===> ---- Thread 00 ---- PC 5: Stalled ----- 5883854 in-flight CPI 1.5494 -- Total Cycles 9116566 ---- Thread 01 ---- PC 5: Stalled ----- 6320693 in-flight CPI 1.4423 -- Total Cycles 9116566 ---- Thread 02 ---- PC 5: Stalled ----- 6249896 in-flight CPI 1.4587 -- Total Cycles 9116566 ---- Thread 03 ---- PC 5: Stalled ----- 6005830 in-flight CPI 1.5179 -- Total Cycles 9116566 ---- Thread 04 ---- PC 5: Stalled ----- 5951832 in-flight CPI 1.5317 -- Total Cycles 9116566 ---- Thread 05 ---- PC 5: Stalled ----- 6639243 in-flight CPI 1.3731 -- Total Cycles 9116566 ---- Thread 06 ---- PC 5: Stalled ----- 6261949 in-flight CPI 1.4559 -- Total Cycles 9116566 ---- Thread 07 ---- PC 5: Stalled ----- 6339839 in-flight CPI 1.4380 -- Total Cycles 9116566 ---- Thread 08 ---- PC 5: Stalled ----- 6088764 in-flight CPI 1.4973 -- Total Cycles 9116566 ---- Thread 09 ---- PC 5: Stalled ----- 5968459 in-flight CPI 1.5275 -- Total Cycles 9116566 ---- Thread 10 ---- PC 5: Stalled ----- 6576901 in-flight CPI 1.3861 -- Total Cycles 9116566 ---- Thread 11 ---- PC 5: Stalled ----- 5958273 in-flight CPI 1.5301 -- Total Cycles 9116566 ---- Thread 12 ---- PC 5: Stalled ----- 6259540 in-flight CPI 1.4564 -- Total Cycles 9116566 ---- Thread 13 ---- PC 5: Stalled ----- 6401884 in-flight CPI 1.4240 -- Total Cycles 9116566 ---- Thread 14 ---- PC 5: Stalled ----- 6513143 in-flight CPI 1.3997 -- Total Cycles 9116566 ---- Thread 15 ---- PC 5: Stalled ----- 6528211 in-flight CPI 1.3965 -- Total Cycles 9116566 ---- Thread 16 ---- PC 5: Stalled ----- 6845667 in-flight CPI 1.3317 -- Total Cycles 9116566 ---- Thread 17 ---- PC 5: Stalled ----- 5761614 in-flight CPI 1.5823 -- Total Cycles 9116566 ---- Thread 18 ---- PC 5: Stalled ----- 6608128 in-flight CPI 1.3796 -- Total Cycles 9116566 ---- Thread 19 ---- PC 5: Stalled ----- 6240211 in-flight CPI 1.4609 -- Total Cycles 9116566 ---- Thread 20 ---- PC 5: Stalled ----- 5777088 in-flight CPI 1.5781 -- Total Cycles 9116566 ---- Thread 21 ---- PC 5: Stalled ----- 6612565 in-flight CPI 1.3787 -- Total Cycles 9116566 ---- Thread 22 ---- PC 5: Stalled ----- 5874054 in-flight CPI 1.5520 -- Total Cycles 9116566 ---- Thread 23 ---- PC 5: Stalled ----- 6035936 in-flight CPI 1.5104 -- Total Cycles 9116566 ---- Thread 24 ---- PC 5: Stalled ----- 5896584 in-flight CPI 1.5461 -- Total Cycles 9116566 ---- Thread 25 ---- PC 5: Stalled ----- 6749439 in-flight CPI 1.3507 -- Total Cycles 9116566 ---- Thread 26 ---- PC 5: Stalled ----- 6261887 in-flight CPI 1.4559 -- Total Cycles 9116566 ---- Thread 27 ---- PC 5: Stalled ----- 5466902 in-flight CPI 1.6676 -- Total Cycles 9116566 ---- Thread 28 ---- PC 5: Stalled ----- 5431429 in-flight CPI 1.6785 -- Total Cycles 9116566 ---- Thread 29 ---- PC 5: Stalled ----- 5910777 in-flight CPI 1.5424 -- Total Cycles 9116566 ---- Thread 30 ---- PC 5: Stalled ----- 5817991 in-flight CPI 1.5670 -- Total Cycles 9116566 ---- Thread 31 ---- PC 5: Stalled ----- 6300669 in-flight CPI 1.4469 -- Total Cycles 9116566 Total CPI 0.0462 , IPC 21.6682 -- Total Cycles 9116566 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 439955 (2.030689%) FPSUB: 0 (0.000000%) FPMUL: 2012957 (9.291152%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 15363132 (70.911200%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 556878 (2.570367%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3284284 (15.159182%) DIV: 7632 (0.035227%) FPUN: 0 (0.000000%) FPRSUB: 473 (0.002183%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (216640485 total) ADD%: 8.174 (17708942) SUB%: 0.000 (0) MUL%: 0.000 (207) BITOR%: 1.231 (2666600) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.543 (1176584) FPSUB%: 0.000 (0) FPMUL%: 4.763 (10318983) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (621) FPMAX%: 0.000 (621) LOAD%: 4.949 (10721433) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (239) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (40977) FPINV%: 0.000 (0) FPCONV%: 0.000 (685) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.062 (2300771) FPLE%: 0.391 (848038) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (621) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27530) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.965 (6422352) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.749 (1622287) CMPU%: 0.000 (0) RSUB%: 0.000 (207) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.771 (34166566) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.232 (2669662) ORI%: 1.265 (2740647) XORI%: 0.000 (0) MULI%: 3.360 (7279042) LW%: 1.192 (2582728) LWI%: 13.917 (30150421) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (652783) SWI%: 4.095 (8871227) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.481 (3208723) bged%: 0.000 (0) bgeid%: 0.000 (207) bgtd%: 0.000 (0) bgtid%: 0.323 (699576) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.039 (85504) bned%: 0.000 (0) bneid%: 13.718 (29718747) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.739 (1600790) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.086 (186860) DIV%: 0.000 (414) FPUN%: 1.187 (2571267) FPRSUB%: 3.712 (8042707) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (3) FPGT%: 3.102 (6720316) FPGE%: 0.800 (1733889) SYNC%: 0.000 (0) NOP%: 8.817 (19100612) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 153 SUB 0 MUL 36 BITOR 1 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 492 FPSUB 0 FPMUL 5366 FPCMPLT 0 FPMIN 0 FPMAX 401 LOAD 2366507 INTCONV 0 ATOMIC_INC 11 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 86 FPINV 0 FPCONV 12 FPEQ 0 FPNE 0 FPLT 5 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1884 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 1 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 2127 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3424323 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 855 ORI 601519 XORI 0 MULI 648246 LW 0 LWI 9586804 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1693 DIV 11 FPUN 0 FPRSUB 6 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.6682 --Total thread-cycles: 291730112 --total thread-cycles issued: 197539873 (67.713227%) --iCache conflicts: 6627326 (2.271732%) --thread*cycles of FU dependence: 16640546 (5.704089%) --thread*cycles of data dependence: 21665311 (7.426491%) --iCache cycles*banks: 291730112 (74.260595% used) Issue breakdown: --thread*cycles of issue worked: 197539873 (67.713227%) --thread*cycles of issue failed: 75089627 (25.739416%) --thread*cycles of issue NOP/other: 19100612 (6.547357%) Number of thread-cycles not ready: 21665311 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 216640485 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 7 2: 7 3: 8 4: 7 5: 8 6: 8 7: 8 8: 7 9: 7 10: 8 11: 7 12: 8 13: 7 14: 7 15: 7 16: 8 17: 7 18: 8 19: 8 20: 7 21: 11 22: 7 23: 8 24: 7 25: 7 26: 7 27: 6 28: 6 29: 8 30: 8 31: 7 <=== Core 76 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6251626 in-flight CPI 1.4798 -- Total Cycles 9250879 ---- Thread 01 ---- PC 5: Stalled ----- 6053317 in-flight CPI 1.5282 -- Total Cycles 9250879 ---- Thread 02 ---- PC 5: Stalled ----- 5832730 in-flight CPI 1.5860 -- Total Cycles 9250879 ---- Thread 03 ---- PC 5: Stalled ----- 6100284 in-flight CPI 1.5165 -- Total Cycles 9250879 ---- Thread 04 ---- PC 5: Stalled ----- 6250993 in-flight CPI 1.4799 -- Total Cycles 9250879 ---- Thread 05 ---- PC 5: Stalled ----- 6714616 in-flight CPI 1.3777 -- Total Cycles 9250879 ---- Thread 06 ---- PC 5: Stalled ----- 6530849 in-flight CPI 1.4165 -- Total Cycles 9250879 ---- Thread 07 ---- PC 5: Stalled ----- 6279328 in-flight CPI 1.4732 -- Total Cycles 9250879 ---- Thread 08 ---- PC 5: Stalled ----- 6078109 in-flight CPI 1.5220 -- Total Cycles 9250879 ---- Thread 09 ---- PC 5: Stalled ----- 5978614 in-flight CPI 1.5473 -- Total Cycles 9250879 ---- Thread 10 ---- PC 5: Stalled ----- 6487508 in-flight CPI 1.4259 -- Total Cycles 9250879 ---- Thread 11 ---- PC 5: Stalled ----- 5980259 in-flight CPI 1.5469 -- Total Cycles 9250879 ---- Thread 12 ---- PC 5: Stalled ----- 6344292 in-flight CPI 1.4581 -- Total Cycles 9250879 ---- Thread 13 ---- PC 5: Stalled ----- 5953001 in-flight CPI 1.5540 -- Total Cycles 9250879 ---- Thread 14 ---- PC 5: Stalled ----- 6074207 in-flight CPI 1.5230 -- Total Cycles 9250879 ---- Thread 15 ---- PC 5: Stalled ----- 7078953 in-flight CPI 1.3068 -- Total Cycles 9250879 ---- Thread 16 ---- PC 5: Stalled ----- 6162488 in-flight CPI 1.5012 -- Total Cycles 9250879 ---- Thread 17 ---- PC 5: Stalled ----- 6042041 in-flight CPI 1.5311 -- Total Cycles 9250879 ---- Thread 18 ---- PC 5: Stalled ----- 6033593 in-flight CPI 1.5332 -- Total Cycles 9250879 ---- Thread 19 ---- PC 5: Stalled ----- 6157777 in-flight CPI 1.5023 -- Total Cycles 9250879 ---- Thread 20 ---- PC 5: Stalled ----- 5667295 in-flight CPI 1.6323 -- Total Cycles 9250879 ---- Thread 21 ---- PC 5: Stalled ----- 5797233 in-flight CPI 1.5957 -- Total Cycles 9250879 ---- Thread 22 ---- PC 5: Stalled ----- 5935465 in-flight CPI 1.5586 -- Total Cycles 9250879 ---- Thread 23 ---- PC 5: Stalled ----- 6547798 in-flight CPI 1.4128 -- Total Cycles 9250879 ---- Thread 24 ---- PC 5: Stalled ----- 5472074 in-flight CPI 1.6906 -- Total Cycles 9250879 ---- Thread 25 ---- PC 5: Stalled ----- 6516150 in-flight CPI 1.4197 -- Total Cycles 9250879 ---- Thread 26 ---- PC 5: Stalled ----- 6038746 in-flight CPI 1.5319 -- Total Cycles 9250879 ---- Thread 27 ---- PC 5: Stalled ----- 5533511 in-flight CPI 1.6718 -- Total Cycles 9250879 ---- Thread 28 ---- PC 5: Stalled ----- 6028080 in-flight CPI 1.5346 -- Total Cycles 9250879 ---- Thread 29 ---- PC 5: Stalled ----- 6027090 in-flight CPI 1.5349 -- Total Cycles 9250879 ---- Thread 30 ---- PC 5: Stalled ----- 6631418 in-flight CPI 1.3950 -- Total Cycles 9250879 ---- Thread 31 ---- PC 5: Stalled ----- 5570287 in-flight CPI 1.6607 -- Total Cycles 9250879 Total CPI 0.0472 , IPC 21.2034 -- Total Cycles 9250879 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 429299 (1.994137%) FPSUB: 0 (0.000000%) FPMUL: 1983452 (9.213335%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 15306089 (71.098327%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 568127 (2.639007%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3233092 (15.018038%) DIV: 7531 (0.034982%) FPUN: 0 (0.000000%) FPRSUB: 468 (0.002174%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (215108366 total) ADD%: 8.220 (17682379) SUB%: 0.000 (0) MUL%: 0.000 (204) BITOR%: 1.218 (2620819) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.537 (1155805) FPSUB%: 0.000 (0) FPMUL%: 4.741 (10197732) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (612) FPMAX%: 0.000 (612) LOAD%: 4.946 (10638911) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (236) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41822) FPINV%: 0.000 (0) FPCONV%: 0.000 (676) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.060 (2279646) FPLE%: 0.392 (843558) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (612) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (27914) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.968 (6385438) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.746 (1605375) CMPU%: 0.000 (0) RSUB%: 0.000 (204) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.768 (33917419) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.233 (2651570) ORI%: 1.248 (2685298) XORI%: 0.000 (0) MULI%: 3.366 (7240635) LW%: 1.194 (2568148) LWI%: 13.950 (30006562) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.302 (648740) SWI%: 4.109 (8839076) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.483 (3190694) bged%: 0.000 (0) bgeid%: 0.000 (204) bgtd%: 0.000 (0) bgtid%: 0.323 (694328) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.039 (83765) bned%: 0.000 (0) bneid%: 13.703 (29476540) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.743 (1598268) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.085 (183757) DIV%: 0.000 (408) FPUN%: 1.176 (2528816) FPRSUB%: 3.706 (7972252) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (1) FPGT%: 3.108 (6685057) FPGE%: 0.789 (1696155) SYNC%: 0.000 (0) NOP%: 8.813 (18958022) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 174 SUB 0 MUL 19 BITOR 3 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 516 FPSUB 0 FPMUL 4849 FPCMPLT 0 FPMIN 0 FPMAX 396 LOAD 2354633 INTCONV 0 ATOMIC_INC 5 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 118 FPINV 0 FPCONV 10 FPEQ 0 FPNE 0 FPLT 8 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 2044 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 1 ADDKC 0 BITXOR 0 ANDN 0 CMP 2155 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3403090 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 769 ORI 585060 XORI 0 MULI 645181 LW 0 LWI 9533329 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1572 DIV 12 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.2034 --Total thread-cycles: 296028128 --total thread-cycles issued: 196150344 (66.260709%) --iCache conflicts: 6572152 (2.220111%) --thread*cycles of FU dependence: 16533946 (5.585262%) --thread*cycles of data dependence: 21528058 (7.272302%) --iCache cycles*banks: 296028128 (72.664851% used) Issue breakdown: --thread*cycles of issue worked: 196150344 (66.260712%) --thread*cycles of issue failed: 80919762 (27.335160%) --thread*cycles of issue NOP/other: 18958022 (6.404129%) Number of thread-cycles not ready: 21528058 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 215108366 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 7 2: 7 3: 7 4: 9 5: 8 6: 7 7: 8 8: 7 9: 7 10: 8 11: 7 12: 8 13: 7 14: 7 15: 8 16: 10 17: 7 18: 7 19: 7 20: 7 21: 7 22: 7 23: 7 24: 6 25: 8 26: 7 27: 7 28: 7 29: 7 30: 8 31: 7 <=== Core 77 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6088630 in-flight CPI 1.4677 -- Total Cycles 8936192 ---- Thread 01 ---- PC 5: Stalled ----- 5845434 in-flight CPI 1.5287 -- Total Cycles 8936192 ---- Thread 02 ---- PC 5: Stalled ----- 5967575 in-flight CPI 1.4975 -- Total Cycles 8936192 ---- Thread 03 ---- PC 5: Stalled ----- 6291384 in-flight CPI 1.4204 -- Total Cycles 8936192 ---- Thread 04 ---- PC 5: Stalled ----- 6200847 in-flight CPI 1.4411 -- Total Cycles 8936192 ---- Thread 05 ---- PC 5: Stalled ----- 6643044 in-flight CPI 1.3452 -- Total Cycles 8936192 ---- Thread 06 ---- PC 5: Stalled ----- 6140672 in-flight CPI 1.4552 -- Total Cycles 8936192 ---- Thread 07 ---- PC 5: Stalled ----- 6352911 in-flight CPI 1.4066 -- Total Cycles 8936192 ---- Thread 08 ---- PC 5: Stalled ----- 5842203 in-flight CPI 1.5296 -- Total Cycles 8936192 ---- Thread 09 ---- PC 5: Stalled ----- 6931986 in-flight CPI 1.2891 -- Total Cycles 8936192 ---- Thread 10 ---- PC 5: Stalled ----- 6498317 in-flight CPI 1.3752 -- Total Cycles 8936192 ---- Thread 11 ---- PC 5: Stalled ----- 6808788 in-flight CPI 1.3124 -- Total Cycles 8936192 ---- Thread 12 ---- PC 5: Stalled ----- 6443747 in-flight CPI 1.3868 -- Total Cycles 8936192 ---- Thread 13 ---- PC 5: Stalled ----- 6481025 in-flight CPI 1.3788 -- Total Cycles 8936192 ---- Thread 14 ---- PC 5: Stalled ----- 6190433 in-flight CPI 1.4435 -- Total Cycles 8936192 ---- Thread 15 ---- PC 5: Stalled ----- 6790030 in-flight CPI 1.3161 -- Total Cycles 8936192 ---- Thread 16 ---- PC 5: Stalled ----- 6108903 in-flight CPI 1.4628 -- Total Cycles 8936192 ---- Thread 17 ---- PC 5: Stalled ----- 6645568 in-flight CPI 1.3447 -- Total Cycles 8936192 ---- Thread 18 ---- PC 5: Stalled ----- 5678073 in-flight CPI 1.5738 -- Total Cycles 8936192 ---- Thread 19 ---- PC 5: Stalled ----- 5901946 in-flight CPI 1.5141 -- Total Cycles 8936192 ---- Thread 20 ---- PC 5: Stalled ----- 6213366 in-flight CPI 1.4382 -- Total Cycles 8936192 ---- Thread 21 ---- PC 5: Stalled ----- 5869055 in-flight CPI 1.5226 -- Total Cycles 8936192 ---- Thread 22 ---- PC 5: Stalled ----- 5944486 in-flight CPI 1.5033 -- Total Cycles 8936192 ---- Thread 23 ---- PC 5: Stalled ----- 6305795 in-flight CPI 1.4171 -- Total Cycles 8936192 ---- Thread 24 ---- PC 5: Stalled ----- 6412238 in-flight CPI 1.3936 -- Total Cycles 8936192 ---- Thread 25 ---- PC 5: Stalled ----- 6128164 in-flight CPI 1.4582 -- Total Cycles 8936192 ---- Thread 26 ---- PC 5: Stalled ----- 5811730 in-flight CPI 1.5376 -- Total Cycles 8936192 ---- Thread 27 ---- PC 5: Stalled ----- 6150064 in-flight CPI 1.4530 -- Total Cycles 8936192 ---- Thread 28 ---- PC 5: Stalled ----- 5772617 in-flight CPI 1.5480 -- Total Cycles 8936192 ---- Thread 29 ---- PC 5: Stalled ----- 6392992 in-flight CPI 1.3978 -- Total Cycles 8936192 ---- Thread 30 ---- PC 5: Stalled ----- 5679442 in-flight CPI 1.5734 -- Total Cycles 8936192 ---- Thread 31 ---- PC 5: Stalled ----- 5953378 in-flight CPI 1.5010 -- Total Cycles 8936192 Total CPI 0.0450 , IPC 22.2114 -- Total Cycles 8936192 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 438788 (2.018559%) FPSUB: 0 (0.000000%) FPMUL: 2015252 (9.270774%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 15404476 (70.865292%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 570117 (2.622712%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3300979 (15.185511%) DIV: 7605 (0.034985%) FPUN: 0 (0.000000%) FPRSUB: 471 (0.002167%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (217673903 total) ADD%: 8.192 (17832200) SUB%: 0.000 (0) MUL%: 0.000 (206) BITOR%: 1.225 (2666678) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.541 (1178630) FPSUB%: 0.000 (0) FPMUL%: 4.752 (10344839) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (618) FPMAX%: 0.000 (618) LOAD%: 4.949 (10772885) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (238) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (42027) FPINV%: 0.000 (0) FPCONV%: 0.000 (682) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.061 (2309941) FPLE%: 0.391 (850507) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (618) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (28182) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.966 (6457053) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.748 (1627855) CMPU%: 0.000 (0) RSUB%: 0.000 (206) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.768 (34323507) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.233 (2682974) ORI%: 1.258 (2738944) XORI%: 0.000 (0) MULI%: 3.363 (7320414) LW%: 1.193 (2596926) LWI%: 13.932 (30325547) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.302 (656657) SWI%: 4.104 (8932367) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.482 (3225669) bged%: 0.000 (0) bgeid%: 0.000 (206) bgtd%: 0.000 (0) bgtid%: 0.323 (703340) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.039 (85437) bned%: 0.000 (0) bneid%: 13.711 (29844961) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.740 (1610821) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.086 (187698) DIV%: 0.000 (412) FPUN%: 1.182 (2571974) FPRSUB%: 3.709 (8073376) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (2) FPGT%: 3.104 (6757682) FPGE%: 0.796 (1732468) SYNC%: 0.000 (0) NOP%: 8.815 (19188442) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 178 SUB 0 MUL 21 BITOR 1 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 563 FPSUB 0 FPMUL 5316 FPCMPLT 0 FPMIN 0 FPMAX 399 LOAD 2352342 INTCONV 0 ATOMIC_INC 7 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 112 FPINV 0 FPCONV 16 FPEQ 0 FPNE 0 FPLT 5 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1834 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 2189 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3443158 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 789 ORI 598461 XORI 0 MULI 651461 LW 0 LWI 9636399 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1781 DIV 18 FPUN 0 FPRSUB 2 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 22.2114 --Total thread-cycles: 285958144 --total thread-cycles issued: 198485461 (69.410667%) --iCache conflicts: 6712758 (2.347462%) --thread*cycles of FU dependence: 16695055 (5.838286%) --thread*cycles of data dependence: 21737688 (7.601703%) --iCache cycles*banks: 285958144 (76.120908% used) Issue breakdown: --thread*cycles of issue worked: 198485461 (69.410669%) --thread*cycles of issue failed: 68284241 (23.879103%) --thread*cycles of issue NOP/other: 19188442 (6.710227%) Number of thread-cycles not ready: 21737688 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 217673903 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 7 2: 7 3: 8 4: 7 5: 8 6: 7 7: 9 8: 7 9: 8 10: 7 11: 8 12: 8 13: 9 14: 7 15: 9 16: 7 17: 8 18: 7 19: 7 20: 7 21: 7 22: 7 23: 8 24: 7 25: 7 26: 7 27: 8 28: 7 29: 7 30: 7 31: 7 <=== Core 78 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6833878 in-flight CPI 1.3287 -- Total Cycles 9080179 ---- Thread 01 ---- PC 5: Stalled ----- 6893592 in-flight CPI 1.3172 -- Total Cycles 9080179 ---- Thread 02 ---- PC 5: Stalled ----- 6240134 in-flight CPI 1.4551 -- Total Cycles 9080179 ---- Thread 03 ---- PC 5: Stalled ----- 6695516 in-flight CPI 1.3562 -- Total Cycles 9080179 ---- Thread 04 ---- PC 5: Stalled ----- 6453491 in-flight CPI 1.4070 -- Total Cycles 9080179 ---- Thread 05 ---- PC 5: Stalled ----- 6052569 in-flight CPI 1.5002 -- Total Cycles 9080179 ---- Thread 06 ---- PC 5: Stalled ----- 5785268 in-flight CPI 1.5695 -- Total Cycles 9080179 ---- Thread 07 ---- PC 5: Stalled ----- 6338484 in-flight CPI 1.4325 -- Total Cycles 9080179 ---- Thread 08 ---- PC 5: Stalled ----- 6481959 in-flight CPI 1.4008 -- Total Cycles 9080179 ---- Thread 09 ---- PC 5: Stalled ----- 6345455 in-flight CPI 1.4310 -- Total Cycles 9080179 ---- Thread 10 ---- PC 5: Stalled ----- 6153576 in-flight CPI 1.4756 -- Total Cycles 9080179 ---- Thread 11 ---- PC 5: Stalled ----- 6049703 in-flight CPI 1.5009 -- Total Cycles 9080179 ---- Thread 12 ---- PC 5: Stalled ----- 6110303 in-flight CPI 1.4860 -- Total Cycles 9080179 ---- Thread 13 ---- PC 5: Stalled ----- 5767840 in-flight CPI 1.5743 -- Total Cycles 9080179 ---- Thread 14 ---- PC 5: Stalled ----- 6900215 in-flight CPI 1.3159 -- Total Cycles 9080179 ---- Thread 15 ---- PC 5: Stalled ----- 6068616 in-flight CPI 1.4962 -- Total Cycles 9080179 ---- Thread 16 ---- PC 5: Stalled ----- 6905266 in-flight CPI 1.3150 -- Total Cycles 9080179 ---- Thread 17 ---- PC 5: Stalled ----- 6398437 in-flight CPI 1.4191 -- Total Cycles 9080179 ---- Thread 18 ---- PC 5: Stalled ----- 6016623 in-flight CPI 1.5092 -- Total Cycles 9080179 ---- Thread 19 ---- PC 5: Stalled ----- 6558785 in-flight CPI 1.3844 -- Total Cycles 9080179 ---- Thread 20 ---- PC 5: Stalled ----- 6098965 in-flight CPI 1.4888 -- Total Cycles 9080179 ---- Thread 21 ---- PC 5: Stalled ----- 6050163 in-flight CPI 1.5008 -- Total Cycles 9080179 ---- Thread 22 ---- PC 5: Stalled ----- 5547666 in-flight CPI 1.6368 -- Total Cycles 9080179 ---- Thread 23 ---- PC 5: Stalled ----- 6132684 in-flight CPI 1.4806 -- Total Cycles 9080179 ---- Thread 24 ---- PC 5: Stalled ----- 6257835 in-flight CPI 1.4510 -- Total Cycles 9080179 ---- Thread 25 ---- PC 5: Stalled ----- 5507128 in-flight CPI 1.6488 -- Total Cycles 9080179 ---- Thread 26 ---- PC 5: Stalled ----- 5717844 in-flight CPI 1.5880 -- Total Cycles 9080179 ---- Thread 27 ---- PC 5: Stalled ----- 5688992 in-flight CPI 1.5961 -- Total Cycles 9080179 ---- Thread 28 ---- PC 5: Stalled ----- 5703419 in-flight CPI 1.5921 -- Total Cycles 9080179 ---- Thread 29 ---- PC 5: Stalled ----- 5717159 in-flight CPI 1.5882 -- Total Cycles 9080179 ---- Thread 30 ---- PC 5: Stalled ----- 5845054 in-flight CPI 1.5535 -- Total Cycles 9080179 ---- Thread 31 ---- PC 5: Stalled ----- 5923170 in-flight CPI 1.5330 -- Total Cycles 9080179 Total CPI 0.0460 , IPC 21.7221 -- Total Cycles 9080179 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 433458 (2.035190%) FPSUB: 0 (0.000000%) FPMUL: 1997624 (9.379328%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 15026841 (70.554654%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 569600 (2.674410%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3262179 (15.316720%) DIV: 7970 (0.037421%) FPUN: 0 (0.000000%) FPRSUB: 485 (0.002277%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (216313055 total) ADD%: 8.191 (17717326) SUB%: 0.000 (0) MUL%: 0.000 (216) BITOR%: 1.224 (2648040) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.539 (1165890) FPSUB%: 0.000 (0) FPMUL%: 4.745 (10264624) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (648) FPMAX%: 0.000 (648) LOAD%: 4.945 (10695701) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (248) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41844) FPINV%: 0.000 (0) FPCONV%: 0.000 (712) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.061 (2294150) FPLE%: 0.393 (849269) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (648) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (28182) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.967 (6417783) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.746 (1614537) CMPU%: 0.000 (0) RSUB%: 0.000 (216) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.768 (34109086) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.232 (2664640) ORI%: 1.255 (2715636) XORI%: 0.000 (0) MULI%: 3.365 (7278896) LW%: 1.193 (2581178) LWI%: 13.945 (30164667) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (651959) SWI%: 4.107 (8883589) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.483 (3207132) bged%: 0.000 (0) bgeid%: 0.000 (216) bgtd%: 0.000 (0) bgtid%: 0.323 (697875) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.040 (85675) bned%: 0.000 (0) bneid%: 13.711 (29658331) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.741 (1603869) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.086 (185432) DIV%: 0.000 (432) FPUN%: 1.182 (2556042) FPRSUB%: 3.707 (8019023) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (1) FPGT%: 3.106 (6718356) FPGE%: 0.794 (1717624) SYNC%: 0.000 (0) NOP%: 8.817 (19072618) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 173 SUB 0 MUL 19 BITOR 8 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 524 FPSUB 0 FPMUL 5344 FPCMPLT 0 FPMIN 0 FPMAX 412 LOAD 2317046 INTCONV 0 ATOMIC_INC 11 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 110 FPINV 0 FPCONV 16 FPEQ 0 FPNE 0 FPLT 9 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 2064 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 1 ADDKC 0 BITXOR 0 ANDN 0 CMP 2319 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3421302 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 747 ORI 590201 XORI 0 MULI 650000 LW 0 LWI 9583028 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1737 DIV 14 FPUN 0 FPRSUB 6 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.7221 --Total thread-cycles: 290565728 --total thread-cycles issued: 197240437 (67.881520%) --iCache conflicts: 6657668 (2.291278%) --thread*cycles of FU dependence: 16575109 (5.704427%) --thread*cycles of data dependence: 21298157 (7.329893%) --iCache cycles*banks: 290565728 (74.445492% used) Issue breakdown: --thread*cycles of issue worked: 197240437 (67.881521%) --thread*cycles of issue failed: 74252673 (25.554519%) --thread*cycles of issue NOP/other: 19072618 (6.563960%) Number of thread-cycles not ready: 21298157 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 216313055 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 10 1: 9 2: 8 3: 9 4: 8 5: 7 6: 8 7: 7 8: 8 9: 9 10: 7 11: 8 12: 7 13: 8 14: 7 15: 7 16: 8 17: 11 18: 7 19: 9 20: 8 21: 7 22: 8 23: 7 24: 7 25: 7 26: 7 27: 7 28: 7 29: 7 30: 7 31: 7 <=== Core 79 ===> ---- Thread 00 ---- PC 5: Stalled ----- 6759742 in-flight CPI 1.2922 -- Total Cycles 8734944 ---- Thread 01 ---- PC 5: Stalled ----- 6365256 in-flight CPI 1.3723 -- Total Cycles 8734944 ---- Thread 02 ---- PC 5: Stalled ----- 5952370 in-flight CPI 1.4675 -- Total Cycles 8734944 ---- Thread 03 ---- PC 5: Stalled ----- 6211978 in-flight CPI 1.4061 -- Total Cycles 8734944 ---- Thread 04 ---- PC 5: Stalled ----- 6532829 in-flight CPI 1.3371 -- Total Cycles 8734944 ---- Thread 05 ---- PC 5: Stalled ----- 6630069 in-flight CPI 1.3175 -- Total Cycles 8734944 ---- Thread 06 ---- PC 5: Stalled ----- 5893649 in-flight CPI 1.4821 -- Total Cycles 8734944 ---- Thread 07 ---- PC 5: Stalled ----- 6703359 in-flight CPI 1.3031 -- Total Cycles 8734944 ---- Thread 08 ---- PC 5: Stalled ----- 6713760 in-flight CPI 1.3010 -- Total Cycles 8734944 ---- Thread 09 ---- PC 5: Stalled ----- 5853522 in-flight CPI 1.4922 -- Total Cycles 8734944 ---- Thread 10 ---- PC 5: Stalled ----- 6706674 in-flight CPI 1.3024 -- Total Cycles 8734944 ---- Thread 11 ---- PC 5: Stalled ----- 6656407 in-flight CPI 1.3123 -- Total Cycles 8734944 ---- Thread 12 ---- PC 5: Stalled ----- 5977710 in-flight CPI 1.4612 -- Total Cycles 8734944 ---- Thread 13 ---- PC 5: Stalled ----- 5856584 in-flight CPI 1.4915 -- Total Cycles 8734944 ---- Thread 14 ---- PC 5: Stalled ----- 5903506 in-flight CPI 1.4796 -- Total Cycles 8734944 ---- Thread 15 ---- PC 5: Stalled ----- 6034991 in-flight CPI 1.4474 -- Total Cycles 8734944 ---- Thread 16 ---- PC 5: Stalled ----- 6212593 in-flight CPI 1.4060 -- Total Cycles 8734944 ---- Thread 17 ---- PC 5: Stalled ----- 6607936 in-flight CPI 1.3219 -- Total Cycles 8734944 ---- Thread 18 ---- PC 5: Stalled ----- 5875340 in-flight CPI 1.4867 -- Total Cycles 8734944 ---- Thread 19 ---- PC 5: Stalled ----- 6133984 in-flight CPI 1.4240 -- Total Cycles 8734944 ---- Thread 20 ---- PC 5: Stalled ----- 6316770 in-flight CPI 1.3828 -- Total Cycles 8734944 ---- Thread 21 ---- PC 5: Stalled ----- 6502440 in-flight CPI 1.3433 -- Total Cycles 8734944 ---- Thread 22 ---- PC 5: Stalled ----- 6162670 in-flight CPI 1.4174 -- Total Cycles 8734944 ---- Thread 23 ---- PC 5: Stalled ----- 5755127 in-flight CPI 1.5178 -- Total Cycles 8734944 ---- Thread 24 ---- PC 5: Stalled ----- 5780322 in-flight CPI 1.5111 -- Total Cycles 8734944 ---- Thread 25 ---- PC 5: Stalled ----- 6460719 in-flight CPI 1.3520 -- Total Cycles 8734944 ---- Thread 26 ---- PC 5: Stalled ----- 5885048 in-flight CPI 1.4843 -- Total Cycles 8734944 ---- Thread 27 ---- PC 5: Stalled ----- 6387839 in-flight CPI 1.3674 -- Total Cycles 8734944 ---- Thread 28 ---- PC 5: Stalled ----- 6251301 in-flight CPI 1.3973 -- Total Cycles 8734944 ---- Thread 29 ---- PC 5: Stalled ----- 5708700 in-flight CPI 1.5301 -- Total Cycles 8734944 ---- Thread 30 ---- PC 5: Stalled ----- 5472314 in-flight CPI 1.5962 -- Total Cycles 8734944 ---- Thread 31 ---- PC 5: Stalled ----- 5663462 in-flight CPI 1.5423 -- Total Cycles 8734944 Total CPI 0.0441 , IPC 22.6595 -- Total Cycles 8734944 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 443258 (2.106267%) FPSUB: 0 (0.000000%) FPMUL: 2024119 (9.618180%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 14668109 (69.699711%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 570239 (2.709654%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 3330817 (15.827329%) DIV: 7709 (0.036632%) FPUN: 0 (0.000000%) FPRSUB: 469 (0.002229%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (217073264 total) ADD%: 8.181 (17758796) SUB%: 0.000 (0) MUL%: 0.000 (209) BITOR%: 1.227 (2663657) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.547 (1187248) FPSUB%: 0.000 (0) FPMUL%: 4.770 (10355430) FPCMPLT%: 0.000 (0) FPMIN%: 0.000 (627) FPMAX%: 0.000 (627) LOAD%: 4.949 (10743360) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.000 (241) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (41920) FPINV%: 0.000 (0) FPCONV%: 0.000 (691) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.065 (2310774) FPLE%: 0.392 (851658) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.000 (627) LOADIMM%: 0.000 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.013 (28086) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.960 (6426024) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.748 (1623611) CMPU%: 0.000 (0) RSUB%: 0.000 (209) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.762 (34214577) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.230 (2669937) ORI%: 1.265 (2744960) XORI%: 0.000 (0) MULI%: 3.360 (7293815) LW%: 1.191 (2584436) LWI%: 13.927 (30231721) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.301 (652939) SWI%: 4.097 (8892907) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.479 (3211004) bged%: 0.000 (0) bgeid%: 0.000 (209) bgtd%: 0.000 (0) bgtid%: 0.322 (699589) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.041 (88431) bned%: 0.000 (0) bneid%: 13.711 (29763451) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.738 (1601825) braid%: 0.000 (0) brlid%: 0.000 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.000 (32) FPDIV%: 0.087 (189489) DIV%: 0.000 (418) FPUN%: 1.184 (2569584) FPRSUB%: 3.714 (8063173) FPSQRT%: 0.000 (0) FPNEG%: 0.000 (2) FPGT%: 3.102 (6734406) FPGE%: 0.796 (1728834) SYNC%: 0.000 (0) NOP%: 8.819 (19143666) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 169 SUB 0 MUL 31 BITOR 3 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 509 FPSUB 0 FPMUL 5285 FPCMPLT 0 FPMIN 0 FPMAX 403 LOAD 2381439 INTCONV 0 ATOMIC_INC 9 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 116 FPINV 0 FPCONV 14 FPEQ 0 FPNE 0 FPLT 5 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 2041 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 2 ADDKC 0 BITXOR 0 ANDN 0 CMP 2307 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 3428955 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 851 ORI 605719 XORI 0 MULI 657394 LW 0 LWI 9606880 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 1744 DIV 18 FPUN 0 FPRSUB 3 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 22.6595 --Total thread-cycles: 279518208 --total thread-cycles issued: 197929598 (70.810986%) --iCache conflicts: 6689119 (2.393089%) --thread*cycles of FU dependence: 16693904 (5.972385%) --thread*cycles of data dependence: 21044720 (7.528926%) --iCache cycles*banks: 279518208 (77.659805% used) Issue breakdown: --thread*cycles of issue worked: 197929598 (70.810986%) --thread*cycles of issue failed: 62444944 (22.340206%) --thread*cycles of issue NOP/other: 19143666 (6.848808%) Number of thread-cycles not ready: 21044720 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 217073264 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 7 2: 7 3: 8 4: 7 5: 8 6: 8 7: 10 8: 8 9: 8 10: 8 11: 7 12: 7 13: 8 14: 7 15: 7 16: 7 17: 9 18: 7 19: 8 20: 7 21: 9 22: 7 23: 7 24: 7 25: 7 26: 7 27: 8 28: 7 29: 7 30: 7 31: 7 ## Core 0 ## Module Utilization FP AddSub: 12.70 FP MinMax: 0.00 FP Compare: 4.85 Int AddSub: 20.57 FP Mul: 14.17 Int Mul: 39.99 FP InvSqrt: 0.46 FP Div: 2.07 Conversion Unit: 0.00 ## Core 1 ## Module Utilization FP AddSub: 13.01 FP MinMax: 0.00 FP Compare: 4.99 Int AddSub: 21.13 FP Mul: 14.51 Int Mul: 41.08 FP InvSqrt: 0.47 FP Div: 2.11 Conversion Unit: 0.00 ## Core 2 ## Module Utilization FP AddSub: 13.03 FP MinMax: 0.00 FP Compare: 4.97 Int AddSub: 21.00 FP Mul: 14.56 Int Mul: 40.82 FP InvSqrt: 0.47 FP Div: 2.15 Conversion Unit: 0.00 ## Core 3 ## Module Utilization FP AddSub: 12.38 FP MinMax: 0.00 FP Compare: 4.75 Int AddSub: 20.12 FP Mul: 13.80 Int Mul: 39.10 FP InvSqrt: 0.44 FP Div: 2.01 Conversion Unit: 0.00 ## Core 4 ## Module Utilization FP AddSub: 12.63 FP MinMax: 0.00 FP Compare: 4.83 Int AddSub: 20.47 FP Mul: 14.09 Int Mul: 39.81 FP InvSqrt: 0.46 FP Div: 2.05 Conversion Unit: 0.00 ## Core 5 ## Module Utilization FP AddSub: 12.91 FP MinMax: 0.00 FP Compare: 4.95 Int AddSub: 20.89 FP Mul: 14.41 Int Mul: 40.59 FP InvSqrt: 0.47 FP Div: 2.11 Conversion Unit: 0.00 ## Core 6 ## Module Utilization FP AddSub: 12.86 FP MinMax: 0.00 FP Compare: 4.93 Int AddSub: 20.83 FP Mul: 14.35 Int Mul: 40.49 FP InvSqrt: 0.47 FP Div: 2.10 Conversion Unit: 0.00 ## Core 7 ## Module Utilization FP AddSub: 12.62 FP MinMax: 0.00 FP Compare: 4.85 Int AddSub: 20.50 FP Mul: 14.06 Int Mul: 39.90 FP InvSqrt: 0.46 FP Div: 2.04 Conversion Unit: 0.00 ## Core 8 ## Module Utilization FP AddSub: 12.73 FP MinMax: 0.00 FP Compare: 4.86 Int AddSub: 20.57 FP Mul: 14.21 Int Mul: 40.00 FP InvSqrt: 0.45 FP Div: 2.08 Conversion Unit: 0.00 ## Core 9 ## Module Utilization FP AddSub: 12.81 FP MinMax: 0.00 FP Compare: 4.87 Int AddSub: 20.61 FP Mul: 14.34 Int Mul: 40.01 FP InvSqrt: 0.45 FP Div: 2.12 Conversion Unit: 0.00 ## Core 10 ## Module Utilization FP AddSub: 12.78 FP MinMax: 0.00 FP Compare: 4.91 Int AddSub: 20.75 FP Mul: 14.24 Int Mul: 40.38 FP InvSqrt: 0.46 FP Div: 2.07 Conversion Unit: 0.00 ## Core 11 ## Module Utilization FP AddSub: 12.66 FP MinMax: 0.00 FP Compare: 4.86 Int AddSub: 20.49 FP Mul: 14.11 Int Mul: 39.89 FP InvSqrt: 0.46 FP Div: 2.06 Conversion Unit: 0.00 ## Core 12 ## Module Utilization FP AddSub: 12.75 FP MinMax: 0.00 FP Compare: 4.90 Int AddSub: 20.74 FP Mul: 14.20 Int Mul: 40.37 FP InvSqrt: 0.46 FP Div: 2.05 Conversion Unit: 0.00 ## Core 13 ## Module Utilization FP AddSub: 12.58 FP MinMax: 0.00 FP Compare: 4.80 Int AddSub: 20.34 FP Mul: 14.04 Int Mul: 39.53 FP InvSqrt: 0.45 FP Div: 2.06 Conversion Unit: 0.00 ## Core 14 ## Module Utilization FP AddSub: 12.93 FP MinMax: 0.00 FP Compare: 4.94 Int AddSub: 20.93 FP Mul: 14.45 Int Mul: 40.64 FP InvSqrt: 0.46 FP Div: 2.11 Conversion Unit: 0.00 ## Core 15 ## Module Utilization FP AddSub: 13.07 FP MinMax: 0.00 FP Compare: 5.00 Int AddSub: 21.15 FP Mul: 14.59 Int Mul: 41.09 FP InvSqrt: 0.47 FP Div: 2.14 Conversion Unit: 0.00 ## Core 16 ## Module Utilization FP AddSub: 12.72 FP MinMax: 0.00 FP Compare: 4.87 Int AddSub: 20.57 FP Mul: 14.21 Int Mul: 39.96 FP InvSqrt: 0.46 FP Div: 2.08 Conversion Unit: 0.00 ## Core 17 ## Module Utilization FP AddSub: 12.78 FP MinMax: 0.00 FP Compare: 4.88 Int AddSub: 20.66 FP Mul: 14.26 Int Mul: 40.14 FP InvSqrt: 0.46 FP Div: 2.10 Conversion Unit: 0.00 ## Core 18 ## Module Utilization FP AddSub: 13.16 FP MinMax: 0.00 FP Compare: 5.04 Int AddSub: 21.38 FP Mul: 14.67 Int Mul: 41.59 FP InvSqrt: 0.47 FP Div: 2.13 Conversion Unit: 0.00 ## Core 19 ## Module Utilization FP AddSub: 12.72 FP MinMax: 0.00 FP Compare: 4.86 Int AddSub: 20.57 FP Mul: 14.21 Int Mul: 39.97 FP InvSqrt: 0.46 FP Div: 2.09 Conversion Unit: 0.00 ## Core 20 ## Module Utilization FP AddSub: 12.66 FP MinMax: 0.00 FP Compare: 4.82 Int AddSub: 20.46 FP Mul: 14.16 Int Mul: 39.74 FP InvSqrt: 0.45 FP Div: 2.08 Conversion Unit: 0.00 ## Core 21 ## Module Utilization FP AddSub: 12.80 FP MinMax: 0.00 FP Compare: 4.87 Int AddSub: 20.59 FP Mul: 14.32 Int Mul: 39.95 FP InvSqrt: 0.46 FP Div: 2.14 Conversion Unit: 0.00 ## Core 22 ## Module Utilization FP AddSub: 12.72 FP MinMax: 0.00 FP Compare: 4.85 Int AddSub: 20.56 FP Mul: 14.21 Int Mul: 39.93 FP InvSqrt: 0.46 FP Div: 2.08 Conversion Unit: 0.00 ## Core 23 ## Module Utilization FP AddSub: 12.40 FP MinMax: 0.00 FP Compare: 4.76 Int AddSub: 20.17 FP Mul: 13.81 Int Mul: 39.24 FP InvSqrt: 0.45 FP Div: 2.01 Conversion Unit: 0.00 ## Core 24 ## Module Utilization FP AddSub: 12.81 FP MinMax: 0.00 FP Compare: 4.92 Int AddSub: 20.82 FP Mul: 14.27 Int Mul: 40.55 FP InvSqrt: 0.47 FP Div: 2.07 Conversion Unit: 0.00 ## Core 25 ## Module Utilization FP AddSub: 12.75 FP MinMax: 0.00 FP Compare: 4.88 Int AddSub: 20.68 FP Mul: 14.22 Int Mul: 40.18 FP InvSqrt: 0.46 FP Div: 2.07 Conversion Unit: 0.00 ## Core 26 ## Module Utilization FP AddSub: 12.98 FP MinMax: 0.00 FP Compare: 4.96 Int AddSub: 21.01 FP Mul: 14.49 Int Mul: 40.81 FP InvSqrt: 0.47 FP Div: 2.12 Conversion Unit: 0.00 ## Core 27 ## Module Utilization FP AddSub: 12.80 FP MinMax: 0.00 FP Compare: 4.90 Int AddSub: 20.77 FP Mul: 14.27 Int Mul: 40.38 FP InvSqrt: 0.46 FP Div: 2.07 Conversion Unit: 0.00 ## Core 28 ## Module Utilization FP AddSub: 13.02 FP MinMax: 0.00 FP Compare: 4.98 Int AddSub: 21.09 FP Mul: 14.52 Int Mul: 40.99 FP InvSqrt: 0.47 FP Div: 2.13 Conversion Unit: 0.00 ## Core 29 ## Module Utilization FP AddSub: 12.80 FP MinMax: 0.00 FP Compare: 4.90 Int AddSub: 20.76 FP Mul: 14.28 Int Mul: 40.35 FP InvSqrt: 0.45 FP Div: 2.08 Conversion Unit: 0.00 ## Core 30 ## Module Utilization FP AddSub: 12.76 FP MinMax: 0.00 FP Compare: 4.89 Int AddSub: 20.69 FP Mul: 14.23 Int Mul: 40.21 FP InvSqrt: 0.46 FP Div: 2.07 Conversion Unit: 0.00 ## Core 31 ## Module Utilization FP AddSub: 12.56 FP MinMax: 0.00 FP Compare: 4.82 Int AddSub: 20.39 FP Mul: 13.99 Int Mul: 39.66 FP InvSqrt: 0.45 FP Div: 2.03 Conversion Unit: 0.00 ## Core 32 ## Module Utilization FP AddSub: 12.63 FP MinMax: 0.00 FP Compare: 4.80 Int AddSub: 20.32 FP Mul: 14.13 Int Mul: 39.42 FP InvSqrt: 0.46 FP Div: 2.11 Conversion Unit: 0.00 ## Core 33 ## Module Utilization FP AddSub: 12.76 FP MinMax: 0.00 FP Compare: 4.87 Int AddSub: 20.64 FP Mul: 14.24 Int Mul: 40.11 FP InvSqrt: 0.46 FP Div: 2.09 Conversion Unit: 0.00 ## Core 34 ## Module Utilization FP AddSub: 12.67 FP MinMax: 0.00 FP Compare: 4.84 Int AddSub: 20.48 FP Mul: 14.14 Int Mul: 39.80 FP InvSqrt: 0.46 FP Div: 2.08 Conversion Unit: 0.00 ## Core 35 ## Module Utilization FP AddSub: 12.73 FP MinMax: 0.00 FP Compare: 4.87 Int AddSub: 20.61 FP Mul: 14.21 Int Mul: 40.11 FP InvSqrt: 0.45 FP Div: 2.07 Conversion Unit: 0.00 ## Core 36 ## Module Utilization FP AddSub: 12.40 FP MinMax: 0.00 FP Compare: 4.75 Int AddSub: 20.11 FP Mul: 13.83 Int Mul: 39.09 FP InvSqrt: 0.45 FP Div: 2.01 Conversion Unit: 0.00 ## Core 37 ## Module Utilization FP AddSub: 12.85 FP MinMax: 0.00 FP Compare: 4.92 Int AddSub: 20.82 FP Mul: 14.35 Int Mul: 40.41 FP InvSqrt: 0.46 FP Div: 2.10 Conversion Unit: 0.00 ## Core 38 ## Module Utilization FP AddSub: 12.81 FP MinMax: 0.00 FP Compare: 4.93 Int AddSub: 20.85 FP Mul: 14.26 Int Mul: 40.60 FP InvSqrt: 0.47 FP Div: 2.06 Conversion Unit: 0.00 ## Core 39 ## Module Utilization FP AddSub: 12.80 FP MinMax: 0.00 FP Compare: 4.88 Int AddSub: 20.64 FP Mul: 14.30 Int Mul: 40.09 FP InvSqrt: 0.46 FP Div: 2.11 Conversion Unit: 0.00 ## Core 40 ## Module Utilization FP AddSub: 13.14 FP MinMax: 0.00 FP Compare: 5.03 Int AddSub: 21.26 FP Mul: 14.67 Int Mul: 41.32 FP InvSqrt: 0.47 FP Div: 2.15 Conversion Unit: 0.00 ## Core 41 ## Module Utilization FP AddSub: 12.71 FP MinMax: 0.00 FP Compare: 4.86 Int AddSub: 20.61 FP Mul: 14.19 Int Mul: 40.04 FP InvSqrt: 0.46 FP Div: 2.07 Conversion Unit: 0.00 ## Core 42 ## Module Utilization FP AddSub: 12.62 FP MinMax: 0.00 FP Compare: 4.83 Int AddSub: 20.38 FP Mul: 14.10 Int Mul: 39.58 FP InvSqrt: 0.45 FP Div: 2.07 Conversion Unit: 0.00 ## Core 43 ## Module Utilization FP AddSub: 12.83 FP MinMax: 0.00 FP Compare: 4.92 Int AddSub: 20.82 FP Mul: 14.30 Int Mul: 40.51 FP InvSqrt: 0.46 FP Div: 2.08 Conversion Unit: 0.00 ## Core 44 ## Module Utilization FP AddSub: 12.69 FP MinMax: 0.00 FP Compare: 4.85 Int AddSub: 20.56 FP Mul: 14.15 Int Mul: 39.98 FP InvSqrt: 0.45 FP Div: 2.06 Conversion Unit: 0.00 ## Core 45 ## Module Utilization FP AddSub: 12.92 FP MinMax: 0.00 FP Compare: 4.95 Int AddSub: 20.95 FP Mul: 14.40 Int Mul: 40.72 FP InvSqrt: 0.47 FP Div: 2.10 Conversion Unit: 0.00 ## Core 46 ## Module Utilization FP AddSub: 12.65 FP MinMax: 0.00 FP Compare: 4.84 Int AddSub: 20.48 FP Mul: 14.11 Int Mul: 39.79 FP InvSqrt: 0.46 FP Div: 2.06 Conversion Unit: 0.00 ## Core 47 ## Module Utilization FP AddSub: 12.83 FP MinMax: 0.00 FP Compare: 4.89 Int AddSub: 20.73 FP Mul: 14.32 Int Mul: 40.25 FP InvSqrt: 0.46 FP Div: 2.11 Conversion Unit: 0.00 ## Core 48 ## Module Utilization FP AddSub: 13.11 FP MinMax: 0.00 FP Compare: 5.01 Int AddSub: 21.20 FP Mul: 14.64 Int Mul: 41.20 FP InvSqrt: 0.47 FP Div: 2.15 Conversion Unit: 0.00 ## Core 49 ## Module Utilization FP AddSub: 12.52 FP MinMax: 0.00 FP Compare: 4.77 Int AddSub: 20.19 FP Mul: 14.00 Int Mul: 39.20 FP InvSqrt: 0.45 FP Div: 2.07 Conversion Unit: 0.00 ## Core 50 ## Module Utilization FP AddSub: 12.72 FP MinMax: 0.00 FP Compare: 4.88 Int AddSub: 20.62 FP Mul: 14.19 Int Mul: 40.12 FP InvSqrt: 0.46 FP Div: 2.06 Conversion Unit: 0.00 ## Core 51 ## Module Utilization FP AddSub: 12.41 FP MinMax: 0.00 FP Compare: 4.74 Int AddSub: 20.11 FP Mul: 13.85 Int Mul: 39.09 FP InvSqrt: 0.44 FP Div: 2.02 Conversion Unit: 0.00 ## Core 52 ## Module Utilization FP AddSub: 12.46 FP MinMax: 0.00 FP Compare: 4.78 Int AddSub: 20.19 FP Mul: 13.90 Int Mul: 39.27 FP InvSqrt: 0.45 FP Div: 2.02 Conversion Unit: 0.00 ## Core 53 ## Module Utilization FP AddSub: 12.91 FP MinMax: 0.00 FP Compare: 4.95 Int AddSub: 20.93 FP Mul: 14.39 Int Mul: 40.71 FP InvSqrt: 0.47 FP Div: 2.10 Conversion Unit: 0.00 ## Core 54 ## Module Utilization FP AddSub: 12.30 FP MinMax: 0.00 FP Compare: 4.70 Int AddSub: 19.87 FP Mul: 13.73 Int Mul: 38.64 FP InvSqrt: 0.45 FP Div: 2.01 Conversion Unit: 0.00 ## Core 55 ## Module Utilization FP AddSub: 13.06 FP MinMax: 0.00 FP Compare: 4.98 Int AddSub: 21.11 FP Mul: 14.59 Int Mul: 40.97 FP InvSqrt: 0.47 FP Div: 2.15 Conversion Unit: 0.00 ## Core 56 ## Module Utilization FP AddSub: 12.62 FP MinMax: 0.00 FP Compare: 4.84 Int AddSub: 20.50 FP Mul: 14.06 Int Mul: 39.89 FP InvSqrt: 0.46 FP Div: 2.03 Conversion Unit: 0.00 ## Core 57 ## Module Utilization FP AddSub: 12.62 FP MinMax: 0.00 FP Compare: 4.84 Int AddSub: 20.46 FP Mul: 14.07 Int Mul: 39.80 FP InvSqrt: 0.46 FP Div: 2.06 Conversion Unit: 0.00 ## Core 58 ## Module Utilization FP AddSub: 12.81 FP MinMax: 0.00 FP Compare: 4.90 Int AddSub: 20.76 FP Mul: 14.30 Int Mul: 40.31 FP InvSqrt: 0.46 FP Div: 2.09 Conversion Unit: 0.00 ## Core 59 ## Module Utilization FP AddSub: 12.86 FP MinMax: 0.00 FP Compare: 4.93 Int AddSub: 20.84 FP Mul: 14.35 Int Mul: 40.47 FP InvSqrt: 0.47 FP Div: 2.10 Conversion Unit: 0.00 ## Core 60 ## Module Utilization FP AddSub: 12.68 FP MinMax: 0.00 FP Compare: 4.85 Int AddSub: 20.54 FP Mul: 14.15 Int Mul: 39.90 FP InvSqrt: 0.46 FP Div: 2.07 Conversion Unit: 0.00 ## Core 61 ## Module Utilization FP AddSub: 12.75 FP MinMax: 0.00 FP Compare: 4.88 Int AddSub: 20.63 FP Mul: 14.24 Int Mul: 40.08 FP InvSqrt: 0.46 FP Div: 2.08 Conversion Unit: 0.00 ## Core 62 ## Module Utilization FP AddSub: 12.51 FP MinMax: 0.00 FP Compare: 4.78 Int AddSub: 20.24 FP Mul: 13.97 Int Mul: 39.32 FP InvSqrt: 0.45 FP Div: 2.05 Conversion Unit: 0.00 ## Core 63 ## Module Utilization FP AddSub: 12.83 FP MinMax: 0.00 FP Compare: 4.91 Int AddSub: 20.80 FP Mul: 14.31 Int Mul: 40.39 FP InvSqrt: 0.46 FP Div: 2.09 Conversion Unit: 0.00 ## Core 64 ## Module Utilization FP AddSub: 12.89 FP MinMax: 0.00 FP Compare: 4.94 Int AddSub: 20.88 FP Mul: 14.38 Int Mul: 40.61 FP InvSqrt: 0.47 FP Div: 2.09 Conversion Unit: 0.00 ## Core 65 ## Module Utilization FP AddSub: 12.65 FP MinMax: 0.00 FP Compare: 4.84 Int AddSub: 20.46 FP Mul: 14.12 Int Mul: 39.76 FP InvSqrt: 0.46 FP Div: 2.07 Conversion Unit: 0.00 ## Core 66 ## Module Utilization FP AddSub: 12.83 FP MinMax: 0.00 FP Compare: 4.89 Int AddSub: 20.74 FP Mul: 14.33 Int Mul: 40.29 FP InvSqrt: 0.46 FP Div: 2.11 Conversion Unit: 0.00 ## Core 67 ## Module Utilization FP AddSub: 12.96 FP MinMax: 0.00 FP Compare: 4.95 Int AddSub: 20.96 FP Mul: 14.48 Int Mul: 40.68 FP InvSqrt: 0.46 FP Div: 2.13 Conversion Unit: 0.00 ## Core 68 ## Module Utilization FP AddSub: 12.66 FP MinMax: 0.00 FP Compare: 4.85 Int AddSub: 20.50 FP Mul: 14.13 Int Mul: 39.82 FP InvSqrt: 0.46 FP Div: 2.07 Conversion Unit: 0.00 ## Core 69 ## Module Utilization FP AddSub: 12.97 FP MinMax: 0.00 FP Compare: 4.97 Int AddSub: 21.02 FP Mul: 14.46 Int Mul: 40.86 FP InvSqrt: 0.46 FP Div: 2.11 Conversion Unit: 0.00 ## Core 70 ## Module Utilization FP AddSub: 12.90 FP MinMax: 0.00 FP Compare: 4.94 Int AddSub: 20.87 FP Mul: 14.40 Int Mul: 40.54 FP InvSqrt: 0.46 FP Div: 2.11 Conversion Unit: 0.00 ## Core 71 ## Module Utilization FP AddSub: 12.97 FP MinMax: 0.00 FP Compare: 4.99 Int AddSub: 21.11 FP Mul: 14.43 Int Mul: 41.07 FP InvSqrt: 0.48 FP Div: 2.10 Conversion Unit: 0.00 ## Core 72 ## Module Utilization FP AddSub: 12.69 FP MinMax: 0.00 FP Compare: 4.85 Int AddSub: 20.49 FP Mul: 14.18 Int Mul: 39.80 FP InvSqrt: 0.46 FP Div: 2.09 Conversion Unit: 0.00 ## Core 73 ## Module Utilization FP AddSub: 12.73 FP MinMax: 0.00 FP Compare: 4.87 Int AddSub: 20.65 FP Mul: 14.20 Int Mul: 40.16 FP InvSqrt: 0.47 FP Div: 2.07 Conversion Unit: 0.00 ## Core 74 ## Module Utilization FP AddSub: 12.95 FP MinMax: 0.00 FP Compare: 4.97 Int AddSub: 21.03 FP Mul: 14.42 Int Mul: 40.94 FP InvSqrt: 0.48 FP Div: 2.10 Conversion Unit: 0.00 ## Core 75 ## Module Utilization FP AddSub: 12.68 FP MinMax: 0.00 FP Compare: 4.86 Int AddSub: 20.54 FP Mul: 14.15 Int Mul: 39.92 FP InvSqrt: 0.45 FP Div: 2.05 Conversion Unit: 0.00 ## Core 76 ## Module Utilization FP AddSub: 12.37 FP MinMax: 0.00 FP Compare: 4.74 Int AddSub: 20.13 FP Mul: 13.78 Int Mul: 39.14 FP InvSqrt: 0.45 FP Div: 1.99 Conversion Unit: 0.00 ## Core 77 ## Module Utilization FP AddSub: 12.98 FP MinMax: 0.00 FP Compare: 4.97 Int AddSub: 21.07 FP Mul: 14.47 Int Mul: 40.96 FP InvSqrt: 0.47 FP Div: 2.11 Conversion Unit: 0.00 ## Core 78 ## Module Utilization FP AddSub: 12.68 FP MinMax: 0.00 FP Compare: 4.86 Int AddSub: 20.60 FP Mul: 14.13 Int Mul: 40.08 FP InvSqrt: 0.46 FP Div: 2.05 Conversion Unit: 0.00 ## Core 79 ## Module Utilization FP AddSub: 13.28 FP MinMax: 0.00 FP Compare: 5.08 Int AddSub: 21.47 FP Mul: 14.82 Int Mul: 41.75 FP InvSqrt: 0.48 FP Div: 2.17 Conversion Unit: 0.00 L1 accesses: 857084635 L1 hits: 779144011 L1 misses: 77940624 L1 bank conflicts: 148194811 L1 stores: 49152 L1 near hit: 0 L1 hit rate: 0.909063 -= L2 #0 =- L2 accesses: 19450334 L2 hits: 16707566 L2 misses: 2742768 L2 stores: 12207 L2 bank conflicts: 4383284 L2 hit rate: 0.858986 L2 memory faults: 37342 L2 bandwidth limited stalls: 4570735 -= L2 #1 =- L2 accesses: 19382246 L2 hits: 16651857 L2 misses: 2730389 L2 stores: 12270 L2 bank conflicts: 4341536 L2 hit rate: 0.859129 L2 memory faults: 37952 L2 bandwidth limited stalls: 4480728 -= L2 #2 =- L2 accesses: 19472644 L2 hits: 16713752 L2 misses: 2758892 L2 stores: 12300 L2 bank conflicts: 4402631 L2 hit rate: 0.858320 L2 memory faults: 38422 L2 bandwidth limited stalls: 4682839 -= L2 #3 =- L2 accesses: 19483636 L2 hits: 16732249 L2 misses: 2751387 L2 stores: 12375 L2 bank conflicts: 4371693 L2 hit rate: 0.858785 L2 memory faults: 38048 L2 bandwidth limited stalls: 4591452 Bandwidth numbers for 1000MHz clock: register to L1 bandwidth: 369223581795.843380 L1 to L2 bandwidth: 1072341516622.467700 L2 to memory bandwidth: 151409783071.326780 Core size: 0.9818 L2 size: 0.0000 4-L2 size: 0.0000 80-core chip size: 78.5458 FPS Statistics: FPS assuming 1000MHz clock: 107.6975