--load-assembly ../../llvm_trax/examples/project4_noInh/project4_noInh_rt-llvm.s --config-file ../trunk/configs/default.config --model ../trunk/test_models/conference.obj --view-file ../trunk/views/conference.view --light-file ../trunk/lights/conference.light --num-cores 20 --num-thread-procs 32 --num-l2s 4 --num-icaches 2 --num-icache-banks 16 Loading core 0. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 1. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 2. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 3. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 4. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 5. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 6. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 7. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 8. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 9. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 10. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 11. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 12. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 13. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 14. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 15. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 16. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 17. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 18. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 19. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 20. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 21. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 22. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 23. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 24. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 25. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 26. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 27. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 28. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 29. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 30. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 31. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 32. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 33. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 34. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 35. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 36. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 37. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 38. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 39. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 40. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 41. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 42. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 43. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 44. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 45. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 46. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 47. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 48. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 49. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 50. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 51. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 52. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 53. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 54. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 55. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 56. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 57. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 58. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 59. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 60. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 61. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 62. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 63. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 64. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 65. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 66. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 67. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 68. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 69. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 70. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 71. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 72. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 73. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 74. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 75. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 76. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 77. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 78. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 79. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Center is 1.5233 1.618 1.7711 Corner is 1.9101 3.20911 0.248412 Across is 1.25037 -1.56095 0 Up is 0.523201 0.419102 1.88431 U is 0.360963 -0.450621 0 V is 0.15104 0.120988 0.543969 radius is 0 Work queue starts at 30 (0x0000001e) FB starts at 32 (0x00000020) FB ends at 49183 (0x0000c01f) loading model ../trunk/test_models/conference.obj MTL file: "../trunk/test_models/conference.mtl" loading material file ../trunk/test_models/conference.mtl Found 43 total materials Found 282664 total triangles vertex min/max = x: (-0.177790, 11.125200) y: (-0.164592, 7.010400) z: (-0.005078, 2.712720) Materials start at 49184 (0x0000c020) Materials end at 50284 (0x0000c46c) Starting BVH build. BVH build complete with 265647 nodes. Scene starts at 50285 (0x0000c46d) BVH bounds [-0.177790 -0.164592 -0.005078] [11.125200 7.010400 2.712720] Triangles start at 2175464 (0x002131e8) Scene ends at 11316197 (0x00acabe5) Starting camera at 11316198 (0x00acabe6) Camera ended at 11316220 (0x00acabfc) Background Color 0x00acabfd to 0x00acabff Light at 0x00acac00 to 0x00acac02 Permutation table from 0x00acac03 to 0x00acae02 Hammersley table from 0x00acae03 to 0x00acb002 Memory used: 11317251 (0x00acb003) Image size: 49152 start_wq: 30 start_fb: 32 start_scene: 50288 start_camera: 11316198 start_matls: 49184 start_bg_color: 11316221 start_light: 11316224 start_permutation: 11316227 Loading assembly file ../../llvm_trax/examples/project4_noInh/project4_noInh_rt-llvm.s using 36 registers Number of instructions: 1231 Creating thread 0... Creating thread 1... Creating thread 2... Creating thread 3... Core 0 running... Creating thread 4... Core 1 running... Creating thread 5... Creating thread 6... Core 3 running... Core 2 running... Creating thread 7... Creating thread 8... Creating thread 9... Creating thread 10... Creating thread 11... Creating thread 12... Creating thread 13... Creating thread 14... Creating thread 15... Creating thread 16... Creating thread 17... Core 4 running... Creating thread 18... Core 5 running... Creating thread 19... Core 9 running... Creating thread 20... Core 13 running... Creating thread 21... Creating thread 22... Core 6 running... Creating thread 23... Core 17 running... Creating thread 24... Core 21 running... Creating thread 25... Creating thread 26... Core 7 running... Creating thread 27... Core 25 running... Creating thread 28... Creating thread 29... Core 8 running... Creating thread 30... Creating thread 31... Core 10 running... Creating thread 32... Core 29 running... Core 11 running... Creating thread 33... Core 12 running... Creating thread 34... Core 14 running... Creating thread 35... Core 33 running... Creating thread 36... Core 15 running... Creating thread 37... Core 16 running... Creating thread 38... Creating thread 39... Core 37 running... Creating thread 40... Core 18 running... Creating thread 41... Core 19 running... Creating thread 42... Creating thread 43... Core 41 running... Creating thread 44... Core 20 running... Creating thread 45... Core 22 running... Creating thread 46... Core 45 running... Creating thread 47... Creating thread 48... Core 23 running... Creating thread 49... Creating thread 50... Core 24 running... Creating thread 51... Core 49 running... Creating thread 52... Core 26 running... Creating thread 53... Core 27 running... Creating thread 54... Creating thread 55... Core 53 running... Creating thread 56... Core 28 running... Creating thread 57... Creating thread 58... Core 30 running... Creating thread 59... Core 57 running... Creating thread 60... Core 31 running... Creating thread 61... Creating thread 62... Core 32 running... Creating thread 63... Core 61 running... Creating thread 64... Core 34 running... Creating thread 65... Creating thread 66... Core 35 running... Creating thread 67... Core 65 running... Creating thread 68... Core 36 running... Creating thread 69... Creating thread 70... Core 38 running... Creating thread 71... Core 69 running... Creating thread 72... Core 39 running... Creating thread 73... Core 40 running... Creating thread 74... Creating thread 75... Core 42 running... Creating thread 76... Core 73 running... Creating thread 77... Core 43 running... Creating thread 78... Creating thread 79... Core 44 running... Core 77 running... Core 46 running... Core 50 running... Core 48 running... Core 47 running... Core 51 running... Core 52 running... Core 54 running... Core 55 running... Core 56 running... Core 58 running... Core 59 running... Core 60 running... Core 62 running... Core 63 running... Core 64 running... Core 66 running... Core 67 running... Core 70 running... Core 68 running... Core 71 running... Core 72 running... Core 75 running... Core 74 running... Core 76 running... Core 78 running... Core 79 running... <=== Core 0 ===> ---- Thread 00 ---- PC 5: Stalled ----- 101424 in-flight CPI 1.6114 -- Total Cycles 163466 ---- Thread 01 ---- PC 5: Stalled ----- 99950 in-flight CPI 1.6352 -- Total Cycles 163466 ---- Thread 02 ---- PC 5: Stalled ----- 94134 in-flight CPI 1.7362 -- Total Cycles 163466 ---- Thread 03 ---- PC 5: Stalled ----- 116814 in-flight CPI 1.3992 -- Total Cycles 163466 ---- Thread 04 ---- PC 5: Stalled ----- 97132 in-flight CPI 1.6826 -- Total Cycles 163466 ---- Thread 05 ---- PC 5: Stalled ----- 96402 in-flight CPI 1.6954 -- Total Cycles 163466 ---- Thread 06 ---- PC 5: Stalled ----- 96024 in-flight CPI 1.7021 -- Total Cycles 163466 ---- Thread 07 ---- PC 5: Stalled ----- 96166 in-flight CPI 1.6995 -- Total Cycles 163466 ---- Thread 08 ---- PC 5: Stalled ----- 94357 in-flight CPI 1.7321 -- Total Cycles 163466 ---- Thread 09 ---- PC 5: Stalled ----- 94680 in-flight CPI 1.7262 -- Total Cycles 163466 ---- Thread 10 ---- PC 5: Stalled ----- 100409 in-flight CPI 1.6277 -- Total Cycles 163466 ---- Thread 11 ---- PC 5: Stalled ----- 99552 in-flight CPI 1.6417 -- Total Cycles 163466 ---- Thread 12 ---- PC 5: Stalled ----- 97916 in-flight CPI 1.6691 -- Total Cycles 163466 ---- Thread 13 ---- PC 5: Stalled ----- 93026 in-flight CPI 1.7569 -- Total Cycles 163466 ---- Thread 14 ---- PC 5: Stalled ----- 93397 in-flight CPI 1.7499 -- Total Cycles 163466 ---- Thread 15 ---- PC 5: Stalled ----- 101466 in-flight CPI 1.6107 -- Total Cycles 163466 ---- Thread 16 ---- PC 5: Stalled ----- 94850 in-flight CPI 1.7231 -- Total Cycles 163466 ---- Thread 17 ---- PC 5: Stalled ----- 97741 in-flight CPI 1.6722 -- Total Cycles 163466 ---- Thread 18 ---- PC 5: Stalled ----- 89098 in-flight CPI 1.8344 -- Total Cycles 163466 ---- Thread 19 ---- PC 5: Stalled ----- 96803 in-flight CPI 1.6884 -- Total Cycles 163466 ---- Thread 20 ---- PC 5: Stalled ----- 92618 in-flight CPI 1.7646 -- Total Cycles 163466 ---- Thread 21 ---- PC 5: Stalled ----- 96166 in-flight CPI 1.6995 -- Total Cycles 163466 ---- Thread 22 ---- PC 5: Stalled ----- 87477 in-flight CPI 1.8684 -- Total Cycles 163466 ---- Thread 23 ---- PC 5: Stalled ----- 92052 in-flight CPI 1.7755 -- Total Cycles 163466 ---- Thread 24 ---- PC 5: Stalled ----- 87326 in-flight CPI 1.8716 -- Total Cycles 163466 ---- Thread 25 ---- PC 5: Stalled ----- 95358 in-flight CPI 1.7139 -- Total Cycles 163466 ---- Thread 26 ---- PC 5: Stalled ----- 88111 in-flight CPI 1.8549 -- Total Cycles 163466 ---- Thread 27 ---- PC 5: Stalled ----- 91178 in-flight CPI 1.7924 -- Total Cycles 163466 ---- Thread 28 ---- PC 5: Stalled ----- 85937 in-flight CPI 1.9019 -- Total Cycles 163466 ---- Thread 29 ---- PC 5: Stalled ----- 91451 in-flight CPI 1.7871 -- Total Cycles 163466 ---- Thread 30 ---- PC 5: Stalled ----- 89590 in-flight CPI 1.8243 -- Total Cycles 163466 ---- Thread 31 ---- PC 5: Stalled ----- 85582 in-flight CPI 1.9097 -- Total Cycles 163466 Total CPI 0.0539 , IPC 18.5649 -- Total Cycles 163466 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8225 (3.787356%) FPSUB: 0 (0.000000%) FPMUL: 32600 (15.011281%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 87996 (40.519409%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5428 (2.499424%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 75356 (34.699084%) DIV: 7308 (3.365106%) FPUN: 0 (0.000000%) FPRSUB: 257 (0.118340%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3327340 total) ADD%: 7.509 (249857) SUB%: 0.000 (0) MUL%: 0.006 (198) BITOR%: 1.540 (51239) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.579 (19250) FPSUB%: 0.000 (0) FPMUL%: 4.869 (162020) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (594) FPMAX%: 0.018 (594) LOAD%: 5.217 (173592) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (230) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (570) FPINV%: 0.000 (0) FPCONV%: 0.019 (626) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.077 (35821) FPLE%: 0.461 (15351) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (594) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.800 (93163) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.763 (25372) CMPU%: 0.000 (0) RSUB%: 0.006 (198) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.757 (524300) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.180 (39247) ORI%: 1.584 (52691) XORI%: 0.000 (0) MULI%: 3.196 (106346) LW%: 1.129 (37582) LWI%: 13.460 (447876) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.288 (9567) SWI%: 4.051 (134788) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.398 (46520) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10360) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.059 (1952) bned%: 0.000 (0) bneid%: 13.853 (460923) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.716 (23840) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.129 (4298) DIV%: 0.012 (396) FPUN%: 1.484 (49376) FPRSUB%: 3.704 (123251) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (70) FPGT%: 2.946 (98008) FPGE%: 1.023 (34025) SYNC%: 0.000 (0) NOP%: 8.793 (292559) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 18 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 8 FPSUB 0 FPMUL 76 FPCMPLT 0 FPMIN 0 FPMAX 393 LOAD 40903 INTCONV 0 ATOMIC_INC 22 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 13 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 2264 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 11 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48984 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 17 ORI 11765 XORI 0 MULI 9342 LW 0 LWI 142044 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 75 DIV 20 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 18.5651 --Total thread-cycles: 5230912 --total thread-cycles issued: 3034781 (58.016289%) --iCache conflicts: 111841 (2.138078%) --thread*cycles of FU dependence: 255984 (4.893678%) --thread*cycles of data dependence: 217170 (4.151666%) --iCache cycles*banks: 5230912 (63.609787% used) Issue breakdown: --thread*cycles of issue worked: 3034781 (58.016289%) --thread*cycles of issue failed: 1903572 (36.390824%) --thread*cycles of issue NOP/other: 292559 (5.592887%) Number of thread-cycles not ready: 217170 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3327340 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 9 1: 8 2: 8 3: 5 4: 8 5: 7 6: 7 7: 7 8: 7 9: 8 10: 8 11: 8 12: 9 13: 7 14: 7 15: 9 16: 7 17: 7 18: 6 19: 7 20: 7 21: 7 22: 6 23: 6 24: 6 25: 7 26: 7 27: 8 28: 6 29: 7 30: 7 31: 7 <=== Core 1 ===> ---- Thread 00 ---- PC 5: Stalled ----- 98552 in-flight CPI 1.3643 -- Total Cycles 134477 ---- Thread 01 ---- PC 5: Stalled ----- 99086 in-flight CPI 1.3569 -- Total Cycles 134477 ---- Thread 02 ---- PC 5: Stalled ----- 95009 in-flight CPI 1.4151 -- Total Cycles 134477 ---- Thread 03 ---- PC 5: Stalled ----- 94003 in-flight CPI 1.4303 -- Total Cycles 134477 ---- Thread 04 ---- PC 5: Stalled ----- 97898 in-flight CPI 1.3735 -- Total Cycles 134477 ---- Thread 05 ---- PC 5: Stalled ----- 98734 in-flight CPI 1.3617 -- Total Cycles 134477 ---- Thread 06 ---- PC 5: Stalled ----- 96225 in-flight CPI 1.3972 -- Total Cycles 134477 ---- Thread 07 ---- PC 5: Stalled ----- 98127 in-flight CPI 1.3701 -- Total Cycles 134477 ---- Thread 08 ---- PC 5: Stalled ----- 100386 in-flight CPI 1.3393 -- Total Cycles 134477 ---- Thread 09 ---- PC 5: Stalled ----- 102641 in-flight CPI 1.3099 -- Total Cycles 134477 ---- Thread 10 ---- PC 5: Stalled ----- 101597 in-flight CPI 1.3234 -- Total Cycles 134477 ---- Thread 11 ---- PC 5: Stalled ----- 95626 in-flight CPI 1.4060 -- Total Cycles 134477 ---- Thread 12 ---- PC 5: Stalled ----- 92289 in-flight CPI 1.4568 -- Total Cycles 134477 ---- Thread 13 ---- PC 5: Stalled ----- 99684 in-flight CPI 1.3488 -- Total Cycles 134477 ---- Thread 14 ---- PC 5: Stalled ----- 97046 in-flight CPI 1.3855 -- Total Cycles 134477 ---- Thread 15 ---- PC 5: Stalled ----- 96874 in-flight CPI 1.3879 -- Total Cycles 134477 ---- Thread 16 ---- PC 5: Stalled ----- 98627 in-flight CPI 1.3632 -- Total Cycles 134477 ---- Thread 17 ---- PC 5: Stalled ----- 96242 in-flight CPI 1.3970 -- Total Cycles 134477 ---- Thread 18 ---- PC 5: Stalled ----- 91723 in-flight CPI 1.4658 -- Total Cycles 134477 ---- Thread 19 ---- PC 5: Stalled ----- 91767 in-flight CPI 1.4651 -- Total Cycles 134477 ---- Thread 20 ---- PC 5: Stalled ----- 96905 in-flight CPI 1.3874 -- Total Cycles 134477 ---- Thread 21 ---- PC 5: Stalled ----- 90417 in-flight CPI 1.4870 -- Total Cycles 134477 ---- Thread 22 ---- PC 5: Stalled ----- 92591 in-flight CPI 1.4522 -- Total Cycles 134477 ---- Thread 23 ---- PC 5: Stalled ----- 92254 in-flight CPI 1.4575 -- Total Cycles 134477 ---- Thread 24 ---- PC 5: Stalled ----- 95043 in-flight CPI 1.4147 -- Total Cycles 134477 ---- Thread 25 ---- PC 5: Stalled ----- 91041 in-flight CPI 1.4768 -- Total Cycles 134477 ---- Thread 26 ---- PC 5: Stalled ----- 93639 in-flight CPI 1.4359 -- Total Cycles 134477 ---- Thread 27 ---- PC 5: Stalled ----- 89968 in-flight CPI 1.4944 -- Total Cycles 134477 ---- Thread 28 ---- PC 5: Stalled ----- 92547 in-flight CPI 1.4528 -- Total Cycles 134477 ---- Thread 29 ---- PC 5: Stalled ----- 88412 in-flight CPI 1.5207 -- Total Cycles 134477 ---- Thread 30 ---- PC 5: Stalled ----- 89406 in-flight CPI 1.5038 -- Total Cycles 134477 ---- Thread 31 ---- PC 5: Stalled ----- 88437 in-flight CPI 1.5204 -- Total Cycles 134477 Total CPI 0.0442 , IPC 22.6311 -- Total Cycles 134477 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7461 (3.482169%) FPSUB: 0 (0.000000%) FPMUL: 31208 (14.565277%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 93267 (43.529214%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5604 (2.615477%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 68813 (32.116138%) DIV: 7644 (3.567578%) FPUN: 0 (0.000000%) FPRSUB: 266 (0.124146%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3337061 total) ADD%: 7.514 (250734) SUB%: 0.000 (0) MUL%: 0.006 (207) BITOR%: 1.531 (51081) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.529 (17665) FPSUB%: 0.000 (0) FPMUL%: 4.727 (157759) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (621) FPMAX%: 0.019 (621) LOAD%: 5.141 (171548) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (239) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (589) FPINV%: 0.000 (0) FPCONV%: 0.020 (653) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.059 (35349) FPLE%: 0.457 (15263) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (621) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.836 (94652) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.737 (24582) CMPU%: 0.000 (0) RSUB%: 0.006 (207) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.770 (526257) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.178 (39313) ORI%: 1.559 (52020) XORI%: 0.000 (0) MULI%: 3.235 (107954) LW%: 1.144 (38192) LWI%: 13.596 (453697) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.291 (9712) SWI%: 4.104 (136946) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.417 (47273) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.313 (10451) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.059 (1980) bned%: 0.000 (0) bneid%: 13.877 (463082) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.726 (24224) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.117 (3907) DIV%: 0.012 (414) FPUN%: 1.491 (49753) FPRSUB%: 3.661 (122171) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (67) FPGT%: 2.967 (99027) FPGE%: 1.034 (34490) SYNC%: 0.000 (0) NOP%: 8.799 (293644) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 19 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 15 FPSUB 0 FPMUL 66 FPCMPLT 0 FPMIN 0 FPMAX 403 LOAD 38675 INTCONV 0 ATOMIC_INC 22 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 11 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1432 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 11 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49519 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 10 ORI 10605 XORI 0 MULI 9983 LW 0 LWI 143347 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 61 DIV 16 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 22.6313 --Total thread-cycles: 4303264 --total thread-cycles issued: 3043417 (70.723455%) --iCache conflicts: 112478 (2.613783%) --thread*cycles of FU dependence: 254223 (5.907678%) --thread*cycles of data dependence: 214263 (4.979081%) --iCache cycles*banks: 4303264 (77.547950% used) Issue breakdown: --thread*cycles of issue worked: 3043417 (70.723455%) --thread*cycles of issue failed: 966203 (22.452794%) --thread*cycles of issue NOP/other: 293644 (6.823751%) Number of thread-cycles not ready: 214263 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3337061 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 6 1: 8 2: 8 3: 7 4: 4 5: 9 6: 8 7: 9 8: 9 9: 9 10: 7 11: 8 12: 8 13: 8 14: 7 15: 7 16: 8 17: 8 18: 8 19: 9 20: 8 21: 7 22: 6 23: 6 24: 7 25: 7 26: 7 27: 8 28: 8 29: 7 30: 7 31: 6 <=== Core 2 ===> ---- Thread 00 ---- PC 5: Stalled ----- 98490 in-flight CPI 1.2978 -- Total Cycles 127850 ---- Thread 01 ---- PC 5: Stalled ----- 96121 in-flight CPI 1.3298 -- Total Cycles 127850 ---- Thread 02 ---- PC 5: Stalled ----- 98988 in-flight CPI 1.2913 -- Total Cycles 127850 ---- Thread 03 ---- PC 5: Stalled ----- 102024 in-flight CPI 1.2528 -- Total Cycles 127850 ---- Thread 04 ---- PC 5: Stalled ----- 97155 in-flight CPI 1.3157 -- Total Cycles 127850 ---- Thread 05 ---- PC 5: Stalled ----- 101166 in-flight CPI 1.2635 -- Total Cycles 127850 ---- Thread 06 ---- PC 5: Stalled ----- 96666 in-flight CPI 1.3223 -- Total Cycles 127850 ---- Thread 07 ---- PC 5: Stalled ----- 99656 in-flight CPI 1.2827 -- Total Cycles 127850 ---- Thread 08 ---- PC 5: Stalled ----- 93999 in-flight CPI 1.3598 -- Total Cycles 127850 ---- Thread 09 ---- PC 5: Stalled ----- 97726 in-flight CPI 1.3080 -- Total Cycles 127850 ---- Thread 10 ---- PC 5: Stalled ----- 93182 in-flight CPI 1.3718 -- Total Cycles 127850 ---- Thread 11 ---- PC 5: Stalled ----- 97795 in-flight CPI 1.3071 -- Total Cycles 127850 ---- Thread 12 ---- PC 5: Stalled ----- 97404 in-flight CPI 1.3123 -- Total Cycles 127850 ---- Thread 13 ---- PC 5: Stalled ----- 93217 in-flight CPI 1.3713 -- Total Cycles 127850 ---- Thread 14 ---- PC 5: Stalled ----- 99447 in-flight CPI 1.2854 -- Total Cycles 127850 ---- Thread 15 ---- PC 5: Stalled ----- 99537 in-flight CPI 1.2842 -- Total Cycles 127850 ---- Thread 16 ---- PC 5: Stalled ----- 95004 in-flight CPI 1.3455 -- Total Cycles 127850 ---- Thread 17 ---- PC 5: Stalled ----- 95926 in-flight CPI 1.3325 -- Total Cycles 127850 ---- Thread 18 ---- PC 5: Stalled ----- 97356 in-flight CPI 1.3130 -- Total Cycles 127850 ---- Thread 19 ---- PC 5: Stalled ----- 96323 in-flight CPI 1.3271 -- Total Cycles 127850 ---- Thread 20 ---- PC 5: Stalled ----- 92244 in-flight CPI 1.3857 -- Total Cycles 127850 ---- Thread 21 ---- PC 5: Stalled ----- 86258 in-flight CPI 1.4820 -- Total Cycles 127850 ---- Thread 22 ---- PC 5: Stalled ----- 91384 in-flight CPI 1.3988 -- Total Cycles 127850 ---- Thread 23 ---- PC 5: Stalled ----- 93628 in-flight CPI 1.3653 -- Total Cycles 127850 ---- Thread 24 ---- PC 5: Stalled ----- 89971 in-flight CPI 1.4207 -- Total Cycles 127850 ---- Thread 25 ---- PC 5: Stalled ----- 95560 in-flight CPI 1.3376 -- Total Cycles 127850 ---- Thread 26 ---- PC 5: Stalled ----- 87883 in-flight CPI 1.4546 -- Total Cycles 127850 ---- Thread 27 ---- PC 5: Stalled ----- 90873 in-flight CPI 1.4067 -- Total Cycles 127850 ---- Thread 28 ---- PC 5: Stalled ----- 87924 in-flight CPI 1.4539 -- Total Cycles 127850 ---- Thread 29 ---- PC 5: Stalled ----- 88036 in-flight CPI 1.4521 -- Total Cycles 127850 ---- Thread 30 ---- PC 5: Stalled ----- 90761 in-flight CPI 1.4084 -- Total Cycles 127850 ---- Thread 31 ---- PC 5: Stalled ----- 94350 in-flight CPI 1.3547 -- Total Cycles 127850 Total CPI 0.0421 , IPC 23.7515 -- Total Cycles 127850 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7015 (3.887547%) FPSUB: 0 (0.000000%) FPMUL: 30276 (16.778241%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 63798 (35.355338%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5945 (3.294578%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 65435 (36.262524%) DIV: 7712 (4.273807%) FPUN: 0 (0.000000%) FPRSUB: 267 (0.147965%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3329408 total) ADD%: 7.533 (250804) SUB%: 0.000 (0) MUL%: 0.006 (209) BITOR%: 1.537 (51163) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.503 (16743) FPSUB%: 0.000 (0) FPMUL%: 4.646 (154697) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (627) FPMAX%: 0.019 (627) LOAD%: 5.103 (169907) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (241) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (609) FPINV%: 0.000 (0) FPCONV%: 0.020 (659) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.048 (34902) FPLE%: 0.459 (15267) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (627) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.853 (94984) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.731 (24339) CMPU%: 0.000 (0) RSUB%: 0.006 (209) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.784 (525519) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.183 (39402) ORI%: 1.539 (51228) XORI%: 0.000 (0) MULI%: 3.255 (108372) LW%: 1.151 (38328) LWI%: 13.653 (454567) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.291 (9691) SWI%: 4.123 (137276) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.427 (47510) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.312 (10388) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.054 (1785) bned%: 0.000 (0) bneid%: 13.898 (462724) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.731 (24324) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.111 (3707) DIV%: 0.013 (418) FPUN%: 1.498 (49880) FPRSUB%: 3.637 (121099) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (81) FPGT%: 2.975 (99059) FPGE%: 1.040 (34613) SYNC%: 0.000 (0) NOP%: 8.792 (292727) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 42 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 14 FPSUB 0 FPMUL 43 FPCMPLT 0 FPMIN 0 FPMAX 414 LOAD 37854 INTCONV 0 ATOMIC_INC 18 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 18 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1581 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 8 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49668 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 18 ORI 9891 XORI 0 MULI 10579 LW 0 LWI 143528 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 67 DIV 16 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.7517 --Total thread-cycles: 4091200 --total thread-cycles issued: 3036681 (74.224702%) --iCache conflicts: 113593 (2.776520%) --thread*cycles of FU dependence: 253760 (6.202581%) --thread*cycles of data dependence: 180448 (4.410637%) --iCache cycles*banks: 4091200 (81.380524% used) Issue breakdown: --thread*cycles of issue worked: 3036681 (74.224702%) --thread*cycles of issue failed: 761792 (18.620258%) --thread*cycles of issue NOP/other: 292727 (7.155040%) Number of thread-cycles not ready: 180448 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3329408 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 9 1: 8 2: 9 3: 10 4: 8 5: 8 6: 8 7: 7 8: 8 9: 7 10: 7 11: 8 12: 8 13: 6 14: 8 15: 8 16: 7 17: 8 18: 8 19: 7 20: 8 21: 5 22: 7 23: 7 24: 8 25: 9 26: 6 27: 7 28: 6 29: 5 30: 7 31: 9 <=== Core 3 ===> ---- Thread 00 ---- PC 5: Stalled ----- 96948 in-flight CPI 1.4718 -- Total Cycles 142720 ---- Thread 01 ---- PC 5: Stalled ----- 97780 in-flight CPI 1.4593 -- Total Cycles 142720 ---- Thread 02 ---- PC 5: Stalled ----- 98459 in-flight CPI 1.4493 -- Total Cycles 142720 ---- Thread 03 ---- PC 5: Stalled ----- 95353 in-flight CPI 1.4965 -- Total Cycles 142720 ---- Thread 04 ---- PC 5: Stalled ----- 90551 in-flight CPI 1.5759 -- Total Cycles 142720 ---- Thread 05 ---- PC 5: Stalled ----- 96878 in-flight CPI 1.4729 -- Total Cycles 142720 ---- Thread 06 ---- PC 5: Stalled ----- 98101 in-flight CPI 1.4545 -- Total Cycles 142720 ---- Thread 07 ---- PC 5: Stalled ----- 96752 in-flight CPI 1.4749 -- Total Cycles 142720 ---- Thread 08 ---- PC 5: Stalled ----- 99033 in-flight CPI 1.4409 -- Total Cycles 142720 ---- Thread 09 ---- PC 5: Stalled ----- 99383 in-flight CPI 1.4358 -- Total Cycles 142720 ---- Thread 10 ---- PC 5: Stalled ----- 93652 in-flight CPI 1.5237 -- Total Cycles 142720 ---- Thread 11 ---- PC 5: Stalled ----- 96964 in-flight CPI 1.4716 -- Total Cycles 142720 ---- Thread 12 ---- PC 5: Stalled ----- 100999 in-flight CPI 1.4128 -- Total Cycles 142720 ---- Thread 13 ---- PC 5: Stalled ----- 96486 in-flight CPI 1.4789 -- Total Cycles 142720 ---- Thread 14 ---- PC 5: Stalled ----- 98298 in-flight CPI 1.4516 -- Total Cycles 142720 ---- Thread 15 ---- PC 5: Stalled ----- 98942 in-flight CPI 1.4421 -- Total Cycles 142720 ---- Thread 16 ---- PC 5: Stalled ----- 98200 in-flight CPI 1.4531 -- Total Cycles 142720 ---- Thread 17 ---- PC 5: Stalled ----- 96202 in-flight CPI 1.4832 -- Total Cycles 142720 ---- Thread 18 ---- PC 5: Stalled ----- 90229 in-flight CPI 1.5815 -- Total Cycles 142720 ---- Thread 19 ---- PC 5: Stalled ----- 97134 in-flight CPI 1.4690 -- Total Cycles 142720 ---- Thread 20 ---- PC 5: Stalled ----- 91428 in-flight CPI 1.5607 -- Total Cycles 142720 ---- Thread 21 ---- PC 5: Stalled ----- 99497 in-flight CPI 1.4341 -- Total Cycles 142720 ---- Thread 22 ---- PC 5: Stalled ----- 94375 in-flight CPI 1.5120 -- Total Cycles 142720 ---- Thread 23 ---- PC 5: Stalled ----- 93480 in-flight CPI 1.5265 -- Total Cycles 142720 ---- Thread 24 ---- PC 5: Stalled ----- 93601 in-flight CPI 1.5245 -- Total Cycles 142720 ---- Thread 25 ---- PC 5: Stalled ----- 91094 in-flight CPI 1.5665 -- Total Cycles 142720 ---- Thread 26 ---- PC 5: Stalled ----- 87309 in-flight CPI 1.6343 -- Total Cycles 142720 ---- Thread 27 ---- PC 5: Stalled ----- 88514 in-flight CPI 1.6122 -- Total Cycles 142720 ---- Thread 28 ---- PC 5: Stalled ----- 90638 in-flight CPI 1.5743 -- Total Cycles 142720 ---- Thread 29 ---- PC 5: Stalled ----- 83422 in-flight CPI 1.7105 -- Total Cycles 142720 ---- Thread 30 ---- PC 5: Stalled ----- 86836 in-flight CPI 1.6432 -- Total Cycles 142720 ---- Thread 31 ---- PC 5: Stalled ----- 95044 in-flight CPI 1.5014 -- Total Cycles 142720 Total CPI 0.0471 , IPC 21.2454 -- Total Cycles 142720 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7883 (3.684265%) FPSUB: 0 (0.000000%) FPMUL: 31997 (14.954385%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 87834 (41.050831%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5473 (2.557907%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 72906 (34.073956%) DIV: 7604 (3.553869%) FPUN: 0 (0.000000%) FPRSUB: 267 (0.124787%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3324996 total) ADD%: 7.418 (246635) SUB%: 0.000 (0) MUL%: 0.006 (206) BITOR%: 1.536 (51070) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.562 (18687) FPSUB%: 0.000 (0) FPMUL%: 4.821 (160291) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (618) FPMAX%: 0.019 (618) LOAD%: 5.190 (172561) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (238) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (581) FPINV%: 0.000 (0) FPCONV%: 0.020 (650) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.072 (35629) FPLE%: 0.457 (15211) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (618) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.821 (93791) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.747 (24831) CMPU%: 0.000 (0) RSUB%: 0.006 (206) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.767 (524263) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.177 (39127) ORI%: 1.584 (52669) XORI%: 0.000 (0) MULI%: 3.216 (106918) LW%: 1.138 (37846) LWI%: 13.525 (449705) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.290 (9632) SWI%: 4.080 (135674) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.408 (46830) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.312 (10384) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.062 (2077) bned%: 0.000 (0) bneid%: 13.868 (461121) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.722 (24019) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.125 (4149) DIV%: 0.012 (412) FPUN%: 1.491 (49583) FPRSUB%: 3.688 (122613) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (67) FPGT%: 2.953 (98202) FPGE%: 1.034 (34372) SYNC%: 0.000 (0) NOP%: 8.806 (292796) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 21 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 11 FPSUB 0 FPMUL 59 FPCMPLT 0 FPMIN 0 FPMAX 399 LOAD 39780 INTCONV 0 ATOMIC_INC 17 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 12 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1412 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 5 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49019 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 8 ORI 11273 XORI 0 MULI 9351 LW 0 LWI 142341 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 66 DIV 31 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.2456 --Total thread-cycles: 4567040 --total thread-cycles issued: 3032200 (66.393112%) --iCache conflicts: 113366 (2.482264%) --thread*cycles of FU dependence: 253806 (5.557341%) --thread*cycles of data dependence: 213964 (4.684960%) --iCache cycles*banks: 4567040 (72.804880% used) Issue breakdown: --thread*cycles of issue worked: 3032200 (66.393112%) --thread*cycles of issue failed: 1242044 (27.195820%) --thread*cycles of issue NOP/other: 292796 (6.411067%) Number of thread-cycles not ready: 213964 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3324996 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 8 3: 8 4: 6 5: 8 6: 8 7: 7 8: 7 9: 8 10: 7 11: 7 12: 9 13: 7 14: 8 15: 9 16: 8 17: 8 18: 7 19: 8 20: 8 21: 8 22: 7 23: 6 24: 8 25: 7 26: 7 27: 6 28: 7 29: 7 30: 7 31: 6 <=== Core 4 ===> ---- Thread 00 ---- PC 5: Stalled ----- 94985 in-flight CPI 1.3482 -- Total Cycles 128083 ---- Thread 01 ---- PC 5: Stalled ----- 95314 in-flight CPI 1.3435 -- Total Cycles 128083 ---- Thread 02 ---- PC 5: Stalled ----- 97670 in-flight CPI 1.3111 -- Total Cycles 128083 ---- Thread 03 ---- PC 5: Stalled ----- 94894 in-flight CPI 1.3495 -- Total Cycles 128083 ---- Thread 04 ---- PC 5: Stalled ----- 100735 in-flight CPI 1.2712 -- Total Cycles 128083 ---- Thread 05 ---- PC 5: Stalled ----- 98548 in-flight CPI 1.2994 -- Total Cycles 128083 ---- Thread 06 ---- PC 5: Stalled ----- 98491 in-flight CPI 1.3002 -- Total Cycles 128083 ---- Thread 07 ---- PC 5: Stalled ----- 101334 in-flight CPI 1.2637 -- Total Cycles 128083 ---- Thread 08 ---- PC 5: Stalled ----- 98541 in-flight CPI 1.2996 -- Total Cycles 128083 ---- Thread 09 ---- PC 5: Stalled ----- 98944 in-flight CPI 1.2942 -- Total Cycles 128083 ---- Thread 10 ---- PC 5: Stalled ----- 99226 in-flight CPI 1.2906 -- Total Cycles 128083 ---- Thread 11 ---- PC 5: Stalled ----- 96894 in-flight CPI 1.3216 -- Total Cycles 128083 ---- Thread 12 ---- PC 5: Stalled ----- 94201 in-flight CPI 1.3594 -- Total Cycles 128083 ---- Thread 13 ---- PC 5: Stalled ----- 96524 in-flight CPI 1.3266 -- Total Cycles 128083 ---- Thread 14 ---- PC 5: Stalled ----- 93802 in-flight CPI 1.3652 -- Total Cycles 128083 ---- Thread 15 ---- PC 5: Stalled ----- 95229 in-flight CPI 1.3447 -- Total Cycles 128083 ---- Thread 16 ---- PC 5: Stalled ----- 97806 in-flight CPI 1.3093 -- Total Cycles 128083 ---- Thread 17 ---- PC 5: Stalled ----- 98766 in-flight CPI 1.2965 -- Total Cycles 128083 ---- Thread 18 ---- PC 5: Stalled ----- 101131 in-flight CPI 1.2663 -- Total Cycles 128083 ---- Thread 19 ---- PC 5: Stalled ----- 92071 in-flight CPI 1.3909 -- Total Cycles 128083 ---- Thread 20 ---- PC 5: Stalled ----- 93114 in-flight CPI 1.3753 -- Total Cycles 128083 ---- Thread 21 ---- PC 5: Stalled ----- 95416 in-flight CPI 1.3421 -- Total Cycles 128083 ---- Thread 22 ---- PC 5: Stalled ----- 89001 in-flight CPI 1.4388 -- Total Cycles 128083 ---- Thread 23 ---- PC 5: Stalled ----- 91215 in-flight CPI 1.4040 -- Total Cycles 128083 ---- Thread 24 ---- PC 5: Stalled ----- 90196 in-flight CPI 1.4198 -- Total Cycles 128083 ---- Thread 25 ---- PC 5: Stalled ----- 87321 in-flight CPI 1.4665 -- Total Cycles 128083 ---- Thread 26 ---- PC 5: Stalled ----- 91451 in-flight CPI 1.4003 -- Total Cycles 128083 ---- Thread 27 ---- PC 5: Stalled ----- 92310 in-flight CPI 1.3872 -- Total Cycles 128083 ---- Thread 28 ---- PC 5: Stalled ----- 83841 in-flight CPI 1.5274 -- Total Cycles 128083 ---- Thread 29 ---- PC 5: Stalled ----- 91908 in-flight CPI 1.3934 -- Total Cycles 128083 ---- Thread 30 ---- PC 5: Stalled ----- 88921 in-flight CPI 1.4401 -- Total Cycles 128083 ---- Thread 31 ---- PC 5: Stalled ----- 89191 in-flight CPI 1.4357 -- Total Cycles 128083 Total CPI 0.0423 , IPC 23.6533 -- Total Cycles 128083 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7683 (4.043663%) FPSUB: 0 (0.000000%) FPMUL: 31565 (16.613070%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 65103 (34.264557%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5932 (3.122089%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 71474 (37.617697%) DIV: 7973 (4.196294%) FPUN: 0 (0.000000%) FPRSUB: 271 (0.142631%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3321732 total) ADD%: 7.554 (250939) SUB%: 0.000 (0) MUL%: 0.007 (216) BITOR%: 1.532 (50874) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.547 (18185) FPSUB%: 0.000 (0) FPMUL%: 4.769 (158422) FPCMPLT%: 0.000 (0) FPMIN%: 0.020 (648) FPMAX%: 0.020 (648) LOAD%: 5.149 (171025) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (248) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (617) FPINV%: 0.000 (0) FPCONV%: 0.020 (680) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.068 (35477) FPLE%: 0.457 (15175) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.020 (648) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.816 (93536) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.746 (24769) CMPU%: 0.000 (0) RSUB%: 0.007 (216) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.747 (523081) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.180 (39186) ORI%: 1.561 (51840) XORI%: 0.000 (0) MULI%: 3.222 (107032) LW%: 1.137 (37760) LWI%: 13.566 (450634) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.289 (9592) SWI%: 4.103 (136275) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.407 (46737) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10322) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.053 (1754) bned%: 0.000 (0) bneid%: 13.874 (460852) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.716 (23781) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.122 (4057) DIV%: 0.013 (432) FPUN%: 1.482 (49232) FPRSUB%: 3.670 (121913) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (80) FPGT%: 2.968 (98603) FPGE%: 1.025 (34057) SYNC%: 0.000 (0) NOP%: 8.793 (292093) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 12 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 14 FPSUB 0 FPMUL 42 FPCMPLT 0 FPMIN 0 FPMAX 417 LOAD 39421 INTCONV 0 ATOMIC_INC 23 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 13 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1589 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 12 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49129 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 16 ORI 10842 XORI 0 MULI 9832 LW 0 LWI 142525 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 77 DIV 29 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.6535 --Total thread-cycles: 4098656 --total thread-cycles issued: 3029639 (73.917865%) --iCache conflicts: 113681 (2.773617%) --thread*cycles of FU dependence: 254007 (6.197324%) --thread*cycles of data dependence: 190001 (4.635690%) --iCache cycles*banks: 4098656 (81.045201% used) Issue breakdown: --thread*cycles of issue worked: 3029639 (73.917865%) --thread*cycles of issue failed: 776924 (18.955580%) --thread*cycles of issue NOP/other: 292093 (7.126556%) Number of thread-cycles not ready: 190001 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3321732 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 9 2: 8 3: 7 4: 8 5: 8 6: 8 7: 9 8: 7 9: 8 10: 8 11: 9 12: 7 13: 9 14: 8 15: 9 16: 9 17: 9 18: 8 19: 7 20: 7 21: 7 22: 8 23: 6 24: 7 25: 7 26: 7 27: 8 28: 6 29: 7 30: 8 31: 8 <=== Core 5 ===> ---- Thread 00 ---- PC 5: Stalled ----- 101556 in-flight CPI 1.2600 -- Total Cycles 127991 ---- Thread 01 ---- PC 5: Stalled ----- 94317 in-flight CPI 1.3568 -- Total Cycles 127991 ---- Thread 02 ---- PC 5: Stalled ----- 101034 in-flight CPI 1.2666 -- Total Cycles 127991 ---- Thread 03 ---- PC 5: Stalled ----- 98572 in-flight CPI 1.2982 -- Total Cycles 127991 ---- Thread 04 ---- PC 5: Stalled ----- 99062 in-flight CPI 1.2918 -- Total Cycles 127991 ---- Thread 05 ---- PC 5: Stalled ----- 102982 in-flight CPI 1.2426 -- Total Cycles 127991 ---- Thread 06 ---- PC 5: Stalled ----- 101704 in-flight CPI 1.2582 -- Total Cycles 127991 ---- Thread 07 ---- PC 5: Stalled ----- 100732 in-flight CPI 1.2704 -- Total Cycles 127991 ---- Thread 08 ---- PC 5: Stalled ----- 99081 in-flight CPI 1.2915 -- Total Cycles 127991 ---- Thread 09 ---- PC 5: Stalled ----- 96308 in-flight CPI 1.3287 -- Total Cycles 127991 ---- Thread 10 ---- PC 5: Stalled ----- 92318 in-flight CPI 1.3862 -- Total Cycles 127991 ---- Thread 11 ---- PC 5: Stalled ----- 101560 in-flight CPI 1.2599 -- Total Cycles 127991 ---- Thread 12 ---- PC 5: Stalled ----- 95217 in-flight CPI 1.3439 -- Total Cycles 127991 ---- Thread 13 ---- PC 5: Stalled ----- 96058 in-flight CPI 1.3322 -- Total Cycles 127991 ---- Thread 14 ---- PC 5: Stalled ----- 96147 in-flight CPI 1.3310 -- Total Cycles 127991 ---- Thread 15 ---- PC 5: Stalled ----- 98372 in-flight CPI 1.3008 -- Total Cycles 127991 ---- Thread 16 ---- PC 5: Stalled ----- 95211 in-flight CPI 1.3440 -- Total Cycles 127991 ---- Thread 17 ---- PC 5: Stalled ----- 97520 in-flight CPI 1.3122 -- Total Cycles 127991 ---- Thread 18 ---- PC 5: Stalled ----- 91349 in-flight CPI 1.4009 -- Total Cycles 127991 ---- Thread 19 ---- PC 5: Stalled ----- 91970 in-flight CPI 1.3914 -- Total Cycles 127991 ---- Thread 20 ---- PC 5: Stalled ----- 92085 in-flight CPI 1.3897 -- Total Cycles 127991 ---- Thread 21 ---- PC 5: Stalled ----- 94425 in-flight CPI 1.3552 -- Total Cycles 127991 ---- Thread 22 ---- PC 5: Stalled ----- 89027 in-flight CPI 1.4375 -- Total Cycles 127991 ---- Thread 23 ---- PC 5: Stalled ----- 97298 in-flight CPI 1.3152 -- Total Cycles 127991 ---- Thread 24 ---- PC 5: Stalled ----- 94565 in-flight CPI 1.3532 -- Total Cycles 127991 ---- Thread 25 ---- PC 5: Stalled ----- 91796 in-flight CPI 1.3941 -- Total Cycles 127991 ---- Thread 26 ---- PC 5: Stalled ----- 90844 in-flight CPI 1.4086 -- Total Cycles 127991 ---- Thread 27 ---- PC 5: Stalled ----- 88731 in-flight CPI 1.4422 -- Total Cycles 127991 ---- Thread 28 ---- PC 5: Stalled ----- 85198 in-flight CPI 1.5020 -- Total Cycles 127991 ---- Thread 29 ---- PC 5: Stalled ----- 89295 in-flight CPI 1.4331 -- Total Cycles 127991 ---- Thread 30 ---- PC 5: Stalled ----- 79430 in-flight CPI 1.6112 -- Total Cycles 127991 ---- Thread 31 ---- PC 5: Stalled ----- 92055 in-flight CPI 1.3901 -- Total Cycles 127991 Total CPI 0.0422 , IPC 23.7235 -- Total Cycles 127991 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7531 (3.824240%) FPSUB: 0 (0.000000%) FPMUL: 31345 (15.916985%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 73971 (37.562459%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5775 (2.932544%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 70241 (35.668366%) DIV: 7795 (3.958299%) FPUN: 0 (0.000000%) FPRSUB: 270 (0.137106%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3328950 total) ADD%: 7.524 (250479) SUB%: 0.000 (0) MUL%: 0.006 (211) BITOR%: 1.539 (51236) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.539 (17929) FPSUB%: 0.000 (0) FPMUL%: 4.752 (158193) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (633) FPMAX%: 0.019 (633) LOAD%: 5.140 (171099) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (243) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (602) FPINV%: 0.000 (0) FPCONV%: 0.020 (665) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.064 (35428) FPLE%: 0.457 (15203) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (633) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.824 (93996) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.745 (24799) CMPU%: 0.000 (0) RSUB%: 0.006 (211) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.752 (524383) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.182 (39348) ORI%: 1.560 (51928) XORI%: 0.000 (0) MULI%: 3.229 (107496) LW%: 1.140 (37936) LWI%: 13.590 (452409) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.289 (9633) SWI%: 4.100 (136489) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.411 (46965) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10365) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.052 (1737) bned%: 0.000 (0) bneid%: 13.881 (462106) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.723 (24058) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.120 (3989) DIV%: 0.013 (422) FPUN%: 1.490 (49613) FPRSUB%: 3.669 (122147) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (72) FPGT%: 2.964 (98657) FPGE%: 1.034 (34410) SYNC%: 0.000 (0) NOP%: 8.786 (292498) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 11 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 16 FPSUB 0 FPMUL 40 FPCMPLT 0 FPMIN 1 FPMAX 414 LOAD 39058 INTCONV 0 ATOMIC_INC 15 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 16 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1781 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 10 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49283 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 14 ORI 10725 XORI 0 MULI 10172 LW 0 LWI 143153 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 88 DIV 34 FPUN 0 FPRSUB 1 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.7238 --Total thread-cycles: 4095712 --total thread-cycles issued: 3036452 (74.137342%) --iCache conflicts: 114373 (2.792506%) --thread*cycles of FU dependence: 254833 (6.221946%) --thread*cycles of data dependence: 196928 (4.808151%) --iCache cycles*banks: 4095712 (81.279690% used) Issue breakdown: --thread*cycles of issue worked: 3036452 (74.137342%) --thread*cycles of issue failed: 766762 (18.721092%) --thread*cycles of issue NOP/other: 292498 (7.141567%) Number of thread-cycles not ready: 196928 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3328950 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 9 1: 7 2: 8 3: 8 4: 8 5: 8 6: 9 7: 8 8: 8 9: 8 10: 7 11: 10 12: 8 13: 8 14: 7 15: 9 16: 8 17: 8 18: 7 19: 7 20: 7 21: 8 22: 6 23: 7 24: 8 25: 7 26: 8 27: 6 28: 7 29: 7 30: 5 31: 7 <=== Core 6 ===> ---- Thread 00 ---- PC 5: Stalled ----- 99011 in-flight CPI 1.2828 -- Total Cycles 127047 ---- Thread 01 ---- PC 5: Stalled ----- 99992 in-flight CPI 1.2703 -- Total Cycles 127047 ---- Thread 02 ---- PC 5: Stalled ----- 101685 in-flight CPI 1.2492 -- Total Cycles 127047 ---- Thread 03 ---- PC 5: Stalled ----- 100103 in-flight CPI 1.2689 -- Total Cycles 127047 ---- Thread 04 ---- PC 5: Stalled ----- 92631 in-flight CPI 1.3713 -- Total Cycles 127047 ---- Thread 05 ---- PC 5: Stalled ----- 92283 in-flight CPI 1.3765 -- Total Cycles 127047 ---- Thread 06 ---- PC 5: Stalled ----- 96517 in-flight CPI 1.3160 -- Total Cycles 127047 ---- Thread 07 ---- PC 5: Stalled ----- 92175 in-flight CPI 1.3781 -- Total Cycles 127047 ---- Thread 08 ---- PC 5: Stalled ----- 98798 in-flight CPI 1.2857 -- Total Cycles 127047 ---- Thread 09 ---- PC 5: Stalled ----- 95037 in-flight CPI 1.3366 -- Total Cycles 127047 ---- Thread 10 ---- PC 5: Stalled ----- 94806 in-flight CPI 1.3398 -- Total Cycles 127047 ---- Thread 11 ---- PC 5: Stalled ----- 94610 in-flight CPI 1.3426 -- Total Cycles 127047 ---- Thread 12 ---- PC 5: Stalled ----- 100592 in-flight CPI 1.2628 -- Total Cycles 127047 ---- Thread 13 ---- PC 5: Stalled ----- 95744 in-flight CPI 1.3267 -- Total Cycles 127047 ---- Thread 14 ---- PC 5: Stalled ----- 92851 in-flight CPI 1.3681 -- Total Cycles 127047 ---- Thread 15 ---- PC 5: Stalled ----- 89264 in-flight CPI 1.4231 -- Total Cycles 127047 ---- Thread 16 ---- PC 5: Stalled ----- 98165 in-flight CPI 1.2940 -- Total Cycles 127047 ---- Thread 17 ---- PC 5: Stalled ----- 100331 in-flight CPI 1.2660 -- Total Cycles 127047 ---- Thread 18 ---- PC 5: Stalled ----- 91974 in-flight CPI 1.3811 -- Total Cycles 127047 ---- Thread 19 ---- PC 5: Stalled ----- 93035 in-flight CPI 1.3653 -- Total Cycles 127047 ---- Thread 20 ---- PC 5: Stalled ----- 95519 in-flight CPI 1.3298 -- Total Cycles 127047 ---- Thread 21 ---- PC 5: Stalled ----- 92847 in-flight CPI 1.3681 -- Total Cycles 127047 ---- Thread 22 ---- PC 5: Stalled ----- 85521 in-flight CPI 1.4854 -- Total Cycles 127047 ---- Thread 23 ---- PC 5: Stalled ----- 94842 in-flight CPI 1.3393 -- Total Cycles 127047 ---- Thread 24 ---- PC 5: Stalled ----- 93260 in-flight CPI 1.3620 -- Total Cycles 127047 ---- Thread 25 ---- PC 5: Stalled ----- 89462 in-flight CPI 1.4198 -- Total Cycles 127047 ---- Thread 26 ---- PC 5: Stalled ----- 87474 in-flight CPI 1.4521 -- Total Cycles 127047 ---- Thread 27 ---- PC 5: Stalled ----- 92001 in-flight CPI 1.3807 -- Total Cycles 127047 ---- Thread 28 ---- PC 5: Stalled ----- 88268 in-flight CPI 1.4391 -- Total Cycles 127047 ---- Thread 29 ---- PC 5: Stalled ----- 86735 in-flight CPI 1.4645 -- Total Cycles 127047 ---- Thread 30 ---- PC 5: Stalled ----- 90488 in-flight CPI 1.4037 -- Total Cycles 127047 ---- Thread 31 ---- PC 5: Stalled ----- 90909 in-flight CPI 1.3973 -- Total Cycles 127047 Total CPI 0.0422 , IPC 23.6723 -- Total Cycles 127047 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7406 (3.478302%) FPSUB: 0 (0.000000%) FPMUL: 30985 (14.552414%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 92706 (43.540297%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5535 (2.599568%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 68419 (32.133665%) DIV: 7606 (3.572234%) FPUN: 0 (0.000000%) FPRSUB: 263 (0.123521%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3297588 total) ADD%: 7.599 (250569) SUB%: 0.000 (0) MUL%: 0.006 (206) BITOR%: 1.536 (50637) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.533 (17573) FPSUB%: 0.000 (0) FPMUL%: 4.736 (156170) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (618) FPMAX%: 0.019 (618) LOAD%: 5.129 (169119) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (238) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (585) FPINV%: 0.000 (0) FPCONV%: 0.020 (650) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.062 (35035) FPLE%: 0.459 (15131) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (618) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.821 (93041) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.740 (24415) CMPU%: 0.000 (0) RSUB%: 0.006 (206) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.747 (519281) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.178 (38852) ORI%: 1.555 (51272) XORI%: 0.000 (0) MULI%: 3.228 (106432) LW%: 1.139 (37546) LWI%: 13.582 (447873) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.290 (9547) SWI%: 4.093 (134981) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.409 (46469) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.312 (10280) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.054 (1776) bned%: 0.000 (0) bneid%: 13.880 (457701) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.723 (23857) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.118 (3885) DIV%: 0.012 (412) FPUN%: 1.490 (49125) FPRSUB%: 3.663 (120786) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (60) FPGT%: 2.969 (97894) FPGE%: 1.031 (33994) SYNC%: 0.000 (0) NOP%: 8.796 (290040) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 25 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 9 FPSUB 0 FPMUL 32 FPCMPLT 0 FPMIN 0 FPMAX 404 LOAD 39209 INTCONV 0 ATOMIC_INC 25 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 13 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1675 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 7 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48926 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 11 ORI 10574 XORI 0 MULI 9938 LW 0 LWI 141644 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 67 DIV 42 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.6725 --Total thread-cycles: 4065504 --total thread-cycles issued: 3007548 (73.977249%) --iCache conflicts: 111383 (2.739710%) --thread*cycles of FU dependence: 252621 (6.213768%) --thread*cycles of data dependence: 212920 (5.237235%) --iCache cycles*banks: 4065504 (81.112207% used) Issue breakdown: --thread*cycles of issue worked: 3007548 (73.977249%) --thread*cycles of issue failed: 767916 (18.888581%) --thread*cycles of issue NOP/other: 4562161815689391352 (112216389792984.860000%) Number of thread-cycles not ready: 212920 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3297588 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 10 1: 9 2: 8 3: 8 4: 6 5: 7 6: 9 7: 7 8: 7 9: 7 10: 8 11: 8 12: 8 13: 8 14: 5 15: 6 16: 8 17: 9 18: 7 19: 7 20: 7 21: 8 22: 5 23: 8 24: 8 25: 8 26: 7 27: 7 28: 6 29: 7 30: 8 31: 7 <=== Core 7 ===> ---- Thread 00 ---- PC 5: Stalled ----- 97589 in-flight CPI 1.3061 -- Total Cycles 127488 ---- Thread 01 ---- PC 5: Stalled ----- 95567 in-flight CPI 1.3337 -- Total Cycles 127488 ---- Thread 02 ---- PC 5: Stalled ----- 100486 in-flight CPI 1.2685 -- Total Cycles 127488 ---- Thread 03 ---- PC 5: Stalled ----- 93765 in-flight CPI 1.3594 -- Total Cycles 127488 ---- Thread 04 ---- PC 5: Stalled ----- 99034 in-flight CPI 1.2871 -- Total Cycles 127488 ---- Thread 05 ---- PC 5: Stalled ----- 99239 in-flight CPI 1.2844 -- Total Cycles 127488 ---- Thread 06 ---- PC 5: Stalled ----- 98709 in-flight CPI 1.2913 -- Total Cycles 127488 ---- Thread 07 ---- PC 5: Stalled ----- 100813 in-flight CPI 1.2644 -- Total Cycles 127488 ---- Thread 08 ---- PC 5: Stalled ----- 97804 in-flight CPI 1.3032 -- Total Cycles 127488 ---- Thread 09 ---- PC 5: Stalled ----- 101713 in-flight CPI 1.2532 -- Total Cycles 127488 ---- Thread 10 ---- PC 5: Stalled ----- 97394 in-flight CPI 1.3087 -- Total Cycles 127488 ---- Thread 11 ---- PC 5: Stalled ----- 100204 in-flight CPI 1.2720 -- Total Cycles 127488 ---- Thread 12 ---- PC 5: Stalled ----- 96085 in-flight CPI 1.3266 -- Total Cycles 127488 ---- Thread 13 ---- PC 5: Stalled ----- 95905 in-flight CPI 1.3290 -- Total Cycles 127488 ---- Thread 14 ---- PC 5: Stalled ----- 95462 in-flight CPI 1.3353 -- Total Cycles 127488 ---- Thread 15 ---- PC 5: Stalled ----- 96684 in-flight CPI 1.3184 -- Total Cycles 127488 ---- Thread 16 ---- PC 5: Stalled ----- 96598 in-flight CPI 1.3195 -- Total Cycles 127488 ---- Thread 17 ---- PC 5: Stalled ----- 95802 in-flight CPI 1.3305 -- Total Cycles 127488 ---- Thread 18 ---- PC 5: Stalled ----- 97382 in-flight CPI 1.3089 -- Total Cycles 127488 ---- Thread 19 ---- PC 5: Stalled ----- 92902 in-flight CPI 1.3720 -- Total Cycles 127488 ---- Thread 20 ---- PC 5: Stalled ----- 97404 in-flight CPI 1.3086 -- Total Cycles 127488 ---- Thread 21 ---- PC 5: Stalled ----- 94374 in-flight CPI 1.3506 -- Total Cycles 127488 ---- Thread 22 ---- PC 5: Stalled ----- 88319 in-flight CPI 1.4432 -- Total Cycles 127488 ---- Thread 23 ---- PC 5: Stalled ----- 90814 in-flight CPI 1.4036 -- Total Cycles 127488 ---- Thread 24 ---- PC 5: Stalled ----- 92954 in-flight CPI 1.3712 -- Total Cycles 127488 ---- Thread 25 ---- PC 5: Stalled ----- 92924 in-flight CPI 1.3717 -- Total Cycles 127488 ---- Thread 26 ---- PC 5: Stalled ----- 85285 in-flight CPI 1.4946 -- Total Cycles 127488 ---- Thread 27 ---- PC 5: Stalled ----- 87737 in-flight CPI 1.4528 -- Total Cycles 127488 ---- Thread 28 ---- PC 5: Stalled ----- 87143 in-flight CPI 1.4627 -- Total Cycles 127488 ---- Thread 29 ---- PC 5: Stalled ----- 92006 in-flight CPI 1.3854 -- Total Cycles 127488 ---- Thread 30 ---- PC 5: Stalled ----- 92100 in-flight CPI 1.3840 -- Total Cycles 127488 ---- Thread 31 ---- PC 5: Stalled ----- 83510 in-flight CPI 1.5263 -- Total Cycles 127488 Total CPI 0.0420 , IPC 23.8006 -- Total Cycles 127488 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 6646 (3.614687%) FPSUB: 0 (0.000000%) FPMUL: 29653 (16.127944%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 71176 (38.711853%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5779 (3.143135%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 62434 (33.957174%) DIV: 7899 (4.296180%) FPUN: 0 (0.000000%) FPRSUB: 274 (0.149026%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3326678 total) ADD%: 7.604 (252962) SUB%: 0.000 (0) MUL%: 0.006 (214) BITOR%: 1.554 (51695) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.484 (16101) FPSUB%: 0.000 (0) FPMUL%: 4.585 (152525) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (642) FPMAX%: 0.019 (642) LOAD%: 5.059 (168292) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (246) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (606) FPINV%: 0.000 (0) FPCONV%: 0.020 (674) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.043 (34702) FPLE%: 0.463 (15419) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (642) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.856 (95004) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.723 (24051) CMPU%: 0.000 (0) RSUB%: 0.006 (214) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.786 (525153) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.186 (39442) ORI%: 1.525 (50747) XORI%: 0.000 (0) MULI%: 3.262 (108508) LW%: 1.153 (38344) LWI%: 13.658 (454342) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.293 (9754) SWI%: 4.127 (137304) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.426 (47448) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.313 (10409) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.045 (1486) bned%: 0.000 (0) bneid%: 13.935 (463556) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.734 (24422) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.106 (3527) DIV%: 0.013 (428) FPUN%: 1.515 (50383) FPRSUB%: 3.612 (120168) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (74) FPGT%: 2.981 (99163) FPGE%: 1.051 (34964) SYNC%: 0.000 (0) NOP%: 8.787 (292329) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 41 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 12 FPSUB 0 FPMUL 38 FPCMPLT 0 FPMIN 0 FPMAX 414 LOAD 36918 INTCONV 0 ATOMIC_INC 18 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 14 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1402 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 6 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49716 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 14 ORI 9380 XORI 0 MULI 10216 LW 0 LWI 143302 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 49 DIV 19 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.8009 --Total thread-cycles: 4079616 --total thread-cycles issued: 3034349 (74.378299%) --iCache conflicts: 113394 (2.779526%) --thread*cycles of FU dependence: 251561 (6.166291%) --thread*cycles of data dependence: 183861 (4.506821%) --iCache cycles*banks: 4079616 (81.544685% used) Issue breakdown: --thread*cycles of issue worked: 3034349 (74.378299%) --thread*cycles of issue failed: 752938 (18.456100%) --thread*cycles of issue NOP/other: 4612155766517036521 (113053673838837.670000%) Number of thread-cycles not ready: 183861 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3326678 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 9 1: 8 2: 8 3: 7 4: 8 5: 9 6: 8 7: 8 8: 9 9: 8 10: 8 11: 8 12: 7 13: 9 14: 7 15: 7 16: 8 17: 8 18: 8 19: 8 20: 8 21: 8 22: 7 23: 6 24: 8 25: 7 26: 7 27: 6 28: 7 29: 7 30: 7 31: 8 <=== Core 8 ===> ---- Thread 00 ---- PC 5: Stalled ----- 100845 in-flight CPI 1.4990 -- Total Cycles 151199 ---- Thread 01 ---- PC 5: Stalled ----- 97165 in-flight CPI 1.5557 -- Total Cycles 151199 ---- Thread 02 ---- PC 5: Stalled ----- 97134 in-flight CPI 1.5563 -- Total Cycles 151199 ---- Thread 03 ---- PC 5: Stalled ----- 92594 in-flight CPI 1.6326 -- Total Cycles 151199 ---- Thread 04 ---- PC 5: Stalled ----- 98637 in-flight CPI 1.5325 -- Total Cycles 151199 ---- Thread 05 ---- PC 5: Stalled ----- 101089 in-flight CPI 1.4954 -- Total Cycles 151199 ---- Thread 06 ---- PC 5: Stalled ----- 93166 in-flight CPI 1.6227 -- Total Cycles 151199 ---- Thread 07 ---- PC 5: Stalled ----- 95465 in-flight CPI 1.5835 -- Total Cycles 151199 ---- Thread 08 ---- PC 5: Stalled ----- 95557 in-flight CPI 1.5820 -- Total Cycles 151199 ---- Thread 09 ---- PC 5: Stalled ----- 96408 in-flight CPI 1.5680 -- Total Cycles 151199 ---- Thread 10 ---- PC 5: Stalled ----- 97469 in-flight CPI 1.5510 -- Total Cycles 151199 ---- Thread 11 ---- PC 5: Stalled ----- 95470 in-flight CPI 1.5835 -- Total Cycles 151199 ---- Thread 12 ---- PC 5: Stalled ----- 92841 in-flight CPI 1.6283 -- Total Cycles 151199 ---- Thread 13 ---- PC 5: Stalled ----- 97799 in-flight CPI 1.5457 -- Total Cycles 151199 ---- Thread 14 ---- PC 5: Stalled ----- 99290 in-flight CPI 1.5226 -- Total Cycles 151199 ---- Thread 15 ---- PC 5: Stalled ----- 90293 in-flight CPI 1.6743 -- Total Cycles 151199 ---- Thread 16 ---- PC 5: Stalled ----- 99096 in-flight CPI 1.5255 -- Total Cycles 151199 ---- Thread 17 ---- PC 5: Stalled ----- 96448 in-flight CPI 1.5673 -- Total Cycles 151199 ---- Thread 18 ---- PC 5: Stalled ----- 95549 in-flight CPI 1.5821 -- Total Cycles 151199 ---- Thread 19 ---- PC 5: Stalled ----- 94986 in-flight CPI 1.5915 -- Total Cycles 151199 ---- Thread 20 ---- PC 5: Stalled ----- 90418 in-flight CPI 1.6719 -- Total Cycles 151199 ---- Thread 21 ---- PC 5: Stalled ----- 87743 in-flight CPI 1.7230 -- Total Cycles 151199 ---- Thread 22 ---- PC 5: Stalled ----- 95163 in-flight CPI 1.5885 -- Total Cycles 151199 ---- Thread 23 ---- PC 5: Stalled ----- 82391 in-flight CPI 1.8348 -- Total Cycles 151199 ---- Thread 24 ---- PC 5: Stalled ----- 93535 in-flight CPI 1.6162 -- Total Cycles 151199 ---- Thread 25 ---- PC 5: Stalled ----- 89397 in-flight CPI 1.6910 -- Total Cycles 151199 ---- Thread 26 ---- PC 5: Stalled ----- 83686 in-flight CPI 1.8065 -- Total Cycles 151199 ---- Thread 27 ---- PC 5: Stalled ----- 89000 in-flight CPI 1.6986 -- Total Cycles 151199 ---- Thread 28 ---- PC 5: Stalled ----- 93056 in-flight CPI 1.6245 -- Total Cycles 151199 ---- Thread 29 ---- PC 5: Stalled ----- 90548 in-flight CPI 1.6695 -- Total Cycles 151199 ---- Thread 30 ---- PC 5: Stalled ----- 102958 in-flight CPI 1.4684 -- Total Cycles 151199 ---- Thread 31 ---- PC 5: Stalled ----- 87274 in-flight CPI 1.7322 -- Total Cycles 151199 Total CPI 0.0502 , IPC 19.9275 -- Total Cycles 151199 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8049 (3.198274%) FPSUB: 0 (0.000000%) FPMUL: 32164 (12.780380%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 124005 (49.273445%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5526 (2.195759%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 74202 (29.484199%) DIV: 7458 (2.963440%) FPUN: 0 (0.000000%) FPRSUB: 263 (0.104503%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3303540 total) ADD%: 7.476 (246982) SUB%: 0.000 (0) MUL%: 0.006 (202) BITOR%: 1.526 (50397) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.573 (18917) FPSUB%: 0.000 (0) FPMUL%: 4.853 (160318) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (606) FPMAX%: 0.018 (606) LOAD%: 5.202 (171854) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (234) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (579) FPINV%: 0.000 (0) FPCONV%: 0.019 (638) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.075 (35526) FPLE%: 0.455 (15024) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (606) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.808 (92762) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.756 (24964) CMPU%: 0.000 (0) RSUB%: 0.006 (202) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.749 (520280) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.178 (38915) ORI%: 1.582 (52248) XORI%: 0.000 (0) MULI%: 3.207 (105954) LW%: 1.133 (37428) LWI%: 13.528 (446909) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.287 (9489) SWI%: 4.082 (134836) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.403 (46360) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10252) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.060 (1998) bned%: 0.000 (0) bneid%: 13.842 (457274) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.717 (23701) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.128 (4227) DIV%: 0.012 (404) FPUN%: 1.474 (48708) FPRSUB%: 3.702 (122286) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (65) FPGT%: 2.953 (97545) FPGE%: 1.020 (33684) SYNC%: 0.000 (0) NOP%: 8.793 (290464) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 20 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 9 FPSUB 0 FPMUL 42 FPCMPLT 0 FPMIN 0 FPMAX 396 LOAD 40266 INTCONV 0 ATOMIC_INC 28 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 4 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1265 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 5 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48752 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 13 ORI 11465 XORI 0 MULI 9522 LW 0 LWI 141615 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 92 DIV 35 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 19.9278 --Total thread-cycles: 4838368 --total thread-cycles issued: 3013076 (62.274635%) --iCache conflicts: 110075 (2.275044%) --thread*cycles of FU dependence: 253531 (5.240011%) --thread*cycles of data dependence: 251667 (5.201485%) --iCache cycles*banks: 4838368 (68.278643% used) Issue breakdown: --thread*cycles of issue worked: 3013076 (62.274635%) --thread*cycles of issue failed: 1534828 (31.722019%) --thread*cycles of issue NOP/other: -4661559865863016800 (-96345707186039.094000%) Number of thread-cycles not ready: 251667 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3303540 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 9 2: 8 3: 8 4: 9 5: 8 6: 6 7: 7 8: 8 9: 8 10: 7 11: 7 12: 7 13: 8 14: 7 15: 6 16: 7 17: 9 18: 8 19: 8 20: 8 21: 5 22: 8 23: 6 24: 7 25: 7 26: 5 27: 7 28: 8 29: 8 30: 6 31: 6 <=== Core 9 ===> ---- Thread 00 ---- PC 5: Stalled ----- 98569 in-flight CPI 1.2770 -- Total Cycles 125898 ---- Thread 01 ---- PC 5: Stalled ----- 103014 in-flight CPI 1.2219 -- Total Cycles 125898 ---- Thread 02 ---- PC 5: Stalled ----- 99309 in-flight CPI 1.2675 -- Total Cycles 125898 ---- Thread 03 ---- PC 5: Stalled ----- 99735 in-flight CPI 1.2621 -- Total Cycles 125898 ---- Thread 04 ---- PC 5: Stalled ----- 98202 in-flight CPI 1.2818 -- Total Cycles 125898 ---- Thread 05 ---- PC 5: Stalled ----- 102769 in-flight CPI 1.2248 -- Total Cycles 125898 ---- Thread 06 ---- PC 5: Stalled ----- 98842 in-flight CPI 1.2735 -- Total Cycles 125898 ---- Thread 07 ---- PC 5: Stalled ----- 91070 in-flight CPI 1.3822 -- Total Cycles 125898 ---- Thread 08 ---- PC 5: Stalled ----- 94255 in-flight CPI 1.3354 -- Total Cycles 125898 ---- Thread 09 ---- PC 5: Stalled ----- 96743 in-flight CPI 1.3012 -- Total Cycles 125898 ---- Thread 10 ---- PC 5: Stalled ----- 99398 in-flight CPI 1.2664 -- Total Cycles 125898 ---- Thread 11 ---- PC 5: Stalled ----- 98896 in-flight CPI 1.2728 -- Total Cycles 125898 ---- Thread 12 ---- PC 5: Stalled ----- 97999 in-flight CPI 1.2844 -- Total Cycles 125898 ---- Thread 13 ---- PC 5: Stalled ----- 93748 in-flight CPI 1.3427 -- Total Cycles 125898 ---- Thread 14 ---- PC 5: Stalled ----- 93814 in-flight CPI 1.3418 -- Total Cycles 125898 ---- Thread 15 ---- PC 5: Stalled ----- 97436 in-flight CPI 1.2919 -- Total Cycles 125898 ---- Thread 16 ---- PC 5: Stalled ----- 89369 in-flight CPI 1.4085 -- Total Cycles 125898 ---- Thread 17 ---- PC 5: Stalled ----- 95950 in-flight CPI 1.3119 -- Total Cycles 125898 ---- Thread 18 ---- PC 5: Stalled ----- 93374 in-flight CPI 1.3481 -- Total Cycles 125898 ---- Thread 19 ---- PC 5: Stalled ----- 90296 in-flight CPI 1.3941 -- Total Cycles 125898 ---- Thread 20 ---- PC 5: Stalled ----- 92777 in-flight CPI 1.3568 -- Total Cycles 125898 ---- Thread 21 ---- PC 5: Stalled ----- 93907 in-flight CPI 1.3404 -- Total Cycles 125898 ---- Thread 22 ---- PC 5: Stalled ----- 91143 in-flight CPI 1.3811 -- Total Cycles 125898 ---- Thread 23 ---- PC 5: Stalled ----- 93093 in-flight CPI 1.3521 -- Total Cycles 125898 ---- Thread 24 ---- PC 5: Stalled ----- 90748 in-flight CPI 1.3871 -- Total Cycles 125898 ---- Thread 25 ---- PC 5: Stalled ----- 89155 in-flight CPI 1.4119 -- Total Cycles 125898 ---- Thread 26 ---- PC 5: Stalled ----- 89182 in-flight CPI 1.4114 -- Total Cycles 125898 ---- Thread 27 ---- PC 5: Stalled ----- 86592 in-flight CPI 1.4536 -- Total Cycles 125898 ---- Thread 28 ---- PC 5: Stalled ----- 92774 in-flight CPI 1.3568 -- Total Cycles 125898 ---- Thread 29 ---- PC 5: Stalled ----- 92077 in-flight CPI 1.3670 -- Total Cycles 125898 ---- Thread 30 ---- PC 5: Stalled ----- 88821 in-flight CPI 1.4172 -- Total Cycles 125898 ---- Thread 31 ---- PC 5: Stalled ----- 84959 in-flight CPI 1.4816 -- Total Cycles 125898 Total CPI 0.0417 , IPC 23.9764 -- Total Cycles 125898 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7545 (3.811800%) FPSUB: 0 (0.000000%) FPMUL: 31387 (15.856986%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 75969 (38.380200%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5642 (2.850387%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 69489 (35.106447%) DIV: 7639 (3.859289%) FPUN: 0 (0.000000%) FPRSUB: 267 (0.134891%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3309898 total) ADD%: 7.461 (246942) SUB%: 0.000 (0) MUL%: 0.006 (207) BITOR%: 1.542 (51039) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.543 (17959) FPSUB%: 0.000 (0) FPMUL%: 4.765 (157708) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (621) FPMAX%: 0.019 (621) LOAD%: 5.144 (170251) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (239) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (591) FPINV%: 0.000 (0) FPCONV%: 0.020 (653) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.065 (35252) FPLE%: 0.459 (15205) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (621) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.827 (93567) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.743 (24599) CMPU%: 0.000 (0) RSUB%: 0.006 (207) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.768 (521899) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.181 (39082) ORI%: 1.566 (51826) XORI%: 0.000 (0) MULI%: 3.227 (106822) LW%: 1.141 (37758) LWI%: 13.573 (449254) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.290 (9584) SWI%: 4.092 (135440) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.412 (46752) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.312 (10317) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.055 (1812) bned%: 0.000 (0) bneid%: 13.894 (459871) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.724 (23969) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.119 (3946) DIV%: 0.013 (414) FPUN%: 1.496 (49508) FPRSUB%: 3.672 (121524) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (75) FPGT%: 2.964 (98103) FPGE%: 1.036 (34303) SYNC%: 0.000 (0) NOP%: 8.800 (291261) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 16 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 11 FPSUB 0 FPMUL 51 FPCMPLT 0 FPMIN 0 FPMAX 398 LOAD 39322 INTCONV 0 ATOMIC_INC 25 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 13 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1483 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 7 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49117 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 15 ORI 10773 XORI 0 MULI 10037 LW 0 LWI 142153 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 65 DIV 15 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.9767 --Total thread-cycles: 4028736 --total thread-cycles issued: 3018637 (74.927645%) --iCache conflicts: 113057 (2.806265%) --thread*cycles of FU dependence: 253532 (6.293090%) --thread*cycles of data dependence: 197938 (4.913154%) --iCache cycles*banks: 4028736 (82.158027% used) Issue breakdown: --thread*cycles of issue worked: 3018637 (74.927645%) --thread*cycles of issue failed: 718838 (17.842768%) --thread*cycles of issue NOP/other: 4611603692494549437 (114467755953593.110000%) Number of thread-cycles not ready: 197938 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3309898 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 9 2: 8 3: 8 4: 8 5: 9 6: 8 7: 6 8: 8 9: 6 10: 8 11: 8 12: 8 13: 8 14: 6 15: 8 16: 7 17: 8 18: 7 19: 6 20: 7 21: 8 22: 7 23: 8 24: 7 25: 6 26: 7 27: 8 28: 8 29: 9 30: 7 31: 6 <=== Core 10 ===> ---- Thread 00 ---- PC 5: Stalled ----- 99612 in-flight CPI 1.5210 -- Total Cycles 151544 ---- Thread 01 ---- PC 5: Stalled ----- 100421 in-flight CPI 1.5088 -- Total Cycles 151544 ---- Thread 02 ---- PC 5: Stalled ----- 100996 in-flight CPI 1.5002 -- Total Cycles 151544 ---- Thread 03 ---- PC 5: Stalled ----- 100013 in-flight CPI 1.5150 -- Total Cycles 151544 ---- Thread 04 ---- PC 5: Stalled ----- 103096 in-flight CPI 1.4696 -- Total Cycles 151544 ---- Thread 05 ---- PC 5: Stalled ----- 101166 in-flight CPI 1.4977 -- Total Cycles 151544 ---- Thread 06 ---- PC 5: Stalled ----- 94192 in-flight CPI 1.6086 -- Total Cycles 151544 ---- Thread 07 ---- PC 5: Stalled ----- 98798 in-flight CPI 1.5336 -- Total Cycles 151544 ---- Thread 08 ---- PC 5: Stalled ----- 92931 in-flight CPI 1.6304 -- Total Cycles 151544 ---- Thread 09 ---- PC 5: Stalled ----- 100173 in-flight CPI 1.5125 -- Total Cycles 151544 ---- Thread 10 ---- PC 5: Stalled ----- 95227 in-flight CPI 1.5911 -- Total Cycles 151544 ---- Thread 11 ---- PC 5: Stalled ----- 95423 in-flight CPI 1.5878 -- Total Cycles 151544 ---- Thread 12 ---- PC 5: Stalled ----- 94513 in-flight CPI 1.6030 -- Total Cycles 151544 ---- Thread 13 ---- PC 5: Stalled ----- 99533 in-flight CPI 1.5223 -- Total Cycles 151544 ---- Thread 14 ---- PC 5: Stalled ----- 93910 in-flight CPI 1.6134 -- Total Cycles 151544 ---- Thread 15 ---- PC 5: Stalled ----- 91775 in-flight CPI 1.6510 -- Total Cycles 151544 ---- Thread 16 ---- PC 5: Stalled ----- 96062 in-flight CPI 1.5773 -- Total Cycles 151544 ---- Thread 17 ---- PC 5: Stalled ----- 96010 in-flight CPI 1.5782 -- Total Cycles 151544 ---- Thread 18 ---- PC 5: Stalled ----- 99618 in-flight CPI 1.5209 -- Total Cycles 151544 ---- Thread 19 ---- PC 5: Stalled ----- 99085 in-flight CPI 1.5291 -- Total Cycles 151544 ---- Thread 20 ---- PC 5: Stalled ----- 91078 in-flight CPI 1.6636 -- Total Cycles 151544 ---- Thread 21 ---- PC 5: Stalled ----- 92967 in-flight CPI 1.6298 -- Total Cycles 151544 ---- Thread 22 ---- PC 5: Stalled ----- 97442 in-flight CPI 1.5549 -- Total Cycles 151544 ---- Thread 23 ---- PC 5: Stalled ----- 90203 in-flight CPI 1.6797 -- Total Cycles 151544 ---- Thread 24 ---- PC 5: Stalled ----- 108444 in-flight CPI 1.3973 -- Total Cycles 151544 ---- Thread 25 ---- PC 5: Stalled ----- 92022 in-flight CPI 1.6465 -- Total Cycles 151544 ---- Thread 26 ---- PC 5: Stalled ----- 86547 in-flight CPI 1.7507 -- Total Cycles 151544 ---- Thread 27 ---- PC 5: Stalled ----- 86012 in-flight CPI 1.7616 -- Total Cycles 151544 ---- Thread 28 ---- PC 5: Stalled ----- 93400 in-flight CPI 1.6222 -- Total Cycles 151544 ---- Thread 29 ---- PC 5: Stalled ----- 92850 in-flight CPI 1.6318 -- Total Cycles 151544 ---- Thread 30 ---- PC 5: Stalled ----- 91858 in-flight CPI 1.6495 -- Total Cycles 151544 ---- Thread 31 ---- PC 5: Stalled ----- 85914 in-flight CPI 1.7637 -- Total Cycles 151544 Total CPI 0.0495 , IPC 20.2044 -- Total Cycles 151544 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7977 (3.943349%) FPSUB: 0 (0.000000%) FPMUL: 32355 (15.994365%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 75129 (37.139256%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5654 (2.794997%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 73263 (36.216817%) DIV: 7644 (3.778734%) FPUN: 0 (0.000000%) FPRSUB: 268 (0.132483%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3357256 total) ADD%: 7.467 (250681) SUB%: 0.000 (0) MUL%: 0.006 (207) BITOR%: 1.526 (51235) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.559 (18774) FPSUB%: 0.000 (0) FPMUL%: 4.818 (161762) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (621) FPMAX%: 0.018 (621) LOAD%: 5.180 (173906) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (239) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (592) FPINV%: 0.000 (0) FPCONV%: 0.019 (653) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.071 (35961) FPLE%: 0.455 (15267) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (621) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.816 (94547) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.751 (25211) CMPU%: 0.000 (0) RSUB%: 0.006 (207) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.761 (529125) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.179 (39588) ORI%: 1.572 (52785) XORI%: 0.000 (0) MULI%: 3.218 (108022) LW%: 1.136 (38150) LWI%: 13.555 (455081) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.289 (9698) SWI%: 4.088 (137261) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.407 (47227) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.312 (10469) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.058 (1961) bned%: 0.000 (0) bneid%: 13.860 (465315) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.718 (24116) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.124 (4170) DIV%: 0.012 (414) FPUN%: 1.477 (49590) FPRSUB%: 3.691 (123915) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (71) FPGT%: 2.962 (99430) FPGE%: 1.022 (34323) SYNC%: 0.000 (0) NOP%: 8.797 (295344) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 26 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 14 FPSUB 0 FPMUL 59 FPCMPLT 0 FPMIN 0 FPMAX 400 LOAD 39851 INTCONV 0 ATOMIC_INC 22 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 15 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1557 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 9 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49607 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 9 ORI 11318 XORI 0 MULI 9713 LW 0 LWI 144119 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 87 DIV 26 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 20.2046 --Total thread-cycles: 4849408 --total thread-cycles issued: 3061912 (63.139913%) --iCache conflicts: 113342 (2.337234%) --thread*cycles of FU dependence: 256840 (5.296317%) --thread*cycles of data dependence: 202290 (4.171437%) --iCache cycles*banks: 4849408 (69.230883% used) Issue breakdown: --thread*cycles of issue worked: 3061912 (63.139913%) --thread*cycles of issue failed: 1492152 (30.769776%) --thread*cycles of issue NOP/other: 4608659284107231664 (95035502974945.234000%) Number of thread-cycles not ready: 202290 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3357256 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 9 3: 7 4: 9 5: 8 6: 7 7: 8 8: 7 9: 8 10: 7 11: 9 12: 9 13: 8 14: 7 15: 6 16: 7 17: 6 18: 9 19: 8 20: 7 21: 7 22: 9 23: 7 24: 6 25: 7 26: 6 27: 7 28: 8 29: 8 30: 7 31: 5 <=== Core 11 ===> ---- Thread 00 ---- PC 5: Stalled ----- 100534 in-flight CPI 1.4164 -- Total Cycles 142418 ---- Thread 01 ---- PC 5: Stalled ----- 98010 in-flight CPI 1.4529 -- Total Cycles 142418 ---- Thread 02 ---- PC 5: Stalled ----- 102088 in-flight CPI 1.3948 -- Total Cycles 142418 ---- Thread 03 ---- PC 5: Stalled ----- 100602 in-flight CPI 1.4154 -- Total Cycles 142418 ---- Thread 04 ---- PC 5: Stalled ----- 100268 in-flight CPI 1.4201 -- Total Cycles 142418 ---- Thread 05 ---- PC 5: Stalled ----- 98099 in-flight CPI 1.4515 -- Total Cycles 142418 ---- Thread 06 ---- PC 5: Stalled ----- 98450 in-flight CPI 1.4464 -- Total Cycles 142418 ---- Thread 07 ---- PC 5: Stalled ----- 103357 in-flight CPI 1.3776 -- Total Cycles 142418 ---- Thread 08 ---- PC 5: Stalled ----- 91024 in-flight CPI 1.5644 -- Total Cycles 142418 ---- Thread 09 ---- PC 5: Stalled ----- 90849 in-flight CPI 1.5674 -- Total Cycles 142418 ---- Thread 10 ---- PC 5: Stalled ----- 103286 in-flight CPI 1.3786 -- Total Cycles 142418 ---- Thread 11 ---- PC 5: Stalled ----- 96453 in-flight CPI 1.4763 -- Total Cycles 142418 ---- Thread 12 ---- PC 5: Stalled ----- 100210 in-flight CPI 1.4209 -- Total Cycles 142418 ---- Thread 13 ---- PC 5: Stalled ----- 96007 in-flight CPI 1.4832 -- Total Cycles 142418 ---- Thread 14 ---- PC 5: Stalled ----- 104171 in-flight CPI 1.3670 -- Total Cycles 142418 ---- Thread 15 ---- PC 5: Stalled ----- 94112 in-flight CPI 1.5130 -- Total Cycles 142418 ---- Thread 16 ---- PC 5: Stalled ----- 98460 in-flight CPI 1.4462 -- Total Cycles 142418 ---- Thread 17 ---- PC 5: Stalled ----- 95007 in-flight CPI 1.4987 -- Total Cycles 142418 ---- Thread 18 ---- PC 5: Stalled ----- 95804 in-flight CPI 1.4863 -- Total Cycles 142418 ---- Thread 19 ---- PC 5: Stalled ----- 91960 in-flight CPI 1.5483 -- Total Cycles 142418 ---- Thread 20 ---- PC 5: Stalled ----- 92074 in-flight CPI 1.5465 -- Total Cycles 142418 ---- Thread 21 ---- PC 5: Stalled ----- 91936 in-flight CPI 1.5489 -- Total Cycles 142418 ---- Thread 22 ---- PC 5: Stalled ----- 94826 in-flight CPI 1.5016 -- Total Cycles 142418 ---- Thread 23 ---- PC 5: Stalled ----- 88544 in-flight CPI 1.6081 -- Total Cycles 142418 ---- Thread 24 ---- PC 5: Stalled ----- 98095 in-flight CPI 1.4515 -- Total Cycles 142418 ---- Thread 25 ---- PC 5: Stalled ----- 90246 in-flight CPI 1.5778 -- Total Cycles 142418 ---- Thread 26 ---- PC 5: Stalled ----- 86031 in-flight CPI 1.6551 -- Total Cycles 142418 ---- Thread 27 ---- PC 5: Stalled ----- 89539 in-flight CPI 1.5902 -- Total Cycles 142418 ---- Thread 28 ---- PC 5: Stalled ----- 90746 in-flight CPI 1.5691 -- Total Cycles 142418 ---- Thread 29 ---- PC 5: Stalled ----- 89383 in-flight CPI 1.5930 -- Total Cycles 142418 ---- Thread 30 ---- PC 5: Stalled ----- 88870 in-flight CPI 1.6022 -- Total Cycles 142418 ---- Thread 31 ---- PC 5: Stalled ----- 86086 in-flight CPI 1.6540 -- Total Cycles 142418 Total CPI 0.0468 , IPC 21.3857 -- Total Cycles 142418 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8215 (4.207750%) FPSUB: 0 (0.000000%) FPMUL: 32720 (16.759290%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 64636 (33.106769%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 6055 (3.101391%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 75475 (38.658540%) DIV: 7863 (4.027454%) FPUN: 0 (0.000000%) FPRSUB: 271 (0.138807%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3339838 total) ADD%: 7.471 (249506) SUB%: 0.000 (0) MUL%: 0.006 (213) BITOR%: 1.517 (50677) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.580 (19360) FPSUB%: 0.000 (0) FPMUL%: 4.868 (162593) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (639) FPMAX%: 0.019 (639) LOAD%: 5.193 (173446) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (245) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (620) FPINV%: 0.000 (0) FPCONV%: 0.020 (671) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.079 (36037) FPLE%: 0.453 (15116) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (639) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.804 (93658) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.752 (25119) CMPU%: 0.000 (0) RSUB%: 0.006 (213) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.727 (525271) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.174 (39201) ORI%: 1.583 (52866) XORI%: 0.000 (0) MULI%: 3.208 (107148) LW%: 1.132 (37804) LWI%: 13.543 (452323) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.287 (9575) SWI%: 4.080 (136270) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.402 (46835) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10346) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.063 (2096) bned%: 0.000 (0) bneid%: 13.842 (462316) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.713 (23818) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.129 (4296) DIV%: 0.013 (426) FPUN%: 1.469 (49067) FPRSUB%: 3.704 (123694) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (83) FPGT%: 2.961 (98893) FPGE%: 1.017 (33951) SYNC%: 0.000 (0) NOP%: 8.805 (294072) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 26 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 11 FPSUB 0 FPMUL 56 FPCMPLT 0 FPMIN 0 FPMAX 414 LOAD 39921 INTCONV 0 ATOMIC_INC 26 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 9 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1790 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 4 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49244 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 11 ORI 11733 XORI 0 MULI 9891 LW 0 LWI 143191 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 66 DIV 39 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.3860 --Total thread-cycles: 4557376 --total thread-cycles issued: 3045766 (66.831572%) --iCache conflicts: 115238 (2.528604%) --thread*cycles of FU dependence: 256450 (5.627142%) --thread*cycles of data dependence: 195235 (4.283934%) --iCache cycles*banks: 4557376 (73.284934% used) Issue breakdown: --thread*cycles of issue worked: 3045766 (66.831572%) --thread*cycles of issue failed: 1217538 (26.715768%) --thread*cycles of issue NOP/other: 294072 (6.452660%) Number of thread-cycles not ready: 195235 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3339838 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 7 2: 8 3: 8 4: 8 5: 8 6: 7 7: 10 8: 6 9: 6 10: 8 11: 7 12: 9 13: 7 14: 6 15: 8 16: 8 17: 8 18: 7 19: 9 20: 7 21: 6 22: 8 23: 8 24: 9 25: 8 26: 8 27: 8 28: 8 29: 8 30: 7 31: 8 <=== Core 12 ===> ---- Thread 00 ---- PC 5: Stalled ----- 97989 in-flight CPI 1.3254 -- Total Cycles 129896 ---- Thread 01 ---- PC 5: Stalled ----- 99302 in-flight CPI 1.3078 -- Total Cycles 129896 ---- Thread 02 ---- PC 5: Stalled ----- 100339 in-flight CPI 1.2943 -- Total Cycles 129896 ---- Thread 03 ---- PC 5: Stalled ----- 95328 in-flight CPI 1.3624 -- Total Cycles 129896 ---- Thread 04 ---- PC 5: Stalled ----- 98014 in-flight CPI 1.3251 -- Total Cycles 129896 ---- Thread 05 ---- PC 5: Stalled ----- 98824 in-flight CPI 1.3142 -- Total Cycles 129896 ---- Thread 06 ---- PC 5: Stalled ----- 94179 in-flight CPI 1.3790 -- Total Cycles 129896 ---- Thread 07 ---- PC 5: Stalled ----- 94875 in-flight CPI 1.3688 -- Total Cycles 129896 ---- Thread 08 ---- PC 5: Stalled ----- 96160 in-flight CPI 1.3505 -- Total Cycles 129896 ---- Thread 09 ---- PC 5: Stalled ----- 102704 in-flight CPI 1.2646 -- Total Cycles 129896 ---- Thread 10 ---- PC 5: Stalled ----- 96355 in-flight CPI 1.3478 -- Total Cycles 129896 ---- Thread 11 ---- PC 5: Stalled ----- 98989 in-flight CPI 1.3120 -- Total Cycles 129896 ---- Thread 12 ---- PC 5: Stalled ----- 95479 in-flight CPI 1.3602 -- Total Cycles 129896 ---- Thread 13 ---- PC 5: Stalled ----- 91884 in-flight CPI 1.4135 -- Total Cycles 129896 ---- Thread 14 ---- PC 5: Stalled ----- 98834 in-flight CPI 1.3140 -- Total Cycles 129896 ---- Thread 15 ---- PC 5: Stalled ----- 90713 in-flight CPI 1.4318 -- Total Cycles 129896 ---- Thread 16 ---- PC 5: Stalled ----- 96521 in-flight CPI 1.3455 -- Total Cycles 129896 ---- Thread 17 ---- PC 5: Stalled ----- 98781 in-flight CPI 1.3148 -- Total Cycles 129896 ---- Thread 18 ---- PC 5: Stalled ----- 95686 in-flight CPI 1.3573 -- Total Cycles 129896 ---- Thread 19 ---- PC 5: Stalled ----- 94909 in-flight CPI 1.3684 -- Total Cycles 129896 ---- Thread 20 ---- PC 5: Stalled ----- 97029 in-flight CPI 1.3385 -- Total Cycles 129896 ---- Thread 21 ---- PC 5: Stalled ----- 96792 in-flight CPI 1.3417 -- Total Cycles 129896 ---- Thread 22 ---- PC 5: Stalled ----- 89711 in-flight CPI 1.4476 -- Total Cycles 129896 ---- Thread 23 ---- PC 5: Stalled ----- 100812 in-flight CPI 1.2883 -- Total Cycles 129896 ---- Thread 24 ---- PC 5: Stalled ----- 93954 in-flight CPI 1.3823 -- Total Cycles 129896 ---- Thread 25 ---- PC 5: Stalled ----- 91244 in-flight CPI 1.4233 -- Total Cycles 129896 ---- Thread 26 ---- PC 5: Stalled ----- 89627 in-flight CPI 1.4490 -- Total Cycles 129896 ---- Thread 27 ---- PC 5: Stalled ----- 90719 in-flight CPI 1.4316 -- Total Cycles 129896 ---- Thread 28 ---- PC 5: Stalled ----- 91275 in-flight CPI 1.4229 -- Total Cycles 129896 ---- Thread 29 ---- PC 5: Stalled ----- 86076 in-flight CPI 1.5088 -- Total Cycles 129896 ---- Thread 30 ---- PC 5: Stalled ----- 90134 in-flight CPI 1.4409 -- Total Cycles 129896 ---- Thread 31 ---- PC 5: Stalled ----- 89920 in-flight CPI 1.4443 -- Total Cycles 129896 Total CPI 0.0427 , IPC 23.4320 -- Total Cycles 129896 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7446 (3.716255%) FPSUB: 0 (0.000000%) FPMUL: 31260 (15.601683%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 78928 (39.392503%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5703 (2.846334%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 69082 (34.478422%) DIV: 7679 (3.832544%) FPUN: 0 (0.000000%) FPRSUB: 265 (0.132260%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3337131 total) ADD%: 7.494 (250075) SUB%: 0.000 (0) MUL%: 0.006 (208) BITOR%: 1.524 (50866) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.530 (17678) FPSUB%: 0.000 (0) FPMUL%: 4.731 (157894) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (624) FPMAX%: 0.019 (624) LOAD%: 5.148 (171807) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (240) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (596) FPINV%: 0.000 (0) FPCONV%: 0.020 (656) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.060 (35361) FPLE%: 0.455 (15190) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (624) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.839 (94738) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.742 (24771) CMPU%: 0.000 (0) RSUB%: 0.006 (208) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.779 (526570) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.184 (39523) ORI%: 1.543 (51484) XORI%: 0.000 (0) MULI%: 3.241 (108158) LW%: 1.146 (38228) LWI%: 13.620 (454528) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.290 (9692) SWI%: 4.111 (137180) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.419 (47350) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.312 (10420) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.054 (1807) bned%: 0.000 (0) bneid%: 13.877 (463110) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.718 (23972) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.118 (3922) DIV%: 0.012 (416) FPUN%: 1.479 (49363) FPRSUB%: 3.666 (122349) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (74) FPGT%: 2.973 (99207) FPGE%: 1.024 (34173) SYNC%: 0.000 (0) NOP%: 8.790 (293349) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 29 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 11 FPSUB 0 FPMUL 55 FPCMPLT 0 FPMIN 0 FPMAX 399 LOAD 39321 INTCONV 0 ATOMIC_INC 25 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 13 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1566 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 7 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49559 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 12 ORI 10578 XORI 0 MULI 9896 LW 0 LWI 143547 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 58 DIV 21 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.4323 --Total thread-cycles: 4156672 --total thread-cycles issued: 3043782 (73.226418%) --iCache conflicts: 114962 (2.765722%) --thread*cycles of FU dependence: 255106 (6.137266%) --thread*cycles of data dependence: 200363 (4.820274%) --iCache cycles*banks: 4156672 (80.284492% used) Issue breakdown: --thread*cycles of issue worked: 3043782 (73.226418%) --thread*cycles of issue failed: 819541 (19.716278%) --thread*cycles of issue NOP/other: 293349 (7.057304%) Number of thread-cycles not ready: 200363 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3337131 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 8 3: 7 4: 7 5: 8 6: 8 7: 8 8: 9 9: 7 10: 8 11: 8 12: 8 13: 6 14: 8 15: 5 16: 8 17: 7 18: 6 19: 7 20: 8 21: 8 22: 8 23: 8 24: 8 25: 8 26: 8 27: 7 28: 7 29: 7 30: 7 31: 7 <=== Core 13 ===> ---- Thread 00 ---- PC 5: Stalled ----- 98594 in-flight CPI 1.3053 -- Total Cycles 128721 ---- Thread 01 ---- PC 5: Stalled ----- 96338 in-flight CPI 1.3359 -- Total Cycles 128721 ---- Thread 02 ---- PC 5: Stalled ----- 92716 in-flight CPI 1.3881 -- Total Cycles 128721 ---- Thread 03 ---- PC 5: Stalled ----- 94390 in-flight CPI 1.3635 -- Total Cycles 128721 ---- Thread 04 ---- PC 5: Stalled ----- 93380 in-flight CPI 1.3782 -- Total Cycles 128721 ---- Thread 05 ---- PC 5: Stalled ----- 102126 in-flight CPI 1.2601 -- Total Cycles 128721 ---- Thread 06 ---- PC 5: Stalled ----- 98599 in-flight CPI 1.3053 -- Total Cycles 128721 ---- Thread 07 ---- PC 5: Stalled ----- 91597 in-flight CPI 1.4051 -- Total Cycles 128721 ---- Thread 08 ---- PC 5: Stalled ----- 92787 in-flight CPI 1.3870 -- Total Cycles 128721 ---- Thread 09 ---- PC 5: Stalled ----- 99032 in-flight CPI 1.2995 -- Total Cycles 128721 ---- Thread 10 ---- PC 5: Stalled ----- 97355 in-flight CPI 1.3219 -- Total Cycles 128721 ---- Thread 11 ---- PC 5: Stalled ----- 96733 in-flight CPI 1.3304 -- Total Cycles 128721 ---- Thread 12 ---- PC 5: Stalled ----- 93656 in-flight CPI 1.3742 -- Total Cycles 128721 ---- Thread 13 ---- PC 5: Stalled ----- 95815 in-flight CPI 1.3431 -- Total Cycles 128721 ---- Thread 14 ---- PC 5: Stalled ----- 94675 in-flight CPI 1.3594 -- Total Cycles 128721 ---- Thread 15 ---- PC 5: Stalled ----- 97253 in-flight CPI 1.3233 -- Total Cycles 128721 ---- Thread 16 ---- PC 5: Stalled ----- 95233 in-flight CPI 1.3514 -- Total Cycles 128721 ---- Thread 17 ---- PC 5: Stalled ----- 97577 in-flight CPI 1.3189 -- Total Cycles 128721 ---- Thread 18 ---- PC 5: Stalled ----- 96370 in-flight CPI 1.3355 -- Total Cycles 128721 ---- Thread 19 ---- PC 5: Stalled ----- 97542 in-flight CPI 1.3194 -- Total Cycles 128721 ---- Thread 20 ---- PC 5: Stalled ----- 94818 in-flight CPI 1.3573 -- Total Cycles 128721 ---- Thread 21 ---- PC 5: Stalled ----- 93111 in-flight CPI 1.3822 -- Total Cycles 128721 ---- Thread 22 ---- PC 5: Stalled ----- 90695 in-flight CPI 1.4191 -- Total Cycles 128721 ---- Thread 23 ---- PC 5: Stalled ----- 97556 in-flight CPI 1.3192 -- Total Cycles 128721 ---- Thread 24 ---- PC 5: Stalled ----- 91773 in-flight CPI 1.4024 -- Total Cycles 128721 ---- Thread 25 ---- PC 5: Stalled ----- 90217 in-flight CPI 1.4265 -- Total Cycles 128721 ---- Thread 26 ---- PC 5: Stalled ----- 92357 in-flight CPI 1.3935 -- Total Cycles 128721 ---- Thread 27 ---- PC 5: Stalled ----- 91809 in-flight CPI 1.4018 -- Total Cycles 128721 ---- Thread 28 ---- PC 5: Stalled ----- 91034 in-flight CPI 1.4137 -- Total Cycles 128721 ---- Thread 29 ---- PC 5: Stalled ----- 90010 in-flight CPI 1.4298 -- Total Cycles 128721 ---- Thread 30 ---- PC 5: Stalled ----- 84833 in-flight CPI 1.5170 -- Total Cycles 128721 ---- Thread 31 ---- PC 5: Stalled ----- 94975 in-flight CPI 1.3550 -- Total Cycles 128721 Total CPI 0.0425 , IPC 23.5046 -- Total Cycles 128721 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8167 (3.870781%) FPSUB: 0 (0.000000%) FPMUL: 32620 (15.460375%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 81939 (38.835306%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5893 (2.793010%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 74356 (35.241314%) DIV: 7747 (3.671721%) FPUN: 0 (0.000000%) FPRSUB: 269 (0.127494%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3317779 total) ADD%: 7.454 (247317) SUB%: 0.000 (0) MUL%: 0.006 (210) BITOR%: 1.522 (50485) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.578 (19188) FPSUB%: 0.000 (0) FPMUL%: 4.874 (161710) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (630) FPMAX%: 0.019 (630) LOAD%: 5.187 (172079) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (242) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (608) FPINV%: 0.000 (0) FPCONV%: 0.020 (662) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.078 (35782) FPLE%: 0.453 (15031) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (630) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.804 (93045) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.753 (24981) CMPU%: 0.000 (0) RSUB%: 0.006 (210) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.743 (522304) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.174 (38959) ORI%: 1.583 (52505) XORI%: 0.000 (0) MULI%: 3.208 (106424) LW%: 1.132 (37554) LWI%: 13.527 (448780) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.287 (9521) SWI%: 4.079 (135341) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.402 (46514) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10311) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.062 (2060) bned%: 0.000 (0) bneid%: 13.850 (459527) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.712 (23633) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.128 (4231) DIV%: 0.013 (420) FPUN%: 1.473 (48870) FPRSUB%: 3.706 (122941) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (82) FPGT%: 2.961 (98234) FPGE%: 1.020 (33839) SYNC%: 0.000 (0) NOP%: 8.807 (292193) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 31 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 17 FPSUB 0 FPMUL 50 FPCMPLT 0 FPMIN 0 FPMAX 411 LOAD 40053 INTCONV 0 ATOMIC_INC 27 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 8 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1609 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 9 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48964 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 13 ORI 11610 XORI 0 MULI 9565 LW 0 LWI 142159 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 66 DIV 31 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.5048 --Total thread-cycles: 4119072 --total thread-cycles issued: 3025586 (73.453098%) --iCache conflicts: 113566 (2.757077%) --thread*cycles of FU dependence: 254624 (6.181587%) --thread*cycles of data dependence: 210991 (5.122295%) --iCache cycles*banks: 4119072 (80.547536% used) Issue breakdown: --thread*cycles of issue worked: 3025586 (73.453098%) --thread*cycles of issue failed: 801293 (19.453241%) --thread*cycles of issue NOP/other: 292193 (7.093661%) Number of thread-cycles not ready: 210991 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3317779 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 7 3: 7 4: 8 5: 9 6: 7 7: 6 8: 7 9: 8 10: 8 11: 8 12: 6 13: 9 14: 6 15: 8 16: 8 17: 9 18: 7 19: 8 20: 8 21: 7 22: 6 23: 8 24: 7 25: 7 26: 7 27: 8 28: 8 29: 8 30: 8 31: 8 <=== Core 14 ===> ---- Thread 00 ---- PC 5: Stalled ----- 98524 in-flight CPI 1.2777 -- Total Cycles 125907 ---- Thread 01 ---- PC 5: Stalled ----- 100698 in-flight CPI 1.2501 -- Total Cycles 125907 ---- Thread 02 ---- PC 5: Stalled ----- 95849 in-flight CPI 1.3133 -- Total Cycles 125907 ---- Thread 03 ---- PC 5: Stalled ----- 103345 in-flight CPI 1.2181 -- Total Cycles 125907 ---- Thread 04 ---- PC 5: Stalled ----- 97963 in-flight CPI 1.2850 -- Total Cycles 125907 ---- Thread 05 ---- PC 5: Stalled ----- 95660 in-flight CPI 1.3160 -- Total Cycles 125907 ---- Thread 06 ---- PC 5: Stalled ----- 94320 in-flight CPI 1.3346 -- Total Cycles 125907 ---- Thread 07 ---- PC 5: Stalled ----- 96214 in-flight CPI 1.3084 -- Total Cycles 125907 ---- Thread 08 ---- PC 5: Stalled ----- 96374 in-flight CPI 1.3062 -- Total Cycles 125907 ---- Thread 09 ---- PC 5: Stalled ----- 98649 in-flight CPI 1.2761 -- Total Cycles 125907 ---- Thread 10 ---- PC 5: Stalled ----- 98533 in-flight CPI 1.2776 -- Total Cycles 125907 ---- Thread 11 ---- PC 5: Stalled ----- 94408 in-flight CPI 1.3334 -- Total Cycles 125907 ---- Thread 12 ---- PC 5: Stalled ----- 100147 in-flight CPI 1.2570 -- Total Cycles 125907 ---- Thread 13 ---- PC 5: Stalled ----- 96275 in-flight CPI 1.3076 -- Total Cycles 125907 ---- Thread 14 ---- PC 5: Stalled ----- 99282 in-flight CPI 1.2680 -- Total Cycles 125907 ---- Thread 15 ---- PC 5: Stalled ----- 101281 in-flight CPI 1.2429 -- Total Cycles 125907 ---- Thread 16 ---- PC 5: Stalled ----- 98150 in-flight CPI 1.2826 -- Total Cycles 125907 ---- Thread 17 ---- PC 5: Stalled ----- 96826 in-flight CPI 1.3000 -- Total Cycles 125907 ---- Thread 18 ---- PC 5: Stalled ----- 90865 in-flight CPI 1.3854 -- Total Cycles 125907 ---- Thread 19 ---- PC 5: Stalled ----- 95694 in-flight CPI 1.3155 -- Total Cycles 125907 ---- Thread 20 ---- PC 5: Stalled ----- 93454 in-flight CPI 1.3471 -- Total Cycles 125907 ---- Thread 21 ---- PC 5: Stalled ----- 90420 in-flight CPI 1.3922 -- Total Cycles 125907 ---- Thread 22 ---- PC 5: Stalled ----- 93154 in-flight CPI 1.3514 -- Total Cycles 125907 ---- Thread 23 ---- PC 5: Stalled ----- 93259 in-flight CPI 1.3498 -- Total Cycles 125907 ---- Thread 24 ---- PC 5: Stalled ----- 93103 in-flight CPI 1.3520 -- Total Cycles 125907 ---- Thread 25 ---- PC 5: Stalled ----- 91145 in-flight CPI 1.3811 -- Total Cycles 125907 ---- Thread 26 ---- PC 5: Stalled ----- 93509 in-flight CPI 1.3462 -- Total Cycles 125907 ---- Thread 27 ---- PC 5: Stalled ----- 92629 in-flight CPI 1.3590 -- Total Cycles 125907 ---- Thread 28 ---- PC 5: Stalled ----- 92747 in-flight CPI 1.3572 -- Total Cycles 125907 ---- Thread 29 ---- PC 5: Stalled ----- 87896 in-flight CPI 1.4322 -- Total Cycles 125907 ---- Thread 30 ---- PC 5: Stalled ----- 82593 in-flight CPI 1.5242 -- Total Cycles 125907 ---- Thread 31 ---- PC 5: Stalled ----- 84363 in-flight CPI 1.4921 -- Total Cycles 125907 Total CPI 0.0414 , IPC 24.1281 -- Total Cycles 125907 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7788 (4.114105%) FPSUB: 0 (0.000000%) FPMUL: 31732 (16.762810%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 63768 (33.686212%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 6002 (3.170629%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 71986 (38.027470%) DIV: 7757 (4.097728%) FPUN: 0 (0.000000%) FPRSUB: 267 (0.141046%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3330380 total) ADD%: 7.457 (248337) SUB%: 0.000 (0) MUL%: 0.006 (210) BITOR%: 1.536 (51146) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.553 (18429) FPSUB%: 0.000 (0) FPMUL%: 4.792 (159579) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (630) FPMAX%: 0.019 (630) LOAD%: 5.171 (172223) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (242) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (614) FPINV%: 0.000 (0) FPCONV%: 0.020 (662) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.066 (35513) FPLE%: 0.458 (15251) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (630) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.826 (94100) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.751 (25008) CMPU%: 0.000 (0) RSUB%: 0.006 (210) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.762 (524937) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.184 (39422) ORI%: 1.566 (52150) XORI%: 0.000 (0) MULI%: 3.222 (107298) LW%: 1.140 (37976) LWI%: 13.575 (452086) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.288 (9585) SWI%: 4.106 (136747) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.414 (47089) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10326) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.055 (1837) bned%: 0.000 (0) bneid%: 13.863 (461682) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.722 (24060) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.123 (4092) DIV%: 0.013 (420) FPUN%: 1.486 (49490) FPRSUB%: 3.680 (122544) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (77) FPGT%: 2.954 (98392) FPGE%: 1.028 (34239) SYNC%: 0.000 (0) NOP%: 8.780 (292421) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 24 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 7 FPSUB 0 FPMUL 43 FPCMPLT 0 FPMIN 0 FPMAX 410 LOAD 38863 INTCONV 0 ATOMIC_INC 27 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 12 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1425 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 5 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49320 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 14 ORI 11056 XORI 0 MULI 9578 LW 0 LWI 142818 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 98 DIV 25 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 24.1284 --Total thread-cycles: 4029024 --total thread-cycles issued: 3037959 (75.401859%) --iCache conflicts: 115286 (2.861388%) --thread*cycles of FU dependence: 253754 (6.298151%) --thread*cycles of data dependence: 189300 (4.698408%) --iCache cycles*banks: 4029024 (82.660515% used) Issue breakdown: --thread*cycles of issue worked: 3037959 (75.401859%) --thread*cycles of issue failed: 698644 (17.340279%) --thread*cycles of issue NOP/other: 292421 (7.257862%) Number of thread-cycles not ready: 189300 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3330380 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 9 2: 8 3: 9 4: 8 5: 7 6: 8 7: 7 8: 8 9: 8 10: 8 11: 7 12: 8 13: 6 14: 7 15: 9 16: 8 17: 9 18: 8 19: 6 20: 6 21: 7 22: 6 23: 8 24: 9 25: 9 26: 7 27: 7 28: 8 29: 6 30: 6 31: 8 <=== Core 15 ===> ---- Thread 00 ---- PC 5: Stalled ----- 97132 in-flight CPI 1.3256 -- Total Cycles 128783 ---- Thread 01 ---- PC 5: Stalled ----- 99420 in-flight CPI 1.2951 -- Total Cycles 128783 ---- Thread 02 ---- PC 5: Stalled ----- 95383 in-flight CPI 1.3499 -- Total Cycles 128783 ---- Thread 03 ---- PC 5: Stalled ----- 100039 in-flight CPI 1.2871 -- Total Cycles 128783 ---- Thread 04 ---- PC 5: Stalled ----- 97814 in-flight CPI 1.3164 -- Total Cycles 128783 ---- Thread 05 ---- PC 5: Stalled ----- 97595 in-flight CPI 1.3194 -- Total Cycles 128783 ---- Thread 06 ---- PC 5: Stalled ----- 101461 in-flight CPI 1.2690 -- Total Cycles 128783 ---- Thread 07 ---- PC 5: Stalled ----- 95790 in-flight CPI 1.3442 -- Total Cycles 128783 ---- Thread 08 ---- PC 5: Stalled ----- 98382 in-flight CPI 1.3087 -- Total Cycles 128783 ---- Thread 09 ---- PC 5: Stalled ----- 92908 in-flight CPI 1.3859 -- Total Cycles 128783 ---- Thread 10 ---- PC 5: Stalled ----- 92543 in-flight CPI 1.3913 -- Total Cycles 128783 ---- Thread 11 ---- PC 5: Stalled ----- 100055 in-flight CPI 1.2869 -- Total Cycles 128783 ---- Thread 12 ---- PC 5: Stalled ----- 103516 in-flight CPI 1.2438 -- Total Cycles 128783 ---- Thread 13 ---- PC 5: Stalled ----- 94207 in-flight CPI 1.3667 -- Total Cycles 128783 ---- Thread 14 ---- PC 5: Stalled ----- 98026 in-flight CPI 1.3135 -- Total Cycles 128783 ---- Thread 15 ---- PC 5: Stalled ----- 95633 in-flight CPI 1.3464 -- Total Cycles 128783 ---- Thread 16 ---- PC 5: Stalled ----- 93529 in-flight CPI 1.3766 -- Total Cycles 128783 ---- Thread 17 ---- PC 5: Stalled ----- 94898 in-flight CPI 1.3568 -- Total Cycles 128783 ---- Thread 18 ---- PC 5: Stalled ----- 96704 in-flight CPI 1.3315 -- Total Cycles 128783 ---- Thread 19 ---- PC 5: Stalled ----- 94056 in-flight CPI 1.3689 -- Total Cycles 128783 ---- Thread 20 ---- PC 5: Stalled ----- 95272 in-flight CPI 1.3515 -- Total Cycles 128783 ---- Thread 21 ---- PC 5: Stalled ----- 87939 in-flight CPI 1.4642 -- Total Cycles 128783 ---- Thread 22 ---- PC 5: Stalled ----- 90473 in-flight CPI 1.4232 -- Total Cycles 128783 ---- Thread 23 ---- PC 5: Stalled ----- 93460 in-flight CPI 1.3777 -- Total Cycles 128783 ---- Thread 24 ---- PC 5: Stalled ----- 92433 in-flight CPI 1.3930 -- Total Cycles 128783 ---- Thread 25 ---- PC 5: Stalled ----- 89034 in-flight CPI 1.4462 -- Total Cycles 128783 ---- Thread 26 ---- PC 5: Stalled ----- 92209 in-flight CPI 1.3964 -- Total Cycles 128783 ---- Thread 27 ---- PC 5: Stalled ----- 94153 in-flight CPI 1.3675 -- Total Cycles 128783 ---- Thread 28 ---- PC 5: Stalled ----- 92689 in-flight CPI 1.3892 -- Total Cycles 128783 ---- Thread 29 ---- PC 5: Stalled ----- 85819 in-flight CPI 1.5003 -- Total Cycles 128783 ---- Thread 30 ---- PC 5: Stalled ----- 89950 in-flight CPI 1.4314 -- Total Cycles 128783 ---- Thread 31 ---- PC 5: Stalled ----- 89907 in-flight CPI 1.4321 -- Total Cycles 128783 Total CPI 0.0425 , IPC 23.5513 -- Total Cycles 128783 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7819 (4.099534%) FPSUB: 0 (0.000000%) FPMUL: 32037 (16.797131%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 64739 (33.942924%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5882 (3.083957%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 72115 (37.810191%) DIV: 7866 (4.124176%) FPUN: 0 (0.000000%) FPRSUB: 271 (0.142086%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3325453 total) ADD%: 7.546 (250933) SUB%: 0.000 (0) MUL%: 0.006 (213) BITOR%: 1.531 (50899) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.558 (18548) FPSUB%: 0.000 (0) FPMUL%: 4.807 (159867) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (639) FPMAX%: 0.019 (639) LOAD%: 5.155 (171417) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (245) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (611) FPINV%: 0.000 (0) FPCONV%: 0.020 (671) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.072 (35633) FPLE%: 0.456 (15164) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (639) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.812 (93513) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.746 (24818) CMPU%: 0.000 (0) RSUB%: 0.006 (213) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.739 (523380) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.177 (39150) ORI%: 1.568 (52141) XORI%: 0.000 (0) MULI%: 3.216 (106956) LW%: 1.135 (37746) LWI%: 13.549 (450564) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.288 (9591) SWI%: 4.093 (136124) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.405 (46723) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10330) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.056 (1851) bned%: 0.000 (0) bneid%: 13.862 (460975) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.717 (23842) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.123 (4097) DIV%: 0.013 (426) FPUN%: 1.482 (49282) FPRSUB%: 3.683 (122468) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (69) FPGT%: 2.961 (98477) FPGE%: 1.026 (34118) SYNC%: 0.000 (0) NOP%: 8.792 (292385) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 14 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 14 FPSUB 0 FPMUL 52 FPCMPLT 0 FPMIN 0 FPMAX 416 LOAD 39033 INTCONV 0 ATOMIC_INC 14 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 15 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1901 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 9 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49130 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 12 ORI 11094 XORI 0 MULI 9725 LW 0 LWI 142645 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 82 DIV 37 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.5516 --Total thread-cycles: 4121056 --total thread-cycles issued: 3033068 (73.599291%) --iCache conflicts: 112334 (2.725855%) --thread*cycles of FU dependence: 254213 (6.168637%) --thread*cycles of data dependence: 190729 (4.628158%) --iCache cycles*banks: 4121056 (80.694972% used) Issue breakdown: --thread*cycles of issue worked: 3033068 (73.599291%) --thread*cycles of issue failed: 795603 (19.305804%) --thread*cycles of issue NOP/other: 292385 (7.094905%) Number of thread-cycles not ready: 190729 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3325453 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 7 3: 8 4: 7 5: 6 6: 9 7: 8 8: 9 9: 7 10: 9 11: 8 12: 9 13: 8 14: 9 15: 7 16: 8 17: 8 18: 8 19: 8 20: 8 21: 6 22: 6 23: 7 24: 8 25: 6 26: 7 27: 8 28: 7 29: 7 30: 8 31: 8 <=== Core 16 ===> ---- Thread 00 ---- PC 5: Stalled ----- 97482 in-flight CPI 1.3212 -- Total Cycles 128821 ---- Thread 01 ---- PC 5: Stalled ----- 98573 in-flight CPI 1.3066 -- Total Cycles 128821 ---- Thread 02 ---- PC 5: Stalled ----- 97051 in-flight CPI 1.3271 -- Total Cycles 128821 ---- Thread 03 ---- PC 5: Stalled ----- 100619 in-flight CPI 1.2800 -- Total Cycles 128821 ---- Thread 04 ---- PC 5: Stalled ----- 97138 in-flight CPI 1.3259 -- Total Cycles 128821 ---- Thread 05 ---- PC 5: Stalled ----- 100596 in-flight CPI 1.2803 -- Total Cycles 128821 ---- Thread 06 ---- PC 5: Stalled ----- 95166 in-flight CPI 1.3534 -- Total Cycles 128821 ---- Thread 07 ---- PC 5: Stalled ----- 102549 in-flight CPI 1.2559 -- Total Cycles 128821 ---- Thread 08 ---- PC 5: Stalled ----- 102562 in-flight CPI 1.2558 -- Total Cycles 128821 ---- Thread 09 ---- PC 5: Stalled ----- 105208 in-flight CPI 1.2242 -- Total Cycles 128821 ---- Thread 10 ---- PC 5: Stalled ----- 98403 in-flight CPI 1.3089 -- Total Cycles 128821 ---- Thread 11 ---- PC 5: Stalled ----- 100546 in-flight CPI 1.2809 -- Total Cycles 128821 ---- Thread 12 ---- PC 5: Stalled ----- 100189 in-flight CPI 1.2855 -- Total Cycles 128821 ---- Thread 13 ---- PC 5: Stalled ----- 98955 in-flight CPI 1.3016 -- Total Cycles 128821 ---- Thread 14 ---- PC 5: Stalled ----- 94378 in-flight CPI 1.3647 -- Total Cycles 128821 ---- Thread 15 ---- PC 5: Stalled ----- 90963 in-flight CPI 1.4160 -- Total Cycles 128821 ---- Thread 16 ---- PC 5: Stalled ----- 95876 in-flight CPI 1.3434 -- Total Cycles 128821 ---- Thread 17 ---- PC 5: Stalled ----- 91853 in-flight CPI 1.4022 -- Total Cycles 128821 ---- Thread 18 ---- PC 5: Stalled ----- 92056 in-flight CPI 1.3991 -- Total Cycles 128821 ---- Thread 19 ---- PC 5: Stalled ----- 95206 in-flight CPI 1.3528 -- Total Cycles 128821 ---- Thread 20 ---- PC 5: Stalled ----- 98936 in-flight CPI 1.3018 -- Total Cycles 128821 ---- Thread 21 ---- PC 5: Stalled ----- 92479 in-flight CPI 1.3927 -- Total Cycles 128821 ---- Thread 22 ---- PC 5: Stalled ----- 97301 in-flight CPI 1.3237 -- Total Cycles 128821 ---- Thread 23 ---- PC 5: Stalled ----- 92367 in-flight CPI 1.3944 -- Total Cycles 128821 ---- Thread 24 ---- PC 5: Stalled ----- 96600 in-flight CPI 1.3333 -- Total Cycles 128821 ---- Thread 25 ---- PC 5: Stalled ----- 87841 in-flight CPI 1.4663 -- Total Cycles 128821 ---- Thread 26 ---- PC 5: Stalled ----- 85463 in-flight CPI 1.5071 -- Total Cycles 128821 ---- Thread 27 ---- PC 5: Stalled ----- 92134 in-flight CPI 1.3979 -- Total Cycles 128821 ---- Thread 28 ---- PC 5: Stalled ----- 92580 in-flight CPI 1.3912 -- Total Cycles 128821 ---- Thread 29 ---- PC 5: Stalled ----- 94204 in-flight CPI 1.3672 -- Total Cycles 128821 ---- Thread 30 ---- PC 5: Stalled ----- 86623 in-flight CPI 1.4869 -- Total Cycles 128821 ---- Thread 31 ---- PC 5: Stalled ----- 86341 in-flight CPI 1.4917 -- Total Cycles 128821 Total CPI 0.0421 , IPC 23.7448 -- Total Cycles 128821 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7903 (4.157308%) FPSUB: 0 (0.000000%) FPMUL: 32170 (16.922761%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 63108 (33.197439%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5899 (3.103120%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 72812 (38.302148%) DIV: 7936 (4.174667%) FPUN: 0 (0.000000%) FPRSUB: 271 (0.142557%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3353786 total) ADD%: 7.471 (250560) SUB%: 0.000 (0) MUL%: 0.006 (215) BITOR%: 1.551 (52013) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.559 (18741) FPSUB%: 0.000 (0) FPMUL%: 4.805 (161155) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (645) FPMAX%: 0.019 (645) LOAD%: 5.145 (172563) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (247) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (614) FPINV%: 0.000 (0) FPCONV%: 0.020 (677) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.072 (35947) FPLE%: 0.455 (15248) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (645) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.813 (94340) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.748 (25098) CMPU%: 0.000 (0) RSUB%: 0.006 (215) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.737 (527787) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.179 (39546) ORI%: 1.589 (53303) XORI%: 0.000 (0) MULI%: 3.215 (107822) LW%: 1.135 (38080) LWI%: 13.537 (454018) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.288 (9660) SWI%: 4.089 (137125) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.406 (47150) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10412) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.054 (1824) bned%: 0.000 (0) bneid%: 13.887 (465734) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.725 (24306) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.123 (4137) DIV%: 0.013 (430) FPUN%: 1.501 (50332) FPRSUB%: 3.678 (123369) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (73) FPGT%: 2.953 (99027) FPGE%: 1.046 (35084) SYNC%: 0.000 (0) NOP%: 8.793 (294903) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 19 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 15 FPSUB 0 FPMUL 48 FPCMPLT 0 FPMIN 0 FPMAX 422 LOAD 39290 INTCONV 0 ATOMIC_INC 19 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 9 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1351 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 8 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49474 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 10 ORI 11311 XORI 0 MULI 9951 LW 0 LWI 143589 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 89 DIV 27 FPUN 0 FPRSUB 1 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.7450 --Total thread-cycles: 4122272 --total thread-cycles issued: 3058883 (74.203813%) --iCache conflicts: 115848 (2.810295%) --thread*cycles of FU dependence: 255656 (6.201823%) --thread*cycles of data dependence: 190099 (4.611510%) --iCache cycles*banks: 4122272 (81.358484% used) Issue breakdown: --thread*cycles of issue worked: 3058883 (74.203813%) --thread*cycles of issue failed: 768486 (18.642292%) --thread*cycles of issue NOP/other: 294903 (7.153895%) Number of thread-cycles not ready: 190099 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3353786 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 9 2: 7 3: 8 4: 8 5: 9 6: 7 7: 10 8: 9 9: 9 10: 7 11: 9 12: 8 13: 7 14: 7 15: 6 16: 7 17: 8 18: 7 19: 8 20: 8 21: 7 22: 8 23: 8 24: 7 25: 7 26: 6 27: 8 28: 8 29: 8 30: 6 31: 8 <=== Core 17 ===> ---- Thread 00 ---- PC 5: Stalled ----- 94902 in-flight CPI 1.7120 -- Total Cycles 162504 ---- Thread 01 ---- PC 5: Stalled ----- 98440 in-flight CPI 1.6505 -- Total Cycles 162504 ---- Thread 02 ---- PC 5: Stalled ----- 94957 in-flight CPI 1.7110 -- Total Cycles 162504 ---- Thread 03 ---- PC 5: Stalled ----- 94518 in-flight CPI 1.7189 -- Total Cycles 162504 ---- Thread 04 ---- PC 5: Stalled ----- 99699 in-flight CPI 1.6296 -- Total Cycles 162504 ---- Thread 05 ---- PC 5: Stalled ----- 92181 in-flight CPI 1.7626 -- Total Cycles 162504 ---- Thread 06 ---- PC 5: Stalled ----- 95566 in-flight CPI 1.7001 -- Total Cycles 162504 ---- Thread 07 ---- PC 5: Stalled ----- 94347 in-flight CPI 1.7221 -- Total Cycles 162504 ---- Thread 08 ---- PC 5: Stalled ----- 97170 in-flight CPI 1.6720 -- Total Cycles 162504 ---- Thread 09 ---- PC 5: Stalled ----- 93776 in-flight CPI 1.7326 -- Total Cycles 162504 ---- Thread 10 ---- PC 5: Stalled ----- 97046 in-flight CPI 1.6742 -- Total Cycles 162504 ---- Thread 11 ---- PC 5: Stalled ----- 93332 in-flight CPI 1.7408 -- Total Cycles 162504 ---- Thread 12 ---- PC 5: Stalled ----- 86391 in-flight CPI 1.8807 -- Total Cycles 162504 ---- Thread 13 ---- PC 5: Stalled ----- 98595 in-flight CPI 1.6478 -- Total Cycles 162504 ---- Thread 14 ---- PC 5: Stalled ----- 101944 in-flight CPI 1.5938 -- Total Cycles 162504 ---- Thread 15 ---- PC 5: Stalled ----- 95660 in-flight CPI 1.6984 -- Total Cycles 162504 ---- Thread 16 ---- PC 5: Stalled ----- 115731 in-flight CPI 1.4040 -- Total Cycles 162504 ---- Thread 17 ---- PC 5: Stalled ----- 96074 in-flight CPI 1.6912 -- Total Cycles 162504 ---- Thread 18 ---- PC 5: Stalled ----- 91081 in-flight CPI 1.7839 -- Total Cycles 162504 ---- Thread 19 ---- PC 5: Stalled ----- 95767 in-flight CPI 1.6965 -- Total Cycles 162504 ---- Thread 20 ---- PC 5: Stalled ----- 95294 in-flight CPI 1.7050 -- Total Cycles 162504 ---- Thread 21 ---- PC 5: Stalled ----- 97797 in-flight CPI 1.6614 -- Total Cycles 162504 ---- Thread 22 ---- PC 5: Stalled ----- 90014 in-flight CPI 1.8050 -- Total Cycles 162504 ---- Thread 23 ---- PC 5: Stalled ----- 94474 in-flight CPI 1.7198 -- Total Cycles 162504 ---- Thread 24 ---- PC 5: Stalled ----- 87538 in-flight CPI 1.8561 -- Total Cycles 162504 ---- Thread 25 ---- PC 5: Stalled ----- 87746 in-flight CPI 1.8517 -- Total Cycles 162504 ---- Thread 26 ---- PC 5: Stalled ----- 88286 in-flight CPI 1.8402 -- Total Cycles 162504 ---- Thread 27 ---- PC 5: Stalled ----- 86838 in-flight CPI 1.8711 -- Total Cycles 162504 ---- Thread 28 ---- PC 5: Stalled ----- 90811 in-flight CPI 1.7892 -- Total Cycles 162504 ---- Thread 29 ---- PC 5: Stalled ----- 86160 in-flight CPI 1.8857 -- Total Cycles 162504 ---- Thread 30 ---- PC 5: Stalled ----- 84640 in-flight CPI 1.9196 -- Total Cycles 162504 ---- Thread 31 ---- PC 5: Stalled ----- 95106 in-flight CPI 1.7084 -- Total Cycles 162504 Total CPI 0.0539 , IPC 18.5375 -- Total Cycles 162504 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 9068 (3.840745%) FPSUB: 0 (0.000000%) FPMUL: 34265 (14.512918%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 98221 (41.601440%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5606 (2.374418%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 81484 (34.512495%) DIV: 7201 (3.049979%) FPUN: 0 (0.000000%) FPRSUB: 255 (0.108005%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3303453 total) ADD%: 7.392 (244207) SUB%: 0.000 (0) MUL%: 0.006 (195) BITOR%: 1.519 (50177) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.637 (21034) FPSUB%: 0.000 (0) FPMUL%: 5.046 (166676) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (585) FPMAX%: 0.018 (585) LOAD%: 5.270 (174092) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (227) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (576) FPINV%: 0.000 (0) FPCONV%: 0.019 (617) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.101 (36376) FPLE%: 0.452 (14916) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (585) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.769 (91465) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.774 (25572) CMPU%: 0.000 (0) RSUB%: 0.006 (195) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.702 (518711) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.168 (38590) ORI%: 1.626 (53698) XORI%: 0.000 (0) MULI%: 3.169 (104688) LW%: 1.117 (36898) LWI%: 13.432 (443714) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.282 (9305) SWI%: 4.023 (132891) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.386 (45774) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.307 (10152) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.072 (2380) bned%: 0.000 (0) bneid%: 13.804 (456003) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.712 (23529) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.141 (4663) DIV%: 0.012 (390) FPUN%: 1.462 (48290) FPRSUB%: 3.762 (124284) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (70) FPGT%: 2.933 (96886) FPGE%: 1.010 (33374) SYNC%: 0.000 (0) NOP%: 8.809 (290987) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 27 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 5 FPSUB 0 FPMUL 54 FPCMPLT 0 FPMIN 0 FPMAX 379 LOAD 40522 INTCONV 0 ATOMIC_INC 21 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 16 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1789 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 10 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48172 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 15 ORI 13077 XORI 0 MULI 8628 LW 0 LWI 140957 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 84 DIV 17 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 18.5377 --Total thread-cycles: 5200128 --total thread-cycles issued: 3012466 (57.930612%) --iCache conflicts: 110222 (2.119602%) --thread*cycles of FU dependence: 253789 (4.880438%) --thread*cycles of data dependence: 236100 (4.540273%) --iCache cycles*banks: 5200128 (63.526994% used) Issue breakdown: --thread*cycles of issue worked: 3012466 (57.930612%) --thread*cycles of issue failed: 1896675 (36.473621%) --thread*cycles of issue NOP/other: 290987 (5.595766%) Number of thread-cycles not ready: 236100 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3303453 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 8 3: 8 4: 8 5: 7 6: 7 7: 7 8: 8 9: 7 10: 7 11: 8 12: 6 13: 9 14: 7 15: 8 16: 5 17: 7 18: 6 19: 8 20: 7 21: 7 22: 6 23: 6 24: 6 25: 6 26: 8 27: 6 28: 7 29: 7 30: 7 31: 7 <=== Core 18 ===> ---- Thread 00 ---- PC 5: Stalled ----- 97005 in-flight CPI 1.4355 -- Total Cycles 139287 ---- Thread 01 ---- PC 5: Stalled ----- 95589 in-flight CPI 1.4569 -- Total Cycles 139287 ---- Thread 02 ---- PC 5: Stalled ----- 102841 in-flight CPI 1.3541 -- Total Cycles 139287 ---- Thread 03 ---- PC 5: Stalled ----- 100305 in-flight CPI 1.3884 -- Total Cycles 139287 ---- Thread 04 ---- PC 5: Stalled ----- 95575 in-flight CPI 1.4571 -- Total Cycles 139287 ---- Thread 05 ---- PC 5: Stalled ----- 96802 in-flight CPI 1.4386 -- Total Cycles 139287 ---- Thread 06 ---- PC 5: Stalled ----- 92245 in-flight CPI 1.5097 -- Total Cycles 139287 ---- Thread 07 ---- PC 5: Stalled ----- 94680 in-flight CPI 1.4709 -- Total Cycles 139287 ---- Thread 08 ---- PC 5: Stalled ----- 96716 in-flight CPI 1.4400 -- Total Cycles 139287 ---- Thread 09 ---- PC 5: Stalled ----- 95173 in-flight CPI 1.4633 -- Total Cycles 139287 ---- Thread 10 ---- PC 5: Stalled ----- 96367 in-flight CPI 1.4451 -- Total Cycles 139287 ---- Thread 11 ---- PC 5: Stalled ----- 96737 in-flight CPI 1.4396 -- Total Cycles 139287 ---- Thread 12 ---- PC 5: Stalled ----- 90503 in-flight CPI 1.5387 -- Total Cycles 139287 ---- Thread 13 ---- PC 5: Stalled ----- 101809 in-flight CPI 1.3679 -- Total Cycles 139287 ---- Thread 14 ---- PC 5: Stalled ----- 90840 in-flight CPI 1.5330 -- Total Cycles 139287 ---- Thread 15 ---- PC 5: Stalled ----- 95446 in-flight CPI 1.4591 -- Total Cycles 139287 ---- Thread 16 ---- PC 5: Stalled ----- 87049 in-flight CPI 1.5999 -- Total Cycles 139287 ---- Thread 17 ---- PC 5: Stalled ----- 93390 in-flight CPI 1.4912 -- Total Cycles 139287 ---- Thread 18 ---- PC 5: Stalled ----- 92026 in-flight CPI 1.5133 -- Total Cycles 139287 ---- Thread 19 ---- PC 5: Stalled ----- 90038 in-flight CPI 1.5468 -- Total Cycles 139287 ---- Thread 20 ---- PC 5: Stalled ----- 98916 in-flight CPI 1.4080 -- Total Cycles 139287 ---- Thread 21 ---- PC 5: Stalled ----- 91068 in-flight CPI 1.5292 -- Total Cycles 139287 ---- Thread 22 ---- PC 5: Stalled ----- 93784 in-flight CPI 1.4849 -- Total Cycles 139287 ---- Thread 23 ---- PC 5: Stalled ----- 92967 in-flight CPI 1.4979 -- Total Cycles 139287 ---- Thread 24 ---- PC 5: Stalled ----- 95610 in-flight CPI 1.4566 -- Total Cycles 139287 ---- Thread 25 ---- PC 5: Stalled ----- 87624 in-flight CPI 1.5893 -- Total Cycles 139287 ---- Thread 26 ---- PC 5: Stalled ----- 88656 in-flight CPI 1.5708 -- Total Cycles 139287 ---- Thread 27 ---- PC 5: Stalled ----- 86976 in-flight CPI 1.6013 -- Total Cycles 139287 ---- Thread 28 ---- PC 5: Stalled ----- 83452 in-flight CPI 1.6688 -- Total Cycles 139287 ---- Thread 29 ---- PC 5: Stalled ----- 90648 in-flight CPI 1.5363 -- Total Cycles 139287 ---- Thread 30 ---- PC 5: Stalled ----- 94910 in-flight CPI 1.4673 -- Total Cycles 139287 ---- Thread 31 ---- PC 5: Stalled ----- 91135 in-flight CPI 1.5281 -- Total Cycles 139287 Total CPI 0.0465 , IPC 21.5196 -- Total Cycles 139287 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8460 (3.724641%) FPSUB: 0 (0.000000%) FPMUL: 32876 (14.474148%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 97090 (42.745316%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5260 (2.315793%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 76217 (33.555667%) DIV: 6985 (3.075250%) FPUN: 0 (0.000000%) FPRSUB: 248 (0.109186%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3285784 total) ADD%: 7.418 (243751) SUB%: 0.000 (0) MUL%: 0.006 (189) BITOR%: 1.545 (50763) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.602 (19782) FPSUB%: 0.000 (0) FPMUL%: 4.942 (162373) FPCMPLT%: 0.000 (0) FPMIN%: 0.017 (567) FPMAX%: 0.017 (567) LOAD%: 5.238 (172119) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (221) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (551) FPINV%: 0.000 (0) FPCONV%: 0.018 (599) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.082 (35542) FPLE%: 0.458 (15038) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.017 (567) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.795 (91839) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.772 (25367) CMPU%: 0.000 (0) RSUB%: 0.006 (189) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.757 (517734) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.181 (38817) ORI%: 1.611 (52929) XORI%: 0.000 (0) MULI%: 3.184 (104614) LW%: 1.127 (37038) LWI%: 13.435 (441439) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.286 (9385) SWI%: 4.055 (133234) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.397 (45903) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10199) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.061 (1989) bned%: 0.000 (0) bneid%: 13.830 (454435) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.723 (23752) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.133 (4357) DIV%: 0.012 (378) FPUN%: 1.485 (48795) FPRSUB%: 3.724 (122371) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (76) FPGT%: 2.926 (96127) FPGE%: 1.027 (33757) SYNC%: 0.000 (0) NOP%: 8.775 (288335) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 37 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 12 FPSUB 0 FPMUL 68 FPCMPLT 0 FPMIN 0 FPMAX 369 LOAD 41196 INTCONV 0 ATOMIC_INC 14 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 13 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 1 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 2549 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 10 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48083 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 10 ORI 12170 XORI 0 MULI 8747 LW 0 LWI 140006 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 84 DIV 30 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.5198 --Total thread-cycles: 4457184 --total thread-cycles issued: 2997449 (67.249838%) --iCache conflicts: 110624 (2.481926%) --thread*cycles of FU dependence: 253411 (5.685451%) --thread*cycles of data dependence: 227136 (5.095953%) --iCache cycles*banks: 4457184 (73.719550% used) Issue breakdown: --thread*cycles of issue worked: 2997449 (67.249838%) --thread*cycles of issue failed: 1171400 (26.281168%) --thread*cycles of issue NOP/other: 288335 (6.468995%) Number of thread-cycles not ready: 227136 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3285784 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 9 1: 7 2: 8 3: 8 4: 7 5: 8 6: 6 7: 6 8: 6 9: 7 10: 8 11: 8 12: 8 13: 8 14: 7 15: 7 16: 5 17: 7 18: 7 19: 5 20: 4 21: 7 22: 8 23: 8 24: 7 25: 7 26: 7 27: 5 28: 6 29: 6 30: 7 31: 7 <=== Core 19 ===> ---- Thread 00 ---- PC 5: Stalled ----- 94375 in-flight CPI 1.3771 -- Total Cycles 129991 ---- Thread 01 ---- PC 5: Stalled ----- 97946 in-flight CPI 1.3269 -- Total Cycles 129991 ---- Thread 02 ---- PC 5: Stalled ----- 103316 in-flight CPI 1.2580 -- Total Cycles 129991 ---- Thread 03 ---- PC 5: Stalled ----- 97827 in-flight CPI 1.3286 -- Total Cycles 129991 ---- Thread 04 ---- PC 5: Stalled ----- 95061 in-flight CPI 1.3672 -- Total Cycles 129991 ---- Thread 05 ---- PC 5: Stalled ----- 100605 in-flight CPI 1.2918 -- Total Cycles 129991 ---- Thread 06 ---- PC 5: Stalled ----- 101879 in-flight CPI 1.2757 -- Total Cycles 129991 ---- Thread 07 ---- PC 5: Stalled ----- 93735 in-flight CPI 1.3866 -- Total Cycles 129991 ---- Thread 08 ---- PC 5: Stalled ----- 98418 in-flight CPI 1.3206 -- Total Cycles 129991 ---- Thread 09 ---- PC 5: Stalled ----- 93726 in-flight CPI 1.3867 -- Total Cycles 129991 ---- Thread 10 ---- PC 5: Stalled ----- 93463 in-flight CPI 1.3905 -- Total Cycles 129991 ---- Thread 11 ---- PC 5: Stalled ----- 98039 in-flight CPI 1.3257 -- Total Cycles 129991 ---- Thread 12 ---- PC 5: Stalled ----- 95423 in-flight CPI 1.3620 -- Total Cycles 129991 ---- Thread 13 ---- PC 5: Stalled ----- 96679 in-flight CPI 1.3443 -- Total Cycles 129991 ---- Thread 14 ---- PC 5: Stalled ----- 91019 in-flight CPI 1.4280 -- Total Cycles 129991 ---- Thread 15 ---- PC 5: Stalled ----- 93007 in-flight CPI 1.3974 -- Total Cycles 129991 ---- Thread 16 ---- PC 5: Stalled ----- 91387 in-flight CPI 1.4221 -- Total Cycles 129991 ---- Thread 17 ---- PC 5: Stalled ----- 92878 in-flight CPI 1.3993 -- Total Cycles 129991 ---- Thread 18 ---- PC 5: Stalled ----- 96972 in-flight CPI 1.3403 -- Total Cycles 129991 ---- Thread 19 ---- PC 5: Stalled ----- 92752 in-flight CPI 1.4013 -- Total Cycles 129991 ---- Thread 20 ---- PC 5: Stalled ----- 98557 in-flight CPI 1.3187 -- Total Cycles 129991 ---- Thread 21 ---- PC 5: Stalled ----- 93039 in-flight CPI 1.3969 -- Total Cycles 129991 ---- Thread 22 ---- PC 5: Stalled ----- 93106 in-flight CPI 1.3958 -- Total Cycles 129991 ---- Thread 23 ---- PC 5: Stalled ----- 97453 in-flight CPI 1.3336 -- Total Cycles 129991 ---- Thread 24 ---- PC 5: Stalled ----- 87993 in-flight CPI 1.4770 -- Total Cycles 129991 ---- Thread 25 ---- PC 5: Stalled ----- 93962 in-flight CPI 1.3832 -- Total Cycles 129991 ---- Thread 26 ---- PC 5: Stalled ----- 92224 in-flight CPI 1.4093 -- Total Cycles 129991 ---- Thread 27 ---- PC 5: Stalled ----- 82781 in-flight CPI 1.5701 -- Total Cycles 129991 ---- Thread 28 ---- PC 5: Stalled ----- 85942 in-flight CPI 1.5122 -- Total Cycles 129991 ---- Thread 29 ---- PC 5: Stalled ----- 88105 in-flight CPI 1.4751 -- Total Cycles 129991 ---- Thread 30 ---- PC 5: Stalled ----- 84629 in-flight CPI 1.5358 -- Total Cycles 129991 ---- Thread 31 ---- PC 5: Stalled ----- 90038 in-flight CPI 1.4435 -- Total Cycles 129991 Total CPI 0.0432 , IPC 23.1315 -- Total Cycles 129991 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7997 (3.660239%) FPSUB: 0 (0.000000%) FPMUL: 32070 (14.678488%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 91594 (41.922713%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5476 (2.506373%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 73628 (33.699647%) DIV: 7458 (3.413538%) FPUN: 0 (0.000000%) FPRSUB: 260 (0.119002%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3296556 total) ADD%: 7.510 (247568) SUB%: 0.000 (0) MUL%: 0.006 (202) BITOR%: 1.525 (50277) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.571 (18808) FPSUB%: 0.000 (0) FPMUL%: 4.850 (159885) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (606) FPMAX%: 0.018 (606) LOAD%: 5.204 (171552) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (234) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (577) FPINV%: 0.000 (0) FPCONV%: 0.019 (638) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.074 (35413) FPLE%: 0.456 (15035) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (606) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.811 (92667) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.755 (24897) CMPU%: 0.000 (0) RSUB%: 0.006 (202) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.744 (519002) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.179 (38859) ORI%: 1.575 (51926) XORI%: 0.000 (0) MULI%: 3.208 (105760) LW%: 1.134 (37390) LWI%: 13.524 (445829) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.288 (9499) SWI%: 4.077 (134410) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.404 (46291) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10265) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.060 (1985) bned%: 0.000 (0) bneid%: 13.837 (456131) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.718 (23660) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.127 (4195) DIV%: 0.012 (404) FPUN%: 1.474 (48606) FPRSUB%: 3.700 (121963) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (73) FPGT%: 2.950 (97254) FPGE%: 1.018 (33571) SYNC%: 0.000 (0) NOP%: 8.785 (289614) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 20 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 16 FPSUB 0 FPMUL 58 FPCMPLT 0 FPMIN 0 FPMAX 397 LOAD 41207 INTCONV 0 ATOMIC_INC 15 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 15 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1563 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 12 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48613 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 7 ORI 11389 XORI 0 MULI 9458 LW 0 LWI 141201 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 75 DIV 29 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.1317 --Total thread-cycles: 4159712 --total thread-cycles issued: 3006942 (72.287264%) --iCache conflicts: 111483 (2.680065%) --thread*cycles of FU dependence: 254076 (6.108019%) --thread*cycles of data dependence: 218483 (5.252359%) --iCache cycles*banks: 4159712 (79.250390% used) Issue breakdown: --thread*cycles of issue worked: 3006942 (72.287264%) --thread*cycles of issue failed: 863156 (20.750379%) --thread*cycles of issue NOP/other: 289614 (6.962357%) Number of thread-cycles not ready: 218483 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3296556 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 8 3: 7 4: 7 5: 8 6: 9 7: 6 8: 7 9: 7 10: 8 11: 8 12: 7 13: 8 14: 6 15: 6 16: 8 17: 8 18: 7 19: 6 20: 7 21: 8 22: 9 23: 9 24: 7 25: 7 26: 7 27: 5 28: 8 29: 7 30: 6 31: 7 <=== Core 20 ===> ---- Thread 00 ---- PC 5: Stalled ----- 95413 in-flight CPI 1.7139 -- Total Cycles 163555 ---- Thread 01 ---- PC 5: Stalled ----- 100138 in-flight CPI 1.6330 -- Total Cycles 163555 ---- Thread 02 ---- PC 5: Stalled ----- 101536 in-flight CPI 1.6105 -- Total Cycles 163555 ---- Thread 03 ---- PC 5: Stalled ----- 103872 in-flight CPI 1.5742 -- Total Cycles 163555 ---- Thread 04 ---- PC 5: Stalled ----- 99886 in-flight CPI 1.6371 -- Total Cycles 163555 ---- Thread 05 ---- PC 5: Stalled ----- 97746 in-flight CPI 1.6729 -- Total Cycles 163555 ---- Thread 06 ---- PC 5: Stalled ----- 94051 in-flight CPI 1.7387 -- Total Cycles 163555 ---- Thread 07 ---- PC 5: Stalled ----- 98018 in-flight CPI 1.6683 -- Total Cycles 163555 ---- Thread 08 ---- PC 5: Stalled ----- 97133 in-flight CPI 1.6835 -- Total Cycles 163555 ---- Thread 09 ---- PC 5: Stalled ----- 101828 in-flight CPI 1.6059 -- Total Cycles 163555 ---- Thread 10 ---- PC 5: Stalled ----- 93947 in-flight CPI 1.7406 -- Total Cycles 163555 ---- Thread 11 ---- PC 5: Stalled ----- 94027 in-flight CPI 1.7391 -- Total Cycles 163555 ---- Thread 12 ---- PC 5: Stalled ----- 97554 in-flight CPI 1.6762 -- Total Cycles 163555 ---- Thread 13 ---- PC 5: Stalled ----- 94818 in-flight CPI 1.7246 -- Total Cycles 163555 ---- Thread 14 ---- PC 5: Stalled ----- 94626 in-flight CPI 1.7281 -- Total Cycles 163555 ---- Thread 15 ---- PC 5: Stalled ----- 92169 in-flight CPI 1.7742 -- Total Cycles 163555 ---- Thread 16 ---- PC 5: Stalled ----- 96674 in-flight CPI 1.6915 -- Total Cycles 163555 ---- Thread 17 ---- PC 5: Stalled ----- 92856 in-flight CPI 1.7610 -- Total Cycles 163555 ---- Thread 18 ---- PC 5: Stalled ----- 98145 in-flight CPI 1.6661 -- Total Cycles 163555 ---- Thread 19 ---- PC 5: Stalled ----- 90367 in-flight CPI 1.8096 -- Total Cycles 163555 ---- Thread 20 ---- PC 5: Stalled ----- 88694 in-flight CPI 1.8438 -- Total Cycles 163555 ---- Thread 21 ---- PC 5: Stalled ----- 94088 in-flight CPI 1.7380 -- Total Cycles 163555 ---- Thread 22 ---- PC 5: Stalled ----- 100048 in-flight CPI 1.6345 -- Total Cycles 163555 ---- Thread 23 ---- PC 5: Stalled ----- 93808 in-flight CPI 1.7432 -- Total Cycles 163555 ---- Thread 24 ---- PC 5: Stalled ----- 93540 in-flight CPI 1.7482 -- Total Cycles 163555 ---- Thread 25 ---- PC 5: Stalled ----- 96814 in-flight CPI 1.6891 -- Total Cycles 163555 ---- Thread 26 ---- PC 5: Stalled ----- 86923 in-flight CPI 1.8813 -- Total Cycles 163555 ---- Thread 27 ---- PC 5: Stalled ----- 90632 in-flight CPI 1.8042 -- Total Cycles 163555 ---- Thread 28 ---- PC 5: Stalled ----- 89342 in-flight CPI 1.8304 -- Total Cycles 163555 ---- Thread 29 ---- PC 5: Stalled ----- 85246 in-flight CPI 1.9183 -- Total Cycles 163555 ---- Thread 30 ---- PC 5: Stalled ----- 110531 in-flight CPI 1.4795 -- Total Cycles 163555 ---- Thread 31 ---- PC 5: Stalled ----- 88179 in-flight CPI 1.8545 -- Total Cycles 163555 Total CPI 0.0536 , IPC 18.6677 -- Total Cycles 163555 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8219 (3.574613%) FPSUB: 0 (0.000000%) FPMUL: 32700 (14.221905%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 100424 (43.676471%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5578 (2.425987%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 75323 (32.759528%) DIV: 7424 (3.228851%) FPUN: 0 (0.000000%) FPRSUB: 259 (0.112644%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3347607 total) ADD%: 7.464 (249866) SUB%: 0.000 (0) MUL%: 0.006 (201) BITOR%: 1.536 (51412) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.574 (19220) FPSUB%: 0.000 (0) FPMUL%: 4.862 (162754) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (603) FPMAX%: 0.018 (603) LOAD%: 5.208 (174338) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (233) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (581) FPINV%: 0.000 (0) FPCONV%: 0.019 (635) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.076 (36007) FPLE%: 0.458 (15325) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (603) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.807 (93956) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.760 (25428) CMPU%: 0.000 (0) RSUB%: 0.006 (201) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.752 (527313) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.178 (39451) ORI%: 1.584 (53038) XORI%: 0.000 (0) MULI%: 3.205 (107292) LW%: 1.132 (37904) LWI%: 13.504 (452058) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.287 (9599) SWI%: 4.060 (135926) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.403 (46970) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10403) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.062 (2061) bned%: 0.000 (0) bneid%: 13.849 (463610) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.721 (24122) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.128 (4297) DIV%: 0.012 (402) FPUN%: 1.484 (49664) FPRSUB%: 3.706 (124055) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (72) FPGT%: 2.946 (98614) FPGE%: 1.026 (34339) SYNC%: 0.000 (0) NOP%: 8.793 (294355) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 15 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 12 FPSUB 0 FPMUL 51 FPCMPLT 0 FPMIN 0 FPMAX 392 LOAD 40806 INTCONV 0 ATOMIC_INC 25 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 16 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 2331 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 5 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49265 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 11 ORI 11685 XORI 0 MULI 9684 LW 0 LWI 143478 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 73 DIV 50 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 18.6679 --Total thread-cycles: 5233760 --total thread-cycles issued: 3053252 (58.337639%) --iCache conflicts: 112224 (2.144233%) --thread*cycles of FU dependence: 257901 (4.927643%) --thread*cycles of data dependence: 229927 (4.393151%) --iCache cycles*banks: 5233760 (63.962409% used) Issue breakdown: --thread*cycles of issue worked: 3053252 (58.337639%) --thread*cycles of issue failed: 1886153 (36.038202%) --thread*cycles of issue NOP/other: 294355 (5.624159%) Number of thread-cycles not ready: 229927 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3347607 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 8 3: 10 4: 8 5: 8 6: 7 7: 7 8: 7 9: 8 10: 8 11: 7 12: 8 13: 7 14: 8 15: 7 16: 7 17: 8 18: 8 19: 7 20: 5 21: 7 22: 6 23: 7 24: 7 25: 7 26: 7 27: 8 28: 6 29: 7 30: 6 31: 7 <=== Core 21 ===> ---- Thread 00 ---- PC 5: Stalled ----- 93780 in-flight CPI 1.4153 -- Total Cycles 132756 ---- Thread 01 ---- PC 5: Stalled ----- 95642 in-flight CPI 1.3878 -- Total Cycles 132756 ---- Thread 02 ---- PC 5: Stalled ----- 101790 in-flight CPI 1.3040 -- Total Cycles 132756 ---- Thread 03 ---- PC 5: Stalled ----- 96390 in-flight CPI 1.3770 -- Total Cycles 132756 ---- Thread 04 ---- PC 5: Stalled ----- 97028 in-flight CPI 1.3680 -- Total Cycles 132756 ---- Thread 05 ---- PC 5: Stalled ----- 99022 in-flight CPI 1.3404 -- Total Cycles 132756 ---- Thread 06 ---- PC 5: Stalled ----- 95054 in-flight CPI 1.3964 -- Total Cycles 132756 ---- Thread 07 ---- PC 5: Stalled ----- 105350 in-flight CPI 1.2599 -- Total Cycles 132756 ---- Thread 08 ---- PC 5: Stalled ----- 97751 in-flight CPI 1.3579 -- Total Cycles 132756 ---- Thread 09 ---- PC 5: Stalled ----- 98513 in-flight CPI 1.3473 -- Total Cycles 132756 ---- Thread 10 ---- PC 5: Stalled ----- 88316 in-flight CPI 1.5030 -- Total Cycles 132756 ---- Thread 11 ---- PC 5: Stalled ----- 97453 in-flight CPI 1.3620 -- Total Cycles 132756 ---- Thread 12 ---- PC 5: Stalled ----- 93219 in-flight CPI 1.4239 -- Total Cycles 132756 ---- Thread 13 ---- PC 5: Stalled ----- 101076 in-flight CPI 1.3133 -- Total Cycles 132756 ---- Thread 14 ---- PC 5: Stalled ----- 99390 in-flight CPI 1.3354 -- Total Cycles 132756 ---- Thread 15 ---- PC 5: Stalled ----- 92553 in-flight CPI 1.4341 -- Total Cycles 132756 ---- Thread 16 ---- PC 5: Stalled ----- 96311 in-flight CPI 1.3781 -- Total Cycles 132756 ---- Thread 17 ---- PC 5: Stalled ----- 97089 in-flight CPI 1.3671 -- Total Cycles 132756 ---- Thread 18 ---- PC 5: Stalled ----- 99270 in-flight CPI 1.3370 -- Total Cycles 132756 ---- Thread 19 ---- PC 5: Stalled ----- 97102 in-flight CPI 1.3669 -- Total Cycles 132756 ---- Thread 20 ---- PC 5: Stalled ----- 94733 in-flight CPI 1.4011 -- Total Cycles 132756 ---- Thread 21 ---- PC 5: Stalled ----- 91526 in-flight CPI 1.4502 -- Total Cycles 132756 ---- Thread 22 ---- PC 5: Stalled ----- 91397 in-flight CPI 1.4523 -- Total Cycles 132756 ---- Thread 23 ---- PC 5: Stalled ----- 92455 in-flight CPI 1.4356 -- Total Cycles 132756 ---- Thread 24 ---- PC 5: Stalled ----- 95021 in-flight CPI 1.3969 -- Total Cycles 132756 ---- Thread 25 ---- PC 5: Stalled ----- 93727 in-flight CPI 1.4161 -- Total Cycles 132756 ---- Thread 26 ---- PC 5: Stalled ----- 88357 in-flight CPI 1.5022 -- Total Cycles 132756 ---- Thread 27 ---- PC 5: Stalled ----- 86535 in-flight CPI 1.5339 -- Total Cycles 132756 ---- Thread 28 ---- PC 5: Stalled ----- 87526 in-flight CPI 1.5165 -- Total Cycles 132756 ---- Thread 29 ---- PC 5: Stalled ----- 92182 in-flight CPI 1.4398 -- Total Cycles 132756 ---- Thread 30 ---- PC 5: Stalled ----- 93378 in-flight CPI 1.4215 -- Total Cycles 132756 ---- Thread 31 ---- PC 5: Stalled ----- 89269 in-flight CPI 1.4869 -- Total Cycles 132756 Total CPI 0.0437 , IPC 22.8899 -- Total Cycles 132756 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8105 (3.919947%) FPSUB: 0 (0.000000%) FPMUL: 32438 (15.688494%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 78353 (37.895078%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5827 (2.818202%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 74101 (35.838617%) DIV: 7676 (3.712463%) FPUN: 0 (0.000000%) FPRSUB: 263 (0.127199%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3332022 total) ADD%: 7.479 (249213) SUB%: 0.000 (0) MUL%: 0.006 (208) BITOR%: 1.537 (51211) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.574 (19110) FPSUB%: 0.000 (0) FPMUL%: 4.846 (161464) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (624) FPMAX%: 0.019 (624) LOAD%: 5.178 (172545) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (240) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (602) FPINV%: 0.000 (0) FPCONV%: 0.020 (656) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.076 (35858) FPLE%: 0.456 (15186) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (624) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.805 (93463) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.756 (25184) CMPU%: 0.000 (0) RSUB%: 0.006 (208) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.741 (524501) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.177 (39215) ORI%: 1.592 (53059) XORI%: 0.000 (0) MULI%: 3.204 (106758) LW%: 1.132 (37718) LWI%: 13.510 (450140) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.287 (9549) SWI%: 4.080 (135944) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.403 (46734) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10356) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.060 (1986) bned%: 0.000 (0) bneid%: 13.861 (461851) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.721 (24018) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.127 (4218) DIV%: 0.012 (416) FPUN%: 1.486 (49506) FPRSUB%: 3.691 (122976) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (75) FPGT%: 2.952 (98373) FPGE%: 1.030 (34320) SYNC%: 0.000 (0) NOP%: 8.799 (293193) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 17 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 10 FPSUB 0 FPMUL 54 FPCMPLT 0 FPMIN 0 FPMAX 405 LOAD 40006 INTCONV 0 ATOMIC_INC 17 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 12 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1931 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 9 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49192 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 20 ORI 11548 XORI 0 MULI 9666 LW 0 LWI 142442 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 78 DIV 28 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 22.8902 --Total thread-cycles: 4248192 --total thread-cycles issued: 3038829 (71.532290%) --iCache conflicts: 113443 (2.670383%) --thread*cycles of FU dependence: 255454 (6.013240%) --thread*cycles of data dependence: 206763 (4.867082%) --iCache cycles*banks: 4248192 (78.434638% used) Issue breakdown: --thread*cycles of issue worked: 3038829 (71.532290%) --thread*cycles of issue failed: 916170 (21.566116%) --thread*cycles of issue NOP/other: 293193 (6.901595%) Number of thread-cycles not ready: 206763 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3332022 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 7 2: 8 3: 7 4: 8 5: 9 6: 8 7: 9 8: 7 9: 9 10: 5 11: 7 12: 7 13: 5 14: 8 15: 8 16: 9 17: 8 18: 9 19: 8 20: 7 21: 8 22: 7 23: 7 24: 7 25: 8 26: 7 27: 6 28: 7 29: 8 30: 7 31: 7 <=== Core 22 ===> ---- Thread 00 ---- PC 5: Stalled ----- 95310 in-flight CPI 1.3472 -- Total Cycles 128428 ---- Thread 01 ---- PC 5: Stalled ----- 100640 in-flight CPI 1.2759 -- Total Cycles 128428 ---- Thread 02 ---- PC 5: Stalled ----- 100636 in-flight CPI 1.2759 -- Total Cycles 128428 ---- Thread 03 ---- PC 5: Stalled ----- 95625 in-flight CPI 1.3428 -- Total Cycles 128428 ---- Thread 04 ---- PC 5: Stalled ----- 97440 in-flight CPI 1.3178 -- Total Cycles 128428 ---- Thread 05 ---- PC 5: Stalled ----- 102469 in-flight CPI 1.2531 -- Total Cycles 128428 ---- Thread 06 ---- PC 5: Stalled ----- 103534 in-flight CPI 1.2402 -- Total Cycles 128428 ---- Thread 07 ---- PC 5: Stalled ----- 90387 in-flight CPI 1.4206 -- Total Cycles 128428 ---- Thread 08 ---- PC 5: Stalled ----- 97891 in-flight CPI 1.3117 -- Total Cycles 128428 ---- Thread 09 ---- PC 5: Stalled ----- 100426 in-flight CPI 1.2786 -- Total Cycles 128428 ---- Thread 10 ---- PC 5: Stalled ----- 91846 in-flight CPI 1.3981 -- Total Cycles 128428 ---- Thread 11 ---- PC 5: Stalled ----- 95176 in-flight CPI 1.3491 -- Total Cycles 128428 ---- Thread 12 ---- PC 5: Stalled ----- 96716 in-flight CPI 1.3276 -- Total Cycles 128428 ---- Thread 13 ---- PC 5: Stalled ----- 95092 in-flight CPI 1.3503 -- Total Cycles 128428 ---- Thread 14 ---- PC 5: Stalled ----- 97660 in-flight CPI 1.3148 -- Total Cycles 128428 ---- Thread 15 ---- PC 5: Stalled ----- 98496 in-flight CPI 1.3037 -- Total Cycles 128428 ---- Thread 16 ---- PC 5: Stalled ----- 96985 in-flight CPI 1.3239 -- Total Cycles 128428 ---- Thread 17 ---- PC 5: Stalled ----- 95630 in-flight CPI 1.3427 -- Total Cycles 128428 ---- Thread 18 ---- PC 5: Stalled ----- 90481 in-flight CPI 1.4191 -- Total Cycles 128428 ---- Thread 19 ---- PC 5: Stalled ----- 92771 in-flight CPI 1.3841 -- Total Cycles 128428 ---- Thread 20 ---- PC 5: Stalled ----- 93320 in-flight CPI 1.3759 -- Total Cycles 128428 ---- Thread 21 ---- PC 5: Stalled ----- 91135 in-flight CPI 1.4089 -- Total Cycles 128428 ---- Thread 22 ---- PC 5: Stalled ----- 91692 in-flight CPI 1.4004 -- Total Cycles 128428 ---- Thread 23 ---- PC 5: Stalled ----- 94221 in-flight CPI 1.3628 -- Total Cycles 128428 ---- Thread 24 ---- PC 5: Stalled ----- 88890 in-flight CPI 1.4445 -- Total Cycles 128428 ---- Thread 25 ---- PC 5: Stalled ----- 88564 in-flight CPI 1.4498 -- Total Cycles 128428 ---- Thread 26 ---- PC 5: Stalled ----- 96990 in-flight CPI 1.3239 -- Total Cycles 128428 ---- Thread 27 ---- PC 5: Stalled ----- 85959 in-flight CPI 1.4938 -- Total Cycles 128428 ---- Thread 28 ---- PC 5: Stalled ----- 88512 in-flight CPI 1.4507 -- Total Cycles 128428 ---- Thread 29 ---- PC 5: Stalled ----- 91916 in-flight CPI 1.3969 -- Total Cycles 128428 ---- Thread 30 ---- PC 5: Stalled ----- 89378 in-flight CPI 1.4365 -- Total Cycles 128428 ---- Thread 31 ---- PC 5: Stalled ----- 82553 in-flight CPI 1.5554 -- Total Cycles 128428 Total CPI 0.0425 , IPC 23.5068 -- Total Cycles 128428 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7815 (3.740523%) FPSUB: 0 (0.000000%) FPMUL: 31879 (15.258367%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 83561 (39.995118%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5939 (2.842606%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 71564 (34.252948%) DIV: 7900 (3.781207%) FPUN: 0 (0.000000%) FPRSUB: 270 (0.129231%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3310711 total) ADD%: 7.547 (249855) SUB%: 0.000 (0) MUL%: 0.006 (214) BITOR%: 1.526 (50519) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.559 (18511) FPSUB%: 0.000 (0) FPMUL%: 4.810 (159251) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (642) FPMAX%: 0.019 (642) LOAD%: 5.146 (170364) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (246) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (614) FPINV%: 0.000 (0) FPCONV%: 0.020 (674) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.073 (35540) FPLE%: 0.456 (15085) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (642) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.808 (92969) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.742 (24570) CMPU%: 0.000 (0) RSUB%: 0.006 (214) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.733 (520889) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.172 (38806) ORI%: 1.568 (51897) XORI%: 0.000 (0) MULI%: 3.217 (106508) LW%: 1.134 (37530) LWI%: 13.549 (448580) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.288 (9545) SWI%: 4.081 (135096) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.403 (46444) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10297) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.059 (1951) bned%: 0.000 (0) bneid%: 13.872 (459251) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.716 (23701) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.123 (4065) DIV%: 0.013 (428) FPUN%: 1.481 (49029) FPRSUB%: 3.683 (121942) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (66) FPGT%: 2.971 (98366) FPGE%: 1.025 (33944) SYNC%: 0.000 (0) NOP%: 8.812 (291728) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 23 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 15 FPSUB 0 FPMUL 50 FPCMPLT 0 FPMIN 0 FPMAX 419 LOAD 38998 INTCONV 0 ATOMIC_INC 26 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 17 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 2346 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 9 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48915 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 7 ORI 11129 XORI 0 MULI 9967 LW 0 LWI 142105 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 71 DIV 36 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.5070 --Total thread-cycles: 4109696 --total thread-cycles issued: 3018983 (73.460008%) --iCache conflicts: 112193 (2.729959%) --thread*cycles of FU dependence: 254139 (6.183888%) --thread*cycles of data dependence: 208928 (5.083782%) --iCache cycles*banks: 4109696 (80.559316% used) Issue breakdown: --thread*cycles of issue worked: 3018983 (73.460008%) --thread*cycles of issue failed: 798985 (19.441462%) --thread*cycles of issue NOP/other: 291728 (7.098530%) Number of thread-cycles not ready: 208928 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3310711 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 9 3: 8 4: 7 5: 9 6: 8 7: 7 8: 8 9: 8 10: 6 11: 7 12: 9 13: 8 14: 8 15: 7 16: 8 17: 8 18: 7 19: 8 20: 8 21: 8 22: 7 23: 8 24: 7 25: 7 26: 8 27: 6 28: 7 29: 9 30: 9 31: 7 <=== Core 23 ===> ---- Thread 00 ---- PC 5: Stalled ----- 98693 in-flight CPI 1.2769 -- Total Cycles 126050 ---- Thread 01 ---- PC 5: Stalled ----- 98441 in-flight CPI 1.2803 -- Total Cycles 126050 ---- Thread 02 ---- PC 5: Stalled ----- 99318 in-flight CPI 1.2689 -- Total Cycles 126050 ---- Thread 03 ---- PC 5: Stalled ----- 100654 in-flight CPI 1.2521 -- Total Cycles 126050 ---- Thread 04 ---- PC 5: Stalled ----- 97396 in-flight CPI 1.2939 -- Total Cycles 126050 ---- Thread 05 ---- PC 5: Stalled ----- 99260 in-flight CPI 1.2697 -- Total Cycles 126050 ---- Thread 06 ---- PC 5: Stalled ----- 97391 in-flight CPI 1.2940 -- Total Cycles 126050 ---- Thread 07 ---- PC 5: Stalled ----- 93736 in-flight CPI 1.3445 -- Total Cycles 126050 ---- Thread 08 ---- PC 5: Stalled ----- 95441 in-flight CPI 1.3205 -- Total Cycles 126050 ---- Thread 09 ---- PC 5: Stalled ----- 103636 in-flight CPI 1.2160 -- Total Cycles 126050 ---- Thread 10 ---- PC 5: Stalled ----- 97468 in-flight CPI 1.2930 -- Total Cycles 126050 ---- Thread 11 ---- PC 5: Stalled ----- 95448 in-flight CPI 1.3204 -- Total Cycles 126050 ---- Thread 12 ---- PC 5: Stalled ----- 95585 in-flight CPI 1.3185 -- Total Cycles 126050 ---- Thread 13 ---- PC 5: Stalled ----- 95868 in-flight CPI 1.3146 -- Total Cycles 126050 ---- Thread 14 ---- PC 5: Stalled ----- 96820 in-flight CPI 1.3016 -- Total Cycles 126050 ---- Thread 15 ---- PC 5: Stalled ----- 92250 in-flight CPI 1.3661 -- Total Cycles 126050 ---- Thread 16 ---- PC 5: Stalled ----- 86223 in-flight CPI 1.4617 -- Total Cycles 126050 ---- Thread 17 ---- PC 5: Stalled ----- 91938 in-flight CPI 1.3708 -- Total Cycles 126050 ---- Thread 18 ---- PC 5: Stalled ----- 89904 in-flight CPI 1.4018 -- Total Cycles 126050 ---- Thread 19 ---- PC 5: Stalled ----- 95566 in-flight CPI 1.3187 -- Total Cycles 126050 ---- Thread 20 ---- PC 5: Stalled ----- 94305 in-flight CPI 1.3364 -- Total Cycles 126050 ---- Thread 21 ---- PC 5: Stalled ----- 97723 in-flight CPI 1.2896 -- Total Cycles 126050 ---- Thread 22 ---- PC 5: Stalled ----- 95618 in-flight CPI 1.3180 -- Total Cycles 126050 ---- Thread 23 ---- PC 5: Stalled ----- 94278 in-flight CPI 1.3367 -- Total Cycles 126050 ---- Thread 24 ---- PC 5: Stalled ----- 89021 in-flight CPI 1.4157 -- Total Cycles 126050 ---- Thread 25 ---- PC 5: Stalled ----- 92166 in-flight CPI 1.3674 -- Total Cycles 126050 ---- Thread 26 ---- PC 5: Stalled ----- 93223 in-flight CPI 1.3519 -- Total Cycles 126050 ---- Thread 27 ---- PC 5: Stalled ----- 94558 in-flight CPI 1.3328 -- Total Cycles 126050 ---- Thread 28 ---- PC 5: Stalled ----- 91116 in-flight CPI 1.3831 -- Total Cycles 126050 ---- Thread 29 ---- PC 5: Stalled ----- 86300 in-flight CPI 1.4604 -- Total Cycles 126050 ---- Thread 30 ---- PC 5: Stalled ----- 87822 in-flight CPI 1.4351 -- Total Cycles 126050 ---- Thread 31 ---- PC 5: Stalled ----- 83082 in-flight CPI 1.5169 -- Total Cycles 126050 Total CPI 0.0417 , IPC 23.9652 -- Total Cycles 126050 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7192 (3.875334%) FPSUB: 0 (0.000000%) FPMUL: 30556 (16.464781%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 67550 (36.398612%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5716 (3.080007%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 66663 (35.920661%) DIV: 7641 (4.117273%) FPUN: 0 (0.000000%) FPRSUB: 266 (0.143331%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3312504 total) ADD%: 7.534 (249573) SUB%: 0.000 (0) MUL%: 0.006 (207) BITOR%: 1.528 (50630) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.515 (17062) FPSUB%: 0.000 (0) FPMUL%: 4.686 (155215) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (621) FPMAX%: 0.019 (621) LOAD%: 5.126 (169800) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (239) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (595) FPINV%: 0.000 (0) FPCONV%: 0.020 (653) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.055 (34953) FPLE%: 0.461 (15267) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (621) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.841 (94122) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.736 (24368) CMPU%: 0.000 (0) RSUB%: 0.006 (207) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.784 (522854) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.182 (39140) ORI%: 1.537 (50904) XORI%: 0.000 (0) MULI%: 3.246 (107514) LW%: 1.147 (37980) LWI%: 13.625 (451322) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.291 (9652) SWI%: 4.107 (136050) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.420 (47021) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.313 (10375) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.054 (1804) bned%: 0.000 (0) bneid%: 13.895 (460276) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.723 (23953) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.114 (3780) DIV%: 0.012 (414) FPUN%: 1.487 (49263) FPRSUB%: 3.650 (120914) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (71) FPGT%: 2.981 (98736) FPGE%: 1.026 (33996) SYNC%: 0.000 (0) NOP%: 8.804 (291635) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 25 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 6 FPSUB 0 FPMUL 53 FPCMPLT 0 FPMIN 0 FPMAX 402 LOAD 38471 INTCONV 0 ATOMIC_INC 21 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 11 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1935 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 8 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49340 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 10 ORI 10094 XORI 0 MULI 9989 LW 0 LWI 142539 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 70 DIV 31 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.9655 --Total thread-cycles: 4033600 --total thread-cycles issued: 3020869 (74.892627%) --iCache conflicts: 112979 (2.800947%) --thread*cycles of FU dependence: 253028 (6.273007%) --thread*cycles of data dependence: 185584 (4.600952%) --iCache cycles*banks: 4033600 (82.123562% used) Issue breakdown: --thread*cycles of issue worked: 3020869 (74.892627%) --thread*cycles of issue failed: 721096 (17.877231%) --thread*cycles of issue NOP/other: 291635 (7.230142%) Number of thread-cycles not ready: 185584 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3312504 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 7 2: 7 3: 8 4: 8 5: 8 6: 8 7: 8 8: 6 9: 9 10: 8 11: 7 12: 7 13: 8 14: 8 15: 8 16: 6 17: 7 18: 7 19: 8 20: 7 21: 9 22: 7 23: 8 24: 7 25: 8 26: 8 27: 8 28: 8 29: 6 30: 6 31: 6 <=== Core 24 ===> ---- Thread 00 ---- PC 5: Stalled ----- 96955 in-flight CPI 1.3065 -- Total Cycles 126690 ---- Thread 01 ---- PC 5: Stalled ----- 95770 in-flight CPI 1.3226 -- Total Cycles 126690 ---- Thread 02 ---- PC 5: Stalled ----- 97797 in-flight CPI 1.2952 -- Total Cycles 126690 ---- Thread 03 ---- PC 5: Stalled ----- 95596 in-flight CPI 1.3250 -- Total Cycles 126690 ---- Thread 04 ---- PC 5: Stalled ----- 103282 in-flight CPI 1.2264 -- Total Cycles 126690 ---- Thread 05 ---- PC 5: Stalled ----- 92910 in-flight CPI 1.3633 -- Total Cycles 126690 ---- Thread 06 ---- PC 5: Stalled ----- 95164 in-flight CPI 1.3311 -- Total Cycles 126690 ---- Thread 07 ---- PC 5: Stalled ----- 95684 in-flight CPI 1.3238 -- Total Cycles 126690 ---- Thread 08 ---- PC 5: Stalled ----- 92591 in-flight CPI 1.3680 -- Total Cycles 126690 ---- Thread 09 ---- PC 5: Stalled ----- 96327 in-flight CPI 1.3150 -- Total Cycles 126690 ---- Thread 10 ---- PC 5: Stalled ----- 95559 in-flight CPI 1.3255 -- Total Cycles 126690 ---- Thread 11 ---- PC 5: Stalled ----- 93909 in-flight CPI 1.3488 -- Total Cycles 126690 ---- Thread 12 ---- PC 5: Stalled ----- 97507 in-flight CPI 1.2990 -- Total Cycles 126690 ---- Thread 13 ---- PC 5: Stalled ----- 98934 in-flight CPI 1.2803 -- Total Cycles 126690 ---- Thread 14 ---- PC 5: Stalled ----- 97344 in-flight CPI 1.3012 -- Total Cycles 126690 ---- Thread 15 ---- PC 5: Stalled ----- 95955 in-flight CPI 1.3200 -- Total Cycles 126690 ---- Thread 16 ---- PC 5: Stalled ----- 95579 in-flight CPI 1.3253 -- Total Cycles 126690 ---- Thread 17 ---- PC 5: Stalled ----- 96061 in-flight CPI 1.3186 -- Total Cycles 126690 ---- Thread 18 ---- PC 5: Stalled ----- 87665 in-flight CPI 1.4449 -- Total Cycles 126690 ---- Thread 19 ---- PC 5: Stalled ----- 89300 in-flight CPI 1.4185 -- Total Cycles 126690 ---- Thread 20 ---- PC 5: Stalled ----- 91369 in-flight CPI 1.3863 -- Total Cycles 126690 ---- Thread 21 ---- PC 5: Stalled ----- 92300 in-flight CPI 1.3723 -- Total Cycles 126690 ---- Thread 22 ---- PC 5: Stalled ----- 93674 in-flight CPI 1.3522 -- Total Cycles 126690 ---- Thread 23 ---- PC 5: Stalled ----- 91684 in-flight CPI 1.3815 -- Total Cycles 126690 ---- Thread 24 ---- PC 5: Stalled ----- 87829 in-flight CPI 1.4422 -- Total Cycles 126690 ---- Thread 25 ---- PC 5: Stalled ----- 92724 in-flight CPI 1.3660 -- Total Cycles 126690 ---- Thread 26 ---- PC 5: Stalled ----- 90185 in-flight CPI 1.4046 -- Total Cycles 126690 ---- Thread 27 ---- PC 5: Stalled ----- 87150 in-flight CPI 1.4535 -- Total Cycles 126690 ---- Thread 28 ---- PC 5: Stalled ----- 82640 in-flight CPI 1.5328 -- Total Cycles 126690 ---- Thread 29 ---- PC 5: Stalled ----- 93965 in-flight CPI 1.3479 -- Total Cycles 126690 ---- Thread 30 ---- PC 5: Stalled ----- 84928 in-flight CPI 1.4915 -- Total Cycles 126690 ---- Thread 31 ---- PC 5: Stalled ----- 83362 in-flight CPI 1.5195 -- Total Cycles 126690 Total CPI 0.0425 , IPC 23.5397 -- Total Cycles 126690 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7709 (3.648367%) FPSUB: 0 (0.000000%) FPMUL: 31447 (14.882631%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 88592 (41.927118%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5566 (2.634169%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 70387 (33.311406%) DIV: 7342 (3.474681%) FPUN: 0 (0.000000%) FPRSUB: 257 (0.121628%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3269875 total) ADD%: 7.480 (244588) SUB%: 0.000 (0) MUL%: 0.006 (199) BITOR%: 1.546 (50560) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.558 (18242) FPSUB%: 0.000 (0) FPMUL%: 4.807 (157181) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (597) FPMAX%: 0.018 (597) LOAD%: 5.161 (168766) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (231) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (578) FPINV%: 0.000 (0) FPCONV%: 0.019 (629) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.069 (34960) FPLE%: 0.463 (15127) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (597) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.813 (91989) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.751 (24561) CMPU%: 0.000 (0) RSUB%: 0.006 (199) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.766 (515516) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.180 (38577) ORI%: 1.578 (51593) XORI%: 0.000 (0) MULI%: 3.214 (105086) LW%: 1.135 (37114) LWI%: 13.524 (442227) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.287 (9391) SWI%: 4.077 (133319) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.407 (45998) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10140) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.056 (1815) bned%: 0.000 (0) bneid%: 13.881 (453885) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.727 (23761) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.122 (4005) DIV%: 0.012 (398) FPUN%: 1.495 (48898) FPRSUB%: 3.682 (120403) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (80) FPGT%: 2.955 (96622) FPGE%: 1.033 (33771) SYNC%: 0.000 (0) NOP%: 8.795 (287579) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 20 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 14 FPSUB 0 FPMUL 54 FPCMPLT 0 FPMIN 0 FPMAX 390 LOAD 38675 INTCONV 0 ATOMIC_INC 24 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 14 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1812 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 11 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48292 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 11 ORI 11017 XORI 0 MULI 9599 LW 0 LWI 140009 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 74 DIV 20 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.5399 --Total thread-cycles: 4054080 --total thread-cycles issued: 2982296 (73.562831%) --iCache conflicts: 110743 (2.731643%) --thread*cycles of FU dependence: 250039 (6.167589%) --thread*cycles of data dependence: 211300 (5.212033%) --iCache cycles*banks: 4054080 (80.657190% used) Issue breakdown: --thread*cycles of issue worked: 2982296 (73.562831%) --thread*cycles of issue failed: 784205 (19.343600%) --thread*cycles of issue NOP/other: 287579 (7.093570%) Number of thread-cycles not ready: 211300 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3269875 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 7 3: 7 4: 8 5: 8 6: 7 7: 7 8: 8 9: 7 10: 8 11: 7 12: 9 13: 7 14: 8 15: 8 16: 7 17: 7 18: 7 19: 6 20: 8 21: 7 22: 7 23: 8 24: 6 25: 8 26: 6 27: 6 28: 6 29: 9 30: 6 31: 6 <=== Core 25 ===> ---- Thread 00 ---- PC 5: Stalled ----- 96910 in-flight CPI 1.3386 -- Total Cycles 129743 ---- Thread 01 ---- PC 5: Stalled ----- 97144 in-flight CPI 1.3354 -- Total Cycles 129743 ---- Thread 02 ---- PC 5: Stalled ----- 98862 in-flight CPI 1.3121 -- Total Cycles 129743 ---- Thread 03 ---- PC 5: Stalled ----- 97851 in-flight CPI 1.3257 -- Total Cycles 129743 ---- Thread 04 ---- PC 5: Stalled ----- 101936 in-flight CPI 1.2725 -- Total Cycles 129743 ---- Thread 05 ---- PC 5: Stalled ----- 98501 in-flight CPI 1.3169 -- Total Cycles 129743 ---- Thread 06 ---- PC 5: Stalled ----- 95191 in-flight CPI 1.3627 -- Total Cycles 129743 ---- Thread 07 ---- PC 5: Stalled ----- 100583 in-flight CPI 1.2897 -- Total Cycles 129743 ---- Thread 08 ---- PC 5: Stalled ----- 93488 in-flight CPI 1.3876 -- Total Cycles 129743 ---- Thread 09 ---- PC 5: Stalled ----- 98067 in-flight CPI 1.3227 -- Total Cycles 129743 ---- Thread 10 ---- PC 5: Stalled ----- 90195 in-flight CPI 1.4383 -- Total Cycles 129743 ---- Thread 11 ---- PC 5: Stalled ----- 93263 in-flight CPI 1.3909 -- Total Cycles 129743 ---- Thread 12 ---- PC 5: Stalled ----- 92790 in-flight CPI 1.3980 -- Total Cycles 129743 ---- Thread 13 ---- PC 5: Stalled ----- 95325 in-flight CPI 1.3608 -- Total Cycles 129743 ---- Thread 14 ---- PC 5: Stalled ----- 98577 in-flight CPI 1.3159 -- Total Cycles 129743 ---- Thread 15 ---- PC 5: Stalled ----- 96055 in-flight CPI 1.3504 -- Total Cycles 129743 ---- Thread 16 ---- PC 5: Stalled ----- 91893 in-flight CPI 1.4117 -- Total Cycles 129743 ---- Thread 17 ---- PC 5: Stalled ----- 92097 in-flight CPI 1.4085 -- Total Cycles 129743 ---- Thread 18 ---- PC 5: Stalled ----- 92279 in-flight CPI 1.4057 -- Total Cycles 129743 ---- Thread 19 ---- PC 5: Stalled ----- 87693 in-flight CPI 1.4793 -- Total Cycles 129743 ---- Thread 20 ---- PC 5: Stalled ----- 98533 in-flight CPI 1.3165 -- Total Cycles 129743 ---- Thread 21 ---- PC 5: Stalled ----- 90213 in-flight CPI 1.4379 -- Total Cycles 129743 ---- Thread 22 ---- PC 5: Stalled ----- 100314 in-flight CPI 1.2931 -- Total Cycles 129743 ---- Thread 23 ---- PC 5: Stalled ----- 92215 in-flight CPI 1.4067 -- Total Cycles 129743 ---- Thread 24 ---- PC 5: Stalled ----- 92069 in-flight CPI 1.4089 -- Total Cycles 129743 ---- Thread 25 ---- PC 5: Stalled ----- 91217 in-flight CPI 1.4221 -- Total Cycles 129743 ---- Thread 26 ---- PC 5: Stalled ----- 91069 in-flight CPI 1.4244 -- Total Cycles 129743 ---- Thread 27 ---- PC 5: Stalled ----- 90580 in-flight CPI 1.4321 -- Total Cycles 129743 ---- Thread 28 ---- PC 5: Stalled ----- 86157 in-flight CPI 1.5057 -- Total Cycles 129743 ---- Thread 29 ---- PC 5: Stalled ----- 88088 in-flight CPI 1.4726 -- Total Cycles 129743 ---- Thread 30 ---- PC 5: Stalled ----- 89289 in-flight CPI 1.4528 -- Total Cycles 129743 ---- Thread 31 ---- PC 5: Stalled ----- 91392 in-flight CPI 1.4194 -- Total Cycles 129743 Total CPI 0.0431 , IPC 23.2027 -- Total Cycles 129743 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8464 (4.216903%) FPSUB: 0 (0.000000%) FPMUL: 32995 (16.438650%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 69120 (34.436717%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5471 (2.725742%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 76993 (38.359174%) DIV: 7416 (3.694773%) FPUN: 0 (0.000000%) FPRSUB: 257 (0.128042%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3301376 total) ADD%: 7.433 (245402) SUB%: 0.000 (0) MUL%: 0.006 (201) BITOR%: 1.527 (50424) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.597 (19708) FPSUB%: 0.000 (0) FPMUL%: 4.927 (162655) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (603) FPMAX%: 0.018 (603) LOAD%: 5.222 (172403) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (233) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (576) FPINV%: 0.000 (0) FPCONV%: 0.019 (635) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.088 (35916) FPLE%: 0.455 (15021) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (603) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.788 (92041) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.761 (25122) CMPU%: 0.000 (0) RSUB%: 0.006 (201) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.731 (519345) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.172 (38676) ORI%: 1.602 (52877) XORI%: 0.000 (0) MULI%: 3.191 (105356) LW%: 1.125 (37138) LWI%: 13.477 (444914) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.285 (9425) SWI%: 4.048 (133646) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.393 (45990) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10229) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.065 (2161) bned%: 0.000 (0) bneid%: 13.845 (457079) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.715 (23610) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.133 (4393) DIV%: 0.012 (402) FPUN%: 1.475 (48685) FPRSUB%: 3.724 (122953) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (68) FPGT%: 2.950 (97385) FPGE%: 1.020 (33664) SYNC%: 0.000 (0) NOP%: 8.813 (290937) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 25 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 10 FPSUB 0 FPMUL 63 FPCMPLT 0 FPMIN 0 FPMAX 395 LOAD 39957 INTCONV 0 ATOMIC_INC 13 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 13 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1825 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 9 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48380 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 14 ORI 12108 XORI 0 MULI 9543 LW 0 LWI 141097 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 71 DIV 34 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.2029 --Total thread-cycles: 4151776 --total thread-cycles issued: 3010439 (72.509668%) --iCache conflicts: 113319 (2.729410%) --thread*cycles of FU dependence: 253567 (6.107435%) --thread*cycles of data dependence: 200716 (4.834461%) --iCache cycles*banks: 4151776 (79.517970% used) Issue breakdown: --thread*cycles of issue worked: 3010439 (72.509668%) --thread*cycles of issue failed: 850400 (20.482801%) --thread*cycles of issue NOP/other: 290937 (7.007531%) Number of thread-cycles not ready: 200716 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3301376 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 7 2: 7 3: 7 4: 9 5: 8 6: 8 7: 8 8: 6 9: 8 10: 6 11: 8 12: 8 13: 7 14: 7 15: 8 16: 6 17: 7 18: 7 19: 6 20: 8 21: 7 22: 8 23: 7 24: 7 25: 7 26: 8 27: 8 28: 6 29: 8 30: 7 31: 7 <=== Core 26 ===> ---- Thread 00 ---- PC 5: Stalled ----- 100504 in-flight CPI 1.2644 -- Total Cycles 127103 ---- Thread 01 ---- PC 5: Stalled ----- 99658 in-flight CPI 1.2751 -- Total Cycles 127103 ---- Thread 02 ---- PC 5: Stalled ----- 94723 in-flight CPI 1.3416 -- Total Cycles 127103 ---- Thread 03 ---- PC 5: Stalled ----- 98092 in-flight CPI 1.2955 -- Total Cycles 127103 ---- Thread 04 ---- PC 5: Stalled ----- 94764 in-flight CPI 1.3410 -- Total Cycles 127103 ---- Thread 05 ---- PC 5: Stalled ----- 100083 in-flight CPI 1.2697 -- Total Cycles 127103 ---- Thread 06 ---- PC 5: Stalled ----- 100172 in-flight CPI 1.2686 -- Total Cycles 127103 ---- Thread 07 ---- PC 5: Stalled ----- 102447 in-flight CPI 1.2404 -- Total Cycles 127103 ---- Thread 08 ---- PC 5: Stalled ----- 96453 in-flight CPI 1.3175 -- Total Cycles 127103 ---- Thread 09 ---- PC 5: Stalled ----- 90290 in-flight CPI 1.4075 -- Total Cycles 127103 ---- Thread 10 ---- PC 5: Stalled ----- 100543 in-flight CPI 1.2639 -- Total Cycles 127103 ---- Thread 11 ---- PC 5: Stalled ----- 93093 in-flight CPI 1.3651 -- Total Cycles 127103 ---- Thread 12 ---- PC 5: Stalled ----- 98631 in-flight CPI 1.2884 -- Total Cycles 127103 ---- Thread 13 ---- PC 5: Stalled ----- 94743 in-flight CPI 1.3413 -- Total Cycles 127103 ---- Thread 14 ---- PC 5: Stalled ----- 98682 in-flight CPI 1.2878 -- Total Cycles 127103 ---- Thread 15 ---- PC 5: Stalled ----- 94790 in-flight CPI 1.3407 -- Total Cycles 127103 ---- Thread 16 ---- PC 5: Stalled ----- 94024 in-flight CPI 1.3516 -- Total Cycles 127103 ---- Thread 17 ---- PC 5: Stalled ----- 96006 in-flight CPI 1.3237 -- Total Cycles 127103 ---- Thread 18 ---- PC 5: Stalled ----- 94904 in-flight CPI 1.3390 -- Total Cycles 127103 ---- Thread 19 ---- PC 5: Stalled ----- 91918 in-flight CPI 1.3825 -- Total Cycles 127103 ---- Thread 20 ---- PC 5: Stalled ----- 95515 in-flight CPI 1.3304 -- Total Cycles 127103 ---- Thread 21 ---- PC 5: Stalled ----- 96612 in-flight CPI 1.3154 -- Total Cycles 127103 ---- Thread 22 ---- PC 5: Stalled ----- 88828 in-flight CPI 1.4306 -- Total Cycles 127103 ---- Thread 23 ---- PC 5: Stalled ----- 90893 in-flight CPI 1.3981 -- Total Cycles 127103 ---- Thread 24 ---- PC 5: Stalled ----- 93469 in-flight CPI 1.3596 -- Total Cycles 127103 ---- Thread 25 ---- PC 5: Stalled ----- 87633 in-flight CPI 1.4501 -- Total Cycles 127103 ---- Thread 26 ---- PC 5: Stalled ----- 91785 in-flight CPI 1.3845 -- Total Cycles 127103 ---- Thread 27 ---- PC 5: Stalled ----- 85887 in-flight CPI 1.4797 -- Total Cycles 127103 ---- Thread 28 ---- PC 5: Stalled ----- 85818 in-flight CPI 1.4808 -- Total Cycles 127103 ---- Thread 29 ---- PC 5: Stalled ----- 89735 in-flight CPI 1.4161 -- Total Cycles 127103 ---- Thread 30 ---- PC 5: Stalled ----- 89216 in-flight CPI 1.4244 -- Total Cycles 127103 ---- Thread 31 ---- PC 5: Stalled ----- 86041 in-flight CPI 1.4770 -- Total Cycles 127103 Total CPI 0.0421 , IPC 23.7328 -- Total Cycles 127103 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7742 (4.179623%) FPSUB: 0 (0.000000%) FPMUL: 31665 (17.094778%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 61307 (33.097413%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5646 (3.048069%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 71004 (38.332470%) DIV: 7603 (4.104582%) FPUN: 0 (0.000000%) FPRSUB: 265 (0.143064%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3307527 total) ADD%: 7.541 (249433) SUB%: 0.000 (0) MUL%: 0.006 (206) BITOR%: 1.514 (50084) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.552 (18260) FPSUB%: 0.000 (0) FPMUL%: 4.800 (158762) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (618) FPMAX%: 0.019 (618) LOAD%: 5.168 (170932) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (238) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (590) FPINV%: 0.000 (0) FPCONV%: 0.020 (650) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.068 (35318) FPLE%: 0.453 (14974) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (618) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.822 (93341) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.746 (24662) CMPU%: 0.000 (0) RSUB%: 0.006 (206) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.752 (520997) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.179 (38986) ORI%: 1.558 (51523) XORI%: 0.000 (0) MULI%: 3.224 (106622) LW%: 1.139 (37666) LWI%: 13.572 (448890) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.289 (9572) SWI%: 4.094 (135421) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.410 (46629) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.312 (10312) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.058 (1923) bned%: 0.000 (0) bneid%: 13.850 (458107) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.712 (23545) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.122 (4037) DIV%: 0.012 (412) FPUN%: 1.468 (48558) FPRSUB%: 3.685 (121889) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (74) FPGT%: 2.969 (98217) FPGE%: 1.015 (33584) SYNC%: 0.000 (0) NOP%: 8.797 (290957) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 14 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 16 FPSUB 0 FPMUL 54 FPCMPLT 0 FPMIN 0 FPMAX 399 LOAD 39345 INTCONV 0 ATOMIC_INC 25 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 14 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 2325 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 11 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48813 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 8 ORI 11010 XORI 0 MULI 10278 LW 0 LWI 141964 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 88 DIV 32 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.7331 --Total thread-cycles: 4067296 --total thread-cycles issued: 3016570 (74.166473%) --iCache conflicts: 113850 (2.799157%) --thread*cycles of FU dependence: 254407 (6.254942%) --thread*cycles of data dependence: 185232 (4.554180%) --iCache cycles*banks: 4067296 (81.320833% used) Issue breakdown: --thread*cycles of issue worked: 3016570 (74.166473%) --thread*cycles of issue failed: 759769 (18.679953%) --thread*cycles of issue NOP/other: 290957 (7.153573%) Number of thread-cycles not ready: 185232 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3307527 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 8 3: 9 4: 8 5: 9 6: 8 7: 9 8: 7 9: 6 10: 9 11: 6 12: 8 13: 8 14: 7 15: 7 16: 7 17: 7 18: 8 19: 7 20: 8 21: 6 22: 7 23: 7 24: 8 25: 7 26: 7 27: 6 28: 7 29: 8 30: 7 31: 6 <=== Core 27 ===> ---- Thread 00 ---- PC 5: Stalled ----- 102977 in-flight CPI 1.2495 -- Total Cycles 128700 ---- Thread 01 ---- PC 5: Stalled ----- 102493 in-flight CPI 1.2555 -- Total Cycles 128700 ---- Thread 02 ---- PC 5: Stalled ----- 97761 in-flight CPI 1.3162 -- Total Cycles 128700 ---- Thread 03 ---- PC 5: Stalled ----- 100197 in-flight CPI 1.2842 -- Total Cycles 128700 ---- Thread 04 ---- PC 5: Stalled ----- 95615 in-flight CPI 1.3458 -- Total Cycles 128700 ---- Thread 05 ---- PC 5: Stalled ----- 95170 in-flight CPI 1.3521 -- Total Cycles 128700 ---- Thread 06 ---- PC 5: Stalled ----- 98094 in-flight CPI 1.3118 -- Total Cycles 128700 ---- Thread 07 ---- PC 5: Stalled ----- 104650 in-flight CPI 1.2296 -- Total Cycles 128700 ---- Thread 08 ---- PC 5: Stalled ----- 95529 in-flight CPI 1.3470 -- Total Cycles 128700 ---- Thread 09 ---- PC 5: Stalled ----- 93672 in-flight CPI 1.3737 -- Total Cycles 128700 ---- Thread 10 ---- PC 5: Stalled ----- 99616 in-flight CPI 1.2918 -- Total Cycles 128700 ---- Thread 11 ---- PC 5: Stalled ----- 100784 in-flight CPI 1.2767 -- Total Cycles 128700 ---- Thread 12 ---- PC 5: Stalled ----- 103228 in-flight CPI 1.2465 -- Total Cycles 128700 ---- Thread 13 ---- PC 5: Stalled ----- 95531 in-flight CPI 1.3470 -- Total Cycles 128700 ---- Thread 14 ---- PC 5: Stalled ----- 96506 in-flight CPI 1.3333 -- Total Cycles 128700 ---- Thread 15 ---- PC 5: Stalled ----- 98480 in-flight CPI 1.3066 -- Total Cycles 128700 ---- Thread 16 ---- PC 5: Stalled ----- 97017 in-flight CPI 1.3263 -- Total Cycles 128700 ---- Thread 17 ---- PC 5: Stalled ----- 96038 in-flight CPI 1.3398 -- Total Cycles 128700 ---- Thread 18 ---- PC 5: Stalled ----- 96559 in-flight CPI 1.3326 -- Total Cycles 128700 ---- Thread 19 ---- PC 5: Stalled ----- 95523 in-flight CPI 1.3471 -- Total Cycles 128700 ---- Thread 20 ---- PC 5: Stalled ----- 89056 in-flight CPI 1.4449 -- Total Cycles 128700 ---- Thread 21 ---- PC 5: Stalled ----- 88327 in-flight CPI 1.4569 -- Total Cycles 128700 ---- Thread 22 ---- PC 5: Stalled ----- 91913 in-flight CPI 1.3999 -- Total Cycles 128700 ---- Thread 23 ---- PC 5: Stalled ----- 92667 in-flight CPI 1.3886 -- Total Cycles 128700 ---- Thread 24 ---- PC 5: Stalled ----- 96925 in-flight CPI 1.3276 -- Total Cycles 128700 ---- Thread 25 ---- PC 5: Stalled ----- 92340 in-flight CPI 1.3934 -- Total Cycles 128700 ---- Thread 26 ---- PC 5: Stalled ----- 96729 in-flight CPI 1.3303 -- Total Cycles 128700 ---- Thread 27 ---- PC 5: Stalled ----- 91873 in-flight CPI 1.4006 -- Total Cycles 128700 ---- Thread 28 ---- PC 5: Stalled ----- 90798 in-flight CPI 1.4172 -- Total Cycles 128700 ---- Thread 29 ---- PC 5: Stalled ----- 90400 in-flight CPI 1.4234 -- Total Cycles 128700 ---- Thread 30 ---- PC 5: Stalled ----- 91145 in-flight CPI 1.4118 -- Total Cycles 128700 ---- Thread 31 ---- PC 5: Stalled ----- 91695 in-flight CPI 1.4033 -- Total Cycles 128700 Total CPI 0.0419 , IPC 23.8531 -- Total Cycles 128700 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7584 (4.130067%) FPSUB: 0 (0.000000%) FPMUL: 31748 (17.289208%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 59993 (32.670766%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 6024 (3.280528%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 70108 (38.179155%) DIV: 7898 (4.301064%) FPUN: 0 (0.000000%) FPRSUB: 274 (0.149214%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3365803 total) ADD%: 7.503 (252534) SUB%: 0.000 (0) MUL%: 0.006 (214) BITOR%: 1.529 (51478) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.539 (18127) FPSUB%: 0.000 (0) FPMUL%: 4.752 (159951) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (642) FPMAX%: 0.019 (642) LOAD%: 5.136 (172857) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (246) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (619) FPINV%: 0.000 (0) FPCONV%: 0.020 (674) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.061 (35711) FPLE%: 0.455 (15311) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (642) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.834 (95389) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.740 (24899) CMPU%: 0.000 (0) RSUB%: 0.006 (214) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.758 (530373) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.181 (39737) ORI%: 1.560 (52499) XORI%: 0.000 (0) MULI%: 3.235 (108884) LW%: 1.144 (38498) LWI%: 13.603 (457863) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.289 (9739) SWI%: 4.111 (138377) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.417 (47707) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10470) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.056 (1886) bned%: 0.000 (0) bneid%: 13.871 (466859) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.723 (24324) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.118 (3979) DIV%: 0.013 (428) FPUN%: 1.486 (50025) FPRSUB%: 3.667 (123430) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (82) FPGT%: 2.966 (99830) FPGE%: 1.031 (34714) SYNC%: 0.000 (0) NOP%: 8.790 (295853) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 38 BITOR 1 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 10 FPSUB 0 FPMUL 49 FPCMPLT 0 FPMIN 0 FPMAX 422 LOAD 38803 INTCONV 0 ATOMIC_INC 20 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 17 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1937 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 6 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49835 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 16 ORI 10817 XORI 0 MULI 10264 LW 0 LWI 144541 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 79 DIV 13 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.8534 --Total thread-cycles: 4118400 --total thread-cycles issued: 3069950 (74.542298%) --iCache conflicts: 117157 (2.844721%) --thread*cycles of FU dependence: 256893 (6.237689%) --thread*cycles of data dependence: 183629 (4.458746%) --iCache cycles*banks: 4118400 (81.726763% used) Issue breakdown: --thread*cycles of issue worked: 3069950 (74.542298%) --thread*cycles of issue failed: 752597 (18.274014%) --thread*cycles of issue NOP/other: 295853 (7.183688%) Number of thread-cycles not ready: 183629 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3365803 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 10 1: 8 2: 8 3: 8 4: 6 5: 7 6: 7 7: 8 8: 8 9: 8 10: 7 11: 9 12: 9 13: 6 14: 8 15: 8 16: 8 17: 9 18: 8 19: 7 20: 7 21: 6 22: 8 23: 7 24: 8 25: 9 26: 8 27: 8 28: 7 29: 7 30: 7 31: 7 <=== Core 28 ===> ---- Thread 00 ---- PC 5: Stalled ----- 102705 in-flight CPI 1.2493 -- Total Cycles 128333 ---- Thread 01 ---- PC 5: Stalled ----- 96083 in-flight CPI 1.3354 -- Total Cycles 128333 ---- Thread 02 ---- PC 5: Stalled ----- 102794 in-flight CPI 1.2482 -- Total Cycles 128333 ---- Thread 03 ---- PC 5: Stalled ----- 93703 in-flight CPI 1.3693 -- Total Cycles 128333 ---- Thread 04 ---- PC 5: Stalled ----- 92176 in-flight CPI 1.3920 -- Total Cycles 128333 ---- Thread 05 ---- PC 5: Stalled ----- 98496 in-flight CPI 1.3027 -- Total Cycles 128333 ---- Thread 06 ---- PC 5: Stalled ----- 93795 in-flight CPI 1.3680 -- Total Cycles 128333 ---- Thread 07 ---- PC 5: Stalled ----- 101774 in-flight CPI 1.2607 -- Total Cycles 128333 ---- Thread 08 ---- PC 5: Stalled ----- 91362 in-flight CPI 1.4044 -- Total Cycles 128333 ---- Thread 09 ---- PC 5: Stalled ----- 92039 in-flight CPI 1.3941 -- Total Cycles 128333 ---- Thread 10 ---- PC 5: Stalled ----- 97569 in-flight CPI 1.3151 -- Total Cycles 128333 ---- Thread 11 ---- PC 5: Stalled ----- 97006 in-flight CPI 1.3227 -- Total Cycles 128333 ---- Thread 12 ---- PC 5: Stalled ----- 93316 in-flight CPI 1.3750 -- Total Cycles 128333 ---- Thread 13 ---- PC 5: Stalled ----- 90029 in-flight CPI 1.4253 -- Total Cycles 128333 ---- Thread 14 ---- PC 5: Stalled ----- 98241 in-flight CPI 1.3060 -- Total Cycles 128333 ---- Thread 15 ---- PC 5: Stalled ----- 97465 in-flight CPI 1.3164 -- Total Cycles 128333 ---- Thread 16 ---- PC 5: Stalled ----- 89012 in-flight CPI 1.4415 -- Total Cycles 128333 ---- Thread 17 ---- PC 5: Stalled ----- 95791 in-flight CPI 1.3394 -- Total Cycles 128333 ---- Thread 18 ---- PC 5: Stalled ----- 91456 in-flight CPI 1.4030 -- Total Cycles 128333 ---- Thread 19 ---- PC 5: Stalled ----- 95415 in-flight CPI 1.3448 -- Total Cycles 128333 ---- Thread 20 ---- PC 5: Stalled ----- 90275 in-flight CPI 1.4214 -- Total Cycles 128333 ---- Thread 21 ---- PC 5: Stalled ----- 85768 in-flight CPI 1.4960 -- Total Cycles 128333 ---- Thread 22 ---- PC 5: Stalled ----- 91272 in-flight CPI 1.4058 -- Total Cycles 128333 ---- Thread 23 ---- PC 5: Stalled ----- 91924 in-flight CPI 1.3958 -- Total Cycles 128333 ---- Thread 24 ---- PC 5: Stalled ----- 89489 in-flight CPI 1.4338 -- Total Cycles 128333 ---- Thread 25 ---- PC 5: Stalled ----- 99122 in-flight CPI 1.2944 -- Total Cycles 128333 ---- Thread 26 ---- PC 5: Stalled ----- 91119 in-flight CPI 1.4082 -- Total Cycles 128333 ---- Thread 27 ---- PC 5: Stalled ----- 95589 in-flight CPI 1.3422 -- Total Cycles 128333 ---- Thread 28 ---- PC 5: Stalled ----- 91855 in-flight CPI 1.3968 -- Total Cycles 128333 ---- Thread 29 ---- PC 5: Stalled ----- 89415 in-flight CPI 1.4350 -- Total Cycles 128333 ---- Thread 30 ---- PC 5: Stalled ----- 93243 in-flight CPI 1.3760 -- Total Cycles 128333 ---- Thread 31 ---- PC 5: Stalled ----- 87458 in-flight CPI 1.4671 -- Total Cycles 128333 Total CPI 0.0427 , IPC 23.4337 -- Total Cycles 128333 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7561 (3.630349%) FPSUB: 0 (0.000000%) FPMUL: 31345 (15.050031%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 86355 (41.462607%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5542 (2.660943%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 69679 (33.455769%) DIV: 7530 (3.615464%) FPUN: 0 (0.000000%) FPRSUB: 260 (0.124837%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3297345 total) ADD%: 7.537 (248518) SUB%: 0.000 (0) MUL%: 0.006 (204) BITOR%: 1.528 (50388) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.543 (17902) FPSUB%: 0.000 (0) FPMUL%: 4.770 (157287) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (612) FPMAX%: 0.019 (612) LOAD%: 5.158 (170065) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (236) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (583) FPINV%: 0.000 (0) FPCONV%: 0.020 (644) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.066 (35146) FPLE%: 0.458 (15104) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (612) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.824 (93104) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.745 (24580) CMPU%: 0.000 (0) RSUB%: 0.006 (204) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.760 (519654) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.180 (38897) ORI%: 1.554 (51230) XORI%: 0.000 (0) MULI%: 3.226 (106370) LW%: 1.139 (37568) LWI%: 13.574 (447598) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.290 (9548) SWI%: 4.091 (134892) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.410 (46507) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.312 (10302) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.057 (1865) bned%: 0.000 (0) bneid%: 13.869 (457306) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.719 (23717) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.120 (3961) DIV%: 0.012 (408) FPUN%: 1.482 (48855) FPRSUB%: 3.675 (121180) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (71) FPGT%: 2.966 (97791) FPGE%: 1.024 (33751) SYNC%: 0.000 (0) NOP%: 8.794 (289977) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 25 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 12 FPSUB 0 FPMUL 50 FPCMPLT 0 FPMIN 0 FPMAX 399 LOAD 38948 INTCONV 0 ATOMIC_INC 22 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 14 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1469 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 16 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48893 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 13 ORI 10753 XORI 0 MULI 9451 LW 0 LWI 141652 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 74 DIV 23 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.4339 --Total thread-cycles: 4106656 --total thread-cycles issued: 3007368 (73.231554%) --iCache conflicts: 111376 (2.712085%) --thread*cycles of FU dependence: 251821 (6.132021%) --thread*cycles of data dependence: 208272 (5.071572%) --iCache cycles*banks: 4106656 (80.293480% used) Issue breakdown: --thread*cycles of issue worked: 3007368 (73.231554%) --thread*cycles of issue failed: 809311 (19.707300%) --thread*cycles of issue NOP/other: 289977 (7.061147%) Number of thread-cycles not ready: 208272 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3297345 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 7 2: 8 3: 7 4: 7 5: 8 6: 7 7: 8 8: 8 9: 7 10: 7 11: 8 12: 8 13: 6 14: 9 15: 8 16: 6 17: 8 18: 6 19: 7 20: 6 21: 6 22: 7 23: 8 24: 7 25: 9 26: 7 27: 9 28: 8 29: 6 30: 8 31: 7 <=== Core 29 ===> ---- Thread 00 ---- PC 5: Stalled ----- 91668 in-flight CPI 1.3943 -- Total Cycles 127832 ---- Thread 01 ---- PC 5: Stalled ----- 97646 in-flight CPI 1.3089 -- Total Cycles 127832 ---- Thread 02 ---- PC 5: Stalled ----- 99241 in-flight CPI 1.2879 -- Total Cycles 127832 ---- Thread 03 ---- PC 5: Stalled ----- 92257 in-flight CPI 1.3854 -- Total Cycles 127832 ---- Thread 04 ---- PC 5: Stalled ----- 99418 in-flight CPI 1.2856 -- Total Cycles 127832 ---- Thread 05 ---- PC 5: Stalled ----- 102105 in-flight CPI 1.2517 -- Total Cycles 127832 ---- Thread 06 ---- PC 5: Stalled ----- 93750 in-flight CPI 1.3633 -- Total Cycles 127832 ---- Thread 07 ---- PC 5: Stalled ----- 88942 in-flight CPI 1.4371 -- Total Cycles 127832 ---- Thread 08 ---- PC 5: Stalled ----- 96830 in-flight CPI 1.3199 -- Total Cycles 127832 ---- Thread 09 ---- PC 5: Stalled ----- 103521 in-flight CPI 1.2346 -- Total Cycles 127832 ---- Thread 10 ---- PC 5: Stalled ----- 95514 in-flight CPI 1.3381 -- Total Cycles 127832 ---- Thread 11 ---- PC 5: Stalled ----- 92964 in-flight CPI 1.3748 -- Total Cycles 127832 ---- Thread 12 ---- PC 5: Stalled ----- 97842 in-flight CPI 1.3063 -- Total Cycles 127832 ---- Thread 13 ---- PC 5: Stalled ----- 97127 in-flight CPI 1.3158 -- Total Cycles 127832 ---- Thread 14 ---- PC 5: Stalled ----- 98310 in-flight CPI 1.3001 -- Total Cycles 127832 ---- Thread 15 ---- PC 5: Stalled ----- 98443 in-flight CPI 1.2982 -- Total Cycles 127832 ---- Thread 16 ---- PC 5: Stalled ----- 94858 in-flight CPI 1.3474 -- Total Cycles 127832 ---- Thread 17 ---- PC 5: Stalled ----- 93158 in-flight CPI 1.3720 -- Total Cycles 127832 ---- Thread 18 ---- PC 5: Stalled ----- 95452 in-flight CPI 1.3390 -- Total Cycles 127832 ---- Thread 19 ---- PC 5: Stalled ----- 89910 in-flight CPI 1.4216 -- Total Cycles 127832 ---- Thread 20 ---- PC 5: Stalled ----- 89193 in-flight CPI 1.4329 -- Total Cycles 127832 ---- Thread 21 ---- PC 5: Stalled ----- 92250 in-flight CPI 1.3855 -- Total Cycles 127832 ---- Thread 22 ---- PC 5: Stalled ----- 91228 in-flight CPI 1.4010 -- Total Cycles 127832 ---- Thread 23 ---- PC 5: Stalled ----- 92727 in-flight CPI 1.3784 -- Total Cycles 127832 ---- Thread 24 ---- PC 5: Stalled ----- 93742 in-flight CPI 1.3635 -- Total Cycles 127832 ---- Thread 25 ---- PC 5: Stalled ----- 88195 in-flight CPI 1.4491 -- Total Cycles 127832 ---- Thread 26 ---- PC 5: Stalled ----- 84700 in-flight CPI 1.5090 -- Total Cycles 127832 ---- Thread 27 ---- PC 5: Stalled ----- 90508 in-flight CPI 1.4121 -- Total Cycles 127832 ---- Thread 28 ---- PC 5: Stalled ----- 89839 in-flight CPI 1.4226 -- Total Cycles 127832 ---- Thread 29 ---- PC 5: Stalled ----- 91489 in-flight CPI 1.3970 -- Total Cycles 127832 ---- Thread 30 ---- PC 5: Stalled ----- 88424 in-flight CPI 1.4454 -- Total Cycles 127832 ---- Thread 31 ---- PC 5: Stalled ----- 93253 in-flight CPI 1.3705 -- Total Cycles 127832 Total CPI 0.0425 , IPC 23.5077 -- Total Cycles 127832 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8917 (3.944668%) FPSUB: 0 (0.000000%) FPMUL: 33913 (15.002300%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 89763 (39.709005%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5403 (2.390158%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 80649 (35.677189%) DIV: 7151 (3.163431%) FPUN: 0 (0.000000%) FPRSUB: 256 (0.113248%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3294839 total) ADD%: 7.415 (244322) SUB%: 0.000 (0) MUL%: 0.006 (194) BITOR%: 1.536 (50625) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.631 (20790) FPSUB%: 0.000 (0) FPMUL%: 5.024 (165533) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (582) FPMAX%: 0.018 (582) LOAD%: 5.265 (173481) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (226) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (564) FPINV%: 0.000 (0) FPCONV%: 0.019 (614) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.097 (36141) FPLE%: 0.454 (14968) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (582) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.772 (91324) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.777 (25601) CMPU%: 0.000 (0) RSUB%: 0.006 (194) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.712 (517682) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.173 (38662) ORI%: 1.629 (53688) XORI%: 0.000 (0) MULI%: 3.167 (104340) LW%: 1.118 (36840) LWI%: 13.399 (441489) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.283 (9327) SWI%: 4.023 (132537) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.386 (45657) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.309 (10169) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.067 (2195) bned%: 0.000 (0) bneid%: 13.818 (455272) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.714 (23519) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.140 (4615) DIV%: 0.012 (388) FPUN%: 1.475 (48609) FPRSUB%: 3.752 (123608) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (57) FPGT%: 2.925 (96372) FPGE%: 1.021 (33641) SYNC%: 0.000 (0) NOP%: 8.794 (289753) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 12 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 11 FPSUB 0 FPMUL 59 FPCMPLT 0 FPMIN 0 FPMAX 384 LOAD 41082 INTCONV 0 ATOMIC_INC 12 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 14 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 2237 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 9 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48097 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 9 ORI 12855 XORI 0 MULI 8980 LW 0 LWI 140195 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 104 DIV 18 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.5079 --Total thread-cycles: 4090624 --total thread-cycles issued: 3005086 (73.462777%) --iCache conflicts: 112011 (2.738237%) --thread*cycles of FU dependence: 254079 (6.211253%) --thread*cycles of data dependence: 226052 (5.526101%) --iCache cycles*banks: 4090624 (80.546904% used) Issue breakdown: --thread*cycles of issue worked: 3005086 (73.462777%) --thread*cycles of issue failed: 795785 (19.453878%) --thread*cycles of issue NOP/other: 289753 (7.083345%) Number of thread-cycles not ready: 226052 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3294839 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 6 1: 7 2: 7 3: 7 4: 8 5: 9 6: 6 7: 5 8: 7 9: 9 10: 7 11: 7 12: 7 13: 9 14: 7 15: 9 16: 6 17: 7 18: 8 19: 6 20: 7 21: 7 22: 7 23: 6 24: 6 25: 8 26: 5 27: 8 28: 7 29: 6 30: 7 31: 8 <=== Core 30 ===> ---- Thread 00 ---- PC 5: Stalled ----- 102689 in-flight CPI 1.2357 -- Total Cycles 126920 ---- Thread 01 ---- PC 5: Stalled ----- 101445 in-flight CPI 1.2509 -- Total Cycles 126920 ---- Thread 02 ---- PC 5: Stalled ----- 102340 in-flight CPI 1.2399 -- Total Cycles 126920 ---- Thread 03 ---- PC 5: Stalled ----- 103888 in-flight CPI 1.2214 -- Total Cycles 126920 ---- Thread 04 ---- PC 5: Stalled ----- 96930 in-flight CPI 1.3092 -- Total Cycles 126920 ---- Thread 05 ---- PC 5: Stalled ----- 99499 in-flight CPI 1.2753 -- Total Cycles 126920 ---- Thread 06 ---- PC 5: Stalled ----- 101631 in-flight CPI 1.2486 -- Total Cycles 126920 ---- Thread 07 ---- PC 5: Stalled ----- 99384 in-flight CPI 1.2768 -- Total Cycles 126920 ---- Thread 08 ---- PC 5: Stalled ----- 94378 in-flight CPI 1.3446 -- Total Cycles 126920 ---- Thread 09 ---- PC 5: Stalled ----- 96530 in-flight CPI 1.3146 -- Total Cycles 126920 ---- Thread 10 ---- PC 5: Stalled ----- 101591 in-flight CPI 1.2491 -- Total Cycles 126920 ---- Thread 11 ---- PC 5: Stalled ----- 92548 in-flight CPI 1.3712 -- Total Cycles 126920 ---- Thread 12 ---- PC 5: Stalled ----- 99963 in-flight CPI 1.2694 -- Total Cycles 126920 ---- Thread 13 ---- PC 5: Stalled ----- 99941 in-flight CPI 1.2697 -- Total Cycles 126920 ---- Thread 14 ---- PC 5: Stalled ----- 96724 in-flight CPI 1.3119 -- Total Cycles 126920 ---- Thread 15 ---- PC 5: Stalled ----- 91028 in-flight CPI 1.3940 -- Total Cycles 126920 ---- Thread 16 ---- PC 5: Stalled ----- 91495 in-flight CPI 1.3869 -- Total Cycles 126920 ---- Thread 17 ---- PC 5: Stalled ----- 98285 in-flight CPI 1.2911 -- Total Cycles 126920 ---- Thread 18 ---- PC 5: Stalled ----- 99300 in-flight CPI 1.2779 -- Total Cycles 126920 ---- Thread 19 ---- PC 5: Stalled ----- 93290 in-flight CPI 1.3602 -- Total Cycles 126920 ---- Thread 20 ---- PC 5: Stalled ----- 90581 in-flight CPI 1.4009 -- Total Cycles 126920 ---- Thread 21 ---- PC 5: Stalled ----- 95820 in-flight CPI 1.3243 -- Total Cycles 126920 ---- Thread 22 ---- PC 5: Stalled ----- 96982 in-flight CPI 1.3084 -- Total Cycles 126920 ---- Thread 23 ---- PC 5: Stalled ----- 91195 in-flight CPI 1.3915 -- Total Cycles 126920 ---- Thread 24 ---- PC 5: Stalled ----- 88386 in-flight CPI 1.4357 -- Total Cycles 126920 ---- Thread 25 ---- PC 5: Stalled ----- 87840 in-flight CPI 1.4446 -- Total Cycles 126920 ---- Thread 26 ---- PC 5: Stalled ----- 90745 in-flight CPI 1.3983 -- Total Cycles 126920 ---- Thread 27 ---- PC 5: Stalled ----- 86546 in-flight CPI 1.4663 -- Total Cycles 126920 ---- Thread 28 ---- PC 5: Stalled ----- 84640 in-flight CPI 1.4993 -- Total Cycles 126920 ---- Thread 29 ---- PC 5: Stalled ----- 85657 in-flight CPI 1.4815 -- Total Cycles 126920 ---- Thread 30 ---- PC 5: Stalled ----- 92897 in-flight CPI 1.3660 -- Total Cycles 126920 ---- Thread 31 ---- PC 5: Stalled ----- 83670 in-flight CPI 1.5166 -- Total Cycles 126920 Total CPI 0.0418 , IPC 23.9396 -- Total Cycles 126920 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7617 (4.201486%) FPSUB: 0 (0.000000%) FPMUL: 31578 (17.418213%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 57659 (31.804317%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5971 (3.293563%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 70373 (38.817274%) DIV: 7828 (4.317872%) FPUN: 0 (0.000000%) FPRSUB: 267 (0.147275%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3332016 total) ADD%: 7.488 (249510) SUB%: 0.000 (0) MUL%: 0.006 (212) BITOR%: 1.535 (51132) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.545 (18166) FPSUB%: 0.000 (0) FPMUL%: 4.768 (158856) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (636) FPMAX%: 0.019 (636) LOAD%: 5.139 (171239) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (244) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (614) FPINV%: 0.000 (0) FPCONV%: 0.020 (668) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.067 (35553) FPLE%: 0.457 (15243) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (636) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.823 (94057) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.743 (24758) CMPU%: 0.000 (0) RSUB%: 0.006 (212) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.758 (525056) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.178 (39238) ORI%: 1.565 (52150) XORI%: 0.000 (0) MULI%: 3.227 (107540) LW%: 1.139 (37962) LWI%: 13.572 (452217) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.288 (9608) SWI%: 4.089 (136234) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.412 (47037) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10370) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.057 (1913) bned%: 0.000 (0) bneid%: 13.891 (462845) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.721 (24012) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.120 (3997) DIV%: 0.013 (424) FPUN%: 1.490 (49644) FPRSUB%: 3.671 (122306) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (71) FPGT%: 2.971 (98981) FPGE%: 1.032 (34401) SYNC%: 0.000 (0) NOP%: 8.810 (293542) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 13 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 11 FPSUB 0 FPMUL 56 FPCMPLT 0 FPMIN 0 FPMAX 415 LOAD 39099 INTCONV 0 ATOMIC_INC 24 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 14 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1955 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 6 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49293 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 20 ORI 10841 XORI 0 MULI 9566 LW 0 LWI 143037 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 70 DIV 21 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.9399 --Total thread-cycles: 4061440 --total thread-cycles issued: 3038474 (74.812727%) --iCache conflicts: 114314 (2.814617%) --thread*cycles of FU dependence: 254443 (6.264847%) --thread*cycles of data dependence: 181293 (4.463762%) --iCache cycles*banks: 4061440 (82.041049% used) Issue breakdown: --thread*cycles of issue worked: 3038474 (74.812727%) --thread*cycles of issue failed: 729424 (17.959738%) --thread*cycles of issue NOP/other: 293542 (7.227535%) Number of thread-cycles not ready: 181293 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3332016 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 9 1: 7 2: 9 3: 9 4: 7 5: 9 6: 9 7: 8 8: 7 9: 7 10: 8 11: 7 12: 8 13: 7 14: 8 15: 7 16: 8 17: 8 18: 9 19: 8 20: 8 21: 8 22: 9 23: 6 24: 7 25: 7 26: 8 27: 6 28: 6 29: 6 30: 7 31: 7 <=== Core 31 ===> ---- Thread 00 ---- PC 5: Stalled ----- 104236 in-flight CPI 1.2274 -- Total Cycles 127966 ---- Thread 01 ---- PC 5: Stalled ----- 93777 in-flight CPI 1.3644 -- Total Cycles 127966 ---- Thread 02 ---- PC 5: Stalled ----- 101187 in-flight CPI 1.2644 -- Total Cycles 127966 ---- Thread 03 ---- PC 5: Stalled ----- 102989 in-flight CPI 1.2423 -- Total Cycles 127966 ---- Thread 04 ---- PC 5: Stalled ----- 102381 in-flight CPI 1.2496 -- Total Cycles 127966 ---- Thread 05 ---- PC 5: Stalled ----- 89096 in-flight CPI 1.4361 -- Total Cycles 127966 ---- Thread 06 ---- PC 5: Stalled ----- 101564 in-flight CPI 1.2597 -- Total Cycles 127966 ---- Thread 07 ---- PC 5: Stalled ----- 103685 in-flight CPI 1.2340 -- Total Cycles 127966 ---- Thread 08 ---- PC 5: Stalled ----- 103345 in-flight CPI 1.2380 -- Total Cycles 127966 ---- Thread 09 ---- PC 5: Stalled ----- 97075 in-flight CPI 1.3180 -- Total Cycles 127966 ---- Thread 10 ---- PC 5: Stalled ----- 99302 in-flight CPI 1.2884 -- Total Cycles 127966 ---- Thread 11 ---- PC 5: Stalled ----- 98383 in-flight CPI 1.3004 -- Total Cycles 127966 ---- Thread 12 ---- PC 5: Stalled ----- 96868 in-flight CPI 1.3208 -- Total Cycles 127966 ---- Thread 13 ---- PC 5: Stalled ----- 97672 in-flight CPI 1.3099 -- Total Cycles 127966 ---- Thread 14 ---- PC 5: Stalled ----- 99277 in-flight CPI 1.2887 -- Total Cycles 127966 ---- Thread 15 ---- PC 5: Stalled ----- 95185 in-flight CPI 1.3441 -- Total Cycles 127966 ---- Thread 16 ---- PC 5: Stalled ----- 98266 in-flight CPI 1.3020 -- Total Cycles 127966 ---- Thread 17 ---- PC 5: Stalled ----- 95986 in-flight CPI 1.3329 -- Total Cycles 127966 ---- Thread 18 ---- PC 5: Stalled ----- 89805 in-flight CPI 1.4247 -- Total Cycles 127966 ---- Thread 19 ---- PC 5: Stalled ----- 91617 in-flight CPI 1.3965 -- Total Cycles 127966 ---- Thread 20 ---- PC 5: Stalled ----- 94148 in-flight CPI 1.3590 -- Total Cycles 127966 ---- Thread 21 ---- PC 5: Stalled ----- 89549 in-flight CPI 1.4288 -- Total Cycles 127966 ---- Thread 22 ---- PC 5: Stalled ----- 96985 in-flight CPI 1.3192 -- Total Cycles 127966 ---- Thread 23 ---- PC 5: Stalled ----- 94343 in-flight CPI 1.3561 -- Total Cycles 127966 ---- Thread 24 ---- PC 5: Stalled ----- 87617 in-flight CPI 1.4603 -- Total Cycles 127966 ---- Thread 25 ---- PC 5: Stalled ----- 94301 in-flight CPI 1.3567 -- Total Cycles 127966 ---- Thread 26 ---- PC 5: Stalled ----- 90681 in-flight CPI 1.4110 -- Total Cycles 127966 ---- Thread 27 ---- PC 5: Stalled ----- 92582 in-flight CPI 1.3819 -- Total Cycles 127966 ---- Thread 28 ---- PC 5: Stalled ----- 90428 in-flight CPI 1.4149 -- Total Cycles 127966 ---- Thread 29 ---- PC 5: Stalled ----- 85907 in-flight CPI 1.4893 -- Total Cycles 127966 ---- Thread 30 ---- PC 5: Stalled ----- 90032 in-flight CPI 1.4211 -- Total Cycles 127966 ---- Thread 31 ---- PC 5: Stalled ----- 89618 in-flight CPI 1.4276 -- Total Cycles 127966 Total CPI 0.0418 , IPC 23.9005 -- Total Cycles 127966 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7947 (4.114611%) FPSUB: 0 (0.000000%) FPMUL: 32293 (16.719909%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 66217 (34.284279%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5739 (2.971404%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 73040 (37.816932%) DIV: 7638 (3.954624%) FPUN: 0 (0.000000%) FPRSUB: 267 (0.138241%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3353525 total) ADD%: 7.514 (251980) SUB%: 0.000 (0) MUL%: 0.006 (207) BITOR%: 1.523 (51080) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.560 (18776) FPSUB%: 0.000 (0) FPMUL%: 4.818 (161578) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (621) FPMAX%: 0.019 (621) LOAD%: 5.175 (173532) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (239) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (597) FPINV%: 0.000 (0) FPCONV%: 0.019 (653) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.071 (35907) FPLE%: 0.453 (15206) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (621) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.818 (94512) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.746 (25020) CMPU%: 0.000 (0) RSUB%: 0.006 (207) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.743 (527953) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.176 (39433) ORI%: 1.572 (52711) XORI%: 0.000 (0) MULI%: 3.219 (107942) LW%: 1.137 (38136) LWI%: 13.554 (454542) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.288 (9663) SWI%: 4.080 (136812) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.409 (47246) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10414) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.062 (2074) bned%: 0.000 (0) bneid%: 13.850 (464460) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.718 (24087) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.124 (4157) DIV%: 0.012 (414) FPUN%: 1.478 (49576) FPRSUB%: 3.691 (123765) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (71) FPGT%: 2.959 (99229) FPGE%: 1.025 (34370) SYNC%: 0.000 (0) NOP%: 8.797 (295017) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 23 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 15 FPSUB 0 FPMUL 59 FPCMPLT 0 FPMIN 0 FPMAX 401 LOAD 39616 INTCONV 0 ATOMIC_INC 21 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 13 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1762 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 4 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49485 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 7 ORI 11331 XORI 0 MULI 9751 LW 0 LWI 143917 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 83 DIV 30 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.9008 --Total thread-cycles: 4094912 --total thread-cycles issued: 3058508 (74.690445%) --iCache conflicts: 114666 (2.800207%) --thread*cycles of FU dependence: 256543 (6.264921%) --thread*cycles of data dependence: 193141 (4.716609%) --iCache cycles*banks: 4094912 (81.895704% used) Issue breakdown: --thread*cycles of issue worked: 3058508 (74.690445%) --thread*cycles of issue failed: 741387 (18.105078%) --thread*cycles of issue NOP/other: 295017 (7.204477%) Number of thread-cycles not ready: 193141 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3353525 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 9 1: 5 2: 8 3: 7 4: 9 5: 5 6: 8 7: 8 8: 9 9: 8 10: 8 11: 9 12: 8 13: 8 14: 8 15: 8 16: 8 17: 7 18: 7 19: 7 20: 7 21: 6 22: 8 23: 8 24: 6 25: 8 26: 6 27: 7 28: 7 29: 8 30: 7 31: 7 <=== Core 32 ===> ---- Thread 00 ---- PC 5: Stalled ----- 95021 in-flight CPI 1.3314 -- Total Cycles 126531 ---- Thread 01 ---- PC 5: Stalled ----- 97366 in-flight CPI 1.2993 -- Total Cycles 126531 ---- Thread 02 ---- PC 5: Stalled ----- 102275 in-flight CPI 1.2369 -- Total Cycles 126531 ---- Thread 03 ---- PC 5: Stalled ----- 99244 in-flight CPI 1.2747 -- Total Cycles 126531 ---- Thread 04 ---- PC 5: Stalled ----- 91123 in-flight CPI 1.3883 -- Total Cycles 126531 ---- Thread 05 ---- PC 5: Stalled ----- 102103 in-flight CPI 1.2390 -- Total Cycles 126531 ---- Thread 06 ---- PC 5: Stalled ----- 102832 in-flight CPI 1.2302 -- Total Cycles 126531 ---- Thread 07 ---- PC 5: Stalled ----- 98011 in-flight CPI 1.2907 -- Total Cycles 126531 ---- Thread 08 ---- PC 5: Stalled ----- 99408 in-flight CPI 1.2726 -- Total Cycles 126531 ---- Thread 09 ---- PC 5: Stalled ----- 99734 in-flight CPI 1.2685 -- Total Cycles 126531 ---- Thread 10 ---- PC 5: Stalled ----- 93700 in-flight CPI 1.3502 -- Total Cycles 126531 ---- Thread 11 ---- PC 5: Stalled ----- 93449 in-flight CPI 1.3538 -- Total Cycles 126531 ---- Thread 12 ---- PC 5: Stalled ----- 92522 in-flight CPI 1.3673 -- Total Cycles 126531 ---- Thread 13 ---- PC 5: Stalled ----- 98764 in-flight CPI 1.2809 -- Total Cycles 126531 ---- Thread 14 ---- PC 5: Stalled ----- 95559 in-flight CPI 1.3239 -- Total Cycles 126531 ---- Thread 15 ---- PC 5: Stalled ----- 95370 in-flight CPI 1.3265 -- Total Cycles 126531 ---- Thread 16 ---- PC 5: Stalled ----- 92566 in-flight CPI 1.3667 -- Total Cycles 126531 ---- Thread 17 ---- PC 5: Stalled ----- 94943 in-flight CPI 1.3324 -- Total Cycles 126531 ---- Thread 18 ---- PC 5: Stalled ----- 86812 in-flight CPI 1.4573 -- Total Cycles 126531 ---- Thread 19 ---- PC 5: Stalled ----- 93308 in-flight CPI 1.3558 -- Total Cycles 126531 ---- Thread 20 ---- PC 5: Stalled ----- 97680 in-flight CPI 1.2951 -- Total Cycles 126531 ---- Thread 21 ---- PC 5: Stalled ----- 89092 in-flight CPI 1.4200 -- Total Cycles 126531 ---- Thread 22 ---- PC 5: Stalled ----- 97393 in-flight CPI 1.2989 -- Total Cycles 126531 ---- Thread 23 ---- PC 5: Stalled ----- 96609 in-flight CPI 1.3094 -- Total Cycles 126531 ---- Thread 24 ---- PC 5: Stalled ----- 87621 in-flight CPI 1.4438 -- Total Cycles 126531 ---- Thread 25 ---- PC 5: Stalled ----- 87350 in-flight CPI 1.4482 -- Total Cycles 126531 ---- Thread 26 ---- PC 5: Stalled ----- 91940 in-flight CPI 1.3759 -- Total Cycles 126531 ---- Thread 27 ---- PC 5: Stalled ----- 88498 in-flight CPI 1.4295 -- Total Cycles 126531 ---- Thread 28 ---- PC 5: Stalled ----- 87224 in-flight CPI 1.4504 -- Total Cycles 126531 ---- Thread 29 ---- PC 5: Stalled ----- 88863 in-flight CPI 1.4236 -- Total Cycles 126531 ---- Thread 30 ---- PC 5: Stalled ----- 88243 in-flight CPI 1.4336 -- Total Cycles 126531 ---- Thread 31 ---- PC 5: Stalled ----- 84897 in-flight CPI 1.4901 -- Total Cycles 126531 Total CPI 0.0420 , IPC 23.7894 -- Total Cycles 126531 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7428 (3.536469%) FPSUB: 0 (0.000000%) FPMUL: 30880 (14.701962%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 88869 (42.310512%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5694 (2.710912%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 69189 (32.940868%) DIV: 7714 (3.672634%) FPUN: 0 (0.000000%) FPRSUB: 266 (0.126643%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3300166 total) ADD%: 7.482 (246903) SUB%: 0.000 (0) MUL%: 0.006 (209) BITOR%: 1.526 (50368) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.532 (17545) FPSUB%: 0.000 (0) FPMUL%: 4.732 (156177) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (627) FPMAX%: 0.019 (627) LOAD%: 5.167 (170513) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (241) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (596) FPINV%: 0.000 (0) FPCONV%: 0.020 (659) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.058 (34928) FPLE%: 0.457 (15092) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (627) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.842 (93789) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.742 (24473) CMPU%: 0.000 (0) RSUB%: 0.006 (209) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.779 (520731) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.183 (39040) ORI%: 1.552 (51227) XORI%: 0.000 (0) MULI%: 3.238 (106844) LW%: 1.147 (37850) LWI%: 13.601 (448847) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.292 (9630) SWI%: 4.118 (135890) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.419 (46841) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.314 (10358) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.058 (1908) bned%: 0.000 (0) bneid%: 13.864 (457529) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.721 (23806) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.119 (3928) DIV%: 0.013 (418) FPUN%: 1.484 (48969) FPRSUB%: 3.663 (120878) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (69) FPGT%: 2.964 (97828) FPGE%: 1.027 (33877) SYNC%: 0.000 (0) NOP%: 8.788 (290019) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 10 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 16 FPSUB 0 FPMUL 58 FPCMPLT 0 FPMIN 0 FPMAX 403 LOAD 38940 INTCONV 0 ATOMIC_INC 22 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 14 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1601 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 14 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48971 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 12 ORI 10527 XORI 0 MULI 9476 LW 0 LWI 141887 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 66 DIV 26 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.7896 --Total thread-cycles: 4048992 --total thread-cycles issued: 3010147 (74.343120%) --iCache conflicts: 111835 (2.762045%) --thread*cycles of FU dependence: 252055 (6.225130%) --thread*cycles of data dependence: 210040 (5.187464%) --iCache cycles*banks: 4048992 (81.506656% used) Issue breakdown: --thread*cycles of issue worked: 3010147 (74.343120%) --thread*cycles of issue failed: 748826 (18.494134%) --thread*cycles of issue NOP/other: 290019 (7.162746%) Number of thread-cycles not ready: 210040 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3300166 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 7 2: 9 3: 8 4: 7 5: 8 6: 8 7: 9 8: 8 9: 7 10: 6 11: 7 12: 8 13: 8 14: 7 15: 7 16: 7 17: 8 18: 6 19: 8 20: 8 21: 6 22: 9 23: 9 24: 7 25: 8 26: 8 27: 7 28: 7 29: 7 30: 8 31: 7 <=== Core 33 ===> ---- Thread 00 ---- PC 5: Stalled ----- 97141 in-flight CPI 1.4803 -- Total Cycles 143822 ---- Thread 01 ---- PC 5: Stalled ----- 94969 in-flight CPI 1.5142 -- Total Cycles 143822 ---- Thread 02 ---- PC 5: Stalled ----- 91999 in-flight CPI 1.5630 -- Total Cycles 143822 ---- Thread 03 ---- PC 5: Stalled ----- 101770 in-flight CPI 1.4129 -- Total Cycles 143822 ---- Thread 04 ---- PC 5: Stalled ----- 93456 in-flight CPI 1.5386 -- Total Cycles 143822 ---- Thread 05 ---- PC 5: Stalled ----- 102256 in-flight CPI 1.4062 -- Total Cycles 143822 ---- Thread 06 ---- PC 5: Stalled ----- 97851 in-flight CPI 1.4695 -- Total Cycles 143822 ---- Thread 07 ---- PC 5: Stalled ----- 98398 in-flight CPI 1.4613 -- Total Cycles 143822 ---- Thread 08 ---- PC 5: Stalled ----- 96503 in-flight CPI 1.4901 -- Total Cycles 143822 ---- Thread 09 ---- PC 5: Stalled ----- 96865 in-flight CPI 1.4845 -- Total Cycles 143822 ---- Thread 10 ---- PC 5: Stalled ----- 96669 in-flight CPI 1.4875 -- Total Cycles 143822 ---- Thread 11 ---- PC 5: Stalled ----- 92893 in-flight CPI 1.5480 -- Total Cycles 143822 ---- Thread 12 ---- PC 5: Stalled ----- 105291 in-flight CPI 1.3658 -- Total Cycles 143822 ---- Thread 13 ---- PC 5: Stalled ----- 95116 in-flight CPI 1.5118 -- Total Cycles 143822 ---- Thread 14 ---- PC 5: Stalled ----- 95637 in-flight CPI 1.5035 -- Total Cycles 143822 ---- Thread 15 ---- PC 5: Stalled ----- 98938 in-flight CPI 1.4533 -- Total Cycles 143822 ---- Thread 16 ---- PC 5: Stalled ----- 94598 in-flight CPI 1.5201 -- Total Cycles 143822 ---- Thread 17 ---- PC 5: Stalled ----- 95229 in-flight CPI 1.5100 -- Total Cycles 143822 ---- Thread 18 ---- PC 5: Stalled ----- 92186 in-flight CPI 1.5599 -- Total Cycles 143822 ---- Thread 19 ---- PC 5: Stalled ----- 93441 in-flight CPI 1.5388 -- Total Cycles 143822 ---- Thread 20 ---- PC 5: Stalled ----- 88154 in-flight CPI 1.6312 -- Total Cycles 143822 ---- Thread 21 ---- PC 5: Stalled ----- 89431 in-flight CPI 1.6079 -- Total Cycles 143822 ---- Thread 22 ---- PC 5: Stalled ----- 88918 in-flight CPI 1.6172 -- Total Cycles 143822 ---- Thread 23 ---- PC 5: Stalled ----- 97184 in-flight CPI 1.4796 -- Total Cycles 143822 ---- Thread 24 ---- PC 5: Stalled ----- 89202 in-flight CPI 1.6120 -- Total Cycles 143822 ---- Thread 25 ---- PC 5: Stalled ----- 89204 in-flight CPI 1.6120 -- Total Cycles 143822 ---- Thread 26 ---- PC 5: Stalled ----- 89171 in-flight CPI 1.6125 -- Total Cycles 143822 ---- Thread 27 ---- PC 5: Stalled ----- 94375 in-flight CPI 1.5236 -- Total Cycles 143822 ---- Thread 28 ---- PC 5: Stalled ----- 93340 in-flight CPI 1.5406 -- Total Cycles 143822 ---- Thread 29 ---- PC 5: Stalled ----- 83704 in-flight CPI 1.7179 -- Total Cycles 143822 ---- Thread 30 ---- PC 5: Stalled ----- 91850 in-flight CPI 1.5656 -- Total Cycles 143822 ---- Thread 31 ---- PC 5: Stalled ----- 91075 in-flight CPI 1.5789 -- Total Cycles 143822 Total CPI 0.0477 , IPC 20.9799 -- Total Cycles 143822 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7840 (3.585131%) FPSUB: 0 (0.000000%) FPMUL: 31778 (14.531669%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 93840 (42.911821%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5610 (2.565381%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 71859 (32.860194%) DIV: 7490 (3.425080%) FPUN: 0 (0.000000%) FPRSUB: 264 (0.120724%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3307931 total) ADD%: 7.464 (246913) SUB%: 0.000 (0) MUL%: 0.006 (203) BITOR%: 1.538 (50865) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.561 (18547) FPSUB%: 0.000 (0) FPMUL%: 4.819 (159410) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (609) FPMAX%: 0.018 (609) LOAD%: 5.179 (171302) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (235) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (585) FPINV%: 0.000 (0) FPCONV%: 0.019 (641) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.067 (35305) FPLE%: 0.457 (15118) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (609) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.820 (93288) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.752 (24887) CMPU%: 0.000 (0) RSUB%: 0.006 (203) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.763 (521431) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.182 (39109) ORI%: 1.579 (52231) XORI%: 0.000 (0) MULI%: 3.214 (106332) LW%: 1.138 (37640) LWI%: 13.538 (447822) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.289 (9547) SWI%: 4.091 (135327) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.409 (46622) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10295) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.056 (1860) bned%: 0.000 (0) bneid%: 13.857 (458396) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.724 (23950) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.124 (4090) DIV%: 0.012 (406) FPUN%: 1.487 (49193) FPRSUB%: 3.688 (122000) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (70) FPGT%: 2.951 (97602) FPGE%: 1.030 (34075) SYNC%: 0.000 (0) NOP%: 8.782 (290508) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 21 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 10 FPSUB 0 FPMUL 51 FPCMPLT 0 FPMIN 0 FPMAX 393 LOAD 38658 INTCONV 0 ATOMIC_INC 24 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 13 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1428 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 4 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48858 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 8 ORI 11178 XORI 0 MULI 9502 LW 0 LWI 141941 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 79 DIV 23 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 20.9801 --Total thread-cycles: 4602304 --total thread-cycles issued: 3017423 (65.563314%) --iCache conflicts: 110553 (2.402123%) --thread*cycles of FU dependence: 252211 (5.480103%) --thread*cycles of data dependence: 218681 (4.751555%) --iCache cycles*banks: 4602304 (71.876239% used) Issue breakdown: --thread*cycles of issue worked: 3017423 (65.563314%) --thread*cycles of issue failed: 1294373 (28.124457%) --thread*cycles of issue NOP/other: 290508 (6.312230%) Number of thread-cycles not ready: 218681 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3307931 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 6 2: 7 3: 9 4: 8 5: 9 6: 8 7: 8 8: 7 9: 7 10: 8 11: 7 12: 5 13: 8 14: 8 15: 9 16: 7 17: 8 18: 7 19: 9 20: 6 21: 6 22: 6 23: 8 24: 7 25: 7 26: 8 27: 8 28: 6 29: 7 30: 7 31: 6 <=== Core 34 ===> ---- Thread 00 ---- PC 5: Stalled ----- 91836 in-flight CPI 1.4048 -- Total Cycles 129028 ---- Thread 01 ---- PC 5: Stalled ----- 97314 in-flight CPI 1.3257 -- Total Cycles 129028 ---- Thread 02 ---- PC 5: Stalled ----- 101382 in-flight CPI 1.2725 -- Total Cycles 129028 ---- Thread 03 ---- PC 5: Stalled ----- 101048 in-flight CPI 1.2767 -- Total Cycles 129028 ---- Thread 04 ---- PC 5: Stalled ----- 98137 in-flight CPI 1.3146 -- Total Cycles 129028 ---- Thread 05 ---- PC 5: Stalled ----- 92645 in-flight CPI 1.3926 -- Total Cycles 129028 ---- Thread 06 ---- PC 5: Stalled ----- 97323 in-flight CPI 1.3255 -- Total Cycles 129028 ---- Thread 07 ---- PC 5: Stalled ----- 91841 in-flight CPI 1.4047 -- Total Cycles 129028 ---- Thread 08 ---- PC 5: Stalled ----- 97352 in-flight CPI 1.3251 -- Total Cycles 129028 ---- Thread 09 ---- PC 5: Stalled ----- 96338 in-flight CPI 1.3391 -- Total Cycles 129028 ---- Thread 10 ---- PC 5: Stalled ----- 92760 in-flight CPI 1.3907 -- Total Cycles 129028 ---- Thread 11 ---- PC 5: Stalled ----- 97279 in-flight CPI 1.3261 -- Total Cycles 129028 ---- Thread 12 ---- PC 5: Stalled ----- 91918 in-flight CPI 1.4035 -- Total Cycles 129028 ---- Thread 13 ---- PC 5: Stalled ----- 90246 in-flight CPI 1.4295 -- Total Cycles 129028 ---- Thread 14 ---- PC 5: Stalled ----- 98492 in-flight CPI 1.3097 -- Total Cycles 129028 ---- Thread 15 ---- PC 5: Stalled ----- 95228 in-flight CPI 1.3547 -- Total Cycles 129028 ---- Thread 16 ---- PC 5: Stalled ----- 92749 in-flight CPI 1.3910 -- Total Cycles 129028 ---- Thread 17 ---- PC 5: Stalled ----- 94251 in-flight CPI 1.3687 -- Total Cycles 129028 ---- Thread 18 ---- PC 5: Stalled ----- 94391 in-flight CPI 1.3667 -- Total Cycles 129028 ---- Thread 19 ---- PC 5: Stalled ----- 100891 in-flight CPI 1.2787 -- Total Cycles 129028 ---- Thread 20 ---- PC 5: Stalled ----- 94738 in-flight CPI 1.3617 -- Total Cycles 129028 ---- Thread 21 ---- PC 5: Stalled ----- 92822 in-flight CPI 1.3898 -- Total Cycles 129028 ---- Thread 22 ---- PC 5: Stalled ----- 91562 in-flight CPI 1.4089 -- Total Cycles 129028 ---- Thread 23 ---- PC 5: Stalled ----- 94669 in-flight CPI 1.3627 -- Total Cycles 129028 ---- Thread 24 ---- PC 5: Stalled ----- 88538 in-flight CPI 1.4571 -- Total Cycles 129028 ---- Thread 25 ---- PC 5: Stalled ----- 94503 in-flight CPI 1.3651 -- Total Cycles 129028 ---- Thread 26 ---- PC 5: Stalled ----- 89206 in-flight CPI 1.4461 -- Total Cycles 129028 ---- Thread 27 ---- PC 5: Stalled ----- 90954 in-flight CPI 1.4183 -- Total Cycles 129028 ---- Thread 28 ---- PC 5: Stalled ----- 88112 in-flight CPI 1.4641 -- Total Cycles 129028 ---- Thread 29 ---- PC 5: Stalled ----- 90698 in-flight CPI 1.4223 -- Total Cycles 129028 ---- Thread 30 ---- PC 5: Stalled ----- 84205 in-flight CPI 1.5321 -- Total Cycles 129028 ---- Thread 31 ---- PC 5: Stalled ----- 86726 in-flight CPI 1.4875 -- Total Cycles 129028 Total CPI 0.0430 , IPC 23.2560 -- Total Cycles 129028 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8061 (3.743643%) FPSUB: 0 (0.000000%) FPMUL: 32093 (14.904447%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 89178 (41.415535%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5223 (2.425636%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 73352 (34.065715%) DIV: 7162 (3.326135%) FPUN: 0 (0.000000%) FPRSUB: 256 (0.118890%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3290552 total) ADD%: 7.485 (246296) SUB%: 0.000 (0) MUL%: 0.006 (194) BITOR%: 1.541 (50721) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.573 (18854) FPSUB%: 0.000 (0) FPMUL%: 4.856 (159802) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (582) FPMAX%: 0.018 (582) LOAD%: 5.204 (171245) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (226) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (554) FPINV%: 0.000 (0) FPCONV%: 0.019 (614) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.077 (35449) FPLE%: 0.463 (15230) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (582) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.802 (92214) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.757 (24913) CMPU%: 0.000 (0) RSUB%: 0.006 (194) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.756 (518465) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.175 (38668) ORI%: 1.588 (52250) XORI%: 0.000 (0) MULI%: 3.198 (105226) LW%: 1.130 (37196) LWI%: 13.473 (443321) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.288 (9463) SWI%: 4.049 (133226) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.399 (46045) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.312 (10273) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.063 (2071) bned%: 0.000 (0) bneid%: 13.863 (456174) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.723 (23789) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.127 (4183) DIV%: 0.012 (388) FPUN%: 1.491 (49055) FPRSUB%: 3.700 (121740) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (56) FPGT%: 2.947 (96974) FPGE%: 1.028 (33825) SYNC%: 0.000 (0) NOP%: 8.808 (289816) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 12 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 9 FPSUB 0 FPMUL 52 FPCMPLT 0 FPMIN 0 FPMAX 379 LOAD 40090 INTCONV 0 ATOMIC_INC 13 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 11 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1362 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 5 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48426 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 17 ORI 11541 XORI 0 MULI 9309 LW 0 LWI 140607 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 69 DIV 40 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.2563 --Total thread-cycles: 4128896 --total thread-cycles issued: 3000736 (72.676473%) --iCache conflicts: 112551 (2.725934%) --thread*cycles of FU dependence: 251955 (6.102237%) --thread*cycles of data dependence: 215325 (5.215074%) --iCache cycles*banks: 4128896 (79.696461% used) Issue breakdown: --thread*cycles of issue worked: 3000736 (72.676473%) --thread*cycles of issue failed: 838344 (20.304314%) --thread*cycles of issue NOP/other: 289816 (7.019213%) Number of thread-cycles not ready: 215325 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3290552 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 6 1: 7 2: 6 3: 6 4: 7 5: 4 6: 9 7: 7 8: 8 9: 8 10: 8 11: 8 12: 7 13: 7 14: 9 15: 8 16: 6 17: 8 18: 7 19: 7 20: 8 21: 7 22: 7 23: 8 24: 6 25: 7 26: 7 27: 8 28: 6 29: 8 30: 5 31: 6 <=== Core 35 ===> ---- Thread 00 ---- PC 5: Stalled ----- 96901 in-flight CPI 1.4830 -- Total Cycles 143728 ---- Thread 01 ---- PC 5: Stalled ----- 101307 in-flight CPI 1.4186 -- Total Cycles 143728 ---- Thread 02 ---- PC 5: Stalled ----- 96005 in-flight CPI 1.4968 -- Total Cycles 143728 ---- Thread 03 ---- PC 5: Stalled ----- 97446 in-flight CPI 1.4747 -- Total Cycles 143728 ---- Thread 04 ---- PC 5: Stalled ----- 97140 in-flight CPI 1.4793 -- Total Cycles 143728 ---- Thread 05 ---- PC 5: Stalled ----- 94781 in-flight CPI 1.5162 -- Total Cycles 143728 ---- Thread 06 ---- PC 5: Stalled ----- 100513 in-flight CPI 1.4296 -- Total Cycles 143728 ---- Thread 07 ---- PC 5: Stalled ----- 98890 in-flight CPI 1.4531 -- Total Cycles 143728 ---- Thread 08 ---- PC 5: Stalled ----- 100295 in-flight CPI 1.4328 -- Total Cycles 143728 ---- Thread 09 ---- PC 5: Stalled ----- 100624 in-flight CPI 1.4281 -- Total Cycles 143728 ---- Thread 10 ---- PC 5: Stalled ----- 92234 in-flight CPI 1.5581 -- Total Cycles 143728 ---- Thread 11 ---- PC 5: Stalled ----- 93162 in-flight CPI 1.5425 -- Total Cycles 143728 ---- Thread 12 ---- PC 5: Stalled ----- 94020 in-flight CPI 1.5285 -- Total Cycles 143728 ---- Thread 13 ---- PC 5: Stalled ----- 98038 in-flight CPI 1.4658 -- Total Cycles 143728 ---- Thread 14 ---- PC 5: Stalled ----- 93935 in-flight CPI 1.5299 -- Total Cycles 143728 ---- Thread 15 ---- PC 5: Stalled ----- 94506 in-flight CPI 1.5206 -- Total Cycles 143728 ---- Thread 16 ---- PC 5: Stalled ----- 90719 in-flight CPI 1.5840 -- Total Cycles 143728 ---- Thread 17 ---- PC 5: Stalled ----- 93130 in-flight CPI 1.5430 -- Total Cycles 143728 ---- Thread 18 ---- PC 5: Stalled ----- 95794 in-flight CPI 1.5002 -- Total Cycles 143728 ---- Thread 19 ---- PC 5: Stalled ----- 93794 in-flight CPI 1.5321 -- Total Cycles 143728 ---- Thread 20 ---- PC 5: Stalled ----- 95518 in-flight CPI 1.5045 -- Total Cycles 143728 ---- Thread 21 ---- PC 5: Stalled ----- 92666 in-flight CPI 1.5507 -- Total Cycles 143728 ---- Thread 22 ---- PC 5: Stalled ----- 92383 in-flight CPI 1.5556 -- Total Cycles 143728 ---- Thread 23 ---- PC 5: Stalled ----- 95230 in-flight CPI 1.5090 -- Total Cycles 143728 ---- Thread 24 ---- PC 5: Stalled ----- 96875 in-flight CPI 1.4833 -- Total Cycles 143728 ---- Thread 25 ---- PC 5: Stalled ----- 96362 in-flight CPI 1.4913 -- Total Cycles 143728 ---- Thread 26 ---- PC 5: Stalled ----- 85813 in-flight CPI 1.6745 -- Total Cycles 143728 ---- Thread 27 ---- PC 5: Stalled ----- 92438 in-flight CPI 1.5545 -- Total Cycles 143728 ---- Thread 28 ---- PC 5: Stalled ----- 82331 in-flight CPI 1.7455 -- Total Cycles 143728 ---- Thread 29 ---- PC 5: Stalled ----- 85184 in-flight CPI 1.6870 -- Total Cycles 143728 ---- Thread 30 ---- PC 5: Stalled ----- 93291 in-flight CPI 1.5403 -- Total Cycles 143728 ---- Thread 31 ---- PC 5: Stalled ----- 86739 in-flight CPI 1.6567 -- Total Cycles 143728 Total CPI 0.0476 , IPC 21.0021 -- Total Cycles 143728 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8655 (3.409870%) FPSUB: 0 (0.000000%) FPMUL: 33172 (13.069001%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 120732 (47.565617%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5138 (2.024253%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 78750 (31.025679%) DIV: 7126 (2.807479%) FPUN: 0 (0.000000%) FPRSUB: 249 (0.098100%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3309620 total) ADD%: 7.451 (246584) SUB%: 0.000 (0) MUL%: 0.006 (193) BITOR%: 1.535 (50817) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.603 (19972) FPSUB%: 0.000 (0) FPMUL%: 4.943 (163581) FPCMPLT%: 0.000 (0) FPMIN%: 0.017 (579) FPMAX%: 0.017 (579) LOAD%: 5.264 (174232) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (225) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (549) FPINV%: 0.000 (0) FPCONV%: 0.018 (611) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.086 (35938) FPLE%: 0.458 (15154) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.017 (579) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.788 (92268) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.772 (25542) CMPU%: 0.000 (0) RSUB%: 0.006 (193) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.748 (521187) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.177 (38942) ORI%: 1.610 (53285) XORI%: 0.000 (0) MULI%: 3.177 (105162) LW%: 1.124 (37216) LWI%: 13.420 (444149) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.286 (9474) SWI%: 4.047 (133938) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.392 (46063) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10307) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.065 (2164) bned%: 0.000 (0) bneid%: 13.820 (457406) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.717 (23732) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.136 (4502) DIV%: 0.012 (386) FPUN%: 1.477 (48898) FPRSUB%: 3.727 (123366) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (58) FPGT%: 2.930 (96972) FPGE%: 1.020 (33744) SYNC%: 0.000 (0) NOP%: 8.792 (290977) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 20 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 12 FPSUB 0 FPMUL 56 FPCMPLT 0 FPMIN 0 FPMAX 373 LOAD 41365 INTCONV 0 ATOMIC_INC 20 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 20 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1467 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 6 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48405 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 11 ORI 12424 XORI 0 MULI 8790 LW 0 LWI 141020 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 78 DIV 19 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.0023 --Total thread-cycles: 4599296 --total thread-cycles issued: 3018643 (65.632719%) --iCache conflicts: 110586 (2.404411%) --thread*cycles of FU dependence: 254117 (5.525128%) --thread*cycles of data dependence: 253822 (5.518714%) --iCache cycles*banks: 4599296 (71.959970% used) Issue breakdown: --thread*cycles of issue worked: 3018643 (65.632719%) --thread*cycles of issue failed: 1289676 (28.040726%) --thread*cycles of issue NOP/other: 290977 (6.326555%) Number of thread-cycles not ready: 253822 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3309620 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 5 2: 7 3: 8 4: 7 5: 6 6: 9 7: 8 8: 8 9: 8 10: 6 11: 7 12: 6 13: 8 14: 5 15: 6 16: 7 17: 8 18: 6 19: 8 20: 7 21: 8 22: 5 23: 8 24: 8 25: 7 26: 8 27: 8 28: 5 29: 6 30: 8 31: 7 <=== Core 36 ===> ---- Thread 00 ---- PC 5: Stalled ----- 93562 in-flight CPI 1.3708 -- Total Cycles 128277 ---- Thread 01 ---- PC 5: Stalled ----- 99945 in-flight CPI 1.2833 -- Total Cycles 128277 ---- Thread 02 ---- PC 5: Stalled ----- 98093 in-flight CPI 1.3075 -- Total Cycles 128277 ---- Thread 03 ---- PC 5: Stalled ----- 102946 in-flight CPI 1.2458 -- Total Cycles 128277 ---- Thread 04 ---- PC 5: Stalled ----- 102280 in-flight CPI 1.2539 -- Total Cycles 128277 ---- Thread 05 ---- PC 5: Stalled ----- 92732 in-flight CPI 1.3831 -- Total Cycles 128277 ---- Thread 06 ---- PC 5: Stalled ----- 98710 in-flight CPI 1.2993 -- Total Cycles 128277 ---- Thread 07 ---- PC 5: Stalled ----- 97144 in-flight CPI 1.3202 -- Total Cycles 128277 ---- Thread 08 ---- PC 5: Stalled ----- 102184 in-flight CPI 1.2551 -- Total Cycles 128277 ---- Thread 09 ---- PC 5: Stalled ----- 99199 in-flight CPI 1.2929 -- Total Cycles 128277 ---- Thread 10 ---- PC 5: Stalled ----- 94680 in-flight CPI 1.3546 -- Total Cycles 128277 ---- Thread 11 ---- PC 5: Stalled ----- 99269 in-flight CPI 1.2919 -- Total Cycles 128277 ---- Thread 12 ---- PC 5: Stalled ----- 95034 in-flight CPI 1.3496 -- Total Cycles 128277 ---- Thread 13 ---- PC 5: Stalled ----- 91308 in-flight CPI 1.4047 -- Total Cycles 128277 ---- Thread 14 ---- PC 5: Stalled ----- 92711 in-flight CPI 1.3834 -- Total Cycles 128277 ---- Thread 15 ---- PC 5: Stalled ----- 91939 in-flight CPI 1.3950 -- Total Cycles 128277 ---- Thread 16 ---- PC 5: Stalled ----- 90152 in-flight CPI 1.4226 -- Total Cycles 128277 ---- Thread 17 ---- PC 5: Stalled ----- 96573 in-flight CPI 1.3280 -- Total Cycles 128277 ---- Thread 18 ---- PC 5: Stalled ----- 95442 in-flight CPI 1.3438 -- Total Cycles 128277 ---- Thread 19 ---- PC 5: Stalled ----- 94962 in-flight CPI 1.3506 -- Total Cycles 128277 ---- Thread 20 ---- PC 5: Stalled ----- 95418 in-flight CPI 1.3441 -- Total Cycles 128277 ---- Thread 21 ---- PC 5: Stalled ----- 99408 in-flight CPI 1.2902 -- Total Cycles 128277 ---- Thread 22 ---- PC 5: Stalled ----- 88594 in-flight CPI 1.4477 -- Total Cycles 128277 ---- Thread 23 ---- PC 5: Stalled ----- 92629 in-flight CPI 1.3846 -- Total Cycles 128277 ---- Thread 24 ---- PC 5: Stalled ----- 95167 in-flight CPI 1.3476 -- Total Cycles 128277 ---- Thread 25 ---- PC 5: Stalled ----- 92535 in-flight CPI 1.3860 -- Total Cycles 128277 ---- Thread 26 ---- PC 5: Stalled ----- 92462 in-flight CPI 1.3871 -- Total Cycles 128277 ---- Thread 27 ---- PC 5: Stalled ----- 89415 in-flight CPI 1.4343 -- Total Cycles 128277 ---- Thread 28 ---- PC 5: Stalled ----- 87555 in-flight CPI 1.4649 -- Total Cycles 128277 ---- Thread 29 ---- PC 5: Stalled ----- 88961 in-flight CPI 1.4416 -- Total Cycles 128277 ---- Thread 30 ---- PC 5: Stalled ----- 83268 in-flight CPI 1.5403 -- Total Cycles 128277 ---- Thread 31 ---- PC 5: Stalled ----- 89655 in-flight CPI 1.4305 -- Total Cycles 128277 Total CPI 0.0424 , IPC 23.5778 -- Total Cycles 128277 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7611 (3.902517%) FPSUB: 0 (0.000000%) FPMUL: 31427 (16.114096%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 72730 (37.292081%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5631 (2.887278%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 69835 (35.807679%) DIV: 7533 (3.862522%) FPUN: 0 (0.000000%) FPRSUB: 261 (0.133827%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3316240 total) ADD%: 7.475 (247874) SUB%: 0.000 (0) MUL%: 0.006 (204) BITOR%: 1.546 (51254) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.543 (18007) FPSUB%: 0.000 (0) FPMUL%: 4.766 (158041) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (612) FPMAX%: 0.018 (612) LOAD%: 5.153 (170877) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (236) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (587) FPINV%: 0.000 (0) FPCONV%: 0.019 (644) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.065 (35324) FPLE%: 0.461 (15285) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (612) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.825 (93689) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.748 (24806) CMPU%: 0.000 (0) RSUB%: 0.006 (204) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.776 (523175) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.183 (39227) ORI%: 1.564 (51877) XORI%: 0.000 (0) MULI%: 3.225 (106938) LW%: 1.140 (37802) LWI%: 13.550 (449359) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.290 (9608) SWI%: 4.087 (135519) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.411 (46802) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.312 (10360) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.054 (1780) bned%: 0.000 (0) bneid%: 13.895 (460789) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.723 (23990) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.120 (3970) DIV%: 0.012 (408) FPUN%: 1.496 (49625) FPRSUB%: 3.671 (121745) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (70) FPGT%: 2.961 (98196) FPGE%: 1.036 (34340) SYNC%: 0.000 (0) NOP%: 8.796 (291696) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 33 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 12 FPSUB 0 FPMUL 56 FPCMPLT 0 FPMIN 0 FPMAX 402 LOAD 39195 INTCONV 0 ATOMIC_INC 13 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 15 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 2164 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 6 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49237 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 19 ORI 10829 XORI 0 MULI 10057 LW 0 LWI 142185 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 78 DIV 18 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.5780 --Total thread-cycles: 4104864 --total thread-cycles issued: 3024544 (73.681954%) --iCache conflicts: 113566 (2.766620%) --thread*cycles of FU dependence: 254330 (6.195820%) --thread*cycles of data dependence: 195028 (4.751144%) --iCache cycles*banks: 4104864 (80.788840% used) Issue breakdown: --thread*cycles of issue worked: 3024544 (73.681954%) --thread*cycles of issue failed: 788624 (19.211940%) --thread*cycles of issue NOP/other: 291696 (7.106106%) Number of thread-cycles not ready: 195028 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3316240 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 7 2: 7 3: 8 4: 8 5: 6 6: 8 7: 8 8: 8 9: 8 10: 8 11: 9 12: 7 13: 6 14: 7 15: 7 16: 7 17: 9 18: 7 19: 7 20: 8 21: 7 22: 7 23: 8 24: 8 25: 7 26: 6 27: 8 28: 6 29: 8 30: 6 31: 8 <=== Core 37 ===> ---- Thread 00 ---- PC 5: Stalled ----- 95081 in-flight CPI 1.3369 -- Total Cycles 127142 ---- Thread 01 ---- PC 5: Stalled ----- 102502 in-flight CPI 1.2401 -- Total Cycles 127142 ---- Thread 02 ---- PC 5: Stalled ----- 97185 in-flight CPI 1.3079 -- Total Cycles 127142 ---- Thread 03 ---- PC 5: Stalled ----- 95499 in-flight CPI 1.3311 -- Total Cycles 127142 ---- Thread 04 ---- PC 5: Stalled ----- 95269 in-flight CPI 1.3343 -- Total Cycles 127142 ---- Thread 05 ---- PC 5: Stalled ----- 95157 in-flight CPI 1.3359 -- Total Cycles 127142 ---- Thread 06 ---- PC 5: Stalled ----- 99653 in-flight CPI 1.2756 -- Total Cycles 127142 ---- Thread 07 ---- PC 5: Stalled ----- 89621 in-flight CPI 1.4185 -- Total Cycles 127142 ---- Thread 08 ---- PC 5: Stalled ----- 96025 in-flight CPI 1.3238 -- Total Cycles 127142 ---- Thread 09 ---- PC 5: Stalled ----- 99304 in-flight CPI 1.2801 -- Total Cycles 127142 ---- Thread 10 ---- PC 5: Stalled ----- 98995 in-flight CPI 1.2841 -- Total Cycles 127142 ---- Thread 11 ---- PC 5: Stalled ----- 95845 in-flight CPI 1.3263 -- Total Cycles 127142 ---- Thread 12 ---- PC 5: Stalled ----- 98212 in-flight CPI 1.2943 -- Total Cycles 127142 ---- Thread 13 ---- PC 5: Stalled ----- 100556 in-flight CPI 1.2641 -- Total Cycles 127142 ---- Thread 14 ---- PC 5: Stalled ----- 99072 in-flight CPI 1.2831 -- Total Cycles 127142 ---- Thread 15 ---- PC 5: Stalled ----- 95556 in-flight CPI 1.3303 -- Total Cycles 127142 ---- Thread 16 ---- PC 5: Stalled ----- 90023 in-flight CPI 1.4121 -- Total Cycles 127142 ---- Thread 17 ---- PC 5: Stalled ----- 90008 in-flight CPI 1.4123 -- Total Cycles 127142 ---- Thread 18 ---- PC 5: Stalled ----- 93053 in-flight CPI 1.3661 -- Total Cycles 127142 ---- Thread 19 ---- PC 5: Stalled ----- 100315 in-flight CPI 1.2672 -- Total Cycles 127142 ---- Thread 20 ---- PC 5: Stalled ----- 97005 in-flight CPI 1.3104 -- Total Cycles 127142 ---- Thread 21 ---- PC 5: Stalled ----- 88425 in-flight CPI 1.4376 -- Total Cycles 127142 ---- Thread 22 ---- PC 5: Stalled ----- 95568 in-flight CPI 1.3302 -- Total Cycles 127142 ---- Thread 23 ---- PC 5: Stalled ----- 94442 in-flight CPI 1.3460 -- Total Cycles 127142 ---- Thread 24 ---- PC 5: Stalled ----- 95147 in-flight CPI 1.3360 -- Total Cycles 127142 ---- Thread 25 ---- PC 5: Stalled ----- 93634 in-flight CPI 1.3576 -- Total Cycles 127142 ---- Thread 26 ---- PC 5: Stalled ----- 92331 in-flight CPI 1.3768 -- Total Cycles 127142 ---- Thread 27 ---- PC 5: Stalled ----- 86371 in-flight CPI 1.4718 -- Total Cycles 127142 ---- Thread 28 ---- PC 5: Stalled ----- 94735 in-flight CPI 1.3418 -- Total Cycles 127142 ---- Thread 29 ---- PC 5: Stalled ----- 89934 in-flight CPI 1.4135 -- Total Cycles 127142 ---- Thread 30 ---- PC 5: Stalled ----- 88244 in-flight CPI 1.4405 -- Total Cycles 127142 ---- Thread 31 ---- PC 5: Stalled ----- 83270 in-flight CPI 1.5266 -- Total Cycles 127142 Total CPI 0.0420 , IPC 23.8050 -- Total Cycles 127142 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7547 (3.714897%) FPSUB: 0 (0.000000%) FPMUL: 31280 (15.397111%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 80922 (39.832640%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5706 (2.808693%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 69676 (34.296965%) DIV: 7760 (3.819744%) FPUN: 0 (0.000000%) FPRSUB: 264 (0.129950%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3317935 total) ADD%: 7.538 (250108) SUB%: 0.000 (0) MUL%: 0.006 (210) BITOR%: 1.542 (51175) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.539 (17868) FPSUB%: 0.000 (0) FPMUL%: 4.749 (157570) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (630) FPMAX%: 0.019 (630) LOAD%: 5.145 (170716) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (242) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (598) FPINV%: 0.000 (0) FPCONV%: 0.020 (662) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.063 (35269) FPLE%: 0.459 (15238) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (630) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.828 (93835) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.744 (24683) CMPU%: 0.000 (0) RSUB%: 0.006 (210) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.758 (522846) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.183 (39236) ORI%: 1.556 (51621) XORI%: 0.000 (0) MULI%: 3.229 (107148) LW%: 1.141 (37870) LWI%: 13.573 (450359) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.290 (9622) SWI%: 4.102 (136094) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.413 (46877) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.312 (10351) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.053 (1753) bned%: 0.000 (0) bneid%: 13.876 (460407) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.724 (24030) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.119 (3956) DIV%: 0.013 (420) FPUN%: 1.495 (49599) FPRSUB%: 3.664 (121570) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (75) FPGT%: 2.957 (98102) FPGE%: 1.036 (34361) SYNC%: 0.000 (0) NOP%: 8.779 (291268) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 25 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 6 FPSUB 0 FPMUL 49 FPCMPLT 0 FPMIN 0 FPMAX 414 LOAD 39517 INTCONV 0 ATOMIC_INC 25 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 12 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1486 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 8 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49107 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 13 ORI 10662 XORI 0 MULI 10027 LW 0 LWI 142452 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 60 DIV 34 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.8052 --Total thread-cycles: 4068544 --total thread-cycles issued: 3026667 (74.391895%) --iCache conflicts: 112561 (2.766616%) --thread*cycles of FU dependence: 253910 (6.240808%) --thread*cycles of data dependence: 203155 (4.993310%) --iCache cycles*banks: 4068544 (81.551705% used) Issue breakdown: --thread*cycles of issue worked: 3026667 (74.391895%) --thread*cycles of issue failed: 750609 (18.449082%) --thread*cycles of issue NOP/other: 291268 (7.159023%) Number of thread-cycles not ready: 203155 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3317935 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 9 2: 9 3: 7 4: 8 5: 8 6: 8 7: 6 8: 7 9: 8 10: 8 11: 7 12: 8 13: 9 14: 8 15: 8 16: 6 17: 7 18: 8 19: 8 20: 8 21: 7 22: 7 23: 8 24: 8 25: 7 26: 7 27: 7 28: 8 29: 7 30: 7 31: 6 <=== Core 38 ===> ---- Thread 00 ---- PC 5: Stalled ----- 100825 in-flight CPI 1.2545 -- Total Cycles 126508 ---- Thread 01 ---- PC 5: Stalled ----- 97758 in-flight CPI 1.2939 -- Total Cycles 126508 ---- Thread 02 ---- PC 5: Stalled ----- 97722 in-flight CPI 1.2943 -- Total Cycles 126508 ---- Thread 03 ---- PC 5: Stalled ----- 98494 in-flight CPI 1.2842 -- Total Cycles 126508 ---- Thread 04 ---- PC 5: Stalled ----- 97122 in-flight CPI 1.3024 -- Total Cycles 126508 ---- Thread 05 ---- PC 5: Stalled ----- 96435 in-flight CPI 1.3116 -- Total Cycles 126508 ---- Thread 06 ---- PC 5: Stalled ----- 94579 in-flight CPI 1.3373 -- Total Cycles 126508 ---- Thread 07 ---- PC 5: Stalled ----- 102462 in-flight CPI 1.2344 -- Total Cycles 126508 ---- Thread 08 ---- PC 5: Stalled ----- 96029 in-flight CPI 1.3172 -- Total Cycles 126508 ---- Thread 09 ---- PC 5: Stalled ----- 96630 in-flight CPI 1.3089 -- Total Cycles 126508 ---- Thread 10 ---- PC 5: Stalled ----- 88616 in-flight CPI 1.4274 -- Total Cycles 126508 ---- Thread 11 ---- PC 5: Stalled ----- 101735 in-flight CPI 1.2433 -- Total Cycles 126508 ---- Thread 12 ---- PC 5: Stalled ----- 98404 in-flight CPI 1.2853 -- Total Cycles 126508 ---- Thread 13 ---- PC 5: Stalled ----- 92095 in-flight CPI 1.3734 -- Total Cycles 126508 ---- Thread 14 ---- PC 5: Stalled ----- 91395 in-flight CPI 1.3840 -- Total Cycles 126508 ---- Thread 15 ---- PC 5: Stalled ----- 90604 in-flight CPI 1.3960 -- Total Cycles 126508 ---- Thread 16 ---- PC 5: Stalled ----- 90472 in-flight CPI 1.3981 -- Total Cycles 126508 ---- Thread 17 ---- PC 5: Stalled ----- 97508 in-flight CPI 1.2972 -- Total Cycles 126508 ---- Thread 18 ---- PC 5: Stalled ----- 90543 in-flight CPI 1.3970 -- Total Cycles 126508 ---- Thread 19 ---- PC 5: Stalled ----- 96081 in-flight CPI 1.3165 -- Total Cycles 126508 ---- Thread 20 ---- PC 5: Stalled ----- 97676 in-flight CPI 1.2950 -- Total Cycles 126508 ---- Thread 21 ---- PC 5: Stalled ----- 99202 in-flight CPI 1.2750 -- Total Cycles 126508 ---- Thread 22 ---- PC 5: Stalled ----- 90773 in-flight CPI 1.3934 -- Total Cycles 126508 ---- Thread 23 ---- PC 5: Stalled ----- 92889 in-flight CPI 1.3617 -- Total Cycles 126508 ---- Thread 24 ---- PC 5: Stalled ----- 88851 in-flight CPI 1.4235 -- Total Cycles 126508 ---- Thread 25 ---- PC 5: Stalled ----- 87103 in-flight CPI 1.4521 -- Total Cycles 126508 ---- Thread 26 ---- PC 5: Stalled ----- 91537 in-flight CPI 1.3818 -- Total Cycles 126508 ---- Thread 27 ---- PC 5: Stalled ----- 91310 in-flight CPI 1.3852 -- Total Cycles 126508 ---- Thread 28 ---- PC 5: Stalled ----- 93011 in-flight CPI 1.3599 -- Total Cycles 126508 ---- Thread 29 ---- PC 5: Stalled ----- 83410 in-flight CPI 1.5165 -- Total Cycles 126508 ---- Thread 30 ---- PC 5: Stalled ----- 87982 in-flight CPI 1.4377 -- Total Cycles 126508 ---- Thread 31 ---- PC 5: Stalled ----- 82030 in-flight CPI 1.5419 -- Total Cycles 126508 Total CPI 0.0421 , IPC 23.7284 -- Total Cycles 126508 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7676 (3.672024%) FPSUB: 0 (0.000000%) FPMUL: 31520 (15.078454%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 85861 (41.073957%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5592 (2.675086%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 70674 (33.808840%) DIV: 7457 (3.567260%) FPUN: 0 (0.000000%) FPRSUB: 260 (0.124378%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3291019 total) ADD%: 7.558 (248746) SUB%: 0.000 (0) MUL%: 0.006 (202) BITOR%: 1.549 (50989) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.553 (18197) FPSUB%: 0.000 (0) FPMUL%: 4.791 (157669) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (606) FPMAX%: 0.018 (606) LOAD%: 5.152 (169550) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (234) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (583) FPINV%: 0.000 (0) FPCONV%: 0.019 (638) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.067 (35128) FPLE%: 0.461 (15158) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (606) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.813 (92572) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.748 (24622) CMPU%: 0.000 (0) RSUB%: 0.006 (202) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.745 (518167) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.177 (38746) ORI%: 1.582 (52064) XORI%: 0.000 (0) MULI%: 3.215 (105792) LW%: 1.135 (37352) LWI%: 13.517 (444863) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.288 (9472) SWI%: 4.074 (134078) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.406 (46267) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10229) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.057 (1882) bned%: 0.000 (0) bneid%: 13.870 (456454) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.729 (23999) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.122 (4020) DIV%: 0.012 (404) FPUN%: 1.501 (49393) FPRSUB%: 3.675 (120953) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (79) FPGT%: 2.949 (97036) FPGE%: 1.040 (34235) SYNC%: 0.000 (0) NOP%: 8.785 (289130) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 11 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 15 FPSUB 0 FPMUL 57 FPCMPLT 0 FPMIN 0 FPMAX 394 LOAD 39446 INTCONV 0 ATOMIC_INC 16 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 12 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1630 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 6 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48533 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 14 ORI 10988 XORI 0 MULI 9416 LW 0 LWI 140814 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 74 DIV 34 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.7287 --Total thread-cycles: 4048256 --total thread-cycles issued: 3001889 (74.152647%) --iCache conflicts: 111025 (2.742539%) --thread*cycles of FU dependence: 251471 (6.211835%) --thread*cycles of data dependence: 209040 (5.163705%) --iCache cycles*banks: 4048256 (81.295526% used) Issue breakdown: --thread*cycles of issue worked: 3001889 (74.152647%) --thread*cycles of issue failed: 757237 (18.705265%) --thread*cycles of issue NOP/other: 289130 (7.142088%) Number of thread-cycles not ready: 209040 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3291019 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 7 2: 9 3: 7 4: 7 5: 8 6: 8 7: 9 8: 7 9: 8 10: 5 11: 8 12: 8 13: 8 14: 6 15: 7 16: 7 17: 8 18: 6 19: 7 20: 7 21: 9 22: 7 23: 7 24: 8 25: 7 26: 7 27: 8 28: 8 29: 5 30: 6 31: 7 <=== Core 39 ===> ---- Thread 00 ---- PC 5: Stalled ----- 96964 in-flight CPI 1.4327 -- Total Cycles 138950 ---- Thread 01 ---- PC 5: Stalled ----- 94869 in-flight CPI 1.4644 -- Total Cycles 138950 ---- Thread 02 ---- PC 5: Stalled ----- 101899 in-flight CPI 1.3633 -- Total Cycles 138950 ---- Thread 03 ---- PC 5: Stalled ----- 98864 in-flight CPI 1.4051 -- Total Cycles 138950 ---- Thread 04 ---- PC 5: Stalled ----- 99397 in-flight CPI 1.3977 -- Total Cycles 138950 ---- Thread 05 ---- PC 5: Stalled ----- 94253 in-flight CPI 1.4740 -- Total Cycles 138950 ---- Thread 06 ---- PC 5: Stalled ----- 97120 in-flight CPI 1.4305 -- Total Cycles 138950 ---- Thread 07 ---- PC 5: Stalled ----- 103883 in-flight CPI 1.3374 -- Total Cycles 138950 ---- Thread 08 ---- PC 5: Stalled ----- 95043 in-flight CPI 1.4617 -- Total Cycles 138950 ---- Thread 09 ---- PC 5: Stalled ----- 97376 in-flight CPI 1.4267 -- Total Cycles 138950 ---- Thread 10 ---- PC 5: Stalled ----- 96747 in-flight CPI 1.4359 -- Total Cycles 138950 ---- Thread 11 ---- PC 5: Stalled ----- 96057 in-flight CPI 1.4462 -- Total Cycles 138950 ---- Thread 12 ---- PC 5: Stalled ----- 92204 in-flight CPI 1.5067 -- Total Cycles 138950 ---- Thread 13 ---- PC 5: Stalled ----- 95166 in-flight CPI 1.4598 -- Total Cycles 138950 ---- Thread 14 ---- PC 5: Stalled ----- 99868 in-flight CPI 1.3911 -- Total Cycles 138950 ---- Thread 15 ---- PC 5: Stalled ----- 95458 in-flight CPI 1.4554 -- Total Cycles 138950 ---- Thread 16 ---- PC 5: Stalled ----- 98260 in-flight CPI 1.4139 -- Total Cycles 138950 ---- Thread 17 ---- PC 5: Stalled ----- 97549 in-flight CPI 1.4241 -- Total Cycles 138950 ---- Thread 18 ---- PC 5: Stalled ----- 92791 in-flight CPI 1.4972 -- Total Cycles 138950 ---- Thread 19 ---- PC 5: Stalled ----- 96827 in-flight CPI 1.4348 -- Total Cycles 138950 ---- Thread 20 ---- PC 5: Stalled ----- 93499 in-flight CPI 1.4859 -- Total Cycles 138950 ---- Thread 21 ---- PC 5: Stalled ----- 92990 in-flight CPI 1.4940 -- Total Cycles 138950 ---- Thread 22 ---- PC 5: Stalled ----- 89283 in-flight CPI 1.5561 -- Total Cycles 138950 ---- Thread 23 ---- PC 5: Stalled ----- 92328 in-flight CPI 1.5047 -- Total Cycles 138950 ---- Thread 24 ---- PC 5: Stalled ----- 85682 in-flight CPI 1.6215 -- Total Cycles 138950 ---- Thread 25 ---- PC 5: Stalled ----- 91514 in-flight CPI 1.5180 -- Total Cycles 138950 ---- Thread 26 ---- PC 5: Stalled ----- 88859 in-flight CPI 1.5634 -- Total Cycles 138950 ---- Thread 27 ---- PC 5: Stalled ----- 89409 in-flight CPI 1.5539 -- Total Cycles 138950 ---- Thread 28 ---- PC 5: Stalled ----- 87176 in-flight CPI 1.5936 -- Total Cycles 138950 ---- Thread 29 ---- PC 5: Stalled ----- 96379 in-flight CPI 1.4414 -- Total Cycles 138950 ---- Thread 30 ---- PC 5: Stalled ----- 90005 in-flight CPI 1.5435 -- Total Cycles 138950 ---- Thread 31 ---- PC 5: Stalled ----- 84961 in-flight CPI 1.6352 -- Total Cycles 138950 Total CPI 0.0460 , IPC 21.7576 -- Total Cycles 138950 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7760 (3.792210%) FPSUB: 0 (0.000000%) FPMUL: 31686 (15.484533%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 81007 (39.587060%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5348 (2.613498%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 71222 (34.805258%) DIV: 7351 (3.592337%) FPUN: 0 (0.000000%) FPRSUB: 256 (0.125104%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3314851 total) ADD%: 7.468 (247568) SUB%: 0.000 (0) MUL%: 0.006 (199) BITOR%: 1.531 (50744) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.551 (18264) FPSUB%: 0.000 (0) FPMUL%: 4.794 (158924) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (597) FPMAX%: 0.018 (597) LOAD%: 5.189 (172012) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (231) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (567) FPINV%: 0.000 (0) FPCONV%: 0.019 (629) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.064 (35278) FPLE%: 0.461 (15268) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (597) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.826 (93674) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.751 (24909) CMPU%: 0.000 (0) RSUB%: 0.006 (199) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.779 (523038) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.182 (39166) ORI%: 1.569 (51997) XORI%: 0.000 (0) MULI%: 3.219 (106708) LW%: 1.140 (37788) LWI%: 13.544 (448955) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.290 (9612) SWI%: 4.085 (135421) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.411 (46777) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.314 (10396) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.059 (1972) bned%: 0.000 (0) bneid%: 13.858 (459388) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.724 (23995) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.122 (4053) DIV%: 0.012 (398) FPUN%: 1.483 (49167) FPRSUB%: 3.683 (122094) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (71) FPGT%: 2.957 (98029) FPGE%: 1.023 (33899) SYNC%: 0.000 (0) NOP%: 8.796 (291574) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 22 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 10 FPSUB 0 FPMUL 65 FPCMPLT 0 FPMIN 0 FPMAX 382 LOAD 39752 INTCONV 0 ATOMIC_INC 23 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 11 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1645 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 12 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49004 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 21 ORI 11160 XORI 0 MULI 9730 LW 0 LWI 142005 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 80 DIV 18 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.7579 --Total thread-cycles: 4446400 --total thread-cycles issued: 3023277 (67.993815%) --iCache conflicts: 113111 (2.543878%) --thread*cycles of FU dependence: 253965 (5.711699%) --thread*cycles of data dependence: 204630 (4.602150%) --iCache cycles*banks: 4446400 (74.552065% used) Issue breakdown: --thread*cycles of issue worked: 3023277 (67.993815%) --thread*cycles of issue failed: 1131549 (25.448655%) --thread*cycles of issue NOP/other: 291574 (6.557530%) Number of thread-cycles not ready: 204630 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3314851 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 7 2: 8 3: 9 4: 8 5: 7 6: 7 7: 6 8: 7 9: 7 10: 8 11: 8 12: 7 13: 8 14: 8 15: 7 16: 7 17: 8 18: 6 19: 7 20: 7 21: 7 22: 6 23: 7 24: 5 25: 8 26: 8 27: 6 28: 7 29: 8 30: 8 31: 6 <=== Core 40 ===> ---- Thread 00 ---- PC 5: Stalled ----- 98701 in-flight CPI 1.2860 -- Total Cycles 126958 ---- Thread 01 ---- PC 5: Stalled ----- 99855 in-flight CPI 1.2712 -- Total Cycles 126958 ---- Thread 02 ---- PC 5: Stalled ----- 95050 in-flight CPI 1.3355 -- Total Cycles 126958 ---- Thread 03 ---- PC 5: Stalled ----- 94371 in-flight CPI 1.3451 -- Total Cycles 126958 ---- Thread 04 ---- PC 5: Stalled ----- 99816 in-flight CPI 1.2716 -- Total Cycles 126958 ---- Thread 05 ---- PC 5: Stalled ----- 98245 in-flight CPI 1.2920 -- Total Cycles 126958 ---- Thread 06 ---- PC 5: Stalled ----- 102392 in-flight CPI 1.2397 -- Total Cycles 126958 ---- Thread 07 ---- PC 5: Stalled ----- 93551 in-flight CPI 1.3569 -- Total Cycles 126958 ---- Thread 08 ---- PC 5: Stalled ----- 98393 in-flight CPI 1.2901 -- Total Cycles 126958 ---- Thread 09 ---- PC 5: Stalled ----- 99303 in-flight CPI 1.2783 -- Total Cycles 126958 ---- Thread 10 ---- PC 5: Stalled ----- 95956 in-flight CPI 1.3228 -- Total Cycles 126958 ---- Thread 11 ---- PC 5: Stalled ----- 95694 in-flight CPI 1.3264 -- Total Cycles 126958 ---- Thread 12 ---- PC 5: Stalled ----- 96880 in-flight CPI 1.3102 -- Total Cycles 126958 ---- Thread 13 ---- PC 5: Stalled ----- 92440 in-flight CPI 1.3732 -- Total Cycles 126958 ---- Thread 14 ---- PC 5: Stalled ----- 91918 in-flight CPI 1.3809 -- Total Cycles 126958 ---- Thread 15 ---- PC 5: Stalled ----- 91375 in-flight CPI 1.3892 -- Total Cycles 126958 ---- Thread 16 ---- PC 5: Stalled ----- 97280 in-flight CPI 1.3048 -- Total Cycles 126958 ---- Thread 17 ---- PC 5: Stalled ----- 95243 in-flight CPI 1.3328 -- Total Cycles 126958 ---- Thread 18 ---- PC 5: Stalled ----- 91007 in-flight CPI 1.3948 -- Total Cycles 126958 ---- Thread 19 ---- PC 5: Stalled ----- 88905 in-flight CPI 1.4278 -- Total Cycles 126958 ---- Thread 20 ---- PC 5: Stalled ----- 92466 in-flight CPI 1.3727 -- Total Cycles 126958 ---- Thread 21 ---- PC 5: Stalled ----- 91949 in-flight CPI 1.3805 -- Total Cycles 126958 ---- Thread 22 ---- PC 5: Stalled ----- 94374 in-flight CPI 1.3451 -- Total Cycles 126958 ---- Thread 23 ---- PC 5: Stalled ----- 88976 in-flight CPI 1.4266 -- Total Cycles 126958 ---- Thread 24 ---- PC 5: Stalled ----- 87097 in-flight CPI 1.4574 -- Total Cycles 126958 ---- Thread 25 ---- PC 5: Stalled ----- 95149 in-flight CPI 1.3340 -- Total Cycles 126958 ---- Thread 26 ---- PC 5: Stalled ----- 86277 in-flight CPI 1.4713 -- Total Cycles 126958 ---- Thread 27 ---- PC 5: Stalled ----- 90131 in-flight CPI 1.4083 -- Total Cycles 126958 ---- Thread 28 ---- PC 5: Stalled ----- 86139 in-flight CPI 1.4736 -- Total Cycles 126958 ---- Thread 29 ---- PC 5: Stalled ----- 89241 in-flight CPI 1.4224 -- Total Cycles 126958 ---- Thread 30 ---- PC 5: Stalled ----- 84294 in-flight CPI 1.5059 -- Total Cycles 126958 ---- Thread 31 ---- PC 5: Stalled ----- 84672 in-flight CPI 1.4991 -- Total Cycles 126958 Total CPI 0.0425 , IPC 23.5328 -- Total Cycles 126958 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7851 (3.944889%) FPSUB: 0 (0.000000%) FPMUL: 31728 (15.942357%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 74645 (37.506846%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5430 (2.728410%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 71792 (36.073300%) DIV: 7313 (3.674560%) FPUN: 0 (0.000000%) FPRSUB: 258 (0.129637%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3276207 total) ADD%: 7.482 (245124) SUB%: 0.000 (0) MUL%: 0.006 (198) BITOR%: 1.542 (50514) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.563 (18431) FPSUB%: 0.000 (0) FPMUL%: 4.825 (158073) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (594) FPMAX%: 0.018 (594) LOAD%: 5.175 (169539) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (230) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (570) FPINV%: 0.000 (0) FPCONV%: 0.019 (626) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.074 (35195) FPLE%: 0.459 (15038) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (594) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.806 (91918) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.756 (24757) CMPU%: 0.000 (0) RSUB%: 0.006 (198) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.755 (516166) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.178 (38608) ORI%: 1.582 (51818) XORI%: 0.000 (0) MULI%: 3.208 (105112) LW%: 1.132 (37084) LWI%: 13.515 (442765) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.287 (9418) SWI%: 4.066 (133215) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.402 (45922) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10203) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.057 (1876) bned%: 0.000 (0) bneid%: 13.880 (454732) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.721 (23607) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.125 (4089) DIV%: 0.012 (396) FPUN%: 1.489 (48792) FPRSUB%: 3.691 (120938) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (64) FPGT%: 2.957 (96886) FPGE%: 1.030 (33754) SYNC%: 0.000 (0) NOP%: 8.805 (288473) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 20 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 9 FPSUB 0 FPMUL 48 FPCMPLT 0 FPMIN 0 FPMAX 389 LOAD 39508 INTCONV 0 ATOMIC_INC 26 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 13 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1349 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 5 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48320 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 16 ORI 11224 XORI 0 MULI 9509 LW 0 LWI 140264 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 89 DIV 26 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.5331 --Total thread-cycles: 4062656 --total thread-cycles issued: 2987734 (73.541398%) --iCache conflicts: 110967 (2.731390%) --thread*cycles of FU dependence: 250841 (6.174311%) --thread*cycles of data dependence: 199017 (4.898692%) --iCache cycles*banks: 4062656 (80.642786% used) Issue breakdown: --thread*cycles of issue worked: 2987734 (73.541398%) --thread*cycles of issue failed: 786449 (19.358001%) --thread*cycles of issue NOP/other: 288473 (7.100601%) Number of thread-cycles not ready: 199017 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3276207 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 7 2: 7 3: 7 4: 9 5: 8 6: 7 7: 7 8: 7 9: 7 10: 8 11: 8 12: 7 13: 6 14: 8 15: 7 16: 9 17: 7 18: 7 19: 6 20: 8 21: 7 22: 6 23: 7 24: 6 25: 9 26: 6 27: 8 28: 6 29: 6 30: 6 31: 8 <=== Core 41 ===> ---- Thread 00 ---- PC 5: Stalled ----- 95443 in-flight CPI 1.3678 -- Total Cycles 130576 ---- Thread 01 ---- PC 5: Stalled ----- 93141 in-flight CPI 1.4016 -- Total Cycles 130576 ---- Thread 02 ---- PC 5: Stalled ----- 102197 in-flight CPI 1.2775 -- Total Cycles 130576 ---- Thread 03 ---- PC 5: Stalled ----- 98171 in-flight CPI 1.3299 -- Total Cycles 130576 ---- Thread 04 ---- PC 5: Stalled ----- 98965 in-flight CPI 1.3192 -- Total Cycles 130576 ---- Thread 05 ---- PC 5: Stalled ----- 98869 in-flight CPI 1.3205 -- Total Cycles 130576 ---- Thread 06 ---- PC 5: Stalled ----- 96390 in-flight CPI 1.3544 -- Total Cycles 130576 ---- Thread 07 ---- PC 5: Stalled ----- 95635 in-flight CPI 1.3651 -- Total Cycles 130576 ---- Thread 08 ---- PC 5: Stalled ----- 97917 in-flight CPI 1.3333 -- Total Cycles 130576 ---- Thread 09 ---- PC 5: Stalled ----- 102777 in-flight CPI 1.2702 -- Total Cycles 130576 ---- Thread 10 ---- PC 5: Stalled ----- 89861 in-flight CPI 1.4529 -- Total Cycles 130576 ---- Thread 11 ---- PC 5: Stalled ----- 93507 in-flight CPI 1.3961 -- Total Cycles 130576 ---- Thread 12 ---- PC 5: Stalled ----- 96569 in-flight CPI 1.3519 -- Total Cycles 130576 ---- Thread 13 ---- PC 5: Stalled ----- 98195 in-flight CPI 1.3295 -- Total Cycles 130576 ---- Thread 14 ---- PC 5: Stalled ----- 90606 in-flight CPI 1.4409 -- Total Cycles 130576 ---- Thread 15 ---- PC 5: Stalled ----- 100842 in-flight CPI 1.2946 -- Total Cycles 130576 ---- Thread 16 ---- PC 5: Stalled ----- 96725 in-flight CPI 1.3497 -- Total Cycles 130576 ---- Thread 17 ---- PC 5: Stalled ----- 93711 in-flight CPI 1.3931 -- Total Cycles 130576 ---- Thread 18 ---- PC 5: Stalled ----- 95357 in-flight CPI 1.3691 -- Total Cycles 130576 ---- Thread 19 ---- PC 5: Stalled ----- 97935 in-flight CPI 1.3331 -- Total Cycles 130576 ---- Thread 20 ---- PC 5: Stalled ----- 93968 in-flight CPI 1.3893 -- Total Cycles 130576 ---- Thread 21 ---- PC 5: Stalled ----- 92362 in-flight CPI 1.4135 -- Total Cycles 130576 ---- Thread 22 ---- PC 5: Stalled ----- 94990 in-flight CPI 1.3744 -- Total Cycles 130576 ---- Thread 23 ---- PC 5: Stalled ----- 90577 in-flight CPI 1.4414 -- Total Cycles 130576 ---- Thread 24 ---- PC 5: Stalled ----- 92583 in-flight CPI 1.4101 -- Total Cycles 130576 ---- Thread 25 ---- PC 5: Stalled ----- 92370 in-flight CPI 1.4133 -- Total Cycles 130576 ---- Thread 26 ---- PC 5: Stalled ----- 90170 in-flight CPI 1.4478 -- Total Cycles 130576 ---- Thread 27 ---- PC 5: Stalled ----- 92975 in-flight CPI 1.4043 -- Total Cycles 130576 ---- Thread 28 ---- PC 5: Stalled ----- 88581 in-flight CPI 1.4738 -- Total Cycles 130576 ---- Thread 29 ---- PC 5: Stalled ----- 90192 in-flight CPI 1.4474 -- Total Cycles 130576 ---- Thread 30 ---- PC 5: Stalled ----- 88250 in-flight CPI 1.4793 -- Total Cycles 130576 ---- Thread 31 ---- PC 5: Stalled ----- 89735 in-flight CPI 1.4549 -- Total Cycles 130576 Total CPI 0.0431 , IPC 23.2057 -- Total Cycles 130576 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7975 (3.923295%) FPSUB: 0 (0.000000%) FPMUL: 32330 (15.904719%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 77066 (37.912561%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5381 (2.647179%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 72887 (35.856705%) DIV: 7376 (3.628618%) FPUN: 0 (0.000000%) FPRSUB: 258 (0.126923%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3323058 total) ADD%: 7.431 (246940) SUB%: 0.000 (0) MUL%: 0.006 (200) BITOR%: 1.537 (51073) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.567 (18857) FPSUB%: 0.000 (0) FPMUL%: 4.843 (160924) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (600) FPMAX%: 0.018 (600) LOAD%: 5.179 (172102) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (232) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (570) FPINV%: 0.000 (0) FPCONV%: 0.019 (632) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.076 (35749) FPLE%: 0.457 (15194) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (600) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.810 (93375) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.749 (24897) CMPU%: 0.000 (0) RSUB%: 0.006 (200) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.748 (523331) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.174 (39018) ORI%: 1.589 (52795) XORI%: 0.000 (0) MULI%: 3.211 (106688) LW%: 1.134 (37670) LWI%: 13.528 (449542) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.288 (9566) SWI%: 4.061 (134940) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.404 (46645) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10341) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.063 (2085) bned%: 0.000 (0) bneid%: 13.874 (461057) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.724 (24069) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.125 (4152) DIV%: 0.012 (400) FPUN%: 1.490 (49525) FPRSUB%: 3.698 (122882) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (69) FPGT%: 2.956 (98219) FPGE%: 1.033 (34331) SYNC%: 0.000 (0) NOP%: 8.814 (292892) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 31 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 11 FPSUB 0 FPMUL 75 FPCMPLT 0 FPMIN 0 FPMAX 386 LOAD 39605 INTCONV 0 ATOMIC_INC 27 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 13 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1295 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 14 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49040 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 13 ORI 11462 XORI 0 MULI 9419 LW 0 LWI 142352 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 87 DIV 33 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.2060 --Total thread-cycles: 4178432 --total thread-cycles issued: 3030166 (72.519213%) --iCache conflicts: 114183 (2.732676%) --thread*cycles of FU dependence: 253893 (6.076275%) --thread*cycles of data dependence: 203273 (4.864815%) --iCache cycles*banks: 4178432 (79.529594% used) Issue breakdown: --thread*cycles of issue worked: 3030166 (72.519213%) --thread*cycles of issue failed: 855374 (20.471172%) --thread*cycles of issue NOP/other: 292892 (7.009615%) Number of thread-cycles not ready: 203273 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3323058 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 6 3: 7 4: 7 5: 7 6: 8 7: 7 8: 7 9: 8 10: 6 11: 8 12: 7 13: 8 14: 6 15: 9 16: 9 17: 7 18: 8 19: 7 20: 7 21: 7 22: 8 23: 6 24: 7 25: 8 26: 7 27: 5 28: 7 29: 9 30: 7 31: 6 <=== Core 42 ===> ---- Thread 00 ---- PC 5: Stalled ----- 100622 in-flight CPI 1.2712 -- Total Cycles 127942 ---- Thread 01 ---- PC 5: Stalled ----- 99489 in-flight CPI 1.2858 -- Total Cycles 127942 ---- Thread 02 ---- PC 5: Stalled ----- 98462 in-flight CPI 1.2992 -- Total Cycles 127942 ---- Thread 03 ---- PC 5: Stalled ----- 89244 in-flight CPI 1.4334 -- Total Cycles 127942 ---- Thread 04 ---- PC 5: Stalled ----- 96478 in-flight CPI 1.3259 -- Total Cycles 127942 ---- Thread 05 ---- PC 5: Stalled ----- 96656 in-flight CPI 1.3235 -- Total Cycles 127942 ---- Thread 06 ---- PC 5: Stalled ----- 101750 in-flight CPI 1.2571 -- Total Cycles 127942 ---- Thread 07 ---- PC 5: Stalled ----- 98173 in-flight CPI 1.3030 -- Total Cycles 127942 ---- Thread 08 ---- PC 5: Stalled ----- 96695 in-flight CPI 1.3229 -- Total Cycles 127942 ---- Thread 09 ---- PC 5: Stalled ----- 101334 in-flight CPI 1.2624 -- Total Cycles 127942 ---- Thread 10 ---- PC 5: Stalled ----- 96958 in-flight CPI 1.3193 -- Total Cycles 127942 ---- Thread 11 ---- PC 5: Stalled ----- 98152 in-flight CPI 1.3033 -- Total Cycles 127942 ---- Thread 12 ---- PC 5: Stalled ----- 94953 in-flight CPI 1.3472 -- Total Cycles 127942 ---- Thread 13 ---- PC 5: Stalled ----- 100967 in-flight CPI 1.2669 -- Total Cycles 127942 ---- Thread 14 ---- PC 5: Stalled ----- 92060 in-flight CPI 1.3896 -- Total Cycles 127942 ---- Thread 15 ---- PC 5: Stalled ----- 94944 in-flight CPI 1.3473 -- Total Cycles 127942 ---- Thread 16 ---- PC 5: Stalled ----- 98431 in-flight CPI 1.2996 -- Total Cycles 127942 ---- Thread 17 ---- PC 5: Stalled ----- 93401 in-flight CPI 1.3695 -- Total Cycles 127942 ---- Thread 18 ---- PC 5: Stalled ----- 94505 in-flight CPI 1.3536 -- Total Cycles 127942 ---- Thread 19 ---- PC 5: Stalled ----- 94923 in-flight CPI 1.3476 -- Total Cycles 127942 ---- Thread 20 ---- PC 5: Stalled ----- 87789 in-flight CPI 1.4572 -- Total Cycles 127942 ---- Thread 21 ---- PC 5: Stalled ----- 96157 in-flight CPI 1.3303 -- Total Cycles 127942 ---- Thread 22 ---- PC 5: Stalled ----- 91247 in-flight CPI 1.4019 -- Total Cycles 127942 ---- Thread 23 ---- PC 5: Stalled ----- 95509 in-flight CPI 1.3394 -- Total Cycles 127942 ---- Thread 24 ---- PC 5: Stalled ----- 94905 in-flight CPI 1.3478 -- Total Cycles 127942 ---- Thread 25 ---- PC 5: Stalled ----- 93874 in-flight CPI 1.3626 -- Total Cycles 127942 ---- Thread 26 ---- PC 5: Stalled ----- 91333 in-flight CPI 1.4006 -- Total Cycles 127942 ---- Thread 27 ---- PC 5: Stalled ----- 87414 in-flight CPI 1.4634 -- Total Cycles 127942 ---- Thread 28 ---- PC 5: Stalled ----- 85576 in-flight CPI 1.4948 -- Total Cycles 127942 ---- Thread 29 ---- PC 5: Stalled ----- 85368 in-flight CPI 1.4984 -- Total Cycles 127942 ---- Thread 30 ---- PC 5: Stalled ----- 86672 in-flight CPI 1.4759 -- Total Cycles 127942 ---- Thread 31 ---- PC 5: Stalled ----- 86983 in-flight CPI 1.4706 -- Total Cycles 127942 Total CPI 0.0423 , IPC 23.6168 -- Total Cycles 127942 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7852 (3.996315%) FPSUB: 0 (0.000000%) FPMUL: 31877 (16.223961%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 71435 (36.357205%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5583 (2.841496%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 72020 (36.654944%) DIV: 7456 (3.794769%) FPUN: 0 (0.000000%) FPRSUB: 258 (0.131310%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3312949 total) ADD%: 7.475 (247645) SUB%: 0.000 (0) MUL%: 0.006 (202) BITOR%: 1.532 (50770) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.558 (18495) FPSUB%: 0.000 (0) FPMUL%: 4.814 (159470) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (606) FPMAX%: 0.018 (606) LOAD%: 5.184 (171751) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (234) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (582) FPINV%: 0.000 (0) FPCONV%: 0.019 (638) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.068 (35373) FPLE%: 0.458 (15157) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (606) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.821 (93447) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.751 (24889) CMPU%: 0.000 (0) RSUB%: 0.006 (202) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.766 (522323) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.180 (39094) ORI%: 1.575 (52187) XORI%: 0.000 (0) MULI%: 3.217 (106562) LW%: 1.138 (37702) LWI%: 13.533 (448351) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.289 (9568) SWI%: 4.083 (135254) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.409 (46695) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.312 (10333) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.059 (1971) bned%: 0.000 (0) bneid%: 13.858 (459115) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.721 (23884) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.124 (4100) DIV%: 0.012 (404) FPUN%: 1.484 (49176) FPRSUB%: 3.687 (122151) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (69) FPGT%: 2.955 (97903) FPGE%: 1.027 (34019) SYNC%: 0.000 (0) NOP%: 8.793 (291319) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 15 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 12 FPSUB 0 FPMUL 44 FPCMPLT 0 FPMIN 0 FPMAX 393 LOAD 40274 INTCONV 0 ATOMIC_INC 15 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 12 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1764 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 10 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48944 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 8 ORI 11172 XORI 0 MULI 9681 LW 0 LWI 141945 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 90 DIV 36 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.6170 --Total thread-cycles: 4094144 --total thread-cycles issued: 3021630 (73.803706%) --iCache conflicts: 113083 (2.762067%) --thread*cycles of FU dependence: 254438 (6.214681%) --thread*cycles of data dependence: 196481 (4.799074%) --iCache cycles*banks: 4094144 (80.919992% used) Issue breakdown: --thread*cycles of issue worked: 3021630 (73.803706%) --thread*cycles of issue failed: 781195 (19.080790%) --thread*cycles of issue NOP/other: 291319 (7.115504%) Number of thread-cycles not ready: 196481 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3312949 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 9 1: 7 2: 7 3: 6 4: 7 5: 7 6: 9 7: 7 8: 8 9: 7 10: 9 11: 8 12: 7 13: 8 14: 6 15: 7 16: 7 17: 8 18: 7 19: 8 20: 6 21: 8 22: 7 23: 7 24: 8 25: 8 26: 6 27: 7 28: 7 29: 7 30: 7 31: 7 <=== Core 43 ===> ---- Thread 00 ---- PC 5: Stalled ----- 98659 in-flight CPI 1.3107 -- Total Cycles 129331 ---- Thread 01 ---- PC 5: Stalled ----- 98341 in-flight CPI 1.3149 -- Total Cycles 129331 ---- Thread 02 ---- PC 5: Stalled ----- 98402 in-flight CPI 1.3141 -- Total Cycles 129331 ---- Thread 03 ---- PC 5: Stalled ----- 91676 in-flight CPI 1.4105 -- Total Cycles 129331 ---- Thread 04 ---- PC 5: Stalled ----- 101758 in-flight CPI 1.2708 -- Total Cycles 129331 ---- Thread 05 ---- PC 5: Stalled ----- 93757 in-flight CPI 1.3791 -- Total Cycles 129331 ---- Thread 06 ---- PC 5: Stalled ----- 103183 in-flight CPI 1.2531 -- Total Cycles 129331 ---- Thread 07 ---- PC 5: Stalled ----- 102074 in-flight CPI 1.2668 -- Total Cycles 129331 ---- Thread 08 ---- PC 5: Stalled ----- 100193 in-flight CPI 1.2906 -- Total Cycles 129331 ---- Thread 09 ---- PC 5: Stalled ----- 98771 in-flight CPI 1.3091 -- Total Cycles 129331 ---- Thread 10 ---- PC 5: Stalled ----- 98177 in-flight CPI 1.3171 -- Total Cycles 129331 ---- Thread 11 ---- PC 5: Stalled ----- 95701 in-flight CPI 1.3512 -- Total Cycles 129331 ---- Thread 12 ---- PC 5: Stalled ----- 101593 in-flight CPI 1.2727 -- Total Cycles 129331 ---- Thread 13 ---- PC 5: Stalled ----- 98748 in-flight CPI 1.3095 -- Total Cycles 129331 ---- Thread 14 ---- PC 5: Stalled ----- 95580 in-flight CPI 1.3529 -- Total Cycles 129331 ---- Thread 15 ---- PC 5: Stalled ----- 94305 in-flight CPI 1.3712 -- Total Cycles 129331 ---- Thread 16 ---- PC 5: Stalled ----- 89516 in-flight CPI 1.4446 -- Total Cycles 129331 ---- Thread 17 ---- PC 5: Stalled ----- 95189 in-flight CPI 1.3584 -- Total Cycles 129331 ---- Thread 18 ---- PC 5: Stalled ----- 97411 in-flight CPI 1.3274 -- Total Cycles 129331 ---- Thread 19 ---- PC 5: Stalled ----- 99603 in-flight CPI 1.2982 -- Total Cycles 129331 ---- Thread 20 ---- PC 5: Stalled ----- 98176 in-flight CPI 1.3170 -- Total Cycles 129331 ---- Thread 21 ---- PC 5: Stalled ----- 95101 in-flight CPI 1.3597 -- Total Cycles 129331 ---- Thread 22 ---- PC 5: Stalled ----- 91918 in-flight CPI 1.4069 -- Total Cycles 129331 ---- Thread 23 ---- PC 5: Stalled ----- 94222 in-flight CPI 1.3724 -- Total Cycles 129331 ---- Thread 24 ---- PC 5: Stalled ----- 91287 in-flight CPI 1.4165 -- Total Cycles 129331 ---- Thread 25 ---- PC 5: Stalled ----- 92330 in-flight CPI 1.4005 -- Total Cycles 129331 ---- Thread 26 ---- PC 5: Stalled ----- 84843 in-flight CPI 1.5241 -- Total Cycles 129331 ---- Thread 27 ---- PC 5: Stalled ----- 85737 in-flight CPI 1.5082 -- Total Cycles 129331 ---- Thread 28 ---- PC 5: Stalled ----- 89778 in-flight CPI 1.4403 -- Total Cycles 129331 ---- Thread 29 ---- PC 5: Stalled ----- 89376 in-flight CPI 1.4468 -- Total Cycles 129331 ---- Thread 30 ---- PC 5: Stalled ----- 85032 in-flight CPI 1.5207 -- Total Cycles 129331 ---- Thread 31 ---- PC 5: Stalled ----- 82178 in-flight CPI 1.5735 -- Total Cycles 129331 Total CPI 0.0426 , IPC 23.4527 -- Total Cycles 129331 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8503 (3.947704%) FPSUB: 0 (0.000000%) FPMUL: 33065 (15.351152%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 82880 (38.478859%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5557 (2.579959%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 77703 (36.075323%) DIV: 7419 (3.444434%) FPUN: 0 (0.000000%) FPRSUB: 264 (0.122568%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3325549 total) ADD%: 7.473 (248515) SUB%: 0.000 (0) MUL%: 0.006 (201) BITOR%: 1.539 (51169) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.596 (19834) FPSUB%: 0.000 (0) FPMUL%: 4.923 (163729) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (603) FPMAX%: 0.018 (603) LOAD%: 5.225 (173770) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (233) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (580) FPINV%: 0.000 (0) FPCONV%: 0.019 (635) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.085 (36082) FPLE%: 0.454 (15095) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (603) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.791 (92806) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.767 (25502) CMPU%: 0.000 (0) RSUB%: 0.006 (201) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.720 (522778) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.175 (39083) ORI%: 1.607 (53439) XORI%: 0.000 (0) MULI%: 3.187 (105994) LW%: 1.126 (37444) LWI%: 13.459 (447577) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.285 (9491) SWI%: 4.050 (134672) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.395 (46387) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10332) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.065 (2145) bned%: 0.000 (0) bneid%: 13.832 (459998) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.717 (23842) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.133 (4435) DIV%: 0.012 (402) FPUN%: 1.483 (49329) FPRSUB%: 3.721 (123758) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (70) FPGT%: 2.933 (97551) FPGE%: 1.029 (34234) SYNC%: 0.000 (0) NOP%: 8.790 (292331) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 34 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 13 FPSUB 0 FPMUL 67 FPCMPLT 0 FPMIN 0 FPMAX 393 LOAD 40644 INTCONV 0 ATOMIC_INC 13 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 12 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1334 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 11 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48766 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 18 ORI 12083 XORI 0 MULI 9348 LW 0 LWI 141985 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 97 DIV 29 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.4530 --Total thread-cycles: 4138592 --total thread-cycles issued: 3033218 (73.291061%) --iCache conflicts: 113544 (2.743542%) --thread*cycles of FU dependence: 254857 (6.158061%) --thread*cycles of data dependence: 215391 (5.204451%) --iCache cycles*banks: 4138592 (80.355372% used) Issue breakdown: --thread*cycles of issue worked: 3033218 (73.291061%) --thread*cycles of issue failed: 813043 (19.645401%) --thread*cycles of issue NOP/other: 292331 (7.063538%) Number of thread-cycles not ready: 215391 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3325549 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 7 3: 6 4: 7 5: 8 6: 9 7: 8 8: 8 9: 8 10: 7 11: 7 12: 10 13: 7 14: 6 15: 7 16: 6 17: 8 18: 8 19: 9 20: 9 21: 7 22: 5 23: 7 24: 7 25: 7 26: 6 27: 7 28: 7 29: 7 30: 7 31: 6 <=== Core 44 ===> ---- Thread 00 ---- PC 5: Stalled ----- 91336 in-flight CPI 1.3912 -- Total Cycles 127087 ---- Thread 01 ---- PC 5: Stalled ----- 92952 in-flight CPI 1.3670 -- Total Cycles 127087 ---- Thread 02 ---- PC 5: Stalled ----- 99135 in-flight CPI 1.2817 -- Total Cycles 127087 ---- Thread 03 ---- PC 5: Stalled ----- 100563 in-flight CPI 1.2635 -- Total Cycles 127087 ---- Thread 04 ---- PC 5: Stalled ----- 103953 in-flight CPI 1.2223 -- Total Cycles 127087 ---- Thread 05 ---- PC 5: Stalled ----- 93545 in-flight CPI 1.3583 -- Total Cycles 127087 ---- Thread 06 ---- PC 5: Stalled ----- 97920 in-flight CPI 1.2976 -- Total Cycles 127087 ---- Thread 07 ---- PC 5: Stalled ----- 98185 in-flight CPI 1.2941 -- Total Cycles 127087 ---- Thread 08 ---- PC 5: Stalled ----- 93457 in-flight CPI 1.3596 -- Total Cycles 127087 ---- Thread 09 ---- PC 5: Stalled ----- 101623 in-flight CPI 1.2503 -- Total Cycles 127087 ---- Thread 10 ---- PC 5: Stalled ----- 93693 in-flight CPI 1.3562 -- Total Cycles 127087 ---- Thread 11 ---- PC 5: Stalled ----- 95808 in-flight CPI 1.3263 -- Total Cycles 127087 ---- Thread 12 ---- PC 5: Stalled ----- 94395 in-flight CPI 1.3461 -- Total Cycles 127087 ---- Thread 13 ---- PC 5: Stalled ----- 97058 in-flight CPI 1.3091 -- Total Cycles 127087 ---- Thread 14 ---- PC 5: Stalled ----- 91424 in-flight CPI 1.3899 -- Total Cycles 127087 ---- Thread 15 ---- PC 5: Stalled ----- 94885 in-flight CPI 1.3392 -- Total Cycles 127087 ---- Thread 16 ---- PC 5: Stalled ----- 96034 in-flight CPI 1.3231 -- Total Cycles 127087 ---- Thread 17 ---- PC 5: Stalled ----- 99637 in-flight CPI 1.2752 -- Total Cycles 127087 ---- Thread 18 ---- PC 5: Stalled ----- 93321 in-flight CPI 1.3616 -- Total Cycles 127087 ---- Thread 19 ---- PC 5: Stalled ----- 99344 in-flight CPI 1.2790 -- Total Cycles 127087 ---- Thread 20 ---- PC 5: Stalled ----- 93207 in-flight CPI 1.3632 -- Total Cycles 127087 ---- Thread 21 ---- PC 5: Stalled ----- 94826 in-flight CPI 1.3399 -- Total Cycles 127087 ---- Thread 22 ---- PC 5: Stalled ----- 88209 in-flight CPI 1.4405 -- Total Cycles 127087 ---- Thread 23 ---- PC 5: Stalled ----- 92171 in-flight CPI 1.3785 -- Total Cycles 127087 ---- Thread 24 ---- PC 5: Stalled ----- 87667 in-flight CPI 1.4494 -- Total Cycles 127087 ---- Thread 25 ---- PC 5: Stalled ----- 93666 in-flight CPI 1.3566 -- Total Cycles 127087 ---- Thread 26 ---- PC 5: Stalled ----- 79574 in-flight CPI 1.5969 -- Total Cycles 127087 ---- Thread 27 ---- PC 5: Stalled ----- 88509 in-flight CPI 1.4356 -- Total Cycles 127087 ---- Thread 28 ---- PC 5: Stalled ----- 85783 in-flight CPI 1.4813 -- Total Cycles 127087 ---- Thread 29 ---- PC 5: Stalled ----- 89191 in-flight CPI 1.4246 -- Total Cycles 127087 ---- Thread 30 ---- PC 5: Stalled ----- 93564 in-flight CPI 1.3581 -- Total Cycles 127087 ---- Thread 31 ---- PC 5: Stalled ----- 88194 in-flight CPI 1.4407 -- Total Cycles 127087 Total CPI 0.0423 , IPC 23.6326 -- Total Cycles 127087 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7651 (3.596865%) FPSUB: 0 (0.000000%) FPMUL: 31305 (14.717013%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 90170 (42.390451%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5633 (2.648169%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 70094 (32.952382%) DIV: 7600 (3.572889%) FPUN: 0 (0.000000%) FPRSUB: 260 (0.122230%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3292997 total) ADD%: 7.484 (246462) SUB%: 0.000 (0) MUL%: 0.006 (206) BITOR%: 1.531 (50428) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.549 (18064) FPSUB%: 0.000 (0) FPMUL%: 4.782 (157482) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (618) FPMAX%: 0.019 (618) LOAD%: 5.162 (169996) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (238) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (590) FPINV%: 0.000 (0) FPCONV%: 0.020 (650) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.065 (35067) FPLE%: 0.457 (15053) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (618) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.826 (93066) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.747 (24593) CMPU%: 0.000 (0) RSUB%: 0.006 (206) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.765 (519151) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.182 (38913) ORI%: 1.567 (51599) XORI%: 0.000 (0) MULI%: 3.223 (106120) LW%: 1.140 (37556) LWI%: 13.566 (446713) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.290 (9536) SWI%: 4.100 (135009) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.412 (46500) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.312 (10277) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.056 (1834) bned%: 0.000 (0) bneid%: 13.869 (456720) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.720 (23712) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.121 (3984) DIV%: 0.013 (412) FPUN%: 1.484 (48865) FPRSUB%: 3.676 (121059) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (56) FPGT%: 2.963 (97568) FPGE%: 1.027 (33812) SYNC%: 0.000 (0) NOP%: 8.793 (289550) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 20 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 11 FPSUB 0 FPMUL 49 FPCMPLT 0 FPMIN 0 FPMAX 397 LOAD 39315 INTCONV 0 ATOMIC_INC 16 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 14 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1446 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 7 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48770 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 12 ORI 10820 XORI 0 MULI 9715 LW 0 LWI 141251 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 60 DIV 25 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.6328 --Total thread-cycles: 4066784 --total thread-cycles issued: 3003447 (73.853123%) --iCache conflicts: 111400 (2.739265%) --thread*cycles of FU dependence: 251956 (6.195461%) --thread*cycles of data dependence: 212713 (5.230497%) --iCache cycles*banks: 4066784 (80.973787% used) Issue breakdown: --thread*cycles of issue worked: 3003447 (73.853123%) --thread*cycles of issue failed: 773787 (19.027000%) --thread*cycles of issue NOP/other: 289550 (7.119877%) Number of thread-cycles not ready: 212713 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3292997 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 6 1: 7 2: 8 3: 8 4: 9 5: 8 6: 9 7: 8 8: 8 9: 9 10: 7 11: 7 12: 7 13: 9 14: 6 15: 6 16: 8 17: 9 18: 6 19: 8 20: 8 21: 9 22: 6 23: 8 24: 7 25: 7 26: 5 27: 7 28: 6 29: 8 30: 7 31: 7 <=== Core 45 ===> ---- Thread 00 ---- PC 5: Stalled ----- 94579 in-flight CPI 1.4763 -- Total Cycles 139648 ---- Thread 01 ---- PC 5: Stalled ----- 98497 in-flight CPI 1.4175 -- Total Cycles 139648 ---- Thread 02 ---- PC 5: Stalled ----- 101317 in-flight CPI 1.3780 -- Total Cycles 139648 ---- Thread 03 ---- PC 5: Stalled ----- 93461 in-flight CPI 1.4939 -- Total Cycles 139648 ---- Thread 04 ---- PC 5: Stalled ----- 94750 in-flight CPI 1.4736 -- Total Cycles 139648 ---- Thread 05 ---- PC 5: Stalled ----- 96550 in-flight CPI 1.4461 -- Total Cycles 139648 ---- Thread 06 ---- PC 5: Stalled ----- 104291 in-flight CPI 1.3387 -- Total Cycles 139648 ---- Thread 07 ---- PC 5: Stalled ----- 96986 in-flight CPI 1.4396 -- Total Cycles 139648 ---- Thread 08 ---- PC 5: Stalled ----- 95247 in-flight CPI 1.4660 -- Total Cycles 139648 ---- Thread 09 ---- PC 5: Stalled ----- 94821 in-flight CPI 1.4725 -- Total Cycles 139648 ---- Thread 10 ---- PC 5: Stalled ----- 102683 in-flight CPI 1.3599 -- Total Cycles 139648 ---- Thread 11 ---- PC 5: Stalled ----- 94342 in-flight CPI 1.4800 -- Total Cycles 139648 ---- Thread 12 ---- PC 5: Stalled ----- 97555 in-flight CPI 1.4312 -- Total Cycles 139648 ---- Thread 13 ---- PC 5: Stalled ----- 97149 in-flight CPI 1.4372 -- Total Cycles 139648 ---- Thread 14 ---- PC 5: Stalled ----- 94846 in-flight CPI 1.4721 -- Total Cycles 139648 ---- Thread 15 ---- PC 5: Stalled ----- 92737 in-flight CPI 1.5056 -- Total Cycles 139648 ---- Thread 16 ---- PC 5: Stalled ----- 88390 in-flight CPI 1.5797 -- Total Cycles 139648 ---- Thread 17 ---- PC 5: Stalled ----- 92107 in-flight CPI 1.5159 -- Total Cycles 139648 ---- Thread 18 ---- PC 5: Stalled ----- 94586 in-flight CPI 1.4762 -- Total Cycles 139648 ---- Thread 19 ---- PC 5: Stalled ----- 92416 in-flight CPI 1.5108 -- Total Cycles 139648 ---- Thread 20 ---- PC 5: Stalled ----- 90322 in-flight CPI 1.5458 -- Total Cycles 139648 ---- Thread 21 ---- PC 5: Stalled ----- 89659 in-flight CPI 1.5574 -- Total Cycles 139648 ---- Thread 22 ---- PC 5: Stalled ----- 87651 in-flight CPI 1.5930 -- Total Cycles 139648 ---- Thread 23 ---- PC 5: Stalled ----- 95604 in-flight CPI 1.4604 -- Total Cycles 139648 ---- Thread 24 ---- PC 5: Stalled ----- 93817 in-flight CPI 1.4883 -- Total Cycles 139648 ---- Thread 25 ---- PC 5: Stalled ----- 95239 in-flight CPI 1.4660 -- Total Cycles 139648 ---- Thread 26 ---- PC 5: Stalled ----- 93965 in-flight CPI 1.4859 -- Total Cycles 139648 ---- Thread 27 ---- PC 5: Stalled ----- 87587 in-flight CPI 1.5941 -- Total Cycles 139648 ---- Thread 28 ---- PC 5: Stalled ----- 90644 in-flight CPI 1.5403 -- Total Cycles 139648 ---- Thread 29 ---- PC 5: Stalled ----- 90937 in-flight CPI 1.5354 -- Total Cycles 139648 ---- Thread 30 ---- PC 5: Stalled ----- 87959 in-flight CPI 1.5873 -- Total Cycles 139648 ---- Thread 31 ---- PC 5: Stalled ----- 88629 in-flight CPI 1.5754 -- Total Cycles 139648 Total CPI 0.0464 , IPC 21.5531 -- Total Cycles 139648 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 9027 (4.022942%) FPSUB: 0 (0.000000%) FPMUL: 33898 (15.106868%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 87492 (38.991390%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4997 (2.226946%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 81642 (36.384299%) DIV: 7083 (3.156586%) FPUN: 0 (0.000000%) FPRSUB: 249 (0.110969%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3300689 total) ADD%: 7.401 (244293) SUB%: 0.000 (0) MUL%: 0.006 (192) BITOR%: 1.532 (50556) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.629 (20762) FPSUB%: 0.000 (0) FPMUL%: 5.025 (165864) FPCMPLT%: 0.000 (0) FPMIN%: 0.017 (576) FPMAX%: 0.017 (576) LOAD%: 5.286 (174461) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (224) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.016 (541) FPINV%: 0.000 (0) FPCONV%: 0.018 (608) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.097 (36203) FPLE%: 0.455 (15012) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.017 (576) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.767 (91342) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.779 (25696) CMPU%: 0.000 (0) RSUB%: 0.006 (192) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.728 (519121) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.172 (38694) ORI%: 1.631 (53833) XORI%: 0.000 (0) MULI%: 3.162 (104360) LW%: 1.116 (36844) LWI%: 13.383 (441722) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.285 (9392) SWI%: 4.016 (132570) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.381 (45583) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10250) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.068 (2239) bned%: 0.000 (0) bneid%: 13.819 (456130) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.711 (23463) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.142 (4674) DIV%: 0.012 (384) FPUN%: 1.470 (48514) FPRSUB%: 3.757 (124009) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (58) FPGT%: 2.932 (96787) FPGE%: 1.015 (33502) SYNC%: 0.000 (0) NOP%: 8.810 (290790) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 10 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 13 FPSUB 0 FPMUL 68 FPCMPLT 0 FPMIN 0 FPMAX 376 LOAD 41623 INTCONV 0 ATOMIC_INC 25 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 15 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1398 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 6 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48182 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 10 ORI 12962 XORI 0 MULI 8732 LW 0 LWI 140493 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 90 DIV 33 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.5533 --Total thread-cycles: 4468736 --total thread-cycles issued: 3009899 (67.354594%) --iCache conflicts: 112563 (2.518900%) --thread*cycles of FU dependence: 254058 (5.685232%) --thread*cycles of data dependence: 224388 (5.021286%) --iCache cycles*banks: 4468736 (73.862520% used) Issue breakdown: --thread*cycles of issue worked: 3009899 (67.354594%) --thread*cycles of issue failed: 1168047 (26.138197%) --thread*cycles of issue NOP/other: 290790 (6.507209%) Number of thread-cycles not ready: 224388 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3300689 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 6 1: 8 2: 9 3: 7 4: 8 5: 8 6: 9 7: 7 8: 6 9: 8 10: 5 11: 6 12: 8 13: 8 14: 7 15: 7 16: 6 17: 6 18: 6 19: 7 20: 7 21: 5 22: 6 23: 7 24: 6 25: 8 26: 8 27: 6 28: 7 29: 7 30: 8 31: 7 <=== Core 46 ===> ---- Thread 00 ---- PC 5: Stalled ----- 108992 in-flight CPI 1.3186 -- Total Cycles 143740 ---- Thread 01 ---- PC 5: Stalled ----- 102095 in-flight CPI 1.4076 -- Total Cycles 143740 ---- Thread 02 ---- PC 5: Stalled ----- 98100 in-flight CPI 1.4650 -- Total Cycles 143740 ---- Thread 03 ---- PC 5: Stalled ----- 102419 in-flight CPI 1.4032 -- Total Cycles 143740 ---- Thread 04 ---- PC 5: Stalled ----- 93009 in-flight CPI 1.5452 -- Total Cycles 143740 ---- Thread 05 ---- PC 5: Stalled ----- 98486 in-flight CPI 1.4593 -- Total Cycles 143740 ---- Thread 06 ---- PC 5: Stalled ----- 99828 in-flight CPI 1.4396 -- Total Cycles 143740 ---- Thread 07 ---- PC 5: Stalled ----- 101670 in-flight CPI 1.4135 -- Total Cycles 143740 ---- Thread 08 ---- PC 5: Stalled ----- 98886 in-flight CPI 1.4534 -- Total Cycles 143740 ---- Thread 09 ---- PC 5: Stalled ----- 94951 in-flight CPI 1.5136 -- Total Cycles 143740 ---- Thread 10 ---- PC 5: Stalled ----- 99570 in-flight CPI 1.4433 -- Total Cycles 143740 ---- Thread 11 ---- PC 5: Stalled ----- 101928 in-flight CPI 1.4100 -- Total Cycles 143740 ---- Thread 12 ---- PC 5: Stalled ----- 96187 in-flight CPI 1.4940 -- Total Cycles 143740 ---- Thread 13 ---- PC 5: Stalled ----- 96629 in-flight CPI 1.4872 -- Total Cycles 143740 ---- Thread 14 ---- PC 5: Stalled ----- 96910 in-flight CPI 1.4830 -- Total Cycles 143740 ---- Thread 15 ---- PC 5: Stalled ----- 89304 in-flight CPI 1.6093 -- Total Cycles 143740 ---- Thread 16 ---- PC 5: Stalled ----- 98507 in-flight CPI 1.4589 -- Total Cycles 143740 ---- Thread 17 ---- PC 5: Stalled ----- 97065 in-flight CPI 1.4806 -- Total Cycles 143740 ---- Thread 18 ---- PC 5: Stalled ----- 91406 in-flight CPI 1.5723 -- Total Cycles 143740 ---- Thread 19 ---- PC 5: Stalled ----- 96346 in-flight CPI 1.4917 -- Total Cycles 143740 ---- Thread 20 ---- PC 5: Stalled ----- 95684 in-flight CPI 1.5019 -- Total Cycles 143740 ---- Thread 21 ---- PC 5: Stalled ----- 87797 in-flight CPI 1.6369 -- Total Cycles 143740 ---- Thread 22 ---- PC 5: Stalled ----- 89802 in-flight CPI 1.6003 -- Total Cycles 143740 ---- Thread 23 ---- PC 5: Stalled ----- 88870 in-flight CPI 1.6172 -- Total Cycles 143740 ---- Thread 24 ---- PC 5: Stalled ----- 94257 in-flight CPI 1.5247 -- Total Cycles 143740 ---- Thread 25 ---- PC 5: Stalled ----- 94103 in-flight CPI 1.5273 -- Total Cycles 143740 ---- Thread 26 ---- PC 5: Stalled ----- 97693 in-flight CPI 1.4711 -- Total Cycles 143740 ---- Thread 27 ---- PC 5: Stalled ----- 88468 in-flight CPI 1.6245 -- Total Cycles 143740 ---- Thread 28 ---- PC 5: Stalled ----- 90646 in-flight CPI 1.5854 -- Total Cycles 143740 ---- Thread 29 ---- PC 5: Stalled ----- 88285 in-flight CPI 1.6279 -- Total Cycles 143740 ---- Thread 30 ---- PC 5: Stalled ----- 87303 in-flight CPI 1.6462 -- Total Cycles 143740 ---- Thread 31 ---- PC 5: Stalled ----- 89443 in-flight CPI 1.6068 -- Total Cycles 143740 Total CPI 0.0470 , IPC 21.2550 -- Total Cycles 143740 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7921 (3.781250%) FPSUB: 0 (0.000000%) FPMUL: 32212 (15.377051%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 84082 (40.138246%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5659 (2.701438%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 71845 (34.296667%) DIV: 7500 (3.580277%) FPUN: 0 (0.000000%) FPRSUB: 262 (0.125071%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3349288 total) ADD%: 7.457 (249770) SUB%: 0.000 (0) MUL%: 0.006 (203) BITOR%: 1.544 (51704) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.559 (18723) FPSUB%: 0.000 (0) FPMUL%: 4.810 (161097) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (609) FPMAX%: 0.018 (609) LOAD%: 5.163 (172936) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (235) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (588) FPINV%: 0.000 (0) FPCONV%: 0.019 (641) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.067 (35735) FPLE%: 0.456 (15272) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (609) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.823 (94558) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.753 (25215) CMPU%: 0.000 (0) RSUB%: 0.006 (203) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.758 (527790) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.183 (39625) ORI%: 1.586 (53130) XORI%: 0.000 (0) MULI%: 3.217 (107748) LW%: 1.139 (38148) LWI%: 13.544 (453636) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.288 (9640) SWI%: 4.092 (137063) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.412 (47294) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10419) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.056 (1884) bned%: 0.000 (0) bneid%: 13.864 (464360) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.728 (24389) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.122 (4088) DIV%: 0.012 (406) FPUN%: 1.494 (50024) FPRSUB%: 3.681 (123299) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (70) FPGT%: 2.946 (98680) FPGE%: 1.038 (34752) SYNC%: 0.000 (0) NOP%: 8.779 (294040) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 23 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 6 FPSUB 0 FPMUL 56 FPCMPLT 0 FPMIN 0 FPMAX 400 LOAD 40045 INTCONV 0 ATOMIC_INC 12 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 13 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1627 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 5 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49360 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 16 ORI 11362 XORI 0 MULI 9785 LW 0 LWI 143341 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 67 DIV 44 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.2552 --Total thread-cycles: 4599680 --total thread-cycles issued: 3055248 (66.423056%) --iCache conflicts: 114050 (2.479520%) --thread*cycles of FU dependence: 256194 (5.569822%) --thread*cycles of data dependence: 209481 (4.554252%) --iCache cycles*banks: 4599680 (72.816370% used) Issue breakdown: --thread*cycles of issue worked: 3055248 (66.423056%) --thread*cycles of issue failed: 1250392 (27.184326%) --thread*cycles of issue NOP/other: 294040 (6.392619%) Number of thread-cycles not ready: 209481 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3349288 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 8 3: 8 4: 7 5: 7 6: 8 7: 8 8: 7 9: 7 10: 8 11: 7 12: 9 13: 8 14: 7 15: 7 16: 8 17: 8 18: 7 19: 7 20: 8 21: 7 22: 8 23: 6 24: 8 25: 6 26: 8 27: 7 28: 7 29: 6 30: 6 31: 7 <=== Core 47 ===> ---- Thread 00 ---- PC 5: Stalled ----- 94384 in-flight CPI 1.3615 -- Total Cycles 128530 ---- Thread 01 ---- PC 5: Stalled ----- 100944 in-flight CPI 1.2730 -- Total Cycles 128530 ---- Thread 02 ---- PC 5: Stalled ----- 98276 in-flight CPI 1.3076 -- Total Cycles 128530 ---- Thread 03 ---- PC 5: Stalled ----- 98458 in-flight CPI 1.3052 -- Total Cycles 128530 ---- Thread 04 ---- PC 5: Stalled ----- 98634 in-flight CPI 1.3028 -- Total Cycles 128530 ---- Thread 05 ---- PC 5: Stalled ----- 98274 in-flight CPI 1.3076 -- Total Cycles 128530 ---- Thread 06 ---- PC 5: Stalled ----- 98882 in-flight CPI 1.2996 -- Total Cycles 128530 ---- Thread 07 ---- PC 5: Stalled ----- 95532 in-flight CPI 1.3452 -- Total Cycles 128530 ---- Thread 08 ---- PC 5: Stalled ----- 98307 in-flight CPI 1.3072 -- Total Cycles 128530 ---- Thread 09 ---- PC 5: Stalled ----- 93559 in-flight CPI 1.3736 -- Total Cycles 128530 ---- Thread 10 ---- PC 5: Stalled ----- 98142 in-flight CPI 1.3094 -- Total Cycles 128530 ---- Thread 11 ---- PC 5: Stalled ----- 99041 in-flight CPI 1.2975 -- Total Cycles 128530 ---- Thread 12 ---- PC 5: Stalled ----- 99586 in-flight CPI 1.2904 -- Total Cycles 128530 ---- Thread 13 ---- PC 5: Stalled ----- 93242 in-flight CPI 1.3782 -- Total Cycles 128530 ---- Thread 14 ---- PC 5: Stalled ----- 98555 in-flight CPI 1.3039 -- Total Cycles 128530 ---- Thread 15 ---- PC 5: Stalled ----- 91591 in-flight CPI 1.4030 -- Total Cycles 128530 ---- Thread 16 ---- PC 5: Stalled ----- 98523 in-flight CPI 1.3043 -- Total Cycles 128530 ---- Thread 17 ---- PC 5: Stalled ----- 96884 in-flight CPI 1.3264 -- Total Cycles 128530 ---- Thread 18 ---- PC 5: Stalled ----- 90841 in-flight CPI 1.4147 -- Total Cycles 128530 ---- Thread 19 ---- PC 5: Stalled ----- 99196 in-flight CPI 1.2955 -- Total Cycles 128530 ---- Thread 20 ---- PC 5: Stalled ----- 94665 in-flight CPI 1.3575 -- Total Cycles 128530 ---- Thread 21 ---- PC 5: Stalled ----- 90310 in-flight CPI 1.4229 -- Total Cycles 128530 ---- Thread 22 ---- PC 5: Stalled ----- 90371 in-flight CPI 1.4220 -- Total Cycles 128530 ---- Thread 23 ---- PC 5: Stalled ----- 96049 in-flight CPI 1.3379 -- Total Cycles 128530 ---- Thread 24 ---- PC 5: Stalled ----- 91137 in-flight CPI 1.4100 -- Total Cycles 128530 ---- Thread 25 ---- PC 5: Stalled ----- 90957 in-flight CPI 1.4128 -- Total Cycles 128530 ---- Thread 26 ---- PC 5: Stalled ----- 95705 in-flight CPI 1.3428 -- Total Cycles 128530 ---- Thread 27 ---- PC 5: Stalled ----- 95049 in-flight CPI 1.3519 -- Total Cycles 128530 ---- Thread 28 ---- PC 5: Stalled ----- 89340 in-flight CPI 1.4384 -- Total Cycles 128530 ---- Thread 29 ---- PC 5: Stalled ----- 93217 in-flight CPI 1.3786 -- Total Cycles 128530 ---- Thread 30 ---- PC 5: Stalled ----- 89702 in-flight CPI 1.4326 -- Total Cycles 128530 ---- Thread 31 ---- PC 5: Stalled ----- 90163 in-flight CPI 1.4253 -- Total Cycles 128530 Total CPI 0.0422 , IPC 23.7151 -- Total Cycles 128530 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7820 (3.797444%) FPSUB: 0 (0.000000%) FPMUL: 31932 (15.506391%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 79460 (38.586302%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5976 (2.901985%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 72641 (35.274950%) DIV: 7829 (3.801814%) FPUN: 0 (0.000000%) FPRSUB: 270 (0.131114%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3342207 total) ADD%: 7.478 (249919) SUB%: 0.000 (0) MUL%: 0.006 (212) BITOR%: 1.529 (51108) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.552 (18458) FPSUB%: 0.000 (0) FPMUL%: 4.793 (160187) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (636) FPMAX%: 0.019 (636) LOAD%: 5.163 (172556) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (244) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (614) FPINV%: 0.000 (0) FPCONV%: 0.020 (668) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.069 (35713) FPLE%: 0.453 (15125) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (636) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.820 (94262) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.747 (24982) CMPU%: 0.000 (0) RSUB%: 0.006 (212) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.747 (526292) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.179 (39401) ORI%: 1.571 (52516) XORI%: 0.000 (0) MULI%: 3.223 (107720) LW%: 1.138 (38044) LWI%: 13.577 (453776) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.288 (9619) SWI%: 4.095 (136857) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.411 (47149) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10366) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.058 (1949) bned%: 0.000 (0) bneid%: 13.866 (463436) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.719 (24031) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.124 (4129) DIV%: 0.013 (424) FPUN%: 1.482 (49539) FPRSUB%: 3.684 (123115) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (73) FPGT%: 2.963 (99038) FPGE%: 1.030 (34414) SYNC%: 0.000 (0) NOP%: 8.798 (294055) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 19 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 14 FPSUB 0 FPMUL 48 FPCMPLT 0 FPMIN 0 FPMAX 416 LOAD 39149 INTCONV 0 ATOMIC_INC 23 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 9 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1573 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 13 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49500 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 9 ORI 11106 XORI 0 MULI 9609 LW 0 LWI 143570 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 90 DIV 24 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.7153 --Total thread-cycles: 4112960 --total thread-cycles issued: 3048152 (74.110908%) --iCache conflicts: 115479 (2.807686%) --thread*cycles of FU dependence: 255178 (6.204242%) --thread*cycles of data dependence: 205928 (5.006808%) --iCache cycles*banks: 4112960 (81.261160% used) Issue breakdown: --thread*cycles of issue worked: 3048152 (74.110908%) --thread*cycles of issue failed: 770753 (18.739618%) --thread*cycles of issue NOP/other: 294055 (7.149474%) Number of thread-cycles not ready: 205928 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3342207 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 8 3: 7 4: 8 5: 8 6: 8 7: 7 8: 7 9: 6 10: 8 11: 8 12: 8 13: 7 14: 7 15: 9 16: 9 17: 8 18: 6 19: 8 20: 8 21: 8 22: 7 23: 8 24: 8 25: 8 26: 7 27: 9 28: 7 29: 7 30: 7 31: 7 <=== Core 48 ===> ---- Thread 00 ---- PC 5: Stalled ----- 99599 in-flight CPI 1.5142 -- Total Cycles 150841 ---- Thread 01 ---- PC 5: Stalled ----- 101079 in-flight CPI 1.4921 -- Total Cycles 150841 ---- Thread 02 ---- PC 5: Stalled ----- 99669 in-flight CPI 1.5131 -- Total Cycles 150841 ---- Thread 03 ---- PC 5: Stalled ----- 100999 in-flight CPI 1.4933 -- Total Cycles 150841 ---- Thread 04 ---- PC 5: Stalled ----- 93372 in-flight CPI 1.6152 -- Total Cycles 150841 ---- Thread 05 ---- PC 5: Stalled ----- 102954 in-flight CPI 1.4648 -- Total Cycles 150841 ---- Thread 06 ---- PC 5: Stalled ----- 96690 in-flight CPI 1.5597 -- Total Cycles 150841 ---- Thread 07 ---- PC 5: Stalled ----- 97173 in-flight CPI 1.5520 -- Total Cycles 150841 ---- Thread 08 ---- PC 5: Stalled ----- 98878 in-flight CPI 1.5252 -- Total Cycles 150841 ---- Thread 09 ---- PC 5: Stalled ----- 100076 in-flight CPI 1.5069 -- Total Cycles 150841 ---- Thread 10 ---- PC 5: Stalled ----- 98370 in-flight CPI 1.5331 -- Total Cycles 150841 ---- Thread 11 ---- PC 5: Stalled ----- 95697 in-flight CPI 1.5760 -- Total Cycles 150841 ---- Thread 12 ---- PC 5: Stalled ----- 95733 in-flight CPI 1.5754 -- Total Cycles 150841 ---- Thread 13 ---- PC 5: Stalled ----- 99414 in-flight CPI 1.5170 -- Total Cycles 150841 ---- Thread 14 ---- PC 5: Stalled ----- 98878 in-flight CPI 1.5252 -- Total Cycles 150841 ---- Thread 15 ---- PC 5: Stalled ----- 93061 in-flight CPI 1.6206 -- Total Cycles 150841 ---- Thread 16 ---- PC 5: Stalled ----- 94425 in-flight CPI 1.5972 -- Total Cycles 150841 ---- Thread 17 ---- PC 5: Stalled ----- 93538 in-flight CPI 1.6123 -- Total Cycles 150841 ---- Thread 18 ---- PC 5: Stalled ----- 93598 in-flight CPI 1.6112 -- Total Cycles 150841 ---- Thread 19 ---- PC 5: Stalled ----- 93018 in-flight CPI 1.6213 -- Total Cycles 150841 ---- Thread 20 ---- PC 5: Stalled ----- 95792 in-flight CPI 1.5744 -- Total Cycles 150841 ---- Thread 21 ---- PC 5: Stalled ----- 109555 in-flight CPI 1.3767 -- Total Cycles 150841 ---- Thread 22 ---- PC 5: Stalled ----- 94135 in-flight CPI 1.6021 -- Total Cycles 150841 ---- Thread 23 ---- PC 5: Stalled ----- 87683 in-flight CPI 1.7200 -- Total Cycles 150841 ---- Thread 24 ---- PC 5: Stalled ----- 90996 in-flight CPI 1.6574 -- Total Cycles 150841 ---- Thread 25 ---- PC 5: Stalled ----- 90968 in-flight CPI 1.6579 -- Total Cycles 150841 ---- Thread 26 ---- PC 5: Stalled ----- 89120 in-flight CPI 1.6922 -- Total Cycles 150841 ---- Thread 27 ---- PC 5: Stalled ----- 93107 in-flight CPI 1.6197 -- Total Cycles 150841 ---- Thread 28 ---- PC 5: Stalled ----- 93705 in-flight CPI 1.6095 -- Total Cycles 150841 ---- Thread 29 ---- PC 5: Stalled ----- 93474 in-flight CPI 1.6134 -- Total Cycles 150841 ---- Thread 30 ---- PC 5: Stalled ----- 92758 in-flight CPI 1.6259 -- Total Cycles 150841 ---- Thread 31 ---- PC 5: Stalled ----- 89078 in-flight CPI 1.6930 -- Total Cycles 150841 Total CPI 0.0492 , IPC 20.3337 -- Total Cycles 150841 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8893 (4.228238%) FPSUB: 0 (0.000000%) FPMUL: 33978 (16.155075%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 73175 (34.791560%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5556 (2.641639%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 80968 (38.496795%) DIV: 7494 (3.563074%) FPUN: 0 (0.000000%) FPRSUB: 260 (0.123619%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3363334 total) ADD%: 7.331 (246556) SUB%: 0.000 (0) MUL%: 0.006 (203) BITOR%: 1.525 (51307) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.615 (20681) FPSUB%: 0.000 (0) FPMUL%: 4.979 (167452) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (609) FPMAX%: 0.018 (609) LOAD%: 5.272 (177300) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (235) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (582) FPINV%: 0.000 (0) FPCONV%: 0.019 (641) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.089 (36642) FPLE%: 0.455 (15318) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (609) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.791 (93878) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.768 (25835) CMPU%: 0.000 (0) RSUB%: 0.006 (203) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.749 (529689) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.174 (39483) ORI%: 1.613 (54252) XORI%: 0.000 (0) MULI%: 3.183 (107066) LW%: 1.126 (37876) LWI%: 13.448 (452286) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.286 (9612) SWI%: 4.051 (136246) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.395 (46908) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10441) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.069 (2336) bned%: 0.000 (0) bneid%: 13.821 (464858) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.714 (24028) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.138 (4626) DIV%: 0.012 (406) FPUN%: 1.472 (49497) FPRSUB%: 3.741 (125829) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (68) FPGT%: 2.936 (98759) FPGE%: 1.016 (34179) SYNC%: 0.000 (0) NOP%: 8.805 (296133) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 30 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 14 FPSUB 0 FPMUL 64 FPCMPLT 0 FPMIN 0 FPMAX 393 LOAD 42237 INTCONV 0 ATOMIC_INC 16 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 13 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1458 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 13 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49155 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 17 ORI 12773 XORI 0 MULI 9962 LW 0 LWI 143715 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 78 DIV 41 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 20.3339 --Total thread-cycles: 4826912 --total thread-cycles issued: 3067201 (63.543752%) --iCache conflicts: 115370 (2.390141%) --thread*cycles of FU dependence: 259990 (5.386259%) --thread*cycles of data dependence: 210324 (4.357320%) --iCache cycles*banks: 4826912 (69.679456% used) Issue breakdown: --thread*cycles of issue worked: 3067201 (63.543752%) --thread*cycles of issue failed: 1463578 (30.321207%) --thread*cycles of issue NOP/other: 296133 (6.135040%) Number of thread-cycles not ready: 210324 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3363334 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 7 2: 8 3: 6 4: 7 5: 9 6: 8 7: 8 8: 8 9: 9 10: 7 11: 6 12: 7 13: 8 14: 9 15: 7 16: 7 17: 8 18: 8 19: 7 20: 7 21: 6 22: 8 23: 6 24: 7 25: 6 26: 7 27: 8 28: 7 29: 8 30: 7 31: 7 <=== Core 49 ===> ---- Thread 00 ---- PC 5: Stalled ----- 100252 in-flight CPI 1.2919 -- Total Cycles 129541 ---- Thread 01 ---- PC 5: Stalled ----- 102949 in-flight CPI 1.2581 -- Total Cycles 129541 ---- Thread 02 ---- PC 5: Stalled ----- 99396 in-flight CPI 1.3031 -- Total Cycles 129541 ---- Thread 03 ---- PC 5: Stalled ----- 104705 in-flight CPI 1.2370 -- Total Cycles 129541 ---- Thread 04 ---- PC 5: Stalled ----- 93092 in-flight CPI 1.3913 -- Total Cycles 129541 ---- Thread 05 ---- PC 5: Stalled ----- 95263 in-flight CPI 1.3596 -- Total Cycles 129541 ---- Thread 06 ---- PC 5: Stalled ----- 96748 in-flight CPI 1.3387 -- Total Cycles 129541 ---- Thread 07 ---- PC 5: Stalled ----- 93714 in-flight CPI 1.3821 -- Total Cycles 129541 ---- Thread 08 ---- PC 5: Stalled ----- 100856 in-flight CPI 1.2841 -- Total Cycles 129541 ---- Thread 09 ---- PC 5: Stalled ----- 96507 in-flight CPI 1.3420 -- Total Cycles 129541 ---- Thread 10 ---- PC 5: Stalled ----- 97706 in-flight CPI 1.3256 -- Total Cycles 129541 ---- Thread 11 ---- PC 5: Stalled ----- 99827 in-flight CPI 1.2974 -- Total Cycles 129541 ---- Thread 12 ---- PC 5: Stalled ----- 97888 in-flight CPI 1.3231 -- Total Cycles 129541 ---- Thread 13 ---- PC 5: Stalled ----- 95042 in-flight CPI 1.3627 -- Total Cycles 129541 ---- Thread 14 ---- PC 5: Stalled ----- 90615 in-flight CPI 1.4294 -- Total Cycles 129541 ---- Thread 15 ---- PC 5: Stalled ----- 103964 in-flight CPI 1.2458 -- Total Cycles 129541 ---- Thread 16 ---- PC 5: Stalled ----- 96994 in-flight CPI 1.3353 -- Total Cycles 129541 ---- Thread 17 ---- PC 5: Stalled ----- 91369 in-flight CPI 1.4176 -- Total Cycles 129541 ---- Thread 18 ---- PC 5: Stalled ----- 94294 in-flight CPI 1.3736 -- Total Cycles 129541 ---- Thread 19 ---- PC 5: Stalled ----- 87325 in-flight CPI 1.4832 -- Total Cycles 129541 ---- Thread 20 ---- PC 5: Stalled ----- 95807 in-flight CPI 1.3519 -- Total Cycles 129541 ---- Thread 21 ---- PC 5: Stalled ----- 95571 in-flight CPI 1.3552 -- Total Cycles 129541 ---- Thread 22 ---- PC 5: Stalled ----- 91298 in-flight CPI 1.4186 -- Total Cycles 129541 ---- Thread 23 ---- PC 5: Stalled ----- 95173 in-flight CPI 1.3608 -- Total Cycles 129541 ---- Thread 24 ---- PC 5: Stalled ----- 94132 in-flight CPI 1.3759 -- Total Cycles 129541 ---- Thread 25 ---- PC 5: Stalled ----- 92729 in-flight CPI 1.3967 -- Total Cycles 129541 ---- Thread 26 ---- PC 5: Stalled ----- 90415 in-flight CPI 1.4325 -- Total Cycles 129541 ---- Thread 27 ---- PC 5: Stalled ----- 85624 in-flight CPI 1.5127 -- Total Cycles 129541 ---- Thread 28 ---- PC 5: Stalled ----- 87530 in-flight CPI 1.4797 -- Total Cycles 129541 ---- Thread 29 ---- PC 5: Stalled ----- 90319 in-flight CPI 1.4340 -- Total Cycles 129541 ---- Thread 30 ---- PC 5: Stalled ----- 91692 in-flight CPI 1.4125 -- Total Cycles 129541 ---- Thread 31 ---- PC 5: Stalled ----- 90782 in-flight CPI 1.4266 -- Total Cycles 129541 Total CPI 0.0426 , IPC 23.4685 -- Total Cycles 129541 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8183 (4.111131%) FPSUB: 0 (0.000000%) FPMUL: 32610 (16.383230%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 70056 (35.196061%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5658 (2.842573%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 74781 (37.569896%) DIV: 7496 (3.765983%) FPUN: 0 (0.000000%) FPRSUB: 261 (0.131126%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3333428 total) ADD%: 7.410 (247009) SUB%: 0.000 (0) MUL%: 0.006 (203) BITOR%: 1.523 (50770) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.577 (19228) FPSUB%: 0.000 (0) FPMUL%: 4.869 (162311) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (609) FPMAX%: 0.018 (609) LOAD%: 5.207 (173572) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (235) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (588) FPINV%: 0.000 (0) FPCONV%: 0.019 (641) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.077 (35913) FPLE%: 0.452 (15066) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (609) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.812 (93743) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.759 (25291) CMPU%: 0.000 (0) RSUB%: 0.006 (203) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.757 (525258) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.182 (39388) ORI%: 1.581 (52717) XORI%: 0.000 (0) MULI%: 3.209 (106956) LW%: 1.135 (37822) LWI%: 13.535 (451164) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.288 (9600) SWI%: 4.082 (136063) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.405 (46845) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10367) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.059 (1957) bned%: 0.000 (0) bneid%: 13.853 (461787) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.712 (23747) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.128 (4262) DIV%: 0.012 (406) FPUN%: 1.470 (49001) FPRSUB%: 3.707 (123585) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (74) FPGT%: 2.957 (98557) FPGE%: 1.018 (33935) SYNC%: 0.000 (0) NOP%: 8.797 (293241) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 11 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 9 FPSUB 0 FPMUL 59 FPCMPLT 0 FPMIN 0 FPMAX 390 LOAD 39721 INTCONV 0 ATOMIC_INC 18 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 14 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1381 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 10 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49208 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 10 ORI 11733 XORI 0 MULI 9205 LW 0 LWI 142662 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 65 DIV 22 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.4687 --Total thread-cycles: 4145312 --total thread-cycles issued: 3040187 (73.340366%) --iCache conflicts: 115238 (2.779960%) --thread*cycles of FU dependence: 254535 (6.140310%) --thread*cycles of data dependence: 199045 (4.801689%) --iCache cycles*banks: 4145312 (80.415177% used) Issue breakdown: --thread*cycles of issue worked: 3040187 (73.340366%) --thread*cycles of issue failed: 811884 (19.585595%) --thread*cycles of issue NOP/other: 293241 (7.074039%) Number of thread-cycles not ready: 199045 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3333428 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 7 3: 8 4: 7 5: 6 6: 8 7: 6 8: 9 9: 8 10: 8 11: 7 12: 9 13: 8 14: 6 15: 9 16: 7 17: 6 18: 7 19: 7 20: 7 21: 7 22: 7 23: 8 24: 7 25: 8 26: 7 27: 5 28: 7 29: 7 30: 8 31: 8 <=== Core 50 ===> ---- Thread 00 ---- PC 5: Stalled ----- 103349 in-flight CPI 1.2421 -- Total Cycles 128401 ---- Thread 01 ---- PC 5: Stalled ----- 99470 in-flight CPI 1.2906 -- Total Cycles 128401 ---- Thread 02 ---- PC 5: Stalled ----- 93460 in-flight CPI 1.3736 -- Total Cycles 128401 ---- Thread 03 ---- PC 5: Stalled ----- 102546 in-flight CPI 1.2519 -- Total Cycles 128401 ---- Thread 04 ---- PC 5: Stalled ----- 102254 in-flight CPI 1.2555 -- Total Cycles 128401 ---- Thread 05 ---- PC 5: Stalled ----- 102823 in-flight CPI 1.2485 -- Total Cycles 128401 ---- Thread 06 ---- PC 5: Stalled ----- 93252 in-flight CPI 1.3766 -- Total Cycles 128401 ---- Thread 07 ---- PC 5: Stalled ----- 100747 in-flight CPI 1.2742 -- Total Cycles 128401 ---- Thread 08 ---- PC 5: Stalled ----- 98801 in-flight CPI 1.2993 -- Total Cycles 128401 ---- Thread 09 ---- PC 5: Stalled ----- 95087 in-flight CPI 1.3502 -- Total Cycles 128401 ---- Thread 10 ---- PC 5: Stalled ----- 94577 in-flight CPI 1.3574 -- Total Cycles 128401 ---- Thread 11 ---- PC 5: Stalled ----- 97413 in-flight CPI 1.3178 -- Total Cycles 128401 ---- Thread 12 ---- PC 5: Stalled ----- 92406 in-flight CPI 1.3893 -- Total Cycles 128401 ---- Thread 13 ---- PC 5: Stalled ----- 93203 in-flight CPI 1.3774 -- Total Cycles 128401 ---- Thread 14 ---- PC 5: Stalled ----- 93663 in-flight CPI 1.3706 -- Total Cycles 128401 ---- Thread 15 ---- PC 5: Stalled ----- 99741 in-flight CPI 1.2871 -- Total Cycles 128401 ---- Thread 16 ---- PC 5: Stalled ----- 94358 in-flight CPI 1.3605 -- Total Cycles 128401 ---- Thread 17 ---- PC 5: Stalled ----- 96058 in-flight CPI 1.3365 -- Total Cycles 128401 ---- Thread 18 ---- PC 5: Stalled ----- 95390 in-flight CPI 1.3457 -- Total Cycles 128401 ---- Thread 19 ---- PC 5: Stalled ----- 96421 in-flight CPI 1.3314 -- Total Cycles 128401 ---- Thread 20 ---- PC 5: Stalled ----- 93151 in-flight CPI 1.3782 -- Total Cycles 128401 ---- Thread 21 ---- PC 5: Stalled ----- 97400 in-flight CPI 1.3181 -- Total Cycles 128401 ---- Thread 22 ---- PC 5: Stalled ----- 93194 in-flight CPI 1.3775 -- Total Cycles 128401 ---- Thread 23 ---- PC 5: Stalled ----- 92708 in-flight CPI 1.3847 -- Total Cycles 128401 ---- Thread 24 ---- PC 5: Stalled ----- 90528 in-flight CPI 1.4181 -- Total Cycles 128401 ---- Thread 25 ---- PC 5: Stalled ----- 86110 in-flight CPI 1.4909 -- Total Cycles 128401 ---- Thread 26 ---- PC 5: Stalled ----- 96880 in-flight CPI 1.3251 -- Total Cycles 128401 ---- Thread 27 ---- PC 5: Stalled ----- 89423 in-flight CPI 1.4356 -- Total Cycles 128401 ---- Thread 28 ---- PC 5: Stalled ----- 92450 in-flight CPI 1.3886 -- Total Cycles 128401 ---- Thread 29 ---- PC 5: Stalled ----- 91293 in-flight CPI 1.4062 -- Total Cycles 128401 ---- Thread 30 ---- PC 5: Stalled ----- 89282 in-flight CPI 1.4378 -- Total Cycles 128401 ---- Thread 31 ---- PC 5: Stalled ----- 93061 in-flight CPI 1.3795 -- Total Cycles 128401 Total CPI 0.0421 , IPC 23.7622 -- Total Cycles 128401 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7473 (3.755427%) FPSUB: 0 (0.000000%) FPMUL: 31291 (15.724753%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 76482 (38.434711%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 6055 (3.042836%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 69526 (34.939093%) DIV: 7894 (3.966994%) FPUN: 0 (0.000000%) FPRSUB: 271 (0.136186%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3344990 total) ADD%: 7.489 (250502) SUB%: 0.000 (0) MUL%: 0.006 (214) BITOR%: 1.528 (51101) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.532 (17797) FPSUB%: 0.000 (0) FPMUL%: 4.728 (158162) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (642) FPMAX%: 0.019 (642) LOAD%: 5.142 (171991) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (246) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (621) FPINV%: 0.000 (0) FPCONV%: 0.020 (674) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.058 (35387) FPLE%: 0.455 (15206) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (642) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.840 (95004) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.741 (24802) CMPU%: 0.000 (0) RSUB%: 0.006 (214) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.774 (527623) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.185 (39627) ORI%: 1.553 (51955) XORI%: 0.000 (0) MULI%: 3.241 (108400) LW%: 1.146 (38344) LWI%: 13.618 (455530) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.290 (9702) SWI%: 4.121 (137852) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.420 (47515) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10416) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.054 (1793) bned%: 0.000 (0) bneid%: 13.874 (464084) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.721 (24125) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.118 (3945) DIV%: 0.013 (428) FPUN%: 1.483 (49604) FPRSUB%: 3.660 (122436) FPSQRT%: 0.000 (0) FPNEG%: 0.003 (85) FPGT%: 2.970 (99336) FPGE%: 1.028 (34398) SYNC%: 0.000 (0) NOP%: 8.785 (293849) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 33 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 14 FPSUB 0 FPMUL 53 FPCMPLT 0 FPMIN 0 FPMAX 412 LOAD 39647 INTCONV 0 ATOMIC_INC 14 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 13 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1764 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 13 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49650 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 7 ORI 10599 XORI 0 MULI 10113 LW 0 LWI 143788 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 84 DIV 32 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.7624 --Total thread-cycles: 4108832 --total thread-cycles issued: 3051141 (74.258110%) --iCache conflicts: 116414 (2.833263%) --thread*cycles of FU dependence: 256242 (6.236371%) --thread*cycles of data dependence: 198992 (4.843031%) --iCache cycles*banks: 4108832 (81.410532% used) Issue breakdown: --thread*cycles of issue worked: 3051141 (74.258110%) --thread*cycles of issue failed: 763842 (18.590247%) --thread*cycles of issue NOP/other: 293849 (7.151643%) Number of thread-cycles not ready: 198992 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3344990 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 9 1: 8 2: 7 3: 9 4: 8 5: 9 6: 8 7: 8 8: 8 9: 6 10: 7 11: 8 12: 6 13: 7 14: 7 15: 8 16: 9 17: 7 18: 9 19: 9 20: 7 21: 7 22: 8 23: 8 24: 8 25: 5 26: 8 27: 8 28: 8 29: 7 30: 8 31: 7 <=== Core 51 ===> ---- Thread 00 ---- PC 5: Stalled ----- 95209 in-flight CPI 1.4880 -- Total Cycles 141695 ---- Thread 01 ---- PC 5: Stalled ----- 98259 in-flight CPI 1.4418 -- Total Cycles 141695 ---- Thread 02 ---- PC 5: Stalled ----- 100924 in-flight CPI 1.4037 -- Total Cycles 141695 ---- Thread 03 ---- PC 5: Stalled ----- 93008 in-flight CPI 1.5233 -- Total Cycles 141695 ---- Thread 04 ---- PC 5: Stalled ----- 100726 in-flight CPI 1.4064 -- Total Cycles 141695 ---- Thread 05 ---- PC 5: Stalled ----- 100194 in-flight CPI 1.4139 -- Total Cycles 141695 ---- Thread 06 ---- PC 5: Stalled ----- 97700 in-flight CPI 1.4501 -- Total Cycles 141695 ---- Thread 07 ---- PC 5: Stalled ----- 99394 in-flight CPI 1.4253 -- Total Cycles 141695 ---- Thread 08 ---- PC 5: Stalled ----- 94542 in-flight CPI 1.4985 -- Total Cycles 141695 ---- Thread 09 ---- PC 5: Stalled ----- 101773 in-flight CPI 1.3920 -- Total Cycles 141695 ---- Thread 10 ---- PC 5: Stalled ----- 106295 in-flight CPI 1.3328 -- Total Cycles 141695 ---- Thread 11 ---- PC 5: Stalled ----- 98665 in-flight CPI 1.4359 -- Total Cycles 141695 ---- Thread 12 ---- PC 5: Stalled ----- 90734 in-flight CPI 1.5614 -- Total Cycles 141695 ---- Thread 13 ---- PC 5: Stalled ----- 97100 in-flight CPI 1.4590 -- Total Cycles 141695 ---- Thread 14 ---- PC 5: Stalled ----- 101020 in-flight CPI 1.4024 -- Total Cycles 141695 ---- Thread 15 ---- PC 5: Stalled ----- 101304 in-flight CPI 1.3984 -- Total Cycles 141695 ---- Thread 16 ---- PC 5: Stalled ----- 98589 in-flight CPI 1.4369 -- Total Cycles 141695 ---- Thread 17 ---- PC 5: Stalled ----- 87389 in-flight CPI 1.6211 -- Total Cycles 141695 ---- Thread 18 ---- PC 5: Stalled ----- 98102 in-flight CPI 1.4441 -- Total Cycles 141695 ---- Thread 19 ---- PC 5: Stalled ----- 96325 in-flight CPI 1.4707 -- Total Cycles 141695 ---- Thread 20 ---- PC 5: Stalled ----- 96721 in-flight CPI 1.4647 -- Total Cycles 141695 ---- Thread 21 ---- PC 5: Stalled ----- 94974 in-flight CPI 1.4917 -- Total Cycles 141695 ---- Thread 22 ---- PC 5: Stalled ----- 92067 in-flight CPI 1.5388 -- Total Cycles 141695 ---- Thread 23 ---- PC 5: Stalled ----- 90455 in-flight CPI 1.5661 -- Total Cycles 141695 ---- Thread 24 ---- PC 5: Stalled ----- 89299 in-flight CPI 1.5865 -- Total Cycles 141695 ---- Thread 25 ---- PC 5: Stalled ----- 91675 in-flight CPI 1.5453 -- Total Cycles 141695 ---- Thread 26 ---- PC 5: Stalled ----- 92615 in-flight CPI 1.5296 -- Total Cycles 141695 ---- Thread 27 ---- PC 5: Stalled ----- 90822 in-flight CPI 1.5599 -- Total Cycles 141695 ---- Thread 28 ---- PC 5: Stalled ----- 89402 in-flight CPI 1.5846 -- Total Cycles 141695 ---- Thread 29 ---- PC 5: Stalled ----- 94004 in-flight CPI 1.5070 -- Total Cycles 141695 ---- Thread 30 ---- PC 5: Stalled ----- 91118 in-flight CPI 1.5547 -- Total Cycles 141695 ---- Thread 31 ---- PC 5: Stalled ----- 92526 in-flight CPI 1.5311 -- Total Cycles 141695 Total CPI 0.0463 , IPC 21.6205 -- Total Cycles 141695 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7874 (3.817845%) FPSUB: 0 (0.000000%) FPMUL: 32035 (15.532724%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 79554 (38.573133%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5941 (2.880597%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 72661 (35.230942%) DIV: 7903 (3.831906%) FPUN: 0 (0.000000%) FPRSUB: 274 (0.132854%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3358825 total) ADD%: 7.493 (251669) SUB%: 0.000 (0) MUL%: 0.006 (214) BITOR%: 1.538 (51666) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.554 (18595) FPSUB%: 0.000 (0) FPMUL%: 4.789 (160851) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (642) FPMAX%: 0.019 (642) LOAD%: 5.159 (173297) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (246) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (615) FPINV%: 0.000 (0) FPCONV%: 0.020 (674) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.067 (35851) FPLE%: 0.453 (15229) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (642) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.823 (94804) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.747 (25091) CMPU%: 0.000 (0) RSUB%: 0.006 (214) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.747 (528900) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.180 (39634) ORI%: 1.578 (52997) XORI%: 0.000 (0) MULI%: 3.221 (108176) LW%: 1.139 (38264) LWI%: 13.554 (455251) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.289 (9696) SWI%: 4.096 (137569) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.411 (47395) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10445) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.057 (1919) bned%: 0.000 (0) bneid%: 13.868 (465786) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.720 (24198) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.123 (4129) DIV%: 0.013 (428) FPUN%: 1.491 (50082) FPRSUB%: 3.676 (123479) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (75) FPGT%: 2.955 (99258) FPGE%: 1.038 (34853) SYNC%: 0.000 (0) NOP%: 8.790 (295253) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 17 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 14 FPSUB 0 FPMUL 59 FPCMPLT 0 FPMIN 0 FPMAX 415 LOAD 39453 INTCONV 0 ATOMIC_INC 30 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 13 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1708 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 10 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49671 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 8 ORI 11206 XORI 0 MULI 9509 LW 0 LWI 144118 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 67 DIV 27 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.6207 --Total thread-cycles: 4534240 --total thread-cycles issued: 3063572 (67.565281%) --iCache conflicts: 115223 (2.541176%) --thread*cycles of FU dependence: 256347 (5.653583%) --thread*cycles of data dependence: 206242 (4.548546%) --iCache cycles*banks: 4534240 (74.077618% used) Issue breakdown: --thread*cycles of issue worked: 3063572 (67.565281%) --thread*cycles of issue failed: 1175415 (25.923087%) --thread*cycles of issue NOP/other: 295253 (6.511631%) Number of thread-cycles not ready: 206242 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3358825 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 8 3: 6 4: 9 5: 9 6: 7 7: 8 8: 7 9: 9 10: 7 11: 7 12: 7 13: 8 14: 7 15: 8 16: 9 17: 7 18: 8 19: 8 20: 7 21: 7 22: 7 23: 9 24: 7 25: 7 26: 8 27: 7 28: 8 29: 8 30: 8 31: 8 <=== Core 52 ===> ---- Thread 00 ---- PC 5: Stalled ----- 91268 in-flight CPI 1.8946 -- Total Cycles 172944 ---- Thread 01 ---- PC 5: Stalled ----- 92059 in-flight CPI 1.8783 -- Total Cycles 172944 ---- Thread 02 ---- PC 5: Stalled ----- 100189 in-flight CPI 1.7258 -- Total Cycles 172944 ---- Thread 03 ---- PC 5: Stalled ----- 93612 in-flight CPI 1.8471 -- Total Cycles 172944 ---- Thread 04 ---- PC 5: Stalled ----- 102078 in-flight CPI 1.6939 -- Total Cycles 172944 ---- Thread 05 ---- PC 5: Stalled ----- 99922 in-flight CPI 1.7305 -- Total Cycles 172944 ---- Thread 06 ---- PC 5: Stalled ----- 92327 in-flight CPI 1.8729 -- Total Cycles 172944 ---- Thread 07 ---- PC 5: Stalled ----- 91775 in-flight CPI 1.8842 -- Total Cycles 172944 ---- Thread 08 ---- PC 5: Stalled ----- 86640 in-flight CPI 1.9958 -- Total Cycles 172944 ---- Thread 09 ---- PC 5: Stalled ----- 100879 in-flight CPI 1.7140 -- Total Cycles 172944 ---- Thread 10 ---- PC 5: Stalled ----- 98796 in-flight CPI 1.7502 -- Total Cycles 172944 ---- Thread 11 ---- PC 5: Stalled ----- 98718 in-flight CPI 1.7516 -- Total Cycles 172944 ---- Thread 12 ---- PC 5: Stalled ----- 95034 in-flight CPI 1.8194 -- Total Cycles 172944 ---- Thread 13 ---- PC 5: Stalled ----- 95941 in-flight CPI 1.8023 -- Total Cycles 172944 ---- Thread 14 ---- PC 5: Stalled ----- 97618 in-flight CPI 1.7713 -- Total Cycles 172944 ---- Thread 15 ---- PC 5: Stalled ----- 100036 in-flight CPI 1.7285 -- Total Cycles 172944 ---- Thread 16 ---- PC 5: Stalled ----- 93536 in-flight CPI 1.8486 -- Total Cycles 172944 ---- Thread 17 ---- PC 5: Stalled ----- 95117 in-flight CPI 1.8179 -- Total Cycles 172944 ---- Thread 18 ---- PC 5: Stalled ----- 92624 in-flight CPI 1.8669 -- Total Cycles 172944 ---- Thread 19 ---- PC 5: Stalled ----- 122827 in-flight CPI 1.4079 -- Total Cycles 172944 ---- Thread 20 ---- PC 5: Stalled ----- 95238 in-flight CPI 1.8155 -- Total Cycles 172944 ---- Thread 21 ---- PC 5: Stalled ----- 93874 in-flight CPI 1.8419 -- Total Cycles 172944 ---- Thread 22 ---- PC 5: Stalled ----- 96229 in-flight CPI 1.7968 -- Total Cycles 172944 ---- Thread 23 ---- PC 5: Stalled ----- 87922 in-flight CPI 1.9666 -- Total Cycles 172944 ---- Thread 24 ---- PC 5: Stalled ----- 90168 in-flight CPI 1.9177 -- Total Cycles 172944 ---- Thread 25 ---- PC 5: Stalled ----- 92503 in-flight CPI 1.8692 -- Total Cycles 172944 ---- Thread 26 ---- PC 5: Stalled ----- 93115 in-flight CPI 1.8570 -- Total Cycles 172944 ---- Thread 27 ---- PC 5: Stalled ----- 83106 in-flight CPI 2.0807 -- Total Cycles 172944 ---- Thread 28 ---- PC 5: Stalled ----- 86131 in-flight CPI 2.0075 -- Total Cycles 172944 ---- Thread 29 ---- PC 5: Stalled ----- 91290 in-flight CPI 1.8940 -- Total Cycles 172944 ---- Thread 30 ---- PC 5: Stalled ----- 89003 in-flight CPI 1.9428 -- Total Cycles 172944 ---- Thread 31 ---- PC 5: Stalled ----- 84131 in-flight CPI 2.0552 -- Total Cycles 172944 Total CPI 0.0572 , IPC 17.4869 -- Total Cycles 172944 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8602 (3.333178%) FPSUB: 0 (0.000000%) FPMUL: 33133 (12.838665%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 125007 (48.438808%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5411 (2.096702%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 78314 (30.345795%) DIV: 7347 (2.846880%) FPUN: 0 (0.000000%) FPRSUB: 258 (0.099972%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3315936 total) ADD%: 7.461 (247393) SUB%: 0.000 (0) MUL%: 0.006 (199) BITOR%: 1.543 (51149) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.601 (19933) FPSUB%: 0.000 (0) FPMUL%: 4.932 (163530) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (597) FPMAX%: 0.018 (597) LOAD%: 5.238 (173690) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (231) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (570) FPINV%: 0.000 (0) FPCONV%: 0.019 (629) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.087 (36055) FPLE%: 0.457 (15142) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (597) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.786 (92384) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.768 (25481) CMPU%: 0.000 (0) RSUB%: 0.006 (199) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.731 (521624) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.175 (38966) ORI%: 1.611 (53417) XORI%: 0.000 (0) MULI%: 3.181 (105470) LW%: 1.124 (37272) LWI%: 13.430 (445334) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.286 (9473) SWI%: 4.046 (134176) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.392 (46145) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10302) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.064 (2122) bned%: 0.000 (0) bneid%: 13.837 (458842) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.716 (23746) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.135 (4471) DIV%: 0.012 (398) FPUN%: 1.485 (49256) FPRSUB%: 3.722 (123421) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (63) FPGT%: 2.932 (97219) FPGE%: 1.029 (34114) SYNC%: 0.000 (0) NOP%: 8.795 (291633) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 20 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 12 FPSUB 0 FPMUL 51 FPCMPLT 0 FPMIN 0 FPMAX 386 LOAD 40653 INTCONV 0 ATOMIC_INC 17 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 13 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1381 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 7 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48646 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 13 ORI 12307 XORI 0 MULI 9221 LW 0 LWI 141467 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 70 DIV 28 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 17.4871 --Total thread-cycles: 5534208 --total thread-cycles issued: 3024303 (54.647440%) --iCache conflicts: 108589 (1.962142%) --thread*cycles of FU dependence: 254305 (4.595147%) --thread*cycles of data dependence: 258072 (4.663215%) --iCache cycles*banks: 5534208 (59.917661% used) Issue breakdown: --thread*cycles of issue worked: 3024303 (54.647440%) --thread*cycles of issue failed: 2218272 (40.082917%) --thread*cycles of issue NOP/other: 291633 (5.269643%) Number of thread-cycles not ready: 258072 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3315936 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 6 1: 7 2: 8 3: 7 4: 9 5: 7 6: 6 7: 6 8: 6 9: 9 10: 8 11: 8 12: 9 13: 7 14: 8 15: 7 16: 7 17: 7 18: 6 19: 6 20: 8 21: 8 22: 8 23: 7 24: 7 25: 8 26: 7 27: 6 28: 7 29: 8 30: 6 31: 7 <=== Core 53 ===> ---- Thread 00 ---- PC 5: Stalled ----- 102962 in-flight CPI 1.4037 -- Total Cycles 144545 ---- Thread 01 ---- PC 5: Stalled ----- 104596 in-flight CPI 1.3816 -- Total Cycles 144545 ---- Thread 02 ---- PC 5: Stalled ----- 99107 in-flight CPI 1.4581 -- Total Cycles 144545 ---- Thread 03 ---- PC 5: Stalled ----- 95287 in-flight CPI 1.5167 -- Total Cycles 144545 ---- Thread 04 ---- PC 5: Stalled ----- 101709 in-flight CPI 1.4209 -- Total Cycles 144545 ---- Thread 05 ---- PC 5: Stalled ----- 95337 in-flight CPI 1.5159 -- Total Cycles 144545 ---- Thread 06 ---- PC 5: Stalled ----- 90209 in-flight CPI 1.6021 -- Total Cycles 144545 ---- Thread 07 ---- PC 5: Stalled ----- 98825 in-flight CPI 1.4624 -- Total Cycles 144545 ---- Thread 08 ---- PC 5: Stalled ----- 98351 in-flight CPI 1.4694 -- Total Cycles 144545 ---- Thread 09 ---- PC 5: Stalled ----- 94693 in-flight CPI 1.5262 -- Total Cycles 144545 ---- Thread 10 ---- PC 5: Stalled ----- 94286 in-flight CPI 1.5328 -- Total Cycles 144545 ---- Thread 11 ---- PC 5: Stalled ----- 93453 in-flight CPI 1.5464 -- Total Cycles 144545 ---- Thread 12 ---- PC 5: Stalled ----- 95810 in-flight CPI 1.5084 -- Total Cycles 144545 ---- Thread 13 ---- PC 5: Stalled ----- 94991 in-flight CPI 1.5215 -- Total Cycles 144545 ---- Thread 14 ---- PC 5: Stalled ----- 94368 in-flight CPI 1.5314 -- Total Cycles 144545 ---- Thread 15 ---- PC 5: Stalled ----- 98242 in-flight CPI 1.4710 -- Total Cycles 144545 ---- Thread 16 ---- PC 5: Stalled ----- 95426 in-flight CPI 1.5145 -- Total Cycles 144545 ---- Thread 17 ---- PC 5: Stalled ----- 93713 in-flight CPI 1.5422 -- Total Cycles 144545 ---- Thread 18 ---- PC 5: Stalled ----- 91718 in-flight CPI 1.5757 -- Total Cycles 144545 ---- Thread 19 ---- PC 5: Stalled ----- 92597 in-flight CPI 1.5607 -- Total Cycles 144545 ---- Thread 20 ---- PC 5: Stalled ----- 100857 in-flight CPI 1.4330 -- Total Cycles 144545 ---- Thread 21 ---- PC 5: Stalled ----- 91294 in-flight CPI 1.5831 -- Total Cycles 144545 ---- Thread 22 ---- PC 5: Stalled ----- 90476 in-flight CPI 1.5973 -- Total Cycles 144545 ---- Thread 23 ---- PC 5: Stalled ----- 86744 in-flight CPI 1.6661 -- Total Cycles 144545 ---- Thread 24 ---- PC 5: Stalled ----- 92503 in-flight CPI 1.5623 -- Total Cycles 144545 ---- Thread 25 ---- PC 5: Stalled ----- 88547 in-flight CPI 1.6321 -- Total Cycles 144545 ---- Thread 26 ---- PC 5: Stalled ----- 91075 in-flight CPI 1.5868 -- Total Cycles 144545 ---- Thread 27 ---- PC 5: Stalled ----- 86491 in-flight CPI 1.6710 -- Total Cycles 144545 ---- Thread 28 ---- PC 5: Stalled ----- 87567 in-flight CPI 1.6503 -- Total Cycles 144545 ---- Thread 29 ---- PC 5: Stalled ----- 91792 in-flight CPI 1.5744 -- Total Cycles 144545 ---- Thread 30 ---- PC 5: Stalled ----- 87120 in-flight CPI 1.6588 -- Total Cycles 144545 ---- Thread 31 ---- PC 5: Stalled ----- 84761 in-flight CPI 1.7052 -- Total Cycles 144545 Total CPI 0.0481 , IPC 20.7923 -- Total Cycles 144545 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8905 (3.718737%) FPSUB: 0 (0.000000%) FPMUL: 33724 (14.083178%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 103593 (43.260545%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5099 (2.129348%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 80871 (33.771814%) DIV: 7020 (2.931559%) FPUN: 0 (0.000000%) FPRSUB: 251 (0.104818%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3294962 total) ADD%: 7.453 (245574) SUB%: 0.000 (0) MUL%: 0.006 (190) BITOR%: 1.538 (50689) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.626 (20630) FPSUB%: 0.000 (0) FPMUL%: 5.011 (165114) FPCMPLT%: 0.000 (0) FPMIN%: 0.017 (570) FPMAX%: 0.017 (570) LOAD%: 5.282 (174026) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (222) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (544) FPINV%: 0.000 (0) FPCONV%: 0.018 (602) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.093 (36007) FPLE%: 0.457 (15058) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.017 (570) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.772 (91350) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.778 (25623) CMPU%: 0.000 (0) RSUB%: 0.006 (190) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.722 (518032) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.174 (38674) ORI%: 1.628 (53652) XORI%: 0.000 (0) MULI%: 3.162 (104192) LW%: 1.118 (36844) LWI%: 13.383 (440964) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.284 (9363) SWI%: 4.027 (132680) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.385 (45623) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10204) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.067 (2220) bned%: 0.000 (0) bneid%: 13.800 (454712) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.719 (23705) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.140 (4629) DIV%: 0.012 (380) FPUN%: 1.477 (48669) FPRSUB%: 3.749 (123515) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (58) FPGT%: 2.917 (96125) FPGE%: 1.020 (33611) SYNC%: 0.000 (0) NOP%: 8.786 (289485) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 13 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 16 FPSUB 0 FPMUL 72 FPCMPLT 0 FPMIN 0 FPMAX 369 LOAD 41379 INTCONV 0 ATOMIC_INC 18 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 13 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1279 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 12 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48069 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 11 ORI 12814 XORI 0 MULI 8994 LW 0 LWI 140158 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 77 DIV 34 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 20.7925 --Total thread-cycles: 4625440 --total thread-cycles issued: 3005477 (64.977105%) --iCache conflicts: 110400 (2.386800%) --thread*cycles of FU dependence: 253340 (5.477101%) --thread*cycles of data dependence: 239463 (5.177086%) --iCache cycles*banks: 4625440 (71.236336% used) Issue breakdown: --thread*cycles of issue worked: 3005477 (64.977105%) --thread*cycles of issue failed: 1330478 (28.764355%) --thread*cycles of issue NOP/other: 289485 (6.258540%) Number of thread-cycles not ready: 239463 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3294962 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 6 1: 9 2: 9 3: 6 4: 8 5: 7 6: 6 7: 8 8: 8 9: 7 10: 6 11: 7 12: 7 13: 6 14: 8 15: 8 16: 7 17: 7 18: 7 19: 8 20: 5 21: 6 22: 7 23: 6 24: 7 25: 7 26: 7 27: 6 28: 8 29: 7 30: 7 31: 4 <=== Core 54 ===> ---- Thread 00 ---- PC 5: Stalled ----- 98392 in-flight CPI 1.3022 -- Total Cycles 128154 ---- Thread 01 ---- PC 5: Stalled ----- 97126 in-flight CPI 1.3192 -- Total Cycles 128154 ---- Thread 02 ---- PC 5: Stalled ----- 88764 in-flight CPI 1.4435 -- Total Cycles 128154 ---- Thread 03 ---- PC 5: Stalled ----- 98100 in-flight CPI 1.3061 -- Total Cycles 128154 ---- Thread 04 ---- PC 5: Stalled ----- 103670 in-flight CPI 1.2359 -- Total Cycles 128154 ---- Thread 05 ---- PC 5: Stalled ----- 101465 in-flight CPI 1.2628 -- Total Cycles 128154 ---- Thread 06 ---- PC 5: Stalled ----- 101941 in-flight CPI 1.2569 -- Total Cycles 128154 ---- Thread 07 ---- PC 5: Stalled ----- 99826 in-flight CPI 1.2835 -- Total Cycles 128154 ---- Thread 08 ---- PC 5: Stalled ----- 99090 in-flight CPI 1.2931 -- Total Cycles 128154 ---- Thread 09 ---- PC 5: Stalled ----- 94470 in-flight CPI 1.3563 -- Total Cycles 128154 ---- Thread 10 ---- PC 5: Stalled ----- 94740 in-flight CPI 1.3524 -- Total Cycles 128154 ---- Thread 11 ---- PC 5: Stalled ----- 94225 in-flight CPI 1.3598 -- Total Cycles 128154 ---- Thread 12 ---- PC 5: Stalled ----- 100767 in-flight CPI 1.2715 -- Total Cycles 128154 ---- Thread 13 ---- PC 5: Stalled ----- 95184 in-flight CPI 1.3461 -- Total Cycles 128154 ---- Thread 14 ---- PC 5: Stalled ----- 96088 in-flight CPI 1.3334 -- Total Cycles 128154 ---- Thread 15 ---- PC 5: Stalled ----- 91533 in-flight CPI 1.3998 -- Total Cycles 128154 ---- Thread 16 ---- PC 5: Stalled ----- 100000 in-flight CPI 1.2813 -- Total Cycles 128154 ---- Thread 17 ---- PC 5: Stalled ----- 90559 in-flight CPI 1.4149 -- Total Cycles 128154 ---- Thread 18 ---- PC 5: Stalled ----- 98200 in-flight CPI 1.3048 -- Total Cycles 128154 ---- Thread 19 ---- PC 5: Stalled ----- 91162 in-flight CPI 1.4055 -- Total Cycles 128154 ---- Thread 20 ---- PC 5: Stalled ----- 94901 in-flight CPI 1.3501 -- Total Cycles 128154 ---- Thread 21 ---- PC 5: Stalled ----- 99332 in-flight CPI 1.2899 -- Total Cycles 128154 ---- Thread 22 ---- PC 5: Stalled ----- 93832 in-flight CPI 1.3655 -- Total Cycles 128154 ---- Thread 23 ---- PC 5: Stalled ----- 88638 in-flight CPI 1.4455 -- Total Cycles 128154 ---- Thread 24 ---- PC 5: Stalled ----- 87084 in-flight CPI 1.4714 -- Total Cycles 128154 ---- Thread 25 ---- PC 5: Stalled ----- 95487 in-flight CPI 1.3419 -- Total Cycles 128154 ---- Thread 26 ---- PC 5: Stalled ----- 92252 in-flight CPI 1.3889 -- Total Cycles 128154 ---- Thread 27 ---- PC 5: Stalled ----- 86917 in-flight CPI 1.4742 -- Total Cycles 128154 ---- Thread 28 ---- PC 5: Stalled ----- 89709 in-flight CPI 1.4283 -- Total Cycles 128154 ---- Thread 29 ---- PC 5: Stalled ----- 91341 in-flight CPI 1.4028 -- Total Cycles 128154 ---- Thread 30 ---- PC 5: Stalled ----- 89441 in-flight CPI 1.4326 -- Total Cycles 128154 ---- Thread 31 ---- PC 5: Stalled ----- 82673 in-flight CPI 1.5499 -- Total Cycles 128154 Total CPI 0.0423 , IPC 23.6237 -- Total Cycles 128154 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7771 (4.004081%) FPSUB: 0 (0.000000%) FPMUL: 31788 (16.379066%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 69650 (35.887818%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5628 (2.899880%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 71417 (36.798281%) DIV: 7565 (3.897937%) FPUN: 0 (0.000000%) FPRSUB: 258 (0.132937%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3319554 total) ADD%: 7.541 (250334) SUB%: 0.000 (0) MUL%: 0.006 (205) BITOR%: 1.528 (50733) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.553 (18373) FPSUB%: 0.000 (0) FPMUL%: 4.793 (159121) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (615) FPMAX%: 0.019 (615) LOAD%: 5.165 (171461) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (237) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (588) FPINV%: 0.000 (0) FPCONV%: 0.019 (647) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.069 (35486) FPLE%: 0.455 (15120) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (615) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.819 (93585) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.745 (24717) CMPU%: 0.000 (0) RSUB%: 0.006 (205) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.751 (522856) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.177 (39069) ORI%: 1.569 (52074) XORI%: 0.000 (0) MULI%: 3.221 (106920) LW%: 1.138 (37762) LWI%: 13.546 (449652) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.289 (9588) SWI%: 4.081 (135475) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.409 (46759) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10322) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.059 (1964) bned%: 0.000 (0) bneid%: 13.863 (460177) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.719 (23866) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.122 (4062) DIV%: 0.012 (410) FPUN%: 1.483 (49221) FPRSUB%: 3.679 (122119) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (67) FPGT%: 2.961 (98307) FPGE%: 1.027 (34101) SYNC%: 0.000 (0) NOP%: 8.797 (292030) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 19 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 8 FPSUB 0 FPMUL 42 FPCMPLT 0 FPMIN 0 FPMAX 401 LOAD 40028 INTCONV 0 ATOMIC_INC 19 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 12 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1620 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 11 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49034 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 17 ORI 11049 XORI 0 MULI 9518 LW 0 LWI 142031 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 83 DIV 22 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.6239 --Total thread-cycles: 4100928 --total thread-cycles issued: 3027524 (73.825339%) --iCache conflicts: 114411 (2.789881%) --thread*cycles of FU dependence: 253925 (6.191891%) --thread*cycles of data dependence: 194077 (4.732514%) --iCache cycles*banks: 4100928 (80.947190% used) Issue breakdown: --thread*cycles of issue worked: 3027524 (73.825339%) --thread*cycles of issue failed: 781374 (19.053590%) --thread*cycles of issue NOP/other: 292030 (7.121071%) Number of thread-cycles not ready: 194077 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3319554 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 9 1: 7 2: 7 3: 7 4: 9 5: 8 6: 8 7: 8 8: 8 9: 7 10: 8 11: 8 12: 8 13: 8 14: 8 15: 8 16: 8 17: 6 18: 7 19: 7 20: 8 21: 7 22: 7 23: 7 24: 6 25: 7 26: 8 27: 7 28: 7 29: 7 30: 7 31: 5 <=== Core 55 ===> ---- Thread 00 ---- PC 5: Stalled ----- 99267 in-flight CPI 1.3898 -- Total Cycles 137987 ---- Thread 01 ---- PC 5: Stalled ----- 94100 in-flight CPI 1.4661 -- Total Cycles 137987 ---- Thread 02 ---- PC 5: Stalled ----- 94126 in-flight CPI 1.4657 -- Total Cycles 137987 ---- Thread 03 ---- PC 5: Stalled ----- 97003 in-flight CPI 1.4222 -- Total Cycles 137987 ---- Thread 04 ---- PC 5: Stalled ----- 94737 in-flight CPI 1.4563 -- Total Cycles 137987 ---- Thread 05 ---- PC 5: Stalled ----- 95905 in-flight CPI 1.4385 -- Total Cycles 137987 ---- Thread 06 ---- PC 5: Stalled ----- 93608 in-flight CPI 1.4738 -- Total Cycles 137987 ---- Thread 07 ---- PC 5: Stalled ----- 101997 in-flight CPI 1.3527 -- Total Cycles 137987 ---- Thread 08 ---- PC 5: Stalled ----- 97540 in-flight CPI 1.4144 -- Total Cycles 137987 ---- Thread 09 ---- PC 5: Stalled ----- 95493 in-flight CPI 1.4447 -- Total Cycles 137987 ---- Thread 10 ---- PC 5: Stalled ----- 100558 in-flight CPI 1.3720 -- Total Cycles 137987 ---- Thread 11 ---- PC 5: Stalled ----- 94392 in-flight CPI 1.4616 -- Total Cycles 137987 ---- Thread 12 ---- PC 5: Stalled ----- 97391 in-flight CPI 1.4166 -- Total Cycles 137987 ---- Thread 13 ---- PC 5: Stalled ----- 101701 in-flight CPI 1.3565 -- Total Cycles 137987 ---- Thread 14 ---- PC 5: Stalled ----- 95514 in-flight CPI 1.4444 -- Total Cycles 137987 ---- Thread 15 ---- PC 5: Stalled ----- 98079 in-flight CPI 1.4066 -- Total Cycles 137987 ---- Thread 16 ---- PC 5: Stalled ----- 90947 in-flight CPI 1.5169 -- Total Cycles 137987 ---- Thread 17 ---- PC 5: Stalled ----- 99179 in-flight CPI 1.3910 -- Total Cycles 137987 ---- Thread 18 ---- PC 5: Stalled ----- 95636 in-flight CPI 1.4426 -- Total Cycles 137987 ---- Thread 19 ---- PC 5: Stalled ----- 98928 in-flight CPI 1.3946 -- Total Cycles 137987 ---- Thread 20 ---- PC 5: Stalled ----- 98259 in-flight CPI 1.4040 -- Total Cycles 137987 ---- Thread 21 ---- PC 5: Stalled ----- 95396 in-flight CPI 1.4462 -- Total Cycles 137987 ---- Thread 22 ---- PC 5: Stalled ----- 95491 in-flight CPI 1.4447 -- Total Cycles 137987 ---- Thread 23 ---- PC 5: Stalled ----- 92607 in-flight CPI 1.4898 -- Total Cycles 137987 ---- Thread 24 ---- PC 5: Stalled ----- 93563 in-flight CPI 1.4745 -- Total Cycles 137987 ---- Thread 25 ---- PC 5: Stalled ----- 88863 in-flight CPI 1.5525 -- Total Cycles 137987 ---- Thread 26 ---- PC 5: Stalled ----- 94241 in-flight CPI 1.4640 -- Total Cycles 137987 ---- Thread 27 ---- PC 5: Stalled ----- 92048 in-flight CPI 1.4988 -- Total Cycles 137987 ---- Thread 28 ---- PC 5: Stalled ----- 91709 in-flight CPI 1.5043 -- Total Cycles 137987 ---- Thread 29 ---- PC 5: Stalled ----- 87921 in-flight CPI 1.5692 -- Total Cycles 137987 ---- Thread 30 ---- PC 5: Stalled ----- 82529 in-flight CPI 1.6717 -- Total Cycles 137987 ---- Thread 31 ---- PC 5: Stalled ----- 88651 in-flight CPI 1.5562 -- Total Cycles 137987 Total CPI 0.0454 , IPC 22.0161 -- Total Cycles 137987 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7637 (3.694244%) FPSUB: 0 (0.000000%) FPMUL: 31519 (15.246678%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 84069 (40.666676%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5593 (2.705500%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 70154 (33.935577%) DIV: 7490 (3.623136%) FPUN: 0 (0.000000%) FPRSUB: 265 (0.128188%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3330744 total) ADD%: 7.469 (248765) SUB%: 0.000 (0) MUL%: 0.006 (203) BITOR%: 1.530 (50972) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.541 (18027) FPSUB%: 0.000 (0) FPMUL%: 4.762 (158611) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (609) FPMAX%: 0.018 (609) LOAD%: 5.170 (172208) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (235) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (584) FPINV%: 0.000 (0) FPCONV%: 0.019 (641) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.063 (35421) FPLE%: 0.458 (15269) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (609) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.834 (94388) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.749 (24935) CMPU%: 0.000 (0) RSUB%: 0.006 (203) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.787 (525823) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.185 (39485) ORI%: 1.553 (51735) XORI%: 0.000 (0) MULI%: 3.231 (107604) LW%: 1.143 (38080) LWI%: 13.577 (452200) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.290 (9664) SWI%: 4.102 (136625) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.416 (47164) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.313 (10409) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.055 (1820) bned%: 0.000 (0) bneid%: 13.875 (462138) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.718 (23929) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.120 (3989) DIV%: 0.012 (406) FPUN%: 1.482 (49364) FPRSUB%: 3.672 (122301) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (72) FPGT%: 2.963 (98700) FPGE%: 1.024 (34095) SYNC%: 0.000 (0) NOP%: 8.790 (292756) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 18 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 12 FPSUB 0 FPMUL 56 FPCMPLT 0 FPMIN 0 FPMAX 399 LOAD 39855 INTCONV 0 ATOMIC_INC 27 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 13 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1613 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 12 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49384 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 9 ORI 10854 XORI 0 MULI 9647 LW 0 LWI 142854 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 81 DIV 33 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 22.0163 --Total thread-cycles: 4415584 --total thread-cycles issued: 3037988 (68.801499%) --iCache conflicts: 113000 (2.559118%) --thread*cycles of FU dependence: 254892 (5.772555%) --thread*cycles of data dependence: 206727 (4.681759%) --iCache cycles*banks: 4415584 (75.432287% used) Issue breakdown: --thread*cycles of issue worked: 3037988 (68.801499%) --thread*cycles of issue failed: 1084840 (24.568438%) --thread*cycles of issue NOP/other: 292756 (6.630063%) Number of thread-cycles not ready: 206727 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3330744 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 7 2: 7 3: 8 4: 7 5: 8 6: 7 7: 5 8: 7 9: 9 10: 8 11: 7 12: 8 13: 8 14: 7 15: 8 16: 8 17: 8 18: 7 19: 8 20: 8 21: 8 22: 8 23: 7 24: 8 25: 7 26: 6 27: 7 28: 7 29: 6 30: 6 31: 7 <=== Core 56 ===> ---- Thread 00 ---- PC 5: Stalled ----- 96845 in-flight CPI 1.3052 -- Total Cycles 126423 ---- Thread 01 ---- PC 5: Stalled ----- 98210 in-flight CPI 1.2871 -- Total Cycles 126423 ---- Thread 02 ---- PC 5: Stalled ----- 102459 in-flight CPI 1.2336 -- Total Cycles 126423 ---- Thread 03 ---- PC 5: Stalled ----- 94971 in-flight CPI 1.3309 -- Total Cycles 126423 ---- Thread 04 ---- PC 5: Stalled ----- 99293 in-flight CPI 1.2730 -- Total Cycles 126423 ---- Thread 05 ---- PC 5: Stalled ----- 98588 in-flight CPI 1.2821 -- Total Cycles 126423 ---- Thread 06 ---- PC 5: Stalled ----- 95121 in-flight CPI 1.3288 -- Total Cycles 126423 ---- Thread 07 ---- PC 5: Stalled ----- 94370 in-flight CPI 1.3394 -- Total Cycles 126423 ---- Thread 08 ---- PC 5: Stalled ----- 93828 in-flight CPI 1.3471 -- Total Cycles 126423 ---- Thread 09 ---- PC 5: Stalled ----- 94694 in-flight CPI 1.3348 -- Total Cycles 126423 ---- Thread 10 ---- PC 5: Stalled ----- 102758 in-flight CPI 1.2301 -- Total Cycles 126423 ---- Thread 11 ---- PC 5: Stalled ----- 98872 in-flight CPI 1.2784 -- Total Cycles 126423 ---- Thread 12 ---- PC 5: Stalled ----- 97400 in-flight CPI 1.2977 -- Total Cycles 126423 ---- Thread 13 ---- PC 5: Stalled ----- 95962 in-flight CPI 1.3172 -- Total Cycles 126423 ---- Thread 14 ---- PC 5: Stalled ----- 96152 in-flight CPI 1.3146 -- Total Cycles 126423 ---- Thread 15 ---- PC 5: Stalled ----- 95934 in-flight CPI 1.3176 -- Total Cycles 126423 ---- Thread 16 ---- PC 5: Stalled ----- 98841 in-flight CPI 1.2788 -- Total Cycles 126423 ---- Thread 17 ---- PC 5: Stalled ----- 90970 in-flight CPI 1.3895 -- Total Cycles 126423 ---- Thread 18 ---- PC 5: Stalled ----- 94635 in-flight CPI 1.3357 -- Total Cycles 126423 ---- Thread 19 ---- PC 5: Stalled ----- 97686 in-flight CPI 1.2939 -- Total Cycles 126423 ---- Thread 20 ---- PC 5: Stalled ----- 93394 in-flight CPI 1.3534 -- Total Cycles 126423 ---- Thread 21 ---- PC 5: Stalled ----- 94324 in-flight CPI 1.3400 -- Total Cycles 126423 ---- Thread 22 ---- PC 5: Stalled ----- 90467 in-flight CPI 1.3972 -- Total Cycles 126423 ---- Thread 23 ---- PC 5: Stalled ----- 93696 in-flight CPI 1.3491 -- Total Cycles 126423 ---- Thread 24 ---- PC 5: Stalled ----- 87465 in-flight CPI 1.4452 -- Total Cycles 126423 ---- Thread 25 ---- PC 5: Stalled ----- 88402 in-flight CPI 1.4298 -- Total Cycles 126423 ---- Thread 26 ---- PC 5: Stalled ----- 86044 in-flight CPI 1.4690 -- Total Cycles 126423 ---- Thread 27 ---- PC 5: Stalled ----- 91614 in-flight CPI 1.3797 -- Total Cycles 126423 ---- Thread 28 ---- PC 5: Stalled ----- 92773 in-flight CPI 1.3625 -- Total Cycles 126423 ---- Thread 29 ---- PC 5: Stalled ----- 92331 in-flight CPI 1.3690 -- Total Cycles 126423 ---- Thread 30 ---- PC 5: Stalled ----- 89671 in-flight CPI 1.4096 -- Total Cycles 126423 ---- Thread 31 ---- PC 5: Stalled ----- 86896 in-flight CPI 1.4546 -- Total Cycles 126423 Total CPI 0.0418 , IPC 23.9296 -- Total Cycles 126423 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7102 (3.670455%) FPSUB: 0 (0.000000%) FPMUL: 30646 (15.838463%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 75302 (38.917572%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5835 (3.015644%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 66427 (34.330796%) DIV: 7902 (4.083911%) FPUN: 0 (0.000000%) FPRSUB: 277 (0.143159%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3316862 total) ADD%: 7.553 (250519) SUB%: 0.000 (0) MUL%: 0.006 (214) BITOR%: 1.542 (51134) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.516 (17102) FPSUB%: 0.000 (0) FPMUL%: 4.684 (155364) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (642) FPMAX%: 0.019 (642) LOAD%: 5.099 (169134) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (246) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (609) FPINV%: 0.000 (0) FPCONV%: 0.020 (674) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.055 (34990) FPLE%: 0.458 (15185) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (642) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.840 (94214) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.731 (24244) CMPU%: 0.000 (0) RSUB%: 0.006 (214) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.766 (522920) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.181 (39170) ORI%: 1.546 (51289) XORI%: 0.000 (0) MULI%: 3.247 (107714) LW%: 1.147 (38028) LWI%: 13.623 (451865) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.291 (9656) SWI%: 4.113 (136407) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.419 (47075) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.312 (10344) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.052 (1713) bned%: 0.000 (0) bneid%: 13.900 (461052) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.726 (24092) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.113 (3762) DIV%: 0.013 (428) FPUN%: 1.501 (49778) FPRSUB%: 3.645 (120907) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (71) FPGT%: 2.972 (98579) FPGE%: 1.043 (34593) SYNC%: 0.000 (0) NOP%: 8.790 (291554) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 33 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 15 FPSUB 0 FPMUL 40 FPCMPLT 0 FPMIN 0 FPMAX 414 LOAD 38247 INTCONV 0 ATOMIC_INC 23 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 16 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1500 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 8 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49402 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 16 ORI 10079 XORI 0 MULI 10373 LW 0 LWI 142768 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 71 DIV 21 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.9299 --Total thread-cycles: 4045536 --total thread-cycles issued: 3025308 (74.781389%) --iCache conflicts: 113272 (2.799926%) --thread*cycles of FU dependence: 253048 (6.254993%) --thread*cycles of data dependence: 193491 (4.782827%) --iCache cycles*banks: 4045536 (81.988987% used) Issue breakdown: --thread*cycles of issue worked: 3025308 (74.781389%) --thread*cycles of issue failed: 728674 (18.011804%) --thread*cycles of issue NOP/other: 291554 (7.206808%) Number of thread-cycles not ready: 193491 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3316862 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 7 2: 9 3: 8 4: 8 5: 8 6: 8 7: 8 8: 8 9: 7 10: 8 11: 9 12: 8 13: 8 14: 7 15: 7 16: 8 17: 6 18: 7 19: 9 20: 8 21: 8 22: 8 23: 7 24: 6 25: 7 26: 7 27: 8 28: 7 29: 8 30: 8 31: 8 <=== Core 57 ===> ---- Thread 00 ---- PC 5: Stalled ----- 102064 in-flight CPI 1.2659 -- Total Cycles 129234 ---- Thread 01 ---- PC 5: Stalled ----- 93744 in-flight CPI 1.3783 -- Total Cycles 129234 ---- Thread 02 ---- PC 5: Stalled ----- 102987 in-flight CPI 1.2546 -- Total Cycles 129234 ---- Thread 03 ---- PC 5: Stalled ----- 96853 in-flight CPI 1.3341 -- Total Cycles 129234 ---- Thread 04 ---- PC 5: Stalled ----- 94681 in-flight CPI 1.3647 -- Total Cycles 129234 ---- Thread 05 ---- PC 5: Stalled ----- 95759 in-flight CPI 1.3493 -- Total Cycles 129234 ---- Thread 06 ---- PC 5: Stalled ----- 96123 in-flight CPI 1.3442 -- Total Cycles 129234 ---- Thread 07 ---- PC 5: Stalled ----- 100836 in-flight CPI 1.2814 -- Total Cycles 129234 ---- Thread 08 ---- PC 5: Stalled ----- 100144 in-flight CPI 1.2902 -- Total Cycles 129234 ---- Thread 09 ---- PC 5: Stalled ----- 98884 in-flight CPI 1.3067 -- Total Cycles 129234 ---- Thread 10 ---- PC 5: Stalled ----- 87084 in-flight CPI 1.4838 -- Total Cycles 129234 ---- Thread 11 ---- PC 5: Stalled ----- 99504 in-flight CPI 1.2985 -- Total Cycles 129234 ---- Thread 12 ---- PC 5: Stalled ----- 95013 in-flight CPI 1.3599 -- Total Cycles 129234 ---- Thread 13 ---- PC 5: Stalled ----- 98953 in-flight CPI 1.3058 -- Total Cycles 129234 ---- Thread 14 ---- PC 5: Stalled ----- 94235 in-flight CPI 1.3712 -- Total Cycles 129234 ---- Thread 15 ---- PC 5: Stalled ----- 97865 in-flight CPI 1.3203 -- Total Cycles 129234 ---- Thread 16 ---- PC 5: Stalled ----- 94755 in-flight CPI 1.3637 -- Total Cycles 129234 ---- Thread 17 ---- PC 5: Stalled ----- 95319 in-flight CPI 1.3555 -- Total Cycles 129234 ---- Thread 18 ---- PC 5: Stalled ----- 96316 in-flight CPI 1.3415 -- Total Cycles 129234 ---- Thread 19 ---- PC 5: Stalled ----- 94955 in-flight CPI 1.3608 -- Total Cycles 129234 ---- Thread 20 ---- PC 5: Stalled ----- 96839 in-flight CPI 1.3343 -- Total Cycles 129234 ---- Thread 21 ---- PC 5: Stalled ----- 98062 in-flight CPI 1.3176 -- Total Cycles 129234 ---- Thread 22 ---- PC 5: Stalled ----- 94713 in-flight CPI 1.3642 -- Total Cycles 129234 ---- Thread 23 ---- PC 5: Stalled ----- 93786 in-flight CPI 1.3777 -- Total Cycles 129234 ---- Thread 24 ---- PC 5: Stalled ----- 89060 in-flight CPI 1.4508 -- Total Cycles 129234 ---- Thread 25 ---- PC 5: Stalled ----- 89206 in-flight CPI 1.4484 -- Total Cycles 129234 ---- Thread 26 ---- PC 5: Stalled ----- 85658 in-flight CPI 1.5085 -- Total Cycles 129234 ---- Thread 27 ---- PC 5: Stalled ----- 97745 in-flight CPI 1.3219 -- Total Cycles 129234 ---- Thread 28 ---- PC 5: Stalled ----- 91353 in-flight CPI 1.4144 -- Total Cycles 129234 ---- Thread 29 ---- PC 5: Stalled ----- 89415 in-flight CPI 1.4450 -- Total Cycles 129234 ---- Thread 30 ---- PC 5: Stalled ----- 90478 in-flight CPI 1.4281 -- Total Cycles 129234 ---- Thread 31 ---- PC 5: Stalled ----- 88762 in-flight CPI 1.4557 -- Total Cycles 129234 Total CPI 0.0425 , IPC 23.5365 -- Total Cycles 129234 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7245 (3.942407%) FPSUB: 0 (0.000000%) FPMUL: 30758 (16.737135%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 65032 (35.387520%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5623 (3.059786%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 67284 (36.612959%) DIV: 7563 (4.115448%) FPUN: 0 (0.000000%) FPRSUB: 266 (0.144745%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3334770 total) ADD%: 7.473 (249192) SUB%: 0.000 (0) MUL%: 0.006 (205) BITOR%: 1.535 (51183) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.518 (17273) FPSUB%: 0.000 (0) FPMUL%: 4.694 (156525) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (615) FPMAX%: 0.018 (615) LOAD%: 5.141 (171454) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (237) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (588) FPINV%: 0.000 (0) FPCONV%: 0.019 (647) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.054 (35134) FPLE%: 0.460 (15325) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (615) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.851 (95085) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.740 (24668) CMPU%: 0.000 (0) RSUB%: 0.006 (205) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.802 (526967) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.188 (39603) ORI%: 1.544 (51484) XORI%: 0.000 (0) MULI%: 3.246 (108262) LW%: 1.150 (38362) LWI%: 13.615 (454026) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.292 (9731) SWI%: 4.119 (137366) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.425 (47516) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.313 (10446) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.053 (1766) bned%: 0.000 (0) bneid%: 13.891 (463221) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.726 (24196) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.114 (3818) DIV%: 0.012 (410) FPUN%: 1.491 (49737) FPRSUB%: 3.651 (121740) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (60) FPGT%: 2.968 (98981) FPGE%: 1.032 (34412) SYNC%: 0.000 (0) NOP%: 8.786 (293004) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 26 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 15 FPSUB 0 FPMUL 56 FPCMPLT 0 FPMIN 0 FPMAX 399 LOAD 38408 INTCONV 0 ATOMIC_INC 22 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 8 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1378 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 8 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49498 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 8 ORI 10278 XORI 0 MULI 9777 LW 0 LWI 143254 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 81 DIV 33 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.5367 --Total thread-cycles: 4135488 --total thread-cycles issued: 3041766 (73.552771%) --iCache conflicts: 115253 (2.786926%) --thread*cycles of FU dependence: 253254 (6.123921%) --thread*cycles of data dependence: 183771 (4.443756%) --iCache cycles*banks: 4135488 (80.638657% used) Issue breakdown: --thread*cycles of issue worked: 3041766 (73.552771%) --thread*cycles of issue failed: 800718 (19.362116%) --thread*cycles of issue NOP/other: 293004 (7.085113%) Number of thread-cycles not ready: 183771 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3334770 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 9 1: 8 2: 8 3: 7 4: 7 5: 8 6: 8 7: 7 8: 8 9: 8 10: 5 11: 8 12: 8 13: 7 14: 7 15: 8 16: 6 17: 8 18: 7 19: 7 20: 7 21: 8 22: 8 23: 8 24: 7 25: 7 26: 6 27: 7 28: 8 29: 8 30: 7 31: 7 <=== Core 58 ===> ---- Thread 00 ---- PC 5: Stalled ----- 94755 in-flight CPI 1.7072 -- Total Cycles 161800 ---- Thread 01 ---- PC 5: Stalled ----- 102978 in-flight CPI 1.5709 -- Total Cycles 161800 ---- Thread 02 ---- PC 5: Stalled ----- 95787 in-flight CPI 1.6888 -- Total Cycles 161800 ---- Thread 03 ---- PC 5: Stalled ----- 93743 in-flight CPI 1.7256 -- Total Cycles 161800 ---- Thread 04 ---- PC 5: Stalled ----- 95510 in-flight CPI 1.6937 -- Total Cycles 161800 ---- Thread 05 ---- PC 5: Stalled ----- 93928 in-flight CPI 1.7222 -- Total Cycles 161800 ---- Thread 06 ---- PC 5: Stalled ----- 103099 in-flight CPI 1.5691 -- Total Cycles 161800 ---- Thread 07 ---- PC 5: Stalled ----- 100185 in-flight CPI 1.6147 -- Total Cycles 161800 ---- Thread 08 ---- PC 5: Stalled ----- 100867 in-flight CPI 1.6038 -- Total Cycles 161800 ---- Thread 09 ---- PC 5: Stalled ----- 94632 in-flight CPI 1.7095 -- Total Cycles 161800 ---- Thread 10 ---- PC 5: Stalled ----- 97594 in-flight CPI 1.6577 -- Total Cycles 161800 ---- Thread 11 ---- PC 5: Stalled ----- 100401 in-flight CPI 1.6112 -- Total Cycles 161800 ---- Thread 12 ---- PC 5: Stalled ----- 96930 in-flight CPI 1.6690 -- Total Cycles 161800 ---- Thread 13 ---- PC 5: Stalled ----- 98419 in-flight CPI 1.6437 -- Total Cycles 161800 ---- Thread 14 ---- PC 5: Stalled ----- 95060 in-flight CPI 1.7018 -- Total Cycles 161800 ---- Thread 15 ---- PC 5: Stalled ----- 98380 in-flight CPI 1.6443 -- Total Cycles 161800 ---- Thread 16 ---- PC 5: Stalled ----- 87438 in-flight CPI 1.8502 -- Total Cycles 161800 ---- Thread 17 ---- PC 5: Stalled ----- 90696 in-flight CPI 1.7837 -- Total Cycles 161800 ---- Thread 18 ---- PC 5: Stalled ----- 97184 in-flight CPI 1.6646 -- Total Cycles 161800 ---- Thread 19 ---- PC 5: Stalled ----- 96849 in-flight CPI 1.6703 -- Total Cycles 161800 ---- Thread 20 ---- PC 5: Stalled ----- 93330 in-flight CPI 1.7333 -- Total Cycles 161800 ---- Thread 21 ---- PC 5: Stalled ----- 87061 in-flight CPI 1.8582 -- Total Cycles 161800 ---- Thread 22 ---- PC 5: Stalled ----- 94204 in-flight CPI 1.7172 -- Total Cycles 161800 ---- Thread 23 ---- PC 5: Stalled ----- 95424 in-flight CPI 1.6953 -- Total Cycles 161800 ---- Thread 24 ---- PC 5: Stalled ----- 115360 in-flight CPI 1.4024 -- Total Cycles 161800 ---- Thread 25 ---- PC 5: Stalled ----- 92684 in-flight CPI 1.7455 -- Total Cycles 161800 ---- Thread 26 ---- PC 5: Stalled ----- 92809 in-flight CPI 1.7430 -- Total Cycles 161800 ---- Thread 27 ---- PC 5: Stalled ----- 92532 in-flight CPI 1.7482 -- Total Cycles 161800 ---- Thread 28 ---- PC 5: Stalled ----- 90648 in-flight CPI 1.7845 -- Total Cycles 161800 ---- Thread 29 ---- PC 5: Stalled ----- 88599 in-flight CPI 1.8259 -- Total Cycles 161800 ---- Thread 30 ---- PC 5: Stalled ----- 88649 in-flight CPI 1.8248 -- Total Cycles 161800 ---- Thread 31 ---- PC 5: Stalled ----- 91858 in-flight CPI 1.7610 -- Total Cycles 161800 Total CPI 0.0529 , IPC 18.9009 -- Total Cycles 161800 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8284 (4.059133%) FPSUB: 0 (0.000000%) FPMUL: 32987 (16.163522%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 73317 (35.925089%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5656 (2.771421%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 75929 (37.204961%) DIV: 7646 (3.746515%) FPUN: 0 (0.000000%) FPRSUB: 264 (0.129359%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3353809 total) ADD%: 7.461 (250242) SUB%: 0.000 (0) MUL%: 0.006 (207) BITOR%: 1.526 (51164) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.581 (19473) FPSUB%: 0.000 (0) FPMUL%: 4.876 (163540) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (621) FPMAX%: 0.019 (621) LOAD%: 5.199 (174376) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (239) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (592) FPINV%: 0.000 (0) FPCONV%: 0.019 (653) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.081 (36264) FPLE%: 0.455 (15257) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (621) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.798 (93852) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.756 (25345) CMPU%: 0.000 (0) RSUB%: 0.006 (207) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.740 (527879) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.173 (39344) ORI%: 1.589 (53278) XORI%: 0.000 (0) MULI%: 3.203 (107416) LW%: 1.129 (37872) LWI%: 13.506 (452975) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.287 (9624) SWI%: 4.062 (136225) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.398 (46884) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10436) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.063 (2125) bned%: 0.000 (0) bneid%: 13.856 (464708) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.715 (23968) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.129 (4327) DIV%: 0.012 (414) FPUN%: 1.475 (49485) FPRSUB%: 3.708 (124345) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (69) FPGT%: 2.959 (99242) FPGE%: 1.021 (34228) SYNC%: 0.000 (0) NOP%: 8.814 (295595) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 21 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 13 FPSUB 0 FPMUL 57 FPCMPLT 0 FPMIN 0 FPMAX 399 LOAD 40298 INTCONV 0 ATOMIC_INC 15 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 7 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1613 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 7 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49336 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 8 ORI 11871 XORI 0 MULI 9582 LW 0 LWI 143545 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 81 DIV 28 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 18.9011 --Total thread-cycles: 5177600 --total thread-cycles issued: 3058214 (59.066247%) --iCache conflicts: 113420 (2.190590%) --thread*cycles of FU dependence: 256889 (4.961546%) --thread*cycles of data dependence: 204083 (3.941653%) --iCache cycles*banks: 5177600 (64.775977% used) Issue breakdown: --thread*cycles of issue worked: 3058214 (59.066247%) --thread*cycles of issue failed: 1823791 (35.224641%) --thread*cycles of issue NOP/other: 295595 (5.709112%) Number of thread-cycles not ready: 204083 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3353809 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 9 2: 8 3: 8 4: 8 5: 8 6: 8 7: 7 8: 8 9: 7 10: 6 11: 8 12: 7 13: 8 14: 7 15: 8 16: 6 17: 7 18: 7 19: 8 20: 8 21: 6 22: 8 23: 7 24: 6 25: 6 26: 8 27: 8 28: 8 29: 7 30: 8 31: 8 <=== Core 59 ===> ---- Thread 00 ---- PC 5: Stalled ----- 93916 in-flight CPI 1.4660 -- Total Cycles 137708 ---- Thread 01 ---- PC 5: Stalled ----- 97333 in-flight CPI 1.4145 -- Total Cycles 137708 ---- Thread 02 ---- PC 5: Stalled ----- 99831 in-flight CPI 1.3792 -- Total Cycles 137708 ---- Thread 03 ---- PC 5: Stalled ----- 101852 in-flight CPI 1.3517 -- Total Cycles 137708 ---- Thread 04 ---- PC 5: Stalled ----- 100028 in-flight CPI 1.3764 -- Total Cycles 137708 ---- Thread 05 ---- PC 5: Stalled ----- 93166 in-flight CPI 1.4778 -- Total Cycles 137708 ---- Thread 06 ---- PC 5: Stalled ----- 98535 in-flight CPI 1.3973 -- Total Cycles 137708 ---- Thread 07 ---- PC 5: Stalled ----- 93314 in-flight CPI 1.4755 -- Total Cycles 137708 ---- Thread 08 ---- PC 5: Stalled ----- 99992 in-flight CPI 1.3769 -- Total Cycles 137708 ---- Thread 09 ---- PC 5: Stalled ----- 96185 in-flight CPI 1.4315 -- Total Cycles 137708 ---- Thread 10 ---- PC 5: Stalled ----- 96861 in-flight CPI 1.4215 -- Total Cycles 137708 ---- Thread 11 ---- PC 5: Stalled ----- 95347 in-flight CPI 1.4440 -- Total Cycles 137708 ---- Thread 12 ---- PC 5: Stalled ----- 98580 in-flight CPI 1.3966 -- Total Cycles 137708 ---- Thread 13 ---- PC 5: Stalled ----- 93606 in-flight CPI 1.4709 -- Total Cycles 137708 ---- Thread 14 ---- PC 5: Stalled ----- 96335 in-flight CPI 1.4292 -- Total Cycles 137708 ---- Thread 15 ---- PC 5: Stalled ----- 91796 in-flight CPI 1.4998 -- Total Cycles 137708 ---- Thread 16 ---- PC 5: Stalled ----- 93443 in-flight CPI 1.4735 -- Total Cycles 137708 ---- Thread 17 ---- PC 5: Stalled ----- 96739 in-flight CPI 1.4232 -- Total Cycles 137708 ---- Thread 18 ---- PC 5: Stalled ----- 99434 in-flight CPI 1.3847 -- Total Cycles 137708 ---- Thread 19 ---- PC 5: Stalled ----- 95463 in-flight CPI 1.4422 -- Total Cycles 137708 ---- Thread 20 ---- PC 5: Stalled ----- 87431 in-flight CPI 1.5749 -- Total Cycles 137708 ---- Thread 21 ---- PC 5: Stalled ----- 91768 in-flight CPI 1.5003 -- Total Cycles 137708 ---- Thread 22 ---- PC 5: Stalled ----- 93482 in-flight CPI 1.4728 -- Total Cycles 137708 ---- Thread 23 ---- PC 5: Stalled ----- 92966 in-flight CPI 1.4811 -- Total Cycles 137708 ---- Thread 24 ---- PC 5: Stalled ----- 97199 in-flight CPI 1.4165 -- Total Cycles 137708 ---- Thread 25 ---- PC 5: Stalled ----- 88113 in-flight CPI 1.5626 -- Total Cycles 137708 ---- Thread 26 ---- PC 5: Stalled ----- 96722 in-flight CPI 1.4236 -- Total Cycles 137708 ---- Thread 27 ---- PC 5: Stalled ----- 91700 in-flight CPI 1.5015 -- Total Cycles 137708 ---- Thread 28 ---- PC 5: Stalled ----- 88430 in-flight CPI 1.5570 -- Total Cycles 137708 ---- Thread 29 ---- PC 5: Stalled ----- 87883 in-flight CPI 1.5667 -- Total Cycles 137708 ---- Thread 30 ---- PC 5: Stalled ----- 87020 in-flight CPI 1.5822 -- Total Cycles 137708 ---- Thread 31 ---- PC 5: Stalled ----- 88821 in-flight CPI 1.5501 -- Total Cycles 137708 Total CPI 0.0455 , IPC 21.9584 -- Total Cycles 137708 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8227 (3.955612%) FPSUB: 0 (0.000000%) FPMUL: 32613 (15.680609%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 77960 (37.483833%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5702 (2.741570%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 75647 (36.371723%) DIV: 7567 (3.638278%) FPUN: 0 (0.000000%) FPRSUB: 267 (0.128376%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3316045 total) ADD%: 7.476 (247907) SUB%: 0.000 (0) MUL%: 0.006 (205) BITOR%: 1.536 (50923) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.584 (19375) FPSUB%: 0.000 (0) FPMUL%: 4.883 (161935) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (615) FPMAX%: 0.019 (615) LOAD%: 5.200 (172428) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (237) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (592) FPINV%: 0.000 (0) FPCONV%: 0.020 (647) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.082 (35884) FPLE%: 0.458 (15180) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (615) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.791 (92565) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.759 (25179) CMPU%: 0.000 (0) RSUB%: 0.006 (205) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.738 (521894) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.174 (38944) ORI%: 1.593 (52825) XORI%: 0.000 (0) MULI%: 3.193 (105896) LW%: 1.126 (37354) LWI%: 13.481 (447047) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.287 (9507) SWI%: 4.058 (134549) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.394 (46232) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10305) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.060 (1996) bned%: 0.000 (0) bneid%: 13.862 (459655) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.715 (23725) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.130 (4310) DIV%: 0.012 (410) FPUN%: 1.482 (49128) FPRSUB%: 3.707 (122937) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (73) FPGT%: 2.954 (97968) FPGE%: 1.024 (33948) SYNC%: 0.000 (0) NOP%: 8.810 (292139) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 29 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 12 FPSUB 0 FPMUL 63 FPCMPLT 0 FPMIN 0 FPMAX 404 LOAD 41016 INTCONV 0 ATOMIC_INC 20 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 15 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1405 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 8 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48796 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 10 ORI 11752 XORI 0 MULI 9311 LW 0 LWI 141858 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 80 DIV 26 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.9587 --Total thread-cycles: 4406656 --total thread-cycles issued: 3023906 (68.621331%) --iCache conflicts: 112660 (2.556587%) --thread*cycles of FU dependence: 254811 (5.782412%) --thread*cycles of data dependence: 207983 (4.719747%) --iCache cycles*banks: 4406656 (75.251551% used) Issue breakdown: --thread*cycles of issue worked: 3023906 (68.621331%) --thread*cycles of issue failed: 1090611 (24.749175%) --thread*cycles of issue NOP/other: 292139 (6.629494%) Number of thread-cycles not ready: 207983 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3316045 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 7 3: 9 4: 9 5: 7 6: 8 7: 6 8: 9 9: 7 10: 7 11: 8 12: 9 13: 7 14: 7 15: 8 16: 7 17: 8 18: 8 19: 8 20: 5 21: 7 22: 8 23: 6 24: 8 25: 7 26: 6 27: 7 28: 7 29: 7 30: 7 31: 8 <=== Core 60 ===> ---- Thread 00 ---- PC 5: Stalled ----- 94981 in-flight CPI 1.3581 -- Total Cycles 129020 ---- Thread 01 ---- PC 5: Stalled ----- 98125 in-flight CPI 1.3146 -- Total Cycles 129020 ---- Thread 02 ---- PC 5: Stalled ----- 94990 in-flight CPI 1.3580 -- Total Cycles 129020 ---- Thread 03 ---- PC 5: Stalled ----- 98883 in-flight CPI 1.3046 -- Total Cycles 129020 ---- Thread 04 ---- PC 5: Stalled ----- 101917 in-flight CPI 1.2657 -- Total Cycles 129020 ---- Thread 05 ---- PC 5: Stalled ----- 95509 in-flight CPI 1.3506 -- Total Cycles 129020 ---- Thread 06 ---- PC 5: Stalled ----- 102249 in-flight CPI 1.2616 -- Total Cycles 129020 ---- Thread 07 ---- PC 5: Stalled ----- 98030 in-flight CPI 1.3159 -- Total Cycles 129020 ---- Thread 08 ---- PC 5: Stalled ----- 102575 in-flight CPI 1.2576 -- Total Cycles 129020 ---- Thread 09 ---- PC 5: Stalled ----- 94581 in-flight CPI 1.3639 -- Total Cycles 129020 ---- Thread 10 ---- PC 5: Stalled ----- 92138 in-flight CPI 1.4000 -- Total Cycles 129020 ---- Thread 11 ---- PC 5: Stalled ----- 93393 in-flight CPI 1.3812 -- Total Cycles 129020 ---- Thread 12 ---- PC 5: Stalled ----- 98027 in-flight CPI 1.3159 -- Total Cycles 129020 ---- Thread 13 ---- PC 5: Stalled ----- 96362 in-flight CPI 1.3387 -- Total Cycles 129020 ---- Thread 14 ---- PC 5: Stalled ----- 90693 in-flight CPI 1.4224 -- Total Cycles 129020 ---- Thread 15 ---- PC 5: Stalled ----- 97739 in-flight CPI 1.3198 -- Total Cycles 129020 ---- Thread 16 ---- PC 5: Stalled ----- 93538 in-flight CPI 1.3791 -- Total Cycles 129020 ---- Thread 17 ---- PC 5: Stalled ----- 97827 in-flight CPI 1.3186 -- Total Cycles 129020 ---- Thread 18 ---- PC 5: Stalled ----- 98114 in-flight CPI 1.3147 -- Total Cycles 129020 ---- Thread 19 ---- PC 5: Stalled ----- 94069 in-flight CPI 1.3713 -- Total Cycles 129020 ---- Thread 20 ---- PC 5: Stalled ----- 95577 in-flight CPI 1.3496 -- Total Cycles 129020 ---- Thread 21 ---- PC 5: Stalled ----- 91423 in-flight CPI 1.4110 -- Total Cycles 129020 ---- Thread 22 ---- PC 5: Stalled ----- 94014 in-flight CPI 1.3721 -- Total Cycles 129020 ---- Thread 23 ---- PC 5: Stalled ----- 97317 in-flight CPI 1.3255 -- Total Cycles 129020 ---- Thread 24 ---- PC 5: Stalled ----- 97072 in-flight CPI 1.3289 -- Total Cycles 129020 ---- Thread 25 ---- PC 5: Stalled ----- 90225 in-flight CPI 1.4297 -- Total Cycles 129020 ---- Thread 26 ---- PC 5: Stalled ----- 84172 in-flight CPI 1.5326 -- Total Cycles 129020 ---- Thread 27 ---- PC 5: Stalled ----- 89545 in-flight CPI 1.4406 -- Total Cycles 129020 ---- Thread 28 ---- PC 5: Stalled ----- 93609 in-flight CPI 1.3780 -- Total Cycles 129020 ---- Thread 29 ---- PC 5: Stalled ----- 91448 in-flight CPI 1.4106 -- Total Cycles 129020 ---- Thread 30 ---- PC 5: Stalled ----- 85302 in-flight CPI 1.5122 -- Total Cycles 129020 ---- Thread 31 ---- PC 5: Stalled ----- 85645 in-flight CPI 1.5062 -- Total Cycles 129020 Total CPI 0.0426 , IPC 23.4819 -- Total Cycles 129020 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8132 (4.120640%) FPSUB: 0 (0.000000%) FPMUL: 32525 (16.481039%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 69173 (35.051280%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5760 (2.918702%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 74115 (37.555486%) DIV: 7383 (3.741107%) FPUN: 0 (0.000000%) FPRSUB: 260 (0.131747%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3322053 total) ADD%: 7.456 (247693) SUB%: 0.000 (0) MUL%: 0.006 (200) BITOR%: 1.525 (50664) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.577 (19157) FPSUB%: 0.000 (0) FPMUL%: 4.870 (161786) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (600) FPMAX%: 0.018 (600) LOAD%: 5.188 (172363) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (232) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (590) FPINV%: 0.000 (0) FPCONV%: 0.019 (632) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.077 (35784) FPLE%: 0.452 (15026) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (600) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.807 (93255) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.756 (25099) CMPU%: 0.000 (0) RSUB%: 0.006 (200) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.736 (522761) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.177 (39087) ORI%: 1.589 (52774) XORI%: 0.000 (0) MULI%: 3.208 (106588) LW%: 1.132 (37622) LWI%: 13.538 (449744) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.286 (9490) SWI%: 4.065 (135036) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.405 (46669) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.309 (10263) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.062 (2059) bned%: 0.000 (0) bneid%: 13.850 (460099) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.719 (23888) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.127 (4225) DIV%: 0.012 (400) FPUN%: 1.475 (48999) FPRSUB%: 3.708 (123185) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (76) FPGT%: 2.955 (98174) FPGE%: 1.023 (33973) SYNC%: 0.000 (0) NOP%: 8.801 (292364) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 18 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 8 FPSUB 0 FPMUL 43 FPCMPLT 0 FPMIN 0 FPMAX 391 LOAD 39629 INTCONV 0 ATOMIC_INC 14 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 12 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1200 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 8 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48942 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 6 ORI 11596 XORI 0 MULI 9533 LW 0 LWI 142462 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 87 DIV 28 FPUN 0 FPRSUB 1 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.4821 --Total thread-cycles: 4128640 --total thread-cycles issued: 3029689 (73.382252%) --iCache conflicts: 113815 (2.756719%) --thread*cycles of FU dependence: 253987 (6.151832%) --thread*cycles of data dependence: 197348 (4.779976%) --iCache cycles*banks: 4128640 (80.464390% used) Issue breakdown: --thread*cycles of issue worked: 3029689 (73.382252%) --thread*cycles of issue failed: 806587 (19.536385%) --thread*cycles of issue NOP/other: 292364 (7.081363%) Number of thread-cycles not ready: 197348 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3322053 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 7 2: 7 3: 7 4: 8 5: 8 6: 8 7: 7 8: 8 9: 7 10: 7 11: 8 12: 8 13: 7 14: 6 15: 7 16: 7 17: 9 18: 8 19: 8 20: 8 21: 7 22: 8 23: 7 24: 7 25: 7 26: 5 27: 6 28: 8 29: 7 30: 7 31: 6 <=== Core 61 ===> ---- Thread 00 ---- PC 5: Stalled ----- 102056 in-flight CPI 1.3271 -- Total Cycles 135470 ---- Thread 01 ---- PC 5: Stalled ----- 99703 in-flight CPI 1.3585 -- Total Cycles 135470 ---- Thread 02 ---- PC 5: Stalled ----- 100155 in-flight CPI 1.3524 -- Total Cycles 135470 ---- Thread 03 ---- PC 5: Stalled ----- 97885 in-flight CPI 1.3837 -- Total Cycles 135470 ---- Thread 04 ---- PC 5: Stalled ----- 98383 in-flight CPI 1.3768 -- Total Cycles 135470 ---- Thread 05 ---- PC 5: Stalled ----- 94101 in-flight CPI 1.4393 -- Total Cycles 135470 ---- Thread 06 ---- PC 5: Stalled ----- 102520 in-flight CPI 1.3212 -- Total Cycles 135470 ---- Thread 07 ---- PC 5: Stalled ----- 95304 in-flight CPI 1.4212 -- Total Cycles 135470 ---- Thread 08 ---- PC 5: Stalled ----- 100789 in-flight CPI 1.3438 -- Total Cycles 135470 ---- Thread 09 ---- PC 5: Stalled ----- 94300 in-flight CPI 1.4364 -- Total Cycles 135470 ---- Thread 10 ---- PC 5: Stalled ----- 97962 in-flight CPI 1.3826 -- Total Cycles 135470 ---- Thread 11 ---- PC 5: Stalled ----- 96062 in-flight CPI 1.4100 -- Total Cycles 135470 ---- Thread 12 ---- PC 5: Stalled ----- 99243 in-flight CPI 1.3648 -- Total Cycles 135470 ---- Thread 13 ---- PC 5: Stalled ----- 100812 in-flight CPI 1.3435 -- Total Cycles 135470 ---- Thread 14 ---- PC 5: Stalled ----- 98719 in-flight CPI 1.3720 -- Total Cycles 135470 ---- Thread 15 ---- PC 5: Stalled ----- 94105 in-flight CPI 1.4394 -- Total Cycles 135470 ---- Thread 16 ---- PC 5: Stalled ----- 97716 in-flight CPI 1.3861 -- Total Cycles 135470 ---- Thread 17 ---- PC 5: Stalled ----- 102369 in-flight CPI 1.3231 -- Total Cycles 135470 ---- Thread 18 ---- PC 5: Stalled ----- 91335 in-flight CPI 1.4829 -- Total Cycles 135470 ---- Thread 19 ---- PC 5: Stalled ----- 93197 in-flight CPI 1.4533 -- Total Cycles 135470 ---- Thread 20 ---- PC 5: Stalled ----- 90981 in-flight CPI 1.4887 -- Total Cycles 135470 ---- Thread 21 ---- PC 5: Stalled ----- 90267 in-flight CPI 1.5005 -- Total Cycles 135470 ---- Thread 22 ---- PC 5: Stalled ----- 88182 in-flight CPI 1.5360 -- Total Cycles 135470 ---- Thread 23 ---- PC 5: Stalled ----- 88786 in-flight CPI 1.5256 -- Total Cycles 135470 ---- Thread 24 ---- PC 5: Stalled ----- 91051 in-flight CPI 1.4876 -- Total Cycles 135470 ---- Thread 25 ---- PC 5: Stalled ----- 90351 in-flight CPI 1.4991 -- Total Cycles 135470 ---- Thread 26 ---- PC 5: Stalled ----- 85233 in-flight CPI 1.5892 -- Total Cycles 135470 ---- Thread 27 ---- PC 5: Stalled ----- 94991 in-flight CPI 1.4259 -- Total Cycles 135470 ---- Thread 28 ---- PC 5: Stalled ----- 93780 in-flight CPI 1.4443 -- Total Cycles 135470 ---- Thread 29 ---- PC 5: Stalled ----- 87444 in-flight CPI 1.5489 -- Total Cycles 135470 ---- Thread 30 ---- PC 5: Stalled ----- 92830 in-flight CPI 1.4591 -- Total Cycles 135470 ---- Thread 31 ---- PC 5: Stalled ----- 90341 in-flight CPI 1.4993 -- Total Cycles 135470 Total CPI 0.0445 , IPC 22.4514 -- Total Cycles 135470 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8129 (3.699894%) FPSUB: 0 (0.000000%) FPMUL: 32560 (14.819602%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 91669 (41.722915%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5526 (2.515145%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 74218 (33.780136%) DIV: 7349 (3.344879%) FPUN: 0 (0.000000%) FPRSUB: 258 (0.117428%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3334709 total) ADD%: 7.434 (247886) SUB%: 0.000 (0) MUL%: 0.006 (199) BITOR%: 1.530 (51016) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.574 (19125) FPSUB%: 0.000 (0) FPMUL%: 4.859 (162039) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (597) FPMAX%: 0.018 (597) LOAD%: 5.204 (173527) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (231) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (576) FPINV%: 0.000 (0) FPCONV%: 0.019 (629) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.074 (35802) FPLE%: 0.455 (15158) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (597) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.816 (93899) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.754 (25135) CMPU%: 0.000 (0) RSUB%: 0.006 (199) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.757 (525458) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.178 (39278) ORI%: 1.589 (53001) XORI%: 0.000 (0) MULI%: 3.211 (107076) LW%: 1.136 (37878) LWI%: 13.521 (450901) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.288 (9590) SWI%: 4.073 (135823) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.408 (46943) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10355) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.064 (2119) bned%: 0.000 (0) bneid%: 13.843 (461639) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.722 (24082) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.127 (4231) DIV%: 0.012 (398) FPUN%: 1.482 (49417) FPRSUB%: 3.703 (123477) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (71) FPGT%: 2.946 (98246) FPGE%: 1.027 (34259) SYNC%: 0.000 (0) NOP%: 8.791 (293159) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 18 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 11 FPSUB 0 FPMUL 50 FPCMPLT 0 FPMIN 0 FPMAX 391 LOAD 39700 INTCONV 0 ATOMIC_INC 26 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 15 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1233 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 6 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49107 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 7 ORI 11620 XORI 0 MULI 9280 LW 0 LWI 142675 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 77 DIV 26 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 22.4517 --Total thread-cycles: 4335040 --total thread-cycles issued: 3041550 (70.161982%) --iCache conflicts: 114713 (2.646181%) --thread*cycles of FU dependence: 254270 (5.865459%) --thread*cycles of data dependence: 219709 (5.068212%) --iCache cycles*banks: 4335040 (76.925265% used) Issue breakdown: --thread*cycles of issue worked: 3041550 (70.161982%) --thread*cycles of issue failed: 1000331 (23.075473%) --thread*cycles of issue NOP/other: 293159 (6.762544%) Number of thread-cycles not ready: 219709 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3334709 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 9 1: 7 2: 6 3: 8 4: 5 5: 8 6: 8 7: 8 8: 8 9: 6 10: 9 11: 7 12: 8 13: 9 14: 9 15: 6 16: 7 17: 8 18: 8 19: 7 20: 7 21: 6 22: 6 23: 6 24: 7 25: 7 26: 6 27: 7 28: 7 29: 7 30: 7 31: 7 <=== Core 62 ===> ---- Thread 00 ---- PC 5: Stalled ----- 97869 in-flight CPI 1.3149 -- Total Cycles 128711 ---- Thread 01 ---- PC 5: Stalled ----- 104254 in-flight CPI 1.2344 -- Total Cycles 128711 ---- Thread 02 ---- PC 5: Stalled ----- 96295 in-flight CPI 1.3364 -- Total Cycles 128711 ---- Thread 03 ---- PC 5: Stalled ----- 103637 in-flight CPI 1.2416 -- Total Cycles 128711 ---- Thread 04 ---- PC 5: Stalled ----- 99553 in-flight CPI 1.2927 -- Total Cycles 128711 ---- Thread 05 ---- PC 5: Stalled ----- 95158 in-flight CPI 1.3523 -- Total Cycles 128711 ---- Thread 06 ---- PC 5: Stalled ----- 105708 in-flight CPI 1.2174 -- Total Cycles 128711 ---- Thread 07 ---- PC 5: Stalled ----- 99228 in-flight CPI 1.2969 -- Total Cycles 128711 ---- Thread 08 ---- PC 5: Stalled ----- 93593 in-flight CPI 1.3750 -- Total Cycles 128711 ---- Thread 09 ---- PC 5: Stalled ----- 95775 in-flight CPI 1.3437 -- Total Cycles 128711 ---- Thread 10 ---- PC 5: Stalled ----- 96533 in-flight CPI 1.3331 -- Total Cycles 128711 ---- Thread 11 ---- PC 5: Stalled ----- 95477 in-flight CPI 1.3478 -- Total Cycles 128711 ---- Thread 12 ---- PC 5: Stalled ----- 94418 in-flight CPI 1.3629 -- Total Cycles 128711 ---- Thread 13 ---- PC 5: Stalled ----- 96678 in-flight CPI 1.3311 -- Total Cycles 128711 ---- Thread 14 ---- PC 5: Stalled ----- 97180 in-flight CPI 1.3242 -- Total Cycles 128711 ---- Thread 15 ---- PC 5: Stalled ----- 91473 in-flight CPI 1.4068 -- Total Cycles 128711 ---- Thread 16 ---- PC 5: Stalled ----- 97373 in-flight CPI 1.3216 -- Total Cycles 128711 ---- Thread 17 ---- PC 5: Stalled ----- 97414 in-flight CPI 1.3210 -- Total Cycles 128711 ---- Thread 18 ---- PC 5: Stalled ----- 91815 in-flight CPI 1.4016 -- Total Cycles 128711 ---- Thread 19 ---- PC 5: Stalled ----- 91391 in-flight CPI 1.4081 -- Total Cycles 128711 ---- Thread 20 ---- PC 5: Stalled ----- 88818 in-flight CPI 1.4489 -- Total Cycles 128711 ---- Thread 21 ---- PC 5: Stalled ----- 90559 in-flight CPI 1.4210 -- Total Cycles 128711 ---- Thread 22 ---- PC 5: Stalled ----- 94637 in-flight CPI 1.3598 -- Total Cycles 128711 ---- Thread 23 ---- PC 5: Stalled ----- 94882 in-flight CPI 1.3563 -- Total Cycles 128711 ---- Thread 24 ---- PC 5: Stalled ----- 91070 in-flight CPI 1.4130 -- Total Cycles 128711 ---- Thread 25 ---- PC 5: Stalled ----- 87237 in-flight CPI 1.4751 -- Total Cycles 128711 ---- Thread 26 ---- PC 5: Stalled ----- 91037 in-flight CPI 1.4136 -- Total Cycles 128711 ---- Thread 27 ---- PC 5: Stalled ----- 89985 in-flight CPI 1.4301 -- Total Cycles 128711 ---- Thread 28 ---- PC 5: Stalled ----- 92501 in-flight CPI 1.3912 -- Total Cycles 128711 ---- Thread 29 ---- PC 5: Stalled ----- 90598 in-flight CPI 1.4204 -- Total Cycles 128711 ---- Thread 30 ---- PC 5: Stalled ----- 88118 in-flight CPI 1.4604 -- Total Cycles 128711 ---- Thread 31 ---- PC 5: Stalled ----- 86253 in-flight CPI 1.4920 -- Total Cycles 128711 Total CPI 0.0425 , IPC 23.5184 -- Total Cycles 128711 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8115 (4.265126%) FPSUB: 0 (0.000000%) FPMUL: 32393 (17.025291%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 62231 (32.707711%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5508 (2.894925%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 74224 (39.011058%) DIV: 7530 (3.957659%) FPUN: 0 (0.000000%) FPRSUB: 263 (0.138229%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3319446 total) ADD%: 7.467 (247871) SUB%: 0.000 (0) MUL%: 0.006 (204) BITOR%: 1.540 (51103) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.574 (19069) FPSUB%: 0.000 (0) FPMUL%: 4.859 (161286) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (612) FPMAX%: 0.018 (612) LOAD%: 5.182 (172028) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (236) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (581) FPINV%: 0.000 (0) FPCONV%: 0.019 (644) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.079 (35810) FPLE%: 0.456 (15142) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (612) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.800 (92959) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.754 (25017) CMPU%: 0.000 (0) RSUB%: 0.006 (204) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.740 (522495) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.175 (38990) ORI%: 1.591 (52822) XORI%: 0.000 (0) MULI%: 3.204 (106358) LW%: 1.130 (37510) LWI%: 13.503 (448230) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.287 (9533) SWI%: 4.060 (134781) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.399 (46433) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10313) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.060 (2005) bned%: 0.000 (0) bneid%: 13.869 (460385) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.720 (23898) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.127 (4228) DIV%: 0.012 (408) FPUN%: 1.489 (49419) FPRSUB%: 3.702 (122877) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (57) FPGT%: 2.953 (98024) FPGE%: 1.033 (34277) SYNC%: 0.000 (0) NOP%: 8.806 (292317) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 19 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 7 FPSUB 0 FPMUL 75 FPCMPLT 0 FPMIN 0 FPMAX 398 LOAD 40393 INTCONV 0 ATOMIC_INC 34 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 17 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1599 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 12 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48983 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 10 ORI 11529 XORI 0 MULI 10075 LW 0 LWI 142098 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 78 DIV 32 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.5186 --Total thread-cycles: 4118752 --total thread-cycles issued: 3027129 (73.496268%) --iCache conflicts: 114811 (2.787519%) --thread*cycles of FU dependence: 255385 (6.200543%) --thread*cycles of data dependence: 190264 (4.619458%) --iCache cycles*banks: 4118752 (80.594267% used) Issue breakdown: --thread*cycles of issue worked: 3027129 (73.496268%) --thread*cycles of issue failed: 799306 (19.406510%) --thread*cycles of issue NOP/other: 292317 (7.097223%) Number of thread-cycles not ready: 190264 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3319446 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 7 3: 10 4: 7 5: 8 6: 8 7: 7 8: 6 9: 7 10: 7 11: 8 12: 8 13: 7 14: 8 15: 7 16: 7 17: 8 18: 7 19: 7 20: 7 21: 7 22: 7 23: 8 24: 8 25: 7 26: 7 27: 7 28: 8 29: 8 30: 7 31: 6 <=== Core 63 ===> ---- Thread 00 ---- PC 5: Stalled ----- 98138 in-flight CPI 1.3027 -- Total Cycles 127874 ---- Thread 01 ---- PC 5: Stalled ----- 97382 in-flight CPI 1.3129 -- Total Cycles 127874 ---- Thread 02 ---- PC 5: Stalled ----- 95446 in-flight CPI 1.3395 -- Total Cycles 127874 ---- Thread 03 ---- PC 5: Stalled ----- 103024 in-flight CPI 1.2409 -- Total Cycles 127874 ---- Thread 04 ---- PC 5: Stalled ----- 102297 in-flight CPI 1.2498 -- Total Cycles 127874 ---- Thread 05 ---- PC 5: Stalled ----- 98069 in-flight CPI 1.3037 -- Total Cycles 127874 ---- Thread 06 ---- PC 5: Stalled ----- 104478 in-flight CPI 1.2237 -- Total Cycles 127874 ---- Thread 07 ---- PC 5: Stalled ----- 94640 in-flight CPI 1.3509 -- Total Cycles 127874 ---- Thread 08 ---- PC 5: Stalled ----- 95266 in-flight CPI 1.3421 -- Total Cycles 127874 ---- Thread 09 ---- PC 5: Stalled ----- 100101 in-flight CPI 1.2772 -- Total Cycles 127874 ---- Thread 10 ---- PC 5: Stalled ----- 98685 in-flight CPI 1.2956 -- Total Cycles 127874 ---- Thread 11 ---- PC 5: Stalled ----- 94951 in-flight CPI 1.3465 -- Total Cycles 127874 ---- Thread 12 ---- PC 5: Stalled ----- 95536 in-flight CPI 1.3383 -- Total Cycles 127874 ---- Thread 13 ---- PC 5: Stalled ----- 96715 in-flight CPI 1.3219 -- Total Cycles 127874 ---- Thread 14 ---- PC 5: Stalled ----- 95362 in-flight CPI 1.3407 -- Total Cycles 127874 ---- Thread 15 ---- PC 5: Stalled ----- 98210 in-flight CPI 1.3018 -- Total Cycles 127874 ---- Thread 16 ---- PC 5: Stalled ----- 93316 in-flight CPI 1.3701 -- Total Cycles 127874 ---- Thread 17 ---- PC 5: Stalled ----- 100416 in-flight CPI 1.2732 -- Total Cycles 127874 ---- Thread 18 ---- PC 5: Stalled ----- 96375 in-flight CPI 1.3266 -- Total Cycles 127874 ---- Thread 19 ---- PC 5: Stalled ----- 90579 in-flight CPI 1.4114 -- Total Cycles 127874 ---- Thread 20 ---- PC 5: Stalled ----- 90319 in-flight CPI 1.4155 -- Total Cycles 127874 ---- Thread 21 ---- PC 5: Stalled ----- 96107 in-flight CPI 1.3303 -- Total Cycles 127874 ---- Thread 22 ---- PC 5: Stalled ----- 91621 in-flight CPI 1.3954 -- Total Cycles 127874 ---- Thread 23 ---- PC 5: Stalled ----- 95527 in-flight CPI 1.3383 -- Total Cycles 127874 ---- Thread 24 ---- PC 5: Stalled ----- 89972 in-flight CPI 1.4211 -- Total Cycles 127874 ---- Thread 25 ---- PC 5: Stalled ----- 95773 in-flight CPI 1.3349 -- Total Cycles 127874 ---- Thread 26 ---- PC 5: Stalled ----- 85816 in-flight CPI 1.4898 -- Total Cycles 127874 ---- Thread 27 ---- PC 5: Stalled ----- 90426 in-flight CPI 1.4139 -- Total Cycles 127874 ---- Thread 28 ---- PC 5: Stalled ----- 87129 in-flight CPI 1.4675 -- Total Cycles 127874 ---- Thread 29 ---- PC 5: Stalled ----- 87693 in-flight CPI 1.4579 -- Total Cycles 127874 ---- Thread 30 ---- PC 5: Stalled ----- 92674 in-flight CPI 1.3796 -- Total Cycles 127874 ---- Thread 31 ---- PC 5: Stalled ----- 90752 in-flight CPI 1.4087 -- Total Cycles 127874 Total CPI 0.0420 , IPC 23.7997 -- Total Cycles 127874 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7743 (3.950510%) FPSUB: 0 (0.000000%) FPMUL: 31839 (16.244388%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 71281 (36.367857%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5859 (2.989286%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 71307 (36.381122%) DIV: 7706 (3.931633%) FPUN: 0 (0.000000%) FPRSUB: 265 (0.135204%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3337165 total) ADD%: 7.508 (250559) SUB%: 0.000 (0) MUL%: 0.006 (209) BITOR%: 1.529 (51010) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.551 (18375) FPSUB%: 0.000 (0) FPMUL%: 4.785 (159696) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (627) FPMAX%: 0.019 (627) LOAD%: 5.162 (172268) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (241) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (605) FPINV%: 0.000 (0) FPCONV%: 0.020 (659) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.067 (35618) FPLE%: 0.459 (15302) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (627) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.822 (94159) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.746 (24904) CMPU%: 0.000 (0) RSUB%: 0.006 (209) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.757 (525837) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.179 (39332) ORI%: 1.562 (52118) XORI%: 0.000 (0) MULI%: 3.223 (107560) LW%: 1.139 (37998) LWI%: 13.564 (452640) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.289 (9637) SWI%: 4.085 (136316) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.410 (47065) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.312 (10400) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.058 (1933) bned%: 0.000 (0) bneid%: 13.871 (462903) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.719 (23992) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.121 (4053) DIV%: 0.013 (418) FPUN%: 1.482 (49467) FPRSUB%: 3.678 (122726) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (73) FPGT%: 2.967 (98998) FPGE%: 1.024 (34165) SYNC%: 0.000 (0) NOP%: 8.802 (293743) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 38 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 16 FPSUB 0 FPMUL 52 FPCMPLT 0 FPMIN 0 FPMAX 408 LOAD 39994 INTCONV 0 ATOMIC_INC 20 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 15 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1527 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 11 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49401 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 13 ORI 11033 XORI 0 MULI 9853 LW 0 LWI 143180 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 86 DIV 27 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.8000 --Total thread-cycles: 4091968 --total thread-cycles issued: 3043422 (74.375508%) --iCache conflicts: 113862 (2.782573%) --thread*cycles of FU dependence: 255700 (6.248827%) --thread*cycles of data dependence: 196000 (4.789871%) --iCache cycles*banks: 4091968 (81.554817% used) Issue breakdown: --thread*cycles of issue worked: 3043422 (74.375508%) --thread*cycles of issue failed: 754803 (18.445965%) --thread*cycles of issue NOP/other: 293743 (7.178526%) Number of thread-cycles not ready: 196000 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3337165 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 9 1: 7 2: 8 3: 9 4: 8 5: 8 6: 9 7: 8 8: 7 9: 9 10: 7 11: 6 12: 6 13: 8 14: 7 15: 8 16: 8 17: 7 18: 7 19: 8 20: 7 21: 8 22: 8 23: 8 24: 6 25: 8 26: 7 27: 7 28: 5 29: 8 30: 7 31: 8 <=== Core 64 ===> ---- Thread 00 ---- PC 5: Stalled ----- 98175 in-flight CPI 1.4646 -- Total Cycles 143815 ---- Thread 01 ---- PC 5: Stalled ----- 100216 in-flight CPI 1.4348 -- Total Cycles 143815 ---- Thread 02 ---- PC 5: Stalled ----- 102664 in-flight CPI 1.4006 -- Total Cycles 143815 ---- Thread 03 ---- PC 5: Stalled ----- 98331 in-flight CPI 1.4622 -- Total Cycles 143815 ---- Thread 04 ---- PC 5: Stalled ----- 100647 in-flight CPI 1.4287 -- Total Cycles 143815 ---- Thread 05 ---- PC 5: Stalled ----- 97359 in-flight CPI 1.4769 -- Total Cycles 143815 ---- Thread 06 ---- PC 5: Stalled ----- 95057 in-flight CPI 1.5126 -- Total Cycles 143815 ---- Thread 07 ---- PC 5: Stalled ----- 109008 in-flight CPI 1.3191 -- Total Cycles 143815 ---- Thread 08 ---- PC 5: Stalled ----- 98073 in-flight CPI 1.4661 -- Total Cycles 143815 ---- Thread 09 ---- PC 5: Stalled ----- 104868 in-flight CPI 1.3711 -- Total Cycles 143815 ---- Thread 10 ---- PC 5: Stalled ----- 92665 in-flight CPI 1.5517 -- Total Cycles 143815 ---- Thread 11 ---- PC 5: Stalled ----- 104658 in-flight CPI 1.3739 -- Total Cycles 143815 ---- Thread 12 ---- PC 5: Stalled ----- 96078 in-flight CPI 1.4966 -- Total Cycles 143815 ---- Thread 13 ---- PC 5: Stalled ----- 93607 in-flight CPI 1.5361 -- Total Cycles 143815 ---- Thread 14 ---- PC 5: Stalled ----- 94795 in-flight CPI 1.5169 -- Total Cycles 143815 ---- Thread 15 ---- PC 5: Stalled ----- 91919 in-flight CPI 1.5644 -- Total Cycles 143815 ---- Thread 16 ---- PC 5: Stalled ----- 94950 in-flight CPI 1.5144 -- Total Cycles 143815 ---- Thread 17 ---- PC 5: Stalled ----- 96956 in-flight CPI 1.4830 -- Total Cycles 143815 ---- Thread 18 ---- PC 5: Stalled ----- 97104 in-flight CPI 1.4808 -- Total Cycles 143815 ---- Thread 19 ---- PC 5: Stalled ----- 88724 in-flight CPI 1.6206 -- Total Cycles 143815 ---- Thread 20 ---- PC 5: Stalled ----- 92165 in-flight CPI 1.5601 -- Total Cycles 143815 ---- Thread 21 ---- PC 5: Stalled ----- 95027 in-flight CPI 1.5132 -- Total Cycles 143815 ---- Thread 22 ---- PC 5: Stalled ----- 91939 in-flight CPI 1.5640 -- Total Cycles 143815 ---- Thread 23 ---- PC 5: Stalled ----- 96119 in-flight CPI 1.4960 -- Total Cycles 143815 ---- Thread 24 ---- PC 5: Stalled ----- 87032 in-flight CPI 1.6521 -- Total Cycles 143815 ---- Thread 25 ---- PC 5: Stalled ----- 96527 in-flight CPI 1.4895 -- Total Cycles 143815 ---- Thread 26 ---- PC 5: Stalled ----- 88601 in-flight CPI 1.6229 -- Total Cycles 143815 ---- Thread 27 ---- PC 5: Stalled ----- 88862 in-flight CPI 1.6181 -- Total Cycles 143815 ---- Thread 28 ---- PC 5: Stalled ----- 85209 in-flight CPI 1.6874 -- Total Cycles 143815 ---- Thread 29 ---- PC 5: Stalled ----- 92773 in-flight CPI 1.5500 -- Total Cycles 143815 ---- Thread 30 ---- PC 5: Stalled ----- 88285 in-flight CPI 1.6286 -- Total Cycles 143815 ---- Thread 31 ---- PC 5: Stalled ----- 89152 in-flight CPI 1.6130 -- Total Cycles 143815 Total CPI 0.0472 , IPC 21.1946 -- Total Cycles 143815 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8549 (4.090783%) FPSUB: 0 (0.000000%) FPMUL: 33438 (16.000421%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 75338 (36.049995%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5589 (2.674393%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 78354 (37.493181%) DIV: 7451 (3.565379%) FPUN: 0 (0.000000%) FPRSUB: 263 (0.125848%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3342040 total) ADD%: 7.443 (248752) SUB%: 0.000 (0) MUL%: 0.006 (202) BITOR%: 1.532 (51186) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.600 (20045) FPSUB%: 0.000 (0) FPMUL%: 4.935 (164942) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (606) FPMAX%: 0.018 (606) LOAD%: 5.233 (174881) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (234) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (583) FPINV%: 0.000 (0) FPCONV%: 0.019 (638) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.086 (36303) FPLE%: 0.456 (15246) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (606) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.790 (93242) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.765 (25555) CMPU%: 0.000 (0) RSUB%: 0.006 (202) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.735 (525877) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.176 (39292) ORI%: 1.598 (53410) XORI%: 0.000 (0) MULI%: 3.190 (106622) LW%: 1.126 (37620) LWI%: 13.467 (450064) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.286 (9545) SWI%: 4.048 (135298) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.394 (46596) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.309 (10336) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.063 (2091) bned%: 0.000 (0) bneid%: 13.836 (462390) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.715 (23885) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.134 (4473) DIV%: 0.012 (404) FPUN%: 1.476 (49312) FPRSUB%: 3.727 (124570) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (78) FPGT%: 2.941 (98297) FPGE%: 1.019 (34066) SYNC%: 0.000 (0) NOP%: 8.794 (293889) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 22 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 3 FPSUB 0 FPMUL 51 FPCMPLT 0 FPMIN 0 FPMAX 397 LOAD 40823 INTCONV 0 ATOMIC_INC 26 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 15 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1705 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 7 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49060 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 9 ORI 12277 XORI 0 MULI 9620 LW 0 LWI 142906 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 77 DIV 23 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.1948 --Total thread-cycles: 4602080 --total thread-cycles issued: 3048151 (66.234203%) --iCache conflicts: 114609 (2.490374%) --thread*cycles of FU dependence: 257046 (5.585431%) --thread*cycles of data dependence: 208982 (4.541034%) --iCache cycles*banks: 4602080 (72.620902% used) Issue breakdown: --thread*cycles of issue worked: 3048151 (66.234203%) --thread*cycles of issue failed: 1260040 (27.379793%) --thread*cycles of issue NOP/other: 293889 (6.386004%) Number of thread-cycles not ready: 208982 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3342040 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 7 2: 8 3: 9 4: 7 5: 8 6: 8 7: 6 8: 8 9: 9 10: 7 11: 8 12: 7 13: 7 14: 7 15: 6 16: 7 17: 9 18: 7 19: 7 20: 7 21: 7 22: 7 23: 7 24: 7 25: 9 26: 7 27: 7 28: 8 29: 5 30: 8 31: 5 <=== Core 65 ===> ---- Thread 00 ---- PC 5: Stalled ----- 103906 in-flight CPI 1.3631 -- Total Cycles 141665 ---- Thread 01 ---- PC 5: Stalled ----- 98463 in-flight CPI 1.4384 -- Total Cycles 141665 ---- Thread 02 ---- PC 5: Stalled ----- 101057 in-flight CPI 1.4016 -- Total Cycles 141665 ---- Thread 03 ---- PC 5: Stalled ----- 101735 in-flight CPI 1.3923 -- Total Cycles 141665 ---- Thread 04 ---- PC 5: Stalled ----- 98969 in-flight CPI 1.4311 -- Total Cycles 141665 ---- Thread 05 ---- PC 5: Stalled ----- 98032 in-flight CPI 1.4448 -- Total Cycles 141665 ---- Thread 06 ---- PC 5: Stalled ----- 91923 in-flight CPI 1.5409 -- Total Cycles 141665 ---- Thread 07 ---- PC 5: Stalled ----- 95074 in-flight CPI 1.4897 -- Total Cycles 141665 ---- Thread 08 ---- PC 5: Stalled ----- 99614 in-flight CPI 1.4219 -- Total Cycles 141665 ---- Thread 09 ---- PC 5: Stalled ----- 94513 in-flight CPI 1.4986 -- Total Cycles 141665 ---- Thread 10 ---- PC 5: Stalled ----- 92502 in-flight CPI 1.5312 -- Total Cycles 141665 ---- Thread 11 ---- PC 5: Stalled ----- 103126 in-flight CPI 1.3734 -- Total Cycles 141665 ---- Thread 12 ---- PC 5: Stalled ----- 93105 in-flight CPI 1.5212 -- Total Cycles 141665 ---- Thread 13 ---- PC 5: Stalled ----- 99626 in-flight CPI 1.4216 -- Total Cycles 141665 ---- Thread 14 ---- PC 5: Stalled ----- 94605 in-flight CPI 1.4971 -- Total Cycles 141665 ---- Thread 15 ---- PC 5: Stalled ----- 98336 in-flight CPI 1.4403 -- Total Cycles 141665 ---- Thread 16 ---- PC 5: Stalled ----- 90174 in-flight CPI 1.5707 -- Total Cycles 141665 ---- Thread 17 ---- PC 5: Stalled ----- 97828 in-flight CPI 1.4479 -- Total Cycles 141665 ---- Thread 18 ---- PC 5: Stalled ----- 91049 in-flight CPI 1.5556 -- Total Cycles 141665 ---- Thread 19 ---- PC 5: Stalled ----- 93394 in-flight CPI 1.5166 -- Total Cycles 141665 ---- Thread 20 ---- PC 5: Stalled ----- 94692 in-flight CPI 1.4958 -- Total Cycles 141665 ---- Thread 21 ---- PC 5: Stalled ----- 93187 in-flight CPI 1.5200 -- Total Cycles 141665 ---- Thread 22 ---- PC 5: Stalled ----- 91997 in-flight CPI 1.5397 -- Total Cycles 141665 ---- Thread 23 ---- PC 5: Stalled ----- 90538 in-flight CPI 1.5644 -- Total Cycles 141665 ---- Thread 24 ---- PC 5: Stalled ----- 100232 in-flight CPI 1.4132 -- Total Cycles 141665 ---- Thread 25 ---- PC 5: Stalled ----- 94037 in-flight CPI 1.5062 -- Total Cycles 141665 ---- Thread 26 ---- PC 5: Stalled ----- 90789 in-flight CPI 1.5601 -- Total Cycles 141665 ---- Thread 27 ---- PC 5: Stalled ----- 93722 in-flight CPI 1.5113 -- Total Cycles 141665 ---- Thread 28 ---- PC 5: Stalled ----- 87603 in-flight CPI 1.6169 -- Total Cycles 141665 ---- Thread 29 ---- PC 5: Stalled ----- 91494 in-flight CPI 1.5480 -- Total Cycles 141665 ---- Thread 30 ---- PC 5: Stalled ----- 86688 in-flight CPI 1.6339 -- Total Cycles 141665 ---- Thread 31 ---- PC 5: Stalled ----- 94220 in-flight CPI 1.5032 -- Total Cycles 141665 Total CPI 0.0465 , IPC 21.5070 -- Total Cycles 141665 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8224 (4.112576%) FPSUB: 0 (0.000000%) FPMUL: 32792 (16.398296%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 69258 (34.633849%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5606 (2.803392%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 76263 (38.136839%) DIV: 7565 (3.783030%) FPUN: 0 (0.000000%) FPRSUB: 264 (0.132018%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3340843 total) ADD%: 7.483 (249990) SUB%: 0.000 (0) MUL%: 0.006 (205) BITOR%: 1.534 (51254) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.580 (19379) FPSUB%: 0.000 (0) FPMUL%: 4.874 (162837) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (615) FPMAX%: 0.018 (615) LOAD%: 5.204 (173852) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (237) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (587) FPINV%: 0.000 (0) FPCONV%: 0.019 (647) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.082 (36135) FPLE%: 0.456 (15220) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (615) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.797 (93440) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.757 (25295) CMPU%: 0.000 (0) RSUB%: 0.006 (205) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.731 (525537) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.174 (39237) ORI%: 1.591 (53142) XORI%: 0.000 (0) MULI%: 3.200 (106922) LW%: 1.129 (37704) LWI%: 13.500 (451018) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.287 (9579) SWI%: 4.060 (135631) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.397 (46680) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10362) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.062 (2071) bned%: 0.000 (0) bneid%: 13.851 (462748) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.718 (23974) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.130 (4348) DIV%: 0.012 (410) FPUN%: 1.482 (49516) FPRSUB%: 3.708 (123878) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (73) FPGT%: 2.948 (98495) FPGE%: 1.027 (34296) SYNC%: 0.000 (0) NOP%: 8.800 (293998) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 21 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 6 FPSUB 0 FPMUL 44 FPCMPLT 0 FPMIN 0 FPMAX 403 LOAD 39646 INTCONV 0 ATOMIC_INC 25 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 14 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1496 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 7 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49079 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 15 ORI 11835 XORI 0 MULI 9676 LW 0 LWI 142863 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 84 DIV 28 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.5072 --Total thread-cycles: 4533280 --total thread-cycles issued: 3046845 (67.210607%) --iCache conflicts: 114144 (2.517912%) --thread*cycles of FU dependence: 255250 (5.630581%) --thread*cycles of data dependence: 199972 (4.411199%) --iCache cycles*banks: 4533280 (73.696639% used) Issue breakdown: --thread*cycles of issue worked: 3046845 (67.210607%) --thread*cycles of issue failed: 1192437 (26.304067%) --thread*cycles of issue NOP/other: 293998 (6.485326%) Number of thread-cycles not ready: 199972 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3340843 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 9 2: 7 3: 7 4: 8 5: 8 6: 6 7: 8 8: 8 9: 7 10: 7 11: 9 12: 8 13: 9 14: 8 15: 8 16: 7 17: 7 18: 8 19: 7 20: 7 21: 7 22: 6 23: 7 24: 6 25: 7 26: 7 27: 7 28: 6 29: 8 30: 7 31: 8 <=== Core 66 ===> ---- Thread 00 ---- PC 5: Stalled ----- 98963 in-flight CPI 1.2932 -- Total Cycles 128003 ---- Thread 01 ---- PC 5: Stalled ----- 97756 in-flight CPI 1.3092 -- Total Cycles 128003 ---- Thread 02 ---- PC 5: Stalled ----- 96882 in-flight CPI 1.3210 -- Total Cycles 128003 ---- Thread 03 ---- PC 5: Stalled ----- 98643 in-flight CPI 1.2974 -- Total Cycles 128003 ---- Thread 04 ---- PC 5: Stalled ----- 92897 in-flight CPI 1.3777 -- Total Cycles 128003 ---- Thread 05 ---- PC 5: Stalled ----- 89610 in-flight CPI 1.4282 -- Total Cycles 128003 ---- Thread 06 ---- PC 5: Stalled ----- 100440 in-flight CPI 1.2742 -- Total Cycles 128003 ---- Thread 07 ---- PC 5: Stalled ----- 97669 in-flight CPI 1.3103 -- Total Cycles 128003 ---- Thread 08 ---- PC 5: Stalled ----- 97270 in-flight CPI 1.3157 -- Total Cycles 128003 ---- Thread 09 ---- PC 5: Stalled ----- 96813 in-flight CPI 1.3219 -- Total Cycles 128003 ---- Thread 10 ---- PC 5: Stalled ----- 98809 in-flight CPI 1.2952 -- Total Cycles 128003 ---- Thread 11 ---- PC 5: Stalled ----- 95323 in-flight CPI 1.3426 -- Total Cycles 128003 ---- Thread 12 ---- PC 5: Stalled ----- 97268 in-flight CPI 1.3158 -- Total Cycles 128003 ---- Thread 13 ---- PC 5: Stalled ----- 98371 in-flight CPI 1.3010 -- Total Cycles 128003 ---- Thread 14 ---- PC 5: Stalled ----- 97473 in-flight CPI 1.3130 -- Total Cycles 128003 ---- Thread 15 ---- PC 5: Stalled ----- 96192 in-flight CPI 1.3304 -- Total Cycles 128003 ---- Thread 16 ---- PC 5: Stalled ----- 93091 in-flight CPI 1.3748 -- Total Cycles 128003 ---- Thread 17 ---- PC 5: Stalled ----- 95990 in-flight CPI 1.3332 -- Total Cycles 128003 ---- Thread 18 ---- PC 5: Stalled ----- 99608 in-flight CPI 1.2849 -- Total Cycles 128003 ---- Thread 19 ---- PC 5: Stalled ----- 91354 in-flight CPI 1.4009 -- Total Cycles 128003 ---- Thread 20 ---- PC 5: Stalled ----- 93981 in-flight CPI 1.3618 -- Total Cycles 128003 ---- Thread 21 ---- PC 5: Stalled ----- 96192 in-flight CPI 1.3304 -- Total Cycles 128003 ---- Thread 22 ---- PC 5: Stalled ----- 91986 in-flight CPI 1.3913 -- Total Cycles 128003 ---- Thread 23 ---- PC 5: Stalled ----- 97529 in-flight CPI 1.3122 -- Total Cycles 128003 ---- Thread 24 ---- PC 5: Stalled ----- 88942 in-flight CPI 1.4389 -- Total Cycles 128003 ---- Thread 25 ---- PC 5: Stalled ----- 92804 in-flight CPI 1.3790 -- Total Cycles 128003 ---- Thread 26 ---- PC 5: Stalled ----- 89530 in-flight CPI 1.4294 -- Total Cycles 128003 ---- Thread 27 ---- PC 5: Stalled ----- 91778 in-flight CPI 1.3945 -- Total Cycles 128003 ---- Thread 28 ---- PC 5: Stalled ----- 86780 in-flight CPI 1.4747 -- Total Cycles 128003 ---- Thread 29 ---- PC 5: Stalled ----- 86761 in-flight CPI 1.4751 -- Total Cycles 128003 ---- Thread 30 ---- PC 5: Stalled ----- 93997 in-flight CPI 1.3615 -- Total Cycles 128003 ---- Thread 31 ---- PC 5: Stalled ----- 90228 in-flight CPI 1.4184 -- Total Cycles 128003 Total CPI 0.0422 , IPC 23.6829 -- Total Cycles 128003 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7393 (3.779614%) FPSUB: 0 (0.000000%) FPMUL: 30999 (15.847997%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 75372 (38.533348%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5564 (2.844552%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 68483 (35.011401%) DIV: 7529 (3.849143%) FPUN: 0 (0.000000%) FPRSUB: 262 (0.133945%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3323794 total) ADD%: 7.520 (249946) SUB%: 0.000 (0) MUL%: 0.006 (204) BITOR%: 1.537 (51072) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.525 (17463) FPSUB%: 0.000 (0) FPMUL%: 4.718 (156818) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (612) FPMAX%: 0.018 (612) LOAD%: 5.147 (171082) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (236) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (584) FPINV%: 0.000 (0) FPCONV%: 0.019 (644) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.058 (35162) FPLE%: 0.460 (15300) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (612) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.837 (94289) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.743 (24693) CMPU%: 0.000 (0) RSUB%: 0.006 (204) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.785 (524651) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.184 (39356) ORI%: 1.548 (51441) XORI%: 0.000 (0) MULI%: 3.237 (107600) LW%: 1.145 (38042) LWI%: 13.586 (451578) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.291 (9666) SWI%: 4.098 (136220) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.417 (47101) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.313 (10400) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.054 (1790) bned%: 0.000 (0) bneid%: 13.887 (461575) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.723 (24032) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.117 (3890) DIV%: 0.012 (408) FPUN%: 1.491 (49554) FPRSUB%: 3.660 (121650) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (72) FPGT%: 2.967 (98633) FPGE%: 1.031 (34254) SYNC%: 0.000 (0) NOP%: 8.793 (292252) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 24 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 8 FPSUB 0 FPMUL 48 FPCMPLT 0 FPMIN 0 FPMAX 398 LOAD 39363 INTCONV 0 ATOMIC_INC 21 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 13 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1453 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 4 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49338 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 13 ORI 10457 XORI 0 MULI 10064 LW 0 LWI 142765 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 64 DIV 25 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.6832 --Total thread-cycles: 4096096 --total thread-cycles issued: 3031542 (74.010521%) --iCache conflicts: 113919 (2.781160%) --thread*cycles of FU dependence: 254072 (6.202784%) --thread*cycles of data dependence: 195602 (4.775328%) --iCache cycles*banks: 4096096 (81.146194% used) Issue breakdown: --thread*cycles of issue worked: 3031542 (74.010521%) --thread*cycles of issue failed: 772302 (18.854587%) --thread*cycles of issue NOP/other: 292252 (7.134891%) Number of thread-cycles not ready: 195602 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3323794 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 7 3: 8 4: 7 5: 6 6: 8 7: 8 8: 8 9: 7 10: 8 11: 7 12: 7 13: 8 14: 8 15: 8 16: 7 17: 8 18: 7 19: 8 20: 6 21: 8 22: 7 23: 8 24: 8 25: 7 26: 8 27: 5 28: 8 29: 6 30: 8 31: 7 <=== Core 67 ===> ---- Thread 00 ---- PC 5: Stalled ----- 98689 in-flight CPI 1.3435 -- Total Cycles 132606 ---- Thread 01 ---- PC 5: Stalled ----- 97343 in-flight CPI 1.3620 -- Total Cycles 132606 ---- Thread 02 ---- PC 5: Stalled ----- 104476 in-flight CPI 1.2689 -- Total Cycles 132606 ---- Thread 03 ---- PC 5: Stalled ----- 99574 in-flight CPI 1.3316 -- Total Cycles 132606 ---- Thread 04 ---- PC 5: Stalled ----- 99330 in-flight CPI 1.3347 -- Total Cycles 132606 ---- Thread 05 ---- PC 5: Stalled ----- 101832 in-flight CPI 1.3020 -- Total Cycles 132606 ---- Thread 06 ---- PC 5: Stalled ----- 98529 in-flight CPI 1.3456 -- Total Cycles 132606 ---- Thread 07 ---- PC 5: Stalled ----- 100451 in-flight CPI 1.3199 -- Total Cycles 132606 ---- Thread 08 ---- PC 5: Stalled ----- 97186 in-flight CPI 1.3642 -- Total Cycles 132606 ---- Thread 09 ---- PC 5: Stalled ----- 101639 in-flight CPI 1.3044 -- Total Cycles 132606 ---- Thread 10 ---- PC 5: Stalled ----- 93038 in-flight CPI 1.4250 -- Total Cycles 132606 ---- Thread 11 ---- PC 5: Stalled ----- 95458 in-flight CPI 1.3889 -- Total Cycles 132606 ---- Thread 12 ---- PC 5: Stalled ----- 97121 in-flight CPI 1.3651 -- Total Cycles 132606 ---- Thread 13 ---- PC 5: Stalled ----- 99639 in-flight CPI 1.3307 -- Total Cycles 132606 ---- Thread 14 ---- PC 5: Stalled ----- 98457 in-flight CPI 1.3466 -- Total Cycles 132606 ---- Thread 15 ---- PC 5: Stalled ----- 97494 in-flight CPI 1.3599 -- Total Cycles 132606 ---- Thread 16 ---- PC 5: Stalled ----- 96060 in-flight CPI 1.3802 -- Total Cycles 132606 ---- Thread 17 ---- PC 5: Stalled ----- 97403 in-flight CPI 1.3611 -- Total Cycles 132606 ---- Thread 18 ---- PC 5: Stalled ----- 90754 in-flight CPI 1.4608 -- Total Cycles 132606 ---- Thread 19 ---- PC 5: Stalled ----- 95282 in-flight CPI 1.3914 -- Total Cycles 132606 ---- Thread 20 ---- PC 5: Stalled ----- 97599 in-flight CPI 1.3584 -- Total Cycles 132606 ---- Thread 21 ---- PC 5: Stalled ----- 91963 in-flight CPI 1.4417 -- Total Cycles 132606 ---- Thread 22 ---- PC 5: Stalled ----- 87784 in-flight CPI 1.5104 -- Total Cycles 132606 ---- Thread 23 ---- PC 5: Stalled ----- 88550 in-flight CPI 1.4972 -- Total Cycles 132606 ---- Thread 24 ---- PC 5: Stalled ----- 87610 in-flight CPI 1.5134 -- Total Cycles 132606 ---- Thread 25 ---- PC 5: Stalled ----- 96154 in-flight CPI 1.3788 -- Total Cycles 132606 ---- Thread 26 ---- PC 5: Stalled ----- 89489 in-flight CPI 1.4815 -- Total Cycles 132606 ---- Thread 27 ---- PC 5: Stalled ----- 91134 in-flight CPI 1.4548 -- Total Cycles 132606 ---- Thread 28 ---- PC 5: Stalled ----- 91722 in-flight CPI 1.4454 -- Total Cycles 132606 ---- Thread 29 ---- PC 5: Stalled ----- 91078 in-flight CPI 1.4557 -- Total Cycles 132606 ---- Thread 30 ---- PC 5: Stalled ----- 88973 in-flight CPI 1.4901 -- Total Cycles 132606 ---- Thread 31 ---- PC 5: Stalled ----- 84421 in-flight CPI 1.5704 -- Total Cycles 132606 Total CPI 0.0435 , IPC 22.9764 -- Total Cycles 132606 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8094 (4.077192%) FPSUB: 0 (0.000000%) FPMUL: 32429 (16.335464%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 69688 (35.103945%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5985 (3.014825%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 74256 (37.404984%) DIV: 7799 (3.928591%) FPUN: 0 (0.000000%) FPRSUB: 268 (0.135000%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3340837 total) ADD%: 7.464 (249367) SUB%: 0.000 (0) MUL%: 0.006 (211) BITOR%: 1.540 (51461) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.569 (18999) FPSUB%: 0.000 (0) FPMUL%: 4.836 (161568) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (633) FPMAX%: 0.019 (633) LOAD%: 5.179 (173010) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (243) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (614) FPINV%: 0.000 (0) FPCONV%: 0.020 (665) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.074 (35871) FPLE%: 0.457 (15269) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (633) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.806 (93751) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.757 (25298) CMPU%: 0.000 (0) RSUB%: 0.006 (211) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.750 (526190) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.179 (39382) ORI%: 1.586 (52973) XORI%: 0.000 (0) MULI%: 3.205 (107064) LW%: 1.133 (37838) LWI%: 13.512 (451414) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.287 (9587) SWI%: 4.085 (136464) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.403 (46876) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10397) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.058 (1931) bned%: 0.000 (0) bneid%: 13.868 (463317) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.718 (23996) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.126 (4225) DIV%: 0.013 (422) FPUN%: 1.488 (49706) FPRSUB%: 3.691 (123312) FPSQRT%: 0.000 (0) FPNEG%: 0.003 (84) FPGT%: 2.955 (98717) FPGE%: 1.031 (34437) SYNC%: 0.000 (0) NOP%: 8.799 (293972) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 18 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 12 FPSUB 0 FPMUL 60 FPCMPLT 0 FPMIN 0 FPMAX 414 LOAD 39531 INTCONV 0 ATOMIC_INC 19 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 13 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1382 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 11 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49344 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 18 ORI 11474 XORI 0 MULI 9726 LW 0 LWI 143065 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 75 DIV 17 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 22.9767 --Total thread-cycles: 4243392 --total thread-cycles issued: 3046865 (71.802582%) --iCache conflicts: 113359 (2.671424%) --thread*cycles of FU dependence: 255198 (6.014010%) --thread*cycles of data dependence: 198519 (4.678309%) --iCache cycles*banks: 4243392 (78.731095% used) Issue breakdown: --thread*cycles of issue worked: 3046865 (71.802582%) --thread*cycles of issue failed: 902555 (21.269659%) --thread*cycles of issue NOP/other: 293972 (6.927760%) Number of thread-cycles not ready: 198519 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3340837 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 7 2: 10 3: 6 4: 8 5: 8 6: 8 7: 7 8: 8 9: 8 10: 8 11: 7 12: 7 13: 6 14: 8 15: 8 16: 8 17: 8 18: 8 19: 8 20: 8 21: 7 22: 6 23: 8 24: 6 25: 8 26: 7 27: 8 28: 8 29: 8 30: 8 31: 8 <=== Core 68 ===> ---- Thread 00 ---- PC 5: Stalled ----- 100737 in-flight CPI 1.3112 -- Total Cycles 132109 ---- Thread 01 ---- PC 5: Stalled ----- 98620 in-flight CPI 1.3393 -- Total Cycles 132109 ---- Thread 02 ---- PC 5: Stalled ----- 99569 in-flight CPI 1.3266 -- Total Cycles 132109 ---- Thread 03 ---- PC 5: Stalled ----- 99697 in-flight CPI 1.3249 -- Total Cycles 132109 ---- Thread 04 ---- PC 5: Stalled ----- 96145 in-flight CPI 1.3738 -- Total Cycles 132109 ---- Thread 05 ---- PC 5: Stalled ----- 100647 in-flight CPI 1.3124 -- Total Cycles 132109 ---- Thread 06 ---- PC 5: Stalled ----- 96304 in-flight CPI 1.3715 -- Total Cycles 132109 ---- Thread 07 ---- PC 5: Stalled ----- 99839 in-flight CPI 1.3230 -- Total Cycles 132109 ---- Thread 08 ---- PC 5: Stalled ----- 98376 in-flight CPI 1.3426 -- Total Cycles 132109 ---- Thread 09 ---- PC 5: Stalled ----- 97758 in-flight CPI 1.3512 -- Total Cycles 132109 ---- Thread 10 ---- PC 5: Stalled ----- 92063 in-flight CPI 1.4347 -- Total Cycles 132109 ---- Thread 11 ---- PC 5: Stalled ----- 95254 in-flight CPI 1.3867 -- Total Cycles 132109 ---- Thread 12 ---- PC 5: Stalled ----- 101233 in-flight CPI 1.3047 -- Total Cycles 132109 ---- Thread 13 ---- PC 5: Stalled ----- 91558 in-flight CPI 1.4427 -- Total Cycles 132109 ---- Thread 14 ---- PC 5: Stalled ----- 98042 in-flight CPI 1.3472 -- Total Cycles 132109 ---- Thread 15 ---- PC 5: Stalled ----- 96055 in-flight CPI 1.3751 -- Total Cycles 132109 ---- Thread 16 ---- PC 5: Stalled ----- 92143 in-flight CPI 1.4335 -- Total Cycles 132109 ---- Thread 17 ---- PC 5: Stalled ----- 97314 in-flight CPI 1.3574 -- Total Cycles 132109 ---- Thread 18 ---- PC 5: Stalled ----- 91850 in-flight CPI 1.4380 -- Total Cycles 132109 ---- Thread 19 ---- PC 5: Stalled ----- 92018 in-flight CPI 1.4355 -- Total Cycles 132109 ---- Thread 20 ---- PC 5: Stalled ----- 95914 in-flight CPI 1.3772 -- Total Cycles 132109 ---- Thread 21 ---- PC 5: Stalled ----- 95798 in-flight CPI 1.3788 -- Total Cycles 132109 ---- Thread 22 ---- PC 5: Stalled ----- 97132 in-flight CPI 1.3598 -- Total Cycles 132109 ---- Thread 23 ---- PC 5: Stalled ----- 94088 in-flight CPI 1.4039 -- Total Cycles 132109 ---- Thread 24 ---- PC 5: Stalled ----- 92823 in-flight CPI 1.4229 -- Total Cycles 132109 ---- Thread 25 ---- PC 5: Stalled ----- 87916 in-flight CPI 1.5024 -- Total Cycles 132109 ---- Thread 26 ---- PC 5: Stalled ----- 86425 in-flight CPI 1.5283 -- Total Cycles 132109 ---- Thread 27 ---- PC 5: Stalled ----- 93874 in-flight CPI 1.4071 -- Total Cycles 132109 ---- Thread 28 ---- PC 5: Stalled ----- 85373 in-flight CPI 1.5472 -- Total Cycles 132109 ---- Thread 29 ---- PC 5: Stalled ----- 86457 in-flight CPI 1.5277 -- Total Cycles 132109 ---- Thread 30 ---- PC 5: Stalled ----- 91072 in-flight CPI 1.4503 -- Total Cycles 132109 ---- Thread 31 ---- PC 5: Stalled ----- 83667 in-flight CPI 1.5787 -- Total Cycles 132109 Total CPI 0.0437 , IPC 22.9077 -- Total Cycles 132109 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8347 (4.019338%) FPSUB: 0 (0.000000%) FPMUL: 32879 (15.832254%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 77318 (37.231005%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5474 (2.635900%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 75932 (36.563603%) DIV: 7460 (3.592220%) FPUN: 0 (0.000000%) FPRSUB: 261 (0.125680%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3318765 total) ADD%: 7.439 (246890) SUB%: 0.000 (0) MUL%: 0.006 (202) BITOR%: 1.538 (51044) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.587 (19487) FPSUB%: 0.000 (0) FPMUL%: 4.895 (162447) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (606) FPMAX%: 0.018 (606) LOAD%: 5.204 (172699) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (234) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (577) FPINV%: 0.000 (0) FPCONV%: 0.019 (638) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.082 (35913) FPLE%: 0.457 (15151) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (606) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.793 (92687) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.759 (25198) CMPU%: 0.000 (0) RSUB%: 0.006 (202) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.740 (522362) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.175 (38981) ORI%: 1.605 (53268) XORI%: 0.000 (0) MULI%: 3.194 (106008) LW%: 1.127 (37398) LWI%: 13.477 (447280) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.286 (9501) SWI%: 4.051 (134439) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.395 (46301) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10291) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.061 (2030) bned%: 0.000 (0) bneid%: 13.862 (460063) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.719 (23878) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.130 (4330) DIV%: 0.012 (404) FPUN%: 1.484 (49259) FPRSUB%: 3.711 (123169) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (71) FPGT%: 2.951 (97943) FPGE%: 1.028 (34108) SYNC%: 0.000 (0) NOP%: 8.810 (292398) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 17 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 13 FPSUB 0 FPMUL 60 FPCMPLT 0 FPMIN 0 FPMAX 397 LOAD 39912 INTCONV 0 ATOMIC_INC 22 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 15 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1536 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 4 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48751 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 18 ORI 11914 XORI 0 MULI 9441 LW 0 LWI 141784 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 80 DIV 35 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 22.9079 --Total thread-cycles: 4227488 --total thread-cycles issued: 3026367 (71.587832%) --iCache conflicts: 113274 (2.679464%) --thread*cycles of FU dependence: 254014 (6.008627%) --thread*cycles of data dependence: 207671 (4.912397%) --iCache cycles*banks: 4227488 (78.505178% used) Issue breakdown: --thread*cycles of issue worked: 3026367 (71.587832%) --thread*cycles of issue failed: 908723 (21.495578%) --thread*cycles of issue NOP/other: 292398 (6.916590%) Number of thread-cycles not ready: 207671 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3318765 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 9 2: 8 3: 7 4: 8 5: 7 6: 8 7: 7 8: 8 9: 7 10: 7 11: 7 12: 9 13: 6 14: 9 15: 8 16: 7 17: 6 18: 8 19: 6 20: 6 21: 8 22: 8 23: 6 24: 8 25: 7 26: 7 27: 7 28: 6 29: 8 30: 7 31: 6 <=== Core 69 ===> ---- Thread 00 ---- PC 5: Stalled ----- 95013 in-flight CPI 1.3411 -- Total Cycles 127456 ---- Thread 01 ---- PC 5: Stalled ----- 104980 in-flight CPI 1.2138 -- Total Cycles 127456 ---- Thread 02 ---- PC 5: Stalled ----- 91301 in-flight CPI 1.3958 -- Total Cycles 127456 ---- Thread 03 ---- PC 5: Stalled ----- 97332 in-flight CPI 1.3093 -- Total Cycles 127456 ---- Thread 04 ---- PC 5: Stalled ----- 99336 in-flight CPI 1.2829 -- Total Cycles 127456 ---- Thread 05 ---- PC 5: Stalled ----- 96146 in-flight CPI 1.3254 -- Total Cycles 127456 ---- Thread 06 ---- PC 5: Stalled ----- 95007 in-flight CPI 1.3413 -- Total Cycles 127456 ---- Thread 07 ---- PC 5: Stalled ----- 97764 in-flight CPI 1.3035 -- Total Cycles 127456 ---- Thread 08 ---- PC 5: Stalled ----- 95919 in-flight CPI 1.3286 -- Total Cycles 127456 ---- Thread 09 ---- PC 5: Stalled ----- 97085 in-flight CPI 1.3126 -- Total Cycles 127456 ---- Thread 10 ---- PC 5: Stalled ----- 97907 in-flight CPI 1.3015 -- Total Cycles 127456 ---- Thread 11 ---- PC 5: Stalled ----- 97723 in-flight CPI 1.3040 -- Total Cycles 127456 ---- Thread 12 ---- PC 5: Stalled ----- 96925 in-flight CPI 1.3147 -- Total Cycles 127456 ---- Thread 13 ---- PC 5: Stalled ----- 96211 in-flight CPI 1.3245 -- Total Cycles 127456 ---- Thread 14 ---- PC 5: Stalled ----- 93627 in-flight CPI 1.3610 -- Total Cycles 127456 ---- Thread 15 ---- PC 5: Stalled ----- 97323 in-flight CPI 1.3093 -- Total Cycles 127456 ---- Thread 16 ---- PC 5: Stalled ----- 96344 in-flight CPI 1.3227 -- Total Cycles 127456 ---- Thread 17 ---- PC 5: Stalled ----- 93799 in-flight CPI 1.3585 -- Total Cycles 127456 ---- Thread 18 ---- PC 5: Stalled ----- 92559 in-flight CPI 1.3769 -- Total Cycles 127456 ---- Thread 19 ---- PC 5: Stalled ----- 91187 in-flight CPI 1.3975 -- Total Cycles 127456 ---- Thread 20 ---- PC 5: Stalled ----- 96800 in-flight CPI 1.3165 -- Total Cycles 127456 ---- Thread 21 ---- PC 5: Stalled ----- 90519 in-flight CPI 1.4078 -- Total Cycles 127456 ---- Thread 22 ---- PC 5: Stalled ----- 88089 in-flight CPI 1.4467 -- Total Cycles 127456 ---- Thread 23 ---- PC 5: Stalled ----- 89873 in-flight CPI 1.4179 -- Total Cycles 127456 ---- Thread 24 ---- PC 5: Stalled ----- 93573 in-flight CPI 1.3618 -- Total Cycles 127456 ---- Thread 25 ---- PC 5: Stalled ----- 95630 in-flight CPI 1.3326 -- Total Cycles 127456 ---- Thread 26 ---- PC 5: Stalled ----- 91456 in-flight CPI 1.3934 -- Total Cycles 127456 ---- Thread 27 ---- PC 5: Stalled ----- 90173 in-flight CPI 1.4132 -- Total Cycles 127456 ---- Thread 28 ---- PC 5: Stalled ----- 92630 in-flight CPI 1.3757 -- Total Cycles 127456 ---- Thread 29 ---- PC 5: Stalled ----- 90704 in-flight CPI 1.4049 -- Total Cycles 127456 ---- Thread 30 ---- PC 5: Stalled ----- 83192 in-flight CPI 1.5318 -- Total Cycles 127456 ---- Thread 31 ---- PC 5: Stalled ----- 91787 in-flight CPI 1.3884 -- Total Cycles 127456 Total CPI 0.0422 , IPC 23.6824 -- Total Cycles 127456 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8270 (3.897065%) FPSUB: 0 (0.000000%) FPMUL: 32739 (15.427570%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 82425 (38.841059%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5603 (2.640297%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 75537 (35.595233%) DIV: 7376 (3.475786%) FPUN: 0 (0.000000%) FPRSUB: 261 (0.122991%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3309705 total) ADD%: 7.435 (246075) SUB%: 0.000 (0) MUL%: 0.006 (200) BITOR%: 1.530 (50646) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.586 (19400) FPSUB%: 0.000 (0) FPMUL%: 4.901 (162198) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (600) FPMAX%: 0.018 (600) LOAD%: 5.207 (172338) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (232) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (581) FPINV%: 0.000 (0) FPCONV%: 0.019 (632) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.080 (35751) FPLE%: 0.454 (15025) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (600) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.799 (92640) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.760 (25164) CMPU%: 0.000 (0) RSUB%: 0.006 (200) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.737 (520853) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.176 (38907) ORI%: 1.597 (52867) XORI%: 0.000 (0) MULI%: 3.200 (105910) LW%: 1.129 (37376) LWI%: 13.505 (446971) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.285 (9446) SWI%: 4.062 (134452) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.400 (46335) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10247) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.063 (2096) bned%: 0.000 (0) bneid%: 13.841 (458088) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.720 (23827) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.130 (4309) DIV%: 0.012 (400) FPUN%: 1.478 (48915) FPRSUB%: 3.718 (123045) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (75) FPGT%: 2.947 (97527) FPGE%: 1.024 (33890) SYNC%: 0.000 (0) NOP%: 8.798 (291191) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 16 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 16 FPSUB 0 FPMUL 49 FPCMPLT 0 FPMIN 0 FPMAX 384 LOAD 41116 INTCONV 0 ATOMIC_INC 17 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 15 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1172 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 8 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48630 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 12 ORI 11825 XORI 0 MULI 9654 LW 0 LWI 141759 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 72 DIV 23 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.6826 --Total thread-cycles: 4078592 --total thread-cycles issued: 3018514 (74.008727%) --iCache conflicts: 112686 (2.762865%) --thread*cycles of FU dependence: 254774 (6.246616%) --thread*cycles of data dependence: 212211 (5.203046%) --iCache cycles*banks: 4078592 (81.149009% used) Issue breakdown: --thread*cycles of issue worked: 3018514 (74.008727%) --thread*cycles of issue failed: 768887 (18.851775%) --thread*cycles of issue NOP/other: 291191 (7.139498%) Number of thread-cycles not ready: 212211 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3309705 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 9 1: 9 2: 6 3: 7 4: 7 5: 7 6: 7 7: 7 8: 7 9: 7 10: 9 11: 8 12: 8 13: 7 14: 8 15: 9 16: 7 17: 8 18: 5 19: 7 20: 7 21: 7 22: 6 23: 7 24: 8 25: 7 26: 6 27: 7 28: 8 29: 7 30: 6 31: 7 <=== Core 70 ===> ---- Thread 00 ---- PC 5: Stalled ----- 99030 in-flight CPI 1.3004 -- Total Cycles 128806 ---- Thread 01 ---- PC 5: Stalled ----- 91918 in-flight CPI 1.4011 -- Total Cycles 128806 ---- Thread 02 ---- PC 5: Stalled ----- 99592 in-flight CPI 1.2930 -- Total Cycles 128806 ---- Thread 03 ---- PC 5: Stalled ----- 95222 in-flight CPI 1.3524 -- Total Cycles 128806 ---- Thread 04 ---- PC 5: Stalled ----- 102198 in-flight CPI 1.2601 -- Total Cycles 128806 ---- Thread 05 ---- PC 5: Stalled ----- 98104 in-flight CPI 1.3127 -- Total Cycles 128806 ---- Thread 06 ---- PC 5: Stalled ----- 103096 in-flight CPI 1.2491 -- Total Cycles 128806 ---- Thread 07 ---- PC 5: Stalled ----- 96389 in-flight CPI 1.3360 -- Total Cycles 128806 ---- Thread 08 ---- PC 5: Stalled ----- 96346 in-flight CPI 1.3366 -- Total Cycles 128806 ---- Thread 09 ---- PC 5: Stalled ----- 96968 in-flight CPI 1.3281 -- Total Cycles 128806 ---- Thread 10 ---- PC 5: Stalled ----- 93017 in-flight CPI 1.3845 -- Total Cycles 128806 ---- Thread 11 ---- PC 5: Stalled ----- 96438 in-flight CPI 1.3354 -- Total Cycles 128806 ---- Thread 12 ---- PC 5: Stalled ----- 95969 in-flight CPI 1.3419 -- Total Cycles 128806 ---- Thread 13 ---- PC 5: Stalled ----- 98545 in-flight CPI 1.3068 -- Total Cycles 128806 ---- Thread 14 ---- PC 5: Stalled ----- 90626 in-flight CPI 1.4211 -- Total Cycles 128806 ---- Thread 15 ---- PC 5: Stalled ----- 100356 in-flight CPI 1.2832 -- Total Cycles 128806 ---- Thread 16 ---- PC 5: Stalled ----- 92526 in-flight CPI 1.3920 -- Total Cycles 128806 ---- Thread 17 ---- PC 5: Stalled ----- 91250 in-flight CPI 1.4113 -- Total Cycles 128806 ---- Thread 18 ---- PC 5: Stalled ----- 92666 in-flight CPI 1.3898 -- Total Cycles 128806 ---- Thread 19 ---- PC 5: Stalled ----- 92686 in-flight CPI 1.3894 -- Total Cycles 128806 ---- Thread 20 ---- PC 5: Stalled ----- 92789 in-flight CPI 1.3879 -- Total Cycles 128806 ---- Thread 21 ---- PC 5: Stalled ----- 96659 in-flight CPI 1.3323 -- Total Cycles 128806 ---- Thread 22 ---- PC 5: Stalled ----- 94279 in-flight CPI 1.3660 -- Total Cycles 128806 ---- Thread 23 ---- PC 5: Stalled ----- 91275 in-flight CPI 1.4109 -- Total Cycles 128806 ---- Thread 24 ---- PC 5: Stalled ----- 90634 in-flight CPI 1.4209 -- Total Cycles 128806 ---- Thread 25 ---- PC 5: Stalled ----- 94706 in-flight CPI 1.3598 -- Total Cycles 128806 ---- Thread 26 ---- PC 5: Stalled ----- 84861 in-flight CPI 1.5176 -- Total Cycles 128806 ---- Thread 27 ---- PC 5: Stalled ----- 92974 in-flight CPI 1.3852 -- Total Cycles 128806 ---- Thread 28 ---- PC 5: Stalled ----- 86971 in-flight CPI 1.4807 -- Total Cycles 128806 ---- Thread 29 ---- PC 5: Stalled ----- 92697 in-flight CPI 1.3892 -- Total Cycles 128806 ---- Thread 30 ---- PC 5: Stalled ----- 92784 in-flight CPI 1.3880 -- Total Cycles 128806 ---- Thread 31 ---- PC 5: Stalled ----- 87742 in-flight CPI 1.4677 -- Total Cycles 128806 Total CPI 0.0426 , IPC 23.4607 -- Total Cycles 128806 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7821 (4.234067%) FPSUB: 0 (0.000000%) FPMUL: 31964 (17.304402%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 59356 (32.133654%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5877 (3.181641%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 71791 (38.865610%) DIV: 7639 (4.135538%) FPUN: 0 (0.000000%) FPRSUB: 268 (0.145088%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3313544 total) ADD%: 7.395 (245040) SUB%: 0.000 (0) MUL%: 0.006 (207) BITOR%: 1.526 (50552) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.561 (18574) FPSUB%: 0.000 (0) FPMUL%: 4.822 (159784) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (621) FPMAX%: 0.019 (621) LOAD%: 5.166 (171192) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (239) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (604) FPINV%: 0.000 (0) FPCONV%: 0.020 (653) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.071 (35496) FPLE%: 0.452 (14991) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (621) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.823 (93547) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.746 (24721) CMPU%: 0.000 (0) RSUB%: 0.006 (207) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.758 (522152) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.180 (39087) ORI%: 1.573 (52136) XORI%: 0.000 (0) MULI%: 3.225 (106872) LW%: 1.139 (37750) LWI%: 13.589 (450290) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.288 (9529) SWI%: 4.096 (135727) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.413 (46808) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.309 (10237) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.058 (1924) bned%: 0.000 (0) bneid%: 13.871 (459633) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.719 (23819) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.123 (4083) DIV%: 0.012 (414) FPUN%: 1.479 (49008) FPRSUB%: 3.693 (122373) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (72) FPGT%: 2.965 (98237) FPGE%: 1.027 (34017) SYNC%: 0.000 (0) NOP%: 8.801 (291610) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 20 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 8 FPSUB 0 FPMUL 46 FPCMPLT 0 FPMIN 0 FPMAX 405 LOAD 38976 INTCONV 0 ATOMIC_INC 22 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 13 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1559 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 5 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49028 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 7 ORI 11175 XORI 0 MULI 9751 LW 0 LWI 142472 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 84 DIV 28 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.4609 --Total thread-cycles: 4121792 --total thread-cycles issued: 3021934 (73.316024%) --iCache conflicts: 113382 (2.750794%) --thread*cycles of FU dependence: 253616 (6.153052%) --thread*cycles of data dependence: 184716 (4.481449%) --iCache cycles*banks: 4121792 (80.391635% used) Issue breakdown: --thread*cycles of issue worked: 3021934 (73.316024%) --thread*cycles of issue failed: 808248 (19.609141%) --thread*cycles of issue NOP/other: 291610 (7.074835%) Number of thread-cycles not ready: 184716 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3313544 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 7 2: 9 3: 8 4: 8 5: 8 6: 9 7: 8 8: 8 9: 7 10: 7 11: 8 12: 8 13: 8 14: 6 15: 8 16: 5 17: 7 18: 7 19: 8 20: 8 21: 8 22: 7 23: 8 24: 8 25: 7 26: 6 27: 6 28: 7 29: 8 30: 7 31: 7 <=== Core 71 ===> ---- Thread 00 ---- PC 5: Stalled ----- 96444 in-flight CPI 1.3199 -- Total Cycles 127320 ---- Thread 01 ---- PC 5: Stalled ----- 97335 in-flight CPI 1.3078 -- Total Cycles 127320 ---- Thread 02 ---- PC 5: Stalled ----- 98626 in-flight CPI 1.2907 -- Total Cycles 127320 ---- Thread 03 ---- PC 5: Stalled ----- 97284 in-flight CPI 1.3085 -- Total Cycles 127320 ---- Thread 04 ---- PC 5: Stalled ----- 102581 in-flight CPI 1.2409 -- Total Cycles 127320 ---- Thread 05 ---- PC 5: Stalled ----- 102907 in-flight CPI 1.2370 -- Total Cycles 127320 ---- Thread 06 ---- PC 5: Stalled ----- 100075 in-flight CPI 1.2720 -- Total Cycles 127320 ---- Thread 07 ---- PC 5: Stalled ----- 96241 in-flight CPI 1.3227 -- Total Cycles 127320 ---- Thread 08 ---- PC 5: Stalled ----- 99660 in-flight CPI 1.2773 -- Total Cycles 127320 ---- Thread 09 ---- PC 5: Stalled ----- 96302 in-flight CPI 1.3218 -- Total Cycles 127320 ---- Thread 10 ---- PC 5: Stalled ----- 96618 in-flight CPI 1.3175 -- Total Cycles 127320 ---- Thread 11 ---- PC 5: Stalled ----- 94975 in-flight CPI 1.3403 -- Total Cycles 127320 ---- Thread 12 ---- PC 5: Stalled ----- 96942 in-flight CPI 1.3131 -- Total Cycles 127320 ---- Thread 13 ---- PC 5: Stalled ----- 97148 in-flight CPI 1.3103 -- Total Cycles 127320 ---- Thread 14 ---- PC 5: Stalled ----- 95549 in-flight CPI 1.3322 -- Total Cycles 127320 ---- Thread 15 ---- PC 5: Stalled ----- 99728 in-flight CPI 1.2764 -- Total Cycles 127320 ---- Thread 16 ---- PC 5: Stalled ----- 101278 in-flight CPI 1.2569 -- Total Cycles 127320 ---- Thread 17 ---- PC 5: Stalled ----- 99226 in-flight CPI 1.2828 -- Total Cycles 127320 ---- Thread 18 ---- PC 5: Stalled ----- 98431 in-flight CPI 1.2932 -- Total Cycles 127320 ---- Thread 19 ---- PC 5: Stalled ----- 97708 in-flight CPI 1.3028 -- Total Cycles 127320 ---- Thread 20 ---- PC 5: Stalled ----- 89918 in-flight CPI 1.4157 -- Total Cycles 127320 ---- Thread 21 ---- PC 5: Stalled ----- 94996 in-flight CPI 1.3400 -- Total Cycles 127320 ---- Thread 22 ---- PC 5: Stalled ----- 95695 in-flight CPI 1.3302 -- Total Cycles 127320 ---- Thread 23 ---- PC 5: Stalled ----- 91006 in-flight CPI 1.3988 -- Total Cycles 127320 ---- Thread 24 ---- PC 5: Stalled ----- 93569 in-flight CPI 1.3604 -- Total Cycles 127320 ---- Thread 25 ---- PC 5: Stalled ----- 92487 in-flight CPI 1.3764 -- Total Cycles 127320 ---- Thread 26 ---- PC 5: Stalled ----- 87835 in-flight CPI 1.4493 -- Total Cycles 127320 ---- Thread 27 ---- PC 5: Stalled ----- 83745 in-flight CPI 1.5201 -- Total Cycles 127320 ---- Thread 28 ---- PC 5: Stalled ----- 85537 in-flight CPI 1.4882 -- Total Cycles 127320 ---- Thread 29 ---- PC 5: Stalled ----- 89265 in-flight CPI 1.4261 -- Total Cycles 127320 ---- Thread 30 ---- PC 5: Stalled ----- 89909 in-flight CPI 1.4158 -- Total Cycles 127320 ---- Thread 31 ---- PC 5: Stalled ----- 91311 in-flight CPI 1.3941 -- Total Cycles 127320 Total CPI 0.0417 , IPC 23.9624 -- Total Cycles 127320 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7188 (3.658220%) FPSUB: 0 (0.000000%) FPMUL: 30790 (15.670088%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 78179 (39.787978%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5554 (2.826621%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 66829 (34.011573%) DIV: 7683 (3.910143%) FPUN: 0 (0.000000%) FPRSUB: 266 (0.135377%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3345219 total) ADD%: 7.526 (251768) SUB%: 0.000 (0) MUL%: 0.006 (208) BITOR%: 1.551 (51870) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.512 (17124) FPSUB%: 0.000 (0) FPMUL%: 4.675 (156401) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (624) FPMAX%: 0.019 (624) LOAD%: 5.114 (171084) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (240) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (588) FPINV%: 0.000 (0) FPCONV%: 0.020 (656) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.056 (35314) FPLE%: 0.464 (15534) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (624) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.838 (94928) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.739 (24719) CMPU%: 0.000 (0) RSUB%: 0.006 (208) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.797 (528445) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.185 (39656) ORI%: 1.538 (51444) XORI%: 0.000 (0) MULI%: 3.243 (108480) LW%: 1.145 (38304) LWI%: 13.595 (454795) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.292 (9766) SWI%: 4.100 (137157) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.416 (47382) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.314 (10495) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.048 (1610) bned%: 0.000 (0) bneid%: 13.923 (465756) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.726 (24272) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.113 (3789) DIV%: 0.012 (416) FPUN%: 1.504 (50310) FPRSUB%: 3.645 (121947) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (67) FPGT%: 2.974 (99478) FPGE%: 1.040 (34776) SYNC%: 0.000 (0) NOP%: 8.797 (294264) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 24 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 10 FPSUB 0 FPMUL 42 FPCMPLT 0 FPMIN 0 FPMAX 406 LOAD 38468 INTCONV 0 ATOMIC_INC 15 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 13 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1372 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 3 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49848 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 6 ORI 10131 XORI 0 MULI 9504 LW 0 LWI 143887 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 77 DIV 30 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.9627 --Total thread-cycles: 4074240 --total thread-cycles issued: 3050955 (74.884027%) --iCache conflicts: 112716 (2.766553%) --thread*cycles of FU dependence: 253852 (6.230659%) --thread*cycles of data dependence: 196489 (4.822715%) --iCache cycles*banks: 4074240 (82.107362% used) Issue breakdown: --thread*cycles of issue worked: 3050955 (74.884027%) --thread*cycles of issue failed: 729021 (17.893423%) --thread*cycles of issue NOP/other: 294264 (7.222549%) Number of thread-cycles not ready: 196489 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3345219 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 7 2: 8 3: 8 4: 9 5: 9 6: 8 7: 7 8: 7 9: 8 10: 7 11: 7 12: 7 13: 8 14: 8 15: 8 16: 8 17: 9 18: 8 19: 7 20: 6 21: 9 22: 8 23: 6 24: 8 25: 7 26: 7 27: 5 28: 7 29: 7 30: 8 31: 7 <=== Core 72 ===> ---- Thread 00 ---- PC 5: Stalled ----- 96532 in-flight CPI 1.4523 -- Total Cycles 140225 ---- Thread 01 ---- PC 5: Stalled ----- 94777 in-flight CPI 1.4793 -- Total Cycles 140225 ---- Thread 02 ---- PC 5: Stalled ----- 102969 in-flight CPI 1.3615 -- Total Cycles 140225 ---- Thread 03 ---- PC 5: Stalled ----- 107108 in-flight CPI 1.3090 -- Total Cycles 140225 ---- Thread 04 ---- PC 5: Stalled ----- 88982 in-flight CPI 1.5757 -- Total Cycles 140225 ---- Thread 05 ---- PC 5: Stalled ----- 95021 in-flight CPI 1.4755 -- Total Cycles 140225 ---- Thread 06 ---- PC 5: Stalled ----- 98018 in-flight CPI 1.4303 -- Total Cycles 140225 ---- Thread 07 ---- PC 5: Stalled ----- 97902 in-flight CPI 1.4320 -- Total Cycles 140225 ---- Thread 08 ---- PC 5: Stalled ----- 100229 in-flight CPI 1.3988 -- Total Cycles 140225 ---- Thread 09 ---- PC 5: Stalled ----- 92691 in-flight CPI 1.5125 -- Total Cycles 140225 ---- Thread 10 ---- PC 5: Stalled ----- 90666 in-flight CPI 1.5464 -- Total Cycles 140225 ---- Thread 11 ---- PC 5: Stalled ----- 95943 in-flight CPI 1.4613 -- Total Cycles 140225 ---- Thread 12 ---- PC 5: Stalled ----- 99271 in-flight CPI 1.4123 -- Total Cycles 140225 ---- Thread 13 ---- PC 5: Stalled ----- 97089 in-flight CPI 1.4441 -- Total Cycles 140225 ---- Thread 14 ---- PC 5: Stalled ----- 99872 in-flight CPI 1.4037 -- Total Cycles 140225 ---- Thread 15 ---- PC 5: Stalled ----- 94937 in-flight CPI 1.4767 -- Total Cycles 140225 ---- Thread 16 ---- PC 5: Stalled ----- 99056 in-flight CPI 1.4154 -- Total Cycles 140225 ---- Thread 17 ---- PC 5: Stalled ----- 100348 in-flight CPI 1.3971 -- Total Cycles 140225 ---- Thread 18 ---- PC 5: Stalled ----- 97237 in-flight CPI 1.4418 -- Total Cycles 140225 ---- Thread 19 ---- PC 5: Stalled ----- 89522 in-flight CPI 1.5661 -- Total Cycles 140225 ---- Thread 20 ---- PC 5: Stalled ----- 94266 in-flight CPI 1.4872 -- Total Cycles 140225 ---- Thread 21 ---- PC 5: Stalled ----- 92635 in-flight CPI 1.5135 -- Total Cycles 140225 ---- Thread 22 ---- PC 5: Stalled ----- 87355 in-flight CPI 1.6049 -- Total Cycles 140225 ---- Thread 23 ---- PC 5: Stalled ----- 95835 in-flight CPI 1.4629 -- Total Cycles 140225 ---- Thread 24 ---- PC 5: Stalled ----- 89588 in-flight CPI 1.5650 -- Total Cycles 140225 ---- Thread 25 ---- PC 5: Stalled ----- 90906 in-flight CPI 1.5422 -- Total Cycles 140225 ---- Thread 26 ---- PC 5: Stalled ----- 91347 in-flight CPI 1.5349 -- Total Cycles 140225 ---- Thread 27 ---- PC 5: Stalled ----- 91027 in-flight CPI 1.5401 -- Total Cycles 140225 ---- Thread 28 ---- PC 5: Stalled ----- 89506 in-flight CPI 1.5664 -- Total Cycles 140225 ---- Thread 29 ---- PC 5: Stalled ----- 92051 in-flight CPI 1.5230 -- Total Cycles 140225 ---- Thread 30 ---- PC 5: Stalled ----- 90535 in-flight CPI 1.5485 -- Total Cycles 140225 ---- Thread 31 ---- PC 5: Stalled ----- 94620 in-flight CPI 1.4817 -- Total Cycles 140225 Total CPI 0.0462 , IPC 21.6680 -- Total Cycles 140225 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8437 (3.945436%) FPSUB: 0 (0.000000%) FPMUL: 33028 (15.445048%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 82917 (38.774890%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5496 (2.570122%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 76289 (35.675405%) DIV: 7415 (3.467513%) FPUN: 0 (0.000000%) FPRSUB: 260 (0.121585%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3331639 total) ADD%: 7.451 (248240) SUB%: 0.000 (0) MUL%: 0.006 (201) BITOR%: 1.534 (51118) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.590 (19658) FPSUB%: 0.000 (0) FPMUL%: 4.905 (163407) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (603) FPMAX%: 0.018 (603) LOAD%: 5.218 (173838) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (233) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (577) FPINV%: 0.000 (0) FPCONV%: 0.019 (635) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.083 (36094) FPLE%: 0.458 (15249) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (603) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.796 (93146) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.763 (25427) CMPU%: 0.000 (0) RSUB%: 0.006 (201) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.744 (524535) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.178 (39233) ORI%: 1.593 (53084) XORI%: 0.000 (0) MULI%: 3.192 (106330) LW%: 1.128 (37580) LWI%: 13.472 (448841) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.287 (9563) SWI%: 4.056 (135131) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.396 (46516) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10372) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.060 (2009) bned%: 0.000 (0) bneid%: 13.852 (461506) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.715 (23806) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.131 (4350) DIV%: 0.012 (402) FPUN%: 1.479 (49264) FPRSUB%: 3.714 (123733) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (71) FPGT%: 2.947 (98174) FPGE%: 1.021 (34015) SYNC%: 0.000 (0) NOP%: 8.800 (293195) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 18 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 8 FPSUB 0 FPMUL 61 FPCMPLT 0 FPMIN 0 FPMAX 392 LOAD 40815 INTCONV 0 ATOMIC_INC 19 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 13 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1393 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 9 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48991 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 12 ORI 12105 XORI 0 MULI 9697 LW 0 LWI 142157 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 52 DIV 28 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.6682 --Total thread-cycles: 4487200 --total thread-cycles issued: 3038444 (67.713585%) --iCache conflicts: 113490 (2.529194%) --thread*cycles of FU dependence: 255799 (5.700637%) --thread*cycles of data dependence: 213842 (4.765600%) --iCache cycles*banks: 4487200 (74.248329% used) Issue breakdown: --thread*cycles of issue worked: 3038444 (67.713585%) --thread*cycles of issue failed: 1155561 (25.752385%) --thread*cycles of issue NOP/other: 293195 (6.534030%) Number of thread-cycles not ready: 213842 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3331639 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 6 2: 9 3: 6 4: 5 5: 6 6: 8 7: 8 8: 8 9: 8 10: 5 11: 7 12: 8 13: 7 14: 9 15: 8 16: 7 17: 8 18: 9 19: 6 20: 8 21: 6 22: 7 23: 7 24: 6 25: 8 26: 6 27: 8 28: 7 29: 8 30: 8 31: 8 <=== Core 73 ===> ---- Thread 00 ---- PC 5: Stalled ----- 95047 in-flight CPI 1.3484 -- Total Cycles 128190 ---- Thread 01 ---- PC 5: Stalled ----- 97094 in-flight CPI 1.3200 -- Total Cycles 128190 ---- Thread 02 ---- PC 5: Stalled ----- 95343 in-flight CPI 1.3442 -- Total Cycles 128190 ---- Thread 03 ---- PC 5: Stalled ----- 97430 in-flight CPI 1.3155 -- Total Cycles 128190 ---- Thread 04 ---- PC 5: Stalled ----- 103917 in-flight CPI 1.2333 -- Total Cycles 128190 ---- Thread 05 ---- PC 5: Stalled ----- 97352 in-flight CPI 1.3165 -- Total Cycles 128190 ---- Thread 06 ---- PC 5: Stalled ----- 92592 in-flight CPI 1.3843 -- Total Cycles 128190 ---- Thread 07 ---- PC 5: Stalled ----- 95010 in-flight CPI 1.3490 -- Total Cycles 128190 ---- Thread 08 ---- PC 5: Stalled ----- 96195 in-flight CPI 1.3323 -- Total Cycles 128190 ---- Thread 09 ---- PC 5: Stalled ----- 98853 in-flight CPI 1.2966 -- Total Cycles 128190 ---- Thread 10 ---- PC 5: Stalled ----- 95043 in-flight CPI 1.3485 -- Total Cycles 128190 ---- Thread 11 ---- PC 5: Stalled ----- 101405 in-flight CPI 1.2639 -- Total Cycles 128190 ---- Thread 12 ---- PC 5: Stalled ----- 92400 in-flight CPI 1.3871 -- Total Cycles 128190 ---- Thread 13 ---- PC 5: Stalled ----- 93697 in-flight CPI 1.3679 -- Total Cycles 128190 ---- Thread 14 ---- PC 5: Stalled ----- 95031 in-flight CPI 1.3487 -- Total Cycles 128190 ---- Thread 15 ---- PC 5: Stalled ----- 96645 in-flight CPI 1.3261 -- Total Cycles 128190 ---- Thread 16 ---- PC 5: Stalled ----- 94001 in-flight CPI 1.3635 -- Total Cycles 128190 ---- Thread 17 ---- PC 5: Stalled ----- 93656 in-flight CPI 1.3685 -- Total Cycles 128190 ---- Thread 18 ---- PC 5: Stalled ----- 101044 in-flight CPI 1.2684 -- Total Cycles 128190 ---- Thread 19 ---- PC 5: Stalled ----- 97276 in-flight CPI 1.3175 -- Total Cycles 128190 ---- Thread 20 ---- PC 5: Stalled ----- 88611 in-flight CPI 1.4464 -- Total Cycles 128190 ---- Thread 21 ---- PC 5: Stalled ----- 93963 in-flight CPI 1.3640 -- Total Cycles 128190 ---- Thread 22 ---- PC 5: Stalled ----- 95695 in-flight CPI 1.3393 -- Total Cycles 128190 ---- Thread 23 ---- PC 5: Stalled ----- 89465 in-flight CPI 1.4326 -- Total Cycles 128190 ---- Thread 24 ---- PC 5: Stalled ----- 89403 in-flight CPI 1.4336 -- Total Cycles 128190 ---- Thread 25 ---- PC 5: Stalled ----- 89884 in-flight CPI 1.4259 -- Total Cycles 128190 ---- Thread 26 ---- PC 5: Stalled ----- 94089 in-flight CPI 1.3622 -- Total Cycles 128190 ---- Thread 27 ---- PC 5: Stalled ----- 89431 in-flight CPI 1.4332 -- Total Cycles 128190 ---- Thread 28 ---- PC 5: Stalled ----- 85170 in-flight CPI 1.5049 -- Total Cycles 128190 ---- Thread 29 ---- PC 5: Stalled ----- 89109 in-flight CPI 1.4384 -- Total Cycles 128190 ---- Thread 30 ---- PC 5: Stalled ----- 87201 in-flight CPI 1.4698 -- Total Cycles 128190 ---- Thread 31 ---- PC 5: Stalled ----- 91486 in-flight CPI 1.4009 -- Total Cycles 128190 Total CPI 0.0425 , IPC 23.5048 -- Total Cycles 128190 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8061 (3.901724%) FPSUB: 0 (0.000000%) FPMUL: 32227 (15.598666%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 79063 (38.268450%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5529 (2.676173%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 74042 (35.838161%) DIV: 7417 (3.590012%) FPUN: 0 (0.000000%) FPRSUB: 262 (0.126814%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3303952 total) ADD%: 7.448 (246091) SUB%: 0.000 (0) MUL%: 0.006 (201) BITOR%: 1.530 (50557) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.572 (18909) FPSUB%: 0.000 (0) FPMUL%: 4.857 (160460) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (603) FPMAX%: 0.018 (603) LOAD%: 5.198 (171750) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (233) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (579) FPINV%: 0.000 (0) FPCONV%: 0.019 (635) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.077 (35569) FPLE%: 0.454 (15005) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (603) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.805 (92691) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.759 (25081) CMPU%: 0.000 (0) RSUB%: 0.006 (201) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.748 (520312) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.179 (38942) ORI%: 1.584 (52348) XORI%: 0.000 (0) MULI%: 3.205 (105900) LW%: 1.132 (37398) LWI%: 13.525 (446857) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.287 (9496) SWI%: 4.074 (134608) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.402 (46312) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.312 (10295) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.060 (1980) bned%: 0.000 (0) bneid%: 13.856 (457795) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.717 (23693) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.128 (4220) DIV%: 0.012 (402) FPUN%: 1.477 (48812) FPRSUB%: 3.704 (122393) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (72) FPGT%: 2.955 (97632) FPGE%: 1.023 (33807) SYNC%: 0.000 (0) NOP%: 8.802 (290811) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 27 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 9 FPSUB 0 FPMUL 54 FPCMPLT 0 FPMIN 0 FPMAX 392 LOAD 39679 INTCONV 0 ATOMIC_INC 17 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 12 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1296 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 12 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48749 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 10 ORI 11518 XORI 0 MULI 9599 LW 0 LWI 141661 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 83 DIV 30 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.5051 --Total thread-cycles: 4102080 --total thread-cycles issued: 3013141 (73.453979%) --iCache conflicts: 111639 (2.721522%) --thread*cycles of FU dependence: 253162 (6.171552%) --thread*cycles of data dependence: 206601 (5.036494%) --iCache cycles*banks: 4102080 (80.544114% used) Issue breakdown: --thread*cycles of issue worked: 3013141 (73.453979%) --thread*cycles of issue failed: 798128 (19.456666%) --thread*cycles of issue NOP/other: 290811 (7.089355%) Number of thread-cycles not ready: 206601 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3303952 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 7 2: 8 3: 7 4: 9 5: 7 6: 5 7: 8 8: 8 9: 7 10: 7 11: 8 12: 6 13: 8 14: 7 15: 9 16: 7 17: 7 18: 9 19: 9 20: 6 21: 7 22: 8 23: 7 24: 7 25: 7 26: 8 27: 6 28: 6 29: 6 30: 7 31: 7 <=== Core 74 ===> ---- Thread 00 ---- PC 5: Stalled ----- 100892 in-flight CPI 1.2619 -- Total Cycles 127340 ---- Thread 01 ---- PC 5: Stalled ----- 100057 in-flight CPI 1.2724 -- Total Cycles 127340 ---- Thread 02 ---- PC 5: Stalled ----- 98225 in-flight CPI 1.2962 -- Total Cycles 127340 ---- Thread 03 ---- PC 5: Stalled ----- 99722 in-flight CPI 1.2767 -- Total Cycles 127340 ---- Thread 04 ---- PC 5: Stalled ----- 104230 in-flight CPI 1.2215 -- Total Cycles 127340 ---- Thread 05 ---- PC 5: Stalled ----- 95834 in-flight CPI 1.3285 -- Total Cycles 127340 ---- Thread 06 ---- PC 5: Stalled ----- 97267 in-flight CPI 1.3089 -- Total Cycles 127340 ---- Thread 07 ---- PC 5: Stalled ----- 95795 in-flight CPI 1.3290 -- Total Cycles 127340 ---- Thread 08 ---- PC 5: Stalled ----- 95331 in-flight CPI 1.3355 -- Total Cycles 127340 ---- Thread 09 ---- PC 5: Stalled ----- 100382 in-flight CPI 1.2683 -- Total Cycles 127340 ---- Thread 10 ---- PC 5: Stalled ----- 98098 in-flight CPI 1.2979 -- Total Cycles 127340 ---- Thread 11 ---- PC 5: Stalled ----- 98762 in-flight CPI 1.2892 -- Total Cycles 127340 ---- Thread 12 ---- PC 5: Stalled ----- 99589 in-flight CPI 1.2784 -- Total Cycles 127340 ---- Thread 13 ---- PC 5: Stalled ----- 97521 in-flight CPI 1.3055 -- Total Cycles 127340 ---- Thread 14 ---- PC 5: Stalled ----- 96265 in-flight CPI 1.3225 -- Total Cycles 127340 ---- Thread 15 ---- PC 5: Stalled ----- 92894 in-flight CPI 1.3706 -- Total Cycles 127340 ---- Thread 16 ---- PC 5: Stalled ----- 98237 in-flight CPI 1.2960 -- Total Cycles 127340 ---- Thread 17 ---- PC 5: Stalled ----- 99766 in-flight CPI 1.2761 -- Total Cycles 127340 ---- Thread 18 ---- PC 5: Stalled ----- 90450 in-flight CPI 1.4075 -- Total Cycles 127340 ---- Thread 19 ---- PC 5: Stalled ----- 94228 in-flight CPI 1.3512 -- Total Cycles 127340 ---- Thread 20 ---- PC 5: Stalled ----- 97459 in-flight CPI 1.3063 -- Total Cycles 127340 ---- Thread 21 ---- PC 5: Stalled ----- 97235 in-flight CPI 1.3094 -- Total Cycles 127340 ---- Thread 22 ---- PC 5: Stalled ----- 98254 in-flight CPI 1.2958 -- Total Cycles 127340 ---- Thread 23 ---- PC 5: Stalled ----- 97616 in-flight CPI 1.3042 -- Total Cycles 127340 ---- Thread 24 ---- PC 5: Stalled ----- 92052 in-flight CPI 1.3831 -- Total Cycles 127340 ---- Thread 25 ---- PC 5: Stalled ----- 95189 in-flight CPI 1.3375 -- Total Cycles 127340 ---- Thread 26 ---- PC 5: Stalled ----- 88777 in-flight CPI 1.4341 -- Total Cycles 127340 ---- Thread 27 ---- PC 5: Stalled ----- 87507 in-flight CPI 1.4549 -- Total Cycles 127340 ---- Thread 28 ---- PC 5: Stalled ----- 88233 in-flight CPI 1.4430 -- Total Cycles 127340 ---- Thread 29 ---- PC 5: Stalled ----- 93654 in-flight CPI 1.3594 -- Total Cycles 127340 ---- Thread 30 ---- PC 5: Stalled ----- 92963 in-flight CPI 1.3696 -- Total Cycles 127340 ---- Thread 31 ---- PC 5: Stalled ----- 89847 in-flight CPI 1.4170 -- Total Cycles 127340 Total CPI 0.0414 , IPC 24.1315 -- Total Cycles 127340 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7238 (3.702340%) FPSUB: 0 (0.000000%) FPMUL: 31037 (15.875866%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 77042 (39.408076%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5776 (2.954506%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 66434 (33.981933%) DIV: 7706 (3.941728%) FPUN: 0 (0.000000%) FPRSUB: 265 (0.135551%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3369527 total) ADD%: 7.525 (253562) SUB%: 0.000 (0) MUL%: 0.006 (209) BITOR%: 1.531 (51602) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.513 (17277) FPSUB%: 0.000 (0) FPMUL%: 4.679 (157661) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (627) FPMAX%: 0.019 (627) LOAD%: 5.106 (172044) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (241) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (600) FPINV%: 0.000 (0) FPCONV%: 0.020 (659) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.054 (35524) FPLE%: 0.457 (15396) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (627) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.848 (95949) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.733 (24686) CMPU%: 0.000 (0) RSUB%: 0.006 (209) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.775 (531530) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.182 (39820) ORI%: 1.545 (52065) XORI%: 0.000 (0) MULI%: 3.248 (109454) LW%: 1.149 (38714) LWI%: 13.640 (459590) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.291 (9814) SWI%: 4.116 (138697) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.423 (47957) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.313 (10552) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.055 (1843) bned%: 0.000 (0) bneid%: 13.899 (468319) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.727 (24486) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.112 (3765) DIV%: 0.012 (418) FPUN%: 1.492 (50287) FPRSUB%: 3.644 (122778) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (69) FPGT%: 2.977 (100313) FPGE%: 1.035 (34891) SYNC%: 0.000 (0) NOP%: 8.802 (296569) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 12 BITOR 1 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 11 FPSUB 0 FPMUL 48 FPCMPLT 0 FPMIN 0 FPMAX 407 LOAD 38221 INTCONV 0 ATOMIC_INC 23 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 9 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1367 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 11 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 50224 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 15 ORI 10211 XORI 0 MULI 10493 LW 0 LWI 144952 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 67 DIV 25 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 24.1317 --Total thread-cycles: 4074880 --total thread-cycles issued: 3072958 (75.412233%) --iCache conflicts: 116038 (2.847642%) --thread*cycles of FU dependence: 256107 (6.285019%) --thread*cycles of data dependence: 195498 (4.797638%) --iCache cycles*banks: 4074880 (82.690999% used) Issue breakdown: --thread*cycles of issue worked: 3072958 (75.412233%) --thread*cycles of issue failed: 705353 (17.309786%) --thread*cycles of issue NOP/other: 296569 (7.277981%) Number of thread-cycles not ready: 195498 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3369527 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 7 3: 8 4: 9 5: 7 6: 8 7: 8 8: 7 9: 8 10: 7 11: 7 12: 7 13: 8 14: 8 15: 6 16: 8 17: 8 18: 8 19: 7 20: 9 21: 8 22: 8 23: 8 24: 7 25: 8 26: 7 27: 7 28: 6 29: 8 30: 6 31: 7 <=== Core 75 ===> ---- Thread 00 ---- PC 5: Stalled ----- 97811 in-flight CPI 1.3239 -- Total Cycles 129510 ---- Thread 01 ---- PC 5: Stalled ----- 102451 in-flight CPI 1.2639 -- Total Cycles 129510 ---- Thread 02 ---- PC 5: Stalled ----- 99436 in-flight CPI 1.3022 -- Total Cycles 129510 ---- Thread 03 ---- PC 5: Stalled ----- 98199 in-flight CPI 1.3186 -- Total Cycles 129510 ---- Thread 04 ---- PC 5: Stalled ----- 97413 in-flight CPI 1.3292 -- Total Cycles 129510 ---- Thread 05 ---- PC 5: Stalled ----- 104812 in-flight CPI 1.2354 -- Total Cycles 129510 ---- Thread 06 ---- PC 5: Stalled ----- 94996 in-flight CPI 1.3631 -- Total Cycles 129510 ---- Thread 07 ---- PC 5: Stalled ----- 94261 in-flight CPI 1.3737 -- Total Cycles 129510 ---- Thread 08 ---- PC 5: Stalled ----- 100853 in-flight CPI 1.2839 -- Total Cycles 129510 ---- Thread 09 ---- PC 5: Stalled ----- 98562 in-flight CPI 1.3137 -- Total Cycles 129510 ---- Thread 10 ---- PC 5: Stalled ----- 104562 in-flight CPI 1.2384 -- Total Cycles 129510 ---- Thread 11 ---- PC 5: Stalled ----- 96685 in-flight CPI 1.3392 -- Total Cycles 129510 ---- Thread 12 ---- PC 5: Stalled ----- 97809 in-flight CPI 1.3239 -- Total Cycles 129510 ---- Thread 13 ---- PC 5: Stalled ----- 93367 in-flight CPI 1.3869 -- Total Cycles 129510 ---- Thread 14 ---- PC 5: Stalled ----- 95613 in-flight CPI 1.3543 -- Total Cycles 129510 ---- Thread 15 ---- PC 5: Stalled ----- 96651 in-flight CPI 1.3397 -- Total Cycles 129510 ---- Thread 16 ---- PC 5: Stalled ----- 92246 in-flight CPI 1.4037 -- Total Cycles 129510 ---- Thread 17 ---- PC 5: Stalled ----- 95046 in-flight CPI 1.3624 -- Total Cycles 129510 ---- Thread 18 ---- PC 5: Stalled ----- 96258 in-flight CPI 1.3452 -- Total Cycles 129510 ---- Thread 19 ---- PC 5: Stalled ----- 96681 in-flight CPI 1.3393 -- Total Cycles 129510 ---- Thread 20 ---- PC 5: Stalled ----- 90638 in-flight CPI 1.4286 -- Total Cycles 129510 ---- Thread 21 ---- PC 5: Stalled ----- 93320 in-flight CPI 1.3876 -- Total Cycles 129510 ---- Thread 22 ---- PC 5: Stalled ----- 91419 in-flight CPI 1.4164 -- Total Cycles 129510 ---- Thread 23 ---- PC 5: Stalled ----- 93489 in-flight CPI 1.3850 -- Total Cycles 129510 ---- Thread 24 ---- PC 5: Stalled ----- 88175 in-flight CPI 1.4685 -- Total Cycles 129510 ---- Thread 25 ---- PC 5: Stalled ----- 90499 in-flight CPI 1.4308 -- Total Cycles 129510 ---- Thread 26 ---- PC 5: Stalled ----- 90279 in-flight CPI 1.4343 -- Total Cycles 129510 ---- Thread 27 ---- PC 5: Stalled ----- 91674 in-flight CPI 1.4125 -- Total Cycles 129510 ---- Thread 28 ---- PC 5: Stalled ----- 87190 in-flight CPI 1.4851 -- Total Cycles 129510 ---- Thread 29 ---- PC 5: Stalled ----- 89438 in-flight CPI 1.4478 -- Total Cycles 129510 ---- Thread 30 ---- PC 5: Stalled ----- 85490 in-flight CPI 1.5147 -- Total Cycles 129510 ---- Thread 31 ---- PC 5: Stalled ----- 88169 in-flight CPI 1.4686 -- Total Cycles 129510 Total CPI 0.0427 , IPC 23.4273 -- Total Cycles 129510 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7614 (4.012162%) FPSUB: 0 (0.000000%) FPMUL: 31440 (16.567162%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 66311 (34.942273%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5656 (2.980403%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 70763 (37.288234%) DIV: 7720 (4.068018%) FPUN: 0 (0.000000%) FPRSUB: 269 (0.141748%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3326884 total) ADD%: 7.462 (248268) SUB%: 0.000 (0) MUL%: 0.006 (209) BITOR%: 1.536 (51097) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.542 (18040) FPSUB%: 0.000 (0) FPMUL%: 4.764 (158481) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (627) FPMAX%: 0.019 (627) LOAD%: 5.160 (171665) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (241) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (594) FPINV%: 0.000 (0) FPCONV%: 0.020 (659) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.065 (35415) FPLE%: 0.460 (15301) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (627) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.826 (94034) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.743 (24716) CMPU%: 0.000 (0) RSUB%: 0.006 (209) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.770 (524664) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.179 (39240) ORI%: 1.566 (52087) XORI%: 0.000 (0) MULI%: 3.228 (107378) LW%: 1.141 (37948) LWI%: 13.575 (451618) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.290 (9642) SWI%: 4.096 (136283) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.412 (46974) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.312 (10371) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.057 (1899) bned%: 0.000 (0) bneid%: 13.880 (461776) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.725 (24116) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.121 (4022) DIV%: 0.013 (418) FPUN%: 1.491 (49596) FPRSUB%: 3.673 (122210) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (63) FPGT%: 2.964 (98613) FPGE%: 1.031 (34295) SYNC%: 0.000 (0) NOP%: 8.800 (292765) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 11 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 15 FPSUB 0 FPMUL 35 FPCMPLT 0 FPMIN 0 FPMAX 403 LOAD 38663 INTCONV 0 ATOMIC_INC 26 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 14 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1867 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 13 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49253 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 10 ORI 10786 XORI 0 MULI 9772 LW 0 LWI 142916 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 58 DIV 18 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.4275 --Total thread-cycles: 4144320 --total thread-cycles issued: 3034119 (73.211504%) --iCache conflicts: 113565 (2.740257%) --thread*cycles of FU dependence: 253887 (6.126144%) --thread*cycles of data dependence: 189773 (4.579111%) --iCache cycles*banks: 4144320 (80.276523% used) Issue breakdown: --thread*cycles of issue worked: 3034119 (73.211504%) --thread*cycles of issue failed: 817436 (19.724249%) --thread*cycles of issue NOP/other: 292765 (7.064247%) Number of thread-cycles not ready: 189773 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3326884 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 9 3: 8 4: 8 5: 8 6: 6 7: 8 8: 8 9: 8 10: 8 11: 8 12: 8 13: 7 14: 8 15: 8 16: 8 17: 7 18: 8 19: 8 20: 7 21: 7 22: 7 23: 8 24: 7 25: 8 26: 7 27: 7 28: 7 29: 7 30: 5 31: 8 <=== Core 76 ===> ---- Thread 00 ---- PC 5: Stalled ----- 100139 in-flight CPI 1.2782 -- Total Cycles 128025 ---- Thread 01 ---- PC 5: Stalled ----- 93929 in-flight CPI 1.3627 -- Total Cycles 128025 ---- Thread 02 ---- PC 5: Stalled ----- 95502 in-flight CPI 1.3403 -- Total Cycles 128025 ---- Thread 03 ---- PC 5: Stalled ----- 95086 in-flight CPI 1.3462 -- Total Cycles 128025 ---- Thread 04 ---- PC 5: Stalled ----- 93782 in-flight CPI 1.3649 -- Total Cycles 128025 ---- Thread 05 ---- PC 5: Stalled ----- 103210 in-flight CPI 1.2402 -- Total Cycles 128025 ---- Thread 06 ---- PC 5: Stalled ----- 92935 in-flight CPI 1.3773 -- Total Cycles 128025 ---- Thread 07 ---- PC 5: Stalled ----- 87994 in-flight CPI 1.4548 -- Total Cycles 128025 ---- Thread 08 ---- PC 5: Stalled ----- 98041 in-flight CPI 1.3056 -- Total Cycles 128025 ---- Thread 09 ---- PC 5: Stalled ----- 92714 in-flight CPI 1.3806 -- Total Cycles 128025 ---- Thread 10 ---- PC 5: Stalled ----- 99731 in-flight CPI 1.2835 -- Total Cycles 128025 ---- Thread 11 ---- PC 5: Stalled ----- 93752 in-flight CPI 1.3653 -- Total Cycles 128025 ---- Thread 12 ---- PC 5: Stalled ----- 101972 in-flight CPI 1.2553 -- Total Cycles 128025 ---- Thread 13 ---- PC 5: Stalled ----- 92492 in-flight CPI 1.3839 -- Total Cycles 128025 ---- Thread 14 ---- PC 5: Stalled ----- 95439 in-flight CPI 1.3412 -- Total Cycles 128025 ---- Thread 15 ---- PC 5: Stalled ----- 97892 in-flight CPI 1.3076 -- Total Cycles 128025 ---- Thread 16 ---- PC 5: Stalled ----- 96402 in-flight CPI 1.3278 -- Total Cycles 128025 ---- Thread 17 ---- PC 5: Stalled ----- 92842 in-flight CPI 1.3787 -- Total Cycles 128025 ---- Thread 18 ---- PC 5: Stalled ----- 98974 in-flight CPI 1.2933 -- Total Cycles 128025 ---- Thread 19 ---- PC 5: Stalled ----- 92556 in-flight CPI 1.3829 -- Total Cycles 128025 ---- Thread 20 ---- PC 5: Stalled ----- 97879 in-flight CPI 1.3078 -- Total Cycles 128025 ---- Thread 21 ---- PC 5: Stalled ----- 94798 in-flight CPI 1.3502 -- Total Cycles 128025 ---- Thread 22 ---- PC 5: Stalled ----- 95837 in-flight CPI 1.3356 -- Total Cycles 128025 ---- Thread 23 ---- PC 5: Stalled ----- 86509 in-flight CPI 1.4797 -- Total Cycles 128025 ---- Thread 24 ---- PC 5: Stalled ----- 89796 in-flight CPI 1.4255 -- Total Cycles 128025 ---- Thread 25 ---- PC 5: Stalled ----- 96896 in-flight CPI 1.3210 -- Total Cycles 128025 ---- Thread 26 ---- PC 5: Stalled ----- 94589 in-flight CPI 1.3532 -- Total Cycles 128025 ---- Thread 27 ---- PC 5: Stalled ----- 92458 in-flight CPI 1.3844 -- Total Cycles 128025 ---- Thread 28 ---- PC 5: Stalled ----- 91241 in-flight CPI 1.4029 -- Total Cycles 128025 ---- Thread 29 ---- PC 5: Stalled ----- 86345 in-flight CPI 1.4825 -- Total Cycles 128025 ---- Thread 30 ---- PC 5: Stalled ----- 87975 in-flight CPI 1.4550 -- Total Cycles 128025 ---- Thread 31 ---- PC 5: Stalled ----- 87320 in-flight CPI 1.4659 -- Total Cycles 128025 Total CPI 0.0424 , IPC 23.5701 -- Total Cycles 128025 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7994 (3.853644%) FPSUB: 0 (0.000000%) FPMUL: 32100 (15.474354%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 81391 (39.235924%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5577 (2.688488%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 72808 (35.098342%) DIV: 7315 (3.526321%) FPUN: 0 (0.000000%) FPRSUB: 255 (0.122927%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3308444 total) ADD%: 7.468 (247068) SUB%: 0.000 (0) MUL%: 0.006 (198) BITOR%: 1.535 (50772) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.567 (18760) FPSUB%: 0.000 (0) FPMUL%: 4.842 (160208) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (594) FPMAX%: 0.018 (594) LOAD%: 5.182 (171439) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (230) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (578) FPINV%: 0.000 (0) FPCONV%: 0.019 (626) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.073 (35504) FPLE%: 0.453 (15003) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (594) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.812 (93028) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.758 (25084) CMPU%: 0.000 (0) RSUB%: 0.006 (198) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.750 (521072) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.181 (39071) ORI%: 1.585 (52425) XORI%: 0.000 (0) MULI%: 3.210 (106206) LW%: 1.134 (37528) LWI%: 13.533 (447741) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.287 (9489) SWI%: 4.077 (134897) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.406 (46525) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10279) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.058 (1930) bned%: 0.000 (0) bneid%: 13.856 (458422) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.720 (23815) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.125 (4149) DIV%: 0.012 (396) FPUN%: 1.482 (49031) FPRSUB%: 3.699 (122367) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (67) FPGT%: 2.950 (97609) FPGE%: 1.029 (34028) SYNC%: 0.000 (0) NOP%: 8.790 (290823) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 40 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 12 FPSUB 0 FPMUL 54 FPCMPLT 0 FPMIN 0 FPMAX 393 LOAD 38962 INTCONV 0 ATOMIC_INC 19 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 14 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1456 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 10 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48843 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 13 ORI 11408 XORI 0 MULI 9083 LW 0 LWI 141731 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 77 DIV 30 FPUN 0 FPRSUB 1 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.5704 --Total thread-cycles: 4096800 --total thread-cycles issued: 3017621 (73.658001%) --iCache conflicts: 112880 (2.755321%) --thread*cycles of FU dependence: 252153 (6.154877%) --thread*cycles of data dependence: 207440 (5.063464%) --iCache cycles*banks: 4096800 (80.757567% used) Issue breakdown: --thread*cycles of issue worked: 3017621 (73.658001%) --thread*cycles of issue failed: 788356 (19.243214%) --thread*cycles of issue NOP/other: 290823 (7.098784%) Number of thread-cycles not ready: 207440 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3308444 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 7 3: 7 4: 8 5: 9 6: 7 7: 5 8: 8 9: 7 10: 7 11: 7 12: 8 13: 7 14: 7 15: 6 16: 8 17: 7 18: 8 19: 8 20: 7 21: 8 22: 7 23: 6 24: 7 25: 7 26: 8 27: 7 28: 7 29: 6 30: 6 31: 7 <=== Core 77 ===> ---- Thread 00 ---- PC 5: Stalled ----- 94810 in-flight CPI 1.3459 -- Total Cycles 127632 ---- Thread 01 ---- PC 5: Stalled ----- 95439 in-flight CPI 1.3371 -- Total Cycles 127632 ---- Thread 02 ---- PC 5: Stalled ----- 101723 in-flight CPI 1.2544 -- Total Cycles 127632 ---- Thread 03 ---- PC 5: Stalled ----- 101112 in-flight CPI 1.2620 -- Total Cycles 127632 ---- Thread 04 ---- PC 5: Stalled ----- 97938 in-flight CPI 1.3029 -- Total Cycles 127632 ---- Thread 05 ---- PC 5: Stalled ----- 100041 in-flight CPI 1.2756 -- Total Cycles 127632 ---- Thread 06 ---- PC 5: Stalled ----- 97050 in-flight CPI 1.3149 -- Total Cycles 127632 ---- Thread 07 ---- PC 5: Stalled ----- 94952 in-flight CPI 1.3439 -- Total Cycles 127632 ---- Thread 08 ---- PC 5: Stalled ----- 99746 in-flight CPI 1.2794 -- Total Cycles 127632 ---- Thread 09 ---- PC 5: Stalled ----- 95816 in-flight CPI 1.3318 -- Total Cycles 127632 ---- Thread 10 ---- PC 5: Stalled ----- 98810 in-flight CPI 1.2915 -- Total Cycles 127632 ---- Thread 11 ---- PC 5: Stalled ----- 98561 in-flight CPI 1.2947 -- Total Cycles 127632 ---- Thread 12 ---- PC 5: Stalled ----- 97363 in-flight CPI 1.3106 -- Total Cycles 127632 ---- Thread 13 ---- PC 5: Stalled ----- 96709 in-flight CPI 1.3195 -- Total Cycles 127632 ---- Thread 14 ---- PC 5: Stalled ----- 99644 in-flight CPI 1.2806 -- Total Cycles 127632 ---- Thread 15 ---- PC 5: Stalled ----- 94059 in-flight CPI 1.3567 -- Total Cycles 127632 ---- Thread 16 ---- PC 5: Stalled ----- 89531 in-flight CPI 1.4253 -- Total Cycles 127632 ---- Thread 17 ---- PC 5: Stalled ----- 87400 in-flight CPI 1.4601 -- Total Cycles 127632 ---- Thread 18 ---- PC 5: Stalled ----- 95828 in-flight CPI 1.3316 -- Total Cycles 127632 ---- Thread 19 ---- PC 5: Stalled ----- 91278 in-flight CPI 1.3980 -- Total Cycles 127632 ---- Thread 20 ---- PC 5: Stalled ----- 90628 in-flight CPI 1.4081 -- Total Cycles 127632 ---- Thread 21 ---- PC 5: Stalled ----- 96233 in-flight CPI 1.3260 -- Total Cycles 127632 ---- Thread 22 ---- PC 5: Stalled ----- 90821 in-flight CPI 1.4050 -- Total Cycles 127632 ---- Thread 23 ---- PC 5: Stalled ----- 91525 in-flight CPI 1.3943 -- Total Cycles 127632 ---- Thread 24 ---- PC 5: Stalled ----- 91221 in-flight CPI 1.3989 -- Total Cycles 127632 ---- Thread 25 ---- PC 5: Stalled ----- 95062 in-flight CPI 1.3424 -- Total Cycles 127632 ---- Thread 26 ---- PC 5: Stalled ----- 91948 in-flight CPI 1.3878 -- Total Cycles 127632 ---- Thread 27 ---- PC 5: Stalled ----- 94216 in-flight CPI 1.3544 -- Total Cycles 127632 ---- Thread 28 ---- PC 5: Stalled ----- 87794 in-flight CPI 1.4535 -- Total Cycles 127632 ---- Thread 29 ---- PC 5: Stalled ----- 88163 in-flight CPI 1.4474 -- Total Cycles 127632 ---- Thread 30 ---- PC 5: Stalled ----- 92923 in-flight CPI 1.3733 -- Total Cycles 127632 ---- Thread 31 ---- PC 5: Stalled ----- 83919 in-flight CPI 1.5207 -- Total Cycles 127632 Total CPI 0.0422 , IPC 23.6840 -- Total Cycles 127632 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7541 (3.752637%) FPSUB: 0 (0.000000%) FPMUL: 31241 (15.546499%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 78402 (39.015287%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5624 (2.798678%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 70202 (34.934711%) DIV: 7671 (3.817330%) FPUN: 0 (0.000000%) FPRSUB: 271 (0.134858%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3313666 total) ADD%: 7.546 (250041) SUB%: 0.000 (0) MUL%: 0.006 (208) BITOR%: 1.542 (51108) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.539 (17845) FPSUB%: 0.000 (0) FPMUL%: 4.746 (157281) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (624) FPMAX%: 0.019 (624) LOAD%: 5.155 (170824) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (240) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (592) FPINV%: 0.000 (0) FPCONV%: 0.020 (656) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.062 (35180) FPLE%: 0.459 (15202) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (624) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.829 (93728) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.747 (24740) CMPU%: 0.000 (0) RSUB%: 0.006 (208) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.761 (522261) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.182 (39184) ORI%: 1.560 (51690) XORI%: 0.000 (0) MULI%: 3.229 (107008) LW%: 1.141 (37824) LWI%: 13.565 (449497) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.290 (9596) SWI%: 4.099 (135833) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.413 (46836) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.312 (10348) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.055 (1828) bned%: 0.000 (0) bneid%: 13.867 (459498) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.725 (24014) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.120 (3988) DIV%: 0.013 (416) FPUN%: 1.495 (49540) FPRSUB%: 3.665 (121452) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (70) FPGT%: 2.953 (97845) FPGE%: 1.036 (34338) SYNC%: 0.000 (0) NOP%: 8.775 (290779) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 19 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 9 FPSUB 0 FPMUL 45 FPCMPLT 0 FPMIN 0 FPMAX 403 LOAD 38665 INTCONV 0 ATOMIC_INC 33 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 11 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1412 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 7 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49055 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 13 ORI 10608 XORI 0 MULI 9659 LW 0 LWI 142051 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 55 DIV 31 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.6842 --Total thread-cycles: 4084224 --total thread-cycles issued: 3022887 (74.013742%) --iCache conflicts: 113697 (2.783809%) --thread*cycles of FU dependence: 252081 (6.172066%) --thread*cycles of data dependence: 200952 (4.920200%) --iCache cycles*banks: 4084224 (81.134091% used) Issue breakdown: --thread*cycles of issue worked: 3022887 (74.013742%) --thread*cycles of issue failed: 770558 (18.866693%) --thread*cycles of issue NOP/other: 290779 (7.119565%) Number of thread-cycles not ready: 200952 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3313666 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 7 2: 9 3: 9 4: 8 5: 8 6: 7 7: 7 8: 7 9: 8 10: 7 11: 8 12: 8 13: 8 14: 9 15: 7 16: 7 17: 6 18: 9 19: 7 20: 6 21: 8 22: 9 23: 7 24: 8 25: 7 26: 7 27: 7 28: 7 29: 7 30: 7 31: 6 <=== Core 78 ===> ---- Thread 00 ---- PC 5: Stalled ----- 101266 in-flight CPI 1.5023 -- Total Cycles 152165 ---- Thread 01 ---- PC 5: Stalled ----- 103832 in-flight CPI 1.4651 -- Total Cycles 152165 ---- Thread 02 ---- PC 5: Stalled ----- 97991 in-flight CPI 1.5525 -- Total Cycles 152165 ---- Thread 03 ---- PC 5: Stalled ----- 107018 in-flight CPI 1.4217 -- Total Cycles 152165 ---- Thread 04 ---- PC 5: Stalled ----- 101422 in-flight CPI 1.5000 -- Total Cycles 152165 ---- Thread 05 ---- PC 5: Stalled ----- 96991 in-flight CPI 1.5686 -- Total Cycles 152165 ---- Thread 06 ---- PC 5: Stalled ----- 103137 in-flight CPI 1.4750 -- Total Cycles 152165 ---- Thread 07 ---- PC 5: Stalled ----- 101522 in-flight CPI 1.4986 -- Total Cycles 152165 ---- Thread 08 ---- PC 5: Stalled ----- 103698 in-flight CPI 1.4671 -- Total Cycles 152165 ---- Thread 09 ---- PC 5: Stalled ----- 94606 in-flight CPI 1.6081 -- Total Cycles 152165 ---- Thread 10 ---- PC 5: Stalled ----- 98170 in-flight CPI 1.5497 -- Total Cycles 152165 ---- Thread 11 ---- PC 5: Stalled ----- 93257 in-flight CPI 1.6314 -- Total Cycles 152165 ---- Thread 12 ---- PC 5: Stalled ----- 91623 in-flight CPI 1.6605 -- Total Cycles 152165 ---- Thread 13 ---- PC 5: Stalled ----- 102113 in-flight CPI 1.4898 -- Total Cycles 152165 ---- Thread 14 ---- PC 5: Stalled ----- 96369 in-flight CPI 1.5787 -- Total Cycles 152165 ---- Thread 15 ---- PC 5: Stalled ----- 97702 in-flight CPI 1.5572 -- Total Cycles 152165 ---- Thread 16 ---- PC 5: Stalled ----- 98345 in-flight CPI 1.5470 -- Total Cycles 152165 ---- Thread 17 ---- PC 5: Stalled ----- 98810 in-flight CPI 1.5397 -- Total Cycles 152165 ---- Thread 18 ---- PC 5: Stalled ----- 97070 in-flight CPI 1.5673 -- Total Cycles 152165 ---- Thread 19 ---- PC 5: Stalled ----- 94312 in-flight CPI 1.6131 -- Total Cycles 152165 ---- Thread 20 ---- PC 5: Stalled ----- 96765 in-flight CPI 1.5722 -- Total Cycles 152165 ---- Thread 21 ---- PC 5: Stalled ----- 90208 in-flight CPI 1.6865 -- Total Cycles 152165 ---- Thread 22 ---- PC 5: Stalled ----- 94720 in-flight CPI 1.6062 -- Total Cycles 152165 ---- Thread 23 ---- PC 5: Stalled ----- 87068 in-flight CPI 1.7474 -- Total Cycles 152165 ---- Thread 24 ---- PC 5: Stalled ----- 86647 in-flight CPI 1.7559 -- Total Cycles 152165 ---- Thread 25 ---- PC 5: Stalled ----- 93463 in-flight CPI 1.6277 -- Total Cycles 152165 ---- Thread 26 ---- PC 5: Stalled ----- 92872 in-flight CPI 1.6381 -- Total Cycles 152165 ---- Thread 27 ---- PC 5: Stalled ----- 93344 in-flight CPI 1.6298 -- Total Cycles 152165 ---- Thread 28 ---- PC 5: Stalled ----- 89799 in-flight CPI 1.6941 -- Total Cycles 152165 ---- Thread 29 ---- PC 5: Stalled ----- 88717 in-flight CPI 1.7148 -- Total Cycles 152165 ---- Thread 30 ---- PC 5: Stalled ----- 88754 in-flight CPI 1.7141 -- Total Cycles 152165 ---- Thread 31 ---- PC 5: Stalled ----- 83827 in-flight CPI 1.8150 -- Total Cycles 152165 Total CPI 0.0496 , IPC 20.1493 -- Total Cycles 152165 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7378 (3.719369%) FPSUB: 0 (0.000000%) FPMUL: 31241 (15.749091%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 77427 (39.032198%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5785 (2.916312%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 68478 (34.520863%) DIV: 7791 (3.927569%) FPUN: 0 (0.000000%) FPRSUB: 267 (0.134599%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3361960 total) ADD%: 7.505 (252329) SUB%: 0.000 (0) MUL%: 0.006 (211) BITOR%: 1.545 (51941) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.522 (17557) FPSUB%: 0.000 (0) FPMUL%: 4.703 (158129) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (633) FPMAX%: 0.019 (633) LOAD%: 5.117 (172039) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (243) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (603) FPINV%: 0.000 (0) FPCONV%: 0.020 (665) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.059 (35605) FPLE%: 0.459 (15433) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (633) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.834 (95286) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.736 (24748) CMPU%: 0.000 (0) RSUB%: 0.006 (211) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.769 (530150) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.180 (39678) ORI%: 1.557 (52334) XORI%: 0.000 (0) MULI%: 3.239 (108902) LW%: 1.144 (38452) LWI%: 13.606 (457413) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.290 (9764) SWI%: 4.101 (137862) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.416 (47609) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.312 (10486) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.054 (1809) bned%: 0.000 (0) bneid%: 13.906 (467514) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.729 (24507) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.116 (3884) DIV%: 0.013 (422) FPUN%: 1.502 (50504) FPRSUB%: 3.653 (122816) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (65) FPGT%: 2.970 (99834) FPGE%: 1.043 (35071) SYNC%: 0.000 (0) NOP%: 8.801 (295889) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 16 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 9 FPSUB 0 FPMUL 40 FPCMPLT 0 FPMIN 0 FPMAX 410 LOAD 38345 INTCONV 0 ATOMIC_INC 13 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 12 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1476 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 5 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 50037 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 14 ORI 10472 XORI 0 MULI 10224 LW 0 LWI 144464 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 53 DIV 33 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 20.1495 --Total thread-cycles: 4869280 --total thread-cycles issued: 3066071 (62.967646%) --iCache conflicts: 115351 (2.368954%) --thread*cycles of FU dependence: 255652 (5.250304%) --thread*cycles of data dependence: 198367 (4.073847%) --iCache cycles*banks: 4869280 (69.044951% used) Issue breakdown: --thread*cycles of issue worked: 3066071 (62.967646%) --thread*cycles of issue failed: 1507320 (30.955706%) --thread*cycles of issue NOP/other: 295889 (6.076648%) Number of thread-cycles not ready: 198367 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3361960 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 9 1: 10 2: 8 3: 6 4: 8 5: 7 6: 9 7: 8 8: 8 9: 7 10: 9 11: 7 12: 6 13: 9 14: 7 15: 7 16: 8 17: 8 18: 8 19: 8 20: 8 21: 7 22: 7 23: 6 24: 6 25: 8 26: 8 27: 8 28: 8 29: 8 30: 7 31: 5 <=== Core 79 ===> ---- Thread 00 ---- PC 5: Stalled ----- 102757 in-flight CPI 1.2303 -- Total Cycles 126442 ---- Thread 01 ---- PC 5: Stalled ----- 95363 in-flight CPI 1.3257 -- Total Cycles 126442 ---- Thread 02 ---- PC 5: Stalled ----- 98584 in-flight CPI 1.2823 -- Total Cycles 126442 ---- Thread 03 ---- PC 5: Stalled ----- 94657 in-flight CPI 1.3356 -- Total Cycles 126442 ---- Thread 04 ---- PC 5: Stalled ----- 96304 in-flight CPI 1.3128 -- Total Cycles 126442 ---- Thread 05 ---- PC 5: Stalled ----- 98276 in-flight CPI 1.2863 -- Total Cycles 126442 ---- Thread 06 ---- PC 5: Stalled ----- 101366 in-flight CPI 1.2471 -- Total Cycles 126442 ---- Thread 07 ---- PC 5: Stalled ----- 101581 in-flight CPI 1.2445 -- Total Cycles 126442 ---- Thread 08 ---- PC 5: Stalled ----- 94779 in-flight CPI 1.3338 -- Total Cycles 126442 ---- Thread 09 ---- PC 5: Stalled ----- 94930 in-flight CPI 1.3317 -- Total Cycles 126442 ---- Thread 10 ---- PC 5: Stalled ----- 97165 in-flight CPI 1.3011 -- Total Cycles 126442 ---- Thread 11 ---- PC 5: Stalled ----- 99585 in-flight CPI 1.2694 -- Total Cycles 126442 ---- Thread 12 ---- PC 5: Stalled ----- 98732 in-flight CPI 1.2804 -- Total Cycles 126442 ---- Thread 13 ---- PC 5: Stalled ----- 95345 in-flight CPI 1.3259 -- Total Cycles 126442 ---- Thread 14 ---- PC 5: Stalled ----- 91595 in-flight CPI 1.3802 -- Total Cycles 126442 ---- Thread 15 ---- PC 5: Stalled ----- 95861 in-flight CPI 1.3188 -- Total Cycles 126442 ---- Thread 16 ---- PC 5: Stalled ----- 98033 in-flight CPI 1.2895 -- Total Cycles 126442 ---- Thread 17 ---- PC 5: Stalled ----- 88503 in-flight CPI 1.4285 -- Total Cycles 126442 ---- Thread 18 ---- PC 5: Stalled ----- 89012 in-flight CPI 1.4203 -- Total Cycles 126442 ---- Thread 19 ---- PC 5: Stalled ----- 97989 in-flight CPI 1.2902 -- Total Cycles 126442 ---- Thread 20 ---- PC 5: Stalled ----- 90338 in-flight CPI 1.3994 -- Total Cycles 126442 ---- Thread 21 ---- PC 5: Stalled ----- 89342 in-flight CPI 1.4151 -- Total Cycles 126442 ---- Thread 22 ---- PC 5: Stalled ----- 93783 in-flight CPI 1.3480 -- Total Cycles 126442 ---- Thread 23 ---- PC 5: Stalled ----- 97428 in-flight CPI 1.2975 -- Total Cycles 126442 ---- Thread 24 ---- PC 5: Stalled ----- 92922 in-flight CPI 1.3605 -- Total Cycles 126442 ---- Thread 25 ---- PC 5: Stalled ----- 88262 in-flight CPI 1.4324 -- Total Cycles 126442 ---- Thread 26 ---- PC 5: Stalled ----- 94517 in-flight CPI 1.3375 -- Total Cycles 126442 ---- Thread 27 ---- PC 5: Stalled ----- 93047 in-flight CPI 1.3586 -- Total Cycles 126442 ---- Thread 28 ---- PC 5: Stalled ----- 88425 in-flight CPI 1.4297 -- Total Cycles 126442 ---- Thread 29 ---- PC 5: Stalled ----- 88160 in-flight CPI 1.4340 -- Total Cycles 126442 ---- Thread 30 ---- PC 5: Stalled ----- 90747 in-flight CPI 1.3931 -- Total Cycles 126442 ---- Thread 31 ---- PC 5: Stalled ----- 87683 in-flight CPI 1.4418 -- Total Cycles 126442 Total CPI 0.0418 , IPC 23.9289 -- Total Cycles 126442 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7861 (4.077155%) FPSUB: 0 (0.000000%) FPMUL: 31805 (16.495856%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 68142 (35.342261%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5625 (2.917440%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 71694 (37.184527%) DIV: 7422 (3.849465%) FPUN: 0 (0.000000%) FPRSUB: 257 (0.133295%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3317095 total) ADD%: 7.433 (246547) SUB%: 0.000 (0) MUL%: 0.006 (201) BITOR%: 1.534 (50880) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.558 (18501) FPSUB%: 0.000 (0) FPMUL%: 4.811 (159601) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (603) FPMAX%: 0.018 (603) LOAD%: 5.184 (171955) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (233) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (584) FPINV%: 0.000 (0) FPCONV%: 0.019 (635) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.067 (35401) FPLE%: 0.457 (15167) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (603) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.826 (93756) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.753 (24991) CMPU%: 0.000 (0) RSUB%: 0.006 (201) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.772 (523170) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.184 (39266) ORI%: 1.573 (52170) XORI%: 0.000 (0) MULI%: 3.219 (106788) LW%: 1.140 (37824) LWI%: 13.554 (449609) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.288 (9565) SWI%: 4.093 (135762) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.413 (46887) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10328) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.058 (1912) bned%: 0.000 (0) bneid%: 13.860 (459750) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.724 (24000) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.123 (4081) DIV%: 0.012 (402) FPUN%: 1.484 (49237) FPRSUB%: 3.687 (122294) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (79) FPGT%: 2.952 (97922) FPGE%: 1.027 (34070) SYNC%: 0.000 (0) NOP%: 8.785 (291421) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 24 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 12 FPSUB 0 FPMUL 59 FPCMPLT 0 FPMIN 0 FPMAX 389 LOAD 39293 INTCONV 0 ATOMIC_INC 22 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 16 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1152 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 9 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48990 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 8 ORI 11177 XORI 0 MULI 9801 LW 0 LWI 142296 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 76 DIV 33 FPUN 0 FPRSUB 0 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.9292 --Total thread-cycles: 4046144 --total thread-cycles issued: 3025674 (74.779197%) --iCache conflicts: 114251 (2.823701%) --thread*cycles of FU dependence: 253366 (6.261913%) --thread*cycles of data dependence: 192806 (4.765179%) --iCache cycles*banks: 4046144 (81.982426% used) Issue breakdown: --thread*cycles of issue worked: 3025674 (74.779197%) --thread*cycles of issue failed: 729049 (18.018365%) --thread*cycles of issue NOP/other: 291421 (7.202438%) Number of thread-cycles not ready: 192806 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3317095 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 7 2: 9 3: 7 4: 6 5: 8 6: 8 7: 8 8: 7 9: 7 10: 8 11: 8 12: 8 13: 7 14: 7 15: 7 16: 8 17: 6 18: 6 19: 7 20: 8 21: 5 22: 8 23: 8 24: 7 25: 6 26: 8 27: 8 28: 7 29: 7 30: 7 31: 7 ## Core 0 ## Module Utilization FP AddSub: 10.90 FP MinMax: 0.02 FP Compare: 4.45 Int AddSub: 17.07 FP Mul: 12.39 Int Mul: 32.59 FP InvSqrt: 0.35 FP Div: 2.87 Conversion Unit: 0.01 ## Core 1 ## Module Utilization FP AddSub: 13.00 FP MinMax: 0.03 FP Compare: 5.44 Int AddSub: 20.83 FP Mul: 14.66 Int Mul: 40.22 FP InvSqrt: 0.44 FP Div: 3.21 Conversion Unit: 0.02 ## Core 2 ## Module Utilization FP AddSub: 13.48 FP MinMax: 0.03 FP Compare: 5.71 Int AddSub: 21.90 FP Mul: 15.12 Int Mul: 42.46 FP InvSqrt: 0.48 FP Div: 3.23 Conversion Unit: 0.02 ## Core 3 ## Module Utilization FP AddSub: 12.38 FP MinMax: 0.03 FP Compare: 5.10 Int AddSub: 19.48 FP Mul: 14.04 Int Mul: 37.53 FP InvSqrt: 0.41 FP Div: 3.20 Conversion Unit: 0.01 ## Core 4 ## Module Utilization FP AddSub: 13.67 FP MinMax: 0.03 FP Compare: 5.68 Int AddSub: 21.78 FP Mul: 15.46 Int Mul: 41.87 FP InvSqrt: 0.48 FP Div: 3.50 Conversion Unit: 0.02 ## Core 5 ## Module Utilization FP AddSub: 13.68 FP MinMax: 0.03 FP Compare: 5.70 Int AddSub: 21.82 FP Mul: 15.45 Int Mul: 42.08 FP InvSqrt: 0.47 FP Div: 3.45 Conversion Unit: 0.02 ## Core 6 ## Module Utilization FP AddSub: 13.61 FP MinMax: 0.03 FP Compare: 5.69 Int AddSub: 21.83 FP Mul: 15.37 Int Mul: 41.97 FP InvSqrt: 0.46 FP Div: 3.38 Conversion Unit: 0.02 ## Core 7 ## Module Utilization FP AddSub: 13.36 FP MinMax: 0.03 FP Compare: 5.75 Int AddSub: 22.00 FP Mul: 14.95 Int Mul: 42.64 FP InvSqrt: 0.48 FP Div: 3.10 Conversion Unit: 0.02 ## Core 8 ## Module Utilization FP AddSub: 11.67 FP MinMax: 0.03 FP Compare: 4.77 Int AddSub: 18.30 FP Mul: 13.25 Int Mul: 35.10 FP InvSqrt: 0.38 FP Div: 3.06 Conversion Unit: 0.01 ## Core 9 ## Module Utilization FP AddSub: 13.85 FP MinMax: 0.03 FP Compare: 5.77 Int AddSub: 22.02 FP Mul: 15.66 Int Mul: 42.51 FP InvSqrt: 0.47 FP Div: 3.46 Conversion Unit: 0.02 ## Core 10 ## Module Utilization FP AddSub: 11.77 FP MinMax: 0.03 FP Compare: 4.84 Int AddSub: 18.55 FP Mul: 13.34 Int Mul: 35.71 FP InvSqrt: 0.39 FP Div: 3.02 Conversion Unit: 0.01 ## Core 11 ## Module Utilization FP AddSub: 12.56 FP MinMax: 0.03 FP Compare: 5.12 Int AddSub: 19.61 FP Mul: 14.27 Int Mul: 37.69 FP InvSqrt: 0.44 FP Div: 3.32 Conversion Unit: 0.01 ## Core 12 ## Module Utilization FP AddSub: 13.47 FP MinMax: 0.03 FP Compare: 5.61 Int AddSub: 21.56 FP Mul: 15.19 Int Mul: 41.71 FP InvSqrt: 0.46 FP Div: 3.34 Conversion Unit: 0.02 ## Core 13 ## Module Utilization FP AddSub: 13.80 FP MinMax: 0.03 FP Compare: 5.63 Int AddSub: 21.55 FP Mul: 15.70 Int Mul: 41.42 FP InvSqrt: 0.47 FP Div: 3.61 Conversion Unit: 0.02 ## Core 14 ## Module Utilization FP AddSub: 14.00 FP MinMax: 0.03 FP Compare: 5.78 Int AddSub: 22.15 FP Mul: 15.84 Int Mul: 42.69 FP InvSqrt: 0.49 FP Div: 3.58 Conversion Unit: 0.02 ## Core 15 ## Module Utilization FP AddSub: 13.69 FP MinMax: 0.03 FP Compare: 5.65 Int AddSub: 21.67 FP Mul: 15.52 Int Mul: 41.61 FP InvSqrt: 0.47 FP Div: 3.51 Conversion Unit: 0.02 ## Core 16 ## Module Utilization FP AddSub: 13.79 FP MinMax: 0.03 FP Compare: 5.72 Int AddSub: 21.78 FP Mul: 15.64 Int Mul: 41.93 FP InvSqrt: 0.48 FP Div: 3.55 Conversion Unit: 0.02 ## Core 17 ## Module Utilization FP AddSub: 11.18 FP MinMax: 0.02 FP Compare: 4.42 Int AddSub: 16.93 FP Mul: 12.82 Int Mul: 32.27 FP InvSqrt: 0.35 FP Div: 3.11 Conversion Unit: 0.01 ## Core 18 ## Module Utilization FP AddSub: 12.76 FP MinMax: 0.03 FP Compare: 5.15 Int AddSub: 19.72 FP Mul: 14.57 Int Mul: 37.62 FP InvSqrt: 0.40 FP Div: 3.40 Conversion Unit: 0.01 ## Core 19 ## Module Utilization FP AddSub: 13.54 FP MinMax: 0.03 FP Compare: 5.53 Int AddSub: 21.26 FP Mul: 15.37 Int Mul: 40.76 FP InvSqrt: 0.44 FP Div: 3.54 Conversion Unit: 0.02 ## Core 20 ## Module Utilization FP AddSub: 10.95 FP MinMax: 0.02 FP Compare: 4.47 Int AddSub: 17.13 FP Mul: 12.44 Int Mul: 32.86 FP InvSqrt: 0.36 FP Div: 2.87 Conversion Unit: 0.01 ## Core 21 ## Module Utilization FP AddSub: 13.38 FP MinMax: 0.03 FP Compare: 5.49 Int AddSub: 21.01 FP Mul: 15.20 Int Mul: 40.29 FP InvSqrt: 0.45 FP Div: 3.49 Conversion Unit: 0.02 ## Core 22 ## Module Utilization FP AddSub: 13.67 FP MinMax: 0.03 FP Compare: 5.65 Int AddSub: 21.62 FP Mul: 15.50 Int Mul: 41.55 FP InvSqrt: 0.48 FP Div: 3.50 Conversion Unit: 0.02 ## Core 23 ## Module Utilization FP AddSub: 13.68 FP MinMax: 0.03 FP Compare: 5.76 Int AddSub: 22.09 FP Mul: 15.39 Int Mul: 42.73 FP InvSqrt: 0.47 FP Div: 3.33 Conversion Unit: 0.02 ## Core 24 ## Module Utilization FP AddSub: 13.68 FP MinMax: 0.03 FP Compare: 5.66 Int AddSub: 21.63 FP Mul: 15.51 Int Mul: 41.55 FP InvSqrt: 0.46 FP Div: 3.48 Conversion Unit: 0.02 ## Core 25 ## Module Utilization FP AddSub: 13.74 FP MinMax: 0.03 FP Compare: 5.56 Int AddSub: 21.25 FP Mul: 15.67 Int Mul: 40.68 FP InvSqrt: 0.44 FP Div: 3.70 Conversion Unit: 0.02 ## Core 26 ## Module Utilization FP AddSub: 13.78 FP MinMax: 0.03 FP Compare: 5.67 Int AddSub: 21.85 FP Mul: 15.61 Int Mul: 42.02 FP InvSqrt: 0.46 FP Div: 3.50 Conversion Unit: 0.02 ## Core 27 ## Module Utilization FP AddSub: 13.75 FP MinMax: 0.03 FP Compare: 5.72 Int AddSub: 21.94 FP Mul: 15.54 Int Mul: 42.38 FP InvSqrt: 0.48 FP Div: 3.42 Conversion Unit: 0.02 ## Core 28 ## Module Utilization FP AddSub: 13.55 FP MinMax: 0.03 FP Compare: 5.62 Int AddSub: 21.58 FP Mul: 15.32 Int Mul: 41.52 FP InvSqrt: 0.45 FP Div: 3.40 Conversion Unit: 0.02 ## Core 29 ## Module Utilization FP AddSub: 14.12 FP MinMax: 0.03 FP Compare: 5.62 Int AddSub: 21.49 FP Mul: 16.19 Int Mul: 40.89 FP InvSqrt: 0.44 FP Div: 3.91 Conversion Unit: 0.02 ## Core 30 ## Module Utilization FP AddSub: 13.83 FP MinMax: 0.03 FP Compare: 5.76 Int AddSub: 22.00 FP Mul: 15.65 Int Mul: 42.45 FP InvSqrt: 0.48 FP Div: 3.48 Conversion Unit: 0.02 ## Core 31 ## Module Utilization FP AddSub: 13.92 FP MinMax: 0.03 FP Compare: 5.72 Int AddSub: 21.97 FP Mul: 15.78 Int Mul: 42.26 FP InvSqrt: 0.47 FP Div: 3.57 Conversion Unit: 0.02 ## Core 32 ## Module Utilization FP AddSub: 13.67 FP MinMax: 0.03 FP Compare: 5.70 Int AddSub: 21.88 FP Mul: 15.43 Int Mul: 42.30 FP InvSqrt: 0.47 FP Div: 3.43 Conversion Unit: 0.02 ## Core 33 ## Module Utilization FP AddSub: 12.22 FP MinMax: 0.03 FP Compare: 5.03 Int AddSub: 19.27 FP Mul: 13.85 Int Mul: 37.04 FP InvSqrt: 0.41 FP Div: 3.13 Conversion Unit: 0.01 ## Core 34 ## Module Utilization FP AddSub: 13.62 FP MinMax: 0.03 FP Compare: 5.58 Int AddSub: 21.36 FP Mul: 15.48 Int Mul: 40.85 FP InvSqrt: 0.43 FP Div: 3.54 Conversion Unit: 0.01 ## Core 35 ## Module Utilization FP AddSub: 12.47 FP MinMax: 0.03 FP Compare: 5.02 Int AddSub: 19.26 FP Mul: 14.23 Int Mul: 36.65 FP InvSqrt: 0.38 FP Div: 3.40 Conversion Unit: 0.01 ## Core 36 ## Module Utilization FP AddSub: 13.62 FP MinMax: 0.03 FP Compare: 5.67 Int AddSub: 21.68 FP Mul: 15.40 Int Mul: 41.76 FP InvSqrt: 0.46 FP Div: 3.41 Conversion Unit: 0.02 ## Core 37 ## Module Utilization FP AddSub: 13.71 FP MinMax: 0.03 FP Compare: 5.72 Int AddSub: 21.92 FP Mul: 15.49 Int Mul: 42.22 FP InvSqrt: 0.47 FP Div: 3.44 Conversion Unit: 0.02 ## Core 38 ## Module Utilization FP AddSub: 13.75 FP MinMax: 0.03 FP Compare: 5.71 Int AddSub: 21.84 FP Mul: 15.58 Int Mul: 41.89 FP InvSqrt: 0.46 FP Div: 3.50 Conversion Unit: 0.02 ## Core 39 ## Module Utilization FP AddSub: 12.63 FP MinMax: 0.03 FP Compare: 5.21 Int AddSub: 20.00 FP Mul: 14.30 Int Mul: 38.47 FP InvSqrt: 0.41 FP Div: 3.20 Conversion Unit: 0.01 ## Core 40 ## Module Utilization FP AddSub: 13.72 FP MinMax: 0.03 FP Compare: 5.65 Int AddSub: 21.62 FP Mul: 15.56 Int Mul: 41.47 FP InvSqrt: 0.45 FP Div: 3.53 Conversion Unit: 0.02 ## Core 41 ## Module Utilization FP AddSub: 13.57 FP MinMax: 0.03 FP Compare: 5.58 Int AddSub: 21.27 FP Mul: 15.41 Int Mul: 40.93 FP InvSqrt: 0.44 FP Div: 3.49 Conversion Unit: 0.02 ## Core 42 ## Module Utilization FP AddSub: 13.74 FP MinMax: 0.03 FP Compare: 5.66 Int AddSub: 21.70 FP Mul: 15.58 Int Mul: 41.72 FP InvSqrt: 0.45 FP Div: 3.52 Conversion Unit: 0.02 ## Core 43 ## Module Utilization FP AddSub: 13.88 FP MinMax: 0.03 FP Compare: 5.61 Int AddSub: 21.50 FP Mul: 15.82 Int Mul: 41.06 FP InvSqrt: 0.45 FP Div: 3.74 Conversion Unit: 0.02 ## Core 44 ## Module Utilization FP AddSub: 13.68 FP MinMax: 0.03 FP Compare: 5.67 Int AddSub: 21.72 FP Mul: 15.49 Int Mul: 41.83 FP InvSqrt: 0.46 FP Div: 3.46 Conversion Unit: 0.02 ## Core 45 ## Module Utilization FP AddSub: 12.96 FP MinMax: 0.03 FP Compare: 5.15 Int AddSub: 19.71 FP Mul: 14.85 Int Mul: 37.43 FP InvSqrt: 0.39 FP Div: 3.62 Conversion Unit: 0.01 ## Core 46 ## Module Utilization FP AddSub: 12.35 FP MinMax: 0.03 FP Compare: 5.10 Int AddSub: 19.51 FP Mul: 14.01 Int Mul: 37.55 FP InvSqrt: 0.41 FP Div: 3.13 Conversion Unit: 0.01 ## Core 47 ## Module Utilization FP AddSub: 13.77 FP MinMax: 0.03 FP Compare: 5.69 Int AddSub: 21.78 FP Mul: 15.58 Int Mul: 41.99 FP InvSqrt: 0.48 FP Div: 3.54 Conversion Unit: 0.02 ## Core 48 ## Module Utilization FP AddSub: 12.14 FP MinMax: 0.03 FP Compare: 4.86 Int AddSub: 18.57 FP Mul: 13.88 Int Mul: 35.56 FP InvSqrt: 0.39 FP Div: 3.34 Conversion Unit: 0.01 ## Core 49 ## Module Utilization FP AddSub: 13.78 FP MinMax: 0.03 FP Compare: 5.61 Int AddSub: 21.51 FP Mul: 15.66 Int Mul: 41.36 FP InvSqrt: 0.45 FP Div: 3.60 Conversion Unit: 0.02 ## Core 50 ## Module Utilization FP AddSub: 13.65 FP MinMax: 0.03 FP Compare: 5.70 Int AddSub: 21.86 FP Mul: 15.40 Int Mul: 42.29 FP InvSqrt: 0.48 FP Div: 3.41 Conversion Unit: 0.02 ## Core 51 ## Module Utilization FP AddSub: 12.53 FP MinMax: 0.03 FP Compare: 5.19 Int AddSub: 19.86 FP Mul: 14.19 Int Mul: 38.25 FP InvSqrt: 0.43 FP Div: 3.22 Conversion Unit: 0.01 ## Core 52 ## Module Utilization FP AddSub: 10.36 FP MinMax: 0.02 FP Compare: 4.19 Int AddSub: 16.03 FP Mul: 11.82 Int Mul: 30.55 FP InvSqrt: 0.33 FP Div: 2.82 Conversion Unit: 0.01 ## Core 53 ## Module Utilization FP AddSub: 12.47 FP MinMax: 0.02 FP Compare: 4.96 Int AddSub: 19.04 FP Mul: 14.28 Int Mul: 36.11 FP InvSqrt: 0.38 FP Div: 3.47 Conversion Unit: 0.01 ## Core 54 ## Module Utilization FP AddSub: 13.70 FP MinMax: 0.03 FP Compare: 5.66 Int AddSub: 21.74 FP Mul: 15.52 Int Mul: 41.80 FP InvSqrt: 0.46 FP Div: 3.49 Conversion Unit: 0.02 ## Core 55 ## Module Utilization FP AddSub: 12.71 FP MinMax: 0.03 FP Compare: 5.27 Int AddSub: 20.25 FP Mul: 14.37 Int Mul: 39.06 FP InvSqrt: 0.42 FP Div: 3.19 Conversion Unit: 0.01 ## Core 56 ## Module Utilization FP AddSub: 13.65 FP MinMax: 0.03 FP Compare: 5.76 Int AddSub: 22.05 FP Mul: 15.36 Int Mul: 42.69 FP InvSqrt: 0.48 FP Div: 3.31 Conversion Unit: 0.02 ## Core 57 ## Module Utilization FP AddSub: 13.45 FP MinMax: 0.03 FP Compare: 5.65 Int AddSub: 21.67 FP Mul: 15.14 Int Mul: 41.97 FP InvSqrt: 0.45 FP Div: 3.27 Conversion Unit: 0.02 ## Core 58 ## Module Utilization FP AddSub: 11.11 FP MinMax: 0.02 FP Compare: 4.53 Int AddSub: 17.33 FP Mul: 12.63 Int Mul: 33.26 FP InvSqrt: 0.37 FP Div: 2.93 Conversion Unit: 0.01 ## Core 59 ## Module Utilization FP AddSub: 12.92 FP MinMax: 0.03 FP Compare: 5.27 Int AddSub: 20.15 FP Mul: 14.70 Int Mul: 38.52 FP InvSqrt: 0.43 FP Div: 3.43 Conversion Unit: 0.01 ## Core 60 ## Module Utilization FP AddSub: 13.79 FP MinMax: 0.03 FP Compare: 5.62 Int AddSub: 21.53 FP Mul: 15.67 Int Mul: 41.38 FP InvSqrt: 0.46 FP Div: 3.58 Conversion Unit: 0.02 ## Core 61 ## Module Utilization FP AddSub: 13.16 FP MinMax: 0.03 FP Compare: 5.37 Int AddSub: 20.59 FP Mul: 14.95 Int Mul: 39.59 FP InvSqrt: 0.43 FP Div: 3.42 Conversion Unit: 0.01 ## Core 62 ## Module Utilization FP AddSub: 13.79 FP MinMax: 0.03 FP Compare: 5.65 Int AddSub: 21.57 FP Mul: 15.66 Int Mul: 41.40 FP InvSqrt: 0.45 FP Div: 3.60 Conversion Unit: 0.02 ## Core 63 ## Module Utilization FP AddSub: 13.79 FP MinMax: 0.03 FP Compare: 5.71 Int AddSub: 21.89 FP Mul: 15.61 Int Mul: 42.14 FP InvSqrt: 0.47 FP Div: 3.50 Conversion Unit: 0.02 ## Core 64 ## Module Utilization FP AddSub: 12.57 FP MinMax: 0.03 FP Compare: 5.07 Int AddSub: 19.42 FP Mul: 14.34 Int Mul: 37.14 FP InvSqrt: 0.41 FP Div: 3.39 Conversion Unit: 0.01 ## Core 65 ## Module Utilization FP AddSub: 12.64 FP MinMax: 0.03 FP Compare: 5.16 Int AddSub: 19.73 FP Mul: 14.37 Int Mul: 37.81 FP InvSqrt: 0.41 FP Div: 3.36 Conversion Unit: 0.01 ## Core 66 ## Module Utilization FP AddSub: 13.58 FP MinMax: 0.03 FP Compare: 5.69 Int AddSub: 21.82 FP Mul: 15.31 Int Mul: 42.11 FP InvSqrt: 0.46 FP Div: 3.36 Conversion Unit: 0.02 ## Core 67 ## Module Utilization FP AddSub: 13.41 FP MinMax: 0.03 FP Compare: 5.52 Int AddSub: 21.09 FP Mul: 15.23 Int Mul: 40.45 FP InvSqrt: 0.46 FP Div: 3.50 Conversion Unit: 0.02 ## Core 68 ## Module Utilization FP AddSub: 13.50 FP MinMax: 0.03 FP Compare: 5.50 Int AddSub: 20.99 FP Mul: 15.37 Int Mul: 40.20 FP InvSqrt: 0.44 FP Div: 3.58 Conversion Unit: 0.02 ## Core 69 ## Module Utilization FP AddSub: 13.97 FP MinMax: 0.03 FP Compare: 5.67 Int AddSub: 21.70 FP Mul: 15.91 Int Mul: 41.63 FP InvSqrt: 0.46 FP Div: 3.69 Conversion Unit: 0.02 ## Core 70 ## Module Utilization FP AddSub: 13.68 FP MinMax: 0.03 FP Compare: 5.62 Int AddSub: 21.49 FP Mul: 15.51 Int Mul: 41.57 FP InvSqrt: 0.47 FP Div: 3.49 Conversion Unit: 0.02 ## Core 71 ## Module Utilization FP AddSub: 13.65 FP MinMax: 0.03 FP Compare: 5.78 Int AddSub: 22.09 FP Mul: 15.36 Int Mul: 42.68 FP InvSqrt: 0.46 FP Div: 3.30 Conversion Unit: 0.02 ## Core 72 ## Module Utilization FP AddSub: 12.78 FP MinMax: 0.03 FP Compare: 5.19 Int AddSub: 19.87 FP Mul: 14.57 Int Mul: 37.99 FP InvSqrt: 0.41 FP Div: 3.39 Conversion Unit: 0.01 ## Core 73 ## Module Utilization FP AddSub: 13.78 FP MinMax: 0.03 FP Compare: 5.63 Int AddSub: 21.56 FP Mul: 15.65 Int Mul: 41.38 FP InvSqrt: 0.45 FP Div: 3.61 Conversion Unit: 0.02 ## Core 74 ## Module Utilization FP AddSub: 13.75 FP MinMax: 0.03 FP Compare: 5.80 Int AddSub: 22.23 FP Mul: 15.48 Int Mul: 43.06 FP InvSqrt: 0.47 FP Div: 3.28 Conversion Unit: 0.02 ## Core 75 ## Module Utilization FP AddSub: 13.54 FP MinMax: 0.03 FP Compare: 5.63 Int AddSub: 21.52 FP Mul: 15.30 Int Mul: 41.54 FP InvSqrt: 0.46 FP Div: 3.43 Conversion Unit: 0.02 ## Core 76 ## Module Utilization FP AddSub: 13.78 FP MinMax: 0.03 FP Compare: 5.64 Int AddSub: 21.64 FP Mul: 15.64 Int Mul: 41.56 FP InvSqrt: 0.45 FP Div: 3.55 Conversion Unit: 0.02 ## Core 77 ## Module Utilization FP AddSub: 13.64 FP MinMax: 0.03 FP Compare: 5.68 Int AddSub: 21.82 FP Mul: 15.40 Int Mul: 42.00 FP InvSqrt: 0.46 FP Div: 3.45 Conversion Unit: 0.02 ## Core 78 ## Module Utilization FP AddSub: 11.53 FP MinMax: 0.03 FP Compare: 4.86 Int AddSub: 18.54 FP Mul: 12.99 Int Mul: 35.85 FP InvSqrt: 0.40 FP Div: 2.83 Conversion Unit: 0.01 ## Core 79 ## Module Utilization FP AddSub: 13.92 FP MinMax: 0.03 FP Compare: 5.73 Int AddSub: 21.96 FP Mul: 15.78 Int Mul: 42.31 FP InvSqrt: 0.46 FP Div: 3.55 Conversion Unit: 0.02 L1 accesses: 13887476 L1 hits: 13130029 L1 misses: 757447 L1 bank conflicts: 2972985 L1 stores: 49152 L1 near hit: 0 L1 hit rate: 0.945458 -= L2 #0 =- L2 accesses: 187725 L2 hits: 161431 L2 misses: 26294 L2 stores: 12414 L2 bank conflicts: 23439 L2 hit rate: 0.859933 L2 memory faults: 449 L2 bandwidth limited stalls: 41496 -= L2 #1 =- L2 accesses: 188360 L2 hits: 162184 L2 misses: 26176 L2 stores: 12243 L2 bank conflicts: 23307 L2 hit rate: 0.861032 L2 memory faults: 424 L2 bandwidth limited stalls: 43675 -= L2 #2 =- L2 accesses: 192185 L2 hits: 166078 L2 misses: 26107 L2 stores: 12228 L2 bank conflicts: 24383 L2 hit rate: 0.864157 L2 memory faults: 461 L2 bandwidth limited stalls: 38883 -= L2 #3 =- L2 accesses: 187352 L2 hits: 162468 L2 misses: 24884 L2 stores: 12267 L2 bank conflicts: 23262 L2 hit rate: 0.867180 L2 memory faults: 491 L2 bandwidth limited stalls: 34295 Bandwidth numbers for 1000MHz clock: register to L1 bandwidth: 321201683782.033510 L1 to L2 bandwidth: 559253955037.468750 L2 to memory bandwidth: 76573966139.328339 Core size: 0.9818 L2 size: 0.0000 4-L2 size: 0.0000 80-core chip size: 78.5458 FPS Statistics: FPS assuming 1000MHz clock: 5782.2185