--load-assembly ../../llvm_trax/examples/project4_noInh/project4_noInh_rt-llvm.s --config-file ../trunk/configs/default.config --model ../trunk/test_models/conference.obj --view-file ../trunk/views/conference.view --light-file ../trunk/lights/conference.light --num-cores 20 --num-thread-procs 32 --num-l2s 4 --num-icaches 2 --num-icache-banks 16 Loading core 0. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 1. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 2. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 3. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 4. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 5. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 6. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 7. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 8. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 9. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 10. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 11. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 12. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 13. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 14. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 15. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 16. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 17. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 18. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 19. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 20. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 21. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 22. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 23. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 24. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 25. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 26. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 27. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 28. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 29. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 30. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 31. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 32. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 33. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 34. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 35. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 36. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 37. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 38. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 39. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 40. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 41. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 42. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 43. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 44. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 45. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 46. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 47. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 48. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 49. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 50. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 51. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 52. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 53. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 54. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 55. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 56. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 57. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 58. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 59. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 60. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 61. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 62. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 63. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 64. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 65. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 66. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 67. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 68. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 69. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 70. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 71. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 72. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 73. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 74. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 75. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 76. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 77. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 78. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 79. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Center is 1.5233 1.618 1.7711 Corner is 1.9101 3.20911 0.248412 Across is 1.25037 -1.56095 0 Up is 0.523201 0.419102 1.88431 U is 0.360963 -0.450621 0 V is 0.15104 0.120988 0.543969 radius is 0 Work queue starts at 30 (0x0000001e) FB starts at 32 (0x00000020) FB ends at 49183 (0x0000c01f) loading model ../trunk/test_models/conference.obj MTL file: "../trunk/test_models/conference.mtl" loading material file ../trunk/test_models/conference.mtl Found 43 total materials Found 282664 total triangles vertex min/max = x: (-0.177790, 11.125200) y: (-0.164592, 7.010400) z: (-0.005078, 2.712720) Materials start at 49184 (0x0000c020) Materials end at 50284 (0x0000c46c) Starting BVH build. BVH build complete with 265647 nodes. Scene starts at 50285 (0x0000c46d) BVH bounds [-0.177790 -0.164592 -0.005078] [11.125200 7.010400 2.712720] Triangles start at 2175464 (0x002131e8) Scene ends at 11316197 (0x00acabe5) Starting camera at 11316198 (0x00acabe6) Camera ended at 11316220 (0x00acabfc) Background Color 0x00acabfd to 0x00acabff Light at 0x00acac00 to 0x00acac02 Permutation table from 0x00acac03 to 0x00acae02 Hammersley table from 0x00acae03 to 0x00acb002 Memory used: 11317251 (0x00acb003) Image size: 49152 start_wq: 30 start_fb: 32 start_scene: 50288 start_camera: 11316198 start_matls: 49184 start_bg_color: 11316221 start_light: 11316224 start_permutation: 11316227 Loading assembly file ../../llvm_trax/examples/project4_noInh/project4_noInh_rt-llvm.s using 36 registers Number of instructions: 1249 Creating thread 0... Creating thread 1... Creating thread 2... Creating thread 3... Core 0 running... Core 1 running... Creating thread 4... Core 2 running... Core 3 running... Creating thread 5... Core 4 running... Creating thread 6... Core 5 running... Creating thread 7... Core 6 running... Creating thread 8... Creating thread 9... Core 7 running... Creating thread 10... Core 8 running... Core 9 running... Creating thread 11... Creating thread 12... Core 10 running... Creating thread 13... Core 11 running... Core 12 running... Creating thread 14... Core 13 running... Creating thread 15... Core 14 running... Creating thread 16... Core 15 running... Creating thread 17... Creating thread 18... Core 16 running... Core 17 running... Creating thread 19... Core 18 running... Creating thread 20... Creating thread 21... Core 19 running... Core 20 running... Creating thread 22... Creating thread 23... Core 21 running... Core 22 running... Creating thread 24... Core 23 running... Creating thread 25... Creating thread 26... Core 24 running... Core 25 running... Creating thread 27... Creating thread 28... Core 26 running... Core 27 running... Creating thread 29... Creating thread 30... Core 28 running... Core 29 running... Creating thread 31... Core 30 running... Creating thread 32... Creating thread 33... Core 31 running... Core 32 running... Creating thread 34... Creating thread 35... Core 33 running... Core 34 running... Creating thread 36... Creating thread 37... Core 35 running... Core 36 running... Creating thread 38... Core 37 running... Creating thread 39... Core 38 running... Creating thread 40... Core 39 running... Creating thread 41... Creating thread 42... Core 40 running... Creating thread 43... Core 41 running... Creating thread 44... Core 42 running... Core 43 running... Creating thread 45... Core 44 running... Creating thread 46... Creating thread 47... Core 45 running... Core 46 running... Creating thread 48... Creating thread 49... Core 47 running... Core 48 running... Creating thread 50... Creating thread 51... Core 49 running... Creating thread 52... Core 50 running... Creating thread 53... Core 51 running... Creating thread 54... Core 52 running... Core 53 running... Creating thread 55... Core 54 running... Creating thread 56... Core 55 running... Creating thread 57... Core 56 running... Creating thread 58... Creating thread 59... Core 57 running... Core 58 running... Creating thread 60... Core 59 running... Creating thread 61... Core 60 running... Creating thread 62... Core 61 running... Creating thread 63... Creating thread 64... Core 62 running... Core 63 running... Creating thread 65... Creating thread 66... Creating thread 67... Core 64 running... Core 65 running... Creating thread 68... Core 66 running... Creating thread 69... Core 68 running... Core 67 running... Creating thread 70... Core 69 running... Creating thread 71... Creating thread 72... Core 70 running... Creating thread 73... Core 71 running... Creating thread 74... Core 72 running... Core 73 running... Creating thread 75... Core 74 running... Creating thread 76... Core 75 running... Creating thread 77... Core 76 running... Creating thread 78... Creating thread 79... Core 77 running... Core 78 running... Core 79 running... <=== Core 0 ===> ---- Thread 00 ---- PC 5: Stalled ----- 97197 in-flight CPI 1.3083 -- Total Cycles 127190 ---- Thread 01 ---- PC 5: Stalled ----- 98415 in-flight CPI 1.2921 -- Total Cycles 127190 ---- Thread 02 ---- PC 5: Stalled ----- 95713 in-flight CPI 1.3286 -- Total Cycles 127190 ---- Thread 03 ---- PC 5: Stalled ----- 94111 in-flight CPI 1.3513 -- Total Cycles 127190 ---- Thread 04 ---- PC 5: Stalled ----- 94816 in-flight CPI 1.3412 -- Total Cycles 127190 ---- Thread 05 ---- PC 5: Stalled ----- 96876 in-flight CPI 1.3127 -- Total Cycles 127190 ---- Thread 06 ---- PC 5: Stalled ----- 96924 in-flight CPI 1.3120 -- Total Cycles 127190 ---- Thread 07 ---- PC 5: Stalled ----- 102188 in-flight CPI 1.2444 -- Total Cycles 127190 ---- Thread 08 ---- PC 5: Stalled ----- 102384 in-flight CPI 1.2420 -- Total Cycles 127190 ---- Thread 09 ---- PC 5: Stalled ----- 98911 in-flight CPI 1.2857 -- Total Cycles 127190 ---- Thread 10 ---- PC 5: Stalled ----- 97498 in-flight CPI 1.3043 -- Total Cycles 127190 ---- Thread 11 ---- PC 5: Stalled ----- 97445 in-flight CPI 1.3050 -- Total Cycles 127190 ---- Thread 12 ---- PC 5: Stalled ----- 95244 in-flight CPI 1.3352 -- Total Cycles 127190 ---- Thread 13 ---- PC 5: Stalled ----- 98214 in-flight CPI 1.2948 -- Total Cycles 127190 ---- Thread 14 ---- PC 5: Stalled ----- 97340 in-flight CPI 1.3064 -- Total Cycles 127190 ---- Thread 15 ---- PC 5: Stalled ----- 100505 in-flight CPI 1.2653 -- Total Cycles 127190 ---- Thread 16 ---- PC 5: Stalled ----- 89999 in-flight CPI 1.4130 -- Total Cycles 127190 ---- Thread 17 ---- PC 5: Stalled ----- 96528 in-flight CPI 1.3174 -- Total Cycles 127190 ---- Thread 18 ---- PC 5: Stalled ----- 92279 in-flight CPI 1.3781 -- Total Cycles 127190 ---- Thread 19 ---- PC 5: Stalled ----- 93750 in-flight CPI 1.3565 -- Total Cycles 127190 ---- Thread 20 ---- PC 5: Stalled ----- 97840 in-flight CPI 1.2998 -- Total Cycles 127190 ---- Thread 21 ---- PC 5: Stalled ----- 92762 in-flight CPI 1.3709 -- Total Cycles 127190 ---- Thread 22 ---- PC 5: Stalled ----- 95127 in-flight CPI 1.3368 -- Total Cycles 127190 ---- Thread 23 ---- PC 5: Stalled ----- 94704 in-flight CPI 1.3428 -- Total Cycles 127190 ---- Thread 24 ---- PC 5: Stalled ----- 89663 in-flight CPI 1.4182 -- Total Cycles 127190 ---- Thread 25 ---- PC 5: Stalled ----- 96008 in-flight CPI 1.3246 -- Total Cycles 127190 ---- Thread 26 ---- PC 5: Stalled ----- 95609 in-flight CPI 1.3300 -- Total Cycles 127190 ---- Thread 27 ---- PC 5: Stalled ----- 89116 in-flight CPI 1.4269 -- Total Cycles 127190 ---- Thread 28 ---- PC 5: Stalled ----- 92262 in-flight CPI 1.3783 -- Total Cycles 127190 ---- Thread 29 ---- PC 5: Stalled ----- 88331 in-flight CPI 1.4397 -- Total Cycles 127190 ---- Thread 30 ---- PC 5: Stalled ----- 91563 in-flight CPI 1.3889 -- Total Cycles 127190 ---- Thread 31 ---- PC 5: Stalled ----- 89109 in-flight CPI 1.4271 -- Total Cycles 127190 Total CPI 0.0417 , IPC 23.9720 -- Total Cycles 127190 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7763 (3.808174%) FPSUB: 0 (0.000000%) FPMUL: 31619 (15.510839%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 79486 (38.992205%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5693 (2.792726%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 71380 (35.015771%) DIV: 7642 (3.748817%) FPUN: 0 (0.000000%) FPRSUB: 268 (0.131469%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3340966 total) ADD%: 7.404 (247370) SUB%: 0.000 (0) MUL%: 0.006 (207) BITOR%: 1.531 (51157) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.546 (18238) FPSUB%: 0.000 (0) FPMUL%: 4.755 (158867) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (621) FPMAX%: 0.019 (621) LOAD%: 5.146 (171917) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (239) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (594) FPINV%: 0.000 (0) FPCONV%: 0.020 (653) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.059 (35386) FPLE%: 0.455 (15206) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (621) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.813 (93977) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.747 (24941) CMPU%: 0.000 (0) RSUB%: 0.006 (207) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.690 (524189) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.177 (39328) ORI%: 1.560 (52128) XORI%: 0.000 (0) MULI%: 3.208 (107182) LW%: 1.135 (37922) LWI%: 13.485 (450536) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.288 (9620) SWI%: 4.079 (136292) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.406 (46965) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10383) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.056 (1875) bned%: 0.000 (0) bneid%: 13.795 (460879) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.718 (23973) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.121 (4058) DIV%: 0.012 (414) FPUN%: 1.483 (49544) FPRSUB%: 4.199 (140295) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (63) FPGT%: 2.938 (98150) FPGE%: 1.028 (34338) SYNC%: 0.000 (0) NOP%: 8.737 (291914) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 19 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 13 FPSUB 0 FPMUL 5 FPCMPLT 0 FPMIN 0 FPMAX 404 LOAD 40823 INTCONV 0 ATOMIC_INC 34 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 12 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1588 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 10 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48774 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 12 ORI 11005 XORI 0 MULI 9743 LW 0 LWI 142386 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 76 DIV 30 FPUN 0 FPRSUB 57 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.9722 --Total thread-cycles: 4070080 --total thread-cycles issued: 3049052 (74.913810%) --iCache conflicts: 111708 (2.744614%) --thread*cycles of FU dependence: 255000 (6.265233%) --thread*cycles of data dependence: 203851 (5.008526%) --iCache cycles*banks: 4070080 (82.086789% used) Issue breakdown: --thread*cycles of issue worked: 3049052 (74.913810%) --thread*cycles of issue failed: 729114 (17.913997%) --thread*cycles of issue NOP/other: 291914 (7.172193%) Number of thread-cycles not ready: 203851 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3340966 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 7 3: 7 4: 7 5: 7 6: 8 7: 8 8: 9 9: 8 10: 8 11: 9 12: 7 13: 8 14: 8 15: 8 16: 7 17: 8 18: 6 19: 7 20: 7 21: 6 22: 8 23: 6 24: 8 25: 7 26: 8 27: 8 28: 8 29: 6 30: 7 31: 7 <=== Core 1 ===> ---- Thread 00 ---- PC 5: Stalled ----- 103283 in-flight CPI 1.2507 -- Total Cycles 129192 ---- Thread 01 ---- PC 5: Stalled ----- 93976 in-flight CPI 1.3745 -- Total Cycles 129192 ---- Thread 02 ---- PC 5: Stalled ----- 99381 in-flight CPI 1.2998 -- Total Cycles 129192 ---- Thread 03 ---- PC 5: Stalled ----- 101122 in-flight CPI 1.2773 -- Total Cycles 129192 ---- Thread 04 ---- PC 5: Stalled ----- 103187 in-flight CPI 1.2517 -- Total Cycles 129192 ---- Thread 05 ---- PC 5: Stalled ----- 93385 in-flight CPI 1.3831 -- Total Cycles 129192 ---- Thread 06 ---- PC 5: Stalled ----- 99549 in-flight CPI 1.2976 -- Total Cycles 129192 ---- Thread 07 ---- PC 5: Stalled ----- 96474 in-flight CPI 1.3389 -- Total Cycles 129192 ---- Thread 08 ---- PC 5: Stalled ----- 104535 in-flight CPI 1.2356 -- Total Cycles 129192 ---- Thread 09 ---- PC 5: Stalled ----- 94829 in-flight CPI 1.3621 -- Total Cycles 129192 ---- Thread 10 ---- PC 5: Stalled ----- 96360 in-flight CPI 1.3405 -- Total Cycles 129192 ---- Thread 11 ---- PC 5: Stalled ----- 94043 in-flight CPI 1.3735 -- Total Cycles 129192 ---- Thread 12 ---- PC 5: Stalled ----- 98458 in-flight CPI 1.3119 -- Total Cycles 129192 ---- Thread 13 ---- PC 5: Stalled ----- 93706 in-flight CPI 1.3785 -- Total Cycles 129192 ---- Thread 14 ---- PC 5: Stalled ----- 94994 in-flight CPI 1.3598 -- Total Cycles 129192 ---- Thread 15 ---- PC 5: Stalled ----- 90566 in-flight CPI 1.4262 -- Total Cycles 129192 ---- Thread 16 ---- PC 5: Stalled ----- 96015 in-flight CPI 1.3453 -- Total Cycles 129192 ---- Thread 17 ---- PC 5: Stalled ----- 98159 in-flight CPI 1.3159 -- Total Cycles 129192 ---- Thread 18 ---- PC 5: Stalled ----- 94064 in-flight CPI 1.3732 -- Total Cycles 129192 ---- Thread 19 ---- PC 5: Stalled ----- 87853 in-flight CPI 1.4703 -- Total Cycles 129192 ---- Thread 20 ---- PC 5: Stalled ----- 92639 in-flight CPI 1.3943 -- Total Cycles 129192 ---- Thread 21 ---- PC 5: Stalled ----- 93925 in-flight CPI 1.3753 -- Total Cycles 129192 ---- Thread 22 ---- PC 5: Stalled ----- 90404 in-flight CPI 1.4287 -- Total Cycles 129192 ---- Thread 23 ---- PC 5: Stalled ----- 97980 in-flight CPI 1.3183 -- Total Cycles 129192 ---- Thread 24 ---- PC 5: Stalled ----- 91431 in-flight CPI 1.4127 -- Total Cycles 129192 ---- Thread 25 ---- PC 5: Stalled ----- 92160 in-flight CPI 1.4015 -- Total Cycles 129192 ---- Thread 26 ---- PC 5: Stalled ----- 86693 in-flight CPI 1.4900 -- Total Cycles 129192 ---- Thread 27 ---- PC 5: Stalled ----- 89786 in-flight CPI 1.4386 -- Total Cycles 129192 ---- Thread 28 ---- PC 5: Stalled ----- 82725 in-flight CPI 1.5615 -- Total Cycles 129192 ---- Thread 29 ---- PC 5: Stalled ----- 92863 in-flight CPI 1.3910 -- Total Cycles 129192 ---- Thread 30 ---- PC 5: Stalled ----- 84342 in-flight CPI 1.5315 -- Total Cycles 129192 ---- Thread 31 ---- PC 5: Stalled ----- 89649 in-flight CPI 1.4408 -- Total Cycles 129192 Total CPI 0.0428 , IPC 23.3689 -- Total Cycles 129192 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8062 (3.671271%) FPSUB: 0 (0.000000%) FPMUL: 32033 (14.587176%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 92781 (42.250577%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5567 (2.535098%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 73510 (33.474956%) DIV: 7382 (3.361612%) FPUN: 0 (0.000000%) FPRSUB: 262 (0.119309%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3308574 total) ADD%: 7.409 (245134) SUB%: 0.000 (0) MUL%: 0.006 (200) BITOR%: 1.518 (50208) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.569 (18810) FPSUB%: 0.000 (0) FPMUL%: 4.823 (159588) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (600) FPMAX%: 0.018 (600) LOAD%: 5.175 (171210) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (232) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (580) FPINV%: 0.000 (0) FPCONV%: 0.019 (632) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.068 (35352) FPLE%: 0.452 (14956) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (600) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.797 (92525) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.751 (24850) CMPU%: 0.000 (0) RSUB%: 0.006 (200) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.658 (518053) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.171 (38749) ORI%: 1.574 (52062) XORI%: 0.000 (0) MULI%: 3.192 (105600) LW%: 1.128 (37330) LWI%: 13.450 (445015) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.286 (9457) SWI%: 4.049 (133951) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.398 (46254) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.309 (10229) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.062 (2059) bned%: 0.000 (0) bneid%: 13.767 (455501) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.714 (23607) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.127 (4189) DIV%: 0.012 (400) FPUN%: 1.469 (48589) FPRSUB%: 4.258 (140882) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (75) FPGT%: 2.936 (97128) FPGE%: 1.017 (33633) SYNC%: 0.000 (0) NOP%: 8.748 (289438) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 17 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 13 FPSUB 0 FPMUL 3 FPCMPLT 0 FPMIN 0 FPMAX 391 LOAD 39913 INTCONV 0 ATOMIC_INC 13 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 12 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1608 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 7 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48127 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 14 ORI 11473 XORI 0 MULI 9084 LW 0 LWI 140849 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 71 DIV 33 FPUN 0 FPRSUB 69 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.3692 --Total thread-cycles: 4134144 --total thread-cycles issued: 3019136 (73.029290%) --iCache conflicts: 108540 (2.625453%) --thread*cycles of FU dependence: 251729 (6.089024%) --thread*cycles of data dependence: 219597 (5.311789%) --iCache cycles*banks: 4134144 (80.031223% used) Issue breakdown: --thread*cycles of issue worked: 3019136 (73.029290%) --thread*cycles of issue failed: 825570 (19.969551%) --thread*cycles of issue NOP/other: 289438 (7.001159%) Number of thread-cycles not ready: 219597 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3308574 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 7 2: 7 3: 8 4: 9 5: 8 6: 7 7: 8 8: 9 9: 7 10: 7 11: 7 12: 8 13: 6 14: 6 15: 7 16: 8 17: 7 18: 8 19: 7 20: 8 21: 6 22: 8 23: 9 24: 7 25: 8 26: 6 27: 7 28: 6 29: 6 30: 6 31: 7 <=== Core 2 ===> ---- Thread 00 ---- PC 5: Stalled ----- 101403 in-flight CPI 1.4214 -- Total Cycles 144160 ---- Thread 01 ---- PC 5: Stalled ----- 104226 in-flight CPI 1.3829 -- Total Cycles 144160 ---- Thread 02 ---- PC 5: Stalled ----- 95953 in-flight CPI 1.5021 -- Total Cycles 144160 ---- Thread 03 ---- PC 5: Stalled ----- 97249 in-flight CPI 1.4821 -- Total Cycles 144160 ---- Thread 04 ---- PC 5: Stalled ----- 97153 in-flight CPI 1.4836 -- Total Cycles 144160 ---- Thread 05 ---- PC 5: Stalled ----- 102420 in-flight CPI 1.4072 -- Total Cycles 144160 ---- Thread 06 ---- PC 5: Stalled ----- 99794 in-flight CPI 1.4443 -- Total Cycles 144160 ---- Thread 07 ---- PC 5: Stalled ----- 102988 in-flight CPI 1.3995 -- Total Cycles 144160 ---- Thread 08 ---- PC 5: Stalled ----- 92476 in-flight CPI 1.5587 -- Total Cycles 144160 ---- Thread 09 ---- PC 5: Stalled ----- 100007 in-flight CPI 1.4412 -- Total Cycles 144160 ---- Thread 10 ---- PC 5: Stalled ----- 97808 in-flight CPI 1.4736 -- Total Cycles 144160 ---- Thread 11 ---- PC 5: Stalled ----- 92905 in-flight CPI 1.5515 -- Total Cycles 144160 ---- Thread 12 ---- PC 5: Stalled ----- 97118 in-flight CPI 1.4841 -- Total Cycles 144160 ---- Thread 13 ---- PC 5: Stalled ----- 92337 in-flight CPI 1.5610 -- Total Cycles 144160 ---- Thread 14 ---- PC 5: Stalled ----- 97925 in-flight CPI 1.4719 -- Total Cycles 144160 ---- Thread 15 ---- PC 5: Stalled ----- 91881 in-flight CPI 1.5687 -- Total Cycles 144160 ---- Thread 16 ---- PC 5: Stalled ----- 98629 in-flight CPI 1.4614 -- Total Cycles 144160 ---- Thread 17 ---- PC 5: Stalled ----- 96416 in-flight CPI 1.4949 -- Total Cycles 144160 ---- Thread 18 ---- PC 5: Stalled ----- 92976 in-flight CPI 1.5502 -- Total Cycles 144160 ---- Thread 19 ---- PC 5: Stalled ----- 88033 in-flight CPI 1.6374 -- Total Cycles 144160 ---- Thread 20 ---- PC 5: Stalled ----- 93993 in-flight CPI 1.5334 -- Total Cycles 144160 ---- Thread 21 ---- PC 5: Stalled ----- 92061 in-flight CPI 1.5656 -- Total Cycles 144160 ---- Thread 22 ---- PC 5: Stalled ----- 90664 in-flight CPI 1.5898 -- Total Cycles 144160 ---- Thread 23 ---- PC 5: Stalled ----- 102398 in-flight CPI 1.4077 -- Total Cycles 144160 ---- Thread 24 ---- PC 5: Stalled ----- 91870 in-flight CPI 1.5689 -- Total Cycles 144160 ---- Thread 25 ---- PC 5: Stalled ----- 93508 in-flight CPI 1.5413 -- Total Cycles 144160 ---- Thread 26 ---- PC 5: Stalled ----- 93557 in-flight CPI 1.5406 -- Total Cycles 144160 ---- Thread 27 ---- PC 5: Stalled ----- 92194 in-flight CPI 1.5633 -- Total Cycles 144160 ---- Thread 28 ---- PC 5: Stalled ----- 92246 in-flight CPI 1.5625 -- Total Cycles 144160 ---- Thread 29 ---- PC 5: Stalled ----- 91897 in-flight CPI 1.5683 -- Total Cycles 144160 ---- Thread 30 ---- PC 5: Stalled ----- 84737 in-flight CPI 1.7010 -- Total Cycles 144160 ---- Thread 31 ---- PC 5: Stalled ----- 84275 in-flight CPI 1.7103 -- Total Cycles 144160 Total CPI 0.0474 , IPC 21.1131 -- Total Cycles 144160 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8395 (3.968629%) FPSUB: 0 (0.000000%) FPMUL: 32813 (15.511927%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 79893 (37.768397%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5775 (2.730058%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 76751 (36.283056%) DIV: 7640 (3.611713%) FPUN: 0 (0.000000%) FPRSUB: 267 (0.126221%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3335523 total) ADD%: 7.390 (246508) SUB%: 0.000 (0) MUL%: 0.006 (207) BITOR%: 1.520 (50705) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.587 (19595) FPSUB%: 0.000 (0) FPMUL%: 4.874 (162584) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (621) FPMAX%: 0.019 (621) LOAD%: 5.185 (172963) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (239) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (598) FPINV%: 0.000 (0) FPCONV%: 0.020 (653) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.077 (35916) FPLE%: 0.452 (15066) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (621) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.779 (92692) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.756 (25214) CMPU%: 0.000 (0) RSUB%: 0.006 (207) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.641 (521694) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.168 (38954) ORI%: 1.586 (52890) XORI%: 0.000 (0) MULI%: 3.177 (105982) LW%: 1.122 (37408) LWI%: 13.413 (447393) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.284 (9484) SWI%: 4.041 (134782) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.389 (46334) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.308 (10279) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.063 (2089) bned%: 0.000 (0) bneid%: 13.760 (458954) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.710 (23680) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.131 (4376) DIV%: 0.012 (414) FPUN%: 1.468 (48952) FPRSUB%: 4.294 (143230) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (79) FPGT%: 2.931 (97752) FPGE%: 1.016 (33886) SYNC%: 0.000 (0) NOP%: 8.748 (291805) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 17 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 13 FPSUB 0 FPMUL 2 FPCMPLT 0 FPMIN 0 FPMAX 398 LOAD 40615 INTCONV 0 ATOMIC_INC 14 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 13 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1629 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 7 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48378 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 10 ORI 11947 XORI 0 MULI 9269 LW 0 LWI 141792 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 79 DIV 32 FPUN 0 FPRSUB 72 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.1133 --Total thread-cycles: 4613120 --total thread-cycles issued: 3043718 (65.979597%) --iCache conflicts: 111240 (2.411383%) --thread*cycles of FU dependence: 254296 (5.512451%) --thread*cycles of data dependence: 211534 (4.585487%) --iCache cycles*banks: 4613120 (72.305836% used) Issue breakdown: --thread*cycles of issue worked: 3043718 (65.979597%) --thread*cycles of issue failed: 1277597 (27.694857%) --thread*cycles of issue NOP/other: 291805 (6.325545%) Number of thread-cycles not ready: 211534 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3335523 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 9 2: 7 3: 7 4: 6 5: 9 6: 9 7: 8 8: 6 9: 8 10: 8 11: 6 12: 8 13: 7 14: 8 15: 7 16: 8 17: 8 18: 8 19: 5 20: 8 21: 8 22: 7 23: 5 24: 7 25: 9 26: 8 27: 9 28: 7 29: 9 30: 6 31: 6 <=== Core 3 ===> ---- Thread 00 ---- PC 5: Stalled ----- 98970 in-flight CPI 1.3077 -- Total Cycles 129449 ---- Thread 01 ---- PC 5: Stalled ----- 103635 in-flight CPI 1.2488 -- Total Cycles 129449 ---- Thread 02 ---- PC 5: Stalled ----- 100496 in-flight CPI 1.2878 -- Total Cycles 129449 ---- Thread 03 ---- PC 5: Stalled ----- 101077 in-flight CPI 1.2805 -- Total Cycles 129449 ---- Thread 04 ---- PC 5: Stalled ----- 100122 in-flight CPI 1.2926 -- Total Cycles 129449 ---- Thread 05 ---- PC 5: Stalled ----- 100496 in-flight CPI 1.2878 -- Total Cycles 129449 ---- Thread 06 ---- PC 5: Stalled ----- 101155 in-flight CPI 1.2795 -- Total Cycles 129449 ---- Thread 07 ---- PC 5: Stalled ----- 96608 in-flight CPI 1.3397 -- Total Cycles 129449 ---- Thread 08 ---- PC 5: Stalled ----- 102103 in-flight CPI 1.2676 -- Total Cycles 129449 ---- Thread 09 ---- PC 5: Stalled ----- 96997 in-flight CPI 1.3343 -- Total Cycles 129449 ---- Thread 10 ---- PC 5: Stalled ----- 94983 in-flight CPI 1.3626 -- Total Cycles 129449 ---- Thread 11 ---- PC 5: Stalled ----- 99683 in-flight CPI 1.2983 -- Total Cycles 129449 ---- Thread 12 ---- PC 5: Stalled ----- 100727 in-flight CPI 1.2849 -- Total Cycles 129449 ---- Thread 13 ---- PC 5: Stalled ----- 97032 in-flight CPI 1.3338 -- Total Cycles 129449 ---- Thread 14 ---- PC 5: Stalled ----- 98938 in-flight CPI 1.3081 -- Total Cycles 129449 ---- Thread 15 ---- PC 5: Stalled ----- 93661 in-flight CPI 1.3818 -- Total Cycles 129449 ---- Thread 16 ---- PC 5: Stalled ----- 94110 in-flight CPI 1.3752 -- Total Cycles 129449 ---- Thread 17 ---- PC 5: Stalled ----- 99292 in-flight CPI 1.3035 -- Total Cycles 129449 ---- Thread 18 ---- PC 5: Stalled ----- 97355 in-flight CPI 1.3294 -- Total Cycles 129449 ---- Thread 19 ---- PC 5: Stalled ----- 91974 in-flight CPI 1.4072 -- Total Cycles 129449 ---- Thread 20 ---- PC 5: Stalled ----- 99366 in-flight CPI 1.3025 -- Total Cycles 129449 ---- Thread 21 ---- PC 5: Stalled ----- 90929 in-flight CPI 1.4234 -- Total Cycles 129449 ---- Thread 22 ---- PC 5: Stalled ----- 90659 in-flight CPI 1.4276 -- Total Cycles 129449 ---- Thread 23 ---- PC 5: Stalled ----- 95275 in-flight CPI 1.3584 -- Total Cycles 129449 ---- Thread 24 ---- PC 5: Stalled ----- 90649 in-flight CPI 1.4278 -- Total Cycles 129449 ---- Thread 25 ---- PC 5: Stalled ----- 91600 in-flight CPI 1.4131 -- Total Cycles 129449 ---- Thread 26 ---- PC 5: Stalled ----- 92239 in-flight CPI 1.4032 -- Total Cycles 129449 ---- Thread 27 ---- PC 5: Stalled ----- 91702 in-flight CPI 1.4114 -- Total Cycles 129449 ---- Thread 28 ---- PC 5: Stalled ----- 91729 in-flight CPI 1.4110 -- Total Cycles 129449 ---- Thread 29 ---- PC 5: Stalled ----- 91969 in-flight CPI 1.4073 -- Total Cycles 129449 ---- Thread 30 ---- PC 5: Stalled ----- 88934 in-flight CPI 1.4553 -- Total Cycles 129449 ---- Thread 31 ---- PC 5: Stalled ----- 85924 in-flight CPI 1.5063 -- Total Cycles 129449 Total CPI 0.0422 , IPC 23.7233 -- Total Cycles 129449 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7967 (4.088430%) FPSUB: 0 (0.000000%) FPMUL: 32299 (16.574895%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 68052 (34.922280%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5715 (2.932770%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 72882 (37.400894%) DIV: 7683 (3.942689%) FPUN: 0 (0.000000%) FPRSUB: 269 (0.138043%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3365658 total) ADD%: 7.471 (251445) SUB%: 0.000 (0) MUL%: 0.006 (208) BITOR%: 1.526 (51365) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.559 (18829) FPSUB%: 0.000 (0) FPMUL%: 4.791 (161234) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (624) FPMAX%: 0.019 (624) LOAD%: 5.138 (172943) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (240) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (596) FPINV%: 0.000 (0) FPCONV%: 0.019 (656) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.067 (35896) FPLE%: 0.454 (15287) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (624) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.796 (94088) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.745 (25089) CMPU%: 0.000 (0) RSUB%: 0.006 (208) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.661 (527095) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.170 (39378) ORI%: 1.572 (52911) XORI%: 0.000 (0) MULI%: 3.192 (107448) LW%: 1.128 (37968) LWI%: 13.447 (452588) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.286 (9641) SWI%: 4.055 (136464) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.397 (47011) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10423) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.059 (1971) bned%: 0.000 (0) bneid%: 13.792 (464178) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.715 (24077) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.123 (4147) DIV%: 0.012 (416) FPUN%: 1.478 (49747) FPRSUB%: 4.217 (141940) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (66) FPGT%: 2.942 (99032) FPGE%: 1.024 (34460) SYNC%: 0.000 (0) NOP%: 8.754 (294645) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 15 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 14 FPSUB 0 FPMUL 1 FPCMPLT 0 FPMIN 0 FPMAX 402 LOAD 40578 INTCONV 0 ATOMIC_INC 25 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 13 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1557 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 8 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49030 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 18 ORI 11337 XORI 0 MULI 9702 LW 0 LWI 142966 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 75 DIV 28 FPUN 0 FPRSUB 54 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.7235 --Total thread-cycles: 4142368 --total thread-cycles issued: 3071013 (74.136653%) --iCache conflicts: 112723 (2.721221%) --thread*cycles of FU dependence: 255830 (6.175936%) --thread*cycles of data dependence: 194867 (4.704242%) --iCache cycles*banks: 4142368 (81.250386% used) Issue breakdown: --thread*cycles of issue worked: 3071013 (74.136653%) --thread*cycles of issue failed: 776710 (18.750386%) --thread*cycles of issue NOP/other: 294645 (7.112961%) Number of thread-cycles not ready: 194867 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3365658 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 9 2: 9 3: 8 4: 9 5: 9 6: 8 7: 7 8: 7 9: 7 10: 7 11: 9 12: 7 13: 8 14: 8 15: 8 16: 9 17: 8 18: 8 19: 7 20: 8 21: 7 22: 7 23: 8 24: 6 25: 4 26: 7 27: 7 28: 7 29: 6 30: 7 31: 7 <=== Core 4 ===> ---- Thread 00 ---- PC 5: Stalled ----- 95151 in-flight CPI 1.6429 -- Total Cycles 156355 ---- Thread 01 ---- PC 5: Stalled ----- 97040 in-flight CPI 1.6109 -- Total Cycles 156355 ---- Thread 02 ---- PC 5: Stalled ----- 100331 in-flight CPI 1.5580 -- Total Cycles 156355 ---- Thread 03 ---- PC 5: Stalled ----- 97917 in-flight CPI 1.5965 -- Total Cycles 156355 ---- Thread 04 ---- PC 5: Stalled ----- 100034 in-flight CPI 1.5627 -- Total Cycles 156355 ---- Thread 05 ---- PC 5: Stalled ----- 97980 in-flight CPI 1.5954 -- Total Cycles 156355 ---- Thread 06 ---- PC 5: Stalled ----- 95917 in-flight CPI 1.6298 -- Total Cycles 156355 ---- Thread 07 ---- PC 5: Stalled ----- 99932 in-flight CPI 1.5643 -- Total Cycles 156355 ---- Thread 08 ---- PC 5: Stalled ----- 97366 in-flight CPI 1.6055 -- Total Cycles 156355 ---- Thread 09 ---- PC 5: Stalled ----- 97262 in-flight CPI 1.6073 -- Total Cycles 156355 ---- Thread 10 ---- PC 5: Stalled ----- 97229 in-flight CPI 1.6078 -- Total Cycles 156355 ---- Thread 11 ---- PC 5: Stalled ----- 100285 in-flight CPI 1.5588 -- Total Cycles 156355 ---- Thread 12 ---- PC 5: Stalled ----- 103172 in-flight CPI 1.5152 -- Total Cycles 156355 ---- Thread 13 ---- PC 5: Stalled ----- 101141 in-flight CPI 1.5456 -- Total Cycles 156355 ---- Thread 14 ---- PC 5: Stalled ----- 98977 in-flight CPI 1.5794 -- Total Cycles 156355 ---- Thread 15 ---- PC 5: Stalled ----- 94874 in-flight CPI 1.6477 -- Total Cycles 156355 ---- Thread 16 ---- PC 5: Stalled ----- 88298 in-flight CPI 1.7706 -- Total Cycles 156355 ---- Thread 17 ---- PC 5: Stalled ----- 91622 in-flight CPI 1.7063 -- Total Cycles 156355 ---- Thread 18 ---- PC 5: Stalled ----- 98327 in-flight CPI 1.5899 -- Total Cycles 156355 ---- Thread 19 ---- PC 5: Stalled ----- 98690 in-flight CPI 1.5840 -- Total Cycles 156355 ---- Thread 20 ---- PC 5: Stalled ----- 96735 in-flight CPI 1.6159 -- Total Cycles 156355 ---- Thread 21 ---- PC 5: Stalled ----- 84601 in-flight CPI 1.8479 -- Total Cycles 156355 ---- Thread 22 ---- PC 5: Stalled ----- 90175 in-flight CPI 1.7336 -- Total Cycles 156355 ---- Thread 23 ---- PC 5: Stalled ----- 105814 in-flight CPI 1.4775 -- Total Cycles 156355 ---- Thread 24 ---- PC 5: Stalled ----- 94640 in-flight CPI 1.6517 -- Total Cycles 156355 ---- Thread 25 ---- PC 5: Stalled ----- 93907 in-flight CPI 1.6647 -- Total Cycles 156355 ---- Thread 26 ---- PC 5: Stalled ----- 91825 in-flight CPI 1.7024 -- Total Cycles 156355 ---- Thread 27 ---- PC 5: Stalled ----- 85120 in-flight CPI 1.8365 -- Total Cycles 156355 ---- Thread 28 ---- PC 5: Stalled ----- 95326 in-flight CPI 1.6399 -- Total Cycles 156355 ---- Thread 29 ---- PC 5: Stalled ----- 93428 in-flight CPI 1.6732 -- Total Cycles 156355 ---- Thread 30 ---- PC 5: Stalled ----- 84444 in-flight CPI 1.8513 -- Total Cycles 156355 ---- Thread 31 ---- PC 5: Stalled ----- 86703 in-flight CPI 1.8030 -- Total Cycles 156355 Total CPI 0.0512 , IPC 19.5378 -- Total Cycles 156355 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8090 (3.286040%) FPSUB: 0 (0.000000%) FPMUL: 32355 (13.142128%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 118337 (48.066761%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5743 (2.332723%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 73717 (29.942768%) DIV: 7681 (3.119910%) FPUN: 0 (0.000000%) FPRSUB: 270 (0.109670%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3347903 total) ADD%: 7.424 (248532) SUB%: 0.000 (0) MUL%: 0.006 (208) BITOR%: 1.512 (50615) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.567 (18993) FPSUB%: 0.000 (0) FPMUL%: 4.817 (161279) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (624) FPMAX%: 0.019 (624) LOAD%: 5.163 (172849) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (240) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (598) FPINV%: 0.000 (0) FPCONV%: 0.020 (656) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.068 (35759) FPLE%: 0.452 (15122) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (624) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.799 (93713) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.747 (24993) CMPU%: 0.000 (0) RSUB%: 0.006 (208) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.662 (524332) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.170 (39161) ORI%: 1.569 (52543) XORI%: 0.000 (0) MULI%: 3.194 (106924) LW%: 1.130 (37818) LWI%: 13.460 (450617) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.286 (9590) SWI%: 4.062 (135994) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.399 (46839) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10369) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.062 (2085) bned%: 0.000 (0) bneid%: 13.768 (460949) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.712 (23826) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.125 (4195) DIV%: 0.012 (416) FPUN%: 1.466 (49064) FPRSUB%: 4.239 (141933) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (65) FPGT%: 2.942 (98492) FPGE%: 1.014 (33942) SYNC%: 0.000 (0) NOP%: 8.752 (293016) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 20 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 6 FPSUB 0 FPMUL 1 FPCMPLT 0 FPMIN 0 FPMAX 405 LOAD 39923 INTCONV 0 ATOMIC_INC 16 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 13 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 2116 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 8 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48746 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 15 ORI 11560 XORI 0 MULI 8960 LW 0 LWI 142523 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 54 DIV 23 FPUN 0 FPRSUB 44 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 19.5380 --Total thread-cycles: 5003360 --total thread-cycles issued: 3054887 (61.056710%) --iCache conflicts: 108459 (2.167723%) --thread*cycles of FU dependence: 254442 (5.085423%) --thread*cycles of data dependence: 246193 (4.920553%) --iCache cycles*banks: 5003360 (66.913734% used) Issue breakdown: --thread*cycles of issue worked: 3054887 (61.056710%) --thread*cycles of issue failed: 1655457 (33.086906%) --thread*cycles of issue NOP/other: 293016 (5.856385%) Number of thread-cycles not ready: 246193 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3347903 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 9 3: 8 4: 8 5: 10 6: 8 7: 8 8: 9 9: 7 10: 8 11: 8 12: 8 13: 9 14: 7 15: 8 16: 5 17: 6 18: 6 19: 8 20: 9 21: 6 22: 7 23: 5 24: 9 25: 7 26: 7 27: 7 28: 7 29: 7 30: 6 31: 7 <=== Core 5 ===> ---- Thread 00 ---- PC 5: Stalled ----- 93422 in-flight CPI 1.3745 -- Total Cycles 128434 ---- Thread 01 ---- PC 5: Stalled ----- 98086 in-flight CPI 1.3091 -- Total Cycles 128434 ---- Thread 02 ---- PC 5: Stalled ----- 100325 in-flight CPI 1.2799 -- Total Cycles 128434 ---- Thread 03 ---- PC 5: Stalled ----- 101842 in-flight CPI 1.2608 -- Total Cycles 128434 ---- Thread 04 ---- PC 5: Stalled ----- 101989 in-flight CPI 1.2591 -- Total Cycles 128434 ---- Thread 05 ---- PC 5: Stalled ----- 98645 in-flight CPI 1.3018 -- Total Cycles 128434 ---- Thread 06 ---- PC 5: Stalled ----- 99917 in-flight CPI 1.2852 -- Total Cycles 128434 ---- Thread 07 ---- PC 5: Stalled ----- 97169 in-flight CPI 1.3215 -- Total Cycles 128434 ---- Thread 08 ---- PC 5: Stalled ----- 103471 in-flight CPI 1.2410 -- Total Cycles 128434 ---- Thread 09 ---- PC 5: Stalled ----- 101886 in-flight CPI 1.2604 -- Total Cycles 128434 ---- Thread 10 ---- PC 5: Stalled ----- 96238 in-flight CPI 1.3343 -- Total Cycles 128434 ---- Thread 11 ---- PC 5: Stalled ----- 96545 in-flight CPI 1.3300 -- Total Cycles 128434 ---- Thread 12 ---- PC 5: Stalled ----- 93834 in-flight CPI 1.3685 -- Total Cycles 128434 ---- Thread 13 ---- PC 5: Stalled ----- 102500 in-flight CPI 1.2528 -- Total Cycles 128434 ---- Thread 14 ---- PC 5: Stalled ----- 91039 in-flight CPI 1.4106 -- Total Cycles 128434 ---- Thread 15 ---- PC 5: Stalled ----- 100191 in-flight CPI 1.2816 -- Total Cycles 128434 ---- Thread 16 ---- PC 5: Stalled ----- 99231 in-flight CPI 1.2940 -- Total Cycles 128434 ---- Thread 17 ---- PC 5: Stalled ----- 94089 in-flight CPI 1.3648 -- Total Cycles 128434 ---- Thread 18 ---- PC 5: Stalled ----- 94519 in-flight CPI 1.3585 -- Total Cycles 128434 ---- Thread 19 ---- PC 5: Stalled ----- 90443 in-flight CPI 1.4198 -- Total Cycles 128434 ---- Thread 20 ---- PC 5: Stalled ----- 99085 in-flight CPI 1.2959 -- Total Cycles 128434 ---- Thread 21 ---- PC 5: Stalled ----- 92791 in-flight CPI 1.3839 -- Total Cycles 128434 ---- Thread 22 ---- PC 5: Stalled ----- 93539 in-flight CPI 1.3728 -- Total Cycles 128434 ---- Thread 23 ---- PC 5: Stalled ----- 96672 in-flight CPI 1.3283 -- Total Cycles 128434 ---- Thread 24 ---- PC 5: Stalled ----- 92045 in-flight CPI 1.3951 -- Total Cycles 128434 ---- Thread 25 ---- PC 5: Stalled ----- 96484 in-flight CPI 1.3308 -- Total Cycles 128434 ---- Thread 26 ---- PC 5: Stalled ----- 94598 in-flight CPI 1.3574 -- Total Cycles 128434 ---- Thread 27 ---- PC 5: Stalled ----- 87736 in-flight CPI 1.4635 -- Total Cycles 128434 ---- Thread 28 ---- PC 5: Stalled ----- 92414 in-flight CPI 1.3895 -- Total Cycles 128434 ---- Thread 29 ---- PC 5: Stalled ----- 93089 in-flight CPI 1.3794 -- Total Cycles 128434 ---- Thread 30 ---- PC 5: Stalled ----- 87471 in-flight CPI 1.4680 -- Total Cycles 128434 ---- Thread 31 ---- PC 5: Stalled ----- 80802 in-flight CPI 1.5893 -- Total Cycles 128434 Total CPI 0.0419 , IPC 23.8464 -- Total Cycles 128434 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7441 (3.893997%) FPSUB: 0 (0.000000%) FPMUL: 31080 (16.264672%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 69233 (36.230762%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5974 (3.126292%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 69193 (36.209829%) DIV: 7892 (4.130013%) FPUN: 0 (0.000000%) FPRSUB: 276 (0.144435%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3356224 total) ADD%: 7.493 (251497) SUB%: 0.000 (0) MUL%: 0.006 (214) BITOR%: 1.522 (51079) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.525 (17630) FPSUB%: 0.000 (0) FPMUL%: 4.696 (157594) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (642) FPMAX%: 0.019 (642) LOAD%: 5.111 (171537) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (246) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (617) FPINV%: 0.000 (0) FPCONV%: 0.020 (674) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.052 (35322) FPLE%: 0.454 (15248) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (642) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.821 (94674) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.738 (24758) CMPU%: 0.000 (0) RSUB%: 0.006 (214) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.690 (526592) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.178 (39529) ORI%: 1.537 (51594) XORI%: 0.000 (0) MULI%: 3.220 (108074) LW%: 1.139 (38212) LWI%: 13.542 (454499) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.289 (9698) SWI%: 4.096 (137476) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.410 (47317) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10420) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.052 (1743) bned%: 0.000 (0) bneid%: 13.807 (463408) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.716 (24017) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.117 (3925) DIV%: 0.013 (428) FPUN%: 1.476 (49548) FPRSUB%: 4.151 (139318) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (75) FPGT%: 2.957 (99250) FPGE%: 1.022 (34300) SYNC%: 0.000 (0) NOP%: 8.744 (293475) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 37 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 12 FPSUB 0 FPMUL 3 FPCMPLT 0 FPMIN 0 FPMAX 418 LOAD 39142 INTCONV 0 ATOMIC_INC 14 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 13 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1915 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 12 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49302 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 14 ORI 10503 XORI 0 MULI 9691 LW 0 LWI 143670 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 79 DIV 34 FPUN 0 FPRSUB 60 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.8467 --Total thread-cycles: 4109888 --total thread-cycles issued: 3062749 (74.521471%) --iCache conflicts: 110458 (2.687616%) --thread*cycles of FU dependence: 254938 (6.203040%) --thread*cycles of data dependence: 191089 (4.649494%) --iCache cycles*banks: 4109888 (81.662955% used) Issue breakdown: --thread*cycles of issue worked: 3062749 (74.521471%) --thread*cycles of issue failed: 753664 (18.337823%) --thread*cycles of issue NOP/other: 293475 (7.140706%) Number of thread-cycles not ready: 191089 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3356224 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 9 3: 10 4: 8 5: 7 6: 8 7: 7 8: 9 9: 7 10: 7 11: 8 12: 8 13: 7 14: 6 15: 8 16: 8 17: 7 18: 8 19: 8 20: 9 21: 7 22: 8 23: 8 24: 7 25: 9 26: 8 27: 8 28: 7 29: 8 30: 7 31: 5 <=== Core 6 ===> ---- Thread 00 ---- PC 5: Stalled ----- 90639 in-flight CPI 1.6649 -- Total Cycles 150931 ---- Thread 01 ---- PC 5: Stalled ----- 94527 in-flight CPI 1.5964 -- Total Cycles 150931 ---- Thread 02 ---- PC 5: Stalled ----- 103918 in-flight CPI 1.4521 -- Total Cycles 150931 ---- Thread 03 ---- PC 5: Stalled ----- 107251 in-flight CPI 1.4072 -- Total Cycles 150931 ---- Thread 04 ---- PC 5: Stalled ----- 99569 in-flight CPI 1.5155 -- Total Cycles 150931 ---- Thread 05 ---- PC 5: Stalled ----- 97398 in-flight CPI 1.5494 -- Total Cycles 150931 ---- Thread 06 ---- PC 5: Stalled ----- 100362 in-flight CPI 1.5036 -- Total Cycles 150931 ---- Thread 07 ---- PC 5: Stalled ----- 93279 in-flight CPI 1.6178 -- Total Cycles 150931 ---- Thread 08 ---- PC 5: Stalled ----- 96777 in-flight CPI 1.5593 -- Total Cycles 150931 ---- Thread 09 ---- PC 5: Stalled ----- 101091 in-flight CPI 1.4928 -- Total Cycles 150931 ---- Thread 10 ---- PC 5: Stalled ----- 92699 in-flight CPI 1.6279 -- Total Cycles 150931 ---- Thread 11 ---- PC 5: Stalled ----- 95404 in-flight CPI 1.5817 -- Total Cycles 150931 ---- Thread 12 ---- PC 5: Stalled ----- 96305 in-flight CPI 1.5670 -- Total Cycles 150931 ---- Thread 13 ---- PC 5: Stalled ----- 99930 in-flight CPI 1.5100 -- Total Cycles 150931 ---- Thread 14 ---- PC 5: Stalled ----- 94319 in-flight CPI 1.5999 -- Total Cycles 150931 ---- Thread 15 ---- PC 5: Stalled ----- 92398 in-flight CPI 1.6331 -- Total Cycles 150931 ---- Thread 16 ---- PC 5: Stalled ----- 84474 in-flight CPI 1.7865 -- Total Cycles 150931 ---- Thread 17 ---- PC 5: Stalled ----- 96000 in-flight CPI 1.5719 -- Total Cycles 150931 ---- Thread 18 ---- PC 5: Stalled ----- 91105 in-flight CPI 1.6564 -- Total Cycles 150931 ---- Thread 19 ---- PC 5: Stalled ----- 94939 in-flight CPI 1.5895 -- Total Cycles 150931 ---- Thread 20 ---- PC 5: Stalled ----- 94860 in-flight CPI 1.5908 -- Total Cycles 150931 ---- Thread 21 ---- PC 5: Stalled ----- 98203 in-flight CPI 1.5367 -- Total Cycles 150931 ---- Thread 22 ---- PC 5: Stalled ----- 92882 in-flight CPI 1.6247 -- Total Cycles 150931 ---- Thread 23 ---- PC 5: Stalled ----- 98452 in-flight CPI 1.5327 -- Total Cycles 150931 ---- Thread 24 ---- PC 5: Stalled ----- 93139 in-flight CPI 1.6202 -- Total Cycles 150931 ---- Thread 25 ---- PC 5: Stalled ----- 88826 in-flight CPI 1.6989 -- Total Cycles 150931 ---- Thread 26 ---- PC 5: Stalled ----- 86462 in-flight CPI 1.7453 -- Total Cycles 150931 ---- Thread 27 ---- PC 5: Stalled ----- 90774 in-flight CPI 1.6624 -- Total Cycles 150931 ---- Thread 28 ---- PC 5: Stalled ----- 89842 in-flight CPI 1.6797 -- Total Cycles 150931 ---- Thread 29 ---- PC 5: Stalled ----- 88473 in-flight CPI 1.7056 -- Total Cycles 150931 ---- Thread 30 ---- PC 5: Stalled ----- 90900 in-flight CPI 1.6602 -- Total Cycles 150931 ---- Thread 31 ---- PC 5: Stalled ----- 92144 in-flight CPI 1.6376 -- Total Cycles 150931 Total CPI 0.0498 , IPC 20.0612 -- Total Cycles 150931 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8092 (3.319305%) FPSUB: 0 (0.000000%) FPMUL: 32111 (13.171798%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 117297 (48.114740%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5239 (2.149016%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 73739 (30.247430%) DIV: 7054 (2.893521%) FPUN: 0 (0.000000%) FPRSUB: 254 (0.104190%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3318072 total) ADD%: 7.364 (244358) SUB%: 0.000 (0) MUL%: 0.006 (191) BITOR%: 1.521 (50471) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.565 (18746) FPSUB%: 0.000 (0) FPMUL%: 4.822 (159981) FPCMPLT%: 0.000 (0) FPMIN%: 0.017 (573) FPMAX%: 0.017 (573) LOAD%: 5.197 (172446) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (223) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (552) FPINV%: 0.000 (0) FPCONV%: 0.018 (605) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.066 (35377) FPLE%: 0.456 (15122) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.017 (573) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.801 (92941) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.753 (24991) CMPU%: 0.000 (0) RSUB%: 0.006 (191) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.687 (520499) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.173 (38936) ORI%: 1.571 (52125) XORI%: 0.000 (0) MULI%: 3.192 (105922) LW%: 1.130 (37482) LWI%: 13.433 (445721) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.287 (9513) SWI%: 4.042 (134132) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.399 (46434) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10274) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.063 (2093) bned%: 0.000 (0) bneid%: 13.768 (456841) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.715 (23735) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.127 (4208) DIV%: 0.012 (382) FPUN%: 1.472 (48826) FPRSUB%: 4.272 (141756) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (73) FPGT%: 2.931 (97248) FPGE%: 1.016 (33704) SYNC%: 0.000 (0) NOP%: 8.745 (290158) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 17 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 11 FPSUB 0 FPMUL 5 FPCMPLT 0 FPMIN 0 FPMAX 376 LOAD 41298 INTCONV 0 ATOMIC_INC 29 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 14 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1565 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 13 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48326 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 12 ORI 11530 XORI 0 MULI 8561 LW 0 LWI 141269 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 81 DIV 34 FPUN 0 FPRSUB 72 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 20.0614 --Total thread-cycles: 4829792 --total thread-cycles issued: 3027914 (62.692431%) --iCache conflicts: 107749 (2.230924%) --thread*cycles of FU dependence: 253242 (5.243331%) --thread*cycles of data dependence: 243786 (5.047547%) --iCache cycles*banks: 4829792 (68.700764% used) Issue breakdown: --thread*cycles of issue worked: 3027914 (62.692431%) --thread*cycles of issue failed: 1511720 (31.299899%) --thread*cycles of issue NOP/other: 4619256974713056622 (95640909064263.141000%) Number of thread-cycles not ready: 243786 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3318072 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 6 1: 7 2: 9 3: 4 4: 9 5: 7 6: 7 7: 6 8: 7 9: 7 10: 6 11: 8 12: 7 13: 9 14: 7 15: 8 16: 5 17: 8 18: 6 19: 7 20: 7 21: 7 22: 6 23: 8 24: 7 25: 7 26: 7 27: 7 28: 6 29: 7 30: 6 31: 8 <=== Core 7 ===> ---- Thread 00 ---- PC 5: Stalled ----- 103576 in-flight CPI 1.2333 -- Total Cycles 127764 ---- Thread 01 ---- PC 5: Stalled ----- 100462 in-flight CPI 1.2716 -- Total Cycles 127764 ---- Thread 02 ---- PC 5: Stalled ----- 96787 in-flight CPI 1.3198 -- Total Cycles 127764 ---- Thread 03 ---- PC 5: Stalled ----- 96138 in-flight CPI 1.3287 -- Total Cycles 127764 ---- Thread 04 ---- PC 5: Stalled ----- 97730 in-flight CPI 1.3071 -- Total Cycles 127764 ---- Thread 05 ---- PC 5: Stalled ----- 98306 in-flight CPI 1.2994 -- Total Cycles 127764 ---- Thread 06 ---- PC 5: Stalled ----- 100413 in-flight CPI 1.2721 -- Total Cycles 127764 ---- Thread 07 ---- PC 5: Stalled ----- 93911 in-flight CPI 1.3602 -- Total Cycles 127764 ---- Thread 08 ---- PC 5: Stalled ----- 95973 in-flight CPI 1.3310 -- Total Cycles 127764 ---- Thread 09 ---- PC 5: Stalled ----- 98784 in-flight CPI 1.2932 -- Total Cycles 127764 ---- Thread 10 ---- PC 5: Stalled ----- 94729 in-flight CPI 1.3485 -- Total Cycles 127764 ---- Thread 11 ---- PC 5: Stalled ----- 102819 in-flight CPI 1.2424 -- Total Cycles 127764 ---- Thread 12 ---- PC 5: Stalled ----- 96482 in-flight CPI 1.3240 -- Total Cycles 127764 ---- Thread 13 ---- PC 5: Stalled ----- 94897 in-flight CPI 1.3461 -- Total Cycles 127764 ---- Thread 14 ---- PC 5: Stalled ----- 97195 in-flight CPI 1.3143 -- Total Cycles 127764 ---- Thread 15 ---- PC 5: Stalled ----- 98136 in-flight CPI 1.3017 -- Total Cycles 127764 ---- Thread 16 ---- PC 5: Stalled ----- 98671 in-flight CPI 1.2946 -- Total Cycles 127764 ---- Thread 17 ---- PC 5: Stalled ----- 90817 in-flight CPI 1.4067 -- Total Cycles 127764 ---- Thread 18 ---- PC 5: Stalled ----- 91186 in-flight CPI 1.4009 -- Total Cycles 127764 ---- Thread 19 ---- PC 5: Stalled ----- 98903 in-flight CPI 1.2916 -- Total Cycles 127764 ---- Thread 20 ---- PC 5: Stalled ----- 94922 in-flight CPI 1.3458 -- Total Cycles 127764 ---- Thread 21 ---- PC 5: Stalled ----- 96137 in-flight CPI 1.3288 -- Total Cycles 127764 ---- Thread 22 ---- PC 5: Stalled ----- 96010 in-flight CPI 1.3305 -- Total Cycles 127764 ---- Thread 23 ---- PC 5: Stalled ----- 96163 in-flight CPI 1.3283 -- Total Cycles 127764 ---- Thread 24 ---- PC 5: Stalled ----- 97617 in-flight CPI 1.3085 -- Total Cycles 127764 ---- Thread 25 ---- PC 5: Stalled ----- 92479 in-flight CPI 1.3813 -- Total Cycles 127764 ---- Thread 26 ---- PC 5: Stalled ----- 92602 in-flight CPI 1.3796 -- Total Cycles 127764 ---- Thread 27 ---- PC 5: Stalled ----- 90591 in-flight CPI 1.4101 -- Total Cycles 127764 ---- Thread 28 ---- PC 5: Stalled ----- 94800 in-flight CPI 1.3474 -- Total Cycles 127764 ---- Thread 29 ---- PC 5: Stalled ----- 92211 in-flight CPI 1.3853 -- Total Cycles 127764 ---- Thread 30 ---- PC 5: Stalled ----- 89167 in-flight CPI 1.4325 -- Total Cycles 127764 ---- Thread 31 ---- PC 5: Stalled ----- 89720 in-flight CPI 1.4238 -- Total Cycles 127764 Total CPI 0.0416 , IPC 24.0201 -- Total Cycles 127764 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8025 (4.173840%) FPSUB: 0 (0.000000%) FPMUL: 32336 (16.818104%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 64242 (33.412563%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5864 (3.049894%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 73778 (38.372281%) DIV: 7754 (4.032891%) FPUN: 0 (0.000000%) FPRSUB: 270 (0.140428%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3362992 total) ADD%: 7.469 (251196) SUB%: 0.000 (0) MUL%: 0.006 (210) BITOR%: 1.533 (51550) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.561 (18851) FPSUB%: 0.000 (0) FPMUL%: 4.794 (161233) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (630) FPMAX%: 0.019 (630) LOAD%: 5.133 (172636) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (242) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (606) FPINV%: 0.000 (0) FPCONV%: 0.020 (662) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.068 (35908) FPLE%: 0.452 (15185) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (630) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.791 (93860) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.747 (25117) CMPU%: 0.000 (0) RSUB%: 0.006 (210) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.636 (525832) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.169 (39324) ORI%: 1.579 (53096) XORI%: 0.000 (0) MULI%: 3.193 (107378) LW%: 1.126 (37880) LWI%: 13.463 (452764) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.285 (9590) SWI%: 4.055 (136355) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.396 (46932) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.308 (10365) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.059 (1972) bned%: 0.000 (0) bneid%: 13.785 (463594) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.720 (24204) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.125 (4199) DIV%: 0.012 (420) FPUN%: 1.484 (49896) FPRSUB%: 4.230 (142260) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (71) FPGT%: 2.934 (98669) FPGE%: 1.032 (34711) SYNC%: 0.000 (0) NOP%: 8.743 (294028) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 23 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 13 FPSUB 0 FPMUL 3 FPCMPLT 0 FPMIN 0 FPMAX 414 LOAD 39625 INTCONV 0 ATOMIC_INC 21 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 10 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1609 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 8 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48993 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 15 ORI 11395 XORI 0 MULI 9511 LW 0 LWI 143177 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 90 DIV 19 FPUN 0 FPRSUB 54 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 24.0204 --Total thread-cycles: 4088448 --total thread-cycles issued: 3068964 (75.064279%) --iCache conflicts: 111996 (2.739328%) --thread*cycles of FU dependence: 255001 (6.237110%) --thread*cycles of data dependence: 192269 (4.702738%) --iCache cycles*banks: 4088448 (82.256739% used) Issue breakdown: --thread*cycles of issue worked: 3068964 (75.064279%) --thread*cycles of issue failed: 725456 (17.744044%) --thread*cycles of issue NOP/other: 4619233815175658636 (112982574687892.780000%) Number of thread-cycles not ready: 192269 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3362992 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 9 1: 7 2: 8 3: 9 4: 7 5: 8 6: 9 7: 7 8: 7 9: 7 10: 6 11: 8 12: 8 13: 8 14: 8 15: 8 16: 7 17: 5 18: 7 19: 7 20: 7 21: 7 22: 8 23: 9 24: 9 25: 8 26: 5 27: 7 28: 8 29: 8 30: 9 31: 7 <=== Core 8 ===> ---- Thread 00 ---- PC 5: Stalled ----- 90510 in-flight CPI 1.5093 -- Total Cycles 136625 ---- Thread 01 ---- PC 5: Stalled ----- 103155 in-flight CPI 1.3243 -- Total Cycles 136625 ---- Thread 02 ---- PC 5: Stalled ----- 96598 in-flight CPI 1.4142 -- Total Cycles 136625 ---- Thread 03 ---- PC 5: Stalled ----- 96344 in-flight CPI 1.4179 -- Total Cycles 136625 ---- Thread 04 ---- PC 5: Stalled ----- 98924 in-flight CPI 1.3808 -- Total Cycles 136625 ---- Thread 05 ---- PC 5: Stalled ----- 97641 in-flight CPI 1.3990 -- Total Cycles 136625 ---- Thread 06 ---- PC 5: Stalled ----- 93903 in-flight CPI 1.4547 -- Total Cycles 136625 ---- Thread 07 ---- PC 5: Stalled ----- 101037 in-flight CPI 1.3520 -- Total Cycles 136625 ---- Thread 08 ---- PC 5: Stalled ----- 93145 in-flight CPI 1.4665 -- Total Cycles 136625 ---- Thread 09 ---- PC 5: Stalled ----- 98589 in-flight CPI 1.3855 -- Total Cycles 136625 ---- Thread 10 ---- PC 5: Stalled ----- 96154 in-flight CPI 1.4207 -- Total Cycles 136625 ---- Thread 11 ---- PC 5: Stalled ----- 101307 in-flight CPI 1.3484 -- Total Cycles 136625 ---- Thread 12 ---- PC 5: Stalled ----- 102241 in-flight CPI 1.3360 -- Total Cycles 136625 ---- Thread 13 ---- PC 5: Stalled ----- 94502 in-flight CPI 1.4455 -- Total Cycles 136625 ---- Thread 14 ---- PC 5: Stalled ----- 99942 in-flight CPI 1.3668 -- Total Cycles 136625 ---- Thread 15 ---- PC 5: Stalled ----- 98003 in-flight CPI 1.3939 -- Total Cycles 136625 ---- Thread 16 ---- PC 5: Stalled ----- 94884 in-flight CPI 1.4396 -- Total Cycles 136625 ---- Thread 17 ---- PC 5: Stalled ----- 98311 in-flight CPI 1.3894 -- Total Cycles 136625 ---- Thread 18 ---- PC 5: Stalled ----- 94928 in-flight CPI 1.4390 -- Total Cycles 136625 ---- Thread 19 ---- PC 5: Stalled ----- 91807 in-flight CPI 1.4879 -- Total Cycles 136625 ---- Thread 20 ---- PC 5: Stalled ----- 91413 in-flight CPI 1.4944 -- Total Cycles 136625 ---- Thread 21 ---- PC 5: Stalled ----- 90426 in-flight CPI 1.5106 -- Total Cycles 136625 ---- Thread 22 ---- PC 5: Stalled ----- 90456 in-flight CPI 1.5101 -- Total Cycles 136625 ---- Thread 23 ---- PC 5: Stalled ----- 87274 in-flight CPI 1.5652 -- Total Cycles 136625 ---- Thread 24 ---- PC 5: Stalled ----- 97460 in-flight CPI 1.4017 -- Total Cycles 136625 ---- Thread 25 ---- PC 5: Stalled ----- 93215 in-flight CPI 1.4655 -- Total Cycles 136625 ---- Thread 26 ---- PC 5: Stalled ----- 91402 in-flight CPI 1.4945 -- Total Cycles 136625 ---- Thread 27 ---- PC 5: Stalled ----- 88553 in-flight CPI 1.5426 -- Total Cycles 136625 ---- Thread 28 ---- PC 5: Stalled ----- 85168 in-flight CPI 1.6039 -- Total Cycles 136625 ---- Thread 29 ---- PC 5: Stalled ----- 92226 in-flight CPI 1.4812 -- Total Cycles 136625 ---- Thread 30 ---- PC 5: Stalled ----- 89699 in-flight CPI 1.5230 -- Total Cycles 136625 ---- Thread 31 ---- PC 5: Stalled ----- 94370 in-flight CPI 1.4474 -- Total Cycles 136625 Total CPI 0.0450 , IPC 22.2075 -- Total Cycles 136625 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8581 (3.598024%) FPSUB: 0 (0.000000%) FPMUL: 33103 (13.880130%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 106791 (44.777603%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5159 (2.163175%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 77567 (32.523942%) DIV: 7037 (2.950623%) FPUN: 0 (0.000000%) FPRSUB: 254 (0.106503%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3324108 total) ADD%: 7.462 (248061) SUB%: 0.000 (0) MUL%: 0.006 (191) BITOR%: 1.539 (51159) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.600 (19934) FPSUB%: 0.000 (0) FPMUL%: 4.911 (163237) FPCMPLT%: 0.000 (0) FPMIN%: 0.017 (573) FPMAX%: 0.017 (573) LOAD%: 5.202 (172926) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (223) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.016 (548) FPINV%: 0.000 (0) FPCONV%: 0.018 (605) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.077 (35817) FPLE%: 0.456 (15162) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.017 (573) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.764 (91891) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.769 (25560) CMPU%: 0.000 (0) RSUB%: 0.006 (191) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.632 (519625) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.171 (38935) ORI%: 1.605 (53356) XORI%: 0.000 (0) MULI%: 3.156 (104900) LW%: 1.115 (37062) LWI%: 13.326 (442980) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.283 (9407) SWI%: 4.012 (133355) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.381 (45906) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.308 (10239) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.059 (1967) bned%: 0.000 (0) bneid%: 13.743 (456837) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.717 (23831) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.133 (4434) DIV%: 0.011 (382) FPUN%: 1.477 (49095) FPRSUB%: 4.329 (143890) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (63) FPGT%: 2.907 (96643) FPGE%: 1.021 (33933) SYNC%: 0.000 (0) NOP%: 8.723 (289948) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 22 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 10 FPSUB 0 FPMUL 2 FPCMPLT 0 FPMIN 0 FPMAX 376 LOAD 41496 INTCONV 0 ATOMIC_INC 19 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 12 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1741 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 10 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 47930 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 15 ORI 12343 XORI 0 MULI 8849 LW 0 LWI 140508 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 86 DIV 27 FPUN 0 FPRSUB 65 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 22.2078 --Total thread-cycles: 4372000 --total thread-cycles issued: 3034160 (69.399817%) --iCache conflicts: 107114 (2.450000%) --thread*cycles of FU dependence: 253542 (5.799222%) --thread*cycles of data dependence: 238492 (5.454986%) --iCache cycles*banks: 4372000 (76.032479% used) Issue breakdown: --thread*cycles of issue worked: 3034160 (69.399817%) --thread*cycles of issue failed: 1047892 (23.968253%) --thread*cycles of issue NOP/other: 4611615627134921884 (105480686805464.810000%) Number of thread-cycles not ready: 238492 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3324108 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 6 1: 5 2: 6 3: 7 4: 8 5: 7 6: 8 7: 8 8: 7 9: 8 10: 7 11: 8 12: 9 13: 7 14: 7 15: 7 16: 8 17: 8 18: 7 19: 7 20: 6 21: 8 22: 7 23: 6 24: 5 25: 6 26: 7 27: 6 28: 6 29: 7 30: 5 31: 9 <=== Core 9 ===> ---- Thread 00 ---- PC 5: Stalled ----- 94800 in-flight CPI 1.3601 -- Total Cycles 128960 ---- Thread 01 ---- PC 5: Stalled ----- 95523 in-flight CPI 1.3498 -- Total Cycles 128960 ---- Thread 02 ---- PC 5: Stalled ----- 102450 in-flight CPI 1.2585 -- Total Cycles 128960 ---- Thread 03 ---- PC 5: Stalled ----- 100579 in-flight CPI 1.2819 -- Total Cycles 128960 ---- Thread 04 ---- PC 5: Stalled ----- 96103 in-flight CPI 1.3416 -- Total Cycles 128960 ---- Thread 05 ---- PC 5: Stalled ----- 100609 in-flight CPI 1.2815 -- Total Cycles 128960 ---- Thread 06 ---- PC 5: Stalled ----- 94121 in-flight CPI 1.3699 -- Total Cycles 128960 ---- Thread 07 ---- PC 5: Stalled ----- 101917 in-flight CPI 1.2651 -- Total Cycles 128960 ---- Thread 08 ---- PC 5: Stalled ----- 98949 in-flight CPI 1.3030 -- Total Cycles 128960 ---- Thread 09 ---- PC 5: Stalled ----- 96684 in-flight CPI 1.3336 -- Total Cycles 128960 ---- Thread 10 ---- PC 5: Stalled ----- 97460 in-flight CPI 1.3229 -- Total Cycles 128960 ---- Thread 11 ---- PC 5: Stalled ----- 95122 in-flight CPI 1.3555 -- Total Cycles 128960 ---- Thread 12 ---- PC 5: Stalled ----- 100500 in-flight CPI 1.2829 -- Total Cycles 128960 ---- Thread 13 ---- PC 5: Stalled ----- 92589 in-flight CPI 1.3926 -- Total Cycles 128960 ---- Thread 14 ---- PC 5: Stalled ----- 98978 in-flight CPI 1.3027 -- Total Cycles 128960 ---- Thread 15 ---- PC 5: Stalled ----- 96796 in-flight CPI 1.3321 -- Total Cycles 128960 ---- Thread 16 ---- PC 5: Stalled ----- 98995 in-flight CPI 1.3024 -- Total Cycles 128960 ---- Thread 17 ---- PC 5: Stalled ----- 91653 in-flight CPI 1.4068 -- Total Cycles 128960 ---- Thread 18 ---- PC 5: Stalled ----- 87791 in-flight CPI 1.4687 -- Total Cycles 128960 ---- Thread 19 ---- PC 5: Stalled ----- 96285 in-flight CPI 1.3391 -- Total Cycles 128960 ---- Thread 20 ---- PC 5: Stalled ----- 90016 in-flight CPI 1.4324 -- Total Cycles 128960 ---- Thread 21 ---- PC 5: Stalled ----- 88784 in-flight CPI 1.4523 -- Total Cycles 128960 ---- Thread 22 ---- PC 5: Stalled ----- 92604 in-flight CPI 1.3924 -- Total Cycles 128960 ---- Thread 23 ---- PC 5: Stalled ----- 95423 in-flight CPI 1.3512 -- Total Cycles 128960 ---- Thread 24 ---- PC 5: Stalled ----- 93264 in-flight CPI 1.3825 -- Total Cycles 128960 ---- Thread 25 ---- PC 5: Stalled ----- 95666 in-flight CPI 1.3478 -- Total Cycles 128960 ---- Thread 26 ---- PC 5: Stalled ----- 96546 in-flight CPI 1.3355 -- Total Cycles 128960 ---- Thread 27 ---- PC 5: Stalled ----- 92984 in-flight CPI 1.3866 -- Total Cycles 128960 ---- Thread 28 ---- PC 5: Stalled ----- 93310 in-flight CPI 1.3818 -- Total Cycles 128960 ---- Thread 29 ---- PC 5: Stalled ----- 92819 in-flight CPI 1.3891 -- Total Cycles 128960 ---- Thread 30 ---- PC 5: Stalled ----- 95966 in-flight CPI 1.3435 -- Total Cycles 128960 ---- Thread 31 ---- PC 5: Stalled ----- 89633 in-flight CPI 1.4384 -- Total Cycles 128960 Total CPI 0.0422 , IPC 23.6934 -- Total Cycles 128960 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7729 (4.148752%) FPSUB: 0 (0.000000%) FPMUL: 31876 (17.110313%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 61881 (33.216316%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5962 (3.200266%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 70678 (37.938346%) DIV: 7897 (4.238930%) FPUN: 0 (0.000000%) FPRSUB: 274 (0.147077%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3349143 total) ADD%: 7.470 (250193) SUB%: 0.000 (0) MUL%: 0.006 (214) BITOR%: 1.531 (51278) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.547 (18319) FPSUB%: 0.000 (0) FPMUL%: 4.758 (159368) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (642) FPMAX%: 0.019 (642) LOAD%: 5.093 (170567) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (246) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (616) FPINV%: 0.000 (0) FPCONV%: 0.020 (674) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.066 (35694) FPLE%: 0.452 (15146) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (642) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.798 (93704) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.734 (24595) CMPU%: 0.000 (0) RSUB%: 0.006 (214) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.648 (524080) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.167 (39086) ORI%: 1.567 (52485) XORI%: 0.000 (0) MULI%: 3.208 (107432) LW%: 1.129 (37824) LWI%: 13.500 (452150) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.286 (9592) SWI%: 4.058 (135914) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.399 (46840) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.308 (10314) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.056 (1885) bned%: 0.000 (0) bneid%: 13.825 (463012) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.718 (24039) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.120 (4013) DIV%: 0.013 (428) FPUN%: 1.487 (49802) FPRSUB%: 4.184 (140119) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (74) FPGT%: 2.955 (98966) FPGE%: 1.035 (34656) SYNC%: 0.000 (0) NOP%: 8.766 (293582) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 23 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 5 FPSUB 0 FPMUL 2 FPCMPLT 0 FPMIN 0 FPMAX 418 LOAD 38723 INTCONV 0 ATOMIC_INC 22 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 14 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1508 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 12 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49007 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 11 ORI 10956 XORI 0 MULI 9585 LW 0 LWI 142895 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 80 DIV 23 FPUN 0 FPRSUB 40 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.6937 --Total thread-cycles: 4126720 --total thread-cycles issued: 3055561 (74.043332%) --iCache conflicts: 112145 (2.717534%) --thread*cycles of FU dependence: 253326 (6.138677%) --thread*cycles of data dependence: 186297 (4.514409%) --iCache cycles*banks: 4126720 (81.158281% used) Issue breakdown: --thread*cycles of issue worked: 3055561 (74.043332%) --thread*cycles of issue failed: 777577 (18.842495%) --thread*cycles of issue NOP/other: 4611603692494551758 (111749856847436.980000%) Number of thread-cycles not ready: 186297 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3349143 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 9 3: 8 4: 8 5: 8 6: 8 7: 8 8: 8 9: 7 10: 9 11: 7 12: 9 13: 7 14: 8 15: 6 16: 8 17: 7 18: 7 19: 8 20: 7 21: 6 22: 7 23: 7 24: 7 25: 8 26: 8 27: 8 28: 8 29: 8 30: 8 31: 8 <=== Core 10 ===> ---- Thread 00 ---- PC 5: Stalled ----- 99619 in-flight CPI 1.7677 -- Total Cycles 176133 ---- Thread 01 ---- PC 5: Stalled ----- 100367 in-flight CPI 1.7546 -- Total Cycles 176133 ---- Thread 02 ---- PC 5: Stalled ----- 97514 in-flight CPI 1.8059 -- Total Cycles 176133 ---- Thread 03 ---- PC 5: Stalled ----- 101484 in-flight CPI 1.7352 -- Total Cycles 176133 ---- Thread 04 ---- PC 5: Stalled ----- 101474 in-flight CPI 1.7354 -- Total Cycles 176133 ---- Thread 05 ---- PC 5: Stalled ----- 99216 in-flight CPI 1.7748 -- Total Cycles 176133 ---- Thread 06 ---- PC 5: Stalled ----- 92690 in-flight CPI 1.8998 -- Total Cycles 176133 ---- Thread 07 ---- PC 5: Stalled ----- 101371 in-flight CPI 1.7372 -- Total Cycles 176133 ---- Thread 08 ---- PC 5: Stalled ----- 100072 in-flight CPI 1.7597 -- Total Cycles 176133 ---- Thread 09 ---- PC 5: Stalled ----- 93789 in-flight CPI 1.8776 -- Total Cycles 176133 ---- Thread 10 ---- PC 5: Stalled ----- 98226 in-flight CPI 1.7928 -- Total Cycles 176133 ---- Thread 11 ---- PC 5: Stalled ----- 95266 in-flight CPI 1.8485 -- Total Cycles 176133 ---- Thread 12 ---- PC 5: Stalled ----- 100003 in-flight CPI 1.7609 -- Total Cycles 176133 ---- Thread 13 ---- PC 5: Stalled ----- 95475 in-flight CPI 1.8445 -- Total Cycles 176133 ---- Thread 14 ---- PC 5: Stalled ----- 95680 in-flight CPI 1.8405 -- Total Cycles 176133 ---- Thread 15 ---- PC 5: Stalled ----- 98162 in-flight CPI 1.7940 -- Total Cycles 176133 ---- Thread 16 ---- PC 5: Stalled ----- 102423 in-flight CPI 1.7193 -- Total Cycles 176133 ---- Thread 17 ---- PC 5: Stalled ----- 90611 in-flight CPI 1.9434 -- Total Cycles 176133 ---- Thread 18 ---- PC 5: Stalled ----- 97725 in-flight CPI 1.8019 -- Total Cycles 176133 ---- Thread 19 ---- PC 5: Stalled ----- 96331 in-flight CPI 1.8280 -- Total Cycles 176133 ---- Thread 20 ---- PC 5: Stalled ----- 123320 in-flight CPI 1.4281 -- Total Cycles 176133 ---- Thread 21 ---- PC 5: Stalled ----- 96476 in-flight CPI 1.8252 -- Total Cycles 176133 ---- Thread 22 ---- PC 5: Stalled ----- 117889 in-flight CPI 1.4939 -- Total Cycles 176133 ---- Thread 23 ---- PC 5: Stalled ----- 96028 in-flight CPI 1.8339 -- Total Cycles 176133 ---- Thread 24 ---- PC 5: Stalled ----- 97088 in-flight CPI 1.8139 -- Total Cycles 176133 ---- Thread 25 ---- PC 5: Stalled ----- 89464 in-flight CPI 1.9685 -- Total Cycles 176133 ---- Thread 26 ---- PC 5: Stalled ----- 104333 in-flight CPI 1.6880 -- Total Cycles 176133 ---- Thread 27 ---- PC 5: Stalled ----- 89860 in-flight CPI 1.9597 -- Total Cycles 176133 ---- Thread 28 ---- PC 5: Stalled ----- 94836 in-flight CPI 1.8569 -- Total Cycles 176133 ---- Thread 29 ---- PC 5: Stalled ----- 87671 in-flight CPI 2.0086 -- Total Cycles 176133 ---- Thread 30 ---- PC 5: Stalled ----- 83965 in-flight CPI 2.0974 -- Total Cycles 176133 ---- Thread 31 ---- PC 5: Stalled ----- 89816 in-flight CPI 1.9607 -- Total Cycles 176133 Total CPI 0.0563 , IPC 17.7639 -- Total Cycles 176133 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8639 (3.632091%) FPSUB: 0 (0.000000%) FPMUL: 33759 (14.193280%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 102989 (43.299615%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5836 (2.453627%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 78720 (33.096211%) DIV: 7641 (3.212502%) FPUN: 0 (0.000000%) FPRSUB: 268 (0.112675%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3429206 total) ADD%: 7.382 (253148) SUB%: 0.000 (0) MUL%: 0.006 (207) BITOR%: 1.511 (51832) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.586 (20086) FPSUB%: 0.000 (0) FPMUL%: 4.874 (167125) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (621) FPMAX%: 0.018 (621) LOAD%: 5.192 (178059) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (239) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (601) FPINV%: 0.000 (0) FPCONV%: 0.019 (653) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.076 (36903) FPLE%: 0.450 (15442) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (621) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.784 (95467) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.752 (25795) CMPU%: 0.000 (0) RSUB%: 0.006 (207) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.632 (536062) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.163 (39876) ORI%: 1.593 (54627) XORI%: 0.000 (0) MULI%: 3.179 (109010) LW%: 1.123 (38518) LWI%: 13.424 (460353) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.284 (9740) SWI%: 4.036 (138400) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.392 (47746) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.309 (10581) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.071 (2442) bned%: 0.000 (0) bneid%: 13.743 (471289) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.716 (24565) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.131 (4492) DIV%: 0.012 (414) FPUN%: 1.466 (50267) FPRSUB%: 4.299 (147438) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (79) FPGT%: 2.928 (100418) FPGE%: 1.016 (34825) SYNC%: 0.000 (0) NOP%: 8.758 (300341) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 23 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 13 FPSUB 0 FPMUL 3 FPCMPLT 0 FPMIN 0 FPMAX 407 LOAD 40627 INTCONV 0 ATOMIC_INC 23 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 14 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1663 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 12 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49738 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 7 ORI 12315 XORI 0 MULI 8665 LW 0 LWI 145768 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 89 DIV 22 FPUN 0 FPRSUB 81 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 17.7641 --Total thread-cycles: 5636256 --total thread-cycles issued: 3128865 (55.513181%) --iCache conflicts: 109584 (1.944269%) --thread*cycles of FU dependence: 259482 (4.603801%) --thread*cycles of data dependence: 237852 (4.220035%) --iCache cycles*banks: 5636256 (60.842481% used) Issue breakdown: --thread*cycles of issue worked: 3128865 (55.513181%) --thread*cycles of issue failed: 2207050 (39.158087%) --thread*cycles of issue NOP/other: 300341 (5.328732%) Number of thread-cycles not ready: 237852 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3429206 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 7 2: 8 3: 8 4: 9 5: 9 6: 8 7: 7 8: 9 9: 7 10: 8 11: 7 12: 8 13: 7 14: 7 15: 7 16: 9 17: 8 18: 9 19: 8 20: 5 21: 9 22: 6 23: 7 24: 7 25: 6 26: 6 27: 8 28: 8 29: 7 30: 6 31: 6 <=== Core 11 ===> ---- Thread 00 ---- PC 5: Stalled ----- 98829 in-flight CPI 1.3930 -- Total Cycles 137695 ---- Thread 01 ---- PC 5: Stalled ----- 94049 in-flight CPI 1.4638 -- Total Cycles 137695 ---- Thread 02 ---- PC 5: Stalled ----- 98527 in-flight CPI 1.3972 -- Total Cycles 137695 ---- Thread 03 ---- PC 5: Stalled ----- 95440 in-flight CPI 1.4424 -- Total Cycles 137695 ---- Thread 04 ---- PC 5: Stalled ----- 97199 in-flight CPI 1.4163 -- Total Cycles 137695 ---- Thread 05 ---- PC 5: Stalled ----- 100773 in-flight CPI 1.3661 -- Total Cycles 137695 ---- Thread 06 ---- PC 5: Stalled ----- 102405 in-flight CPI 1.3443 -- Total Cycles 137695 ---- Thread 07 ---- PC 5: Stalled ----- 96735 in-flight CPI 1.4232 -- Total Cycles 137695 ---- Thread 08 ---- PC 5: Stalled ----- 94178 in-flight CPI 1.4618 -- Total Cycles 137695 ---- Thread 09 ---- PC 5: Stalled ----- 96335 in-flight CPI 1.4291 -- Total Cycles 137695 ---- Thread 10 ---- PC 5: Stalled ----- 92169 in-flight CPI 1.4937 -- Total Cycles 137695 ---- Thread 11 ---- PC 5: Stalled ----- 96265 in-flight CPI 1.4301 -- Total Cycles 137695 ---- Thread 12 ---- PC 5: Stalled ----- 96873 in-flight CPI 1.4212 -- Total Cycles 137695 ---- Thread 13 ---- PC 5: Stalled ----- 100998 in-flight CPI 1.3631 -- Total Cycles 137695 ---- Thread 14 ---- PC 5: Stalled ----- 98659 in-flight CPI 1.3954 -- Total Cycles 137695 ---- Thread 15 ---- PC 5: Stalled ----- 94471 in-flight CPI 1.4573 -- Total Cycles 137695 ---- Thread 16 ---- PC 5: Stalled ----- 91451 in-flight CPI 1.5055 -- Total Cycles 137695 ---- Thread 17 ---- PC 5: Stalled ----- 97627 in-flight CPI 1.4102 -- Total Cycles 137695 ---- Thread 18 ---- PC 5: Stalled ----- 92167 in-flight CPI 1.4937 -- Total Cycles 137695 ---- Thread 19 ---- PC 5: Stalled ----- 93156 in-flight CPI 1.4779 -- Total Cycles 137695 ---- Thread 20 ---- PC 5: Stalled ----- 94970 in-flight CPI 1.4496 -- Total Cycles 137695 ---- Thread 21 ---- PC 5: Stalled ----- 93497 in-flight CPI 1.4725 -- Total Cycles 137695 ---- Thread 22 ---- PC 5: Stalled ----- 96490 in-flight CPI 1.4268 -- Total Cycles 137695 ---- Thread 23 ---- PC 5: Stalled ----- 93166 in-flight CPI 1.4776 -- Total Cycles 137695 ---- Thread 24 ---- PC 5: Stalled ----- 92007 in-flight CPI 1.4963 -- Total Cycles 137695 ---- Thread 25 ---- PC 5: Stalled ----- 87743 in-flight CPI 1.5690 -- Total Cycles 137695 ---- Thread 26 ---- PC 5: Stalled ----- 83664 in-flight CPI 1.6455 -- Total Cycles 137695 ---- Thread 27 ---- PC 5: Stalled ----- 93953 in-flight CPI 1.4654 -- Total Cycles 137695 ---- Thread 28 ---- PC 5: Stalled ----- 90302 in-flight CPI 1.5245 -- Total Cycles 137695 ---- Thread 29 ---- PC 5: Stalled ----- 90155 in-flight CPI 1.5270 -- Total Cycles 137695 ---- Thread 30 ---- PC 5: Stalled ----- 85902 in-flight CPI 1.6027 -- Total Cycles 137695 ---- Thread 31 ---- PC 5: Stalled ----- 83389 in-flight CPI 1.6510 -- Total Cycles 137695 Total CPI 0.0457 , IPC 21.8897 -- Total Cycles 137695 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7930 (3.621468%) FPSUB: 0 (0.000000%) FPMUL: 31766 (14.506878%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 93482 (42.691303%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5517 (2.519500%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 72555 (33.134373%) DIV: 7459 (3.406372%) FPUN: 0 (0.000000%) FPRSUB: 263 (0.120107%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3302882 total) ADD%: 7.435 (245571) SUB%: 0.000 (0) MUL%: 0.006 (202) BITOR%: 1.512 (49947) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.563 (18581) FPSUB%: 0.000 (0) FPMUL%: 4.810 (158864) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (606) FPMAX%: 0.018 (606) LOAD%: 5.171 (170776) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (234) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (579) FPINV%: 0.000 (0) FPCONV%: 0.019 (638) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.066 (35207) FPLE%: 0.453 (14976) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (606) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.801 (92512) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.748 (24719) CMPU%: 0.000 (0) RSUB%: 0.006 (202) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.669 (517537) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.173 (38742) ORI%: 1.560 (51510) XORI%: 0.000 (0) MULI%: 3.196 (105554) LW%: 1.130 (37328) LWI%: 13.466 (444764) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.287 (9482) SWI%: 4.063 (134193) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.399 (46217) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10232) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.059 (1962) bned%: 0.000 (0) bneid%: 13.765 (454650) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.711 (23494) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.125 (4131) DIV%: 0.012 (404) FPUN%: 1.463 (48331) FPRSUB%: 4.243 (140144) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (71) FPGT%: 2.941 (97127) FPGE%: 1.010 (33355) SYNC%: 0.000 (0) NOP%: 8.742 (288732) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 19 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 13 FPSUB 0 FPMUL 2 FPCMPLT 0 FPMIN 0 FPMAX 390 LOAD 39764 INTCONV 0 ATOMIC_INC 33 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 13 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1765 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 14 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48185 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 10 ORI 11281 XORI 0 MULI 9320 LW 0 LWI 140669 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 67 DIV 27 FPUN 0 FPRSUB 52 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.8899 --Total thread-cycles: 4406240 --total thread-cycles issued: 3014150 (68.406396%) --iCache conflicts: 108411 (2.460397%) --thread*cycles of FU dependence: 251629 (5.710742%) --thread*cycles of data dependence: 218972 (4.969589%) --iCache cycles*banks: 4406240 (74.959920% used) Issue breakdown: --thread*cycles of issue worked: 3014150 (68.406396%) --thread*cycles of issue failed: 1103358 (25.040806%) --thread*cycles of issue NOP/other: 288732 (6.552798%) Number of thread-cycles not ready: 218972 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3302882 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 9 3: 8 4: 8 5: 8 6: 9 7: 7 8: 8 9: 7 10: 7 11: 7 12: 7 13: 8 14: 8 15: 7 16: 6 17: 6 18: 8 19: 7 20: 7 21: 7 22: 8 23: 8 24: 7 25: 8 26: 6 27: 5 28: 8 29: 8 30: 6 31: 6 <=== Core 12 ===> ---- Thread 00 ---- PC 5: Stalled ----- 98653 in-flight CPI 1.6429 -- Total Cycles 162107 ---- Thread 01 ---- PC 5: Stalled ----- 94733 in-flight CPI 1.7108 -- Total Cycles 162107 ---- Thread 02 ---- PC 5: Stalled ----- 102722 in-flight CPI 1.5778 -- Total Cycles 162107 ---- Thread 03 ---- PC 5: Stalled ----- 108219 in-flight CPI 1.4978 -- Total Cycles 162107 ---- Thread 04 ---- PC 5: Stalled ----- 95773 in-flight CPI 1.6924 -- Total Cycles 162107 ---- Thread 05 ---- PC 5: Stalled ----- 102097 in-flight CPI 1.5875 -- Total Cycles 162107 ---- Thread 06 ---- PC 5: Stalled ----- 104046 in-flight CPI 1.5577 -- Total Cycles 162107 ---- Thread 07 ---- PC 5: Stalled ----- 97548 in-flight CPI 1.6615 -- Total Cycles 162107 ---- Thread 08 ---- PC 5: Stalled ----- 96931 in-flight CPI 1.6721 -- Total Cycles 162107 ---- Thread 09 ---- PC 5: Stalled ----- 94628 in-flight CPI 1.7129 -- Total Cycles 162107 ---- Thread 10 ---- PC 5: Stalled ----- 103339 in-flight CPI 1.5684 -- Total Cycles 162107 ---- Thread 11 ---- PC 5: Stalled ----- 102410 in-flight CPI 1.5826 -- Total Cycles 162107 ---- Thread 12 ---- PC 5: Stalled ----- 96510 in-flight CPI 1.6794 -- Total Cycles 162107 ---- Thread 13 ---- PC 5: Stalled ----- 93957 in-flight CPI 1.7251 -- Total Cycles 162107 ---- Thread 14 ---- PC 5: Stalled ----- 100786 in-flight CPI 1.6081 -- Total Cycles 162107 ---- Thread 15 ---- PC 5: Stalled ----- 92036 in-flight CPI 1.7610 -- Total Cycles 162107 ---- Thread 16 ---- PC 5: Stalled ----- 93712 in-flight CPI 1.7295 -- Total Cycles 162107 ---- Thread 17 ---- PC 5: Stalled ----- 95892 in-flight CPI 1.6902 -- Total Cycles 162107 ---- Thread 18 ---- PC 5: Stalled ----- 101951 in-flight CPI 1.5897 -- Total Cycles 162107 ---- Thread 19 ---- PC 5: Stalled ----- 91969 in-flight CPI 1.7623 -- Total Cycles 162107 ---- Thread 20 ---- PC 5: Stalled ----- 86591 in-flight CPI 1.8718 -- Total Cycles 162107 ---- Thread 21 ---- PC 5: Stalled ----- 117010 in-flight CPI 1.3853 -- Total Cycles 162107 ---- Thread 22 ---- PC 5: Stalled ----- 99274 in-flight CPI 1.6327 -- Total Cycles 162107 ---- Thread 23 ---- PC 5: Stalled ----- 93518 in-flight CPI 1.7331 -- Total Cycles 162107 ---- Thread 24 ---- PC 5: Stalled ----- 90530 in-flight CPI 1.7903 -- Total Cycles 162107 ---- Thread 25 ---- PC 5: Stalled ----- 87968 in-flight CPI 1.8425 -- Total Cycles 162107 ---- Thread 26 ---- PC 5: Stalled ----- 87259 in-flight CPI 1.8574 -- Total Cycles 162107 ---- Thread 27 ---- PC 5: Stalled ----- 92844 in-flight CPI 1.7456 -- Total Cycles 162107 ---- Thread 28 ---- PC 5: Stalled ----- 93795 in-flight CPI 1.7280 -- Total Cycles 162107 ---- Thread 29 ---- PC 5: Stalled ----- 87053 in-flight CPI 1.8617 -- Total Cycles 162107 ---- Thread 30 ---- PC 5: Stalled ----- 85894 in-flight CPI 1.8870 -- Total Cycles 162107 ---- Thread 31 ---- PC 5: Stalled ----- 83846 in-flight CPI 1.9331 -- Total Cycles 162107 Total CPI 0.0527 , IPC 18.9631 -- Total Cycles 162107 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8639 (3.896602%) FPSUB: 0 (0.000000%) FPMUL: 33522 (15.120024%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 88090 (39.732799%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5459 (2.462270%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 78240 (35.289979%) DIV: 7491 (3.378799%) FPUN: 0 (0.000000%) FPRSUB: 265 (0.119528%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3369089 total) ADD%: 7.419 (249936) SUB%: 0.000 (0) MUL%: 0.006 (203) BITOR%: 1.510 (50861) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.596 (20076) FPSUB%: 0.000 (0) FPMUL%: 4.906 (165280) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (609) FPMAX%: 0.018 (609) LOAD%: 5.200 (175194) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (235) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (577) FPINV%: 0.000 (0) FPCONV%: 0.019 (641) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.080 (36390) FPLE%: 0.452 (15220) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (609) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.773 (93418) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.755 (25437) CMPU%: 0.000 (0) RSUB%: 0.006 (203) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.630 (526584) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.163 (39186) ORI%: 1.591 (53609) XORI%: 0.000 (0) MULI%: 3.169 (106752) LW%: 1.119 (37692) LWI%: 13.390 (451114) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.285 (9586) SWI%: 4.026 (135625) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.385 (46653) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.309 (10404) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.067 (2272) bned%: 0.000 (0) bneid%: 13.740 (462902) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.710 (23904) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.133 (4465) DIV%: 0.012 (406) FPUN%: 1.459 (49164) FPRSUB%: 4.318 (145480) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (58) FPGT%: 2.930 (98709) FPGE%: 1.008 (33944) SYNC%: 0.000 (0) NOP%: 8.756 (294986) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 19 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 9 FPSUB 0 FPMUL 0 FPCMPLT 0 FPMIN 0 FPMAX 397 LOAD 40782 INTCONV 0 ATOMIC_INC 24 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 16 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1691 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 6 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48702 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 9 ORI 12358 XORI 0 MULI 9075 LW 0 LWI 143044 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 87 DIV 21 FPUN 0 FPRSUB 77 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 18.9633 --Total thread-cycles: 5187424 --total thread-cycles issued: 3074103 (59.260685%) --iCache conflicts: 109333 (2.107655%) --thread*cycles of FU dependence: 256328 (4.941335%) --thread*cycles of data dependence: 221706 (4.273913%) --iCache cycles*banks: 5187424 (64.947862% used) Issue breakdown: --thread*cycles of issue worked: 3074103 (59.260685%) --thread*cycles of issue failed: 1818335 (35.052755%) --thread*cycles of issue NOP/other: 294986 (5.686560%) Number of thread-cycles not ready: 221706 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3369089 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 8 3: 6 4: 6 5: 8 6: 9 7: 8 8: 8 9: 6 10: 9 11: 8 12: 8 13: 6 14: 8 15: 8 16: 7 17: 8 18: 9 19: 7 20: 6 21: 5 22: 6 23: 7 24: 8 25: 7 26: 7 27: 8 28: 8 29: 8 30: 6 31: 6 <=== Core 13 ===> ---- Thread 00 ---- PC 5: Stalled ----- 101406 in-flight CPI 1.2809 -- Total Cycles 129919 ---- Thread 01 ---- PC 5: Stalled ----- 107456 in-flight CPI 1.2088 -- Total Cycles 129919 ---- Thread 02 ---- PC 5: Stalled ----- 94342 in-flight CPI 1.3769 -- Total Cycles 129919 ---- Thread 03 ---- PC 5: Stalled ----- 98903 in-flight CPI 1.3133 -- Total Cycles 129919 ---- Thread 04 ---- PC 5: Stalled ----- 106717 in-flight CPI 1.2172 -- Total Cycles 129919 ---- Thread 05 ---- PC 5: Stalled ----- 98811 in-flight CPI 1.3146 -- Total Cycles 129919 ---- Thread 06 ---- PC 5: Stalled ----- 99609 in-flight CPI 1.3040 -- Total Cycles 129919 ---- Thread 07 ---- PC 5: Stalled ----- 102853 in-flight CPI 1.2629 -- Total Cycles 129919 ---- Thread 08 ---- PC 5: Stalled ----- 97725 in-flight CPI 1.3292 -- Total Cycles 129919 ---- Thread 09 ---- PC 5: Stalled ----- 89997 in-flight CPI 1.4434 -- Total Cycles 129919 ---- Thread 10 ---- PC 5: Stalled ----- 100395 in-flight CPI 1.2938 -- Total Cycles 129919 ---- Thread 11 ---- PC 5: Stalled ----- 100394 in-flight CPI 1.2938 -- Total Cycles 129919 ---- Thread 12 ---- PC 5: Stalled ----- 92533 in-flight CPI 1.4037 -- Total Cycles 129919 ---- Thread 13 ---- PC 5: Stalled ----- 101660 in-flight CPI 1.2777 -- Total Cycles 129919 ---- Thread 14 ---- PC 5: Stalled ----- 95236 in-flight CPI 1.3639 -- Total Cycles 129919 ---- Thread 15 ---- PC 5: Stalled ----- 87773 in-flight CPI 1.4800 -- Total Cycles 129919 ---- Thread 16 ---- PC 5: Stalled ----- 99198 in-flight CPI 1.3095 -- Total Cycles 129919 ---- Thread 17 ---- PC 5: Stalled ----- 92449 in-flight CPI 1.4051 -- Total Cycles 129919 ---- Thread 18 ---- PC 5: Stalled ----- 91811 in-flight CPI 1.4149 -- Total Cycles 129919 ---- Thread 19 ---- PC 5: Stalled ----- 98966 in-flight CPI 1.3125 -- Total Cycles 129919 ---- Thread 20 ---- PC 5: Stalled ----- 91797 in-flight CPI 1.4150 -- Total Cycles 129919 ---- Thread 21 ---- PC 5: Stalled ----- 89749 in-flight CPI 1.4473 -- Total Cycles 129919 ---- Thread 22 ---- PC 5: Stalled ----- 94265 in-flight CPI 1.3780 -- Total Cycles 129919 ---- Thread 23 ---- PC 5: Stalled ----- 94395 in-flight CPI 1.3761 -- Total Cycles 129919 ---- Thread 24 ---- PC 5: Stalled ----- 93569 in-flight CPI 1.3882 -- Total Cycles 129919 ---- Thread 25 ---- PC 5: Stalled ----- 92138 in-flight CPI 1.4098 -- Total Cycles 129919 ---- Thread 26 ---- PC 5: Stalled ----- 88937 in-flight CPI 1.4605 -- Total Cycles 129919 ---- Thread 27 ---- PC 5: Stalled ----- 93124 in-flight CPI 1.3948 -- Total Cycles 129919 ---- Thread 28 ---- PC 5: Stalled ----- 90449 in-flight CPI 1.4361 -- Total Cycles 129919 ---- Thread 29 ---- PC 5: Stalled ----- 92810 in-flight CPI 1.3996 -- Total Cycles 129919 ---- Thread 30 ---- PC 5: Stalled ----- 88132 in-flight CPI 1.4739 -- Total Cycles 129919 ---- Thread 31 ---- PC 5: Stalled ----- 91553 in-flight CPI 1.4188 -- Total Cycles 129919 Total CPI 0.0425 , IPC 23.5509 -- Total Cycles 129919 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7690 (3.930287%) FPSUB: 0 (0.000000%) FPMUL: 31672 (16.187264%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 71741 (36.666156%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5730 (2.928550%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 70992 (36.283349%) DIV: 7567 (3.867423%) FPUN: 0 (0.000000%) FPRSUB: 268 (0.136972%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3353383 total) ADD%: 7.436 (249354) SUB%: 0.000 (0) MUL%: 0.006 (205) BITOR%: 1.520 (50963) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.542 (18183) FPSUB%: 0.000 (0) FPMUL%: 4.751 (159316) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (615) FPMAX%: 0.018 (615) LOAD%: 5.134 (172165) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (237) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (594) FPINV%: 0.000 (0) FPCONV%: 0.019 (647) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.061 (35580) FPLE%: 0.453 (15191) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (615) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.812 (94285) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.740 (24824) CMPU%: 0.000 (0) RSUB%: 0.006 (205) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.676 (525660) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.172 (39316) ORI%: 1.556 (52189) XORI%: 0.000 (0) MULI%: 3.210 (107652) LW%: 1.134 (38042) LWI%: 13.508 (452962) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.287 (9639) SWI%: 4.067 (136388) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.406 (47134) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10380) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.059 (1984) bned%: 0.000 (0) bneid%: 13.798 (462707) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.718 (24068) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.120 (4037) DIV%: 0.012 (410) FPUN%: 1.476 (49484) FPRSUB%: 4.198 (140764) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (70) FPGT%: 2.949 (98898) FPGE%: 1.023 (34293) SYNC%: 0.000 (0) NOP%: 8.756 (293616) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 20 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 7 FPSUB 0 FPMUL 7 FPCMPLT 0 FPMIN 0 FPMAX 400 LOAD 39378 INTCONV 0 ATOMIC_INC 14 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 9 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1384 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 9 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49097 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 12 ORI 10927 XORI 0 MULI 9751 LW 0 LWI 143149 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 66 DIV 23 FPUN 0 FPRSUB 51 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.5512 --Total thread-cycles: 4157408 --total thread-cycles issued: 3059767 (73.597949%) --iCache conflicts: 111395 (2.679434%) --thread*cycles of FU dependence: 254332 (6.117562%) --thread*cycles of data dependence: 195660 (4.706298%) --iCache cycles*banks: 4157408 (80.661196% used) Issue breakdown: --thread*cycles of issue worked: 3059767 (73.597949%) --thread*cycles of issue failed: 804025 (19.339574%) --thread*cycles of issue NOP/other: 293616 (7.062477%) Number of thread-cycles not ready: 195660 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3353383 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 9 1: 8 2: 7 3: 8 4: 9 5: 8 6: 8 7: 8 8: 8 9: 6 10: 8 11: 8 12: 8 13: 8 14: 7 15: 5 16: 7 17: 7 18: 6 19: 9 20: 8 21: 7 22: 7 23: 7 24: 7 25: 7 26: 7 27: 8 28: 7 29: 7 30: 6 31: 7 <=== Core 14 ===> ---- Thread 00 ---- PC 5: Stalled ----- 100374 in-flight CPI 1.2742 -- Total Cycles 127925 ---- Thread 01 ---- PC 5: Stalled ----- 98664 in-flight CPI 1.2963 -- Total Cycles 127925 ---- Thread 02 ---- PC 5: Stalled ----- 103941 in-flight CPI 1.2305 -- Total Cycles 127925 ---- Thread 03 ---- PC 5: Stalled ----- 96243 in-flight CPI 1.3289 -- Total Cycles 127925 ---- Thread 04 ---- PC 5: Stalled ----- 98089 in-flight CPI 1.3039 -- Total Cycles 127925 ---- Thread 05 ---- PC 5: Stalled ----- 102264 in-flight CPI 1.2507 -- Total Cycles 127925 ---- Thread 06 ---- PC 5: Stalled ----- 99360 in-flight CPI 1.2872 -- Total Cycles 127925 ---- Thread 07 ---- PC 5: Stalled ----- 97291 in-flight CPI 1.3147 -- Total Cycles 127925 ---- Thread 08 ---- PC 5: Stalled ----- 95457 in-flight CPI 1.3399 -- Total Cycles 127925 ---- Thread 09 ---- PC 5: Stalled ----- 100536 in-flight CPI 1.2721 -- Total Cycles 127925 ---- Thread 10 ---- PC 5: Stalled ----- 98681 in-flight CPI 1.2961 -- Total Cycles 127925 ---- Thread 11 ---- PC 5: Stalled ----- 94922 in-flight CPI 1.3475 -- Total Cycles 127925 ---- Thread 12 ---- PC 5: Stalled ----- 96630 in-flight CPI 1.3236 -- Total Cycles 127925 ---- Thread 13 ---- PC 5: Stalled ----- 90094 in-flight CPI 1.4197 -- Total Cycles 127925 ---- Thread 14 ---- PC 5: Stalled ----- 95490 in-flight CPI 1.3394 -- Total Cycles 127925 ---- Thread 15 ---- PC 5: Stalled ----- 97971 in-flight CPI 1.3055 -- Total Cycles 127925 ---- Thread 16 ---- PC 5: Stalled ----- 98641 in-flight CPI 1.2966 -- Total Cycles 127925 ---- Thread 17 ---- PC 5: Stalled ----- 97973 in-flight CPI 1.3055 -- Total Cycles 127925 ---- Thread 18 ---- PC 5: Stalled ----- 94563 in-flight CPI 1.3526 -- Total Cycles 127925 ---- Thread 19 ---- PC 5: Stalled ----- 94891 in-flight CPI 1.3479 -- Total Cycles 127925 ---- Thread 20 ---- PC 5: Stalled ----- 89968 in-flight CPI 1.4216 -- Total Cycles 127925 ---- Thread 21 ---- PC 5: Stalled ----- 91638 in-flight CPI 1.3957 -- Total Cycles 127925 ---- Thread 22 ---- PC 5: Stalled ----- 89520 in-flight CPI 1.4288 -- Total Cycles 127925 ---- Thread 23 ---- PC 5: Stalled ----- 90949 in-flight CPI 1.4063 -- Total Cycles 127925 ---- Thread 24 ---- PC 5: Stalled ----- 95715 in-flight CPI 1.3363 -- Total Cycles 127925 ---- Thread 25 ---- PC 5: Stalled ----- 93853 in-flight CPI 1.3628 -- Total Cycles 127925 ---- Thread 26 ---- PC 5: Stalled ----- 87459 in-flight CPI 1.4624 -- Total Cycles 127925 ---- Thread 27 ---- PC 5: Stalled ----- 85930 in-flight CPI 1.4884 -- Total Cycles 127925 ---- Thread 28 ---- PC 5: Stalled ----- 88378 in-flight CPI 1.4472 -- Total Cycles 127925 ---- Thread 29 ---- PC 5: Stalled ----- 93963 in-flight CPI 1.3612 -- Total Cycles 127925 ---- Thread 30 ---- PC 5: Stalled ----- 90544 in-flight CPI 1.4125 -- Total Cycles 127925 ---- Thread 31 ---- PC 5: Stalled ----- 87314 in-flight CPI 1.4649 -- Total Cycles 127925 Total CPI 0.0421 , IPC 23.7472 -- Total Cycles 127925 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7904 (4.006082%) FPSUB: 0 (0.000000%) FPMUL: 31947 (16.192093%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 71972 (36.478459%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5508 (2.791688%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 72183 (36.585403%) DIV: 7520 (3.811455%) FPUN: 0 (0.000000%) FPRSUB: 266 (0.134820%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3329323 total) ADD%: 7.441 (247728) SUB%: 0.000 (0) MUL%: 0.006 (204) BITOR%: 1.536 (51153) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.557 (18550) FPSUB%: 0.000 (0) FPMUL%: 4.790 (159466) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (612) FPMAX%: 0.018 (612) LOAD%: 5.138 (171060) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (236) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (581) FPINV%: 0.000 (0) FPCONV%: 0.019 (644) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.067 (35526) FPLE%: 0.457 (15211) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (612) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.794 (93029) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.745 (24788) CMPU%: 0.000 (0) RSUB%: 0.006 (204) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.660 (521387) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.169 (38930) ORI%: 1.573 (52376) XORI%: 0.000 (0) MULI%: 3.195 (106362) LW%: 1.127 (37538) LWI%: 13.449 (447760) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.286 (9530) SWI%: 4.046 (134711) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.396 (46478) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.309 (10292) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.059 (1956) bned%: 0.000 (0) bneid%: 13.800 (459445) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.721 (23989) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.123 (4109) DIV%: 0.012 (408) FPUN%: 1.489 (49557) FPRSUB%: 4.224 (140626) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (53) FPGT%: 2.936 (97753) FPGE%: 1.032 (34346) SYNC%: 0.000 (0) NOP%: 8.753 (291405) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 20 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 15 FPSUB 0 FPMUL 1 FPCMPLT 0 FPMIN 0 FPMAX 394 LOAD 39870 INTCONV 0 ATOMIC_INC 19 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 11 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1949 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 9 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48504 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 16 ORI 11245 XORI 0 MULI 9670 LW 0 LWI 141859 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 78 DIV 23 FPUN 0 FPRSUB 52 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.7475 --Total thread-cycles: 4093600 --total thread-cycles issued: 3037918 (74.211403%) --iCache conflicts: 111110 (2.714237%) --thread*cycles of FU dependence: 253749 (6.198676%) --thread*cycles of data dependence: 197300 (4.819719%) --iCache cycles*banks: 4093600 (81.330736% used) Issue breakdown: --thread*cycles of issue worked: 3037918 (74.211403%) --thread*cycles of issue failed: 764277 (18.670046%) --thread*cycles of issue NOP/other: 291405 (7.118551%) Number of thread-cycles not ready: 197300 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3329323 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 8 3: 8 4: 9 5: 8 6: 8 7: 6 8: 6 9: 9 10: 8 11: 6 12: 8 13: 5 14: 7 15: 8 16: 8 17: 8 18: 6 19: 7 20: 7 21: 7 22: 7 23: 8 24: 8 25: 8 26: 7 27: 7 28: 7 29: 7 30: 8 31: 6 <=== Core 15 ===> ---- Thread 00 ---- PC 5: Stalled ----- 99964 in-flight CPI 1.2858 -- Total Cycles 128555 ---- Thread 01 ---- PC 5: Stalled ----- 100211 in-flight CPI 1.2826 -- Total Cycles 128555 ---- Thread 02 ---- PC 5: Stalled ----- 98968 in-flight CPI 1.2987 -- Total Cycles 128555 ---- Thread 03 ---- PC 5: Stalled ----- 99099 in-flight CPI 1.2970 -- Total Cycles 128555 ---- Thread 04 ---- PC 5: Stalled ----- 100743 in-flight CPI 1.2758 -- Total Cycles 128555 ---- Thread 05 ---- PC 5: Stalled ----- 89669 in-flight CPI 1.4334 -- Total Cycles 128555 ---- Thread 06 ---- PC 5: Stalled ----- 102456 in-flight CPI 1.2545 -- Total Cycles 128555 ---- Thread 07 ---- PC 5: Stalled ----- 97162 in-flight CPI 1.3229 -- Total Cycles 128555 ---- Thread 08 ---- PC 5: Stalled ----- 93782 in-flight CPI 1.3705 -- Total Cycles 128555 ---- Thread 09 ---- PC 5: Stalled ----- 99943 in-flight CPI 1.2861 -- Total Cycles 128555 ---- Thread 10 ---- PC 5: Stalled ----- 98442 in-flight CPI 1.3056 -- Total Cycles 128555 ---- Thread 11 ---- PC 5: Stalled ----- 101663 in-flight CPI 1.2643 -- Total Cycles 128555 ---- Thread 12 ---- PC 5: Stalled ----- 100555 in-flight CPI 1.2782 -- Total Cycles 128555 ---- Thread 13 ---- PC 5: Stalled ----- 97042 in-flight CPI 1.3246 -- Total Cycles 128555 ---- Thread 14 ---- PC 5: Stalled ----- 88500 in-flight CPI 1.4524 -- Total Cycles 128555 ---- Thread 15 ---- PC 5: Stalled ----- 94453 in-flight CPI 1.3608 -- Total Cycles 128555 ---- Thread 16 ---- PC 5: Stalled ----- 91845 in-flight CPI 1.3994 -- Total Cycles 128555 ---- Thread 17 ---- PC 5: Stalled ----- 96819 in-flight CPI 1.3276 -- Total Cycles 128555 ---- Thread 18 ---- PC 5: Stalled ----- 97775 in-flight CPI 1.3146 -- Total Cycles 128555 ---- Thread 19 ---- PC 5: Stalled ----- 88757 in-flight CPI 1.4481 -- Total Cycles 128555 ---- Thread 20 ---- PC 5: Stalled ----- 96724 in-flight CPI 1.3289 -- Total Cycles 128555 ---- Thread 21 ---- PC 5: Stalled ----- 88868 in-flight CPI 1.4463 -- Total Cycles 128555 ---- Thread 22 ---- PC 5: Stalled ----- 89320 in-flight CPI 1.4390 -- Total Cycles 128555 ---- Thread 23 ---- PC 5: Stalled ----- 95474 in-flight CPI 1.3462 -- Total Cycles 128555 ---- Thread 24 ---- PC 5: Stalled ----- 94790 in-flight CPI 1.3560 -- Total Cycles 128555 ---- Thread 25 ---- PC 5: Stalled ----- 95118 in-flight CPI 1.3513 -- Total Cycles 128555 ---- Thread 26 ---- PC 5: Stalled ----- 92858 in-flight CPI 1.3842 -- Total Cycles 128555 ---- Thread 27 ---- PC 5: Stalled ----- 92726 in-flight CPI 1.3861 -- Total Cycles 128555 ---- Thread 28 ---- PC 5: Stalled ----- 93439 in-flight CPI 1.3756 -- Total Cycles 128555 ---- Thread 29 ---- PC 5: Stalled ----- 89204 in-flight CPI 1.4409 -- Total Cycles 128555 ---- Thread 30 ---- PC 5: Stalled ----- 81344 in-flight CPI 1.5802 -- Total Cycles 128555 ---- Thread 31 ---- PC 5: Stalled ----- 90279 in-flight CPI 1.4237 -- Total Cycles 128555 Total CPI 0.0423 , IPC 23.6359 -- Total Cycles 128555 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8441 (3.818075%) FPSUB: 0 (0.000000%) FPMUL: 32792 (14.832640%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 90948 (41.138050%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5248 (2.373801%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 76277 (34.501990%) DIV: 7121 (3.221006%) FPUN: 0 (0.000000%) FPRSUB: 253 (0.114438%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3329976 total) ADD%: 7.344 (244562) SUB%: 0.000 (0) MUL%: 0.006 (193) BITOR%: 1.524 (50754) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.588 (19571) FPSUB%: 0.000 (0) FPMUL%: 4.886 (162709) FPCMPLT%: 0.000 (0) FPMIN%: 0.017 (579) FPMAX%: 0.017 (579) LOAD%: 5.203 (173273) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (225) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (555) FPINV%: 0.000 (0) FPCONV%: 0.018 (611) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.076 (35839) FPLE%: 0.455 (15149) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.017 (579) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.780 (92568) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.762 (25384) CMPU%: 0.000 (0) RSUB%: 0.006 (193) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.663 (521565) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.171 (38999) ORI%: 1.588 (52879) XORI%: 0.000 (0) MULI%: 3.172 (105638) LW%: 1.121 (37336) LWI%: 13.390 (445900) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.285 (9486) SWI%: 4.025 (134040) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.389 (46237) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10308) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.062 (2077) bned%: 0.000 (0) bneid%: 13.766 (458396) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.713 (23756) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.131 (4357) DIV%: 0.012 (386) FPUN%: 1.468 (48898) FPRSUB%: 4.312 (143602) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (64) FPGT%: 2.927 (97479) FPGE%: 1.013 (33749) SYNC%: 0.000 (0) NOP%: 8.751 (291405) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 30 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 13 FPSUB 0 FPMUL 0 FPCMPLT 0 FPMIN 0 FPMAX 373 LOAD 41458 INTCONV 0 ATOMIC_INC 32 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 13 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1401 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 6 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48358 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 11 ORI 12080 XORI 0 MULI 8769 LW 0 LWI 141296 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 90 DIV 21 FPUN 0 FPRSUB 62 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.6362 --Total thread-cycles: 4113760 --total thread-cycles issued: 3038571 (73.863594%) --iCache conflicts: 110248 (2.679981%) --thread*cycles of FU dependence: 254039 (6.175348%) --thread*cycles of data dependence: 221080 (5.374159%) --iCache cycles*banks: 4113760 (80.948038% used) Issue breakdown: --thread*cycles of issue worked: 3038571 (73.863594%) --thread*cycles of issue failed: 783784 (19.052740%) --thread*cycles of issue NOP/other: 291405 (7.083666%) Number of thread-cycles not ready: 221080 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3329976 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 7 2: 9 3: 8 4: 8 5: 6 6: 8 7: 6 8: 7 9: 7 10: 9 11: 8 12: 8 13: 5 14: 5 15: 7 16: 7 17: 7 18: 7 19: 7 20: 7 21: 7 22: 7 23: 8 24: 6 25: 7 26: 6 27: 8 28: 7 29: 6 30: 5 31: 7 <=== Core 16 ===> ---- Thread 00 ---- PC 5: Stalled ----- 95149 in-flight CPI 1.3640 -- Total Cycles 129809 ---- Thread 01 ---- PC 5: Stalled ----- 99195 in-flight CPI 1.3084 -- Total Cycles 129809 ---- Thread 02 ---- PC 5: Stalled ----- 102700 in-flight CPI 1.2637 -- Total Cycles 129809 ---- Thread 03 ---- PC 5: Stalled ----- 100853 in-flight CPI 1.2869 -- Total Cycles 129809 ---- Thread 04 ---- PC 5: Stalled ----- 99511 in-flight CPI 1.3042 -- Total Cycles 129809 ---- Thread 05 ---- PC 5: Stalled ----- 107253 in-flight CPI 1.2100 -- Total Cycles 129809 ---- Thread 06 ---- PC 5: Stalled ----- 88456 in-flight CPI 1.4673 -- Total Cycles 129809 ---- Thread 07 ---- PC 5: Stalled ----- 98187 in-flight CPI 1.3218 -- Total Cycles 129809 ---- Thread 08 ---- PC 5: Stalled ----- 94094 in-flight CPI 1.3793 -- Total Cycles 129809 ---- Thread 09 ---- PC 5: Stalled ----- 98150 in-flight CPI 1.3223 -- Total Cycles 129809 ---- Thread 10 ---- PC 5: Stalled ----- 100628 in-flight CPI 1.2897 -- Total Cycles 129809 ---- Thread 11 ---- PC 5: Stalled ----- 91233 in-flight CPI 1.4226 -- Total Cycles 129809 ---- Thread 12 ---- PC 5: Stalled ----- 97131 in-flight CPI 1.3362 -- Total Cycles 129809 ---- Thread 13 ---- PC 5: Stalled ----- 96345 in-flight CPI 1.3471 -- Total Cycles 129809 ---- Thread 14 ---- PC 5: Stalled ----- 92183 in-flight CPI 1.4079 -- Total Cycles 129809 ---- Thread 15 ---- PC 5: Stalled ----- 96250 in-flight CPI 1.3485 -- Total Cycles 129809 ---- Thread 16 ---- PC 5: Stalled ----- 95681 in-flight CPI 1.3565 -- Total Cycles 129809 ---- Thread 17 ---- PC 5: Stalled ----- 96889 in-flight CPI 1.3395 -- Total Cycles 129809 ---- Thread 18 ---- PC 5: Stalled ----- 98481 in-flight CPI 1.3179 -- Total Cycles 129809 ---- Thread 19 ---- PC 5: Stalled ----- 92731 in-flight CPI 1.3996 -- Total Cycles 129809 ---- Thread 20 ---- PC 5: Stalled ----- 94602 in-flight CPI 1.3719 -- Total Cycles 129809 ---- Thread 21 ---- PC 5: Stalled ----- 91995 in-flight CPI 1.4108 -- Total Cycles 129809 ---- Thread 22 ---- PC 5: Stalled ----- 89432 in-flight CPI 1.4512 -- Total Cycles 129809 ---- Thread 23 ---- PC 5: Stalled ----- 90276 in-flight CPI 1.4377 -- Total Cycles 129809 ---- Thread 24 ---- PC 5: Stalled ----- 88065 in-flight CPI 1.4738 -- Total Cycles 129809 ---- Thread 25 ---- PC 5: Stalled ----- 93193 in-flight CPI 1.3926 -- Total Cycles 129809 ---- Thread 26 ---- PC 5: Stalled ----- 87713 in-flight CPI 1.4797 -- Total Cycles 129809 ---- Thread 27 ---- PC 5: Stalled ----- 87418 in-flight CPI 1.4846 -- Total Cycles 129809 ---- Thread 28 ---- PC 5: Stalled ----- 89429 in-flight CPI 1.4513 -- Total Cycles 129809 ---- Thread 29 ---- PC 5: Stalled ----- 88883 in-flight CPI 1.4602 -- Total Cycles 129809 ---- Thread 30 ---- PC 5: Stalled ----- 89653 in-flight CPI 1.4476 -- Total Cycles 129809 ---- Thread 31 ---- PC 5: Stalled ----- 84935 in-flight CPI 1.5281 -- Total Cycles 129809 Total CPI 0.0430 , IPC 23.2437 -- Total Cycles 129809 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7864 (3.851183%) FPSUB: 0 (0.000000%) FPMUL: 31754 (15.550669%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 79647 (39.004980%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5736 (2.809052%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 71589 (35.058791%) DIV: 7346 (3.597506%) FPUN: 0 (0.000000%) FPRSUB: 261 (0.127818%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3306177 total) ADD%: 7.415 (245150) SUB%: 0.000 (0) MUL%: 0.006 (199) BITOR%: 1.528 (50525) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.560 (18531) FPSUB%: 0.000 (0) FPMUL%: 4.797 (158589) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (597) FPMAX%: 0.018 (597) LOAD%: 5.144 (170074) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (231) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (587) FPINV%: 0.000 (0) FPCONV%: 0.019 (629) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.065 (35215) FPLE%: 0.452 (14953) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (597) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.802 (92634) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.751 (24836) CMPU%: 0.000 (0) RSUB%: 0.006 (199) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.660 (517763) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.175 (38848) ORI%: 1.574 (52043) XORI%: 0.000 (0) MULI%: 3.197 (105698) LW%: 1.130 (37372) LWI%: 13.476 (445526) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.285 (9415) SWI%: 4.063 (134345) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.403 (46370) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.308 (10186) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.058 (1904) bned%: 0.000 (0) bneid%: 13.784 (455720) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.720 (23811) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.123 (4076) DIV%: 0.012 (398) FPUN%: 1.478 (48850) FPRSUB%: 4.227 (139755) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (78) FPGT%: 2.934 (96997) FPGE%: 1.025 (33897) SYNC%: 0.000 (0) NOP%: 8.738 (288886) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 30 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 11 FPSUB 0 FPMUL 2 FPCMPLT 0 FPMIN 0 FPMAX 386 LOAD 39479 INTCONV 0 ATOMIC_INC 14 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 9 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1511 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 8 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48256 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 16 ORI 11227 XORI 0 MULI 9098 LW 0 LWI 140740 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 67 DIV 20 FPUN 0 FPRSUB 65 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.2439 --Total thread-cycles: 4153888 --total thread-cycles issued: 3017291 (72.637755%) --iCache conflicts: 109125 (2.627057%) --thread*cycles of FU dependence: 250966 (6.041713%) --thread*cycles of data dependence: 204197 (4.915804%) --iCache cycles*banks: 4153888 (79.593119% used) Issue breakdown: --thread*cycles of issue worked: 3017291 (72.637755%) --thread*cycles of issue failed: 847711 (20.407652%) --thread*cycles of issue NOP/other: 288886 (6.954593%) Number of thread-cycles not ready: 204197 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3306177 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 9 3: 8 4: 8 5: 10 6: 6 7: 8 8: 7 9: 8 10: 8 11: 6 12: 7 13: 7 14: 7 15: 6 16: 7 17: 7 18: 8 19: 7 20: 7 21: 6 22: 7 23: 7 24: 6 25: 8 26: 6 27: 7 28: 7 29: 7 30: 7 31: 6 <=== Core 17 ===> ---- Thread 00 ---- PC 5: Stalled ----- 100636 in-flight CPI 1.4222 -- Total Cycles 143144 ---- Thread 01 ---- PC 5: Stalled ----- 98567 in-flight CPI 1.4519 -- Total Cycles 143144 ---- Thread 02 ---- PC 5: Stalled ----- 100317 in-flight CPI 1.4266 -- Total Cycles 143144 ---- Thread 03 ---- PC 5: Stalled ----- 100395 in-flight CPI 1.4255 -- Total Cycles 143144 ---- Thread 04 ---- PC 5: Stalled ----- 100638 in-flight CPI 1.4221 -- Total Cycles 143144 ---- Thread 05 ---- PC 5: Stalled ----- 97901 in-flight CPI 1.4618 -- Total Cycles 143144 ---- Thread 06 ---- PC 5: Stalled ----- 106750 in-flight CPI 1.3408 -- Total Cycles 143144 ---- Thread 07 ---- PC 5: Stalled ----- 99442 in-flight CPI 1.4391 -- Total Cycles 143144 ---- Thread 08 ---- PC 5: Stalled ----- 100778 in-flight CPI 1.4201 -- Total Cycles 143144 ---- Thread 09 ---- PC 5: Stalled ----- 102635 in-flight CPI 1.3944 -- Total Cycles 143144 ---- Thread 10 ---- PC 5: Stalled ----- 97067 in-flight CPI 1.4744 -- Total Cycles 143144 ---- Thread 11 ---- PC 5: Stalled ----- 100421 in-flight CPI 1.4252 -- Total Cycles 143144 ---- Thread 12 ---- PC 5: Stalled ----- 95787 in-flight CPI 1.4941 -- Total Cycles 143144 ---- Thread 13 ---- PC 5: Stalled ----- 99169 in-flight CPI 1.4431 -- Total Cycles 143144 ---- Thread 14 ---- PC 5: Stalled ----- 92250 in-flight CPI 1.5514 -- Total Cycles 143144 ---- Thread 15 ---- PC 5: Stalled ----- 98305 in-flight CPI 1.4558 -- Total Cycles 143144 ---- Thread 16 ---- PC 5: Stalled ----- 90413 in-flight CPI 1.5830 -- Total Cycles 143144 ---- Thread 17 ---- PC 5: Stalled ----- 99018 in-flight CPI 1.4453 -- Total Cycles 143144 ---- Thread 18 ---- PC 5: Stalled ----- 97614 in-flight CPI 1.4661 -- Total Cycles 143144 ---- Thread 19 ---- PC 5: Stalled ----- 93283 in-flight CPI 1.5342 -- Total Cycles 143144 ---- Thread 20 ---- PC 5: Stalled ----- 96029 in-flight CPI 1.4904 -- Total Cycles 143144 ---- Thread 21 ---- PC 5: Stalled ----- 96083 in-flight CPI 1.4894 -- Total Cycles 143144 ---- Thread 22 ---- PC 5: Stalled ----- 88559 in-flight CPI 1.6162 -- Total Cycles 143144 ---- Thread 23 ---- PC 5: Stalled ----- 89114 in-flight CPI 1.6060 -- Total Cycles 143144 ---- Thread 24 ---- PC 5: Stalled ----- 90288 in-flight CPI 1.5852 -- Total Cycles 143144 ---- Thread 25 ---- PC 5: Stalled ----- 96569 in-flight CPI 1.4820 -- Total Cycles 143144 ---- Thread 26 ---- PC 5: Stalled ----- 88692 in-flight CPI 1.6137 -- Total Cycles 143144 ---- Thread 27 ---- PC 5: Stalled ----- 88148 in-flight CPI 1.6237 -- Total Cycles 143144 ---- Thread 28 ---- PC 5: Stalled ----- 91731 in-flight CPI 1.5602 -- Total Cycles 143144 ---- Thread 29 ---- PC 5: Stalled ----- 86196 in-flight CPI 1.6604 -- Total Cycles 143144 ---- Thread 30 ---- PC 5: Stalled ----- 84946 in-flight CPI 1.6848 -- Total Cycles 143144 ---- Thread 31 ---- PC 5: Stalled ----- 93826 in-flight CPI 1.5254 -- Total Cycles 143144 Total CPI 0.0467 , IPC 21.3919 -- Total Cycles 143144 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7345 (3.589546%) FPSUB: 0 (0.000000%) FPMUL: 30880 (15.091241%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 85400 (41.735493%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5593 (2.733333%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 67616 (33.044345%) DIV: 7522 (3.676047%) FPUN: 0 (0.000000%) FPRSUB: 266 (0.129996%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3355309 total) ADD%: 7.451 (250011) SUB%: 0.000 (0) MUL%: 0.006 (204) BITOR%: 1.534 (51471) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.515 (17289) FPSUB%: 0.000 (0) FPMUL%: 4.672 (156751) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (612) FPMAX%: 0.018 (612) LOAD%: 5.120 (171796) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (236) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (585) FPINV%: 0.000 (0) FPCONV%: 0.019 (644) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.047 (35146) FPLE%: 0.460 (15432) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (612) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.834 (95074) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.739 (24784) CMPU%: 0.000 (0) RSUB%: 0.006 (204) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.722 (527522) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.183 (39689) ORI%: 1.534 (51458) XORI%: 0.000 (0) MULI%: 3.227 (108266) LW%: 1.143 (38356) LWI%: 13.535 (454155) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.290 (9741) SWI%: 4.090 (137243) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.416 (47498) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.312 (10457) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.051 (1703) bned%: 0.000 (0) bneid%: 13.821 (463738) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.723 (24250) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.114 (3839) DIV%: 0.012 (408) FPUN%: 1.488 (49934) FPRSUB%: 4.138 (138850) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (72) FPGT%: 2.949 (98939) FPGE%: 1.028 (34502) SYNC%: 0.000 (0) NOP%: 8.736 (293130) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 14 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 9 FPSUB 0 FPMUL 3 FPCMPLT 0 FPMIN 0 FPMAX 397 LOAD 38882 INTCONV 0 ATOMIC_INC 21 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 9 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1571 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 8 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49323 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 10 ORI 10402 XORI 0 MULI 9568 LW 0 LWI 143380 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 56 DIV 34 FPUN 0 FPRSUB 54 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.3921 --Total thread-cycles: 4580608 --total thread-cycles issued: 3062179 (66.850929%) --iCache conflicts: 111628 (2.436969%) --thread*cycles of FU dependence: 253767 (5.540029%) --thread*cycles of data dependence: 204622 (4.467136%) --iCache cycles*banks: 4580608 (73.250996% used) Issue breakdown: --thread*cycles of issue worked: 3062179 (66.850929%) --thread*cycles of issue failed: 1225299 (26.749702%) --thread*cycles of issue NOP/other: 293130 (6.399369%) Number of thread-cycles not ready: 204622 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3355309 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 9 2: 8 3: 8 4: 7 5: 8 6: 6 7: 9 8: 8 9: 9 10: 8 11: 8 12: 7 13: 9 14: 7 15: 8 16: 5 17: 9 18: 8 19: 7 20: 7 21: 9 22: 5 23: 7 24: 6 25: 8 26: 6 27: 6 28: 7 29: 6 30: 7 31: 7 <=== Core 18 ===> ---- Thread 00 ---- PC 5: Stalled ----- 99168 in-flight CPI 1.4364 -- Total Cycles 142473 ---- Thread 01 ---- PC 5: Stalled ----- 97383 in-flight CPI 1.4628 -- Total Cycles 142473 ---- Thread 02 ---- PC 5: Stalled ----- 98045 in-flight CPI 1.4528 -- Total Cycles 142473 ---- Thread 03 ---- PC 5: Stalled ----- 98483 in-flight CPI 1.4464 -- Total Cycles 142473 ---- Thread 04 ---- PC 5: Stalled ----- 95350 in-flight CPI 1.4940 -- Total Cycles 142473 ---- Thread 05 ---- PC 5: Stalled ----- 96688 in-flight CPI 1.4733 -- Total Cycles 142473 ---- Thread 06 ---- PC 5: Stalled ----- 99897 in-flight CPI 1.4259 -- Total Cycles 142473 ---- Thread 07 ---- PC 5: Stalled ----- 98124 in-flight CPI 1.4517 -- Total Cycles 142473 ---- Thread 08 ---- PC 5: Stalled ----- 99032 in-flight CPI 1.4384 -- Total Cycles 142473 ---- Thread 09 ---- PC 5: Stalled ----- 99632 in-flight CPI 1.4297 -- Total Cycles 142473 ---- Thread 10 ---- PC 5: Stalled ----- 99592 in-flight CPI 1.4303 -- Total Cycles 142473 ---- Thread 11 ---- PC 5: Stalled ----- 98859 in-flight CPI 1.4409 -- Total Cycles 142473 ---- Thread 12 ---- PC 5: Stalled ----- 96681 in-flight CPI 1.4734 -- Total Cycles 142473 ---- Thread 13 ---- PC 5: Stalled ----- 99913 in-flight CPI 1.4256 -- Total Cycles 142473 ---- Thread 14 ---- PC 5: Stalled ----- 91419 in-flight CPI 1.5582 -- Total Cycles 142473 ---- Thread 15 ---- PC 5: Stalled ----- 96441 in-flight CPI 1.4770 -- Total Cycles 142473 ---- Thread 16 ---- PC 5: Stalled ----- 93174 in-flight CPI 1.5288 -- Total Cycles 142473 ---- Thread 17 ---- PC 5: Stalled ----- 90177 in-flight CPI 1.5796 -- Total Cycles 142473 ---- Thread 18 ---- PC 5: Stalled ----- 100296 in-flight CPI 1.4203 -- Total Cycles 142473 ---- Thread 19 ---- PC 5: Stalled ----- 95305 in-flight CPI 1.4946 -- Total Cycles 142473 ---- Thread 20 ---- PC 5: Stalled ----- 95080 in-flight CPI 1.4981 -- Total Cycles 142473 ---- Thread 21 ---- PC 5: Stalled ----- 97659 in-flight CPI 1.4586 -- Total Cycles 142473 ---- Thread 22 ---- PC 5: Stalled ----- 93804 in-flight CPI 1.5185 -- Total Cycles 142473 ---- Thread 23 ---- PC 5: Stalled ----- 92587 in-flight CPI 1.5385 -- Total Cycles 142473 ---- Thread 24 ---- PC 5: Stalled ----- 94305 in-flight CPI 1.5105 -- Total Cycles 142473 ---- Thread 25 ---- PC 5: Stalled ----- 92922 in-flight CPI 1.5329 -- Total Cycles 142473 ---- Thread 26 ---- PC 5: Stalled ----- 95043 in-flight CPI 1.4987 -- Total Cycles 142473 ---- Thread 27 ---- PC 5: Stalled ----- 101856 in-flight CPI 1.3986 -- Total Cycles 142473 ---- Thread 28 ---- PC 5: Stalled ----- 88792 in-flight CPI 1.6043 -- Total Cycles 142473 ---- Thread 29 ---- PC 5: Stalled ----- 88087 in-flight CPI 1.6172 -- Total Cycles 142473 ---- Thread 30 ---- PC 5: Stalled ----- 88531 in-flight CPI 1.6091 -- Total Cycles 142473 ---- Thread 31 ---- PC 5: Stalled ----- 88753 in-flight CPI 1.6050 -- Total Cycles 142473 Total CPI 0.0465 , IPC 21.4893 -- Total Cycles 142473 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7794 (3.915461%) FPSUB: 0 (0.000000%) FPMUL: 31891 (16.021039%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 74317 (37.334532%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5772 (2.899672%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 71333 (35.835464%) DIV: 7678 (3.857187%) FPUN: 0 (0.000000%) FPRSUB: 272 (0.136644%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3355680 total) ADD%: 7.455 (250150) SUB%: 0.000 (0) MUL%: 0.006 (208) BITOR%: 1.534 (51460) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.549 (18420) FPSUB%: 0.000 (0) FPMUL%: 4.763 (159835) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (624) FPMAX%: 0.019 (624) LOAD%: 5.113 (171588) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (240) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (599) FPINV%: 0.000 (0) FPCONV%: 0.020 (656) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.065 (35733) FPLE%: 0.453 (15213) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (624) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.801 (93988) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.739 (24796) CMPU%: 0.000 (0) RSUB%: 0.006 (208) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.655 (525329) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.168 (39189) ORI%: 1.575 (52842) XORI%: 0.000 (0) MULI%: 3.203 (107472) LW%: 1.130 (37928) LWI%: 13.476 (452225) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.286 (9606) SWI%: 4.057 (136132) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.400 (46989) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.309 (10369) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.060 (2012) bned%: 0.000 (0) bneid%: 13.810 (463426) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.722 (24237) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.121 (4055) DIV%: 0.012 (416) FPUN%: 1.490 (49991) FPRSUB%: 4.196 (140799) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (71) FPGT%: 2.943 (98774) FPGE%: 1.036 (34778) SYNC%: 0.000 (0) NOP%: 8.761 (293978) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 31 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 4 FPSUB 0 FPMUL 1 FPCMPLT 0 FPMIN 0 FPMAX 403 LOAD 39616 INTCONV 0 ATOMIC_INC 25 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 17 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1749 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 5 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48930 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 14 ORI 11118 XORI 0 MULI 9461 LW 0 LWI 142742 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 72 DIV 21 FPUN 0 FPRSUB 48 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.4895 --Total thread-cycles: 4559136 --total thread-cycles issued: 3061702 (67.155312%) --iCache conflicts: 110819 (2.430702%) --thread*cycles of FU dependence: 254262 (5.576978%) --thread*cycles of data dependence: 199057 (4.366112%) --iCache cycles*banks: 4559136 (73.604121% used) Issue breakdown: --thread*cycles of issue worked: 3061702 (67.155312%) --thread*cycles of issue failed: 1203456 (26.396580%) --thread*cycles of issue NOP/other: 293978 (6.448108%) Number of thread-cycles not ready: 199057 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3355680 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 7 2: 9 3: 8 4: 6 5: 7 6: 8 7: 8 8: 8 9: 8 10: 8 11: 8 12: 7 13: 10 14: 7 15: 8 16: 7 17: 7 18: 8 19: 9 20: 8 21: 7 22: 8 23: 7 24: 7 25: 8 26: 8 27: 6 28: 7 29: 6 30: 6 31: 7 <=== Core 19 ===> ---- Thread 00 ---- PC 5: Stalled ----- 99956 in-flight CPI 1.4229 -- Total Cycles 142252 ---- Thread 01 ---- PC 5: Stalled ----- 100277 in-flight CPI 1.4184 -- Total Cycles 142252 ---- Thread 02 ---- PC 5: Stalled ----- 97805 in-flight CPI 1.4542 -- Total Cycles 142252 ---- Thread 03 ---- PC 5: Stalled ----- 102151 in-flight CPI 1.3923 -- Total Cycles 142252 ---- Thread 04 ---- PC 5: Stalled ----- 95190 in-flight CPI 1.4941 -- Total Cycles 142252 ---- Thread 05 ---- PC 5: Stalled ----- 104249 in-flight CPI 1.3642 -- Total Cycles 142252 ---- Thread 06 ---- PC 5: Stalled ----- 97372 in-flight CPI 1.4606 -- Total Cycles 142252 ---- Thread 07 ---- PC 5: Stalled ----- 104647 in-flight CPI 1.3591 -- Total Cycles 142252 ---- Thread 08 ---- PC 5: Stalled ----- 95577 in-flight CPI 1.4880 -- Total Cycles 142252 ---- Thread 09 ---- PC 5: Stalled ----- 108993 in-flight CPI 1.3050 -- Total Cycles 142252 ---- Thread 10 ---- PC 5: Stalled ----- 102761 in-flight CPI 1.3840 -- Total Cycles 142252 ---- Thread 11 ---- PC 5: Stalled ----- 96740 in-flight CPI 1.4702 -- Total Cycles 142252 ---- Thread 12 ---- PC 5: Stalled ----- 94397 in-flight CPI 1.5066 -- Total Cycles 142252 ---- Thread 13 ---- PC 5: Stalled ----- 98919 in-flight CPI 1.4377 -- Total Cycles 142252 ---- Thread 14 ---- PC 5: Stalled ----- 90982 in-flight CPI 1.5633 -- Total Cycles 142252 ---- Thread 15 ---- PC 5: Stalled ----- 98911 in-flight CPI 1.4379 -- Total Cycles 142252 ---- Thread 16 ---- PC 5: Stalled ----- 96338 in-flight CPI 1.4763 -- Total Cycles 142252 ---- Thread 17 ---- PC 5: Stalled ----- 93888 in-flight CPI 1.5149 -- Total Cycles 142252 ---- Thread 18 ---- PC 5: Stalled ----- 101064 in-flight CPI 1.4074 -- Total Cycles 142252 ---- Thread 19 ---- PC 5: Stalled ----- 93335 in-flight CPI 1.5238 -- Total Cycles 142252 ---- Thread 20 ---- PC 5: Stalled ----- 90207 in-flight CPI 1.5767 -- Total Cycles 142252 ---- Thread 21 ---- PC 5: Stalled ----- 98837 in-flight CPI 1.4390 -- Total Cycles 142252 ---- Thread 22 ---- PC 5: Stalled ----- 91751 in-flight CPI 1.5501 -- Total Cycles 142252 ---- Thread 23 ---- PC 5: Stalled ----- 93110 in-flight CPI 1.5276 -- Total Cycles 142252 ---- Thread 24 ---- PC 5: Stalled ----- 87481 in-flight CPI 1.6257 -- Total Cycles 142252 ---- Thread 25 ---- PC 5: Stalled ----- 93064 in-flight CPI 1.5282 -- Total Cycles 142252 ---- Thread 26 ---- PC 5: Stalled ----- 92169 in-flight CPI 1.5432 -- Total Cycles 142252 ---- Thread 27 ---- PC 5: Stalled ----- 89700 in-flight CPI 1.5856 -- Total Cycles 142252 ---- Thread 28 ---- PC 5: Stalled ----- 90530 in-flight CPI 1.5709 -- Total Cycles 142252 ---- Thread 29 ---- PC 5: Stalled ----- 85605 in-flight CPI 1.6615 -- Total Cycles 142252 ---- Thread 30 ---- PC 5: Stalled ----- 90099 in-flight CPI 1.5786 -- Total Cycles 142252 ---- Thread 31 ---- PC 5: Stalled ----- 84314 in-flight CPI 1.6868 -- Total Cycles 142252 Total CPI 0.0465 , IPC 21.5179 -- Total Cycles 142252 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8406 (3.996121%) FPSUB: 0 (0.000000%) FPMUL: 32959 (15.668350%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 79788 (37.930346%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5444 (2.588018%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 76187 (36.218470%) DIV: 7308 (3.474144%) FPUN: 0 (0.000000%) FPRSUB: 262 (0.124552%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3354421 total) ADD%: 7.387 (247786) SUB%: 0.000 (0) MUL%: 0.006 (198) BITOR%: 1.519 (50938) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.583 (19568) FPSUB%: 0.000 (0) FPMUL%: 4.871 (163386) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (594) FPMAX%: 0.018 (594) LOAD%: 5.185 (173922) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (230) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (571) FPINV%: 0.000 (0) FPCONV%: 0.019 (626) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.075 (36059) FPLE%: 0.452 (15153) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (594) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.784 (93398) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.752 (25217) CMPU%: 0.000 (0) RSUB%: 0.006 (198) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.646 (524837) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.166 (39120) ORI%: 1.589 (53294) XORI%: 0.000 (0) MULI%: 3.181 (106710) LW%: 1.123 (37676) LWI%: 13.421 (450212) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.284 (9534) SWI%: 4.034 (135316) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.392 (46695) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.307 (10310) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.066 (2208) bned%: 0.000 (0) bneid%: 13.754 (461378) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.717 (24064) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.130 (4349) DIV%: 0.012 (396) FPUN%: 1.470 (49296) FPRSUB%: 4.296 (144097) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (69) FPGT%: 2.927 (98181) FPGE%: 1.018 (34143) SYNC%: 0.000 (0) NOP%: 8.747 (293408) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 36 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 14 FPSUB 0 FPMUL 2 FPCMPLT 0 FPMIN 0 FPMAX 383 LOAD 39948 INTCONV 0 ATOMIC_INC 23 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 13 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1513 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 8 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48635 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 17 ORI 12053 XORI 0 MULI 9561 LW 0 LWI 142505 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 79 DIV 26 FPUN 0 FPRSUB 67 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.5181 --Total thread-cycles: 4552064 --total thread-cycles issued: 3061013 (67.244507%) --iCache conflicts: 110968 (2.437751%) --thread*cycles of FU dependence: 254906 (5.599789%) --thread*cycles of data dependence: 210354 (4.621069%) --iCache cycles*banks: 4552064 (73.690814% used) Issue breakdown: --thread*cycles of issue worked: 3061013 (67.244507%) --thread*cycles of issue failed: 1197643 (26.309889%) --thread*cycles of issue NOP/other: 293408 (6.445604%) Number of thread-cycles not ready: 210354 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3354421 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 7 2: 7 3: 7 4: 7 5: 9 6: 8 7: 7 8: 8 9: 6 10: 8 11: 7 12: 8 13: 9 14: 6 15: 7 16: 7 17: 6 18: 6 19: 7 20: 7 21: 8 22: 8 23: 5 24: 8 25: 8 26: 6 27: 6 28: 9 29: 6 30: 7 31: 7 <=== Core 20 ===> ---- Thread 00 ---- PC 5: Stalled ----- 101560 in-flight CPI 1.2529 -- Total Cycles 127271 ---- Thread 01 ---- PC 5: Stalled ----- 96852 in-flight CPI 1.3139 -- Total Cycles 127271 ---- Thread 02 ---- PC 5: Stalled ----- 94373 in-flight CPI 1.3484 -- Total Cycles 127271 ---- Thread 03 ---- PC 5: Stalled ----- 102384 in-flight CPI 1.2428 -- Total Cycles 127271 ---- Thread 04 ---- PC 5: Stalled ----- 103490 in-flight CPI 1.2295 -- Total Cycles 127271 ---- Thread 05 ---- PC 5: Stalled ----- 96624 in-flight CPI 1.3169 -- Total Cycles 127271 ---- Thread 06 ---- PC 5: Stalled ----- 103147 in-flight CPI 1.2336 -- Total Cycles 127271 ---- Thread 07 ---- PC 5: Stalled ----- 102018 in-flight CPI 1.2473 -- Total Cycles 127271 ---- Thread 08 ---- PC 5: Stalled ----- 103604 in-flight CPI 1.2282 -- Total Cycles 127271 ---- Thread 09 ---- PC 5: Stalled ----- 95923 in-flight CPI 1.3266 -- Total Cycles 127271 ---- Thread 10 ---- PC 5: Stalled ----- 95014 in-flight CPI 1.3392 -- Total Cycles 127271 ---- Thread 11 ---- PC 5: Stalled ----- 100148 in-flight CPI 1.2706 -- Total Cycles 127271 ---- Thread 12 ---- PC 5: Stalled ----- 93697 in-flight CPI 1.3581 -- Total Cycles 127271 ---- Thread 13 ---- PC 5: Stalled ----- 96555 in-flight CPI 1.3179 -- Total Cycles 127271 ---- Thread 14 ---- PC 5: Stalled ----- 93705 in-flight CPI 1.3580 -- Total Cycles 127271 ---- Thread 15 ---- PC 5: Stalled ----- 96242 in-flight CPI 1.3221 -- Total Cycles 127271 ---- Thread 16 ---- PC 5: Stalled ----- 98617 in-flight CPI 1.2903 -- Total Cycles 127271 ---- Thread 17 ---- PC 5: Stalled ----- 93189 in-flight CPI 1.3654 -- Total Cycles 127271 ---- Thread 18 ---- PC 5: Stalled ----- 93800 in-flight CPI 1.3566 -- Total Cycles 127271 ---- Thread 19 ---- PC 5: Stalled ----- 97271 in-flight CPI 1.3082 -- Total Cycles 127271 ---- Thread 20 ---- PC 5: Stalled ----- 97034 in-flight CPI 1.3114 -- Total Cycles 127271 ---- Thread 21 ---- PC 5: Stalled ----- 93528 in-flight CPI 1.3606 -- Total Cycles 127271 ---- Thread 22 ---- PC 5: Stalled ----- 97511 in-flight CPI 1.3049 -- Total Cycles 127271 ---- Thread 23 ---- PC 5: Stalled ----- 99049 in-flight CPI 1.2846 -- Total Cycles 127271 ---- Thread 24 ---- PC 5: Stalled ----- 95486 in-flight CPI 1.3326 -- Total Cycles 127271 ---- Thread 25 ---- PC 5: Stalled ----- 90828 in-flight CPI 1.4009 -- Total Cycles 127271 ---- Thread 26 ---- PC 5: Stalled ----- 92433 in-flight CPI 1.3766 -- Total Cycles 127271 ---- Thread 27 ---- PC 5: Stalled ----- 94298 in-flight CPI 1.3494 -- Total Cycles 127271 ---- Thread 28 ---- PC 5: Stalled ----- 91816 in-flight CPI 1.3859 -- Total Cycles 127271 ---- Thread 29 ---- PC 5: Stalled ----- 91809 in-flight CPI 1.3860 -- Total Cycles 127271 ---- Thread 30 ---- PC 5: Stalled ----- 88508 in-flight CPI 1.4377 -- Total Cycles 127271 ---- Thread 31 ---- PC 5: Stalled ----- 90474 in-flight CPI 1.4065 -- Total Cycles 127271 Total CPI 0.0413 , IPC 24.2126 -- Total Cycles 127271 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7326 (3.783935%) FPSUB: 0 (0.000000%) FPMUL: 31069 (16.047374%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 73771 (38.103281%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5994 (3.095946%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 67391 (34.807962%) DIV: 7782 (4.019462%) FPUN: 0 (0.000000%) FPRSUB: 275 (0.142040%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3376905 total) ADD%: 7.484 (252719) SUB%: 0.000 (0) MUL%: 0.006 (211) BITOR%: 1.521 (51376) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.517 (17448) FPSUB%: 0.000 (0) FPMUL%: 4.675 (157858) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (633) FPMAX%: 0.019 (633) LOAD%: 5.092 (171964) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (243) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (614) FPINV%: 0.000 (0) FPCONV%: 0.020 (665) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.048 (35395) FPLE%: 0.452 (15272) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (633) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.834 (95716) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.733 (24737) CMPU%: 0.000 (0) RSUB%: 0.006 (211) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.701 (530220) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.179 (39816) ORI%: 1.536 (51863) XORI%: 0.000 (0) MULI%: 3.231 (109110) LW%: 1.144 (38624) LWI%: 13.566 (458120) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.289 (9760) SWI%: 4.101 (138470) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.418 (47882) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10470) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.053 (1781) bned%: 0.000 (0) bneid%: 13.816 (466558) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.720 (24297) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.113 (3822) DIV%: 0.012 (422) FPUN%: 1.480 (49973) FPRSUB%: 4.125 (139310) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (81) FPGT%: 2.959 (99916) FPGE%: 1.028 (34701) SYNC%: 0.000 (0) NOP%: 8.744 (295285) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 31 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 10 FPSUB 0 FPMUL 0 FPCMPLT 0 FPMIN 0 FPMAX 412 LOAD 39332 INTCONV 0 ATOMIC_INC 23 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 16 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1766 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 10 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49744 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 17 ORI 10359 XORI 0 MULI 9858 LW 0 LWI 144554 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 72 DIV 33 FPUN 0 FPRSUB 52 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 24.2129 --Total thread-cycles: 4072672 --total thread-cycles issued: 3081620 (75.665804%) --iCache conflicts: 111946 (2.748711%) --thread*cycles of FU dependence: 256294 (6.293018%) --thread*cycles of data dependence: 193608 (4.753832%) --iCache cycles*banks: 4072672 (82.916989% used) Issue breakdown: --thread*cycles of issue worked: 3081620 (75.665804%) --thread*cycles of issue failed: 695767 (17.083797%) --thread*cycles of issue NOP/other: 295285 (7.250400%) Number of thread-cycles not ready: 193608 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3376905 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 9 1: 6 2: 6 3: 9 4: 9 5: 8 6: 9 7: 8 8: 8 9: 7 10: 8 11: 8 12: 7 13: 8 14: 6 15: 8 16: 8 17: 8 18: 7 19: 8 20: 7 21: 6 22: 8 23: 9 24: 7 25: 8 26: 8 27: 7 28: 8 29: 7 30: 6 31: 7 <=== Core 21 ===> ---- Thread 00 ---- PC 5: Stalled ----- 96539 in-flight CPI 1.3912 -- Total Cycles 134327 ---- Thread 01 ---- PC 5: Stalled ----- 99425 in-flight CPI 1.3507 -- Total Cycles 134327 ---- Thread 02 ---- PC 5: Stalled ----- 93328 in-flight CPI 1.4390 -- Total Cycles 134327 ---- Thread 03 ---- PC 5: Stalled ----- 97267 in-flight CPI 1.3808 -- Total Cycles 134327 ---- Thread 04 ---- PC 5: Stalled ----- 94682 in-flight CPI 1.4184 -- Total Cycles 134327 ---- Thread 05 ---- PC 5: Stalled ----- 98033 in-flight CPI 1.3700 -- Total Cycles 134327 ---- Thread 06 ---- PC 5: Stalled ----- 101431 in-flight CPI 1.3241 -- Total Cycles 134327 ---- Thread 07 ---- PC 5: Stalled ----- 98569 in-flight CPI 1.3625 -- Total Cycles 134327 ---- Thread 08 ---- PC 5: Stalled ----- 97971 in-flight CPI 1.3708 -- Total Cycles 134327 ---- Thread 09 ---- PC 5: Stalled ----- 101418 in-flight CPI 1.3243 -- Total Cycles 134327 ---- Thread 10 ---- PC 5: Stalled ----- 96456 in-flight CPI 1.3924 -- Total Cycles 134327 ---- Thread 11 ---- PC 5: Stalled ----- 95556 in-flight CPI 1.4055 -- Total Cycles 134327 ---- Thread 12 ---- PC 5: Stalled ----- 102810 in-flight CPI 1.3062 -- Total Cycles 134327 ---- Thread 13 ---- PC 5: Stalled ----- 95535 in-flight CPI 1.4058 -- Total Cycles 134327 ---- Thread 14 ---- PC 5: Stalled ----- 98483 in-flight CPI 1.3637 -- Total Cycles 134327 ---- Thread 15 ---- PC 5: Stalled ----- 100642 in-flight CPI 1.3344 -- Total Cycles 134327 ---- Thread 16 ---- PC 5: Stalled ----- 97432 in-flight CPI 1.3784 -- Total Cycles 134327 ---- Thread 17 ---- PC 5: Stalled ----- 93062 in-flight CPI 1.4431 -- Total Cycles 134327 ---- Thread 18 ---- PC 5: Stalled ----- 90579 in-flight CPI 1.4827 -- Total Cycles 134327 ---- Thread 19 ---- PC 5: Stalled ----- 93442 in-flight CPI 1.4372 -- Total Cycles 134327 ---- Thread 20 ---- PC 5: Stalled ----- 94372 in-flight CPI 1.4231 -- Total Cycles 134327 ---- Thread 21 ---- PC 5: Stalled ----- 90620 in-flight CPI 1.4820 -- Total Cycles 134327 ---- Thread 22 ---- PC 5: Stalled ----- 92709 in-flight CPI 1.4487 -- Total Cycles 134327 ---- Thread 23 ---- PC 5: Stalled ----- 91333 in-flight CPI 1.4705 -- Total Cycles 134327 ---- Thread 24 ---- PC 5: Stalled ----- 93432 in-flight CPI 1.4374 -- Total Cycles 134327 ---- Thread 25 ---- PC 5: Stalled ----- 90062 in-flight CPI 1.4912 -- Total Cycles 134327 ---- Thread 26 ---- PC 5: Stalled ----- 94520 in-flight CPI 1.4209 -- Total Cycles 134327 ---- Thread 27 ---- PC 5: Stalled ----- 91951 in-flight CPI 1.4606 -- Total Cycles 134327 ---- Thread 28 ---- PC 5: Stalled ----- 90793 in-flight CPI 1.4792 -- Total Cycles 134327 ---- Thread 29 ---- PC 5: Stalled ----- 90024 in-flight CPI 1.4918 -- Total Cycles 134327 ---- Thread 30 ---- PC 5: Stalled ----- 89600 in-flight CPI 1.4989 -- Total Cycles 134327 ---- Thread 31 ---- PC 5: Stalled ----- 88417 in-flight CPI 1.5189 -- Total Cycles 134327 Total CPI 0.0442 , IPC 22.6394 -- Total Cycles 134327 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7423 (3.997738%) FPSUB: 0 (0.000000%) FPMUL: 31034 (16.713701%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 64408 (34.687635%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5834 (3.141965%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 68810 (37.058380%) DIV: 7893 (4.250862%) FPUN: 0 (0.000000%) FPRSUB: 278 (0.149720%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3332695 total) ADD%: 7.486 (249497) SUB%: 0.000 (0) MUL%: 0.006 (214) BITOR%: 1.534 (51122) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.530 (17668) FPSUB%: 0.000 (0) FPMUL%: 4.705 (156799) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (642) FPMAX%: 0.019 (642) LOAD%: 5.099 (169950) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (246) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (609) FPINV%: 0.000 (0) FPCONV%: 0.020 (674) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.055 (35144) FPLE%: 0.456 (15213) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (642) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.816 (93834) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.734 (24447) CMPU%: 0.000 (0) RSUB%: 0.006 (214) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.680 (522578) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.173 (39085) ORI%: 1.555 (51814) XORI%: 0.000 (0) MULI%: 3.216 (107196) LW%: 1.136 (37876) LWI%: 13.514 (450380) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.289 (9622) SWI%: 4.086 (136183) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.407 (46881) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10346) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.055 (1822) bned%: 0.000 (0) bneid%: 13.816 (460434) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.723 (24106) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.117 (3902) DIV%: 0.013 (428) FPUN%: 1.491 (49699) FPRSUB%: 4.148 (138248) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (78) FPGT%: 2.950 (98318) FPGE%: 1.035 (34486) SYNC%: 0.000 (0) NOP%: 8.748 (291560) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 27 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 7 FPSUB 0 FPMUL 2 FPCMPLT 0 FPMIN 0 FPMAX 418 LOAD 37690 INTCONV 0 ATOMIC_INC 26 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 12 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1767 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 8 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48838 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 12 ORI 10548 XORI 0 MULI 9556 LW 0 LWI 142246 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 59 DIV 33 FPUN 0 FPRSUB 47 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 22.6396 --Total thread-cycles: 4298464 --total thread-cycles issued: 3041135 (70.749342%) --iCache conflicts: 110958 (2.581341%) --thread*cycles of FU dependence: 251322 (5.846786%) --thread*cycles of data dependence: 185680 (4.319683%) --iCache cycles*banks: 4298464 (77.532975% used) Issue breakdown: --thread*cycles of issue worked: 3041135 (70.749342%) --thread*cycles of issue failed: 965769 (22.467770%) --thread*cycles of issue NOP/other: 291560 (6.782888%) Number of thread-cycles not ready: 185680 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3332695 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 9 2: 7 3: 6 4: 8 5: 8 6: 6 7: 9 8: 8 9: 7 10: 7 11: 7 12: 10 13: 8 14: 9 15: 8 16: 8 17: 8 18: 8 19: 8 20: 8 21: 8 22: 6 23: 7 24: 8 25: 7 26: 7 27: 7 28: 8 29: 8 30: 8 31: 8 <=== Core 22 ===> ---- Thread 00 ---- PC 5: Stalled ----- 100574 in-flight CPI 1.2825 -- Total Cycles 129006 ---- Thread 01 ---- PC 5: Stalled ----- 99199 in-flight CPI 1.3002 -- Total Cycles 129006 ---- Thread 02 ---- PC 5: Stalled ----- 99302 in-flight CPI 1.2988 -- Total Cycles 129006 ---- Thread 03 ---- PC 5: Stalled ----- 93479 in-flight CPI 1.3799 -- Total Cycles 129006 ---- Thread 04 ---- PC 5: Stalled ----- 92292 in-flight CPI 1.3976 -- Total Cycles 129006 ---- Thread 05 ---- PC 5: Stalled ----- 99392 in-flight CPI 1.2977 -- Total Cycles 129006 ---- Thread 06 ---- PC 5: Stalled ----- 97922 in-flight CPI 1.3172 -- Total Cycles 129006 ---- Thread 07 ---- PC 5: Stalled ----- 99662 in-flight CPI 1.2942 -- Total Cycles 129006 ---- Thread 08 ---- PC 5: Stalled ----- 101605 in-flight CPI 1.2695 -- Total Cycles 129006 ---- Thread 09 ---- PC 5: Stalled ----- 96671 in-flight CPI 1.3343 -- Total Cycles 129006 ---- Thread 10 ---- PC 5: Stalled ----- 100640 in-flight CPI 1.2816 -- Total Cycles 129006 ---- Thread 11 ---- PC 5: Stalled ----- 101473 in-flight CPI 1.2711 -- Total Cycles 129006 ---- Thread 12 ---- PC 5: Stalled ----- 98774 in-flight CPI 1.3058 -- Total Cycles 129006 ---- Thread 13 ---- PC 5: Stalled ----- 96362 in-flight CPI 1.3385 -- Total Cycles 129006 ---- Thread 14 ---- PC 5: Stalled ----- 98120 in-flight CPI 1.3145 -- Total Cycles 129006 ---- Thread 15 ---- PC 5: Stalled ----- 102136 in-flight CPI 1.2628 -- Total Cycles 129006 ---- Thread 16 ---- PC 5: Stalled ----- 90217 in-flight CPI 1.4297 -- Total Cycles 129006 ---- Thread 17 ---- PC 5: Stalled ----- 95609 in-flight CPI 1.3490 -- Total Cycles 129006 ---- Thread 18 ---- PC 5: Stalled ----- 99828 in-flight CPI 1.2920 -- Total Cycles 129006 ---- Thread 19 ---- PC 5: Stalled ----- 97324 in-flight CPI 1.3253 -- Total Cycles 129006 ---- Thread 20 ---- PC 5: Stalled ----- 94705 in-flight CPI 1.3620 -- Total Cycles 129006 ---- Thread 21 ---- PC 5: Stalled ----- 92060 in-flight CPI 1.4010 -- Total Cycles 129006 ---- Thread 22 ---- PC 5: Stalled ----- 90132 in-flight CPI 1.4310 -- Total Cycles 129006 ---- Thread 23 ---- PC 5: Stalled ----- 93511 in-flight CPI 1.3793 -- Total Cycles 129006 ---- Thread 24 ---- PC 5: Stalled ----- 90651 in-flight CPI 1.4229 -- Total Cycles 129006 ---- Thread 25 ---- PC 5: Stalled ----- 91064 in-flight CPI 1.4165 -- Total Cycles 129006 ---- Thread 26 ---- PC 5: Stalled ----- 95822 in-flight CPI 1.3460 -- Total Cycles 129006 ---- Thread 27 ---- PC 5: Stalled ----- 87484 in-flight CPI 1.4744 -- Total Cycles 129006 ---- Thread 28 ---- PC 5: Stalled ----- 85867 in-flight CPI 1.5021 -- Total Cycles 129006 ---- Thread 29 ---- PC 5: Stalled ----- 91478 in-flight CPI 1.4100 -- Total Cycles 129006 ---- Thread 30 ---- PC 5: Stalled ----- 89816 in-flight CPI 1.4361 -- Total Cycles 129006 ---- Thread 31 ---- PC 5: Stalled ----- 91044 in-flight CPI 1.4167 -- Total Cycles 129006 Total CPI 0.0422 , IPC 23.6794 -- Total Cycles 129006 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7345 (3.828073%) FPSUB: 0 (0.000000%) FPMUL: 30946 (16.128461%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 72364 (37.714726%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5680 (2.960307%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 67585 (35.224004%) DIV: 7682 (4.003711%) FPUN: 0 (0.000000%) FPRSUB: 270 (0.140719%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3347751 total) ADD%: 7.517 (251650) SUB%: 0.000 (0) MUL%: 0.006 (208) BITOR%: 1.538 (51474) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.521 (17449) FPSUB%: 0.000 (0) FPMUL%: 4.678 (156618) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (624) FPMAX%: 0.019 (624) LOAD%: 5.098 (170678) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (240) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (594) FPINV%: 0.000 (0) FPCONV%: 0.020 (656) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.051 (35183) FPLE%: 0.460 (15402) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (624) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.822 (94478) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.733 (24548) CMPU%: 0.000 (0) RSUB%: 0.006 (208) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.698 (525546) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.175 (39321) ORI%: 1.550 (51879) XORI%: 0.000 (0) MULI%: 3.219 (107752) LW%: 1.139 (38124) LWI%: 13.503 (452045) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.289 (9689) SWI%: 4.079 (136568) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.410 (47195) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10426) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.055 (1825) bned%: 0.000 (0) bneid%: 13.823 (462776) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.726 (24310) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.115 (3835) DIV%: 0.012 (416) FPUN%: 1.496 (50069) FPRSUB%: 4.129 (138229) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (73) FPGT%: 2.949 (98740) FPGE%: 1.036 (34667) SYNC%: 0.000 (0) NOP%: 8.750 (292912) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 37 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 10 FPSUB 0 FPMUL 1 FPCMPLT 0 FPMIN 0 FPMAX 405 LOAD 38619 INTCONV 0 ATOMIC_INC 14 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 13 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1502 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 7 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49039 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 15 ORI 10436 XORI 0 MULI 9476 LW 0 LWI 142766 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 69 DIV 31 FPUN 0 FPRSUB 43 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.6796 --Total thread-cycles: 4128192 --total thread-cycles issued: 3054839 (73.999441%) --iCache conflicts: 110046 (2.665719%) --thread*cycles of FU dependence: 252490 (6.116237%) --thread*cycles of data dependence: 191872 (4.647846%) --iCache cycles*banks: 4128192 (81.095622% used) Issue breakdown: --thread*cycles of issue worked: 3054839 (73.999441%) --thread*cycles of issue failed: 780441 (18.905153%) --thread*cycles of issue NOP/other: 292912 (7.095406%) Number of thread-cycles not ready: 191872 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3347751 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 9 3: 6 4: 7 5: 9 6: 8 7: 8 8: 7 9: 6 10: 8 11: 9 12: 9 13: 7 14: 8 15: 9 16: 6 17: 8 18: 8 19: 8 20: 7 21: 8 22: 8 23: 7 24: 7 25: 5 26: 8 27: 6 28: 7 29: 7 30: 7 31: 8 <=== Core 23 ===> ---- Thread 00 ---- PC 5: Stalled ----- 98090 in-flight CPI 1.3183 -- Total Cycles 129332 ---- Thread 01 ---- PC 5: Stalled ----- 103594 in-flight CPI 1.2482 -- Total Cycles 129332 ---- Thread 02 ---- PC 5: Stalled ----- 97357 in-flight CPI 1.3282 -- Total Cycles 129332 ---- Thread 03 ---- PC 5: Stalled ----- 98820 in-flight CPI 1.3085 -- Total Cycles 129332 ---- Thread 04 ---- PC 5: Stalled ----- 100186 in-flight CPI 1.2907 -- Total Cycles 129332 ---- Thread 05 ---- PC 5: Stalled ----- 101274 in-flight CPI 1.2768 -- Total Cycles 129332 ---- Thread 06 ---- PC 5: Stalled ----- 97210 in-flight CPI 1.3302 -- Total Cycles 129332 ---- Thread 07 ---- PC 5: Stalled ----- 102117 in-flight CPI 1.2663 -- Total Cycles 129332 ---- Thread 08 ---- PC 5: Stalled ----- 97491 in-flight CPI 1.3263 -- Total Cycles 129332 ---- Thread 09 ---- PC 5: Stalled ----- 102760 in-flight CPI 1.2584 -- Total Cycles 129332 ---- Thread 10 ---- PC 5: Stalled ----- 95994 in-flight CPI 1.3470 -- Total Cycles 129332 ---- Thread 11 ---- PC 5: Stalled ----- 97303 in-flight CPI 1.3289 -- Total Cycles 129332 ---- Thread 12 ---- PC 5: Stalled ----- 95687 in-flight CPI 1.3514 -- Total Cycles 129332 ---- Thread 13 ---- PC 5: Stalled ----- 99019 in-flight CPI 1.3059 -- Total Cycles 129332 ---- Thread 14 ---- PC 5: Stalled ----- 91145 in-flight CPI 1.4188 -- Total Cycles 129332 ---- Thread 15 ---- PC 5: Stalled ----- 92852 in-flight CPI 1.3926 -- Total Cycles 129332 ---- Thread 16 ---- PC 5: Stalled ----- 94647 in-flight CPI 1.3662 -- Total Cycles 129332 ---- Thread 17 ---- PC 5: Stalled ----- 99270 in-flight CPI 1.3026 -- Total Cycles 129332 ---- Thread 18 ---- PC 5: Stalled ----- 88554 in-flight CPI 1.4602 -- Total Cycles 129332 ---- Thread 19 ---- PC 5: Stalled ----- 89110 in-flight CPI 1.4511 -- Total Cycles 129332 ---- Thread 20 ---- PC 5: Stalled ----- 95222 in-flight CPI 1.3580 -- Total Cycles 129332 ---- Thread 21 ---- PC 5: Stalled ----- 90669 in-flight CPI 1.4261 -- Total Cycles 129332 ---- Thread 22 ---- PC 5: Stalled ----- 90638 in-flight CPI 1.4267 -- Total Cycles 129332 ---- Thread 23 ---- PC 5: Stalled ----- 94045 in-flight CPI 1.3749 -- Total Cycles 129332 ---- Thread 24 ---- PC 5: Stalled ----- 89953 in-flight CPI 1.4376 -- Total Cycles 129332 ---- Thread 25 ---- PC 5: Stalled ----- 91197 in-flight CPI 1.4179 -- Total Cycles 129332 ---- Thread 26 ---- PC 5: Stalled ----- 93932 in-flight CPI 1.3766 -- Total Cycles 129332 ---- Thread 27 ---- PC 5: Stalled ----- 93994 in-flight CPI 1.3757 -- Total Cycles 129332 ---- Thread 28 ---- PC 5: Stalled ----- 93090 in-flight CPI 1.3890 -- Total Cycles 129332 ---- Thread 29 ---- PC 5: Stalled ----- 90137 in-flight CPI 1.4345 -- Total Cycles 129332 ---- Thread 30 ---- PC 5: Stalled ----- 88405 in-flight CPI 1.4626 -- Total Cycles 129332 ---- Thread 31 ---- PC 5: Stalled ----- 88531 in-flight CPI 1.4606 -- Total Cycles 129332 Total CPI 0.0425 , IPC 23.5275 -- Total Cycles 129332 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7920 (3.728375%) FPSUB: 0 (0.000000%) FPMUL: 31925 (15.028834%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 86778 (40.851124%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5528 (2.602330%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 72442 (34.102389%) DIV: 7567 (3.562198%) FPUN: 0 (0.000000%) FPRSUB: 265 (0.124750%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3334939 total) ADD%: 7.408 (247063) SUB%: 0.000 (0) MUL%: 0.006 (205) BITOR%: 1.530 (51033) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.559 (18631) FPSUB%: 0.000 (0) FPMUL%: 4.791 (159781) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (615) FPMAX%: 0.018 (615) LOAD%: 5.148 (171691) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (237) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (583) FPINV%: 0.000 (0) FPCONV%: 0.019 (647) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.065 (35524) FPLE%: 0.456 (15211) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (615) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.800 (93365) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.743 (24781) CMPU%: 0.000 (0) RSUB%: 0.006 (205) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.669 (522561) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.169 (38991) ORI%: 1.577 (52581) XORI%: 0.000 (0) MULI%: 3.194 (106512) LW%: 1.130 (37674) LWI%: 13.451 (448589) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.287 (9569) SWI%: 4.055 (135236) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.399 (46641) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10327) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.061 (2033) bned%: 0.000 (0) bneid%: 13.793 (459995) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.721 (24059) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.124 (4123) DIV%: 0.012 (410) FPUN%: 1.485 (49511) FPRSUB%: 4.223 (140848) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (62) FPGT%: 2.938 (97988) FPGE%: 1.029 (34300) SYNC%: 0.000 (0) NOP%: 8.757 (292031) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 17 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 17 FPSUB 0 FPMUL 4 FPCMPLT 0 FPMIN 0 FPMAX 401 LOAD 39390 INTCONV 0 ATOMIC_INC 28 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 14 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1919 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 11 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48580 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 15 ORI 11348 XORI 0 MULI 9351 LW 0 LWI 141878 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 77 DIV 25 FPUN 0 FPRSUB 62 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.5277 --Total thread-cycles: 4138624 --total thread-cycles issued: 3042908 (73.524630%) --iCache conflicts: 110168 (2.661948%) --thread*cycles of FU dependence: 253160 (6.117009%) --thread*cycles of data dependence: 212425 (5.132745%) --iCache cycles*banks: 4138624 (80.581638% used) Issue breakdown: --thread*cycles of issue worked: 3042908 (73.524630%) --thread*cycles of issue failed: 803685 (19.419135%) --thread*cycles of issue NOP/other: 292031 (7.056234%) Number of thread-cycles not ready: 212425 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3334939 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 6 3: 7 4: 8 5: 8 6: 8 7: 8 8: 8 9: 7 10: 8 11: 8 12: 7 13: 7 14: 6 15: 8 16: 7 17: 8 18: 7 19: 7 20: 7 21: 8 22: 6 23: 8 24: 6 25: 8 26: 7 27: 8 28: 8 29: 8 30: 8 31: 7 <=== Core 24 ===> ---- Thread 00 ---- PC 5: Stalled ----- 101249 in-flight CPI 1.2833 -- Total Cycles 129954 ---- Thread 01 ---- PC 5: Stalled ----- 94136 in-flight CPI 1.3803 -- Total Cycles 129954 ---- Thread 02 ---- PC 5: Stalled ----- 98744 in-flight CPI 1.3159 -- Total Cycles 129954 ---- Thread 03 ---- PC 5: Stalled ----- 97143 in-flight CPI 1.3375 -- Total Cycles 129954 ---- Thread 04 ---- PC 5: Stalled ----- 100676 in-flight CPI 1.2906 -- Total Cycles 129954 ---- Thread 05 ---- PC 5: Stalled ----- 99491 in-flight CPI 1.3059 -- Total Cycles 129954 ---- Thread 06 ---- PC 5: Stalled ----- 98365 in-flight CPI 1.3209 -- Total Cycles 129954 ---- Thread 07 ---- PC 5: Stalled ----- 95273 in-flight CPI 1.3638 -- Total Cycles 129954 ---- Thread 08 ---- PC 5: Stalled ----- 97633 in-flight CPI 1.3308 -- Total Cycles 129954 ---- Thread 09 ---- PC 5: Stalled ----- 95926 in-flight CPI 1.3545 -- Total Cycles 129954 ---- Thread 10 ---- PC 5: Stalled ----- 95427 in-flight CPI 1.3616 -- Total Cycles 129954 ---- Thread 11 ---- PC 5: Stalled ----- 93653 in-flight CPI 1.3873 -- Total Cycles 129954 ---- Thread 12 ---- PC 5: Stalled ----- 101339 in-flight CPI 1.2822 -- Total Cycles 129954 ---- Thread 13 ---- PC 5: Stalled ----- 92954 in-flight CPI 1.3978 -- Total Cycles 129954 ---- Thread 14 ---- PC 5: Stalled ----- 91528 in-flight CPI 1.4197 -- Total Cycles 129954 ---- Thread 15 ---- PC 5: Stalled ----- 100694 in-flight CPI 1.2904 -- Total Cycles 129954 ---- Thread 16 ---- PC 5: Stalled ----- 92127 in-flight CPI 1.4103 -- Total Cycles 129954 ---- Thread 17 ---- PC 5: Stalled ----- 96265 in-flight CPI 1.3497 -- Total Cycles 129954 ---- Thread 18 ---- PC 5: Stalled ----- 99498 in-flight CPI 1.3058 -- Total Cycles 129954 ---- Thread 19 ---- PC 5: Stalled ----- 89418 in-flight CPI 1.4531 -- Total Cycles 129954 ---- Thread 20 ---- PC 5: Stalled ----- 96022 in-flight CPI 1.3531 -- Total Cycles 129954 ---- Thread 21 ---- PC 5: Stalled ----- 96558 in-flight CPI 1.3456 -- Total Cycles 129954 ---- Thread 22 ---- PC 5: Stalled ----- 97618 in-flight CPI 1.3309 -- Total Cycles 129954 ---- Thread 23 ---- PC 5: Stalled ----- 96030 in-flight CPI 1.3530 -- Total Cycles 129954 ---- Thread 24 ---- PC 5: Stalled ----- 91846 in-flight CPI 1.4147 -- Total Cycles 129954 ---- Thread 25 ---- PC 5: Stalled ----- 88331 in-flight CPI 1.4709 -- Total Cycles 129954 ---- Thread 26 ---- PC 5: Stalled ----- 94220 in-flight CPI 1.3791 -- Total Cycles 129954 ---- Thread 27 ---- PC 5: Stalled ----- 91003 in-flight CPI 1.4278 -- Total Cycles 129954 ---- Thread 28 ---- PC 5: Stalled ----- 96716 in-flight CPI 1.3434 -- Total Cycles 129954 ---- Thread 29 ---- PC 5: Stalled ----- 87405 in-flight CPI 1.4865 -- Total Cycles 129954 ---- Thread 30 ---- PC 5: Stalled ----- 90930 in-flight CPI 1.4289 -- Total Cycles 129954 ---- Thread 31 ---- PC 5: Stalled ----- 87473 in-flight CPI 1.4854 -- Total Cycles 129954 Total CPI 0.0427 , IPC 23.4409 -- Total Cycles 129954 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7674 (3.726274%) FPSUB: 0 (0.000000%) FPMUL: 31538 (15.313946%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 83298 (40.447114%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5510 (2.675498%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 70160 (34.067679%) DIV: 7499 (3.641299%) FPUN: 0 (0.000000%) FPRSUB: 264 (0.128191%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3338788 total) ADD%: 7.456 (248925) SUB%: 0.000 (0) MUL%: 0.006 (203) BITOR%: 1.520 (50751) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.541 (18053) FPSUB%: 0.000 (0) FPMUL%: 4.746 (158457) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (609) FPMAX%: 0.018 (609) LOAD%: 5.133 (171396) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (235) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (580) FPINV%: 0.000 (0) FPCONV%: 0.019 (641) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.060 (35390) FPLE%: 0.457 (15250) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (609) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.811 (93843) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.738 (24655) CMPU%: 0.000 (0) RSUB%: 0.006 (203) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.685 (523696) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.170 (39077) ORI%: 1.552 (51828) XORI%: 0.000 (0) MULI%: 3.211 (107214) LW%: 1.134 (37862) LWI%: 13.495 (450554) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.288 (9615) SWI%: 4.060 (135571) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.404 (46882) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10374) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.060 (2019) bned%: 0.000 (0) bneid%: 13.800 (460767) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.718 (23980) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.119 (3989) DIV%: 0.012 (406) FPUN%: 1.478 (49333) FPRSUB%: 4.191 (139932) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (74) FPGT%: 2.951 (98539) FPGE%: 1.021 (34083) SYNC%: 0.000 (0) NOP%: 8.760 (292488) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 16 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 10 FPSUB 0 FPMUL 2 FPCMPLT 0 FPMIN 0 FPMAX 395 LOAD 40120 INTCONV 0 ATOMIC_INC 17 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 16 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 2082 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 8 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48862 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 15 ORI 10915 XORI 0 MULI 9417 LW 0 LWI 142521 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 76 DIV 19 FPUN 0 FPRSUB 61 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.4412 --Total thread-cycles: 4158528 --total thread-cycles issued: 3046300 (73.254286%) --iCache conflicts: 109686 (2.637616%) --thread*cycles of FU dependence: 254556 (6.121301%) --thread*cycles of data dependence: 205943 (4.952305%) --iCache cycles*banks: 4158528 (80.288506% used) Issue breakdown: --thread*cycles of issue worked: 3046300 (73.254286%) --thread*cycles of issue failed: 819740 (19.712264%) --thread*cycles of issue NOP/other: 292488 (7.033450%) Number of thread-cycles not ready: 205943 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3338788 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 7 2: 7 3: 8 4: 8 5: 8 6: 8 7: 6 8: 8 9: 7 10: 7 11: 8 12: 7 13: 7 14: 5 15: 7 16: 8 17: 8 18: 8 19: 6 20: 7 21: 8 22: 9 23: 7 24: 7 25: 8 26: 6 27: 7 28: 9 29: 7 30: 8 31: 6 <=== Core 25 ===> ---- Thread 00 ---- PC 5: Stalled ----- 96032 in-flight CPI 1.3451 -- Total Cycles 129191 ---- Thread 01 ---- PC 5: Stalled ----- 96134 in-flight CPI 1.3436 -- Total Cycles 129191 ---- Thread 02 ---- PC 5: Stalled ----- 104594 in-flight CPI 1.2349 -- Total Cycles 129191 ---- Thread 03 ---- PC 5: Stalled ----- 95867 in-flight CPI 1.3473 -- Total Cycles 129191 ---- Thread 04 ---- PC 5: Stalled ----- 96273 in-flight CPI 1.3417 -- Total Cycles 129191 ---- Thread 05 ---- PC 5: Stalled ----- 102982 in-flight CPI 1.2542 -- Total Cycles 129191 ---- Thread 06 ---- PC 5: Stalled ----- 95734 in-flight CPI 1.3492 -- Total Cycles 129191 ---- Thread 07 ---- PC 5: Stalled ----- 101394 in-flight CPI 1.2739 -- Total Cycles 129191 ---- Thread 08 ---- PC 5: Stalled ----- 102121 in-flight CPI 1.2648 -- Total Cycles 129191 ---- Thread 09 ---- PC 5: Stalled ----- 95671 in-flight CPI 1.3501 -- Total Cycles 129191 ---- Thread 10 ---- PC 5: Stalled ----- 95780 in-flight CPI 1.3486 -- Total Cycles 129191 ---- Thread 11 ---- PC 5: Stalled ----- 99356 in-flight CPI 1.3000 -- Total Cycles 129191 ---- Thread 12 ---- PC 5: Stalled ----- 101251 in-flight CPI 1.2757 -- Total Cycles 129191 ---- Thread 13 ---- PC 5: Stalled ----- 104311 in-flight CPI 1.2383 -- Total Cycles 129191 ---- Thread 14 ---- PC 5: Stalled ----- 100957 in-flight CPI 1.2794 -- Total Cycles 129191 ---- Thread 15 ---- PC 5: Stalled ----- 95548 in-flight CPI 1.3518 -- Total Cycles 129191 ---- Thread 16 ---- PC 5: Stalled ----- 98803 in-flight CPI 1.3073 -- Total Cycles 129191 ---- Thread 17 ---- PC 5: Stalled ----- 91749 in-flight CPI 1.4078 -- Total Cycles 129191 ---- Thread 18 ---- PC 5: Stalled ----- 94807 in-flight CPI 1.3624 -- Total Cycles 129191 ---- Thread 19 ---- PC 5: Stalled ----- 92444 in-flight CPI 1.3972 -- Total Cycles 129191 ---- Thread 20 ---- PC 5: Stalled ----- 88953 in-flight CPI 1.4521 -- Total Cycles 129191 ---- Thread 21 ---- PC 5: Stalled ----- 95305 in-flight CPI 1.3554 -- Total Cycles 129191 ---- Thread 22 ---- PC 5: Stalled ----- 90916 in-flight CPI 1.4207 -- Total Cycles 129191 ---- Thread 23 ---- PC 5: Stalled ----- 90832 in-flight CPI 1.4221 -- Total Cycles 129191 ---- Thread 24 ---- PC 5: Stalled ----- 91214 in-flight CPI 1.4161 -- Total Cycles 129191 ---- Thread 25 ---- PC 5: Stalled ----- 92536 in-flight CPI 1.3959 -- Total Cycles 129191 ---- Thread 26 ---- PC 5: Stalled ----- 93233 in-flight CPI 1.3854 -- Total Cycles 129191 ---- Thread 27 ---- PC 5: Stalled ----- 94573 in-flight CPI 1.3658 -- Total Cycles 129191 ---- Thread 28 ---- PC 5: Stalled ----- 86447 in-flight CPI 1.4942 -- Total Cycles 129191 ---- Thread 29 ---- PC 5: Stalled ----- 89843 in-flight CPI 1.4377 -- Total Cycles 129191 ---- Thread 30 ---- PC 5: Stalled ----- 90699 in-flight CPI 1.4241 -- Total Cycles 129191 ---- Thread 31 ---- PC 5: Stalled ----- 89972 in-flight CPI 1.4356 -- Total Cycles 129191 Total CPI 0.0423 , IPC 23.6619 -- Total Cycles 129191 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7333 (3.977307%) FPSUB: 0 (0.000000%) FPMUL: 31076 (16.855145%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 64400 (34.929571%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5881 (3.189764%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 67659 (36.697203%) DIV: 7750 (4.203481%) FPUN: 0 (0.000000%) FPRSUB: 272 (0.147529%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3350294 total) ADD%: 7.486 (250795) SUB%: 0.000 (0) MUL%: 0.006 (210) BITOR%: 1.530 (51263) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.520 (17430) FPSUB%: 0.000 (0) FPMUL%: 4.683 (156884) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (630) FPMAX%: 0.019 (630) LOAD%: 5.089 (170500) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (242) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (607) FPINV%: 0.000 (0) FPCONV%: 0.020 (662) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.053 (35289) FPLE%: 0.456 (15267) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (630) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.821 (94510) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.734 (24593) CMPU%: 0.000 (0) RSUB%: 0.006 (210) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.693 (525760) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.176 (39413) ORI%: 1.540 (51611) XORI%: 0.000 (0) MULI%: 3.226 (108082) LW%: 1.138 (38140) LWI%: 13.545 (453783) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.289 (9675) SWI%: 4.083 (136808) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.410 (47238) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10389) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.051 (1720) bned%: 0.000 (0) bneid%: 13.833 (463462) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.720 (24109) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.114 (3836) DIV%: 0.013 (420) FPUN%: 1.486 (49780) FPRSUB%: 4.134 (138516) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (74) FPGT%: 2.960 (99184) FPGE%: 1.030 (34513) SYNC%: 0.000 (0) NOP%: 8.755 (293333) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 19 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 15 FPSUB 0 FPMUL 2 FPCMPLT 0 FPMIN 0 FPMAX 411 LOAD 39094 INTCONV 0 ATOMIC_INC 18 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 8 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1550 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 8 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49196 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 11 ORI 10390 XORI 0 MULI 10049 LW 0 LWI 143254 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 66 DIV 30 FPUN 0 FPRSUB 58 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.6622 --Total thread-cycles: 4134112 --total thread-cycles issued: 3056961 (73.944804%) --iCache conflicts: 112680 (2.725616%) --thread*cycles of FU dependence: 254198 (6.148793%) --thread*cycles of data dependence: 184371 (4.459749%) --iCache cycles*banks: 4134112 (81.041007% used) Issue breakdown: --thread*cycles of issue worked: 3056961 (73.944804%) --thread*cycles of issue failed: 783818 (18.959767%) --thread*cycles of issue NOP/other: 293333 (7.095429%) Number of thread-cycles not ready: 184371 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3350294 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 7 2: 8 3: 8 4: 7 5: 9 6: 7 7: 9 8: 9 9: 8 10: 8 11: 8 12: 9 13: 8 14: 8 15: 8 16: 7 17: 7 18: 8 19: 8 20: 6 21: 6 22: 7 23: 6 24: 8 25: 7 26: 8 27: 7 28: 6 29: 8 30: 8 31: 7 <=== Core 26 ===> ---- Thread 00 ---- PC 5: Stalled ----- 96089 in-flight CPI 1.3377 -- Total Cycles 128557 ---- Thread 01 ---- PC 5: Stalled ----- 95425 in-flight CPI 1.3470 -- Total Cycles 128557 ---- Thread 02 ---- PC 5: Stalled ----- 100042 in-flight CPI 1.2848 -- Total Cycles 128557 ---- Thread 03 ---- PC 5: Stalled ----- 99622 in-flight CPI 1.2902 -- Total Cycles 128557 ---- Thread 04 ---- PC 5: Stalled ----- 100133 in-flight CPI 1.2836 -- Total Cycles 128557 ---- Thread 05 ---- PC 5: Stalled ----- 94219 in-flight CPI 1.3642 -- Total Cycles 128557 ---- Thread 06 ---- PC 5: Stalled ----- 101481 in-flight CPI 1.2665 -- Total Cycles 128557 ---- Thread 07 ---- PC 5: Stalled ----- 100695 in-flight CPI 1.2765 -- Total Cycles 128557 ---- Thread 08 ---- PC 5: Stalled ----- 102836 in-flight CPI 1.2499 -- Total Cycles 128557 ---- Thread 09 ---- PC 5: Stalled ----- 98026 in-flight CPI 1.3112 -- Total Cycles 128557 ---- Thread 10 ---- PC 5: Stalled ----- 92263 in-flight CPI 1.3931 -- Total Cycles 128557 ---- Thread 11 ---- PC 5: Stalled ----- 94761 in-flight CPI 1.3564 -- Total Cycles 128557 ---- Thread 12 ---- PC 5: Stalled ----- 99306 in-flight CPI 1.2943 -- Total Cycles 128557 ---- Thread 13 ---- PC 5: Stalled ----- 97601 in-flight CPI 1.3169 -- Total Cycles 128557 ---- Thread 14 ---- PC 5: Stalled ----- 96068 in-flight CPI 1.3380 -- Total Cycles 128557 ---- Thread 15 ---- PC 5: Stalled ----- 100318 in-flight CPI 1.2812 -- Total Cycles 128557 ---- Thread 16 ---- PC 5: Stalled ----- 97167 in-flight CPI 1.3228 -- Total Cycles 128557 ---- Thread 17 ---- PC 5: Stalled ----- 87630 in-flight CPI 1.4669 -- Total Cycles 128557 ---- Thread 18 ---- PC 5: Stalled ----- 96057 in-flight CPI 1.3381 -- Total Cycles 128557 ---- Thread 19 ---- PC 5: Stalled ----- 96373 in-flight CPI 1.3337 -- Total Cycles 128557 ---- Thread 20 ---- PC 5: Stalled ----- 96324 in-flight CPI 1.3344 -- Total Cycles 128557 ---- Thread 21 ---- PC 5: Stalled ----- 93082 in-flight CPI 1.3809 -- Total Cycles 128557 ---- Thread 22 ---- PC 5: Stalled ----- 99168 in-flight CPI 1.2961 -- Total Cycles 128557 ---- Thread 23 ---- PC 5: Stalled ----- 97265 in-flight CPI 1.3215 -- Total Cycles 128557 ---- Thread 24 ---- PC 5: Stalled ----- 90047 in-flight CPI 1.4274 -- Total Cycles 128557 ---- Thread 25 ---- PC 5: Stalled ----- 91326 in-flight CPI 1.4074 -- Total Cycles 128557 ---- Thread 26 ---- PC 5: Stalled ----- 95048 in-flight CPI 1.3523 -- Total Cycles 128557 ---- Thread 27 ---- PC 5: Stalled ----- 94205 in-flight CPI 1.3644 -- Total Cycles 128557 ---- Thread 28 ---- PC 5: Stalled ----- 93483 in-flight CPI 1.3749 -- Total Cycles 128557 ---- Thread 29 ---- PC 5: Stalled ----- 88222 in-flight CPI 1.4569 -- Total Cycles 128557 ---- Thread 30 ---- PC 5: Stalled ----- 87444 in-flight CPI 1.4699 -- Total Cycles 128557 ---- Thread 31 ---- PC 5: Stalled ----- 83006 in-flight CPI 1.5486 -- Total Cycles 128557 Total CPI 0.0421 , IPC 23.7661 -- Total Cycles 128557 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7571 (3.762025%) FPSUB: 0 (0.000000%) FPMUL: 31361 (15.583260%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 79008 (39.259024%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5678 (2.821394%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 69720 (34.643823%) DIV: 7639 (3.795814%) FPUN: 0 (0.000000%) FPRSUB: 271 (0.134660%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3348229 total) ADD%: 7.457 (249671) SUB%: 0.000 (0) MUL%: 0.006 (207) BITOR%: 1.528 (51148) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.534 (17894) FPSUB%: 0.000 (0) FPMUL%: 4.726 (158224) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (621) FPMAX%: 0.019 (621) LOAD%: 5.122 (171496) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (239) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (593) FPINV%: 0.000 (0) FPCONV%: 0.020 (653) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.058 (35419) FPLE%: 0.455 (15248) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (621) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.815 (94257) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.741 (24827) CMPU%: 0.000 (0) RSUB%: 0.006 (207) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.691 (525373) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.177 (39421) ORI%: 1.548 (51833) XORI%: 0.000 (0) MULI%: 3.212 (107560) LW%: 1.136 (38034) LWI%: 13.507 (452234) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.289 (9660) SWI%: 4.080 (136604) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.406 (47092) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10397) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.053 (1770) bned%: 0.000 (0) bneid%: 13.814 (462523) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.716 (23968) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.118 (3961) DIV%: 0.012 (414) FPUN%: 1.480 (49554) FPRSUB%: 4.173 (139730) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (69) FPGT%: 2.951 (98808) FPGE%: 1.025 (34306) SYNC%: 0.000 (0) NOP%: 8.747 (292876) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 15 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 9 FPSUB 0 FPMUL 1 FPCMPLT 0 FPMIN 0 FPMAX 399 LOAD 39605 INTCONV 0 ATOMIC_INC 24 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 14 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1759 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 10 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49040 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 13 ORI 10767 XORI 0 MULI 9531 LW 0 LWI 142890 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 54 DIV 27 FPUN 0 FPRSUB 53 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.7663 --Total thread-cycles: 4113824 --total thread-cycles issued: 3055353 (74.270387%) --iCache conflicts: 112022 (2.723063%) --thread*cycles of FU dependence: 254235 (6.180016%) --thread*cycles of data dependence: 201248 (4.891993%) --iCache cycles*banks: 4113824 (81.390478% used) Issue breakdown: --thread*cycles of issue worked: 3055353 (74.270387%) --thread*cycles of issue failed: 765595 (18.610300%) --thread*cycles of issue NOP/other: 292876 (7.119313%) Number of thread-cycles not ready: 201248 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3348229 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 6 2: 8 3: 9 4: 8 5: 7 6: 9 7: 7 8: 7 9: 8 10: 7 11: 7 12: 8 13: 8 14: 7 15: 8 16: 8 17: 5 18: 8 19: 7 20: 7 21: 7 22: 9 23: 8 24: 7 25: 7 26: 8 27: 8 28: 9 29: 8 30: 7 31: 5 <=== Core 27 ===> ---- Thread 00 ---- PC 5: Stalled ----- 101556 in-flight CPI 1.5301 -- Total Cycles 155418 ---- Thread 01 ---- PC 5: Stalled ----- 92762 in-flight CPI 1.6752 -- Total Cycles 155418 ---- Thread 02 ---- PC 5: Stalled ----- 96334 in-flight CPI 1.6131 -- Total Cycles 155418 ---- Thread 03 ---- PC 5: Stalled ----- 95050 in-flight CPI 1.6348 -- Total Cycles 155418 ---- Thread 04 ---- PC 5: Stalled ----- 95077 in-flight CPI 1.6344 -- Total Cycles 155418 ---- Thread 05 ---- PC 5: Stalled ----- 102059 in-flight CPI 1.5225 -- Total Cycles 155418 ---- Thread 06 ---- PC 5: Stalled ----- 94445 in-flight CPI 1.6454 -- Total Cycles 155418 ---- Thread 07 ---- PC 5: Stalled ----- 99808 in-flight CPI 1.5568 -- Total Cycles 155418 ---- Thread 08 ---- PC 5: Stalled ----- 96472 in-flight CPI 1.6108 -- Total Cycles 155418 ---- Thread 09 ---- PC 5: Stalled ----- 102974 in-flight CPI 1.5091 -- Total Cycles 155418 ---- Thread 10 ---- PC 5: Stalled ----- 95977 in-flight CPI 1.6190 -- Total Cycles 155418 ---- Thread 11 ---- PC 5: Stalled ----- 91442 in-flight CPI 1.6993 -- Total Cycles 155418 ---- Thread 12 ---- PC 5: Stalled ----- 96495 in-flight CPI 1.6104 -- Total Cycles 155418 ---- Thread 13 ---- PC 5: Stalled ----- 96834 in-flight CPI 1.6047 -- Total Cycles 155418 ---- Thread 14 ---- PC 5: Stalled ----- 94812 in-flight CPI 1.6389 -- Total Cycles 155418 ---- Thread 15 ---- PC 5: Stalled ----- 100416 in-flight CPI 1.5474 -- Total Cycles 155418 ---- Thread 16 ---- PC 5: Stalled ----- 93777 in-flight CPI 1.6570 -- Total Cycles 155418 ---- Thread 17 ---- PC 5: Stalled ----- 92386 in-flight CPI 1.6820 -- Total Cycles 155418 ---- Thread 18 ---- PC 5: Stalled ----- 95577 in-flight CPI 1.6258 -- Total Cycles 155418 ---- Thread 19 ---- PC 5: Stalled ----- 92178 in-flight CPI 1.6858 -- Total Cycles 155418 ---- Thread 20 ---- PC 5: Stalled ----- 93678 in-flight CPI 1.6588 -- Total Cycles 155418 ---- Thread 21 ---- PC 5: Stalled ----- 97835 in-flight CPI 1.5883 -- Total Cycles 155418 ---- Thread 22 ---- PC 5: Stalled ----- 91331 in-flight CPI 1.7015 -- Total Cycles 155418 ---- Thread 23 ---- PC 5: Stalled ----- 96962 in-flight CPI 1.6026 -- Total Cycles 155418 ---- Thread 24 ---- PC 5: Stalled ----- 93050 in-flight CPI 1.6700 -- Total Cycles 155418 ---- Thread 25 ---- PC 5: Stalled ----- 108595 in-flight CPI 1.4310 -- Total Cycles 155418 ---- Thread 26 ---- PC 5: Stalled ----- 84360 in-flight CPI 1.8420 -- Total Cycles 155418 ---- Thread 27 ---- PC 5: Stalled ----- 91649 in-flight CPI 1.6954 -- Total Cycles 155418 ---- Thread 28 ---- PC 5: Stalled ----- 85501 in-flight CPI 1.8174 -- Total Cycles 155418 ---- Thread 29 ---- PC 5: Stalled ----- 85645 in-flight CPI 1.8144 -- Total Cycles 155418 ---- Thread 30 ---- PC 5: Stalled ----- 90834 in-flight CPI 1.7107 -- Total Cycles 155418 ---- Thread 31 ---- PC 5: Stalled ----- 91215 in-flight CPI 1.7035 -- Total Cycles 155418 Total CPI 0.0512 , IPC 19.5448 -- Total Cycles 155418 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8878 (3.697580%) FPSUB: 0 (0.000000%) FPMUL: 33589 (13.989413%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 104545 (43.541730%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5271 (2.195308%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 80407 (33.488544%) DIV: 7156 (2.980388%) FPUN: 0 (0.000000%) FPRSUB: 257 (0.107037%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3328436 total) ADD%: 7.399 (246280) SUB%: 0.000 (0) MUL%: 0.006 (194) BITOR%: 1.523 (50689) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.616 (20508) FPSUB%: 0.000 (0) FPMUL%: 4.958 (165022) FPCMPLT%: 0.000 (0) FPMIN%: 0.017 (582) FPMAX%: 0.017 (582) LOAD%: 5.236 (174293) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (226) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (557) FPINV%: 0.000 (0) FPCONV%: 0.018 (614) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.084 (36089) FPLE%: 0.453 (15085) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.017 (582) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.758 (91804) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.769 (25580) CMPU%: 0.000 (0) RSUB%: 0.006 (194) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.623 (519995) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.164 (38757) ORI%: 1.615 (53750) XORI%: 0.000 (0) MULI%: 3.148 (104770) LW%: 1.113 (37032) LWI%: 13.314 (443144) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.283 (9409) SWI%: 4.011 (133501) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.378 (45856) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.308 (10256) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.068 (2274) bned%: 0.000 (0) bneid%: 13.715 (456490) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.714 (23760) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.138 (4599) DIV%: 0.012 (388) FPUN%: 1.465 (48770) FPRSUB%: 4.370 (145456) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (70) FPGT%: 2.906 (96729) FPGE%: 1.012 (33685) SYNC%: 0.000 (0) NOP%: 8.736 (290768) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 21 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 7 FPSUB 0 FPMUL 5 FPCMPLT 0 FPMIN 0 FPMAX 377 LOAD 41086 INTCONV 0 ATOMIC_INC 30 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 14 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1498 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 11 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 47781 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 12 ORI 12770 XORI 0 MULI 8391 LW 0 LWI 140695 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 71 DIV 25 FPUN 0 FPRSUB 63 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 19.5450 --Total thread-cycles: 4973376 --total thread-cycles issued: 3037668 (61.078591%) --iCache conflicts: 108630 (2.184231%) --thread*cycles of FU dependence: 252864 (5.084353%) --thread*cycles of data dependence: 240103 (4.827767%) --iCache cycles*banks: 4973376 (66.925726% used) Issue breakdown: --thread*cycles of issue worked: 3037668 (61.078591%) --thread*cycles of issue failed: 1644940 (33.074917%) --thread*cycles of issue NOP/other: 290768 (5.846491%) Number of thread-cycles not ready: 240103 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3328436 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 6 2: 7 3: 7 4: 6 5: 8 6: 6 7: 9 8: 6 9: 7 10: 8 11: 7 12: 7 13: 7 14: 7 15: 8 16: 8 17: 7 18: 8 19: 7 20: 7 21: 8 22: 6 23: 8 24: 6 25: 5 26: 7 27: 8 28: 7 29: 6 30: 7 31: 8 <=== Core 28 ===> ---- Thread 00 ---- PC 5: Stalled ----- 102647 in-flight CPI 1.2423 -- Total Cycles 127541 ---- Thread 01 ---- PC 5: Stalled ----- 98545 in-flight CPI 1.2940 -- Total Cycles 127541 ---- Thread 02 ---- PC 5: Stalled ----- 102725 in-flight CPI 1.2413 -- Total Cycles 127541 ---- Thread 03 ---- PC 5: Stalled ----- 102458 in-flight CPI 1.2446 -- Total Cycles 127541 ---- Thread 04 ---- PC 5: Stalled ----- 95036 in-flight CPI 1.3418 -- Total Cycles 127541 ---- Thread 05 ---- PC 5: Stalled ----- 102456 in-flight CPI 1.2446 -- Total Cycles 127541 ---- Thread 06 ---- PC 5: Stalled ----- 95770 in-flight CPI 1.3315 -- Total Cycles 127541 ---- Thread 07 ---- PC 5: Stalled ----- 104920 in-flight CPI 1.2154 -- Total Cycles 127541 ---- Thread 08 ---- PC 5: Stalled ----- 99452 in-flight CPI 1.2822 -- Total Cycles 127541 ---- Thread 09 ---- PC 5: Stalled ----- 102462 in-flight CPI 1.2446 -- Total Cycles 127541 ---- Thread 10 ---- PC 5: Stalled ----- 99180 in-flight CPI 1.2857 -- Total Cycles 127541 ---- Thread 11 ---- PC 5: Stalled ----- 99822 in-flight CPI 1.2774 -- Total Cycles 127541 ---- Thread 12 ---- PC 5: Stalled ----- 98619 in-flight CPI 1.2930 -- Total Cycles 127541 ---- Thread 13 ---- PC 5: Stalled ----- 97378 in-flight CPI 1.3095 -- Total Cycles 127541 ---- Thread 14 ---- PC 5: Stalled ----- 94800 in-flight CPI 1.3451 -- Total Cycles 127541 ---- Thread 15 ---- PC 5: Stalled ----- 98866 in-flight CPI 1.2898 -- Total Cycles 127541 ---- Thread 16 ---- PC 5: Stalled ----- 93070 in-flight CPI 1.3701 -- Total Cycles 127541 ---- Thread 17 ---- PC 5: Stalled ----- 95754 in-flight CPI 1.3317 -- Total Cycles 127541 ---- Thread 18 ---- PC 5: Stalled ----- 97123 in-flight CPI 1.3130 -- Total Cycles 127541 ---- Thread 19 ---- PC 5: Stalled ----- 94643 in-flight CPI 1.3474 -- Total Cycles 127541 ---- Thread 20 ---- PC 5: Stalled ----- 96718 in-flight CPI 1.3184 -- Total Cycles 127541 ---- Thread 21 ---- PC 5: Stalled ----- 88820 in-flight CPI 1.4357 -- Total Cycles 127541 ---- Thread 22 ---- PC 5: Stalled ----- 94459 in-flight CPI 1.3500 -- Total Cycles 127541 ---- Thread 23 ---- PC 5: Stalled ----- 96105 in-flight CPI 1.3269 -- Total Cycles 127541 ---- Thread 24 ---- PC 5: Stalled ----- 91031 in-flight CPI 1.4008 -- Total Cycles 127541 ---- Thread 25 ---- PC 5: Stalled ----- 94522 in-flight CPI 1.3491 -- Total Cycles 127541 ---- Thread 26 ---- PC 5: Stalled ----- 93105 in-flight CPI 1.3697 -- Total Cycles 127541 ---- Thread 27 ---- PC 5: Stalled ----- 92665 in-flight CPI 1.3761 -- Total Cycles 127541 ---- Thread 28 ---- PC 5: Stalled ----- 92426 in-flight CPI 1.3796 -- Total Cycles 127541 ---- Thread 29 ---- PC 5: Stalled ----- 86309 in-flight CPI 1.4775 -- Total Cycles 127541 ---- Thread 30 ---- PC 5: Stalled ----- 84941 in-flight CPI 1.5013 -- Total Cycles 127541 ---- Thread 31 ---- PC 5: Stalled ----- 81425 in-flight CPI 1.5662 -- Total Cycles 127541 Total CPI 0.0416 , IPC 24.0614 -- Total Cycles 127541 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8178 (4.179870%) FPSUB: 0 (0.000000%) FPMUL: 32662 (16.693926%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 66386 (33.930652%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5753 (2.940425%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 74856 (38.259767%) DIV: 7555 (3.861448%) FPUN: 0 (0.000000%) FPRSUB: 262 (0.133911%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3362573 total) ADD%: 7.431 (249887) SUB%: 0.000 (0) MUL%: 0.006 (205) BITOR%: 1.542 (51867) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.572 (19237) FPSUB%: 0.000 (0) FPMUL%: 4.829 (162367) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (615) FPMAX%: 0.018 (615) LOAD%: 5.149 (173141) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (237) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (595) FPINV%: 0.000 (0) FPCONV%: 0.019 (647) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.071 (36019) FPLE%: 0.453 (15232) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (615) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.787 (93710) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.753 (25323) CMPU%: 0.000 (0) RSUB%: 0.006 (205) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.644 (526050) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.172 (39396) ORI%: 1.590 (53465) XORI%: 0.000 (0) MULI%: 3.184 (107048) LW%: 1.124 (37812) LWI%: 13.414 (451069) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.284 (9562) SWI%: 4.044 (135978) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.394 (46867) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.308 (10340) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.058 (1940) bned%: 0.000 (0) bneid%: 13.786 (463552) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.720 (24216) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.127 (4265) DIV%: 0.012 (410) FPUN%: 1.489 (50078) FPRSUB%: 4.253 (143021) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (74) FPGT%: 2.922 (98265) FPGE%: 1.036 (34846) SYNC%: 0.000 (0) NOP%: 8.735 (293706) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 23 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 11 FPSUB 0 FPMUL 0 FPCMPLT 0 FPMIN 0 FPMAX 400 LOAD 40091 INTCONV 0 ATOMIC_INC 18 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 13 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1737 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 9 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48890 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 13 ORI 11627 XORI 0 MULI 9307 LW 0 LWI 142782 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 69 DIV 40 FPUN 0 FPRSUB 56 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 24.0616 --Total thread-cycles: 4081312 --total thread-cycles issued: 3068867 (75.193149%) --iCache conflicts: 113313 (2.776387%) --thread*cycles of FU dependence: 255111 (6.250711%) --thread*cycles of data dependence: 195652 (4.793851%) --iCache cycles*banks: 4081312 (82.390295% used) Issue breakdown: --thread*cycles of issue worked: 3068867 (75.193149%) --thread*cycles of issue failed: 718739 (17.610489%) --thread*cycles of issue NOP/other: 293706 (7.196362%) Number of thread-cycles not ready: 195652 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3362573 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 9 1: 7 2: 9 3: 8 4: 6 5: 8 6: 8 7: 8 8: 8 9: 7 10: 9 11: 8 12: 8 13: 9 14: 7 15: 8 16: 8 17: 7 18: 6 19: 7 20: 8 21: 6 22: 8 23: 7 24: 7 25: 8 26: 6 27: 8 28: 8 29: 5 30: 6 31: 5 <=== Core 29 ===> ---- Thread 00 ---- PC 5: Stalled ----- 96865 in-flight CPI 1.5609 -- Total Cycles 151223 ---- Thread 01 ---- PC 5: Stalled ----- 102429 in-flight CPI 1.4761 -- Total Cycles 151223 ---- Thread 02 ---- PC 5: Stalled ----- 92507 in-flight CPI 1.6345 -- Total Cycles 151223 ---- Thread 03 ---- PC 5: Stalled ----- 101153 in-flight CPI 1.4947 -- Total Cycles 151223 ---- Thread 04 ---- PC 5: Stalled ----- 95258 in-flight CPI 1.5872 -- Total Cycles 151223 ---- Thread 05 ---- PC 5: Stalled ----- 94470 in-flight CPI 1.6005 -- Total Cycles 151223 ---- Thread 06 ---- PC 5: Stalled ----- 94821 in-flight CPI 1.5946 -- Total Cycles 151223 ---- Thread 07 ---- PC 5: Stalled ----- 97337 in-flight CPI 1.5533 -- Total Cycles 151223 ---- Thread 08 ---- PC 5: Stalled ----- 96988 in-flight CPI 1.5589 -- Total Cycles 151223 ---- Thread 09 ---- PC 5: Stalled ----- 99637 in-flight CPI 1.5174 -- Total Cycles 151223 ---- Thread 10 ---- PC 5: Stalled ----- 95952 in-flight CPI 1.5758 -- Total Cycles 151223 ---- Thread 11 ---- PC 5: Stalled ----- 94846 in-flight CPI 1.5941 -- Total Cycles 151223 ---- Thread 12 ---- PC 5: Stalled ----- 91249 in-flight CPI 1.6568 -- Total Cycles 151223 ---- Thread 13 ---- PC 5: Stalled ----- 93160 in-flight CPI 1.6230 -- Total Cycles 151223 ---- Thread 14 ---- PC 5: Stalled ----- 98839 in-flight CPI 1.5297 -- Total Cycles 151223 ---- Thread 15 ---- PC 5: Stalled ----- 98598 in-flight CPI 1.5334 -- Total Cycles 151223 ---- Thread 16 ---- PC 5: Stalled ----- 94514 in-flight CPI 1.5998 -- Total Cycles 151223 ---- Thread 17 ---- PC 5: Stalled ----- 97190 in-flight CPI 1.5556 -- Total Cycles 151223 ---- Thread 18 ---- PC 5: Stalled ----- 97066 in-flight CPI 1.5577 -- Total Cycles 151223 ---- Thread 19 ---- PC 5: Stalled ----- 95453 in-flight CPI 1.5839 -- Total Cycles 151223 ---- Thread 20 ---- PC 5: Stalled ----- 100014 in-flight CPI 1.5117 -- Total Cycles 151223 ---- Thread 21 ---- PC 5: Stalled ----- 95499 in-flight CPI 1.5832 -- Total Cycles 151223 ---- Thread 22 ---- PC 5: Stalled ----- 97562 in-flight CPI 1.5497 -- Total Cycles 151223 ---- Thread 23 ---- PC 5: Stalled ----- 101494 in-flight CPI 1.4898 -- Total Cycles 151223 ---- Thread 24 ---- PC 5: Stalled ----- 91702 in-flight CPI 1.6488 -- Total Cycles 151223 ---- Thread 25 ---- PC 5: Stalled ----- 89244 in-flight CPI 1.6941 -- Total Cycles 151223 ---- Thread 26 ---- PC 5: Stalled ----- 93210 in-flight CPI 1.6221 -- Total Cycles 151223 ---- Thread 27 ---- PC 5: Stalled ----- 96743 in-flight CPI 1.5628 -- Total Cycles 151223 ---- Thread 28 ---- PC 5: Stalled ----- 96449 in-flight CPI 1.5676 -- Total Cycles 151223 ---- Thread 29 ---- PC 5: Stalled ----- 91874 in-flight CPI 1.6457 -- Total Cycles 151223 ---- Thread 30 ---- PC 5: Stalled ----- 92912 in-flight CPI 1.6273 -- Total Cycles 151223 ---- Thread 31 ---- PC 5: Stalled ----- 90863 in-flight CPI 1.6639 -- Total Cycles 151223 Total CPI 0.0493 , IPC 20.2777 -- Total Cycles 151223 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7814 (3.552416%) FPSUB: 0 (0.000000%) FPMUL: 31904 (14.504258%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 95089 (43.229543%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5809 (2.640899%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 71514 (32.511832%) DIV: 7566 (3.439669%) FPUN: 0 (0.000000%) FPRSUB: 267 (0.121384%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3360537 total) ADD%: 7.417 (249261) SUB%: 0.000 (0) MUL%: 0.006 (205) BITOR%: 1.507 (50636) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.547 (18380) FPSUB%: 0.000 (0) FPMUL%: 4.767 (160212) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (615) FPMAX%: 0.018 (615) LOAD%: 5.152 (173123) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (237) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (598) FPINV%: 0.000 (0) FPCONV%: 0.019 (647) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.059 (35595) FPLE%: 0.452 (15178) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (615) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.815 (94595) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.747 (25093) CMPU%: 0.000 (0) RSUB%: 0.006 (205) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.691 (527317) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.177 (39542) ORI%: 1.550 (52090) XORI%: 0.000 (0) MULI%: 3.209 (107852) LW%: 1.136 (38166) LWI%: 13.512 (454076) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.287 (9648) SWI%: 4.083 (137227) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.408 (47315) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10421) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.058 (1951) bned%: 0.000 (0) bneid%: 13.777 (462995) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.712 (23932) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.121 (4069) DIV%: 0.012 (410) FPUN%: 1.460 (49055) FPRSUB%: 4.207 (141379) FPSQRT%: 0.000 (0) FPNEG%: 0.003 (86) FPGT%: 2.952 (99199) FPGE%: 1.008 (33877) SYNC%: 0.000 (0) NOP%: 8.749 (294024) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 24 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 10 FPSUB 0 FPMUL 2 FPCMPLT 0 FPMIN 0 FPMAX 398 LOAD 40223 INTCONV 0 ATOMIC_INC 13 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 12 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1514 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 12 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49175 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 11 ORI 11066 XORI 0 MULI 9103 LW 0 LWI 143620 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 72 DIV 31 FPUN 0 FPRSUB 58 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 20.2780 --Total thread-cycles: 4839136 --total thread-cycles issued: 3066513 (63.369019%) --iCache conflicts: 110040 (2.273960%) --thread*cycles of FU dependence: 255353 (5.276830%) --thread*cycles of data dependence: 219963 (4.545502%) --iCache cycles*banks: 4839136 (69.445641% used) Issue breakdown: --thread*cycles of issue worked: 3066513 (63.369019%) --thread*cycles of issue failed: 1478599 (30.555021%) --thread*cycles of issue NOP/other: 294024 (6.075961%) Number of thread-cycles not ready: 219963 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3360537 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 6 3: 8 4: 8 5: 7 6: 7 7: 8 8: 7 9: 8 10: 7 11: 7 12: 9 13: 6 14: 8 15: 8 16: 6 17: 8 18: 7 19: 8 20: 8 21: 7 22: 8 23: 6 24: 7 25: 8 26: 7 27: 8 28: 8 29: 6 30: 7 31: 9 <=== Core 30 ===> ---- Thread 00 ---- PC 5: Stalled ----- 100255 in-flight CPI 1.2973 -- Total Cycles 130087 ---- Thread 01 ---- PC 5: Stalled ----- 100646 in-flight CPI 1.2923 -- Total Cycles 130087 ---- Thread 02 ---- PC 5: Stalled ----- 100203 in-flight CPI 1.2979 -- Total Cycles 130087 ---- Thread 03 ---- PC 5: Stalled ----- 104176 in-flight CPI 1.2484 -- Total Cycles 130087 ---- Thread 04 ---- PC 5: Stalled ----- 98378 in-flight CPI 1.3221 -- Total Cycles 130087 ---- Thread 05 ---- PC 5: Stalled ----- 99103 in-flight CPI 1.3124 -- Total Cycles 130087 ---- Thread 06 ---- PC 5: Stalled ----- 101310 in-flight CPI 1.2838 -- Total Cycles 130087 ---- Thread 07 ---- PC 5: Stalled ----- 99579 in-flight CPI 1.3061 -- Total Cycles 130087 ---- Thread 08 ---- PC 5: Stalled ----- 103815 in-flight CPI 1.2528 -- Total Cycles 130087 ---- Thread 09 ---- PC 5: Stalled ----- 104293 in-flight CPI 1.2471 -- Total Cycles 130087 ---- Thread 10 ---- PC 5: Stalled ----- 91089 in-flight CPI 1.4279 -- Total Cycles 130087 ---- Thread 11 ---- PC 5: Stalled ----- 98592 in-flight CPI 1.3192 -- Total Cycles 130087 ---- Thread 12 ---- PC 5: Stalled ----- 95060 in-flight CPI 1.3682 -- Total Cycles 130087 ---- Thread 13 ---- PC 5: Stalled ----- 105609 in-flight CPI 1.2315 -- Total Cycles 130087 ---- Thread 14 ---- PC 5: Stalled ----- 93607 in-flight CPI 1.3894 -- Total Cycles 130087 ---- Thread 15 ---- PC 5: Stalled ----- 97641 in-flight CPI 1.3320 -- Total Cycles 130087 ---- Thread 16 ---- PC 5: Stalled ----- 96744 in-flight CPI 1.3444 -- Total Cycles 130087 ---- Thread 17 ---- PC 5: Stalled ----- 94737 in-flight CPI 1.3729 -- Total Cycles 130087 ---- Thread 18 ---- PC 5: Stalled ----- 91182 in-flight CPI 1.4264 -- Total Cycles 130087 ---- Thread 19 ---- PC 5: Stalled ----- 92603 in-flight CPI 1.4045 -- Total Cycles 130087 ---- Thread 20 ---- PC 5: Stalled ----- 92875 in-flight CPI 1.4004 -- Total Cycles 130087 ---- Thread 21 ---- PC 5: Stalled ----- 96163 in-flight CPI 1.3525 -- Total Cycles 130087 ---- Thread 22 ---- PC 5: Stalled ----- 93178 in-flight CPI 1.3959 -- Total Cycles 130087 ---- Thread 23 ---- PC 5: Stalled ----- 86736 in-flight CPI 1.4996 -- Total Cycles 130087 ---- Thread 24 ---- PC 5: Stalled ----- 87398 in-flight CPI 1.4882 -- Total Cycles 130087 ---- Thread 25 ---- PC 5: Stalled ----- 92157 in-flight CPI 1.4113 -- Total Cycles 130087 ---- Thread 26 ---- PC 5: Stalled ----- 90276 in-flight CPI 1.4407 -- Total Cycles 130087 ---- Thread 27 ---- PC 5: Stalled ----- 86644 in-flight CPI 1.5012 -- Total Cycles 130087 ---- Thread 28 ---- PC 5: Stalled ----- 91327 in-flight CPI 1.4242 -- Total Cycles 130087 ---- Thread 29 ---- PC 5: Stalled ----- 91041 in-flight CPI 1.4286 -- Total Cycles 130087 ---- Thread 30 ---- PC 5: Stalled ----- 92955 in-flight CPI 1.3992 -- Total Cycles 130087 ---- Thread 31 ---- PC 5: Stalled ----- 77973 in-flight CPI 1.6682 -- Total Cycles 130087 Total CPI 0.0427 , IPC 23.4298 -- Total Cycles 130087 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7617 (3.779868%) FPSUB: 0 (0.000000%) FPMUL: 31469 (15.616207%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 78350 (38.880480%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5711 (2.834032%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 70416 (34.943304%) DIV: 7681 (3.811627%) FPUN: 0 (0.000000%) FPRSUB: 271 (0.134481%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3340871 total) ADD%: 7.457 (249115) SUB%: 0.000 (0) MUL%: 0.006 (208) BITOR%: 1.511 (50465) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.539 (18004) FPSUB%: 0.000 (0) FPMUL%: 4.742 (158431) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (624) FPMAX%: 0.019 (624) LOAD%: 5.131 (171406) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (240) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (596) FPINV%: 0.000 (0) FPCONV%: 0.020 (656) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.060 (35418) FPLE%: 0.454 (15170) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (624) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.811 (93903) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.735 (24543) CMPU%: 0.000 (0) RSUB%: 0.006 (208) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.676 (523713) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.168 (39032) ORI%: 1.551 (51809) XORI%: 0.000 (0) MULI%: 3.213 (107354) LW%: 1.134 (37894) LWI%: 13.521 (451713) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.288 (9632) SWI%: 4.070 (135975) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.404 (46909) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10368) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.061 (2052) bned%: 0.000 (0) bneid%: 13.795 (460879) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.716 (23926) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.120 (4002) DIV%: 0.012 (416) FPUN%: 1.470 (49119) FPRSUB%: 4.189 (139963) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (80) FPGT%: 2.959 (98853) FPGE%: 1.016 (33949) SYNC%: 0.000 (0) NOP%: 8.767 (292902) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 13 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 10 FPSUB 0 FPMUL 4 FPCMPLT 0 FPMIN 0 FPMAX 406 LOAD 39815 INTCONV 0 ATOMIC_INC 22 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 13 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1337 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 11 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48908 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 13 ORI 10800 XORI 0 MULI 9422 LW 0 LWI 142726 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 65 DIV 33 FPUN 0 FPRSUB 55 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.4301 --Total thread-cycles: 4162784 --total thread-cycles issued: 3047969 (73.219485%) --iCache conflicts: 109331 (2.626391%) --thread*cycles of FU dependence: 253672 (6.093806%) --thread*cycles of data dependence: 201515 (4.840871%) --iCache cycles*banks: 4162784 (80.256458% used) Issue breakdown: --thread*cycles of issue worked: 3047969 (73.219485%) --thread*cycles of issue failed: 821913 (19.744311%) --thread*cycles of issue NOP/other: 292902 (7.036205%) Number of thread-cycles not ready: 201515 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3340871 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 9 3: 10 4: 8 5: 8 6: 7 7: 8 8: 9 9: 9 10: 6 11: 8 12: 7 13: 9 14: 8 15: 9 16: 8 17: 7 18: 7 19: 8 20: 7 21: 8 22: 6 23: 6 24: 7 25: 7 26: 7 27: 6 28: 6 29: 7 30: 8 31: 4 <=== Core 31 ===> ---- Thread 00 ---- PC 5: Stalled ----- 93223 in-flight CPI 1.6591 -- Total Cycles 154692 ---- Thread 01 ---- PC 5: Stalled ----- 99058 in-flight CPI 1.5613 -- Total Cycles 154692 ---- Thread 02 ---- PC 5: Stalled ----- 100976 in-flight CPI 1.5316 -- Total Cycles 154692 ---- Thread 03 ---- PC 5: Stalled ----- 93435 in-flight CPI 1.6554 -- Total Cycles 154692 ---- Thread 04 ---- PC 5: Stalled ----- 101064 in-flight CPI 1.5303 -- Total Cycles 154692 ---- Thread 05 ---- PC 5: Stalled ----- 93735 in-flight CPI 1.6500 -- Total Cycles 154692 ---- Thread 06 ---- PC 5: Stalled ----- 101008 in-flight CPI 1.5312 -- Total Cycles 154692 ---- Thread 07 ---- PC 5: Stalled ----- 98254 in-flight CPI 1.5741 -- Total Cycles 154692 ---- Thread 08 ---- PC 5: Stalled ----- 93718 in-flight CPI 1.6503 -- Total Cycles 154692 ---- Thread 09 ---- PC 5: Stalled ----- 102047 in-flight CPI 1.5156 -- Total Cycles 154692 ---- Thread 10 ---- PC 5: Stalled ----- 98773 in-flight CPI 1.5658 -- Total Cycles 154692 ---- Thread 11 ---- PC 5: Stalled ----- 93641 in-flight CPI 1.6517 -- Total Cycles 154692 ---- Thread 12 ---- PC 5: Stalled ----- 93308 in-flight CPI 1.6575 -- Total Cycles 154692 ---- Thread 13 ---- PC 5: Stalled ----- 94641 in-flight CPI 1.6342 -- Total Cycles 154692 ---- Thread 14 ---- PC 5: Stalled ----- 95063 in-flight CPI 1.6270 -- Total Cycles 154692 ---- Thread 15 ---- PC 5: Stalled ----- 98961 in-flight CPI 1.5629 -- Total Cycles 154692 ---- Thread 16 ---- PC 5: Stalled ----- 97756 in-flight CPI 1.5822 -- Total Cycles 154692 ---- Thread 17 ---- PC 5: Stalled ----- 95245 in-flight CPI 1.6239 -- Total Cycles 154692 ---- Thread 18 ---- PC 5: Stalled ----- 94601 in-flight CPI 1.6349 -- Total Cycles 154692 ---- Thread 19 ---- PC 5: Stalled ----- 95004 in-flight CPI 1.6279 -- Total Cycles 154692 ---- Thread 20 ---- PC 5: Stalled ----- 94208 in-flight CPI 1.6417 -- Total Cycles 154692 ---- Thread 21 ---- PC 5: Stalled ----- 95750 in-flight CPI 1.6153 -- Total Cycles 154692 ---- Thread 22 ---- PC 5: Stalled ----- 92997 in-flight CPI 1.6631 -- Total Cycles 154692 ---- Thread 23 ---- PC 5: Stalled ----- 92532 in-flight CPI 1.6714 -- Total Cycles 154692 ---- Thread 24 ---- PC 5: Stalled ----- 109972 in-flight CPI 1.4065 -- Total Cycles 154692 ---- Thread 25 ---- PC 5: Stalled ----- 90444 in-flight CPI 1.7101 -- Total Cycles 154692 ---- Thread 26 ---- PC 5: Stalled ----- 88126 in-flight CPI 1.7551 -- Total Cycles 154692 ---- Thread 27 ---- PC 5: Stalled ----- 92451 in-flight CPI 1.6729 -- Total Cycles 154692 ---- Thread 28 ---- PC 5: Stalled ----- 93273 in-flight CPI 1.6581 -- Total Cycles 154692 ---- Thread 29 ---- PC 5: Stalled ----- 86571 in-flight CPI 1.7865 -- Total Cycles 154692 ---- Thread 30 ---- PC 5: Stalled ----- 91928 in-flight CPI 1.6825 -- Total Cycles 154692 ---- Thread 31 ---- PC 5: Stalled ----- 91073 in-flight CPI 1.6982 -- Total Cycles 154692 Total CPI 0.0507 , IPC 19.7387 -- Total Cycles 154692 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8313 (3.877423%) FPSUB: 0 (0.000000%) FPMUL: 32747 (15.274144%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 83397 (38.898762%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5831 (2.719746%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 76118 (35.503626%) DIV: 7717 (3.599431%) FPUN: 0 (0.000000%) FPRSUB: 272 (0.126869%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3346371 total) ADD%: 7.486 (250521) SUB%: 0.000 (0) MUL%: 0.006 (209) BITOR%: 1.512 (50595) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.580 (19423) FPSUB%: 0.000 (0) FPMUL%: 4.853 (162412) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (627) FPMAX%: 0.019 (627) LOAD%: 5.170 (173019) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (241) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (603) FPINV%: 0.000 (0) FPCONV%: 0.020 (659) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.076 (36012) FPLE%: 0.452 (15119) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (627) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.777 (92944) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.751 (25137) CMPU%: 0.000 (0) RSUB%: 0.006 (209) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.628 (522973) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.165 (38996) ORI%: 1.573 (52639) XORI%: 0.000 (0) MULI%: 3.179 (106380) LW%: 1.121 (37512) LWI%: 13.425 (449252) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.284 (9513) SWI%: 4.042 (135267) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.388 (46458) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.308 (10299) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.062 (2088) bned%: 0.000 (0) bneid%: 13.757 (460360) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.708 (23678) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.130 (4337) DIV%: 0.012 (418) FPUN%: 1.461 (48902) FPRSUB%: 4.276 (143102) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (71) FPGT%: 2.939 (98355) FPGE%: 1.010 (33783) SYNC%: 0.000 (0) NOP%: 8.753 (292908) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 14 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 10 FPSUB 0 FPMUL 0 FPCMPLT 0 FPMIN 0 FPMAX 404 LOAD 40311 INTCONV 0 ATOMIC_INC 16 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 16 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1716 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 6 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48528 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 6 ORI 11838 XORI 0 MULI 8696 LW 0 LWI 142107 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 72 DIV 20 FPUN 0 FPRSUB 48 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 19.7389 --Total thread-cycles: 4950144 --total thread-cycles issued: 3053463 (61.684327%) --iCache conflicts: 109749 (2.217087%) --thread*cycles of FU dependence: 253816 (5.127447%) --thread*cycles of data dependence: 214395 (4.331086%) --iCache cycles*banks: 4950144 (67.602134% used) Issue breakdown: --thread*cycles of issue worked: 3053463 (61.684327%) --thread*cycles of issue failed: 1603773 (32.398512%) --thread*cycles of issue NOP/other: 292908 (5.917161%) Number of thread-cycles not ready: 214395 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3346371 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 9 3: 6 4: 8 5: 7 6: 8 7: 8 8: 7 9: 8 10: 8 11: 7 12: 9 13: 8 14: 7 15: 7 16: 7 17: 7 18: 8 19: 8 20: 8 21: 8 22: 7 23: 8 24: 6 25: 7 26: 6 27: 8 28: 8 29: 8 30: 7 31: 8 <=== Core 32 ===> ---- Thread 00 ---- PC 5: Stalled ----- 99680 in-flight CPI 1.2800 -- Total Cycles 127607 ---- Thread 01 ---- PC 5: Stalled ----- 102470 in-flight CPI 1.2451 -- Total Cycles 127607 ---- Thread 02 ---- PC 5: Stalled ----- 98322 in-flight CPI 1.2977 -- Total Cycles 127607 ---- Thread 03 ---- PC 5: Stalled ----- 102077 in-flight CPI 1.2498 -- Total Cycles 127607 ---- Thread 04 ---- PC 5: Stalled ----- 98648 in-flight CPI 1.2933 -- Total Cycles 127607 ---- Thread 05 ---- PC 5: Stalled ----- 95023 in-flight CPI 1.3427 -- Total Cycles 127607 ---- Thread 06 ---- PC 5: Stalled ----- 94807 in-flight CPI 1.3457 -- Total Cycles 127607 ---- Thread 07 ---- PC 5: Stalled ----- 100153 in-flight CPI 1.2739 -- Total Cycles 127607 ---- Thread 08 ---- PC 5: Stalled ----- 91166 in-flight CPI 1.3995 -- Total Cycles 127607 ---- Thread 09 ---- PC 5: Stalled ----- 98886 in-flight CPI 1.2902 -- Total Cycles 127607 ---- Thread 10 ---- PC 5: Stalled ----- 92103 in-flight CPI 1.3852 -- Total Cycles 127607 ---- Thread 11 ---- PC 5: Stalled ----- 99504 in-flight CPI 1.2822 -- Total Cycles 127607 ---- Thread 12 ---- PC 5: Stalled ----- 93157 in-flight CPI 1.3695 -- Total Cycles 127607 ---- Thread 13 ---- PC 5: Stalled ----- 96209 in-flight CPI 1.3261 -- Total Cycles 127607 ---- Thread 14 ---- PC 5: Stalled ----- 95878 in-flight CPI 1.3307 -- Total Cycles 127607 ---- Thread 15 ---- PC 5: Stalled ----- 96092 in-flight CPI 1.3277 -- Total Cycles 127607 ---- Thread 16 ---- PC 5: Stalled ----- 94272 in-flight CPI 1.3533 -- Total Cycles 127607 ---- Thread 17 ---- PC 5: Stalled ----- 96403 in-flight CPI 1.3234 -- Total Cycles 127607 ---- Thread 18 ---- PC 5: Stalled ----- 96287 in-flight CPI 1.3251 -- Total Cycles 127607 ---- Thread 19 ---- PC 5: Stalled ----- 95274 in-flight CPI 1.3391 -- Total Cycles 127607 ---- Thread 20 ---- PC 5: Stalled ----- 91817 in-flight CPI 1.3896 -- Total Cycles 127607 ---- Thread 21 ---- PC 5: Stalled ----- 91287 in-flight CPI 1.3977 -- Total Cycles 127607 ---- Thread 22 ---- PC 5: Stalled ----- 96375 in-flight CPI 1.3238 -- Total Cycles 127607 ---- Thread 23 ---- PC 5: Stalled ----- 93168 in-flight CPI 1.3694 -- Total Cycles 127607 ---- Thread 24 ---- PC 5: Stalled ----- 93107 in-flight CPI 1.3703 -- Total Cycles 127607 ---- Thread 25 ---- PC 5: Stalled ----- 93358 in-flight CPI 1.3667 -- Total Cycles 127607 ---- Thread 26 ---- PC 5: Stalled ----- 85965 in-flight CPI 1.4842 -- Total Cycles 127607 ---- Thread 27 ---- PC 5: Stalled ----- 89663 in-flight CPI 1.4229 -- Total Cycles 127607 ---- Thread 28 ---- PC 5: Stalled ----- 90875 in-flight CPI 1.4040 -- Total Cycles 127607 ---- Thread 29 ---- PC 5: Stalled ----- 87951 in-flight CPI 1.4507 -- Total Cycles 127607 ---- Thread 30 ---- PC 5: Stalled ----- 90408 in-flight CPI 1.4112 -- Total Cycles 127607 ---- Thread 31 ---- PC 5: Stalled ----- 89666 in-flight CPI 1.4229 -- Total Cycles 127607 Total CPI 0.0421 , IPC 23.7493 -- Total Cycles 127607 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8366 (3.969237%) FPSUB: 0 (0.000000%) FPMUL: 32742 (15.534395%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 81425 (38.631975%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5320 (2.524066%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 75426 (35.785758%) DIV: 7233 (3.431687%) FPUN: 0 (0.000000%) FPRSUB: 259 (0.122882%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3321535 total) ADD%: 7.432 (246860) SUB%: 0.000 (0) MUL%: 0.006 (196) BITOR%: 1.517 (50390) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.585 (19434) FPSUB%: 0.000 (0) FPMUL%: 4.877 (162002) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (588) FPMAX%: 0.018 (588) LOAD%: 5.186 (172261) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (228) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (562) FPINV%: 0.000 (0) FPCONV%: 0.019 (620) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.076 (35738) FPLE%: 0.452 (15014) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (588) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.778 (92271) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.757 (25133) CMPU%: 0.000 (0) RSUB%: 0.006 (196) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.641 (519506) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.167 (38749) ORI%: 1.586 (52687) XORI%: 0.000 (0) MULI%: 3.172 (105374) LW%: 1.121 (37222) LWI%: 13.394 (444873) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.285 (9482) SWI%: 4.025 (133698) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.387 (46065) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10309) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.065 (2143) bned%: 0.000 (0) bneid%: 13.759 (457013) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.709 (23559) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.130 (4305) DIV%: 0.012 (392) FPUN%: 1.465 (48667) FPRSUB%: 4.296 (142703) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (55) FPGT%: 2.933 (97419) FPGE%: 1.013 (33653) SYNC%: 0.000 (0) NOP%: 8.758 (290896) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 20 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 7 FPSUB 0 FPMUL 3 FPCMPLT 0 FPMIN 0 FPMAX 385 LOAD 41442 INTCONV 0 ATOMIC_INC 19 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 12 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1306 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 14 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48258 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 21 ORI 11986 XORI 0 MULI 9095 LW 0 LWI 140990 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 62 DIV 18 FPUN 0 FPRSUB 70 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.7496 --Total thread-cycles: 4083424 --total thread-cycles issued: 3030639 (74.218083%) --iCache conflicts: 109603 (2.684096%) --thread*cycles of FU dependence: 253712 (6.213217%) --thread*cycles of data dependence: 210771 (5.161624%) --iCache cycles*banks: 4083424 (81.342692% used) Issue breakdown: --thread*cycles of issue worked: 3030639 (74.218083%) --thread*cycles of issue failed: 761889 (18.658092%) --thread*cycles of issue NOP/other: 290896 (7.123825%) Number of thread-cycles not ready: 210771 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3321535 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 6 3: 9 4: 7 5: 7 6: 8 7: 8 8: 6 9: 8 10: 7 11: 7 12: 8 13: 7 14: 7 15: 8 16: 8 17: 8 18: 7 19: 8 20: 7 21: 6 22: 8 23: 7 24: 7 25: 6 26: 6 27: 7 28: 5 29: 6 30: 7 31: 7 <=== Core 33 ===> ---- Thread 00 ---- PC 5: Stalled ----- 94583 in-flight CPI 1.3566 -- Total Cycles 128335 ---- Thread 01 ---- PC 5: Stalled ----- 96553 in-flight CPI 1.3289 -- Total Cycles 128335 ---- Thread 02 ---- PC 5: Stalled ----- 95580 in-flight CPI 1.3425 -- Total Cycles 128335 ---- Thread 03 ---- PC 5: Stalled ----- 91814 in-flight CPI 1.3976 -- Total Cycles 128335 ---- Thread 04 ---- PC 5: Stalled ----- 98810 in-flight CPI 1.2986 -- Total Cycles 128335 ---- Thread 05 ---- PC 5: Stalled ----- 102835 in-flight CPI 1.2477 -- Total Cycles 128335 ---- Thread 06 ---- PC 5: Stalled ----- 97799 in-flight CPI 1.3121 -- Total Cycles 128335 ---- Thread 07 ---- PC 5: Stalled ----- 98924 in-flight CPI 1.2971 -- Total Cycles 128335 ---- Thread 08 ---- PC 5: Stalled ----- 100697 in-flight CPI 1.2742 -- Total Cycles 128335 ---- Thread 09 ---- PC 5: Stalled ----- 100535 in-flight CPI 1.2763 -- Total Cycles 128335 ---- Thread 10 ---- PC 5: Stalled ----- 95543 in-flight CPI 1.3430 -- Total Cycles 128335 ---- Thread 11 ---- PC 5: Stalled ----- 96437 in-flight CPI 1.3305 -- Total Cycles 128335 ---- Thread 12 ---- PC 5: Stalled ----- 94929 in-flight CPI 1.3516 -- Total Cycles 128335 ---- Thread 13 ---- PC 5: Stalled ----- 99514 in-flight CPI 1.2894 -- Total Cycles 128335 ---- Thread 14 ---- PC 5: Stalled ----- 88742 in-flight CPI 1.4459 -- Total Cycles 128335 ---- Thread 15 ---- PC 5: Stalled ----- 97619 in-flight CPI 1.3144 -- Total Cycles 128335 ---- Thread 16 ---- PC 5: Stalled ----- 92382 in-flight CPI 1.3890 -- Total Cycles 128335 ---- Thread 17 ---- PC 5: Stalled ----- 97079 in-flight CPI 1.3217 -- Total Cycles 128335 ---- Thread 18 ---- PC 5: Stalled ----- 94657 in-flight CPI 1.3556 -- Total Cycles 128335 ---- Thread 19 ---- PC 5: Stalled ----- 96694 in-flight CPI 1.3270 -- Total Cycles 128335 ---- Thread 20 ---- PC 5: Stalled ----- 88487 in-flight CPI 1.4501 -- Total Cycles 128335 ---- Thread 21 ---- PC 5: Stalled ----- 93506 in-flight CPI 1.3722 -- Total Cycles 128335 ---- Thread 22 ---- PC 5: Stalled ----- 92838 in-flight CPI 1.3822 -- Total Cycles 128335 ---- Thread 23 ---- PC 5: Stalled ----- 96297 in-flight CPI 1.3324 -- Total Cycles 128335 ---- Thread 24 ---- PC 5: Stalled ----- 94859 in-flight CPI 1.3526 -- Total Cycles 128335 ---- Thread 25 ---- PC 5: Stalled ----- 90213 in-flight CPI 1.4223 -- Total Cycles 128335 ---- Thread 26 ---- PC 5: Stalled ----- 89574 in-flight CPI 1.4325 -- Total Cycles 128335 ---- Thread 27 ---- PC 5: Stalled ----- 95372 in-flight CPI 1.3454 -- Total Cycles 128335 ---- Thread 28 ---- PC 5: Stalled ----- 94427 in-flight CPI 1.3589 -- Total Cycles 128335 ---- Thread 29 ---- PC 5: Stalled ----- 93338 in-flight CPI 1.3748 -- Total Cycles 128335 ---- Thread 30 ---- PC 5: Stalled ----- 88204 in-flight CPI 1.4547 -- Total Cycles 128335 ---- Thread 31 ---- PC 5: Stalled ----- 87721 in-flight CPI 1.4628 -- Total Cycles 128335 Total CPI 0.0423 , IPC 23.6654 -- Total Cycles 128335 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8657 (3.714494%) FPSUB: 0 (0.000000%) FPMUL: 33247 (14.265425%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 100387 (43.073457%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5421 (2.326010%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 77868 (33.411139%) DIV: 7222 (3.098773%) FPUN: 0 (0.000000%) FPRSUB: 258 (0.110701%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3328476 total) ADD%: 7.359 (244949) SUB%: 0.000 (0) MUL%: 0.006 (196) BITOR%: 1.517 (50492) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.601 (19994) FPSUB%: 0.000 (0) FPMUL%: 4.921 (163809) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (588) FPMAX%: 0.018 (588) LOAD%: 5.211 (173448) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (228) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (567) FPINV%: 0.000 (0) FPCONV%: 0.019 (620) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.082 (36002) FPLE%: 0.452 (15039) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (588) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.772 (92256) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.761 (25336) CMPU%: 0.000 (0) RSUB%: 0.006 (196) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.640 (520590) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.167 (38830) ORI%: 1.596 (53107) XORI%: 0.000 (0) MULI%: 3.166 (105386) LW%: 1.118 (37216) LWI%: 13.376 (445209) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.284 (9451) SWI%: 4.017 (133715) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.385 (46092) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.308 (10261) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.066 (2187) bned%: 0.000 (0) bneid%: 13.749 (457628) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.708 (23582) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.134 (4448) DIV%: 0.012 (392) FPUN%: 1.463 (48682) FPRSUB%: 4.335 (144299) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (72) FPGT%: 2.925 (97368) FPGE%: 1.011 (33643) SYNC%: 0.000 (0) NOP%: 8.753 (291326) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 26 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 10 FPSUB 0 FPMUL 2 FPCMPLT 0 FPMIN 0 FPMAX 381 LOAD 41177 INTCONV 0 ATOMIC_INC 21 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 15 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1423 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 6 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48101 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 14 ORI 12388 XORI 0 MULI 8855 LW 0 LWI 141234 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 86 DIV 34 FPUN 0 FPRSUB 56 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.6656 --Total thread-cycles: 4106720 --total thread-cycles issued: 3037150 (73.955614%) --iCache conflicts: 109830 (2.674397%) --thread*cycles of FU dependence: 253832 (6.180894%) --thread*cycles of data dependence: 233060 (5.675089%) --iCache cycles*banks: 4106720 (81.050279% used) Issue breakdown: --thread*cycles of issue worked: 3037150 (73.955614%) --thread*cycles of issue failed: 778244 (18.950501%) --thread*cycles of issue NOP/other: 291326 (7.093885%) Number of thread-cycles not ready: 233060 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3328476 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 7 3: 6 4: 7 5: 8 6: 6 7: 7 8: 8 9: 8 10: 6 11: 7 12: 8 13: 8 14: 6 15: 7 16: 6 17: 9 18: 6 19: 8 20: 6 21: 7 22: 6 23: 8 24: 9 25: 7 26: 6 27: 8 28: 7 29: 6 30: 8 31: 6 <=== Core 34 ===> ---- Thread 00 ---- PC 5: Stalled ----- 95330 in-flight CPI 1.3585 -- Total Cycles 129528 ---- Thread 01 ---- PC 5: Stalled ----- 100539 in-flight CPI 1.2881 -- Total Cycles 129528 ---- Thread 02 ---- PC 5: Stalled ----- 101661 in-flight CPI 1.2738 -- Total Cycles 129528 ---- Thread 03 ---- PC 5: Stalled ----- 97317 in-flight CPI 1.3307 -- Total Cycles 129528 ---- Thread 04 ---- PC 5: Stalled ----- 103548 in-flight CPI 1.2506 -- Total Cycles 129528 ---- Thread 05 ---- PC 5: Stalled ----- 102239 in-flight CPI 1.2667 -- Total Cycles 129528 ---- Thread 06 ---- PC 5: Stalled ----- 93202 in-flight CPI 1.3895 -- Total Cycles 129528 ---- Thread 07 ---- PC 5: Stalled ----- 100052 in-flight CPI 1.2944 -- Total Cycles 129528 ---- Thread 08 ---- PC 5: Stalled ----- 97830 in-flight CPI 1.3237 -- Total Cycles 129528 ---- Thread 09 ---- PC 5: Stalled ----- 101773 in-flight CPI 1.2724 -- Total Cycles 129528 ---- Thread 10 ---- PC 5: Stalled ----- 98049 in-flight CPI 1.3208 -- Total Cycles 129528 ---- Thread 11 ---- PC 5: Stalled ----- 94043 in-flight CPI 1.3771 -- Total Cycles 129528 ---- Thread 12 ---- PC 5: Stalled ----- 99978 in-flight CPI 1.2953 -- Total Cycles 129528 ---- Thread 13 ---- PC 5: Stalled ----- 93399 in-flight CPI 1.3866 -- Total Cycles 129528 ---- Thread 14 ---- PC 5: Stalled ----- 94820 in-flight CPI 1.3659 -- Total Cycles 129528 ---- Thread 15 ---- PC 5: Stalled ----- 95593 in-flight CPI 1.3548 -- Total Cycles 129528 ---- Thread 16 ---- PC 5: Stalled ----- 99072 in-flight CPI 1.3072 -- Total Cycles 129528 ---- Thread 17 ---- PC 5: Stalled ----- 90999 in-flight CPI 1.4231 -- Total Cycles 129528 ---- Thread 18 ---- PC 5: Stalled ----- 95696 in-flight CPI 1.3532 -- Total Cycles 129528 ---- Thread 19 ---- PC 5: Stalled ----- 96660 in-flight CPI 1.3398 -- Total Cycles 129528 ---- Thread 20 ---- PC 5: Stalled ----- 95161 in-flight CPI 1.3610 -- Total Cycles 129528 ---- Thread 21 ---- PC 5: Stalled ----- 92712 in-flight CPI 1.3968 -- Total Cycles 129528 ---- Thread 22 ---- PC 5: Stalled ----- 90114 in-flight CPI 1.4372 -- Total Cycles 129528 ---- Thread 23 ---- PC 5: Stalled ----- 90613 in-flight CPI 1.4292 -- Total Cycles 129528 ---- Thread 24 ---- PC 5: Stalled ----- 93992 in-flight CPI 1.3778 -- Total Cycles 129528 ---- Thread 25 ---- PC 5: Stalled ----- 91385 in-flight CPI 1.4172 -- Total Cycles 129528 ---- Thread 26 ---- PC 5: Stalled ----- 93746 in-flight CPI 1.3814 -- Total Cycles 129528 ---- Thread 27 ---- PC 5: Stalled ----- 88946 in-flight CPI 1.4560 -- Total Cycles 129528 ---- Thread 28 ---- PC 5: Stalled ----- 93689 in-flight CPI 1.3823 -- Total Cycles 129528 ---- Thread 29 ---- PC 5: Stalled ----- 90430 in-flight CPI 1.4321 -- Total Cycles 129528 ---- Thread 30 ---- PC 5: Stalled ----- 90836 in-flight CPI 1.4257 -- Total Cycles 129528 ---- Thread 31 ---- PC 5: Stalled ----- 87086 in-flight CPI 1.4871 -- Total Cycles 129528 Total CPI 0.0425 , IPC 23.5553 -- Total Cycles 129528 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8099 (3.954841%) FPSUB: 0 (0.000000%) FPMUL: 32296 (15.770532%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 76649 (37.428645%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5730 (2.798029%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 74219 (36.242047%) DIV: 7528 (3.676015%) FPUN: 0 (0.000000%) FPRSUB: 266 (0.129891%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3343070 total) ADD%: 7.446 (248910) SUB%: 0.000 (0) MUL%: 0.006 (204) BITOR%: 1.510 (50465) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.568 (18999) FPSUB%: 0.000 (0) FPMUL%: 4.823 (161249) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (612) FPMAX%: 0.018 (612) LOAD%: 5.176 (173024) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (236) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (593) FPINV%: 0.000 (0) FPCONV%: 0.019 (644) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.066 (35639) FPLE%: 0.451 (15069) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (612) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.801 (93639) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.749 (25050) CMPU%: 0.000 (0) RSUB%: 0.006 (204) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.657 (523411) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.172 (39172) ORI%: 1.572 (52553) XORI%: 0.000 (0) MULI%: 3.192 (106700) LW%: 1.130 (37782) LWI%: 13.458 (449907) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.286 (9556) SWI%: 4.068 (135985) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.401 (46830) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.309 (10317) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.062 (2089) bned%: 0.000 (0) bneid%: 13.744 (459469) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.714 (23880) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.127 (4229) DIV%: 0.012 (408) FPUN%: 1.462 (48871) FPRSUB%: 4.253 (142196) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (73) FPGT%: 2.932 (98035) FPGE%: 1.011 (33802) SYNC%: 0.000 (0) NOP%: 8.733 (291948) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 21 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 18 FPSUB 0 FPMUL 3 FPCMPLT 0 FPMIN 0 FPMAX 397 LOAD 39814 INTCONV 0 ATOMIC_INC 24 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 17 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1869 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 10 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48654 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 20 ORI 11535 XORI 0 MULI 9291 LW 0 LWI 142174 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 66 DIV 35 FPUN 0 FPRSUB 43 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.5555 --Total thread-cycles: 4144896 --total thread-cycles issued: 3051122 (73.611545%) --iCache conflicts: 110586 (2.668004%) --thread*cycles of FU dependence: 254003 (6.128091%) --thread*cycles of data dependence: 204787 (4.940703%) --iCache cycles*banks: 4144896 (80.655872% used) Issue breakdown: --thread*cycles of issue worked: 3051122 (73.611545%) --thread*cycles of issue failed: 801826 (19.344900%) --thread*cycles of issue NOP/other: 291948 (7.043554%) Number of thread-cycles not ready: 204787 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3343070 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 9 3: 8 4: 9 5: 8 6: 7 7: 8 8: 9 9: 9 10: 7 11: 7 12: 9 13: 7 14: 5 15: 6 16: 7 17: 7 18: 9 19: 7 20: 6 21: 8 22: 6 23: 7 24: 8 25: 5 26: 8 27: 7 28: 7 29: 8 30: 7 31: 6 <=== Core 35 ===> ---- Thread 00 ---- PC 5: Stalled ----- 95206 in-flight CPI 1.3332 -- Total Cycles 126948 ---- Thread 01 ---- PC 5: Stalled ----- 100005 in-flight CPI 1.2692 -- Total Cycles 126948 ---- Thread 02 ---- PC 5: Stalled ----- 96647 in-flight CPI 1.3133 -- Total Cycles 126948 ---- Thread 03 ---- PC 5: Stalled ----- 96685 in-flight CPI 1.3128 -- Total Cycles 126948 ---- Thread 04 ---- PC 5: Stalled ----- 99667 in-flight CPI 1.2735 -- Total Cycles 126948 ---- Thread 05 ---- PC 5: Stalled ----- 95100 in-flight CPI 1.3347 -- Total Cycles 126948 ---- Thread 06 ---- PC 5: Stalled ----- 97151 in-flight CPI 1.3064 -- Total Cycles 126948 ---- Thread 07 ---- PC 5: Stalled ----- 95002 in-flight CPI 1.3360 -- Total Cycles 126948 ---- Thread 08 ---- PC 5: Stalled ----- 99115 in-flight CPI 1.2806 -- Total Cycles 126948 ---- Thread 09 ---- PC 5: Stalled ----- 96786 in-flight CPI 1.3114 -- Total Cycles 126948 ---- Thread 10 ---- PC 5: Stalled ----- 96845 in-flight CPI 1.3105 -- Total Cycles 126948 ---- Thread 11 ---- PC 5: Stalled ----- 98644 in-flight CPI 1.2867 -- Total Cycles 126948 ---- Thread 12 ---- PC 5: Stalled ----- 95750 in-flight CPI 1.3256 -- Total Cycles 126948 ---- Thread 13 ---- PC 5: Stalled ----- 96650 in-flight CPI 1.3132 -- Total Cycles 126948 ---- Thread 14 ---- PC 5: Stalled ----- 97580 in-flight CPI 1.3007 -- Total Cycles 126948 ---- Thread 15 ---- PC 5: Stalled ----- 99013 in-flight CPI 1.2819 -- Total Cycles 126948 ---- Thread 16 ---- PC 5: Stalled ----- 98811 in-flight CPI 1.2845 -- Total Cycles 126948 ---- Thread 17 ---- PC 5: Stalled ----- 93749 in-flight CPI 1.3539 -- Total Cycles 126948 ---- Thread 18 ---- PC 5: Stalled ----- 100856 in-flight CPI 1.2584 -- Total Cycles 126948 ---- Thread 19 ---- PC 5: Stalled ----- 93693 in-flight CPI 1.3547 -- Total Cycles 126948 ---- Thread 20 ---- PC 5: Stalled ----- 91758 in-flight CPI 1.3833 -- Total Cycles 126948 ---- Thread 21 ---- PC 5: Stalled ----- 94936 in-flight CPI 1.3370 -- Total Cycles 126948 ---- Thread 22 ---- PC 5: Stalled ----- 91590 in-flight CPI 1.3858 -- Total Cycles 126948 ---- Thread 23 ---- PC 5: Stalled ----- 92895 in-flight CPI 1.3663 -- Total Cycles 126948 ---- Thread 24 ---- PC 5: Stalled ----- 91826 in-flight CPI 1.3823 -- Total Cycles 126948 ---- Thread 25 ---- PC 5: Stalled ----- 92337 in-flight CPI 1.3745 -- Total Cycles 126948 ---- Thread 26 ---- PC 5: Stalled ----- 92850 in-flight CPI 1.3670 -- Total Cycles 126948 ---- Thread 27 ---- PC 5: Stalled ----- 89762 in-flight CPI 1.4140 -- Total Cycles 126948 ---- Thread 28 ---- PC 5: Stalled ----- 89556 in-flight CPI 1.4173 -- Total Cycles 126948 ---- Thread 29 ---- PC 5: Stalled ----- 87400 in-flight CPI 1.4522 -- Total Cycles 126948 ---- Thread 30 ---- PC 5: Stalled ----- 84842 in-flight CPI 1.4961 -- Total Cycles 126948 ---- Thread 31 ---- PC 5: Stalled ----- 90778 in-flight CPI 1.3981 -- Total Cycles 126948 Total CPI 0.0418 , IPC 23.8999 -- Total Cycles 126948 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7911 (3.787215%) FPSUB: 0 (0.000000%) FPMUL: 31895 (15.269021%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 82968 (39.719083%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5567 (2.665077%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 72718 (34.812123%) DIV: 7562 (3.620139%) FPUN: 0 (0.000000%) FPRSUB: 266 (0.127342%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3324928 total) ADD%: 7.403 (246138) SUB%: 0.000 (0) MUL%: 0.006 (205) BITOR%: 1.529 (50826) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.560 (18626) FPSUB%: 0.000 (0) FPMUL%: 4.797 (159508) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (615) FPMAX%: 0.018 (615) LOAD%: 5.157 (171457) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (237) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (585) FPINV%: 0.000 (0) FPCONV%: 0.019 (647) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.066 (35435) FPLE%: 0.455 (15121) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (615) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.800 (93110) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.747 (24840) CMPU%: 0.000 (0) RSUB%: 0.006 (205) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.667 (520925) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.171 (38949) ORI%: 1.575 (52364) XORI%: 0.000 (0) MULI%: 3.195 (106246) LW%: 1.130 (37572) LWI%: 13.455 (447377) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.287 (9534) SWI%: 4.060 (134984) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.399 (46525) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10307) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.060 (2006) bned%: 0.000 (0) bneid%: 13.784 (458306) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.719 (23920) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.124 (4139) DIV%: 0.012 (410) FPUN%: 1.481 (49244) FPRSUB%: 4.230 (140652) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (69) FPGT%: 2.934 (97567) FPGE%: 1.026 (34123) SYNC%: 0.000 (0) NOP%: 8.747 (290828) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 22 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 9 FPSUB 0 FPMUL 0 FPCMPLT 0 FPMIN 0 FPMAX 402 LOAD 39763 INTCONV 0 ATOMIC_INC 13 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 13 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1367 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 5 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48489 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 20 ORI 11273 XORI 0 MULI 8948 LW 0 LWI 141450 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 70 DIV 34 FPUN 0 FPRSUB 50 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.9001 --Total thread-cycles: 4062336 --total thread-cycles issued: 3034100 (74.688554%) --iCache conflicts: 111388 (2.741969%) --thread*cycles of FU dependence: 251942 (6.201900%) --thread*cycles of data dependence: 208887 (5.142041%) --iCache cycles*banks: 4062336 (81.848473% used) Issue breakdown: --thread*cycles of issue worked: 3034100 (74.688554%) --thread*cycles of issue failed: 737408 (18.152314%) --thread*cycles of issue NOP/other: 290828 (7.159132%) Number of thread-cycles not ready: 208887 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3324928 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 8 3: 7 4: 8 5: 7 6: 8 7: 7 8: 8 9: 7 10: 9 11: 8 12: 7 13: 8 14: 8 15: 8 16: 8 17: 6 18: 9 19: 6 20: 7 21: 6 22: 8 23: 7 24: 6 25: 8 26: 8 27: 7 28: 7 29: 7 30: 6 31: 8 <=== Core 36 ===> ---- Thread 00 ---- PC 5: Stalled ----- 89562 in-flight CPI 1.4284 -- Total Cycles 127953 ---- Thread 01 ---- PC 5: Stalled ----- 101668 in-flight CPI 1.2583 -- Total Cycles 127953 ---- Thread 02 ---- PC 5: Stalled ----- 102908 in-flight CPI 1.2431 -- Total Cycles 127953 ---- Thread 03 ---- PC 5: Stalled ----- 93416 in-flight CPI 1.3694 -- Total Cycles 127953 ---- Thread 04 ---- PC 5: Stalled ----- 99220 in-flight CPI 1.2894 -- Total Cycles 127953 ---- Thread 05 ---- PC 5: Stalled ----- 95423 in-flight CPI 1.3407 -- Total Cycles 127953 ---- Thread 06 ---- PC 5: Stalled ----- 97054 in-flight CPI 1.3181 -- Total Cycles 127953 ---- Thread 07 ---- PC 5: Stalled ----- 98069 in-flight CPI 1.3045 -- Total Cycles 127953 ---- Thread 08 ---- PC 5: Stalled ----- 99589 in-flight CPI 1.2846 -- Total Cycles 127953 ---- Thread 09 ---- PC 5: Stalled ----- 98205 in-flight CPI 1.3027 -- Total Cycles 127953 ---- Thread 10 ---- PC 5: Stalled ----- 98706 in-flight CPI 1.2961 -- Total Cycles 127953 ---- Thread 11 ---- PC 5: Stalled ----- 98730 in-flight CPI 1.2957 -- Total Cycles 127953 ---- Thread 12 ---- PC 5: Stalled ----- 95077 in-flight CPI 1.3455 -- Total Cycles 127953 ---- Thread 13 ---- PC 5: Stalled ----- 92550 in-flight CPI 1.3823 -- Total Cycles 127953 ---- Thread 14 ---- PC 5: Stalled ----- 89320 in-flight CPI 1.4323 -- Total Cycles 127953 ---- Thread 15 ---- PC 5: Stalled ----- 100465 in-flight CPI 1.2733 -- Total Cycles 127953 ---- Thread 16 ---- PC 5: Stalled ----- 96457 in-flight CPI 1.3263 -- Total Cycles 127953 ---- Thread 17 ---- PC 5: Stalled ----- 100793 in-flight CPI 1.2692 -- Total Cycles 127953 ---- Thread 18 ---- PC 5: Stalled ----- 94986 in-flight CPI 1.3468 -- Total Cycles 127953 ---- Thread 19 ---- PC 5: Stalled ----- 94786 in-flight CPI 1.3497 -- Total Cycles 127953 ---- Thread 20 ---- PC 5: Stalled ----- 96788 in-flight CPI 1.3217 -- Total Cycles 127953 ---- Thread 21 ---- PC 5: Stalled ----- 89348 in-flight CPI 1.4318 -- Total Cycles 127953 ---- Thread 22 ---- PC 5: Stalled ----- 91260 in-flight CPI 1.4018 -- Total Cycles 127953 ---- Thread 23 ---- PC 5: Stalled ----- 97563 in-flight CPI 1.3113 -- Total Cycles 127953 ---- Thread 24 ---- PC 5: Stalled ----- 91810 in-flight CPI 1.3934 -- Total Cycles 127953 ---- Thread 25 ---- PC 5: Stalled ----- 91642 in-flight CPI 1.3960 -- Total Cycles 127953 ---- Thread 26 ---- PC 5: Stalled ----- 93044 in-flight CPI 1.3749 -- Total Cycles 127953 ---- Thread 27 ---- PC 5: Stalled ----- 82606 in-flight CPI 1.5488 -- Total Cycles 127953 ---- Thread 28 ---- PC 5: Stalled ----- 91021 in-flight CPI 1.4055 -- Total Cycles 127953 ---- Thread 29 ---- PC 5: Stalled ----- 89912 in-flight CPI 1.4228 -- Total Cycles 127953 ---- Thread 30 ---- PC 5: Stalled ----- 91413 in-flight CPI 1.3994 -- Total Cycles 127953 ---- Thread 31 ---- PC 5: Stalled ----- 77843 in-flight CPI 1.6436 -- Total Cycles 127953 Total CPI 0.0423 , IPC 23.6164 -- Total Cycles 127953 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8115 (3.675671%) FPSUB: 0 (0.000000%) FPMUL: 32239 (14.602584%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 93026 (42.135921%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5661 (2.564137%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 73980 (33.509077%) DIV: 7490 (3.392579%) FPUN: 0 (0.000000%) FPRSUB: 265 (0.120031%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3311031 total) ADD%: 7.455 (246832) SUB%: 0.000 (0) MUL%: 0.006 (203) BITOR%: 1.527 (50565) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.577 (19095) FPSUB%: 0.000 (0) FPMUL%: 4.844 (160402) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (609) FPMAX%: 0.018 (609) LOAD%: 5.163 (170954) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (235) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (587) FPINV%: 0.000 (0) FPCONV%: 0.019 (641) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.071 (35456) FPLE%: 0.454 (15043) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (609) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.785 (92228) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.753 (24935) CMPU%: 0.000 (0) RSUB%: 0.006 (203) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.648 (518094) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.170 (38743) ORI%: 1.577 (52206) XORI%: 0.000 (0) MULI%: 3.184 (105410) LW%: 1.124 (37216) LWI%: 13.422 (444411) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.285 (9427) SWI%: 4.047 (133986) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.393 (46108) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.308 (10211) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.059 (1966) bned%: 0.000 (0) bneid%: 13.763 (455704) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.716 (23691) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.127 (4216) DIV%: 0.012 (406) FPUN%: 1.475 (48835) FPRSUB%: 4.262 (141107) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (77) FPGT%: 2.928 (96935) FPGE%: 1.021 (33792) SYNC%: 0.000 (0) NOP%: 8.734 (289188) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 18 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 6 FPSUB 0 FPMUL 1 FPCMPLT 0 FPMIN 0 FPMAX 399 LOAD 40614 INTCONV 0 ATOMIC_INC 22 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 13 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1556 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 9 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48081 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 15 ORI 11525 XORI 0 MULI 8812 LW 0 LWI 140932 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 66 DIV 26 FPUN 0 FPRSUB 54 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.6166 --Total thread-cycles: 4094496 --total thread-cycles issued: 3021843 (73.802563%) --iCache conflicts: 107838 (2.633731%) --thread*cycles of FU dependence: 252158 (6.158462%) --thread*cycles of data dependence: 220776 (5.392019%) --iCache cycles*banks: 4094496 (80.866192% used) Issue breakdown: --thread*cycles of issue worked: 3021843 (73.802563%) --thread*cycles of issue failed: 783465 (19.134589%) --thread*cycles of issue NOP/other: 289188 (7.062847%) Number of thread-cycles not ready: 220776 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3311031 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 8 3: 8 4: 7 5: 6 6: 7 7: 8 8: 8 9: 7 10: 7 11: 8 12: 8 13: 7 14: 7 15: 9 16: 8 17: 8 18: 9 19: 7 20: 9 21: 7 22: 8 23: 7 24: 8 25: 7 26: 7 27: 5 28: 6 29: 7 30: 8 31: 4 <=== Core 37 ===> ---- Thread 00 ---- PC 5: Stalled ----- 101604 in-flight CPI 1.6105 -- Total Cycles 163669 ---- Thread 01 ---- PC 5: Stalled ----- 95275 in-flight CPI 1.7175 -- Total Cycles 163669 ---- Thread 02 ---- PC 5: Stalled ----- 94646 in-flight CPI 1.7290 -- Total Cycles 163669 ---- Thread 03 ---- PC 5: Stalled ----- 120927 in-flight CPI 1.3533 -- Total Cycles 163669 ---- Thread 04 ---- PC 5: Stalled ----- 98473 in-flight CPI 1.6618 -- Total Cycles 163669 ---- Thread 05 ---- PC 5: Stalled ----- 94498 in-flight CPI 1.7316 -- Total Cycles 163669 ---- Thread 06 ---- PC 5: Stalled ----- 95755 in-flight CPI 1.7090 -- Total Cycles 163669 ---- Thread 07 ---- PC 5: Stalled ----- 100037 in-flight CPI 1.6358 -- Total Cycles 163669 ---- Thread 08 ---- PC 5: Stalled ----- 100658 in-flight CPI 1.6257 -- Total Cycles 163669 ---- Thread 09 ---- PC 5: Stalled ----- 96713 in-flight CPI 1.6920 -- Total Cycles 163669 ---- Thread 10 ---- PC 5: Stalled ----- 92801 in-flight CPI 1.7633 -- Total Cycles 163669 ---- Thread 11 ---- PC 5: Stalled ----- 96665 in-flight CPI 1.6928 -- Total Cycles 163669 ---- Thread 12 ---- PC 5: Stalled ----- 89557 in-flight CPI 1.8273 -- Total Cycles 163669 ---- Thread 13 ---- PC 5: Stalled ----- 98112 in-flight CPI 1.6679 -- Total Cycles 163669 ---- Thread 14 ---- PC 5: Stalled ----- 100447 in-flight CPI 1.6291 -- Total Cycles 163669 ---- Thread 15 ---- PC 5: Stalled ----- 94341 in-flight CPI 1.7346 -- Total Cycles 163669 ---- Thread 16 ---- PC 5: Stalled ----- 86722 in-flight CPI 1.8871 -- Total Cycles 163669 ---- Thread 17 ---- PC 5: Stalled ----- 98255 in-flight CPI 1.6654 -- Total Cycles 163669 ---- Thread 18 ---- PC 5: Stalled ----- 97828 in-flight CPI 1.6727 -- Total Cycles 163669 ---- Thread 19 ---- PC 5: Stalled ----- 90335 in-flight CPI 1.8115 -- Total Cycles 163669 ---- Thread 20 ---- PC 5: Stalled ----- 94549 in-flight CPI 1.7307 -- Total Cycles 163669 ---- Thread 21 ---- PC 5: Stalled ----- 90316 in-flight CPI 1.8118 -- Total Cycles 163669 ---- Thread 22 ---- PC 5: Stalled ----- 94308 in-flight CPI 1.7352 -- Total Cycles 163669 ---- Thread 23 ---- PC 5: Stalled ----- 95680 in-flight CPI 1.7102 -- Total Cycles 163669 ---- Thread 24 ---- PC 5: Stalled ----- 93649 in-flight CPI 1.7473 -- Total Cycles 163669 ---- Thread 25 ---- PC 5: Stalled ----- 91587 in-flight CPI 1.7867 -- Total Cycles 163669 ---- Thread 26 ---- PC 5: Stalled ----- 88553 in-flight CPI 1.8480 -- Total Cycles 163669 ---- Thread 27 ---- PC 5: Stalled ----- 87528 in-flight CPI 1.8695 -- Total Cycles 163669 ---- Thread 28 ---- PC 5: Stalled ----- 91869 in-flight CPI 1.7812 -- Total Cycles 163669 ---- Thread 29 ---- PC 5: Stalled ----- 90915 in-flight CPI 1.8000 -- Total Cycles 163669 ---- Thread 30 ---- PC 5: Stalled ----- 89384 in-flight CPI 1.8308 -- Total Cycles 163669 ---- Thread 31 ---- PC 5: Stalled ----- 90263 in-flight CPI 1.8129 -- Total Cycles 163669 Total CPI 0.0538 , IPC 18.5912 -- Total Cycles 163669 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8323 (3.775220%) FPSUB: 0 (0.000000%) FPMUL: 32702 (14.833261%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 90871 (41.218067%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5389 (2.444390%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 75578 (34.281334%) DIV: 7342 (3.330249%) FPUN: 0 (0.000000%) FPRSUB: 259 (0.117479%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3334673 total) ADD%: 7.415 (247257) SUB%: 0.000 (0) MUL%: 0.006 (199) BITOR%: 1.520 (50694) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.580 (19352) FPSUB%: 0.000 (0) FPMUL%: 4.857 (161953) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (597) FPMAX%: 0.018 (597) LOAD%: 5.185 (172906) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (231) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (569) FPINV%: 0.000 (0) FPCONV%: 0.019 (629) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.074 (35812) FPLE%: 0.454 (15149) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (597) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.783 (92794) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.755 (25180) CMPU%: 0.000 (0) RSUB%: 0.006 (199) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.649 (521842) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.168 (38951) ORI%: 1.583 (52792) XORI%: 0.000 (0) MULI%: 3.180 (106026) LW%: 1.123 (37436) LWI%: 13.413 (447267) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.285 (9496) SWI%: 4.030 (134395) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.390 (46367) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.309 (10304) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.064 (2119) bned%: 0.000 (0) bneid%: 13.762 (458934) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.715 (23857) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.129 (4311) DIV%: 0.012 (398) FPUN%: 1.469 (48980) FPRSUB%: 4.286 (142909) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (63) FPGT%: 2.932 (97758) FPGE%: 1.015 (33831) SYNC%: 0.000 (0) NOP%: 8.751 (291826) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 27 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 11 FPSUB 0 FPMUL 2 FPCMPLT 0 FPMIN 0 FPMAX 389 LOAD 39738 INTCONV 0 ATOMIC_INC 20 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 10 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1624 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 7 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48380 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 14 ORI 11916 XORI 0 MULI 8887 LW 0 LWI 141543 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 89 DIV 21 FPUN 0 FPRSUB 71 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 18.5914 --Total thread-cycles: 5237408 --total thread-cycles issued: 3042847 (58.098338%) --iCache conflicts: 108724 (2.075912%) --thread*cycles of FU dependence: 252778 (4.826395%) --thread*cycles of data dependence: 220464 (4.209410%) --iCache cycles*banks: 5237408 (63.670904% used) Issue breakdown: --thread*cycles of issue worked: 3042847 (58.098338%) --thread*cycles of issue failed: 1902735 (36.329707%) --thread*cycles of issue NOP/other: 291826 (5.571955%) Number of thread-cycles not ready: 220464 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3334673 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 6 3: 6 4: 6 5: 8 6: 7 7: 8 8: 8 9: 8 10: 7 11: 8 12: 6 13: 7 14: 8 15: 6 16: 5 17: 8 18: 8 19: 7 20: 8 21: 8 22: 7 23: 9 24: 8 25: 8 26: 6 27: 8 28: 7 29: 6 30: 6 31: 7 <=== Core 38 ===> ---- Thread 00 ---- PC 5: Stalled ----- 101003 in-flight CPI 1.2615 -- Total Cycles 127436 ---- Thread 01 ---- PC 5: Stalled ----- 95240 in-flight CPI 1.3379 -- Total Cycles 127436 ---- Thread 02 ---- PC 5: Stalled ----- 93377 in-flight CPI 1.3645 -- Total Cycles 127436 ---- Thread 03 ---- PC 5: Stalled ----- 98800 in-flight CPI 1.2895 -- Total Cycles 127436 ---- Thread 04 ---- PC 5: Stalled ----- 93175 in-flight CPI 1.3675 -- Total Cycles 127436 ---- Thread 05 ---- PC 5: Stalled ----- 96830 in-flight CPI 1.3158 -- Total Cycles 127436 ---- Thread 06 ---- PC 5: Stalled ----- 102625 in-flight CPI 1.2415 -- Total Cycles 127436 ---- Thread 07 ---- PC 5: Stalled ----- 101163 in-flight CPI 1.2595 -- Total Cycles 127436 ---- Thread 08 ---- PC 5: Stalled ----- 101367 in-flight CPI 1.2569 -- Total Cycles 127436 ---- Thread 09 ---- PC 5: Stalled ----- 98893 in-flight CPI 1.2884 -- Total Cycles 127436 ---- Thread 10 ---- PC 5: Stalled ----- 96171 in-flight CPI 1.3249 -- Total Cycles 127436 ---- Thread 11 ---- PC 5: Stalled ----- 103166 in-flight CPI 1.2350 -- Total Cycles 127436 ---- Thread 12 ---- PC 5: Stalled ----- 99509 in-flight CPI 1.2804 -- Total Cycles 127436 ---- Thread 13 ---- PC 5: Stalled ----- 95694 in-flight CPI 1.3314 -- Total Cycles 127436 ---- Thread 14 ---- PC 5: Stalled ----- 96607 in-flight CPI 1.3189 -- Total Cycles 127436 ---- Thread 15 ---- PC 5: Stalled ----- 97526 in-flight CPI 1.3064 -- Total Cycles 127436 ---- Thread 16 ---- PC 5: Stalled ----- 94494 in-flight CPI 1.3483 -- Total Cycles 127436 ---- Thread 17 ---- PC 5: Stalled ----- 91012 in-flight CPI 1.4000 -- Total Cycles 127436 ---- Thread 18 ---- PC 5: Stalled ----- 92706 in-flight CPI 1.3743 -- Total Cycles 127436 ---- Thread 19 ---- PC 5: Stalled ----- 94221 in-flight CPI 1.3523 -- Total Cycles 127436 ---- Thread 20 ---- PC 5: Stalled ----- 95986 in-flight CPI 1.3274 -- Total Cycles 127436 ---- Thread 21 ---- PC 5: Stalled ----- 92286 in-flight CPI 1.3806 -- Total Cycles 127436 ---- Thread 22 ---- PC 5: Stalled ----- 93549 in-flight CPI 1.3620 -- Total Cycles 127436 ---- Thread 23 ---- PC 5: Stalled ----- 97174 in-flight CPI 1.3112 -- Total Cycles 127436 ---- Thread 24 ---- PC 5: Stalled ----- 86770 in-flight CPI 1.4685 -- Total Cycles 127436 ---- Thread 25 ---- PC 5: Stalled ----- 95863 in-flight CPI 1.3291 -- Total Cycles 127436 ---- Thread 26 ---- PC 5: Stalled ----- 88543 in-flight CPI 1.4391 -- Total Cycles 127436 ---- Thread 27 ---- PC 5: Stalled ----- 88767 in-flight CPI 1.4354 -- Total Cycles 127436 ---- Thread 28 ---- PC 5: Stalled ----- 87363 in-flight CPI 1.4584 -- Total Cycles 127436 ---- Thread 29 ---- PC 5: Stalled ----- 89670 in-flight CPI 1.4209 -- Total Cycles 127436 ---- Thread 30 ---- PC 5: Stalled ----- 88164 in-flight CPI 1.4452 -- Total Cycles 127436 ---- Thread 31 ---- PC 5: Stalled ----- 90214 in-flight CPI 1.4123 -- Total Cycles 127436 Total CPI 0.0419 , IPC 23.8433 -- Total Cycles 127436 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8284 (3.862149%) FPSUB: 0 (0.000000%) FPMUL: 32629 (15.212222%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 84657 (39.468605%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5564 (2.594036%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 75447 (35.174738%) DIV: 7643 (3.563303%) FPUN: 0 (0.000000%) FPRSUB: 268 (0.124946%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3329955 total) ADD%: 7.464 (248545) SUB%: 0.000 (0) MUL%: 0.006 (207) BITOR%: 1.528 (50886) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.582 (19370) FPSUB%: 0.000 (0) FPMUL%: 4.858 (161769) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (621) FPMAX%: 0.019 (621) LOAD%: 5.166 (172024) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (239) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (587) FPINV%: 0.000 (0) FPCONV%: 0.020 (653) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.076 (35831) FPLE%: 0.456 (15183) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (621) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.774 (92387) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.752 (25035) CMPU%: 0.000 (0) RSUB%: 0.006 (207) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.641 (520833) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.168 (38880) ORI%: 1.581 (52644) XORI%: 0.000 (0) MULI%: 3.174 (105696) LW%: 1.120 (37286) LWI%: 13.392 (445958) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.285 (9505) SWI%: 4.032 (134264) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.385 (46119) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.308 (10266) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.058 (1920) bned%: 0.000 (0) bneid%: 13.779 (458847) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.709 (23622) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.129 (4300) DIV%: 0.012 (414) FPUN%: 1.474 (49085) FPRSUB%: 4.274 (142310) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (61) FPGT%: 2.936 (97755) FPGE%: 1.018 (33902) SYNC%: 0.000 (0) NOP%: 8.751 (291406) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 32 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 7 FPSUB 0 FPMUL 4 FPCMPLT 0 FPMIN 0 FPMAX 406 LOAD 40912 INTCONV 0 ATOMIC_INC 24 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 13 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1429 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 6 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48375 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 5 ORI 11820 XORI 0 MULI 9106 LW 0 LWI 141348 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 86 DIV 26 FPUN 0 FPRSUB 48 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.8435 --Total thread-cycles: 4077952 --total thread-cycles issued: 3038549 (74.511642%) --iCache conflicts: 109728 (2.690762%) --thread*cycles of FU dependence: 253671 (6.220549%) --thread*cycles of data dependence: 214492 (5.259797%) --iCache cycles*banks: 4077952 (81.658318% used) Issue breakdown: --thread*cycles of issue worked: 3038549 (74.511642%) --thread*cycles of issue failed: 747997 (18.342467%) --thread*cycles of issue NOP/other: 291406 (7.145891%) Number of thread-cycles not ready: 214492 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3329955 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 6 2: 7 3: 9 4: 6 5: 8 6: 10 7: 8 8: 9 9: 8 10: 7 11: 8 12: 8 13: 8 14: 8 15: 8 16: 8 17: 7 18: 9 19: 7 20: 8 21: 7 22: 8 23: 7 24: 5 25: 7 26: 5 27: 7 28: 7 29: 7 30: 7 31: 7 <=== Core 39 ===> ---- Thread 00 ---- PC 5: Stalled ----- 102666 in-flight CPI 1.2428 -- Total Cycles 127616 ---- Thread 01 ---- PC 5: Stalled ----- 101899 in-flight CPI 1.2521 -- Total Cycles 127616 ---- Thread 02 ---- PC 5: Stalled ----- 102608 in-flight CPI 1.2435 -- Total Cycles 127616 ---- Thread 03 ---- PC 5: Stalled ----- 94463 in-flight CPI 1.3507 -- Total Cycles 127616 ---- Thread 04 ---- PC 5: Stalled ----- 100155 in-flight CPI 1.2739 -- Total Cycles 127616 ---- Thread 05 ---- PC 5: Stalled ----- 92387 in-flight CPI 1.3811 -- Total Cycles 127616 ---- Thread 06 ---- PC 5: Stalled ----- 97440 in-flight CPI 1.3095 -- Total Cycles 127616 ---- Thread 07 ---- PC 5: Stalled ----- 97043 in-flight CPI 1.3148 -- Total Cycles 127616 ---- Thread 08 ---- PC 5: Stalled ----- 91763 in-flight CPI 1.3905 -- Total Cycles 127616 ---- Thread 09 ---- PC 5: Stalled ----- 93607 in-flight CPI 1.3630 -- Total Cycles 127616 ---- Thread 10 ---- PC 5: Stalled ----- 90061 in-flight CPI 1.4168 -- Total Cycles 127616 ---- Thread 11 ---- PC 5: Stalled ----- 102818 in-flight CPI 1.2409 -- Total Cycles 127616 ---- Thread 12 ---- PC 5: Stalled ----- 94775 in-flight CPI 1.3463 -- Total Cycles 127616 ---- Thread 13 ---- PC 5: Stalled ----- 94839 in-flight CPI 1.3454 -- Total Cycles 127616 ---- Thread 14 ---- PC 5: Stalled ----- 101638 in-flight CPI 1.2554 -- Total Cycles 127616 ---- Thread 15 ---- PC 5: Stalled ----- 98710 in-flight CPI 1.2926 -- Total Cycles 127616 ---- Thread 16 ---- PC 5: Stalled ----- 93536 in-flight CPI 1.3641 -- Total Cycles 127616 ---- Thread 17 ---- PC 5: Stalled ----- 95855 in-flight CPI 1.3311 -- Total Cycles 127616 ---- Thread 18 ---- PC 5: Stalled ----- 92894 in-flight CPI 1.3735 -- Total Cycles 127616 ---- Thread 19 ---- PC 5: Stalled ----- 91004 in-flight CPI 1.4021 -- Total Cycles 127616 ---- Thread 20 ---- PC 5: Stalled ----- 96100 in-flight CPI 1.3277 -- Total Cycles 127616 ---- Thread 21 ---- PC 5: Stalled ----- 96902 in-flight CPI 1.3167 -- Total Cycles 127616 ---- Thread 22 ---- PC 5: Stalled ----- 99627 in-flight CPI 1.2807 -- Total Cycles 127616 ---- Thread 23 ---- PC 5: Stalled ----- 90457 in-flight CPI 1.4105 -- Total Cycles 127616 ---- Thread 24 ---- PC 5: Stalled ----- 93655 in-flight CPI 1.3624 -- Total Cycles 127616 ---- Thread 25 ---- PC 5: Stalled ----- 95650 in-flight CPI 1.3339 -- Total Cycles 127616 ---- Thread 26 ---- PC 5: Stalled ----- 88454 in-flight CPI 1.4425 -- Total Cycles 127616 ---- Thread 27 ---- PC 5: Stalled ----- 92617 in-flight CPI 1.3776 -- Total Cycles 127616 ---- Thread 28 ---- PC 5: Stalled ----- 90448 in-flight CPI 1.4107 -- Total Cycles 127616 ---- Thread 29 ---- PC 5: Stalled ----- 93433 in-flight CPI 1.3656 -- Total Cycles 127616 ---- Thread 30 ---- PC 5: Stalled ----- 89585 in-flight CPI 1.4243 -- Total Cycles 127616 ---- Thread 31 ---- PC 5: Stalled ----- 88414 in-flight CPI 1.4431 -- Total Cycles 127616 Total CPI 0.0419 , IPC 23.8690 -- Total Cycles 127616 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7933 (4.219569%) FPSUB: 0 (0.000000%) FPMUL: 32007 (17.024547%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 61621 (32.776256%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5910 (3.143533%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 72670 (38.653227%) DIV: 7598 (4.041382%) FPUN: 0 (0.000000%) FPRSUB: 266 (0.141486%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3338018 total) ADD%: 7.450 (248684) SUB%: 0.000 (0) MUL%: 0.006 (206) BITOR%: 1.527 (50969) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.558 (18640) FPSUB%: 0.000 (0) FPMUL%: 4.793 (159989) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (618) FPMAX%: 0.019 (618) LOAD%: 5.138 (171505) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (238) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (604) FPINV%: 0.000 (0) FPCONV%: 0.019 (650) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.065 (35554) FPLE%: 0.453 (15105) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (618) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.797 (93351) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.747 (24926) CMPU%: 0.000 (0) RSUB%: 0.006 (206) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.654 (522534) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.171 (39097) ORI%: 1.571 (52438) XORI%: 0.000 (0) MULI%: 3.195 (106652) LW%: 1.129 (37670) LWI%: 13.471 (449678) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.285 (9514) SWI%: 4.061 (135572) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.399 (46707) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.308 (10268) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.058 (1943) bned%: 0.000 (0) bneid%: 13.784 (460117) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.717 (23944) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.124 (4136) DIV%: 0.012 (412) FPUN%: 1.478 (49336) FPRSUB%: 4.228 (141118) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (81) FPGT%: 2.939 (98096) FPGE%: 1.025 (34231) SYNC%: 0.000 (0) NOP%: 8.745 (291897) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 15 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 11 FPSUB 0 FPMUL 0 FPCMPLT 0 FPMIN 0 FPMAX 403 LOAD 39431 INTCONV 0 ATOMIC_INC 20 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 10 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1498 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 7 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48742 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 18 ORI 11306 XORI 0 MULI 9354 LW 0 LWI 142229 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 78 DIV 30 FPUN 0 FPRSUB 77 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.8692 --Total thread-cycles: 4083712 --total thread-cycles issued: 3046121 (74.591964%) --iCache conflicts: 111738 (2.736187%) --thread*cycles of FU dependence: 253239 (6.201196%) --thread*cycles of data dependence: 188005 (4.603777%) --iCache cycles*banks: 4083712 (81.740583% used) Issue breakdown: --thread*cycles of issue worked: 3046121 (74.591964%) --thread*cycles of issue failed: 745694 (18.260201%) --thread*cycles of issue NOP/other: 291897 (7.147835%) Number of thread-cycles not ready: 188005 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3338018 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 8 3: 8 4: 8 5: 7 6: 6 7: 7 8: 7 9: 8 10: 6 11: 9 12: 7 13: 7 14: 8 15: 8 16: 8 17: 8 18: 7 19: 7 20: 7 21: 7 22: 8 23: 7 24: 7 25: 8 26: 7 27: 8 28: 7 29: 8 30: 7 31: 7 <=== Core 40 ===> ---- Thread 00 ---- PC 5: Stalled ----- 103086 in-flight CPI 1.2450 -- Total Cycles 128366 ---- Thread 01 ---- PC 5: Stalled ----- 99687 in-flight CPI 1.2874 -- Total Cycles 128366 ---- Thread 02 ---- PC 5: Stalled ----- 97404 in-flight CPI 1.3177 -- Total Cycles 128366 ---- Thread 03 ---- PC 5: Stalled ----- 95498 in-flight CPI 1.3439 -- Total Cycles 128366 ---- Thread 04 ---- PC 5: Stalled ----- 98220 in-flight CPI 1.3067 -- Total Cycles 128366 ---- Thread 05 ---- PC 5: Stalled ----- 100869 in-flight CPI 1.2723 -- Total Cycles 128366 ---- Thread 06 ---- PC 5: Stalled ----- 99556 in-flight CPI 1.2891 -- Total Cycles 128366 ---- Thread 07 ---- PC 5: Stalled ----- 101866 in-flight CPI 1.2599 -- Total Cycles 128366 ---- Thread 08 ---- PC 5: Stalled ----- 104594 in-flight CPI 1.2270 -- Total Cycles 128366 ---- Thread 09 ---- PC 5: Stalled ----- 100123 in-flight CPI 1.2818 -- Total Cycles 128366 ---- Thread 10 ---- PC 5: Stalled ----- 94954 in-flight CPI 1.3516 -- Total Cycles 128366 ---- Thread 11 ---- PC 5: Stalled ----- 96138 in-flight CPI 1.3350 -- Total Cycles 128366 ---- Thread 12 ---- PC 5: Stalled ----- 95310 in-flight CPI 1.3466 -- Total Cycles 128366 ---- Thread 13 ---- PC 5: Stalled ----- 97569 in-flight CPI 1.3154 -- Total Cycles 128366 ---- Thread 14 ---- PC 5: Stalled ----- 94890 in-flight CPI 1.3526 -- Total Cycles 128366 ---- Thread 15 ---- PC 5: Stalled ----- 87658 in-flight CPI 1.4642 -- Total Cycles 128366 ---- Thread 16 ---- PC 5: Stalled ----- 92324 in-flight CPI 1.3902 -- Total Cycles 128366 ---- Thread 17 ---- PC 5: Stalled ----- 94016 in-flight CPI 1.3651 -- Total Cycles 128366 ---- Thread 18 ---- PC 5: Stalled ----- 96993 in-flight CPI 1.3232 -- Total Cycles 128366 ---- Thread 19 ---- PC 5: Stalled ----- 95444 in-flight CPI 1.3447 -- Total Cycles 128366 ---- Thread 20 ---- PC 5: Stalled ----- 92387 in-flight CPI 1.3892 -- Total Cycles 128366 ---- Thread 21 ---- PC 5: Stalled ----- 89722 in-flight CPI 1.4304 -- Total Cycles 128366 ---- Thread 22 ---- PC 5: Stalled ----- 95534 in-flight CPI 1.3434 -- Total Cycles 128366 ---- Thread 23 ---- PC 5: Stalled ----- 92753 in-flight CPI 1.3837 -- Total Cycles 128366 ---- Thread 24 ---- PC 5: Stalled ----- 93169 in-flight CPI 1.3775 -- Total Cycles 128366 ---- Thread 25 ---- PC 5: Stalled ----- 89069 in-flight CPI 1.4410 -- Total Cycles 128366 ---- Thread 26 ---- PC 5: Stalled ----- 94853 in-flight CPI 1.3530 -- Total Cycles 128366 ---- Thread 27 ---- PC 5: Stalled ----- 92374 in-flight CPI 1.3894 -- Total Cycles 128366 ---- Thread 28 ---- PC 5: Stalled ----- 92326 in-flight CPI 1.3901 -- Total Cycles 128366 ---- Thread 29 ---- PC 5: Stalled ----- 93081 in-flight CPI 1.3788 -- Total Cycles 128366 ---- Thread 30 ---- PC 5: Stalled ----- 92621 in-flight CPI 1.3857 -- Total Cycles 128366 ---- Thread 31 ---- PC 5: Stalled ----- 90245 in-flight CPI 1.4221 -- Total Cycles 128366 Total CPI 0.0420 , IPC 23.7985 -- Total Cycles 128366 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7504 (3.821805%) FPSUB: 0 (0.000000%) FPMUL: 31276 (15.928942%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 74307 (37.844734%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5857 (2.982984%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 69301 (35.295166%) DIV: 7829 (3.987329%) FPUN: 0 (0.000000%) FPRSUB: 273 (0.139040%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3348296 total) ADD%: 7.455 (249599) SUB%: 0.000 (0) MUL%: 0.006 (212) BITOR%: 1.519 (50868) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.530 (17761) FPSUB%: 0.000 (0) FPMUL%: 4.712 (157785) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (636) FPMAX%: 0.019 (636) LOAD%: 5.111 (171125) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (244) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (608) FPINV%: 0.000 (0) FPCONV%: 0.020 (668) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.057 (35387) FPLE%: 0.454 (15213) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (636) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.817 (94327) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.734 (24575) CMPU%: 0.000 (0) RSUB%: 0.006 (212) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.684 (525152) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.172 (39251) ORI%: 1.546 (51766) XORI%: 0.000 (0) MULI%: 3.221 (107848) LW%: 1.137 (38070) LWI%: 13.542 (453432) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.288 (9652) SWI%: 4.082 (136676) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.408 (47149) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10381) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.057 (1905) bned%: 0.000 (0) bneid%: 13.813 (462495) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.717 (24006) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.117 (3933) DIV%: 0.013 (424) FPUN%: 1.478 (49475) FPRSUB%: 4.161 (139329) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (74) FPGT%: 2.960 (99101) FPGE%: 1.023 (34262) SYNC%: 0.000 (0) NOP%: 8.760 (293327) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 22 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 15 FPSUB 0 FPMUL 1 FPCMPLT 0 FPMIN 0 FPMAX 413 LOAD 39048 INTCONV 0 ATOMIC_INC 27 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 12 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1574 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 7 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49139 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 6 ORI 10609 XORI 0 MULI 9601 LW 0 LWI 143304 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 65 DIV 30 FPUN 0 FPRSUB 40 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.7987 --Total thread-cycles: 4107712 --total thread-cycles issued: 3054969 (74.371548%) --iCache conflicts: 110678 (2.694395%) --thread*cycles of FU dependence: 253937 (6.181957%) --thread*cycles of data dependence: 196347 (4.779960%) --iCache cycles*banks: 4107712 (81.513212% used) Issue breakdown: --thread*cycles of issue worked: 3054969 (74.371548%) --thread*cycles of issue failed: 759416 (18.487567%) --thread*cycles of issue NOP/other: 293327 (7.140885%) Number of thread-cycles not ready: 196347 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3348296 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 9 1: 8 2: 7 3: 9 4: 8 5: 9 6: 9 7: 8 8: 10 9: 8 10: 7 11: 8 12: 8 13: 7 14: 7 15: 6 16: 6 17: 8 18: 7 19: 8 20: 7 21: 8 22: 7 23: 8 24: 7 25: 6 26: 8 27: 6 28: 8 29: 7 30: 7 31: 8 <=== Core 41 ===> ---- Thread 00 ---- PC 5: Stalled ----- 96204 in-flight CPI 1.4644 -- Total Cycles 140904 ---- Thread 01 ---- PC 5: Stalled ----- 100701 in-flight CPI 1.3989 -- Total Cycles 140904 ---- Thread 02 ---- PC 5: Stalled ----- 100322 in-flight CPI 1.4043 -- Total Cycles 140904 ---- Thread 03 ---- PC 5: Stalled ----- 93532 in-flight CPI 1.5062 -- Total Cycles 140904 ---- Thread 04 ---- PC 5: Stalled ----- 97986 in-flight CPI 1.4377 -- Total Cycles 140904 ---- Thread 05 ---- PC 5: Stalled ----- 101622 in-flight CPI 1.3862 -- Total Cycles 140904 ---- Thread 06 ---- PC 5: Stalled ----- 95100 in-flight CPI 1.4814 -- Total Cycles 140904 ---- Thread 07 ---- PC 5: Stalled ----- 100516 in-flight CPI 1.4015 -- Total Cycles 140904 ---- Thread 08 ---- PC 5: Stalled ----- 100170 in-flight CPI 1.4064 -- Total Cycles 140904 ---- Thread 09 ---- PC 5: Stalled ----- 96070 in-flight CPI 1.4664 -- Total Cycles 140904 ---- Thread 10 ---- PC 5: Stalled ----- 102454 in-flight CPI 1.3750 -- Total Cycles 140904 ---- Thread 11 ---- PC 5: Stalled ----- 98887 in-flight CPI 1.4246 -- Total Cycles 140904 ---- Thread 12 ---- PC 5: Stalled ----- 91967 in-flight CPI 1.5318 -- Total Cycles 140904 ---- Thread 13 ---- PC 5: Stalled ----- 95628 in-flight CPI 1.4732 -- Total Cycles 140904 ---- Thread 14 ---- PC 5: Stalled ----- 100444 in-flight CPI 1.4025 -- Total Cycles 140904 ---- Thread 15 ---- PC 5: Stalled ----- 99208 in-flight CPI 1.4200 -- Total Cycles 140904 ---- Thread 16 ---- PC 5: Stalled ----- 98065 in-flight CPI 1.4366 -- Total Cycles 140904 ---- Thread 17 ---- PC 5: Stalled ----- 91365 in-flight CPI 1.5419 -- Total Cycles 140904 ---- Thread 18 ---- PC 5: Stalled ----- 98120 in-flight CPI 1.4358 -- Total Cycles 140904 ---- Thread 19 ---- PC 5: Stalled ----- 93805 in-flight CPI 1.5018 -- Total Cycles 140904 ---- Thread 20 ---- PC 5: Stalled ----- 95305 in-flight CPI 1.4782 -- Total Cycles 140904 ---- Thread 21 ---- PC 5: Stalled ----- 99871 in-flight CPI 1.4106 -- Total Cycles 140904 ---- Thread 22 ---- PC 5: Stalled ----- 94016 in-flight CPI 1.4985 -- Total Cycles 140904 ---- Thread 23 ---- PC 5: Stalled ----- 93124 in-flight CPI 1.5128 -- Total Cycles 140904 ---- Thread 24 ---- PC 5: Stalled ----- 89204 in-flight CPI 1.5792 -- Total Cycles 140904 ---- Thread 25 ---- PC 5: Stalled ----- 89484 in-flight CPI 1.5743 -- Total Cycles 140904 ---- Thread 26 ---- PC 5: Stalled ----- 93425 in-flight CPI 1.5079 -- Total Cycles 140904 ---- Thread 27 ---- PC 5: Stalled ----- 93723 in-flight CPI 1.5031 -- Total Cycles 140904 ---- Thread 28 ---- PC 5: Stalled ----- 97502 in-flight CPI 1.4450 -- Total Cycles 140904 ---- Thread 29 ---- PC 5: Stalled ----- 92068 in-flight CPI 1.5301 -- Total Cycles 140904 ---- Thread 30 ---- PC 5: Stalled ----- 90351 in-flight CPI 1.5592 -- Total Cycles 140904 ---- Thread 31 ---- PC 5: Stalled ----- 95836 in-flight CPI 1.4700 -- Total Cycles 140904 Total CPI 0.0458 , IPC 21.8351 -- Total Cycles 140904 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7830 (3.612957%) FPSUB: 0 (0.000000%) FPMUL: 31909 (14.723606%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 90230 (41.634367%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5800 (2.676264%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 72886 (33.631414%) DIV: 7790 (3.594500%) FPUN: 0 (0.000000%) FPRSUB: 275 (0.126892%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3371173 total) ADD%: 7.496 (252717) SUB%: 0.000 (0) MUL%: 0.006 (211) BITOR%: 1.526 (51429) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.547 (18436) FPSUB%: 0.000 (0) FPMUL%: 4.757 (160356) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (633) FPMAX%: 0.019 (633) LOAD%: 5.147 (173523) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (243) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (604) FPINV%: 0.000 (0) FPCONV%: 0.020 (665) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.059 (35692) FPLE%: 0.455 (15339) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (633) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.804 (94521) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.748 (25202) CMPU%: 0.000 (0) RSUB%: 0.006 (211) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.671 (528302) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.176 (39660) ORI%: 1.559 (52549) XORI%: 0.000 (0) MULI%: 3.201 (107904) LW%: 1.132 (38146) LWI%: 13.471 (454117) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.288 (9708) SWI%: 4.073 (137309) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.400 (47207) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10458) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.054 (1811) bned%: 0.000 (0) bneid%: 13.781 (464591) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.715 (24095) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.123 (4145) DIV%: 0.013 (422) FPUN%: 1.474 (49704) FPRSUB%: 4.207 (141823) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (68) FPGT%: 2.942 (99180) FPGE%: 1.019 (34365) SYNC%: 0.000 (0) NOP%: 8.735 (294465) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 11 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 10 FPSUB 0 FPMUL 1 FPCMPLT 0 FPMIN 0 FPMAX 412 LOAD 40331 INTCONV 0 ATOMIC_INC 22 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 10 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1621 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 6 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49254 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 19 ORI 11084 XORI 0 MULI 9425 LW 0 LWI 143798 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 67 DIV 18 FPUN 0 FPRSUB 57 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.8353 --Total thread-cycles: 4508928 --total thread-cycles issued: 3076708 (68.235909%) --iCache conflicts: 109430 (2.426963%) --thread*cycles of FU dependence: 256150 (5.680951%) --thread*cycles of data dependence: 216720 (4.806464%) --iCache cycles*banks: 4508928 (74.767328% used) Issue breakdown: --thread*cycles of issue worked: 3076708 (68.235909%) --thread*cycles of issue failed: 1137755 (25.233381%) --thread*cycles of issue NOP/other: 294465 (6.530710%) Number of thread-cycles not ready: 216720 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3371173 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 9 2: 6 3: 7 4: 8 5: 9 6: 7 7: 9 8: 8 9: 8 10: 8 11: 8 12: 7 13: 8 14: 8 15: 9 16: 8 17: 8 18: 8 19: 7 20: 7 21: 8 22: 7 23: 7 24: 8 25: 7 26: 7 27: 7 28: 5 29: 8 30: 8 31: 7 <=== Core 42 ===> ---- Thread 00 ---- PC 5: Stalled ----- 96436 in-flight CPI 1.4410 -- Total Cycles 138983 ---- Thread 01 ---- PC 5: Stalled ----- 88531 in-flight CPI 1.5696 -- Total Cycles 138983 ---- Thread 02 ---- PC 5: Stalled ----- 93271 in-flight CPI 1.4898 -- Total Cycles 138983 ---- Thread 03 ---- PC 5: Stalled ----- 95970 in-flight CPI 1.4479 -- Total Cycles 138983 ---- Thread 04 ---- PC 5: Stalled ----- 97334 in-flight CPI 1.4277 -- Total Cycles 138983 ---- Thread 05 ---- PC 5: Stalled ----- 98546 in-flight CPI 1.4101 -- Total Cycles 138983 ---- Thread 06 ---- PC 5: Stalled ----- 99374 in-flight CPI 1.3983 -- Total Cycles 138983 ---- Thread 07 ---- PC 5: Stalled ----- 99696 in-flight CPI 1.3938 -- Total Cycles 138983 ---- Thread 08 ---- PC 5: Stalled ----- 102721 in-flight CPI 1.3528 -- Total Cycles 138983 ---- Thread 09 ---- PC 5: Stalled ----- 96821 in-flight CPI 1.4351 -- Total Cycles 138983 ---- Thread 10 ---- PC 5: Stalled ----- 95716 in-flight CPI 1.4518 -- Total Cycles 138983 ---- Thread 11 ---- PC 5: Stalled ----- 96871 in-flight CPI 1.4345 -- Total Cycles 138983 ---- Thread 12 ---- PC 5: Stalled ----- 101408 in-flight CPI 1.3703 -- Total Cycles 138983 ---- Thread 13 ---- PC 5: Stalled ----- 98190 in-flight CPI 1.4152 -- Total Cycles 138983 ---- Thread 14 ---- PC 5: Stalled ----- 93385 in-flight CPI 1.4881 -- Total Cycles 138983 ---- Thread 15 ---- PC 5: Stalled ----- 95790 in-flight CPI 1.4506 -- Total Cycles 138983 ---- Thread 16 ---- PC 5: Stalled ----- 97846 in-flight CPI 1.4201 -- Total Cycles 138983 ---- Thread 17 ---- PC 5: Stalled ----- 98694 in-flight CPI 1.4079 -- Total Cycles 138983 ---- Thread 18 ---- PC 5: Stalled ----- 90159 in-flight CPI 1.5414 -- Total Cycles 138983 ---- Thread 19 ---- PC 5: Stalled ----- 92938 in-flight CPI 1.4952 -- Total Cycles 138983 ---- Thread 20 ---- PC 5: Stalled ----- 92233 in-flight CPI 1.5066 -- Total Cycles 138983 ---- Thread 21 ---- PC 5: Stalled ----- 96435 in-flight CPI 1.4409 -- Total Cycles 138983 ---- Thread 22 ---- PC 5: Stalled ----- 101092 in-flight CPI 1.3746 -- Total Cycles 138983 ---- Thread 23 ---- PC 5: Stalled ----- 84392 in-flight CPI 1.6466 -- Total Cycles 138983 ---- Thread 24 ---- PC 5: Stalled ----- 87188 in-flight CPI 1.5939 -- Total Cycles 138983 ---- Thread 25 ---- PC 5: Stalled ----- 91115 in-flight CPI 1.5251 -- Total Cycles 138983 ---- Thread 26 ---- PC 5: Stalled ----- 83526 in-flight CPI 1.6637 -- Total Cycles 138983 ---- Thread 27 ---- PC 5: Stalled ----- 91177 in-flight CPI 1.5240 -- Total Cycles 138983 ---- Thread 28 ---- PC 5: Stalled ----- 90453 in-flight CPI 1.5362 -- Total Cycles 138983 ---- Thread 29 ---- PC 5: Stalled ----- 93828 in-flight CPI 1.4810 -- Total Cycles 138983 ---- Thread 30 ---- PC 5: Stalled ----- 85102 in-flight CPI 1.6329 -- Total Cycles 138983 ---- Thread 31 ---- PC 5: Stalled ----- 89309 in-flight CPI 1.5559 -- Total Cycles 138983 Total CPI 0.0461 , IPC 21.7010 -- Total Cycles 138983 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8849 (3.738772%) FPSUB: 0 (0.000000%) FPMUL: 33484 (14.147252%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 101257 (42.781876%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5253 (2.219434%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 80422 (33.978925%) DIV: 7159 (3.024734%) FPUN: 0 (0.000000%) FPRSUB: 258 (0.109007%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3304966 total) ADD%: 7.338 (242514) SUB%: 0.000 (0) MUL%: 0.006 (194) BITOR%: 1.520 (50251) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.619 (20460) FPSUB%: 0.000 (0) FPMUL%: 4.972 (164311) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (582) FPMAX%: 0.018 (582) LOAD%: 5.247 (173416) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (226) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (556) FPINV%: 0.000 (0) FPCONV%: 0.019 (614) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.086 (35902) FPLE%: 0.455 (15039) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (582) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.757 (91119) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.770 (25435) CMPU%: 0.000 (0) RSUB%: 0.006 (194) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.639 (516862) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.167 (38557) ORI%: 1.608 (53153) XORI%: 0.000 (0) MULI%: 3.148 (104046) LW%: 1.112 (36758) LWI%: 13.321 (440248) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.283 (9354) SWI%: 4.010 (132516) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.377 (45499) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.307 (10160) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.065 (2160) bned%: 0.000 (0) bneid%: 13.726 (453633) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.709 (23422) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.139 (4601) DIV%: 0.012 (388) FPUN%: 1.460 (48258) FPRSUB%: 4.384 (144885) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (69) FPGT%: 2.913 (96268) FPGE%: 1.005 (33219) SYNC%: 0.000 (0) NOP%: 8.739 (288837) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 18 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 7 FPSUB 0 FPMUL 1 FPCMPLT 0 FPMIN 0 FPMAX 370 LOAD 41690 INTCONV 0 ATOMIC_INC 16 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 10 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1397 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 13 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 47618 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 8 ORI 12648 XORI 0 MULI 8447 LW 0 LWI 139850 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 88 DIV 23 FPUN 0 FPRSUB 67 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.7013 --Total thread-cycles: 4447456 --total thread-cycles issued: 3016129 (67.816950%) --iCache conflicts: 109818 (2.469232%) --thread*cycles of FU dependence: 252281 (5.672479%) --thread*cycles of data dependence: 236682 (5.321739%) --iCache cycles*banks: 4447456 (74.312101% used) Issue breakdown: --thread*cycles of issue worked: 3016129 (67.816950%) --thread*cycles of issue failed: 1142490 (25.688618%) --thread*cycles of issue NOP/other: 288837 (6.494432%) Number of thread-cycles not ready: 236682 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3304966 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 6 2: 7 3: 8 4: 6 5: 8 6: 8 7: 8 8: 8 9: 9 10: 7 11: 7 12: 8 13: 7 14: 6 15: 8 16: 9 17: 8 18: 5 19: 7 20: 8 21: 8 22: 6 23: 6 24: 5 25: 7 26: 5 27: 7 28: 7 29: 7 30: 6 31: 7 <=== Core 43 ===> ---- Thread 00 ---- PC 5: Stalled ----- 97144 in-flight CPI 1.3194 -- Total Cycles 128196 ---- Thread 01 ---- PC 5: Stalled ----- 100052 in-flight CPI 1.2810 -- Total Cycles 128196 ---- Thread 02 ---- PC 5: Stalled ----- 103220 in-flight CPI 1.2417 -- Total Cycles 128196 ---- Thread 03 ---- PC 5: Stalled ----- 95759 in-flight CPI 1.3385 -- Total Cycles 128196 ---- Thread 04 ---- PC 5: Stalled ----- 100785 in-flight CPI 1.2717 -- Total Cycles 128196 ---- Thread 05 ---- PC 5: Stalled ----- 95282 in-flight CPI 1.3452 -- Total Cycles 128196 ---- Thread 06 ---- PC 5: Stalled ----- 93615 in-flight CPI 1.3691 -- Total Cycles 128196 ---- Thread 07 ---- PC 5: Stalled ----- 93496 in-flight CPI 1.3709 -- Total Cycles 128196 ---- Thread 08 ---- PC 5: Stalled ----- 95648 in-flight CPI 1.3401 -- Total Cycles 128196 ---- Thread 09 ---- PC 5: Stalled ----- 103745 in-flight CPI 1.2354 -- Total Cycles 128196 ---- Thread 10 ---- PC 5: Stalled ----- 96332 in-flight CPI 1.3305 -- Total Cycles 128196 ---- Thread 11 ---- PC 5: Stalled ----- 99964 in-flight CPI 1.2822 -- Total Cycles 128196 ---- Thread 12 ---- PC 5: Stalled ----- 92118 in-flight CPI 1.3914 -- Total Cycles 128196 ---- Thread 13 ---- PC 5: Stalled ----- 97495 in-flight CPI 1.3147 -- Total Cycles 128196 ---- Thread 14 ---- PC 5: Stalled ----- 92474 in-flight CPI 1.3861 -- Total Cycles 128196 ---- Thread 15 ---- PC 5: Stalled ----- 99468 in-flight CPI 1.2886 -- Total Cycles 128196 ---- Thread 16 ---- PC 5: Stalled ----- 96718 in-flight CPI 1.3252 -- Total Cycles 128196 ---- Thread 17 ---- PC 5: Stalled ----- 96488 in-flight CPI 1.3284 -- Total Cycles 128196 ---- Thread 18 ---- PC 5: Stalled ----- 98338 in-flight CPI 1.3033 -- Total Cycles 128196 ---- Thread 19 ---- PC 5: Stalled ----- 95977 in-flight CPI 1.3355 -- Total Cycles 128196 ---- Thread 20 ---- PC 5: Stalled ----- 96021 in-flight CPI 1.3349 -- Total Cycles 128196 ---- Thread 21 ---- PC 5: Stalled ----- 91002 in-flight CPI 1.4084 -- Total Cycles 128196 ---- Thread 22 ---- PC 5: Stalled ----- 89012 in-flight CPI 1.4400 -- Total Cycles 128196 ---- Thread 23 ---- PC 5: Stalled ----- 97901 in-flight CPI 1.3092 -- Total Cycles 128196 ---- Thread 24 ---- PC 5: Stalled ----- 91891 in-flight CPI 1.3948 -- Total Cycles 128196 ---- Thread 25 ---- PC 5: Stalled ----- 96779 in-flight CPI 1.3244 -- Total Cycles 128196 ---- Thread 26 ---- PC 5: Stalled ----- 92221 in-flight CPI 1.3898 -- Total Cycles 128196 ---- Thread 27 ---- PC 5: Stalled ----- 95853 in-flight CPI 1.3372 -- Total Cycles 128196 ---- Thread 28 ---- PC 5: Stalled ----- 93088 in-flight CPI 1.3769 -- Total Cycles 128196 ---- Thread 29 ---- PC 5: Stalled ----- 88675 in-flight CPI 1.4454 -- Total Cycles 128196 ---- Thread 30 ---- PC 5: Stalled ----- 90916 in-flight CPI 1.4098 -- Total Cycles 128196 ---- Thread 31 ---- PC 5: Stalled ----- 90165 in-flight CPI 1.4215 -- Total Cycles 128196 Total CPI 0.0419 , IPC 23.8557 -- Total Cycles 128196 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8104 (4.332670%) FPSUB: 0 (0.000000%) FPMUL: 32552 (17.403392%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 58512 (31.282479%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 6009 (3.212613%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 73921 (39.520648%) DIV: 7677 (4.104382%) FPUN: 0 (0.000000%) FPRSUB: 269 (0.143816%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3351370 total) ADD%: 7.417 (248570) SUB%: 0.000 (0) MUL%: 0.006 (208) BITOR%: 1.541 (51630) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.572 (19158) FPSUB%: 0.000 (0) FPMUL%: 4.825 (161717) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (624) FPMAX%: 0.019 (624) LOAD%: 5.131 (171969) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (240) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (612) FPINV%: 0.000 (0) FPCONV%: 0.020 (656) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.072 (35927) FPLE%: 0.454 (15203) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (624) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.786 (93373) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.750 (25121) CMPU%: 0.000 (0) RSUB%: 0.006 (208) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.640 (524165) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.170 (39195) ORI%: 1.587 (53194) XORI%: 0.000 (0) MULI%: 3.187 (106820) LW%: 1.124 (37682) LWI%: 13.438 (450372) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.284 (9509) SWI%: 4.045 (135572) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.394 (46730) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.307 (10280) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.058 (1935) bned%: 0.000 (0) bneid%: 13.798 (462413) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.721 (24161) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.126 (4209) DIV%: 0.012 (416) FPUN%: 1.489 (49909) FPRSUB%: 4.241 (142148) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (79) FPGT%: 2.930 (98211) FPGE%: 1.036 (34706) SYNC%: 0.000 (0) NOP%: 8.746 (293104) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 27 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 5 FPSUB 0 FPMUL 3 FPCMPLT 0 FPMIN 0 FPMAX 408 LOAD 39266 INTCONV 0 ATOMIC_INC 23 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 14 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1422 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 9 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48797 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 18 ORI 11554 XORI 0 MULI 9309 LW 0 LWI 142526 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 75 DIV 20 FPUN 0 FPRSUB 56 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.8560 --Total thread-cycles: 4102272 --total thread-cycles issued: 3058266 (74.550542%) --iCache conflicts: 112656 (2.746186%) --thread*cycles of FU dependence: 253555 (6.180843%) --thread*cycles of data dependence: 187044 (4.559522%) --iCache cycles*banks: 4102272 (81.696241% used) Issue breakdown: --thread*cycles of issue worked: 3058266 (74.550542%) --thread*cycles of issue failed: 750902 (18.304540%) --thread*cycles of issue NOP/other: 293104 (7.144919%) Number of thread-cycles not ready: 187044 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3351370 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 9 2: 8 3: 8 4: 8 5: 7 6: 8 7: 7 8: 6 9: 9 10: 8 11: 7 12: 7 13: 7 14: 6 15: 8 16: 8 17: 8 18: 9 19: 7 20: 7 21: 8 22: 6 23: 8 24: 7 25: 7 26: 8 27: 7 28: 7 29: 7 30: 7 31: 8 <=== Core 44 ===> ---- Thread 00 ---- PC 5: Stalled ----- 97887 in-flight CPI 1.3817 -- Total Cycles 135274 ---- Thread 01 ---- PC 5: Stalled ----- 100880 in-flight CPI 1.3408 -- Total Cycles 135274 ---- Thread 02 ---- PC 5: Stalled ----- 99809 in-flight CPI 1.3551 -- Total Cycles 135274 ---- Thread 03 ---- PC 5: Stalled ----- 92595 in-flight CPI 1.4607 -- Total Cycles 135274 ---- Thread 04 ---- PC 5: Stalled ----- 97829 in-flight CPI 1.3826 -- Total Cycles 135274 ---- Thread 05 ---- PC 5: Stalled ----- 100039 in-flight CPI 1.3520 -- Total Cycles 135274 ---- Thread 06 ---- PC 5: Stalled ----- 99537 in-flight CPI 1.3588 -- Total Cycles 135274 ---- Thread 07 ---- PC 5: Stalled ----- 101435 in-flight CPI 1.3333 -- Total Cycles 135274 ---- Thread 08 ---- PC 5: Stalled ----- 95713 in-flight CPI 1.4131 -- Total Cycles 135274 ---- Thread 09 ---- PC 5: Stalled ----- 103326 in-flight CPI 1.3089 -- Total Cycles 135274 ---- Thread 10 ---- PC 5: Stalled ----- 103539 in-flight CPI 1.3062 -- Total Cycles 135274 ---- Thread 11 ---- PC 5: Stalled ----- 93869 in-flight CPI 1.4408 -- Total Cycles 135274 ---- Thread 12 ---- PC 5: Stalled ----- 94717 in-flight CPI 1.4279 -- Total Cycles 135274 ---- Thread 13 ---- PC 5: Stalled ----- 95981 in-flight CPI 1.4091 -- Total Cycles 135274 ---- Thread 14 ---- PC 5: Stalled ----- 98170 in-flight CPI 1.3777 -- Total Cycles 135274 ---- Thread 15 ---- PC 5: Stalled ----- 92821 in-flight CPI 1.4571 -- Total Cycles 135274 ---- Thread 16 ---- PC 5: Stalled ----- 96120 in-flight CPI 1.4071 -- Total Cycles 135274 ---- Thread 17 ---- PC 5: Stalled ----- 97197 in-flight CPI 1.3915 -- Total Cycles 135274 ---- Thread 18 ---- PC 5: Stalled ----- 95273 in-flight CPI 1.4196 -- Total Cycles 135274 ---- Thread 19 ---- PC 5: Stalled ----- 95295 in-flight CPI 1.4192 -- Total Cycles 135274 ---- Thread 20 ---- PC 5: Stalled ----- 96132 in-flight CPI 1.4069 -- Total Cycles 135274 ---- Thread 21 ---- PC 5: Stalled ----- 95587 in-flight CPI 1.4149 -- Total Cycles 135274 ---- Thread 22 ---- PC 5: Stalled ----- 94221 in-flight CPI 1.4354 -- Total Cycles 135274 ---- Thread 23 ---- PC 5: Stalled ----- 89911 in-flight CPI 1.5043 -- Total Cycles 135274 ---- Thread 24 ---- PC 5: Stalled ----- 90583 in-flight CPI 1.4932 -- Total Cycles 135274 ---- Thread 25 ---- PC 5: Stalled ----- 92329 in-flight CPI 1.4649 -- Total Cycles 135274 ---- Thread 26 ---- PC 5: Stalled ----- 90828 in-flight CPI 1.4890 -- Total Cycles 135274 ---- Thread 27 ---- PC 5: Stalled ----- 93735 in-flight CPI 1.4429 -- Total Cycles 135274 ---- Thread 28 ---- PC 5: Stalled ----- 92074 in-flight CPI 1.4689 -- Total Cycles 135274 ---- Thread 29 ---- PC 5: Stalled ----- 90815 in-flight CPI 1.4892 -- Total Cycles 135274 ---- Thread 30 ---- PC 5: Stalled ----- 87088 in-flight CPI 1.5531 -- Total Cycles 135274 ---- Thread 31 ---- PC 5: Stalled ----- 94171 in-flight CPI 1.4362 -- Total Cycles 135274 Total CPI 0.0442 , IPC 22.6212 -- Total Cycles 135274 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7950 (3.767683%) FPSUB: 0 (0.000000%) FPMUL: 32060 (15.193953%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 85052 (40.308050%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5507 (2.609891%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 72598 (34.405820%) DIV: 7571 (3.588067%) FPUN: 0 (0.000000%) FPRSUB: 267 (0.126537%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3353402 total) ADD%: 7.430 (249162) SUB%: 0.000 (0) MUL%: 0.006 (205) BITOR%: 1.522 (51047) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.556 (18659) FPSUB%: 0.000 (0) FPMUL%: 4.787 (160534) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (615) FPMAX%: 0.018 (615) LOAD%: 5.155 (172871) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (237) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (582) FPINV%: 0.000 (0) FPCONV%: 0.019 (647) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.063 (35636) FPLE%: 0.455 (15253) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (615) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.802 (93965) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.748 (25079) CMPU%: 0.000 (0) RSUB%: 0.006 (205) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.680 (525804) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.174 (39375) ORI%: 1.567 (52538) XORI%: 0.000 (0) MULI%: 3.197 (107198) LW%: 1.131 (37914) LWI%: 13.461 (451403) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.288 (9648) SWI%: 4.067 (136380) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.399 (46921) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10419) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.057 (1911) bned%: 0.000 (0) bneid%: 13.783 (462215) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.716 (23997) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.123 (4133) DIV%: 0.012 (410) FPUN%: 1.473 (49381) FPRSUB%: 4.222 (141588) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (63) FPGT%: 2.942 (98672) FPGE%: 1.018 (34128) SYNC%: 0.000 (0) NOP%: 8.746 (293281) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 31 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 12 FPSUB 0 FPMUL 2 FPCMPLT 0 FPMIN 0 FPMAX 396 LOAD 40362 INTCONV 0 ATOMIC_INC 33 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 13 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1768 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 7 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48865 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 9 ORI 11325 XORI 0 MULI 9307 LW 0 LWI 142692 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 59 DIV 17 FPUN 0 FPRSUB 47 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 22.6215 --Total thread-cycles: 4328768 --total thread-cycles issued: 3060121 (70.692654%) --iCache conflicts: 110507 (2.552851%) --thread*cycles of FU dependence: 254976 (5.890267%) --thread*cycles of data dependence: 211005 (4.874482%) --iCache cycles*banks: 4328768 (77.468555% used) Issue breakdown: --thread*cycles of issue worked: 3060121 (70.692654%) --thread*cycles of issue failed: 975366 (22.532185%) --thread*cycles of issue NOP/other: 293281 (6.775161%) Number of thread-cycles not ready: 211005 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3353402 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 6 2: 7 3: 6 4: 6 5: 8 6: 8 7: 9 8: 7 9: 9 10: 9 11: 8 12: 7 13: 7 14: 7 15: 7 16: 7 17: 8 18: 8 19: 8 20: 8 21: 8 22: 8 23: 7 24: 6 25: 7 26: 8 27: 6 28: 7 29: 8 30: 6 31: 8 <=== Core 45 ===> ---- Thread 00 ---- PC 5: Stalled ----- 96322 in-flight CPI 1.3357 -- Total Cycles 128679 ---- Thread 01 ---- PC 5: Stalled ----- 102134 in-flight CPI 1.2597 -- Total Cycles 128679 ---- Thread 02 ---- PC 5: Stalled ----- 95780 in-flight CPI 1.3433 -- Total Cycles 128679 ---- Thread 03 ---- PC 5: Stalled ----- 102269 in-flight CPI 1.2580 -- Total Cycles 128679 ---- Thread 04 ---- PC 5: Stalled ----- 92749 in-flight CPI 1.3872 -- Total Cycles 128679 ---- Thread 05 ---- PC 5: Stalled ----- 96533 in-flight CPI 1.3327 -- Total Cycles 128679 ---- Thread 06 ---- PC 5: Stalled ----- 96463 in-flight CPI 1.3337 -- Total Cycles 128679 ---- Thread 07 ---- PC 5: Stalled ----- 96853 in-flight CPI 1.3283 -- Total Cycles 128679 ---- Thread 08 ---- PC 5: Stalled ----- 101324 in-flight CPI 1.2697 -- Total Cycles 128679 ---- Thread 09 ---- PC 5: Stalled ----- 100260 in-flight CPI 1.2832 -- Total Cycles 128679 ---- Thread 10 ---- PC 5: Stalled ----- 94700 in-flight CPI 1.3586 -- Total Cycles 128679 ---- Thread 11 ---- PC 5: Stalled ----- 101435 in-flight CPI 1.2683 -- Total Cycles 128679 ---- Thread 12 ---- PC 5: Stalled ----- 93241 in-flight CPI 1.3798 -- Total Cycles 128679 ---- Thread 13 ---- PC 5: Stalled ----- 92669 in-flight CPI 1.3884 -- Total Cycles 128679 ---- Thread 14 ---- PC 5: Stalled ----- 97802 in-flight CPI 1.3155 -- Total Cycles 128679 ---- Thread 15 ---- PC 5: Stalled ----- 100155 in-flight CPI 1.2846 -- Total Cycles 128679 ---- Thread 16 ---- PC 5: Stalled ----- 92462 in-flight CPI 1.3915 -- Total Cycles 128679 ---- Thread 17 ---- PC 5: Stalled ----- 95718 in-flight CPI 1.3441 -- Total Cycles 128679 ---- Thread 18 ---- PC 5: Stalled ----- 95367 in-flight CPI 1.3490 -- Total Cycles 128679 ---- Thread 19 ---- PC 5: Stalled ----- 91652 in-flight CPI 1.4038 -- Total Cycles 128679 ---- Thread 20 ---- PC 5: Stalled ----- 97637 in-flight CPI 1.3177 -- Total Cycles 128679 ---- Thread 21 ---- PC 5: Stalled ----- 97078 in-flight CPI 1.3253 -- Total Cycles 128679 ---- Thread 22 ---- PC 5: Stalled ----- 90751 in-flight CPI 1.4176 -- Total Cycles 128679 ---- Thread 23 ---- PC 5: Stalled ----- 90538 in-flight CPI 1.4210 -- Total Cycles 128679 ---- Thread 24 ---- PC 5: Stalled ----- 98255 in-flight CPI 1.3094 -- Total Cycles 128679 ---- Thread 25 ---- PC 5: Stalled ----- 93385 in-flight CPI 1.3777 -- Total Cycles 128679 ---- Thread 26 ---- PC 5: Stalled ----- 91030 in-flight CPI 1.4133 -- Total Cycles 128679 ---- Thread 27 ---- PC 5: Stalled ----- 92324 in-flight CPI 1.3935 -- Total Cycles 128679 ---- Thread 28 ---- PC 5: Stalled ----- 94264 in-flight CPI 1.3648 -- Total Cycles 128679 ---- Thread 29 ---- PC 5: Stalled ----- 88830 in-flight CPI 1.4483 -- Total Cycles 128679 ---- Thread 30 ---- PC 5: Stalled ----- 89634 in-flight CPI 1.4353 -- Total Cycles 128679 ---- Thread 31 ---- PC 5: Stalled ----- 91692 in-flight CPI 1.4031 -- Total Cycles 128679 Total CPI 0.0422 , IPC 23.7168 -- Total Cycles 128679 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7249 (3.724273%) FPSUB: 0 (0.000000%) FPMUL: 30627 (15.735042%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 76904 (39.510486%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5571 (2.862178%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 66577 (34.204848%) DIV: 7451 (3.828054%) FPUN: 0 (0.000000%) FPRSUB: 263 (0.135120%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3343794 total) ADD%: 7.496 (250662) SUB%: 0.000 (0) MUL%: 0.006 (202) BITOR%: 1.540 (51510) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.512 (17130) FPSUB%: 0.000 (0) FPMUL%: 4.660 (155835) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (606) FPMAX%: 0.018 (606) LOAD%: 5.099 (170516) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (234) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (582) FPINV%: 0.000 (0) FPCONV%: 0.019 (638) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.046 (34960) FPLE%: 0.458 (15308) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (606) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.833 (94742) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.737 (24651) CMPU%: 0.000 (0) RSUB%: 0.006 (202) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.713 (525411) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.183 (39550) ORI%: 1.541 (51516) XORI%: 0.000 (0) MULI%: 3.229 (107966) LW%: 1.143 (38220) LWI%: 13.531 (452459) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.290 (9693) SWI%: 4.089 (136729) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.416 (47347) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10401) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.049 (1655) bned%: 0.000 (0) bneid%: 13.824 (462246) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.726 (24280) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.113 (3779) DIV%: 0.012 (404) FPUN%: 1.495 (49978) FPRSUB%: 4.126 (137957) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (71) FPGT%: 2.946 (98494) FPGE%: 1.037 (34670) SYNC%: 0.000 (0) NOP%: 8.729 (291882) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 19 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 9 FPSUB 0 FPMUL 1 FPCMPLT 0 FPMIN 0 FPMAX 389 LOAD 38271 INTCONV 0 ATOMIC_INC 14 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 13 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1470 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 4 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49087 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 12 ORI 10281 XORI 0 MULI 9507 LW 0 LWI 142842 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 53 DIV 32 FPUN 0 FPRSUB 50 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.7171 --Total thread-cycles: 4117728 --total thread-cycles issued: 3051912 (74.116406%) --iCache conflicts: 112211 (2.725071%) --thread*cycles of FU dependence: 252062 (6.121385%) --thread*cycles of data dependence: 194642 (4.726927%) --iCache cycles*banks: 4117728 (81.205607% used) Issue breakdown: --thread*cycles of issue worked: 3051912 (74.116406%) --thread*cycles of issue failed: 773934 (18.795171%) --thread*cycles of issue NOP/other: 291882 (7.088424%) Number of thread-cycles not ready: 194642 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3343794 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 6 3: 8 4: 6 5: 8 6: 7 7: 8 8: 8 9: 7 10: 6 11: 9 12: 7 13: 6 14: 7 15: 7 16: 7 17: 8 18: 8 19: 6 20: 8 21: 8 22: 8 23: 7 24: 8 25: 8 26: 7 27: 7 28: 8 29: 7 30: 7 31: 7 <=== Core 46 ===> ---- Thread 00 ---- PC 5: Stalled ----- 99969 in-flight CPI 1.2715 -- Total Cycles 127135 ---- Thread 01 ---- PC 5: Stalled ----- 102275 in-flight CPI 1.2429 -- Total Cycles 127135 ---- Thread 02 ---- PC 5: Stalled ----- 102436 in-flight CPI 1.2408 -- Total Cycles 127135 ---- Thread 03 ---- PC 5: Stalled ----- 98773 in-flight CPI 1.2869 -- Total Cycles 127135 ---- Thread 04 ---- PC 5: Stalled ----- 100181 in-flight CPI 1.2688 -- Total Cycles 127135 ---- Thread 05 ---- PC 5: Stalled ----- 94439 in-flight CPI 1.3459 -- Total Cycles 127135 ---- Thread 06 ---- PC 5: Stalled ----- 98088 in-flight CPI 1.2958 -- Total Cycles 127135 ---- Thread 07 ---- PC 5: Stalled ----- 94935 in-flight CPI 1.3389 -- Total Cycles 127135 ---- Thread 08 ---- PC 5: Stalled ----- 99364 in-flight CPI 1.2792 -- Total Cycles 127135 ---- Thread 09 ---- PC 5: Stalled ----- 92929 in-flight CPI 1.3678 -- Total Cycles 127135 ---- Thread 10 ---- PC 5: Stalled ----- 98517 in-flight CPI 1.2902 -- Total Cycles 127135 ---- Thread 11 ---- PC 5: Stalled ----- 95680 in-flight CPI 1.3285 -- Total Cycles 127135 ---- Thread 12 ---- PC 5: Stalled ----- 97804 in-flight CPI 1.2996 -- Total Cycles 127135 ---- Thread 13 ---- PC 5: Stalled ----- 96095 in-flight CPI 1.3228 -- Total Cycles 127135 ---- Thread 14 ---- PC 5: Stalled ----- 95751 in-flight CPI 1.3275 -- Total Cycles 127135 ---- Thread 15 ---- PC 5: Stalled ----- 93623 in-flight CPI 1.3577 -- Total Cycles 127135 ---- Thread 16 ---- PC 5: Stalled ----- 96741 in-flight CPI 1.3140 -- Total Cycles 127135 ---- Thread 17 ---- PC 5: Stalled ----- 99788 in-flight CPI 1.2738 -- Total Cycles 127135 ---- Thread 18 ---- PC 5: Stalled ----- 95566 in-flight CPI 1.3301 -- Total Cycles 127135 ---- Thread 19 ---- PC 5: Stalled ----- 95023 in-flight CPI 1.3377 -- Total Cycles 127135 ---- Thread 20 ---- PC 5: Stalled ----- 94054 in-flight CPI 1.3515 -- Total Cycles 127135 ---- Thread 21 ---- PC 5: Stalled ----- 97917 in-flight CPI 1.2981 -- Total Cycles 127135 ---- Thread 22 ---- PC 5: Stalled ----- 96600 in-flight CPI 1.3159 -- Total Cycles 127135 ---- Thread 23 ---- PC 5: Stalled ----- 95244 in-flight CPI 1.3346 -- Total Cycles 127135 ---- Thread 24 ---- PC 5: Stalled ----- 94119 in-flight CPI 1.3505 -- Total Cycles 127135 ---- Thread 25 ---- PC 5: Stalled ----- 93299 in-flight CPI 1.3625 -- Total Cycles 127135 ---- Thread 26 ---- PC 5: Stalled ----- 89940 in-flight CPI 1.4133 -- Total Cycles 127135 ---- Thread 27 ---- PC 5: Stalled ----- 90001 in-flight CPI 1.4123 -- Total Cycles 127135 ---- Thread 28 ---- PC 5: Stalled ----- 93455 in-flight CPI 1.3601 -- Total Cycles 127135 ---- Thread 29 ---- PC 5: Stalled ----- 91448 in-flight CPI 1.3900 -- Total Cycles 127135 ---- Thread 30 ---- PC 5: Stalled ----- 85847 in-flight CPI 1.4808 -- Total Cycles 127135 ---- Thread 31 ---- PC 5: Stalled ----- 84097 in-flight CPI 1.5115 -- Total Cycles 127135 Total CPI 0.0416 , IPC 24.0262 -- Total Cycles 127135 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7504 (3.783879%) FPSUB: 0 (0.000000%) FPMUL: 31163 (15.713890%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 76651 (38.651136%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5618 (2.832867%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 69362 (34.975670%) DIV: 7746 (3.905907%) FPUN: 0 (0.000000%) FPRSUB: 271 (0.136651%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3347642 total) ADD%: 7.516 (251623) SUB%: 0.000 (0) MUL%: 0.006 (210) BITOR%: 1.530 (51235) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.528 (17682) FPSUB%: 0.000 (0) FPMUL%: 4.706 (157530) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (630) FPMAX%: 0.019 (630) LOAD%: 5.117 (171295) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (242) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (593) FPINV%: 0.000 (0) FPCONV%: 0.020 (662) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.057 (35385) FPLE%: 0.459 (15378) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (630) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.812 (94140) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.738 (24708) CMPU%: 0.000 (0) RSUB%: 0.006 (210) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.687 (525149) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.175 (39342) ORI%: 1.541 (51595) XORI%: 0.000 (0) MULI%: 3.212 (107534) LW%: 1.135 (37992) LWI%: 13.495 (451756) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.290 (9708) SWI%: 4.071 (136298) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.403 (46969) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.312 (10441) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.052 (1750) bned%: 0.000 (0) bneid%: 13.822 (462696) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.715 (23950) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.118 (3938) DIV%: 0.013 (420) FPUN%: 1.484 (49675) FPRSUB%: 4.160 (139252) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (69) FPGT%: 2.955 (98918) FPGE%: 1.025 (34297) SYNC%: 0.000 (0) NOP%: 8.753 (293014) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 8 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 13 FPSUB 0 FPMUL 1 FPCMPLT 0 FPMIN 0 FPMAX 406 LOAD 38748 INTCONV 0 ATOMIC_INC 21 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 17 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1489 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 8 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49091 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 26 ORI 10588 XORI 0 MULI 9413 LW 0 LWI 142871 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 91 DIV 29 FPUN 0 FPRSUB 57 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 24.0265 --Total thread-cycles: 4068320 --total thread-cycles issued: 3054628 (75.083278%) --iCache conflicts: 110583 (2.718149%) --thread*cycles of FU dependence: 252879 (6.215809%) --thread*cycles of data dependence: 198315 (4.874617%) --iCache cycles*banks: 4068320 (82.286398% used) Issue breakdown: --thread*cycles of issue worked: 3054628 (75.083278%) --thread*cycles of issue failed: 720678 (17.714388%) --thread*cycles of issue NOP/other: 293014 (7.202334%) Number of thread-cycles not ready: 198315 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3347642 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 7 2: 9 3: 8 4: 7 5: 8 6: 9 7: 8 8: 8 9: 8 10: 9 11: 7 12: 8 13: 7 14: 8 15: 8 16: 7 17: 8 18: 8 19: 7 20: 7 21: 8 22: 7 23: 8 24: 8 25: 6 26: 7 27: 7 28: 8 29: 8 30: 5 31: 6 <=== Core 47 ===> ---- Thread 00 ---- PC 5: Stalled ----- 99094 in-flight CPI 1.3117 -- Total Cycles 130012 ---- Thread 01 ---- PC 5: Stalled ----- 95900 in-flight CPI 1.3555 -- Total Cycles 130012 ---- Thread 02 ---- PC 5: Stalled ----- 87955 in-flight CPI 1.4779 -- Total Cycles 130012 ---- Thread 03 ---- PC 5: Stalled ----- 99498 in-flight CPI 1.3064 -- Total Cycles 130012 ---- Thread 04 ---- PC 5: Stalled ----- 102015 in-flight CPI 1.2742 -- Total Cycles 130012 ---- Thread 05 ---- PC 5: Stalled ----- 99495 in-flight CPI 1.3065 -- Total Cycles 130012 ---- Thread 06 ---- PC 5: Stalled ----- 95277 in-flight CPI 1.3643 -- Total Cycles 130012 ---- Thread 07 ---- PC 5: Stalled ----- 101569 in-flight CPI 1.2798 -- Total Cycles 130012 ---- Thread 08 ---- PC 5: Stalled ----- 102783 in-flight CPI 1.2646 -- Total Cycles 130012 ---- Thread 09 ---- PC 5: Stalled ----- 96716 in-flight CPI 1.3440 -- Total Cycles 130012 ---- Thread 10 ---- PC 5: Stalled ----- 99421 in-flight CPI 1.3075 -- Total Cycles 130012 ---- Thread 11 ---- PC 5: Stalled ----- 105633 in-flight CPI 1.2305 -- Total Cycles 130012 ---- Thread 12 ---- PC 5: Stalled ----- 94314 in-flight CPI 1.3783 -- Total Cycles 130012 ---- Thread 13 ---- PC 5: Stalled ----- 98790 in-flight CPI 1.3158 -- Total Cycles 130012 ---- Thread 14 ---- PC 5: Stalled ----- 100929 in-flight CPI 1.2879 -- Total Cycles 130012 ---- Thread 15 ---- PC 5: Stalled ----- 95356 in-flight CPI 1.3632 -- Total Cycles 130012 ---- Thread 16 ---- PC 5: Stalled ----- 96464 in-flight CPI 1.3475 -- Total Cycles 130012 ---- Thread 17 ---- PC 5: Stalled ----- 90309 in-flight CPI 1.4394 -- Total Cycles 130012 ---- Thread 18 ---- PC 5: Stalled ----- 90912 in-flight CPI 1.4298 -- Total Cycles 130012 ---- Thread 19 ---- PC 5: Stalled ----- 98475 in-flight CPI 1.3200 -- Total Cycles 130012 ---- Thread 20 ---- PC 5: Stalled ----- 92242 in-flight CPI 1.4092 -- Total Cycles 130012 ---- Thread 21 ---- PC 5: Stalled ----- 100462 in-flight CPI 1.2939 -- Total Cycles 130012 ---- Thread 22 ---- PC 5: Stalled ----- 89841 in-flight CPI 1.4469 -- Total Cycles 130012 ---- Thread 23 ---- PC 5: Stalled ----- 92401 in-flight CPI 1.4068 -- Total Cycles 130012 ---- Thread 24 ---- PC 5: Stalled ----- 95530 in-flight CPI 1.3607 -- Total Cycles 130012 ---- Thread 25 ---- PC 5: Stalled ----- 92425 in-flight CPI 1.4065 -- Total Cycles 130012 ---- Thread 26 ---- PC 5: Stalled ----- 89160 in-flight CPI 1.4579 -- Total Cycles 130012 ---- Thread 27 ---- PC 5: Stalled ----- 89895 in-flight CPI 1.4460 -- Total Cycles 130012 ---- Thread 28 ---- PC 5: Stalled ----- 93097 in-flight CPI 1.3963 -- Total Cycles 130012 ---- Thread 29 ---- PC 5: Stalled ----- 90015 in-flight CPI 1.4440 -- Total Cycles 130012 ---- Thread 30 ---- PC 5: Stalled ----- 83631 in-flight CPI 1.5543 -- Total Cycles 130012 ---- Thread 31 ---- PC 5: Stalled ----- 81412 in-flight CPI 1.5968 -- Total Cycles 130012 Total CPI 0.0427 , IPC 23.3945 -- Total Cycles 130012 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8051 (3.863930%) FPSUB: 0 (0.000000%) FPMUL: 32216 (15.461478%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 80902 (38.827431%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5752 (2.760567%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 73685 (35.363764%) DIV: 7491 (3.595168%) FPUN: 0 (0.000000%) FPRSUB: 266 (0.127662%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3332899 total) ADD%: 7.366 (245509) SUB%: 0.000 (0) MUL%: 0.006 (203) BITOR%: 1.526 (50856) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.569 (18960) FPSUB%: 0.000 (0) FPMUL%: 4.827 (160878) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (609) FPMAX%: 0.018 (609) LOAD%: 5.168 (172250) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (235) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (592) FPINV%: 0.000 (0) FPCONV%: 0.019 (641) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.068 (35593) FPLE%: 0.455 (15161) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (609) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.798 (93268) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.752 (25070) CMPU%: 0.000 (0) RSUB%: 0.006 (203) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.673 (522374) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.175 (39161) ORI%: 1.570 (52336) XORI%: 0.000 (0) MULI%: 3.193 (106436) LW%: 1.129 (37632) LWI%: 13.452 (448352) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.286 (9526) SWI%: 4.055 (135156) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.399 (46638) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.308 (10276) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.058 (1918) bned%: 0.000 (0) bneid%: 13.781 (459308) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.715 (23837) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.126 (4199) DIV%: 0.012 (406) FPUN%: 1.474 (49124) FPRSUB%: 4.254 (141775) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (77) FPGT%: 2.934 (97789) FPGE%: 1.019 (33963) SYNC%: 0.000 (0) NOP%: 8.739 (291274) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 14 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 9 FPSUB 0 FPMUL 3 FPCMPLT 0 FPMIN 0 FPMAX 395 LOAD 40603 INTCONV 0 ATOMIC_INC 26 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 19 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1760 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 7 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48579 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 9 ORI 11422 XORI 0 MULI 9135 LW 0 LWI 141960 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 90 DIV 36 FPUN 0 FPRSUB 59 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.3948 --Total thread-cycles: 4160384 --total thread-cycles issued: 3041625 (73.109237%) --iCache conflicts: 110484 (2.655620%) --thread*cycles of FU dependence: 254147 (6.108739%) --thread*cycles of data dependence: 208363 (5.008264%) --iCache cycles*banks: 4160384 (80.111139% used) Issue breakdown: --thread*cycles of issue worked: 3041625 (73.109237%) --thread*cycles of issue failed: 827485 (19.889630%) --thread*cycles of issue NOP/other: 291274 (7.001133%) Number of thread-cycles not ready: 208363 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3332899 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 9 1: 7 2: 6 3: 8 4: 8 5: 6 6: 8 7: 8 8: 9 9: 8 10: 7 11: 9 12: 7 13: 8 14: 8 15: 8 16: 9 17: 6 18: 8 19: 7 20: 7 21: 8 22: 7 23: 6 24: 7 25: 6 26: 7 27: 7 28: 7 29: 8 30: 6 31: 5 <=== Core 48 ===> ---- Thread 00 ---- PC 5: Stalled ----- 97451 in-flight CPI 1.4914 -- Total Cycles 145372 ---- Thread 01 ---- PC 5: Stalled ----- 101876 in-flight CPI 1.4267 -- Total Cycles 145372 ---- Thread 02 ---- PC 5: Stalled ----- 99965 in-flight CPI 1.4540 -- Total Cycles 145372 ---- Thread 03 ---- PC 5: Stalled ----- 92939 in-flight CPI 1.5639 -- Total Cycles 145372 ---- Thread 04 ---- PC 5: Stalled ----- 98916 in-flight CPI 1.4693 -- Total Cycles 145372 ---- Thread 05 ---- PC 5: Stalled ----- 97331 in-flight CPI 1.4933 -- Total Cycles 145372 ---- Thread 06 ---- PC 5: Stalled ----- 98537 in-flight CPI 1.4751 -- Total Cycles 145372 ---- Thread 07 ---- PC 5: Stalled ----- 97149 in-flight CPI 1.4961 -- Total Cycles 145372 ---- Thread 08 ---- PC 5: Stalled ----- 101316 in-flight CPI 1.4345 -- Total Cycles 145372 ---- Thread 09 ---- PC 5: Stalled ----- 94413 in-flight CPI 1.5395 -- Total Cycles 145372 ---- Thread 10 ---- PC 5: Stalled ----- 104665 in-flight CPI 1.3886 -- Total Cycles 145372 ---- Thread 11 ---- PC 5: Stalled ----- 97197 in-flight CPI 1.4953 -- Total Cycles 145372 ---- Thread 12 ---- PC 5: Stalled ----- 94844 in-flight CPI 1.5325 -- Total Cycles 145372 ---- Thread 13 ---- PC 5: Stalled ----- 95561 in-flight CPI 1.5210 -- Total Cycles 145372 ---- Thread 14 ---- PC 5: Stalled ----- 91162 in-flight CPI 1.5943 -- Total Cycles 145372 ---- Thread 15 ---- PC 5: Stalled ----- 93923 in-flight CPI 1.5475 -- Total Cycles 145372 ---- Thread 16 ---- PC 5: Stalled ----- 102547 in-flight CPI 1.4174 -- Total Cycles 145372 ---- Thread 17 ---- PC 5: Stalled ----- 92442 in-flight CPI 1.5723 -- Total Cycles 145372 ---- Thread 18 ---- PC 5: Stalled ----- 98863 in-flight CPI 1.4702 -- Total Cycles 145372 ---- Thread 19 ---- PC 5: Stalled ----- 92737 in-flight CPI 1.5673 -- Total Cycles 145372 ---- Thread 20 ---- PC 5: Stalled ----- 93124 in-flight CPI 1.5607 -- Total Cycles 145372 ---- Thread 21 ---- PC 5: Stalled ----- 93679 in-flight CPI 1.5515 -- Total Cycles 145372 ---- Thread 22 ---- PC 5: Stalled ----- 90204 in-flight CPI 1.6114 -- Total Cycles 145372 ---- Thread 23 ---- PC 5: Stalled ----- 95287 in-flight CPI 1.5253 -- Total Cycles 145372 ---- Thread 24 ---- PC 5: Stalled ----- 90143 in-flight CPI 1.6124 -- Total Cycles 145372 ---- Thread 25 ---- PC 5: Stalled ----- 95663 in-flight CPI 1.5193 -- Total Cycles 145372 ---- Thread 26 ---- PC 5: Stalled ----- 97933 in-flight CPI 1.4841 -- Total Cycles 145372 ---- Thread 27 ---- PC 5: Stalled ----- 84829 in-flight CPI 1.7134 -- Total Cycles 145372 ---- Thread 28 ---- PC 5: Stalled ----- 89839 in-flight CPI 1.6178 -- Total Cycles 145372 ---- Thread 29 ---- PC 5: Stalled ----- 85968 in-flight CPI 1.6907 -- Total Cycles 145372 ---- Thread 30 ---- PC 5: Stalled ----- 90810 in-flight CPI 1.6005 -- Total Cycles 145372 ---- Thread 31 ---- PC 5: Stalled ----- 83973 in-flight CPI 1.7309 -- Total Cycles 145372 Total CPI 0.0479 , IPC 20.8833 -- Total Cycles 145372 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7847 (3.597874%) FPSUB: 0 (0.000000%) FPMUL: 31777 (14.569855%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 93279 (42.768717%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5492 (2.518099%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 71952 (32.990220%) DIV: 7488 (3.433272%) FPUN: 0 (0.000000%) FPRSUB: 266 (0.121962%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3327257 total) ADD%: 7.441 (247591) SUB%: 0.000 (0) MUL%: 0.006 (203) BITOR%: 1.523 (50682) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.553 (18394) FPSUB%: 0.000 (0) FPMUL%: 4.776 (158906) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (609) FPMAX%: 0.018 (609) LOAD%: 5.150 (171355) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (235) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (579) FPINV%: 0.000 (0) FPCONV%: 0.019 (641) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.064 (35409) FPLE%: 0.456 (15167) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (609) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.800 (93163) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.746 (24807) CMPU%: 0.000 (0) RSUB%: 0.006 (203) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.675 (521546) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.172 (38993) ORI%: 1.564 (52025) XORI%: 0.000 (0) MULI%: 3.199 (106428) LW%: 1.130 (37590) LWI%: 13.464 (447996) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.287 (9563) SWI%: 4.055 (134926) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.398 (46525) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10320) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.058 (1934) bned%: 0.000 (0) bneid%: 13.795 (458989) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.716 (23809) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.123 (4095) DIV%: 0.012 (406) FPUN%: 1.475 (49076) FPRSUB%: 4.220 (140400) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (67) FPGT%: 2.947 (98040) FPGE%: 1.019 (33909) SYNC%: 0.000 (0) NOP%: 8.757 (291362) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 23 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 7 FPSUB 0 FPMUL 1 FPCMPLT 0 FPMIN 0 FPMAX 404 LOAD 39203 INTCONV 0 ATOMIC_INC 25 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 10 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1581 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 10 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48615 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 11 ORI 11141 XORI 0 MULI 8974 LW 0 LWI 141736 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 84 DIV 39 FPUN 0 FPRSUB 75 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 20.8835 --Total thread-cycles: 4651904 --total thread-cycles issued: 3035895 (65.261342%) --iCache conflicts: 109319 (2.349984%) --thread*cycles of FU dependence: 251955 (5.416169%) --thread*cycles of data dependence: 218101 (4.688424%) --iCache cycles*banks: 4651904 (71.525315% used) Issue breakdown: --thread*cycles of issue worked: 3035895 (65.261342%) --thread*cycles of issue failed: 1324647 (28.475373%) --thread*cycles of issue NOP/other: 291362 (6.263285%) Number of thread-cycles not ready: 218101 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3327257 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 7 3: 7 4: 9 5: 8 6: 7 7: 8 8: 9 9: 7 10: 9 11: 8 12: 7 13: 7 14: 8 15: 7 16: 6 17: 7 18: 7 19: 6 20: 8 21: 7 22: 6 23: 8 24: 7 25: 8 26: 8 27: 6 28: 8 29: 6 30: 7 31: 6 <=== Core 49 ===> ---- Thread 00 ---- PC 5: Stalled ----- 97091 in-flight CPI 1.3208 -- Total Cycles 128264 ---- Thread 01 ---- PC 5: Stalled ----- 98889 in-flight CPI 1.2968 -- Total Cycles 128264 ---- Thread 02 ---- PC 5: Stalled ----- 94368 in-flight CPI 1.3589 -- Total Cycles 128264 ---- Thread 03 ---- PC 5: Stalled ----- 101371 in-flight CPI 1.2651 -- Total Cycles 128264 ---- Thread 04 ---- PC 5: Stalled ----- 96759 in-flight CPI 1.3253 -- Total Cycles 128264 ---- Thread 05 ---- PC 5: Stalled ----- 94881 in-flight CPI 1.3515 -- Total Cycles 128264 ---- Thread 06 ---- PC 5: Stalled ----- 93687 in-flight CPI 1.3688 -- Total Cycles 128264 ---- Thread 07 ---- PC 5: Stalled ----- 98668 in-flight CPI 1.2997 -- Total Cycles 128264 ---- Thread 08 ---- PC 5: Stalled ----- 99805 in-flight CPI 1.2849 -- Total Cycles 128264 ---- Thread 09 ---- PC 5: Stalled ----- 98338 in-flight CPI 1.3041 -- Total Cycles 128264 ---- Thread 10 ---- PC 5: Stalled ----- 98714 in-flight CPI 1.2991 -- Total Cycles 128264 ---- Thread 11 ---- PC 5: Stalled ----- 97669 in-flight CPI 1.3130 -- Total Cycles 128264 ---- Thread 12 ---- PC 5: Stalled ----- 98879 in-flight CPI 1.2969 -- Total Cycles 128264 ---- Thread 13 ---- PC 5: Stalled ----- 99894 in-flight CPI 1.2838 -- Total Cycles 128264 ---- Thread 14 ---- PC 5: Stalled ----- 92835 in-flight CPI 1.3814 -- Total Cycles 128264 ---- Thread 15 ---- PC 5: Stalled ----- 101254 in-flight CPI 1.2665 -- Total Cycles 128264 ---- Thread 16 ---- PC 5: Stalled ----- 95995 in-flight CPI 1.3359 -- Total Cycles 128264 ---- Thread 17 ---- PC 5: Stalled ----- 96516 in-flight CPI 1.3287 -- Total Cycles 128264 ---- Thread 18 ---- PC 5: Stalled ----- 96213 in-flight CPI 1.3329 -- Total Cycles 128264 ---- Thread 19 ---- PC 5: Stalled ----- 95508 in-flight CPI 1.3427 -- Total Cycles 128264 ---- Thread 20 ---- PC 5: Stalled ----- 92300 in-flight CPI 1.3894 -- Total Cycles 128264 ---- Thread 21 ---- PC 5: Stalled ----- 94261 in-flight CPI 1.3605 -- Total Cycles 128264 ---- Thread 22 ---- PC 5: Stalled ----- 90312 in-flight CPI 1.4200 -- Total Cycles 128264 ---- Thread 23 ---- PC 5: Stalled ----- 92690 in-flight CPI 1.3836 -- Total Cycles 128264 ---- Thread 24 ---- PC 5: Stalled ----- 95077 in-flight CPI 1.3488 -- Total Cycles 128264 ---- Thread 25 ---- PC 5: Stalled ----- 92884 in-flight CPI 1.3807 -- Total Cycles 128264 ---- Thread 26 ---- PC 5: Stalled ----- 92206 in-flight CPI 1.3908 -- Total Cycles 128264 ---- Thread 27 ---- PC 5: Stalled ----- 85083 in-flight CPI 1.5073 -- Total Cycles 128264 ---- Thread 28 ---- PC 5: Stalled ----- 97231 in-flight CPI 1.3189 -- Total Cycles 128264 ---- Thread 29 ---- PC 5: Stalled ----- 90222 in-flight CPI 1.4213 -- Total Cycles 128264 ---- Thread 30 ---- PC 5: Stalled ----- 91094 in-flight CPI 1.4077 -- Total Cycles 128264 ---- Thread 31 ---- PC 5: Stalled ----- 91679 in-flight CPI 1.3989 -- Total Cycles 128264 Total CPI 0.0420 , IPC 23.8021 -- Total Cycles 128264 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 6815 (3.938783%) FPSUB: 0 (0.000000%) FPMUL: 29937 (17.302324%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 59282 (34.262497%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5763 (3.330771%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 63127 (36.484745%) DIV: 7824 (4.521942%) FPUN: 0 (0.000000%) FPRSUB: 275 (0.158938%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3345396 total) ADD%: 7.560 (252917) SUB%: 0.000 (0) MUL%: 0.006 (212) BITOR%: 1.535 (51365) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.492 (16446) FPSUB%: 0.000 (0) FPMUL%: 4.597 (153773) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (636) FPMAX%: 0.019 (636) LOAD%: 5.057 (169182) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (244) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (603) FPINV%: 0.000 (0) FPCONV%: 0.020 (668) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.038 (34720) FPLE%: 0.460 (15387) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (636) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.847 (95227) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.724 (24208) CMPU%: 0.000 (0) RSUB%: 0.006 (212) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.724 (526046) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.183 (39577) ORI%: 1.519 (50802) XORI%: 0.000 (0) MULI%: 3.243 (108504) LW%: 1.149 (38430) LWI%: 13.585 (454486) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.292 (9764) SWI%: 4.112 (137574) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.422 (47572) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.312 (10423) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.045 (1501) bned%: 0.000 (0) bneid%: 13.849 (463295) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.726 (24302) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.107 (3570) DIV%: 0.013 (424) FPUN%: 1.495 (50006) FPRSUB%: 4.055 (135659) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (66) FPGT%: 2.966 (99221) FPGE%: 1.035 (34619) SYNC%: 0.000 (0) NOP%: 8.740 (292387) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 23 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 10 FPSUB 0 FPMUL 2 FPCMPLT 0 FPMIN 0 FPMAX 414 LOAD 37411 INTCONV 0 ATOMIC_INC 24 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 15 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1540 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 11 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49439 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 13 ORI 9621 XORI 0 MULI 10107 LW 0 LWI 143279 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 52 DIV 24 FPUN 0 FPRSUB 44 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.8024 --Total thread-cycles: 4104448 --total thread-cycles issued: 3053009 (74.382938%) --iCache conflicts: 111263 (2.710791%) --thread*cycles of FU dependence: 252033 (6.140485%) --thread*cycles of data dependence: 173023 (4.215500%) --iCache cycles*banks: 4104448 (81.507379% used) Issue breakdown: --thread*cycles of issue worked: 3053009 (74.382938%) --thread*cycles of issue failed: 759052 (18.493400%) --thread*cycles of issue NOP/other: 292387 (7.123662%) Number of thread-cycles not ready: 173023 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3345396 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 8 3: 7 4: 8 5: 9 6: 7 7: 8 8: 9 9: 8 10: 8 11: 7 12: 8 13: 8 14: 7 15: 8 16: 8 17: 8 18: 8 19: 8 20: 7 21: 7 22: 6 23: 7 24: 8 25: 7 26: 8 27: 6 28: 8 29: 8 30: 8 31: 6 <=== Core 50 ===> ---- Thread 00 ---- PC 5: Stalled ----- 103175 in-flight CPI 1.2590 -- Total Cycles 129921 ---- Thread 01 ---- PC 5: Stalled ----- 101627 in-flight CPI 1.2782 -- Total Cycles 129921 ---- Thread 02 ---- PC 5: Stalled ----- 95597 in-flight CPI 1.3588 -- Total Cycles 129921 ---- Thread 03 ---- PC 5: Stalled ----- 101770 in-flight CPI 1.2763 -- Total Cycles 129921 ---- Thread 04 ---- PC 5: Stalled ----- 99297 in-flight CPI 1.3082 -- Total Cycles 129921 ---- Thread 05 ---- PC 5: Stalled ----- 99834 in-flight CPI 1.3012 -- Total Cycles 129921 ---- Thread 06 ---- PC 5: Stalled ----- 99509 in-flight CPI 1.3054 -- Total Cycles 129921 ---- Thread 07 ---- PC 5: Stalled ----- 97074 in-flight CPI 1.3381 -- Total Cycles 129921 ---- Thread 08 ---- PC 5: Stalled ----- 96548 in-flight CPI 1.3454 -- Total Cycles 129921 ---- Thread 09 ---- PC 5: Stalled ----- 97609 in-flight CPI 1.3308 -- Total Cycles 129921 ---- Thread 10 ---- PC 5: Stalled ----- 96434 in-flight CPI 1.3470 -- Total Cycles 129921 ---- Thread 11 ---- PC 5: Stalled ----- 98248 in-flight CPI 1.3221 -- Total Cycles 129921 ---- Thread 12 ---- PC 5: Stalled ----- 95517 in-flight CPI 1.3599 -- Total Cycles 129921 ---- Thread 13 ---- PC 5: Stalled ----- 94285 in-flight CPI 1.3777 -- Total Cycles 129921 ---- Thread 14 ---- PC 5: Stalled ----- 98221 in-flight CPI 1.3225 -- Total Cycles 129921 ---- Thread 15 ---- PC 5: Stalled ----- 92810 in-flight CPI 1.3996 -- Total Cycles 129921 ---- Thread 16 ---- PC 5: Stalled ----- 97567 in-flight CPI 1.3313 -- Total Cycles 129921 ---- Thread 17 ---- PC 5: Stalled ----- 95732 in-flight CPI 1.3569 -- Total Cycles 129921 ---- Thread 18 ---- PC 5: Stalled ----- 92764 in-flight CPI 1.4003 -- Total Cycles 129921 ---- Thread 19 ---- PC 5: Stalled ----- 89169 in-flight CPI 1.4568 -- Total Cycles 129921 ---- Thread 20 ---- PC 5: Stalled ----- 89520 in-flight CPI 1.4510 -- Total Cycles 129921 ---- Thread 21 ---- PC 5: Stalled ----- 91062 in-flight CPI 1.4265 -- Total Cycles 129921 ---- Thread 22 ---- PC 5: Stalled ----- 87569 in-flight CPI 1.4834 -- Total Cycles 129921 ---- Thread 23 ---- PC 5: Stalled ----- 95727 in-flight CPI 1.3569 -- Total Cycles 129921 ---- Thread 24 ---- PC 5: Stalled ----- 90707 in-flight CPI 1.4321 -- Total Cycles 129921 ---- Thread 25 ---- PC 5: Stalled ----- 85804 in-flight CPI 1.5139 -- Total Cycles 129921 ---- Thread 26 ---- PC 5: Stalled ----- 88611 in-flight CPI 1.4659 -- Total Cycles 129921 ---- Thread 27 ---- PC 5: Stalled ----- 95333 in-flight CPI 1.3625 -- Total Cycles 129921 ---- Thread 28 ---- PC 5: Stalled ----- 90107 in-flight CPI 1.4416 -- Total Cycles 129921 ---- Thread 29 ---- PC 5: Stalled ----- 88078 in-flight CPI 1.4748 -- Total Cycles 129921 ---- Thread 30 ---- PC 5: Stalled ----- 93530 in-flight CPI 1.3888 -- Total Cycles 129921 ---- Thread 31 ---- PC 5: Stalled ----- 89996 in-flight CPI 1.4434 -- Total Cycles 129921 Total CPI 0.0429 , IPC 23.3171 -- Total Cycles 129921 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8929 (4.128922%) FPSUB: 0 (0.000000%) FPMUL: 33703 (15.584842%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 79922 (36.957296%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5513 (2.549305%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 80505 (37.226885%) DIV: 7419 (3.430672%) FPUN: 0 (0.000000%) FPRSUB: 264 (0.122078%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3319580 total) ADD%: 7.371 (244697) SUB%: 0.000 (0) MUL%: 0.006 (201) BITOR%: 1.527 (50698) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.622 (20644) FPSUB%: 0.000 (0) FPMUL%: 4.974 (165127) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (603) FPMAX%: 0.018 (603) LOAD%: 5.226 (173465) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (233) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (578) FPINV%: 0.000 (0) FPCONV%: 0.019 (635) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.090 (36171) FPLE%: 0.452 (15021) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (603) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.754 (91431) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.771 (25581) CMPU%: 0.000 (0) RSUB%: 0.006 (201) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.606 (518058) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.165 (38660) ORI%: 1.612 (53528) XORI%: 0.000 (0) MULI%: 3.149 (104526) LW%: 1.111 (36894) LWI%: 13.323 (442279) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.282 (9358) SWI%: 4.005 (132944) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.376 (45693) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.308 (10232) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.067 (2214) bned%: 0.000 (0) bneid%: 13.731 (455807) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.709 (23537) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.139 (4602) DIV%: 0.012 (402) FPUN%: 1.468 (48731) FPRSUB%: 4.370 (145072) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (74) FPGT%: 2.908 (96525) FPGE%: 1.015 (33710) SYNC%: 0.000 (0) NOP%: 8.740 (290146) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 12 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 11 FPSUB 0 FPMUL 0 FPCMPLT 0 FPMIN 0 FPMAX 394 LOAD 41920 INTCONV 0 ATOMIC_INC 22 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 13 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1591 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 8 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 47749 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 11 ORI 12714 XORI 0 MULI 8818 LW 0 LWI 140256 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 87 DIV 22 FPUN 0 FPRSUB 54 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.3173 --Total thread-cycles: 4157472 --total thread-cycles issued: 3029434 (72.867214%) --iCache conflicts: 110744 (2.663734%) --thread*cycles of FU dependence: 253714 (6.102603%) --thread*cycles of data dependence: 216255 (5.201598%) --iCache cycles*banks: 4157472 (79.846888% used) Issue breakdown: --thread*cycles of issue worked: 3029434 (72.867214%) --thread*cycles of issue failed: 837892 (20.153882%) --thread*cycles of issue NOP/other: 290146 (6.978904%) Number of thread-cycles not ready: 216255 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3319580 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 7 3: 9 4: 8 5: 7 6: 8 7: 7 8: 7 9: 7 10: 7 11: 8 12: 8 13: 7 14: 7 15: 7 16: 8 17: 7 18: 7 19: 6 20: 7 21: 7 22: 7 23: 8 24: 7 25: 6 26: 7 27: 8 28: 7 29: 6 30: 9 31: 7 <=== Core 51 ===> ---- Thread 00 ---- PC 5: Stalled ----- 100760 in-flight CPI 1.4632 -- Total Cycles 147456 ---- Thread 01 ---- PC 5: Stalled ----- 93039 in-flight CPI 1.5846 -- Total Cycles 147456 ---- Thread 02 ---- PC 5: Stalled ----- 100705 in-flight CPI 1.4639 -- Total Cycles 147456 ---- Thread 03 ---- PC 5: Stalled ----- 101511 in-flight CPI 1.4523 -- Total Cycles 147456 ---- Thread 04 ---- PC 5: Stalled ----- 102353 in-flight CPI 1.4404 -- Total Cycles 147456 ---- Thread 05 ---- PC 5: Stalled ----- 102227 in-flight CPI 1.4421 -- Total Cycles 147456 ---- Thread 06 ---- PC 5: Stalled ----- 98222 in-flight CPI 1.5010 -- Total Cycles 147456 ---- Thread 07 ---- PC 5: Stalled ----- 110955 in-flight CPI 1.3288 -- Total Cycles 147456 ---- Thread 08 ---- PC 5: Stalled ----- 95852 in-flight CPI 1.5381 -- Total Cycles 147456 ---- Thread 09 ---- PC 5: Stalled ----- 96891 in-flight CPI 1.5216 -- Total Cycles 147456 ---- Thread 10 ---- PC 5: Stalled ----- 98608 in-flight CPI 1.4951 -- Total Cycles 147456 ---- Thread 11 ---- PC 5: Stalled ----- 101098 in-flight CPI 1.4583 -- Total Cycles 147456 ---- Thread 12 ---- PC 5: Stalled ----- 99702 in-flight CPI 1.4787 -- Total Cycles 147456 ---- Thread 13 ---- PC 5: Stalled ----- 94976 in-flight CPI 1.5522 -- Total Cycles 147456 ---- Thread 14 ---- PC 5: Stalled ----- 93889 in-flight CPI 1.5702 -- Total Cycles 147456 ---- Thread 15 ---- PC 5: Stalled ----- 95341 in-flight CPI 1.5464 -- Total Cycles 147456 ---- Thread 16 ---- PC 5: Stalled ----- 95735 in-flight CPI 1.5399 -- Total Cycles 147456 ---- Thread 17 ---- PC 5: Stalled ----- 88925 in-flight CPI 1.6580 -- Total Cycles 147456 ---- Thread 18 ---- PC 5: Stalled ----- 90432 in-flight CPI 1.6303 -- Total Cycles 147456 ---- Thread 19 ---- PC 5: Stalled ----- 94976 in-flight CPI 1.5522 -- Total Cycles 147456 ---- Thread 20 ---- PC 5: Stalled ----- 90711 in-flight CPI 1.6253 -- Total Cycles 147456 ---- Thread 21 ---- PC 5: Stalled ----- 96697 in-flight CPI 1.5247 -- Total Cycles 147456 ---- Thread 22 ---- PC 5: Stalled ----- 95530 in-flight CPI 1.5433 -- Total Cycles 147456 ---- Thread 23 ---- PC 5: Stalled ----- 94493 in-flight CPI 1.5602 -- Total Cycles 147456 ---- Thread 24 ---- PC 5: Stalled ----- 92756 in-flight CPI 1.5894 -- Total Cycles 147456 ---- Thread 25 ---- PC 5: Stalled ----- 96808 in-flight CPI 1.5229 -- Total Cycles 147456 ---- Thread 26 ---- PC 5: Stalled ----- 87199 in-flight CPI 1.6907 -- Total Cycles 147456 ---- Thread 27 ---- PC 5: Stalled ----- 93083 in-flight CPI 1.5838 -- Total Cycles 147456 ---- Thread 28 ---- PC 5: Stalled ----- 89699 in-flight CPI 1.6436 -- Total Cycles 147456 ---- Thread 29 ---- PC 5: Stalled ----- 85013 in-flight CPI 1.7342 -- Total Cycles 147456 ---- Thread 30 ---- PC 5: Stalled ----- 92143 in-flight CPI 1.6000 -- Total Cycles 147456 ---- Thread 31 ---- PC 5: Stalled ----- 85339 in-flight CPI 1.7277 -- Total Cycles 147456 Total CPI 0.0482 , IPC 20.7263 -- Total Cycles 147456 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8131 (3.950021%) FPSUB: 0 (0.000000%) FPMUL: 32392 (15.735959%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 78075 (37.928656%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5654 (2.746700%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 73876 (35.888791%) DIV: 7455 (3.621622%) FPUN: 0 (0.000000%) FPRSUB: 264 (0.128251%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3349196 total) ADD%: 7.370 (246824) SUB%: 0.000 (0) MUL%: 0.006 (202) BITOR%: 1.520 (50892) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.569 (19045) FPSUB%: 0.000 (0) FPMUL%: 4.823 (161534) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (606) FPMAX%: 0.018 (606) LOAD%: 5.167 (173047) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (234) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (586) FPINV%: 0.000 (0) FPCONV%: 0.019 (638) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.068 (35761) FPLE%: 0.451 (15112) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (606) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.799 (93747) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.753 (25216) CMPU%: 0.000 (0) RSUB%: 0.006 (202) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.674 (524968) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.175 (39350) ORI%: 1.576 (52781) XORI%: 0.000 (0) MULI%: 3.193 (106950) LW%: 1.129 (37822) LWI%: 13.456 (450670) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.286 (9562) SWI%: 4.062 (136031) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.400 (46885) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.309 (10334) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.058 (1955) bned%: 0.000 (0) bneid%: 13.778 (461457) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.713 (23889) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.126 (4209) DIV%: 0.012 (404) FPUN%: 1.468 (49162) FPRSUB%: 4.250 (142356) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (71) FPGT%: 2.938 (98414) FPGE%: 1.017 (34050) SYNC%: 0.000 (0) NOP%: 8.746 (292922) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 22 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 8 FPSUB 0 FPMUL 3 FPCMPLT 0 FPMIN 0 FPMAX 394 LOAD 40005 INTCONV 0 ATOMIC_INC 23 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 13 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1754 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 10 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48721 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 18 ORI 11633 XORI 0 MULI 8937 LW 0 LWI 142387 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 90 DIV 25 FPUN 0 FPRSUB 73 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 20.7265 --Total thread-cycles: 4718592 --total thread-cycles issued: 3056274 (64.770889%) --iCache conflicts: 111125 (2.355046%) --thread*cycles of FU dependence: 254141 (5.385950%) --thread*cycles of data dependence: 205847 (4.362467%) --iCache cycles*banks: 4718592 (70.979394% used) Issue breakdown: --thread*cycles of issue worked: 3056274 (64.770889%) --thread*cycles of issue failed: 1369396 (29.021284%) --thread*cycles of issue NOP/other: 292922 (6.207826%) Number of thread-cycles not ready: 205847 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3349196 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 7 2: 9 3: 8 4: 8 5: 9 6: 7 7: 6 8: 8 9: 8 10: 7 11: 8 12: 7 13: 8 14: 8 15: 7 16: 8 17: 5 18: 7 19: 8 20: 6 21: 7 22: 7 23: 7 24: 7 25: 8 26: 8 27: 8 28: 7 29: 6 30: 8 31: 5 <=== Core 52 ===> ---- Thread 00 ---- PC 5: Stalled ----- 97708 in-flight CPI 1.3066 -- Total Cycles 127692 ---- Thread 01 ---- PC 5: Stalled ----- 97350 in-flight CPI 1.3115 -- Total Cycles 127692 ---- Thread 02 ---- PC 5: Stalled ----- 93598 in-flight CPI 1.3640 -- Total Cycles 127692 ---- Thread 03 ---- PC 5: Stalled ----- 103038 in-flight CPI 1.2390 -- Total Cycles 127692 ---- Thread 04 ---- PC 5: Stalled ----- 102903 in-flight CPI 1.2407 -- Total Cycles 127692 ---- Thread 05 ---- PC 5: Stalled ----- 96916 in-flight CPI 1.3173 -- Total Cycles 127692 ---- Thread 06 ---- PC 5: Stalled ----- 93064 in-flight CPI 1.3718 -- Total Cycles 127692 ---- Thread 07 ---- PC 5: Stalled ----- 100894 in-flight CPI 1.2654 -- Total Cycles 127692 ---- Thread 08 ---- PC 5: Stalled ----- 96774 in-flight CPI 1.3192 -- Total Cycles 127692 ---- Thread 09 ---- PC 5: Stalled ----- 101514 in-flight CPI 1.2576 -- Total Cycles 127692 ---- Thread 10 ---- PC 5: Stalled ----- 96779 in-flight CPI 1.3192 -- Total Cycles 127692 ---- Thread 11 ---- PC 5: Stalled ----- 103091 in-flight CPI 1.2384 -- Total Cycles 127692 ---- Thread 12 ---- PC 5: Stalled ----- 93087 in-flight CPI 1.3715 -- Total Cycles 127692 ---- Thread 13 ---- PC 5: Stalled ----- 99183 in-flight CPI 1.2872 -- Total Cycles 127692 ---- Thread 14 ---- PC 5: Stalled ----- 94371 in-flight CPI 1.3529 -- Total Cycles 127692 ---- Thread 15 ---- PC 5: Stalled ----- 94835 in-flight CPI 1.3462 -- Total Cycles 127692 ---- Thread 16 ---- PC 5: Stalled ----- 99254 in-flight CPI 1.2863 -- Total Cycles 127692 ---- Thread 17 ---- PC 5: Stalled ----- 92981 in-flight CPI 1.3731 -- Total Cycles 127692 ---- Thread 18 ---- PC 5: Stalled ----- 89499 in-flight CPI 1.4265 -- Total Cycles 127692 ---- Thread 19 ---- PC 5: Stalled ----- 95465 in-flight CPI 1.3374 -- Total Cycles 127692 ---- Thread 20 ---- PC 5: Stalled ----- 93483 in-flight CPI 1.3657 -- Total Cycles 127692 ---- Thread 21 ---- PC 5: Stalled ----- 90822 in-flight CPI 1.4058 -- Total Cycles 127692 ---- Thread 22 ---- PC 5: Stalled ----- 93904 in-flight CPI 1.3596 -- Total Cycles 127692 ---- Thread 23 ---- PC 5: Stalled ----- 88208 in-flight CPI 1.4474 -- Total Cycles 127692 ---- Thread 24 ---- PC 5: Stalled ----- 96809 in-flight CPI 1.3187 -- Total Cycles 127692 ---- Thread 25 ---- PC 5: Stalled ----- 95537 in-flight CPI 1.3363 -- Total Cycles 127692 ---- Thread 26 ---- PC 5: Stalled ----- 91024 in-flight CPI 1.4025 -- Total Cycles 127692 ---- Thread 27 ---- PC 5: Stalled ----- 92331 in-flight CPI 1.3827 -- Total Cycles 127692 ---- Thread 28 ---- PC 5: Stalled ----- 93284 in-flight CPI 1.3686 -- Total Cycles 127692 ---- Thread 29 ---- PC 5: Stalled ----- 93413 in-flight CPI 1.3667 -- Total Cycles 127692 ---- Thread 30 ---- PC 5: Stalled ----- 85829 in-flight CPI 1.4875 -- Total Cycles 127692 ---- Thread 31 ---- PC 5: Stalled ----- 88472 in-flight CPI 1.4430 -- Total Cycles 127692 Total CPI 0.0419 , IPC 23.8541 -- Total Cycles 127692 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8212 (3.959632%) FPSUB: 0 (0.000000%) FPMUL: 32466 (15.654337%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 78219 (37.715352%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5816 (2.804338%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 74846 (36.088971%) DIV: 7568 (3.649111%) FPUN: 0 (0.000000%) FPRSUB: 266 (0.128259%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3337629 total) ADD%: 7.392 (246730) SUB%: 0.000 (0) MUL%: 0.006 (205) BITOR%: 1.517 (50640) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.572 (19086) FPSUB%: 0.000 (0) FPMUL%: 4.833 (161318) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (615) FPMAX%: 0.018 (615) LOAD%: 5.171 (172574) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (237) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (598) FPINV%: 0.000 (0) FPCONV%: 0.019 (647) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.070 (35726) FPLE%: 0.448 (14960) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (615) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.795 (93300) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.753 (25131) CMPU%: 0.000 (0) RSUB%: 0.006 (205) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.648 (522261) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.172 (39112) ORI%: 1.577 (52645) XORI%: 0.000 (0) MULI%: 3.194 (106620) LW%: 1.128 (37648) LWI%: 13.466 (449446) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.285 (9503) SWI%: 4.062 (135559) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.399 (46683) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.308 (10283) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.062 (2065) bned%: 0.000 (0) bneid%: 13.757 (459166) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.715 (23852) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.128 (4265) DIV%: 0.012 (410) FPUN%: 1.467 (48971) FPRSUB%: 4.265 (142354) FPSQRT%: 0.000 (0) FPNEG%: 0.003 (86) FPGT%: 2.930 (97797) FPGE%: 1.019 (34011) SYNC%: 0.000 (0) NOP%: 8.737 (291594) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 21 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 9 FPSUB 0 FPMUL 1 FPCMPLT 0 FPMIN 1 FPMAX 400 LOAD 40485 INTCONV 0 ATOMIC_INC 14 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 14 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 2467 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 4 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48489 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 13 ORI 11656 XORI 0 MULI 9360 LW 0 LWI 142222 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 89 DIV 32 FPUN 0 FPRSUB 53 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.8544 --Total thread-cycles: 4086144 --total thread-cycles issued: 3046035 (74.545464%) --iCache conflicts: 110457 (2.703209%) --thread*cycles of FU dependence: 255338 (6.248874%) --thread*cycles of data dependence: 207393 (5.075519%) --iCache cycles*banks: 4086144 (81.682413% used) Issue breakdown: --thread*cycles of issue worked: 3046035 (74.545464%) --thread*cycles of issue failed: 748515 (18.318371%) --thread*cycles of issue NOP/other: 291594 (7.136166%) Number of thread-cycles not ready: 207393 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3337629 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 7 2: 7 3: 8 4: 8 5: 8 6: 7 7: 8 8: 8 9: 9 10: 7 11: 9 12: 7 13: 8 14: 7 15: 7 16: 8 17: 7 18: 6 19: 7 20: 7 21: 6 22: 7 23: 7 24: 9 25: 7 26: 8 27: 7 28: 7 29: 7 30: 7 31: 7 <=== Core 53 ===> ---- Thread 00 ---- PC 5: Stalled ----- 99988 in-flight CPI 1.2653 -- Total Cycles 126541 ---- Thread 01 ---- PC 5: Stalled ----- 103252 in-flight CPI 1.2253 -- Total Cycles 126541 ---- Thread 02 ---- PC 5: Stalled ----- 94882 in-flight CPI 1.3335 -- Total Cycles 126541 ---- Thread 03 ---- PC 5: Stalled ----- 94498 in-flight CPI 1.3389 -- Total Cycles 126541 ---- Thread 04 ---- PC 5: Stalled ----- 99996 in-flight CPI 1.2652 -- Total Cycles 126541 ---- Thread 05 ---- PC 5: Stalled ----- 98187 in-flight CPI 1.2885 -- Total Cycles 126541 ---- Thread 06 ---- PC 5: Stalled ----- 95113 in-flight CPI 1.3302 -- Total Cycles 126541 ---- Thread 07 ---- PC 5: Stalled ----- 100919 in-flight CPI 1.2536 -- Total Cycles 126541 ---- Thread 08 ---- PC 5: Stalled ----- 95960 in-flight CPI 1.3185 -- Total Cycles 126541 ---- Thread 09 ---- PC 5: Stalled ----- 98522 in-flight CPI 1.2841 -- Total Cycles 126541 ---- Thread 10 ---- PC 5: Stalled ----- 100397 in-flight CPI 1.2602 -- Total Cycles 126541 ---- Thread 11 ---- PC 5: Stalled ----- 93581 in-flight CPI 1.3519 -- Total Cycles 126541 ---- Thread 12 ---- PC 5: Stalled ----- 98063 in-flight CPI 1.2902 -- Total Cycles 126541 ---- Thread 13 ---- PC 5: Stalled ----- 99859 in-flight CPI 1.2670 -- Total Cycles 126541 ---- Thread 14 ---- PC 5: Stalled ----- 91577 in-flight CPI 1.3816 -- Total Cycles 126541 ---- Thread 15 ---- PC 5: Stalled ----- 98309 in-flight CPI 1.2870 -- Total Cycles 126541 ---- Thread 16 ---- PC 5: Stalled ----- 96178 in-flight CPI 1.3154 -- Total Cycles 126541 ---- Thread 17 ---- PC 5: Stalled ----- 94075 in-flight CPI 1.3449 -- Total Cycles 126541 ---- Thread 18 ---- PC 5: Stalled ----- 94377 in-flight CPI 1.3406 -- Total Cycles 126541 ---- Thread 19 ---- PC 5: Stalled ----- 94038 in-flight CPI 1.3454 -- Total Cycles 126541 ---- Thread 20 ---- PC 5: Stalled ----- 93197 in-flight CPI 1.3575 -- Total Cycles 126541 ---- Thread 21 ---- PC 5: Stalled ----- 92025 in-flight CPI 1.3749 -- Total Cycles 126541 ---- Thread 22 ---- PC 5: Stalled ----- 90242 in-flight CPI 1.4020 -- Total Cycles 126541 ---- Thread 23 ---- PC 5: Stalled ----- 94586 in-flight CPI 1.3376 -- Total Cycles 126541 ---- Thread 24 ---- PC 5: Stalled ----- 92283 in-flight CPI 1.3710 -- Total Cycles 126541 ---- Thread 25 ---- PC 5: Stalled ----- 91197 in-flight CPI 1.3872 -- Total Cycles 126541 ---- Thread 26 ---- PC 5: Stalled ----- 93025 in-flight CPI 1.3600 -- Total Cycles 126541 ---- Thread 27 ---- PC 5: Stalled ----- 89408 in-flight CPI 1.4151 -- Total Cycles 126541 ---- Thread 28 ---- PC 5: Stalled ----- 93990 in-flight CPI 1.3461 -- Total Cycles 126541 ---- Thread 29 ---- PC 5: Stalled ----- 90810 in-flight CPI 1.3933 -- Total Cycles 126541 ---- Thread 30 ---- PC 5: Stalled ----- 85979 in-flight CPI 1.4715 -- Total Cycles 126541 ---- Thread 31 ---- PC 5: Stalled ----- 84063 in-flight CPI 1.5050 -- Total Cycles 126541 Total CPI 0.0417 , IPC 23.9695 -- Total Cycles 126541 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8169 (3.933512%) FPSUB: 0 (0.000000%) FPMUL: 32474 (15.636782%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 79745 (38.398571%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5545 (2.670012%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 73989 (35.626959%) DIV: 7492 (3.607525%) FPUN: 0 (0.000000%) FPRSUB: 263 (0.126639%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3324138 total) ADD%: 7.388 (245590) SUB%: 0.000 (0) MUL%: 0.006 (203) BITOR%: 1.524 (50661) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.577 (19185) FPSUB%: 0.000 (0) FPMUL%: 4.848 (161165) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (609) FPMAX%: 0.018 (609) LOAD%: 5.164 (171663) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (235) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (582) FPINV%: 0.000 (0) FPCONV%: 0.019 (641) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.072 (35625) FPLE%: 0.454 (15089) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (609) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.789 (92698) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.751 (24971) CMPU%: 0.000 (0) RSUB%: 0.006 (203) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.662 (520641) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.170 (38902) ORI%: 1.581 (52551) XORI%: 0.000 (0) MULI%: 3.184 (105840) LW%: 1.125 (37404) LWI%: 13.424 (446218) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.285 (9489) SWI%: 4.049 (134579) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.394 (46323) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.309 (10264) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.060 (1980) bned%: 0.000 (0) bneid%: 13.780 (458068) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.714 (23718) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.127 (4215) DIV%: 0.012 (406) FPUN%: 1.473 (48960) FPRSUB%: 4.261 (141640) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (65) FPGT%: 2.937 (97617) FPGE%: 1.019 (33871) SYNC%: 0.000 (0) NOP%: 8.753 (290953) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 16 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 15 FPSUB 0 FPMUL 2 FPCMPLT 0 FPMIN 0 FPMAX 390 LOAD 40244 INTCONV 0 ATOMIC_INC 25 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 17 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1389 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 16 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48313 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 13 ORI 11677 XORI 0 MULI 9222 LW 0 LWI 141207 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 81 DIV 22 FPUN 0 FPRSUB 54 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.9698 --Total thread-cycles: 4049312 --total thread-cycles issued: 3033185 (74.906182%) --iCache conflicts: 109976 (2.715918%) --thread*cycles of FU dependence: 252734 (6.241406%) --thread*cycles of data dependence: 207677 (5.128698%) --iCache cycles*banks: 4049312 (82.092217% used) Issue breakdown: --thread*cycles of issue worked: 3033185 (74.906182%) --thread*cycles of issue failed: 725174 (17.908573%) --thread*cycles of issue NOP/other: 290953 (7.185245%) Number of thread-cycles not ready: 207677 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3324138 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 6 3: 7 4: 8 5: 9 6: 7 7: 8 8: 7 9: 8 10: 8 11: 8 12: 8 13: 8 14: 7 15: 7 16: 8 17: 7 18: 6 19: 7 20: 7 21: 6 22: 7 23: 7 24: 7 25: 9 26: 8 27: 7 28: 7 29: 6 30: 7 31: 7 <=== Core 54 ===> ---- Thread 00 ---- PC 5: Stalled ----- 94207 in-flight CPI 1.3747 -- Total Cycles 129524 ---- Thread 01 ---- PC 5: Stalled ----- 97122 in-flight CPI 1.3334 -- Total Cycles 129524 ---- Thread 02 ---- PC 5: Stalled ----- 91923 in-flight CPI 1.4088 -- Total Cycles 129524 ---- Thread 03 ---- PC 5: Stalled ----- 97054 in-flight CPI 1.3343 -- Total Cycles 129524 ---- Thread 04 ---- PC 5: Stalled ----- 100459 in-flight CPI 1.2891 -- Total Cycles 129524 ---- Thread 05 ---- PC 5: Stalled ----- 100377 in-flight CPI 1.2901 -- Total Cycles 129524 ---- Thread 06 ---- PC 5: Stalled ----- 88796 in-flight CPI 1.4585 -- Total Cycles 129524 ---- Thread 07 ---- PC 5: Stalled ----- 98785 in-flight CPI 1.3109 -- Total Cycles 129524 ---- Thread 08 ---- PC 5: Stalled ----- 101358 in-flight CPI 1.2776 -- Total Cycles 129524 ---- Thread 09 ---- PC 5: Stalled ----- 100789 in-flight CPI 1.2848 -- Total Cycles 129524 ---- Thread 10 ---- PC 5: Stalled ----- 103103 in-flight CPI 1.2561 -- Total Cycles 129524 ---- Thread 11 ---- PC 5: Stalled ----- 95747 in-flight CPI 1.3525 -- Total Cycles 129524 ---- Thread 12 ---- PC 5: Stalled ----- 96094 in-flight CPI 1.3476 -- Total Cycles 129524 ---- Thread 13 ---- PC 5: Stalled ----- 92254 in-flight CPI 1.4037 -- Total Cycles 129524 ---- Thread 14 ---- PC 5: Stalled ----- 97091 in-flight CPI 1.3337 -- Total Cycles 129524 ---- Thread 15 ---- PC 5: Stalled ----- 98252 in-flight CPI 1.3180 -- Total Cycles 129524 ---- Thread 16 ---- PC 5: Stalled ----- 90216 in-flight CPI 1.4355 -- Total Cycles 129524 ---- Thread 17 ---- PC 5: Stalled ----- 92211 in-flight CPI 1.4044 -- Total Cycles 129524 ---- Thread 18 ---- PC 5: Stalled ----- 95665 in-flight CPI 1.3537 -- Total Cycles 129524 ---- Thread 19 ---- PC 5: Stalled ----- 93019 in-flight CPI 1.3922 -- Total Cycles 129524 ---- Thread 20 ---- PC 5: Stalled ----- 90676 in-flight CPI 1.4282 -- Total Cycles 129524 ---- Thread 21 ---- PC 5: Stalled ----- 94163 in-flight CPI 1.3753 -- Total Cycles 129524 ---- Thread 22 ---- PC 5: Stalled ----- 98665 in-flight CPI 1.3125 -- Total Cycles 129524 ---- Thread 23 ---- PC 5: Stalled ----- 91733 in-flight CPI 1.4117 -- Total Cycles 129524 ---- Thread 24 ---- PC 5: Stalled ----- 90527 in-flight CPI 1.4305 -- Total Cycles 129524 ---- Thread 25 ---- PC 5: Stalled ----- 94018 in-flight CPI 1.3774 -- Total Cycles 129524 ---- Thread 26 ---- PC 5: Stalled ----- 91568 in-flight CPI 1.4142 -- Total Cycles 129524 ---- Thread 27 ---- PC 5: Stalled ----- 89225 in-flight CPI 1.4514 -- Total Cycles 129524 ---- Thread 28 ---- PC 5: Stalled ----- 85201 in-flight CPI 1.5200 -- Total Cycles 129524 ---- Thread 29 ---- PC 5: Stalled ----- 90001 in-flight CPI 1.4389 -- Total Cycles 129524 ---- Thread 30 ---- PC 5: Stalled ----- 86406 in-flight CPI 1.4987 -- Total Cycles 129524 ---- Thread 31 ---- PC 5: Stalled ----- 83556 in-flight CPI 1.5498 -- Total Cycles 129524 Total CPI 0.0430 , IPC 23.2452 -- Total Cycles 129524 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7773 (3.469718%) FPSUB: 0 (0.000000%) FPMUL: 31539 (14.078402%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 100787 (44.989376%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5385 (2.403760%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 70790 (31.599293%) DIV: 7486 (3.341606%) FPUN: 0 (0.000000%) FPRSUB: 264 (0.117845%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3299564 total) ADD%: 7.448 (245746) SUB%: 0.000 (0) MUL%: 0.006 (203) BITOR%: 1.533 (50591) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.553 (18250) FPSUB%: 0.000 (0) FPMUL%: 4.779 (157686) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (609) FPMAX%: 0.018 (609) LOAD%: 5.141 (169618) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (235) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (573) FPINV%: 0.000 (0) FPCONV%: 0.019 (641) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.064 (35117) FPLE%: 0.457 (15069) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (609) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.801 (92418) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.744 (24565) CMPU%: 0.000 (0) RSUB%: 0.006 (203) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.672 (517093) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.171 (38653) ORI%: 1.569 (51772) XORI%: 0.000 (0) MULI%: 3.198 (105532) LW%: 1.130 (37292) LWI%: 13.448 (443736) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.288 (9501) SWI%: 4.057 (133867) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.398 (46134) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10271) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.058 (1918) bned%: 0.000 (0) bneid%: 13.797 (455254) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.719 (23719) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.122 (4027) DIV%: 0.012 (406) FPUN%: 1.486 (49027) FPRSUB%: 4.209 (138879) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (65) FPGT%: 2.938 (96928) FPGE%: 1.029 (33958) SYNC%: 0.000 (0) NOP%: 8.749 (288694) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 23 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 8 FPSUB 0 FPMUL 0 FPCMPLT 0 FPMIN 0 FPMAX 395 LOAD 39292 INTCONV 0 ATOMIC_INC 25 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 15 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1469 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 9 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48075 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 16 ORI 11047 XORI 0 MULI 8748 LW 0 LWI 140376 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 82 DIV 21 FPUN 0 FPRSUB 65 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.2455 --Total thread-cycles: 4144768 --total thread-cycles issued: 3010870 (72.642667%) --iCache conflicts: 108148 (2.609265%) --thread*cycles of FU dependence: 249676 (6.023884%) --thread*cycles of data dependence: 224024 (5.404983%) --iCache cycles*banks: 4144768 (79.608702% used) Issue breakdown: --thread*cycles of issue worked: 3010870 (72.642667%) --thread*cycles of issue failed: 845204 (20.392070%) --thread*cycles of issue NOP/other: 288694 (6.965263%) Number of thread-cycles not ready: 224024 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3299564 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 6 1: 8 2: 7 3: 8 4: 8 5: 8 6: 6 7: 8 8: 8 9: 9 10: 7 11: 8 12: 8 13: 7 14: 9 15: 8 16: 6 17: 7 18: 8 19: 8 20: 6 21: 7 22: 7 23: 7 24: 8 25: 7 26: 8 27: 7 28: 5 29: 7 30: 7 31: 7 <=== Core 55 ===> ---- Thread 00 ---- PC 5: Stalled ----- 102890 in-flight CPI 1.3474 -- Total Cycles 138660 ---- Thread 01 ---- PC 5: Stalled ----- 104082 in-flight CPI 1.3319 -- Total Cycles 138660 ---- Thread 02 ---- PC 5: Stalled ----- 100835 in-flight CPI 1.3749 -- Total Cycles 138660 ---- Thread 03 ---- PC 5: Stalled ----- 100870 in-flight CPI 1.3743 -- Total Cycles 138660 ---- Thread 04 ---- PC 5: Stalled ----- 93827 in-flight CPI 1.4775 -- Total Cycles 138660 ---- Thread 05 ---- PC 5: Stalled ----- 98080 in-flight CPI 1.4135 -- Total Cycles 138660 ---- Thread 06 ---- PC 5: Stalled ----- 97609 in-flight CPI 1.4203 -- Total Cycles 138660 ---- Thread 07 ---- PC 5: Stalled ----- 103821 in-flight CPI 1.3353 -- Total Cycles 138660 ---- Thread 08 ---- PC 5: Stalled ----- 101841 in-flight CPI 1.3614 -- Total Cycles 138660 ---- Thread 09 ---- PC 5: Stalled ----- 99487 in-flight CPI 1.3934 -- Total Cycles 138660 ---- Thread 10 ---- PC 5: Stalled ----- 99474 in-flight CPI 1.3937 -- Total Cycles 138660 ---- Thread 11 ---- PC 5: Stalled ----- 101763 in-flight CPI 1.3623 -- Total Cycles 138660 ---- Thread 12 ---- PC 5: Stalled ----- 90945 in-flight CPI 1.5244 -- Total Cycles 138660 ---- Thread 13 ---- PC 5: Stalled ----- 94556 in-flight CPI 1.4662 -- Total Cycles 138660 ---- Thread 14 ---- PC 5: Stalled ----- 90846 in-flight CPI 1.5261 -- Total Cycles 138660 ---- Thread 15 ---- PC 5: Stalled ----- 98620 in-flight CPI 1.4058 -- Total Cycles 138660 ---- Thread 16 ---- PC 5: Stalled ----- 95631 in-flight CPI 1.4497 -- Total Cycles 138660 ---- Thread 17 ---- PC 5: Stalled ----- 95078 in-flight CPI 1.4581 -- Total Cycles 138660 ---- Thread 18 ---- PC 5: Stalled ----- 91528 in-flight CPI 1.5147 -- Total Cycles 138660 ---- Thread 19 ---- PC 5: Stalled ----- 93009 in-flight CPI 1.4906 -- Total Cycles 138660 ---- Thread 20 ---- PC 5: Stalled ----- 92405 in-flight CPI 1.5003 -- Total Cycles 138660 ---- Thread 21 ---- PC 5: Stalled ----- 95236 in-flight CPI 1.4557 -- Total Cycles 138660 ---- Thread 22 ---- PC 5: Stalled ----- 93210 in-flight CPI 1.4873 -- Total Cycles 138660 ---- Thread 23 ---- PC 5: Stalled ----- 89970 in-flight CPI 1.5409 -- Total Cycles 138660 ---- Thread 24 ---- PC 5: Stalled ----- 92348 in-flight CPI 1.5013 -- Total Cycles 138660 ---- Thread 25 ---- PC 5: Stalled ----- 87314 in-flight CPI 1.5877 -- Total Cycles 138660 ---- Thread 26 ---- PC 5: Stalled ----- 92522 in-flight CPI 1.4984 -- Total Cycles 138660 ---- Thread 27 ---- PC 5: Stalled ----- 89058 in-flight CPI 1.5567 -- Total Cycles 138660 ---- Thread 28 ---- PC 5: Stalled ----- 86746 in-flight CPI 1.5982 -- Total Cycles 138660 ---- Thread 29 ---- PC 5: Stalled ----- 92774 in-flight CPI 1.4943 -- Total Cycles 138660 ---- Thread 30 ---- PC 5: Stalled ----- 92233 in-flight CPI 1.5031 -- Total Cycles 138660 ---- Thread 31 ---- PC 5: Stalled ----- 88210 in-flight CPI 1.5717 -- Total Cycles 138660 Total CPI 0.0455 , IPC 21.9772 -- Total Cycles 138660 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8334 (3.930353%) FPSUB: 0 (0.000000%) FPMUL: 32647 (15.396478%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 82430 (38.874374%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5427 (2.559399%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 75561 (35.634921%) DIV: 7381 (3.480914%) FPUN: 0 (0.000000%) FPRSUB: 262 (0.123560%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3339271 total) ADD%: 7.446 (248641) SUB%: 0.000 (0) MUL%: 0.006 (200) BITOR%: 1.530 (51086) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.582 (19449) FPSUB%: 0.000 (0) FPMUL%: 4.860 (162297) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (600) FPMAX%: 0.018 (600) LOAD%: 5.180 (172987) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (232) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (572) FPINV%: 0.000 (0) FPCONV%: 0.019 (632) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.073 (35838) FPLE%: 0.454 (15149) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (600) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.783 (92935) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.757 (25278) CMPU%: 0.000 (0) RSUB%: 0.006 (200) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.647 (522492) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.170 (39055) ORI%: 1.589 (53047) XORI%: 0.000 (0) MULI%: 3.174 (105982) LW%: 1.123 (37494) LWI%: 13.379 (446775) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.285 (9528) SWI%: 4.033 (134684) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.390 (46421) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10350) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.062 (2079) bned%: 0.000 (0) bneid%: 13.761 (459509) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.714 (23837) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.129 (4310) DIV%: 0.012 (400) FPUN%: 1.477 (49325) FPRSUB%: 4.280 (142933) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (66) FPGT%: 2.922 (97563) FPGE%: 1.023 (34176) SYNC%: 0.000 (0) NOP%: 8.740 (291853) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 14 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 8 FPSUB 0 FPMUL 5 FPCMPLT 0 FPMIN 0 FPMAX 391 LOAD 40124 INTCONV 0 ATOMIC_INC 22 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 17 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1718 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 4 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48358 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 22 ORI 11923 XORI 0 MULI 9143 LW 0 LWI 141614 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 75 DIV 40 FPUN 0 FPRSUB 61 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.9775 --Total thread-cycles: 4437120 --total thread-cycles issued: 3047418 (68.680090%) --iCache conflicts: 109456 (2.466825%) --thread*cycles of FU dependence: 253563 (5.714585%) --thread*cycles of data dependence: 212042 (4.778820%) --iCache cycles*banks: 4437120 (75.258343% used) Issue breakdown: --thread*cycles of issue worked: 3047418 (68.680090%) --thread*cycles of issue failed: 1097849 (24.742378%) --thread*cycles of issue NOP/other: 291853 (6.577532%) Number of thread-cycles not ready: 212042 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3339271 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 9 1: 9 2: 8 3: 9 4: 8 5: 8 6: 7 7: 9 8: 5 9: 9 10: 7 11: 9 12: 6 13: 7 14: 6 15: 7 16: 8 17: 7 18: 6 19: 7 20: 7 21: 7 22: 7 23: 7 24: 6 25: 8 26: 7 27: 6 28: 6 29: 7 30: 7 31: 6 <=== Core 56 ===> ---- Thread 00 ---- PC 5: Stalled ----- 95936 in-flight CPI 1.3290 -- Total Cycles 127524 ---- Thread 01 ---- PC 5: Stalled ----- 99199 in-flight CPI 1.2853 -- Total Cycles 127524 ---- Thread 02 ---- PC 5: Stalled ----- 101636 in-flight CPI 1.2545 -- Total Cycles 127524 ---- Thread 03 ---- PC 5: Stalled ----- 97371 in-flight CPI 1.3095 -- Total Cycles 127524 ---- Thread 04 ---- PC 5: Stalled ----- 99224 in-flight CPI 1.2850 -- Total Cycles 127524 ---- Thread 05 ---- PC 5: Stalled ----- 97290 in-flight CPI 1.3105 -- Total Cycles 127524 ---- Thread 06 ---- PC 5: Stalled ----- 93711 in-flight CPI 1.3606 -- Total Cycles 127524 ---- Thread 07 ---- PC 5: Stalled ----- 95738 in-flight CPI 1.3318 -- Total Cycles 127524 ---- Thread 08 ---- PC 5: Stalled ----- 99468 in-flight CPI 1.2818 -- Total Cycles 127524 ---- Thread 09 ---- PC 5: Stalled ----- 103170 in-flight CPI 1.2358 -- Total Cycles 127524 ---- Thread 10 ---- PC 5: Stalled ----- 95868 in-flight CPI 1.3300 -- Total Cycles 127524 ---- Thread 11 ---- PC 5: Stalled ----- 101333 in-flight CPI 1.2582 -- Total Cycles 127524 ---- Thread 12 ---- PC 5: Stalled ----- 96089 in-flight CPI 1.3269 -- Total Cycles 127524 ---- Thread 13 ---- PC 5: Stalled ----- 90309 in-flight CPI 1.4119 -- Total Cycles 127524 ---- Thread 14 ---- PC 5: Stalled ----- 100224 in-flight CPI 1.2721 -- Total Cycles 127524 ---- Thread 15 ---- PC 5: Stalled ----- 97929 in-flight CPI 1.3020 -- Total Cycles 127524 ---- Thread 16 ---- PC 5: Stalled ----- 96801 in-flight CPI 1.3171 -- Total Cycles 127524 ---- Thread 17 ---- PC 5: Stalled ----- 97018 in-flight CPI 1.3142 -- Total Cycles 127524 ---- Thread 18 ---- PC 5: Stalled ----- 90846 in-flight CPI 1.4034 -- Total Cycles 127524 ---- Thread 19 ---- PC 5: Stalled ----- 97638 in-flight CPI 1.3058 -- Total Cycles 127524 ---- Thread 20 ---- PC 5: Stalled ----- 94797 in-flight CPI 1.3450 -- Total Cycles 127524 ---- Thread 21 ---- PC 5: Stalled ----- 97821 in-flight CPI 1.3034 -- Total Cycles 127524 ---- Thread 22 ---- PC 5: Stalled ----- 89033 in-flight CPI 1.4321 -- Total Cycles 127524 ---- Thread 23 ---- PC 5: Stalled ----- 89827 in-flight CPI 1.4194 -- Total Cycles 127524 ---- Thread 24 ---- PC 5: Stalled ----- 88950 in-flight CPI 1.4334 -- Total Cycles 127524 ---- Thread 25 ---- PC 5: Stalled ----- 95840 in-flight CPI 1.3303 -- Total Cycles 127524 ---- Thread 26 ---- PC 5: Stalled ----- 91796 in-flight CPI 1.3890 -- Total Cycles 127524 ---- Thread 27 ---- PC 5: Stalled ----- 87829 in-flight CPI 1.4517 -- Total Cycles 127524 ---- Thread 28 ---- PC 5: Stalled ----- 92995 in-flight CPI 1.3711 -- Total Cycles 127524 ---- Thread 29 ---- PC 5: Stalled ----- 93377 in-flight CPI 1.3654 -- Total Cycles 127524 ---- Thread 30 ---- PC 5: Stalled ----- 85438 in-flight CPI 1.4924 -- Total Cycles 127524 ---- Thread 31 ---- PC 5: Stalled ----- 89384 in-flight CPI 1.4264 -- Total Cycles 127524 Total CPI 0.0419 , IPC 23.8736 -- Total Cycles 127524 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8150 (3.813811%) FPSUB: 0 (0.000000%) FPMUL: 32392 (15.157910%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 85797 (40.148902%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5840 (2.732841%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 73572 (34.428186%) DIV: 7677 (3.592470%) FPUN: 0 (0.000000%) FPRSUB: 269 (0.125879%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3336538 total) ADD%: 7.470 (249230) SUB%: 0.000 (0) MUL%: 0.006 (208) BITOR%: 1.524 (50842) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.570 (19027) FPSUB%: 0.000 (0) FPMUL%: 4.830 (161167) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (624) FPMAX%: 0.019 (624) LOAD%: 5.138 (171440) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (240) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (603) FPINV%: 0.000 (0) FPCONV%: 0.020 (656) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.071 (35737) FPLE%: 0.452 (15084) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (624) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.785 (92938) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.748 (24943) CMPU%: 0.000 (0) RSUB%: 0.006 (208) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.637 (521735) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.167 (38941) ORI%: 1.576 (52593) XORI%: 0.000 (0) MULI%: 3.185 (106276) LW%: 1.124 (37508) LWI%: 13.444 (448549) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.285 (9500) SWI%: 4.048 (135076) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.393 (46469) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.308 (10285) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.060 (1999) bned%: 0.000 (0) bneid%: 13.779 (459728) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.715 (23845) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.126 (4188) DIV%: 0.012 (416) FPUN%: 1.475 (49201) FPRSUB%: 4.245 (141642) FPSQRT%: 0.000 (0) FPNEG%: 0.003 (86) FPGT%: 2.939 (98064) FPGE%: 1.023 (34117) SYNC%: 0.000 (0) NOP%: 8.752 (292029) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 16 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 8 FPSUB 0 FPMUL 2 FPCMPLT 0 FPMIN 0 FPMAX 404 LOAD 39944 INTCONV 0 ATOMIC_INC 21 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 9 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1611 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 15 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48648 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 13 ORI 11558 XORI 0 MULI 9295 LW 0 LWI 142059 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 92 DIV 40 FPUN 0 FPRSUB 59 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.8738 --Total thread-cycles: 4080768 --total thread-cycles issued: 3044509 (74.606275%) --iCache conflicts: 110528 (2.708510%) --thread*cycles of FU dependence: 253823 (6.219981%) --thread*cycles of data dependence: 213697 (5.236686%) --iCache cycles*banks: 4080768 (81.763286% used) Issue breakdown: --thread*cycles of issue worked: 3044509 (74.606275%) --thread*cycles of issue failed: 744230 (18.237498%) --thread*cycles of issue NOP/other: 292029 (7.156226%) Number of thread-cycles not ready: 213697 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3336538 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 7 2: 8 3: 7 4: 8 5: 8 6: 6 7: 7 8: 8 9: 9 10: 7 11: 8 12: 8 13: 5 14: 8 15: 8 16: 8 17: 8 18: 8 19: 8 20: 8 21: 8 22: 7 23: 8 24: 7 25: 8 26: 7 27: 7 28: 7 29: 8 30: 6 31: 8 <=== Core 57 ===> ---- Thread 00 ---- PC 5: Stalled ----- 88642 in-flight CPI 1.4508 -- Total Cycles 128616 ---- Thread 01 ---- PC 5: Stalled ----- 98809 in-flight CPI 1.3014 -- Total Cycles 128616 ---- Thread 02 ---- PC 5: Stalled ----- 104873 in-flight CPI 1.2262 -- Total Cycles 128616 ---- Thread 03 ---- PC 5: Stalled ----- 96245 in-flight CPI 1.3361 -- Total Cycles 128616 ---- Thread 04 ---- PC 5: Stalled ----- 102142 in-flight CPI 1.2590 -- Total Cycles 128616 ---- Thread 05 ---- PC 5: Stalled ----- 97115 in-flight CPI 1.3241 -- Total Cycles 128616 ---- Thread 06 ---- PC 5: Stalled ----- 97676 in-flight CPI 1.3165 -- Total Cycles 128616 ---- Thread 07 ---- PC 5: Stalled ----- 100167 in-flight CPI 1.2838 -- Total Cycles 128616 ---- Thread 08 ---- PC 5: Stalled ----- 96678 in-flight CPI 1.3302 -- Total Cycles 128616 ---- Thread 09 ---- PC 5: Stalled ----- 92782 in-flight CPI 1.3860 -- Total Cycles 128616 ---- Thread 10 ---- PC 5: Stalled ----- 100970 in-flight CPI 1.2736 -- Total Cycles 128616 ---- Thread 11 ---- PC 5: Stalled ----- 94697 in-flight CPI 1.3579 -- Total Cycles 128616 ---- Thread 12 ---- PC 5: Stalled ----- 96990 in-flight CPI 1.3259 -- Total Cycles 128616 ---- Thread 13 ---- PC 5: Stalled ----- 99048 in-flight CPI 1.2983 -- Total Cycles 128616 ---- Thread 14 ---- PC 5: Stalled ----- 97819 in-flight CPI 1.3146 -- Total Cycles 128616 ---- Thread 15 ---- PC 5: Stalled ----- 91964 in-flight CPI 1.3983 -- Total Cycles 128616 ---- Thread 16 ---- PC 5: Stalled ----- 91229 in-flight CPI 1.4096 -- Total Cycles 128616 ---- Thread 17 ---- PC 5: Stalled ----- 93066 in-flight CPI 1.3817 -- Total Cycles 128616 ---- Thread 18 ---- PC 5: Stalled ----- 97951 in-flight CPI 1.3128 -- Total Cycles 128616 ---- Thread 19 ---- PC 5: Stalled ----- 93289 in-flight CPI 1.3785 -- Total Cycles 128616 ---- Thread 20 ---- PC 5: Stalled ----- 96285 in-flight CPI 1.3355 -- Total Cycles 128616 ---- Thread 21 ---- PC 5: Stalled ----- 92940 in-flight CPI 1.3836 -- Total Cycles 128616 ---- Thread 22 ---- PC 5: Stalled ----- 97517 in-flight CPI 1.3187 -- Total Cycles 128616 ---- Thread 23 ---- PC 5: Stalled ----- 91390 in-flight CPI 1.4071 -- Total Cycles 128616 ---- Thread 24 ---- PC 5: Stalled ----- 93641 in-flight CPI 1.3732 -- Total Cycles 128616 ---- Thread 25 ---- PC 5: Stalled ----- 94662 in-flight CPI 1.3584 -- Total Cycles 128616 ---- Thread 26 ---- PC 5: Stalled ----- 93151 in-flight CPI 1.3804 -- Total Cycles 128616 ---- Thread 27 ---- PC 5: Stalled ----- 93893 in-flight CPI 1.3695 -- Total Cycles 128616 ---- Thread 28 ---- PC 5: Stalled ----- 85193 in-flight CPI 1.5096 -- Total Cycles 128616 ---- Thread 29 ---- PC 5: Stalled ----- 93644 in-flight CPI 1.3732 -- Total Cycles 128616 ---- Thread 30 ---- PC 5: Stalled ----- 84330 in-flight CPI 1.5249 -- Total Cycles 128616 ---- Thread 31 ---- PC 5: Stalled ----- 89711 in-flight CPI 1.4334 -- Total Cycles 128616 Total CPI 0.0423 , IPC 23.6289 -- Total Cycles 128616 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7962 (3.756670%) FPSUB: 0 (0.000000%) FPMUL: 31884 (15.043667%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 86191 (40.667066%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5404 (2.549742%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 72854 (34.374336%) DIV: 7387 (3.485371%) FPUN: 0 (0.000000%) FPRSUB: 261 (0.123146%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3329994 total) ADD%: 7.440 (247750) SUB%: 0.000 (0) MUL%: 0.006 (200) BITOR%: 1.543 (51378) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.556 (18519) FPSUB%: 0.000 (0) FPMUL%: 4.790 (159503) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (600) FPMAX%: 0.018 (600) LOAD%: 5.155 (171652) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (232) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (571) FPINV%: 0.000 (0) FPCONV%: 0.019 (632) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.065 (35474) FPLE%: 0.455 (15164) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (600) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.797 (93150) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.751 (25004) CMPU%: 0.000 (0) RSUB%: 0.006 (200) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.661 (521496) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.174 (39100) ORI%: 1.572 (52364) XORI%: 0.000 (0) MULI%: 3.193 (106322) LW%: 1.129 (37580) LWI%: 13.435 (447393) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.287 (9552) SWI%: 4.043 (134637) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.397 (46525) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10316) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.057 (1903) bned%: 0.000 (0) bneid%: 13.792 (459284) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.721 (23994) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.125 (4150) DIV%: 0.012 (400) FPUN%: 1.491 (49658) FPRSUB%: 4.238 (141140) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (61) FPGT%: 2.925 (97415) FPGE%: 1.036 (34494) SYNC%: 0.000 (0) NOP%: 8.735 (290885) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 20 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 17 FPSUB 0 FPMUL 3 FPCMPLT 0 FPMIN 0 FPMAX 383 LOAD 40059 INTCONV 0 ATOMIC_INC 16 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 10 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1318 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 5 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48546 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 14 ORI 11275 XORI 0 MULI 9005 LW 0 LWI 141627 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 73 DIV 35 FPUN 0 FPRSUB 62 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.6291 --Total thread-cycles: 4115712 --total thread-cycles issued: 3039109 (73.841634%) --iCache conflicts: 109527 (2.661192%) --thread*cycles of FU dependence: 252492 (6.134832%) --thread*cycles of data dependence: 211943 (5.149607%) --iCache cycles*banks: 4115712 (80.910083% used) Issue breakdown: --thread*cycles of issue worked: 3039109 (73.841634%) --thread*cycles of issue failed: 785718 (19.090694%) --thread*cycles of issue NOP/other: 290885 (7.067671%) Number of thread-cycles not ready: 211943 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3329994 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 5 1: 7 2: 8 3: 7 4: 8 5: 8 6: 8 7: 8 8: 6 9: 7 10: 8 11: 8 12: 7 13: 8 14: 7 15: 8 16: 7 17: 8 18: 7 19: 6 20: 8 21: 8 22: 7 23: 6 24: 8 25: 8 26: 8 27: 8 28: 4 29: 8 30: 6 31: 7 <=== Core 58 ===> ---- Thread 00 ---- PC 5: Stalled ----- 105465 in-flight CPI 1.2312 -- Total Cycles 129870 ---- Thread 01 ---- PC 5: Stalled ----- 95124 in-flight CPI 1.3650 -- Total Cycles 129870 ---- Thread 02 ---- PC 5: Stalled ----- 90843 in-flight CPI 1.4294 -- Total Cycles 129870 ---- Thread 03 ---- PC 5: Stalled ----- 103149 in-flight CPI 1.2588 -- Total Cycles 129870 ---- Thread 04 ---- PC 5: Stalled ----- 99422 in-flight CPI 1.3060 -- Total Cycles 129870 ---- Thread 05 ---- PC 5: Stalled ----- 95998 in-flight CPI 1.3526 -- Total Cycles 129870 ---- Thread 06 ---- PC 5: Stalled ----- 103242 in-flight CPI 1.2577 -- Total Cycles 129870 ---- Thread 07 ---- PC 5: Stalled ----- 98127 in-flight CPI 1.3233 -- Total Cycles 129870 ---- Thread 08 ---- PC 5: Stalled ----- 103132 in-flight CPI 1.2590 -- Total Cycles 129870 ---- Thread 09 ---- PC 5: Stalled ----- 96191 in-flight CPI 1.3499 -- Total Cycles 129870 ---- Thread 10 ---- PC 5: Stalled ----- 98392 in-flight CPI 1.3197 -- Total Cycles 129870 ---- Thread 11 ---- PC 5: Stalled ----- 100083 in-flight CPI 1.2974 -- Total Cycles 129870 ---- Thread 12 ---- PC 5: Stalled ----- 95444 in-flight CPI 1.3604 -- Total Cycles 129870 ---- Thread 13 ---- PC 5: Stalled ----- 101420 in-flight CPI 1.2802 -- Total Cycles 129870 ---- Thread 14 ---- PC 5: Stalled ----- 94227 in-flight CPI 1.3780 -- Total Cycles 129870 ---- Thread 15 ---- PC 5: Stalled ----- 97343 in-flight CPI 1.3339 -- Total Cycles 129870 ---- Thread 16 ---- PC 5: Stalled ----- 97911 in-flight CPI 1.3261 -- Total Cycles 129870 ---- Thread 17 ---- PC 5: Stalled ----- 97111 in-flight CPI 1.3371 -- Total Cycles 129870 ---- Thread 18 ---- PC 5: Stalled ----- 95773 in-flight CPI 1.3557 -- Total Cycles 129870 ---- Thread 19 ---- PC 5: Stalled ----- 94529 in-flight CPI 1.3736 -- Total Cycles 129870 ---- Thread 20 ---- PC 5: Stalled ----- 87549 in-flight CPI 1.4832 -- Total Cycles 129870 ---- Thread 21 ---- PC 5: Stalled ----- 92124 in-flight CPI 1.4095 -- Total Cycles 129870 ---- Thread 22 ---- PC 5: Stalled ----- 96383 in-flight CPI 1.3472 -- Total Cycles 129870 ---- Thread 23 ---- PC 5: Stalled ----- 92575 in-flight CPI 1.4027 -- Total Cycles 129870 ---- Thread 24 ---- PC 5: Stalled ----- 96433 in-flight CPI 1.3465 -- Total Cycles 129870 ---- Thread 25 ---- PC 5: Stalled ----- 91655 in-flight CPI 1.4167 -- Total Cycles 129870 ---- Thread 26 ---- PC 5: Stalled ----- 85954 in-flight CPI 1.5107 -- Total Cycles 129870 ---- Thread 27 ---- PC 5: Stalled ----- 92400 in-flight CPI 1.4052 -- Total Cycles 129870 ---- Thread 28 ---- PC 5: Stalled ----- 91420 in-flight CPI 1.4203 -- Total Cycles 129870 ---- Thread 29 ---- PC 5: Stalled ----- 95527 in-flight CPI 1.3592 -- Total Cycles 129870 ---- Thread 30 ---- PC 5: Stalled ----- 84639 in-flight CPI 1.5341 -- Total Cycles 129870 ---- Thread 31 ---- PC 5: Stalled ----- 86577 in-flight CPI 1.4998 -- Total Cycles 129870 Total CPI 0.0425 , IPC 23.5367 -- Total Cycles 129870 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7701 (3.850943%) FPSUB: 0 (0.000000%) FPMUL: 31627 (15.815319%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 76753 (38.380914%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5687 (2.843827%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 70452 (35.230051%) DIV: 7494 (3.747431%) FPUN: 0 (0.000000%) FPRSUB: 263 (0.131515%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3349619 total) ADD%: 7.429 (248831) SUB%: 0.000 (0) MUL%: 0.006 (203) BITOR%: 1.530 (51240) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.540 (18082) FPSUB%: 0.000 (0) FPMUL%: 4.740 (158776) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (609) FPMAX%: 0.018 (609) LOAD%: 5.133 (171931) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (235) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (589) FPINV%: 0.000 (0) FPCONV%: 0.019 (641) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.059 (35466) FPLE%: 0.455 (15242) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (609) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.815 (94283) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.744 (24934) CMPU%: 0.000 (0) RSUB%: 0.006 (203) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.687 (525444) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.178 (39461) ORI%: 1.554 (52050) XORI%: 0.000 (0) MULI%: 3.211 (107564) LW%: 1.136 (38038) LWI%: 13.505 (452354) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.288 (9639) SWI%: 4.071 (136356) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.407 (47131) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10372) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.054 (1811) bned%: 0.000 (0) bneid%: 13.807 (462479) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.718 (24060) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.120 (4006) DIV%: 0.012 (406) FPUN%: 1.481 (49614) FPRSUB%: 4.190 (140357) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (68) FPGT%: 2.944 (98610) FPGE%: 1.026 (34372) SYNC%: 0.000 (0) NOP%: 8.743 (292848) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 26 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 11 FPSUB 0 FPMUL 2 FPCMPLT 0 FPMIN 0 FPMAX 396 LOAD 39938 INTCONV 0 ATOMIC_INC 14 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 12 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 2102 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 9 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49001 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 26 ORI 10939 XORI 0 MULI 9524 LW 0 LWI 142967 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 61 DIV 39 FPUN 0 FPRSUB 48 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.5370 --Total thread-cycles: 4155840 --total thread-cycles issued: 3056771 (73.553626%) --iCache conflicts: 109996 (2.646781%) --thread*cycles of FU dependence: 255141 (6.139336%) --thread*cycles of data dependence: 199977 (4.811951%) --iCache cycles*banks: 4155840 (80.601058% used) Issue breakdown: --thread*cycles of issue worked: 3056771 (73.553626%) --thread*cycles of issue failed: 806221 (19.399712%) --thread*cycles of issue NOP/other: 292848 (7.046662%) Number of thread-cycles not ready: 199977 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3349619 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 7 2: 6 3: 9 4: 8 5: 8 6: 7 7: 7 8: 10 9: 7 10: 7 11: 7 12: 8 13: 9 14: 7 15: 8 16: 8 17: 7 18: 8 19: 7 20: 5 21: 7 22: 7 23: 6 24: 7 25: 6 26: 6 27: 8 28: 8 29: 8 30: 7 31: 7 <=== Core 59 ===> ---- Thread 00 ---- PC 5: Stalled ----- 102522 in-flight CPI 1.2341 -- Total Cycles 126543 ---- Thread 01 ---- PC 5: Stalled ----- 101416 in-flight CPI 1.2475 -- Total Cycles 126543 ---- Thread 02 ---- PC 5: Stalled ----- 98699 in-flight CPI 1.2819 -- Total Cycles 126543 ---- Thread 03 ---- PC 5: Stalled ----- 100978 in-flight CPI 1.2529 -- Total Cycles 126543 ---- Thread 04 ---- PC 5: Stalled ----- 95104 in-flight CPI 1.3303 -- Total Cycles 126543 ---- Thread 05 ---- PC 5: Stalled ----- 95724 in-flight CPI 1.3217 -- Total Cycles 126543 ---- Thread 06 ---- PC 5: Stalled ----- 96905 in-flight CPI 1.3056 -- Total Cycles 126543 ---- Thread 07 ---- PC 5: Stalled ----- 98002 in-flight CPI 1.2909 -- Total Cycles 126543 ---- Thread 08 ---- PC 5: Stalled ----- 95237 in-flight CPI 1.3284 -- Total Cycles 126543 ---- Thread 09 ---- PC 5: Stalled ----- 98832 in-flight CPI 1.2801 -- Total Cycles 126543 ---- Thread 10 ---- PC 5: Stalled ----- 96874 in-flight CPI 1.3060 -- Total Cycles 126543 ---- Thread 11 ---- PC 5: Stalled ----- 98340 in-flight CPI 1.2865 -- Total Cycles 126543 ---- Thread 12 ---- PC 5: Stalled ----- 100896 in-flight CPI 1.2540 -- Total Cycles 126543 ---- Thread 13 ---- PC 5: Stalled ----- 94728 in-flight CPI 1.3356 -- Total Cycles 126543 ---- Thread 14 ---- PC 5: Stalled ----- 97439 in-flight CPI 1.2984 -- Total Cycles 126543 ---- Thread 15 ---- PC 5: Stalled ----- 93726 in-flight CPI 1.3499 -- Total Cycles 126543 ---- Thread 16 ---- PC 5: Stalled ----- 98305 in-flight CPI 1.2870 -- Total Cycles 126543 ---- Thread 17 ---- PC 5: Stalled ----- 97126 in-flight CPI 1.3026 -- Total Cycles 126543 ---- Thread 18 ---- PC 5: Stalled ----- 95330 in-flight CPI 1.3271 -- Total Cycles 126543 ---- Thread 19 ---- PC 5: Stalled ----- 93606 in-flight CPI 1.3516 -- Total Cycles 126543 ---- Thread 20 ---- PC 5: Stalled ----- 93637 in-flight CPI 1.3511 -- Total Cycles 126543 ---- Thread 21 ---- PC 5: Stalled ----- 95174 in-flight CPI 1.3293 -- Total Cycles 126543 ---- Thread 22 ---- PC 5: Stalled ----- 90277 in-flight CPI 1.4014 -- Total Cycles 126543 ---- Thread 23 ---- PC 5: Stalled ----- 92680 in-flight CPI 1.3651 -- Total Cycles 126543 ---- Thread 24 ---- PC 5: Stalled ----- 90078 in-flight CPI 1.4045 -- Total Cycles 126543 ---- Thread 25 ---- PC 5: Stalled ----- 89634 in-flight CPI 1.4116 -- Total Cycles 126543 ---- Thread 26 ---- PC 5: Stalled ----- 95491 in-flight CPI 1.3250 -- Total Cycles 126543 ---- Thread 27 ---- PC 5: Stalled ----- 89829 in-flight CPI 1.4085 -- Total Cycles 126543 ---- Thread 28 ---- PC 5: Stalled ----- 91713 in-flight CPI 1.3795 -- Total Cycles 126543 ---- Thread 29 ---- PC 5: Stalled ----- 93515 in-flight CPI 1.3529 -- Total Cycles 126543 ---- Thread 30 ---- PC 5: Stalled ----- 89319 in-flight CPI 1.4165 -- Total Cycles 126543 ---- Thread 31 ---- PC 5: Stalled ----- 86855 in-flight CPI 1.4567 -- Total Cycles 126543 Total CPI 0.0415 , IPC 24.0912 -- Total Cycles 126543 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7478 (3.879134%) FPSUB: 0 (0.000000%) FPMUL: 31223 (16.196602%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 70773 (36.712748%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5916 (3.068863%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 69213 (35.903514%) DIV: 7896 (4.095967%) FPUN: 0 (0.000000%) FPRSUB: 276 (0.143172%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3340995 total) ADD%: 7.556 (252449) SUB%: 0.000 (0) MUL%: 0.006 (214) BITOR%: 1.528 (51043) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.531 (17752) FPSUB%: 0.000 (0) FPMUL%: 4.711 (157408) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (642) FPMAX%: 0.019 (642) LOAD%: 5.100 (170375) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (246) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (613) FPINV%: 0.000 (0) FPCONV%: 0.020 (674) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.057 (35329) FPLE%: 0.458 (15299) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (642) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.808 (93809) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.735 (24546) CMPU%: 0.000 (0) RSUB%: 0.006 (214) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.674 (523652) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.172 (39172) ORI%: 1.539 (51427) XORI%: 0.000 (0) MULI%: 3.213 (107358) LW%: 1.133 (37866) LWI%: 13.505 (451213) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.288 (9622) SWI%: 4.073 (136076) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.403 (46870) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.309 (10333) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.052 (1741) bned%: 0.000 (0) bneid%: 13.816 (461599) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.715 (23886) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.118 (3926) DIV%: 0.013 (428) FPUN%: 1.482 (49523) FPRSUB%: 4.156 (138846) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (71) FPGT%: 2.957 (98807) FPGE%: 1.024 (34224) SYNC%: 0.000 (0) NOP%: 8.751 (292362) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 37 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 11 FPSUB 0 FPMUL 1 FPCMPLT 0 FPMIN 0 FPMAX 415 LOAD 39410 INTCONV 0 ATOMIC_INC 22 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 10 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1502 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 10 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48990 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 18 ORI 10606 XORI 0 MULI 9703 LW 0 LWI 142683 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 77 DIV 25 FPUN 0 FPRSUB 67 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 24.0915 --Total thread-cycles: 4049376 --total thread-cycles issued: 3048633 (75.286489%) --iCache conflicts: 111032 (2.741953%) --thread*cycles of FU dependence: 253593 (6.262520%) --thread*cycles of data dependence: 192775 (4.760610%) --iCache cycles*banks: 4049376 (82.507206% used) Issue breakdown: --thread*cycles of issue worked: 3048633 (75.286489%) --thread*cycles of issue failed: 708381 (17.493584%) --thread*cycles of issue NOP/other: 292362 (7.219927%) Number of thread-cycles not ready: 192775 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3340995 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 7 3: 9 4: 8 5: 7 6: 7 7: 9 8: 8 9: 8 10: 8 11: 8 12: 8 13: 8 14: 9 15: 8 16: 8 17: 8 18: 9 19: 7 20: 8 21: 8 22: 8 23: 7 24: 8 25: 6 26: 7 27: 6 28: 7 29: 8 30: 7 31: 7 <=== Core 60 ===> ---- Thread 00 ---- PC 5: Stalled ----- 99849 in-flight CPI 1.2856 -- Total Cycles 128388 ---- Thread 01 ---- PC 5: Stalled ----- 96042 in-flight CPI 1.3366 -- Total Cycles 128388 ---- Thread 02 ---- PC 5: Stalled ----- 96453 in-flight CPI 1.3308 -- Total Cycles 128388 ---- Thread 03 ---- PC 5: Stalled ----- 100751 in-flight CPI 1.2740 -- Total Cycles 128388 ---- Thread 04 ---- PC 5: Stalled ----- 92433 in-flight CPI 1.3888 -- Total Cycles 128388 ---- Thread 05 ---- PC 5: Stalled ----- 100954 in-flight CPI 1.2715 -- Total Cycles 128388 ---- Thread 06 ---- PC 5: Stalled ----- 101532 in-flight CPI 1.2642 -- Total Cycles 128388 ---- Thread 07 ---- PC 5: Stalled ----- 104138 in-flight CPI 1.2326 -- Total Cycles 128388 ---- Thread 08 ---- PC 5: Stalled ----- 93577 in-flight CPI 1.3718 -- Total Cycles 128388 ---- Thread 09 ---- PC 5: Stalled ----- 100782 in-flight CPI 1.2737 -- Total Cycles 128388 ---- Thread 10 ---- PC 5: Stalled ----- 99985 in-flight CPI 1.2838 -- Total Cycles 128388 ---- Thread 11 ---- PC 5: Stalled ----- 97849 in-flight CPI 1.3118 -- Total Cycles 128388 ---- Thread 12 ---- PC 5: Stalled ----- 101862 in-flight CPI 1.2602 -- Total Cycles 128388 ---- Thread 13 ---- PC 5: Stalled ----- 99723 in-flight CPI 1.2872 -- Total Cycles 128388 ---- Thread 14 ---- PC 5: Stalled ----- 97291 in-flight CPI 1.3194 -- Total Cycles 128388 ---- Thread 15 ---- PC 5: Stalled ----- 96520 in-flight CPI 1.3299 -- Total Cycles 128388 ---- Thread 16 ---- PC 5: Stalled ----- 99376 in-flight CPI 1.2917 -- Total Cycles 128388 ---- Thread 17 ---- PC 5: Stalled ----- 93527 in-flight CPI 1.3725 -- Total Cycles 128388 ---- Thread 18 ---- PC 5: Stalled ----- 95699 in-flight CPI 1.3414 -- Total Cycles 128388 ---- Thread 19 ---- PC 5: Stalled ----- 96582 in-flight CPI 1.3291 -- Total Cycles 128388 ---- Thread 20 ---- PC 5: Stalled ----- 98726 in-flight CPI 1.3002 -- Total Cycles 128388 ---- Thread 21 ---- PC 5: Stalled ----- 93635 in-flight CPI 1.3709 -- Total Cycles 128388 ---- Thread 22 ---- PC 5: Stalled ----- 89790 in-flight CPI 1.4296 -- Total Cycles 128388 ---- Thread 23 ---- PC 5: Stalled ----- 93718 in-flight CPI 1.3697 -- Total Cycles 128388 ---- Thread 24 ---- PC 5: Stalled ----- 94218 in-flight CPI 1.3624 -- Total Cycles 128388 ---- Thread 25 ---- PC 5: Stalled ----- 97109 in-flight CPI 1.3218 -- Total Cycles 128388 ---- Thread 26 ---- PC 5: Stalled ----- 92095 in-flight CPI 1.3938 -- Total Cycles 128388 ---- Thread 27 ---- PC 5: Stalled ----- 90741 in-flight CPI 1.4146 -- Total Cycles 128388 ---- Thread 28 ---- PC 5: Stalled ----- 91058 in-flight CPI 1.4097 -- Total Cycles 128388 ---- Thread 29 ---- PC 5: Stalled ----- 87344 in-flight CPI 1.4696 -- Total Cycles 128388 ---- Thread 30 ---- PC 5: Stalled ----- 91816 in-flight CPI 1.3980 -- Total Cycles 128388 ---- Thread 31 ---- PC 5: Stalled ----- 92434 in-flight CPI 1.3887 -- Total Cycles 128388 Total CPI 0.0417 , IPC 23.9758 -- Total Cycles 128388 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7860 (4.183544%) FPSUB: 0 (0.000000%) FPMUL: 32089 (17.079610%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 61369 (32.664108%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5927 (3.154690%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 72381 (38.525327%) DIV: 7975 (4.244753%) FPUN: 0 (0.000000%) FPRSUB: 278 (0.147968%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3373824 total) ADD%: 7.341 (247661) SUB%: 0.000 (0) MUL%: 0.006 (216) BITOR%: 1.523 (51397) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.550 (18556) FPSUB%: 0.000 (0) FPMUL%: 4.769 (160899) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (648) FPMAX%: 0.019 (648) LOAD%: 5.131 (173109) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (248) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (616) FPINV%: 0.000 (0) FPCONV%: 0.020 (680) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.064 (35890) FPLE%: 0.450 (15198) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (648) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.809 (94756) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.740 (24950) CMPU%: 0.000 (0) RSUB%: 0.006 (216) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.686 (529227) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.173 (39574) ORI%: 1.567 (52859) XORI%: 0.000 (0) MULI%: 3.211 (108318) LW%: 1.134 (38248) LWI%: 13.509 (455759) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.288 (9728) SWI%: 4.082 (137730) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.403 (47332) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10450) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.056 (1900) bned%: 0.000 (0) bneid%: 13.814 (466046) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.714 (24095) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.122 (4112) DIV%: 0.013 (432) FPUN%: 1.478 (49855) FPRSUB%: 4.202 (141782) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (67) FPGT%: 2.954 (99654) FPGE%: 1.027 (34657) SYNC%: 0.000 (0) NOP%: 8.761 (295567) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 22 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 19 FPSUB 0 FPMUL 1 FPCMPLT 0 FPMIN 0 FPMAX 420 LOAD 39481 INTCONV 0 ATOMIC_INC 19 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 15 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1841 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 5 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49306 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 15 ORI 11127 XORI 0 MULI 9871 LW 0 LWI 144185 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 74 DIV 22 FPUN 0 FPRSUB 59 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.9760 --Total thread-cycles: 4108416 --total thread-cycles issued: 3078257 (74.925640%) --iCache conflicts: 113181 (2.754857%) --thread*cycles of FU dependence: 256504 (6.243379%) --thread*cycles of data dependence: 187879 (4.573028%) --iCache cycles*banks: 4108416 (82.120603% used) Issue breakdown: --thread*cycles of issue worked: 3078257 (74.925640%) --thread*cycles of issue failed: 734592 (17.880176%) --thread*cycles of issue NOP/other: 295567 (7.194184%) Number of thread-cycles not ready: 187879 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3373824 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 7 2: 9 3: 9 4: 6 5: 7 6: 9 7: 8 8: 7 9: 8 10: 8 11: 8 12: 8 13: 9 14: 8 15: 8 16: 9 17: 7 18: 7 19: 7 20: 8 21: 8 22: 8 23: 7 24: 7 25: 8 26: 7 27: 8 28: 7 29: 7 30: 8 31: 8 <=== Core 61 ===> ---- Thread 00 ---- PC 5: Stalled ----- 103151 in-flight CPI 1.2353 -- Total Cycles 127445 ---- Thread 01 ---- PC 5: Stalled ----- 98382 in-flight CPI 1.2952 -- Total Cycles 127445 ---- Thread 02 ---- PC 5: Stalled ----- 93752 in-flight CPI 1.3591 -- Total Cycles 127445 ---- Thread 03 ---- PC 5: Stalled ----- 102307 in-flight CPI 1.2455 -- Total Cycles 127445 ---- Thread 04 ---- PC 5: Stalled ----- 97806 in-flight CPI 1.3028 -- Total Cycles 127445 ---- Thread 05 ---- PC 5: Stalled ----- 104160 in-flight CPI 1.2233 -- Total Cycles 127445 ---- Thread 06 ---- PC 5: Stalled ----- 96316 in-flight CPI 1.3229 -- Total Cycles 127445 ---- Thread 07 ---- PC 5: Stalled ----- 94794 in-flight CPI 1.3442 -- Total Cycles 127445 ---- Thread 08 ---- PC 5: Stalled ----- 93987 in-flight CPI 1.3557 -- Total Cycles 127445 ---- Thread 09 ---- PC 5: Stalled ----- 94540 in-flight CPI 1.3478 -- Total Cycles 127445 ---- Thread 10 ---- PC 5: Stalled ----- 99657 in-flight CPI 1.2786 -- Total Cycles 127445 ---- Thread 11 ---- PC 5: Stalled ----- 95371 in-flight CPI 1.3361 -- Total Cycles 127445 ---- Thread 12 ---- PC 5: Stalled ----- 98067 in-flight CPI 1.2994 -- Total Cycles 127445 ---- Thread 13 ---- PC 5: Stalled ----- 97952 in-flight CPI 1.3008 -- Total Cycles 127445 ---- Thread 14 ---- PC 5: Stalled ----- 92623 in-flight CPI 1.3757 -- Total Cycles 127445 ---- Thread 15 ---- PC 5: Stalled ----- 100468 in-flight CPI 1.2683 -- Total Cycles 127445 ---- Thread 16 ---- PC 5: Stalled ----- 95684 in-flight CPI 1.3316 -- Total Cycles 127445 ---- Thread 17 ---- PC 5: Stalled ----- 90480 in-flight CPI 1.4083 -- Total Cycles 127445 ---- Thread 18 ---- PC 5: Stalled ----- 88859 in-flight CPI 1.4341 -- Total Cycles 127445 ---- Thread 19 ---- PC 5: Stalled ----- 100649 in-flight CPI 1.2660 -- Total Cycles 127445 ---- Thread 20 ---- PC 5: Stalled ----- 91076 in-flight CPI 1.3991 -- Total Cycles 127445 ---- Thread 21 ---- PC 5: Stalled ----- 90070 in-flight CPI 1.4147 -- Total Cycles 127445 ---- Thread 22 ---- PC 5: Stalled ----- 92942 in-flight CPI 1.3710 -- Total Cycles 127445 ---- Thread 23 ---- PC 5: Stalled ----- 93170 in-flight CPI 1.3676 -- Total Cycles 127445 ---- Thread 24 ---- PC 5: Stalled ----- 96201 in-flight CPI 1.3245 -- Total Cycles 127445 ---- Thread 25 ---- PC 5: Stalled ----- 86107 in-flight CPI 1.4798 -- Total Cycles 127445 ---- Thread 26 ---- PC 5: Stalled ----- 92927 in-flight CPI 1.3712 -- Total Cycles 127445 ---- Thread 27 ---- PC 5: Stalled ----- 94865 in-flight CPI 1.3432 -- Total Cycles 127445 ---- Thread 28 ---- PC 5: Stalled ----- 86564 in-flight CPI 1.4720 -- Total Cycles 127445 ---- Thread 29 ---- PC 5: Stalled ----- 95252 in-flight CPI 1.3377 -- Total Cycles 127445 ---- Thread 30 ---- PC 5: Stalled ----- 84590 in-flight CPI 1.5063 -- Total Cycles 127445 ---- Thread 31 ---- PC 5: Stalled ----- 91866 in-flight CPI 1.3870 -- Total Cycles 127445 Total CPI 0.0420 , IPC 23.8157 -- Total Cycles 127445 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7393 (3.712986%) FPSUB: 0 (0.000000%) FPMUL: 30967 (15.552553%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 79533 (39.943851%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5575 (2.799932%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 67898 (34.100406%) DIV: 7482 (3.757684%) FPUN: 0 (0.000000%) FPRSUB: 264 (0.132589%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3326298 total) ADD%: 7.418 (246735) SUB%: 0.000 (0) MUL%: 0.006 (203) BITOR%: 1.538 (51160) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.528 (17568) FPSUB%: 0.000 (0) FPMUL%: 4.706 (156532) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (609) FPMAX%: 0.018 (609) LOAD%: 5.111 (170016) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (235) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (583) FPINV%: 0.000 (0) FPCONV%: 0.019 (641) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.051 (34964) FPLE%: 0.457 (15200) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (609) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.825 (93953) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.736 (24484) CMPU%: 0.000 (0) RSUB%: 0.006 (203) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.703 (522327) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.176 (39115) ORI%: 1.562 (51959) XORI%: 0.000 (0) MULI%: 3.220 (107098) LW%: 1.140 (37906) LWI%: 13.509 (449345) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.289 (9620) SWI%: 4.078 (135645) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.411 (46946) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10349) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.056 (1865) bned%: 0.000 (0) bneid%: 13.820 (459703) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.728 (24227) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.116 (3856) DIV%: 0.012 (406) FPUN%: 1.496 (49748) FPRSUB%: 4.153 (138136) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (76) FPGT%: 2.945 (97969) FPGE%: 1.039 (34548) SYNC%: 0.000 (0) NOP%: 8.750 (291054) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 29 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 13 FPSUB 0 FPMUL 0 FPCMPLT 0 FPMIN 0 FPMAX 391 LOAD 38367 INTCONV 0 ATOMIC_INC 13 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 12 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1525 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 9 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48653 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 6 ORI 10556 XORI 0 MULI 9066 LW 0 LWI 141819 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 63 DIV 39 FPUN 0 FPRSUB 59 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.8159 --Total thread-cycles: 4078240 --total thread-cycles issued: 3035244 (74.425340%) --iCache conflicts: 109780 (2.691847%) --thread*cycles of FU dependence: 250623 (6.145372%) --thread*cycles of data dependence: 199112 (4.882302%) --iCache cycles*banks: 4078240 (81.562880% used) Issue breakdown: --thread*cycles of issue worked: 3035244 (74.425340%) --thread*cycles of issue failed: 751942 (18.437905%) --thread*cycles of issue NOP/other: 291054 (7.136755%) Number of thread-cycles not ready: 199112 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3326298 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 7 2: 7 3: 7 4: 8 5: 9 6: 8 7: 7 8: 8 9: 7 10: 7 11: 7 12: 7 13: 9 14: 7 15: 8 16: 9 17: 6 18: 5 19: 8 20: 6 21: 8 22: 7 23: 8 24: 8 25: 6 26: 7 27: 8 28: 6 29: 8 30: 7 31: 8 <=== Core 62 ===> ---- Thread 00 ---- PC 5: Stalled ----- 99332 in-flight CPI 1.2960 -- Total Cycles 128760 ---- Thread 01 ---- PC 5: Stalled ----- 95595 in-flight CPI 1.3467 -- Total Cycles 128760 ---- Thread 02 ---- PC 5: Stalled ----- 101089 in-flight CPI 1.2735 -- Total Cycles 128760 ---- Thread 03 ---- PC 5: Stalled ----- 95250 in-flight CPI 1.3515 -- Total Cycles 128760 ---- Thread 04 ---- PC 5: Stalled ----- 96757 in-flight CPI 1.3305 -- Total Cycles 128760 ---- Thread 05 ---- PC 5: Stalled ----- 100229 in-flight CPI 1.2843 -- Total Cycles 128760 ---- Thread 06 ---- PC 5: Stalled ----- 102477 in-flight CPI 1.2562 -- Total Cycles 128760 ---- Thread 07 ---- PC 5: Stalled ----- 102012 in-flight CPI 1.2620 -- Total Cycles 128760 ---- Thread 08 ---- PC 5: Stalled ----- 100415 in-flight CPI 1.2820 -- Total Cycles 128760 ---- Thread 09 ---- PC 5: Stalled ----- 103570 in-flight CPI 1.2430 -- Total Cycles 128760 ---- Thread 10 ---- PC 5: Stalled ----- 102955 in-flight CPI 1.2504 -- Total Cycles 128760 ---- Thread 11 ---- PC 5: Stalled ----- 101431 in-flight CPI 1.2692 -- Total Cycles 128760 ---- Thread 12 ---- PC 5: Stalled ----- 93154 in-flight CPI 1.3819 -- Total Cycles 128760 ---- Thread 13 ---- PC 5: Stalled ----- 100360 in-flight CPI 1.2827 -- Total Cycles 128760 ---- Thread 14 ---- PC 5: Stalled ----- 92921 in-flight CPI 1.3855 -- Total Cycles 128760 ---- Thread 15 ---- PC 5: Stalled ----- 94601 in-flight CPI 1.3609 -- Total Cycles 128760 ---- Thread 16 ---- PC 5: Stalled ----- 96153 in-flight CPI 1.3388 -- Total Cycles 128760 ---- Thread 17 ---- PC 5: Stalled ----- 90742 in-flight CPI 1.4187 -- Total Cycles 128760 ---- Thread 18 ---- PC 5: Stalled ----- 94251 in-flight CPI 1.3659 -- Total Cycles 128760 ---- Thread 19 ---- PC 5: Stalled ----- 96282 in-flight CPI 1.3371 -- Total Cycles 128760 ---- Thread 20 ---- PC 5: Stalled ----- 90146 in-flight CPI 1.4281 -- Total Cycles 128760 ---- Thread 21 ---- PC 5: Stalled ----- 97349 in-flight CPI 1.3224 -- Total Cycles 128760 ---- Thread 22 ---- PC 5: Stalled ----- 96392 in-flight CPI 1.3356 -- Total Cycles 128760 ---- Thread 23 ---- PC 5: Stalled ----- 93539 in-flight CPI 1.3763 -- Total Cycles 128760 ---- Thread 24 ---- PC 5: Stalled ----- 92359 in-flight CPI 1.3938 -- Total Cycles 128760 ---- Thread 25 ---- PC 5: Stalled ----- 88430 in-flight CPI 1.4558 -- Total Cycles 128760 ---- Thread 26 ---- PC 5: Stalled ----- 91521 in-flight CPI 1.4066 -- Total Cycles 128760 ---- Thread 27 ---- PC 5: Stalled ----- 94610 in-flight CPI 1.3607 -- Total Cycles 128760 ---- Thread 28 ---- PC 5: Stalled ----- 92932 in-flight CPI 1.3852 -- Total Cycles 128760 ---- Thread 29 ---- PC 5: Stalled ----- 85759 in-flight CPI 1.5011 -- Total Cycles 128760 ---- Thread 30 ---- PC 5: Stalled ----- 84249 in-flight CPI 1.5280 -- Total Cycles 128760 ---- Thread 31 ---- PC 5: Stalled ----- 85687 in-flight CPI 1.5024 -- Total Cycles 128760 Total CPI 0.0422 , IPC 23.7118 -- Total Cycles 128760 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7874 (4.125667%) FPSUB: 0 (0.000000%) FPMUL: 31904 (16.716443%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 64148 (33.611033%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5882 (3.081937%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 72883 (38.187829%) DIV: 7887 (4.132478%) FPUN: 0 (0.000000%) FPRSUB: 276 (0.144613%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3345786 total) ADD%: 7.470 (249932) SUB%: 0.000 (0) MUL%: 0.006 (214) BITOR%: 1.531 (51223) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.556 (18595) FPSUB%: 0.000 (0) FPMUL%: 4.775 (159767) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (642) FPMAX%: 0.019 (642) LOAD%: 5.136 (171834) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (246) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (612) FPINV%: 0.000 (0) FPCONV%: 0.020 (674) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.063 (35576) FPLE%: 0.453 (15164) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (642) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.797 (93579) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.748 (25031) CMPU%: 0.000 (0) RSUB%: 0.006 (214) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.654 (523737) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.172 (39209) ORI%: 1.576 (52714) XORI%: 0.000 (0) MULI%: 3.196 (106932) LW%: 1.129 (37774) LWI%: 13.459 (450311) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.287 (9586) SWI%: 4.069 (136128) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.398 (46767) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10391) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.057 (1921) bned%: 0.000 (0) bneid%: 13.786 (461259) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.717 (24006) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.124 (4143) DIV%: 0.013 (428) FPUN%: 1.482 (49574) FPRSUB%: 4.209 (140815) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (78) FPGT%: 2.939 (98325) FPGE%: 1.028 (34410) SYNC%: 0.000 (0) NOP%: 8.745 (292595) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 20 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 9 FPSUB 0 FPMUL 3 FPCMPLT 0 FPMIN 0 FPMAX 412 LOAD 39251 INTCONV 0 ATOMIC_INC 13 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 12 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1504 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 14 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48787 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 13 ORI 11157 XORI 0 MULI 9418 LW 0 LWI 142460 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 76 DIV 41 FPUN 0 FPRSUB 58 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.7121 --Total thread-cycles: 4120320 --total thread-cycles issued: 3053191 (74.100822%) --iCache conflicts: 110288 (2.676685%) --thread*cycles of FU dependence: 253258 (6.146561%) --thread*cycles of data dependence: 190854 (4.632019%) --iCache cycles*banks: 4120320 (81.202868% used) Issue breakdown: --thread*cycles of issue worked: 3053191 (74.100822%) --thread*cycles of issue failed: 774534 (18.797909%) --thread*cycles of issue NOP/other: 292595 (7.101269%) Number of thread-cycles not ready: 190854 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3345786 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 9 1: 8 2: 7 3: 8 4: 8 5: 10 6: 8 7: 7 8: 8 9: 9 10: 9 11: 8 12: 8 13: 8 14: 7 15: 6 16: 8 17: 7 18: 7 19: 8 20: 7 21: 9 22: 7 23: 7 24: 8 25: 7 26: 7 27: 7 28: 8 29: 7 30: 7 31: 7 <=== Core 63 ===> ---- Thread 00 ---- PC 5: Stalled ----- 97793 in-flight CPI 1.2864 -- Total Cycles 125822 ---- Thread 01 ---- PC 5: Stalled ----- 100368 in-flight CPI 1.2534 -- Total Cycles 125822 ---- Thread 02 ---- PC 5: Stalled ----- 98326 in-flight CPI 1.2794 -- Total Cycles 125822 ---- Thread 03 ---- PC 5: Stalled ----- 100655 in-flight CPI 1.2498 -- Total Cycles 125822 ---- Thread 04 ---- PC 5: Stalled ----- 95115 in-flight CPI 1.3225 -- Total Cycles 125822 ---- Thread 05 ---- PC 5: Stalled ----- 98750 in-flight CPI 1.2739 -- Total Cycles 125822 ---- Thread 06 ---- PC 5: Stalled ----- 99570 in-flight CPI 1.2634 -- Total Cycles 125822 ---- Thread 07 ---- PC 5: Stalled ----- 99310 in-flight CPI 1.2668 -- Total Cycles 125822 ---- Thread 08 ---- PC 5: Stalled ----- 94709 in-flight CPI 1.3283 -- Total Cycles 125822 ---- Thread 09 ---- PC 5: Stalled ----- 90528 in-flight CPI 1.3897 -- Total Cycles 125822 ---- Thread 10 ---- PC 5: Stalled ----- 93774 in-flight CPI 1.3415 -- Total Cycles 125822 ---- Thread 11 ---- PC 5: Stalled ----- 100529 in-flight CPI 1.2514 -- Total Cycles 125822 ---- Thread 12 ---- PC 5: Stalled ----- 98582 in-flight CPI 1.2761 -- Total Cycles 125822 ---- Thread 13 ---- PC 5: Stalled ----- 97445 in-flight CPI 1.2909 -- Total Cycles 125822 ---- Thread 14 ---- PC 5: Stalled ----- 95538 in-flight CPI 1.3168 -- Total Cycles 125822 ---- Thread 15 ---- PC 5: Stalled ----- 95584 in-flight CPI 1.3161 -- Total Cycles 125822 ---- Thread 16 ---- PC 5: Stalled ----- 97555 in-flight CPI 1.2895 -- Total Cycles 125822 ---- Thread 17 ---- PC 5: Stalled ----- 94328 in-flight CPI 1.3336 -- Total Cycles 125822 ---- Thread 18 ---- PC 5: Stalled ----- 92700 in-flight CPI 1.3571 -- Total Cycles 125822 ---- Thread 19 ---- PC 5: Stalled ----- 97506 in-flight CPI 1.2901 -- Total Cycles 125822 ---- Thread 20 ---- PC 5: Stalled ----- 96941 in-flight CPI 1.2977 -- Total Cycles 125822 ---- Thread 21 ---- PC 5: Stalled ----- 87411 in-flight CPI 1.4392 -- Total Cycles 125822 ---- Thread 22 ---- PC 5: Stalled ----- 87501 in-flight CPI 1.4377 -- Total Cycles 125822 ---- Thread 23 ---- PC 5: Stalled ----- 90597 in-flight CPI 1.3885 -- Total Cycles 125822 ---- Thread 24 ---- PC 5: Stalled ----- 88428 in-flight CPI 1.4226 -- Total Cycles 125822 ---- Thread 25 ---- PC 5: Stalled ----- 93108 in-flight CPI 1.3511 -- Total Cycles 125822 ---- Thread 26 ---- PC 5: Stalled ----- 93340 in-flight CPI 1.3477 -- Total Cycles 125822 ---- Thread 27 ---- PC 5: Stalled ----- 95316 in-flight CPI 1.3197 -- Total Cycles 125822 ---- Thread 28 ---- PC 5: Stalled ----- 85217 in-flight CPI 1.4763 -- Total Cycles 125822 ---- Thread 29 ---- PC 5: Stalled ----- 88638 in-flight CPI 1.4192 -- Total Cycles 125822 ---- Thread 30 ---- PC 5: Stalled ----- 84917 in-flight CPI 1.4815 -- Total Cycles 125822 ---- Thread 31 ---- PC 5: Stalled ----- 90761 in-flight CPI 1.3861 -- Total Cycles 125822 Total CPI 0.0416 , IPC 24.0132 -- Total Cycles 125822 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8319 (4.094379%) FPSUB: 0 (0.000000%) FPMUL: 32642 (16.065479%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 73260 (36.056521%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5632 (2.771913%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 75617 (37.216570%) DIV: 7448 (3.665697%) FPUN: 0 (0.000000%) FPRSUB: 263 (0.129441%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3311542 total) ADD%: 7.366 (243912) SUB%: 0.000 (0) MUL%: 0.006 (202) BITOR%: 1.515 (50163) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.588 (19482) FPSUB%: 0.000 (0) FPMUL%: 4.878 (161549) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (606) FPMAX%: 0.018 (606) LOAD%: 5.177 (171430) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (234) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (585) FPINV%: 0.000 (0) FPCONV%: 0.019 (638) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.078 (35691) FPLE%: 0.446 (14780) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (606) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.781 (92097) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.756 (25041) CMPU%: 0.000 (0) RSUB%: 0.006 (202) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.630 (517591) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.165 (38592) ORI%: 1.599 (52959) XORI%: 0.000 (0) MULI%: 3.179 (105264) LW%: 1.122 (37162) LWI%: 13.429 (444706) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.284 (9391) SWI%: 4.042 (133866) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.391 (46065) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.309 (10239) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.067 (2231) bned%: 0.000 (0) bneid%: 13.758 (455601) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.712 (23576) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.130 (4312) DIV%: 0.012 (404) FPUN%: 1.466 (48531) FPRSUB%: 4.291 (142106) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (69) FPGT%: 2.932 (97110) FPGE%: 1.019 (33751) SYNC%: 0.000 (0) NOP%: 8.760 (290096) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 16 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 10 FPSUB 0 FPMUL 1 FPCMPLT 0 FPMIN 0 FPMAX 394 LOAD 40160 INTCONV 0 ATOMIC_INC 28 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 8 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1310 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 13 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 47967 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 26 ORI 11892 XORI 0 MULI 9071 LW 0 LWI 140561 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 77 DIV 12 FPUN 0 FPRSUB 56 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 24.0135 --Total thread-cycles: 4026304 --total thread-cycles issued: 3021446 (75.042669%) --iCache conflicts: 111834 (2.777585%) --thread*cycles of FU dependence: 251614 (6.249255%) --thread*cycles of data dependence: 203181 (5.046340%) --iCache cycles*banks: 4026304 (82.248484% used) Issue breakdown: --thread*cycles of issue worked: 3021446 (75.042669%) --thread*cycles of issue failed: 714762 (17.752311%) --thread*cycles of issue NOP/other: 290096 (7.205020%) Number of thread-cycles not ready: 203181 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3311542 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 6 1: 7 2: 8 3: 8 4: 9 5: 8 6: 7 7: 7 8: 6 9: 6 10: 7 11: 7 12: 7 13: 9 14: 7 15: 8 16: 8 17: 7 18: 7 19: 8 20: 8 21: 7 22: 7 23: 8 24: 7 25: 7 26: 8 27: 9 28: 6 29: 7 30: 6 31: 7 <=== Core 64 ===> ---- Thread 00 ---- PC 5: Stalled ----- 99182 in-flight CPI 1.3059 -- Total Cycles 129545 ---- Thread 01 ---- PC 5: Stalled ----- 102803 in-flight CPI 1.2599 -- Total Cycles 129545 ---- Thread 02 ---- PC 5: Stalled ----- 94865 in-flight CPI 1.3653 -- Total Cycles 129545 ---- Thread 03 ---- PC 5: Stalled ----- 96130 in-flight CPI 1.3473 -- Total Cycles 129545 ---- Thread 04 ---- PC 5: Stalled ----- 97885 in-flight CPI 1.3232 -- Total Cycles 129545 ---- Thread 05 ---- PC 5: Stalled ----- 98626 in-flight CPI 1.3132 -- Total Cycles 129545 ---- Thread 06 ---- PC 5: Stalled ----- 94492 in-flight CPI 1.3707 -- Total Cycles 129545 ---- Thread 07 ---- PC 5: Stalled ----- 94200 in-flight CPI 1.3750 -- Total Cycles 129545 ---- Thread 08 ---- PC 5: Stalled ----- 92073 in-flight CPI 1.4067 -- Total Cycles 129545 ---- Thread 09 ---- PC 5: Stalled ----- 98876 in-flight CPI 1.3099 -- Total Cycles 129545 ---- Thread 10 ---- PC 5: Stalled ----- 97495 in-flight CPI 1.3285 -- Total Cycles 129545 ---- Thread 11 ---- PC 5: Stalled ----- 99688 in-flight CPI 1.2993 -- Total Cycles 129545 ---- Thread 12 ---- PC 5: Stalled ----- 97710 in-flight CPI 1.3256 -- Total Cycles 129545 ---- Thread 13 ---- PC 5: Stalled ----- 101536 in-flight CPI 1.2756 -- Total Cycles 129545 ---- Thread 14 ---- PC 5: Stalled ----- 97142 in-flight CPI 1.3333 -- Total Cycles 129545 ---- Thread 15 ---- PC 5: Stalled ----- 96111 in-flight CPI 1.3476 -- Total Cycles 129545 ---- Thread 16 ---- PC 5: Stalled ----- 96646 in-flight CPI 1.3402 -- Total Cycles 129545 ---- Thread 17 ---- PC 5: Stalled ----- 98296 in-flight CPI 1.3176 -- Total Cycles 129545 ---- Thread 18 ---- PC 5: Stalled ----- 95953 in-flight CPI 1.3499 -- Total Cycles 129545 ---- Thread 19 ---- PC 5: Stalled ----- 93462 in-flight CPI 1.3858 -- Total Cycles 129545 ---- Thread 20 ---- PC 5: Stalled ----- 98672 in-flight CPI 1.3126 -- Total Cycles 129545 ---- Thread 21 ---- PC 5: Stalled ----- 89967 in-flight CPI 1.4396 -- Total Cycles 129545 ---- Thread 22 ---- PC 5: Stalled ----- 95056 in-flight CPI 1.3626 -- Total Cycles 129545 ---- Thread 23 ---- PC 5: Stalled ----- 99588 in-flight CPI 1.3006 -- Total Cycles 129545 ---- Thread 24 ---- PC 5: Stalled ----- 92271 in-flight CPI 1.4037 -- Total Cycles 129545 ---- Thread 25 ---- PC 5: Stalled ----- 90598 in-flight CPI 1.4297 -- Total Cycles 129545 ---- Thread 26 ---- PC 5: Stalled ----- 91134 in-flight CPI 1.4212 -- Total Cycles 129545 ---- Thread 27 ---- PC 5: Stalled ----- 86298 in-flight CPI 1.5009 -- Total Cycles 129545 ---- Thread 28 ---- PC 5: Stalled ----- 86608 in-flight CPI 1.4955 -- Total Cycles 129545 ---- Thread 29 ---- PC 5: Stalled ----- 85584 in-flight CPI 1.5134 -- Total Cycles 129545 ---- Thread 30 ---- PC 5: Stalled ----- 87539 in-flight CPI 1.4796 -- Total Cycles 129545 ---- Thread 31 ---- PC 5: Stalled ----- 87529 in-flight CPI 1.4797 -- Total Cycles 129545 Total CPI 0.0427 , IPC 23.4248 -- Total Cycles 129545 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7401 (4.112306%) FPSUB: 0 (0.000000%) FPMUL: 30887 (17.162114%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 60117 (33.403529%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5602 (3.112706%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 68177 (37.882004%) DIV: 7523 (4.180095%) FPUN: 0 (0.000000%) FPRSUB: 265 (0.147245%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3325633 total) ADD%: 7.483 (248859) SUB%: 0.000 (0) MUL%: 0.006 (204) BITOR%: 1.528 (50828) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.525 (17469) FPSUB%: 0.000 (0) FPMUL%: 4.702 (156377) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (612) FPMAX%: 0.018 (612) LOAD%: 5.115 (170114) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (236) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (586) FPINV%: 0.000 (0) FPCONV%: 0.019 (644) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.054 (35047) FPLE%: 0.459 (15273) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (612) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.819 (93739) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.736 (24480) CMPU%: 0.000 (0) RSUB%: 0.006 (204) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.703 (522233) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.176 (39123) ORI%: 1.539 (51171) XORI%: 0.000 (0) MULI%: 3.218 (107026) LW%: 1.137 (37822) LWI%: 13.515 (449448) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.289 (9616) SWI%: 4.073 (135437) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.408 (46823) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10312) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.053 (1749) bned%: 0.000 (0) bneid%: 13.821 (459645) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.719 (23907) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.116 (3872) DIV%: 0.012 (408) FPUN%: 1.483 (49323) FPRSUB%: 4.159 (138300) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (68) FPGT%: 2.956 (98302) FPGE%: 1.024 (34050) SYNC%: 0.000 (0) NOP%: 8.750 (291006) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 30 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 5 FPSUB 0 FPMUL 1 FPCMPLT 0 FPMIN 0 FPMAX 397 LOAD 39351 INTCONV 0 ATOMIC_INC 20 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 17 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1917 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 5 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48854 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 12 ORI 10435 XORI 0 MULI 9604 LW 0 LWI 142101 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 65 DIV 25 FPUN 0 FPRSUB 49 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.4251 --Total thread-cycles: 4145440 --total thread-cycles issued: 3034627 (73.203978%) --iCache conflicts: 110973 (2.676990%) --thread*cycles of FU dependence: 252897 (6.100607%) --thread*cycles of data dependence: 179972 (4.341445%) --iCache cycles*banks: 4145440 (80.224656% used) Issue breakdown: --thread*cycles of issue worked: 3034627 (73.203978%) --thread*cycles of issue failed: 819807 (19.776115%) --thread*cycles of issue NOP/other: 291006 (7.019906%) Number of thread-cycles not ready: 179972 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3325633 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 9 2: 7 3: 8 4: 8 5: 8 6: 8 7: 7 8: 7 9: 9 10: 7 11: 7 12: 7 13: 7 14: 8 15: 7 16: 7 17: 8 18: 7 19: 7 20: 8 21: 8 22: 8 23: 8 24: 7 25: 6 26: 7 27: 7 28: 6 29: 7 30: 6 31: 7 <=== Core 65 ===> ---- Thread 00 ---- PC 5: Stalled ----- 99878 in-flight CPI 1.2906 -- Total Cycles 128915 ---- Thread 01 ---- PC 5: Stalled ----- 99251 in-flight CPI 1.2986 -- Total Cycles 128915 ---- Thread 02 ---- PC 5: Stalled ----- 93266 in-flight CPI 1.3821 -- Total Cycles 128915 ---- Thread 03 ---- PC 5: Stalled ----- 101085 in-flight CPI 1.2750 -- Total Cycles 128915 ---- Thread 04 ---- PC 5: Stalled ----- 100503 in-flight CPI 1.2825 -- Total Cycles 128915 ---- Thread 05 ---- PC 5: Stalled ----- 94903 in-flight CPI 1.3582 -- Total Cycles 128915 ---- Thread 06 ---- PC 5: Stalled ----- 93865 in-flight CPI 1.3732 -- Total Cycles 128915 ---- Thread 07 ---- PC 5: Stalled ----- 97266 in-flight CPI 1.3251 -- Total Cycles 128915 ---- Thread 08 ---- PC 5: Stalled ----- 97599 in-flight CPI 1.3206 -- Total Cycles 128915 ---- Thread 09 ---- PC 5: Stalled ----- 98691 in-flight CPI 1.3060 -- Total Cycles 128915 ---- Thread 10 ---- PC 5: Stalled ----- 97638 in-flight CPI 1.3201 -- Total Cycles 128915 ---- Thread 11 ---- PC 5: Stalled ----- 103452 in-flight CPI 1.2459 -- Total Cycles 128915 ---- Thread 12 ---- PC 5: Stalled ----- 99923 in-flight CPI 1.2899 -- Total Cycles 128915 ---- Thread 13 ---- PC 5: Stalled ----- 91933 in-flight CPI 1.4020 -- Total Cycles 128915 ---- Thread 14 ---- PC 5: Stalled ----- 94525 in-flight CPI 1.3636 -- Total Cycles 128915 ---- Thread 15 ---- PC 5: Stalled ----- 99798 in-flight CPI 1.2915 -- Total Cycles 128915 ---- Thread 16 ---- PC 5: Stalled ----- 98275 in-flight CPI 1.3115 -- Total Cycles 128915 ---- Thread 17 ---- PC 5: Stalled ----- 95847 in-flight CPI 1.3448 -- Total Cycles 128915 ---- Thread 18 ---- PC 5: Stalled ----- 85243 in-flight CPI 1.5121 -- Total Cycles 128915 ---- Thread 19 ---- PC 5: Stalled ----- 91133 in-flight CPI 1.4143 -- Total Cycles 128915 ---- Thread 20 ---- PC 5: Stalled ----- 93520 in-flight CPI 1.3782 -- Total Cycles 128915 ---- Thread 21 ---- PC 5: Stalled ----- 91642 in-flight CPI 1.4064 -- Total Cycles 128915 ---- Thread 22 ---- PC 5: Stalled ----- 95028 in-flight CPI 1.3564 -- Total Cycles 128915 ---- Thread 23 ---- PC 5: Stalled ----- 95392 in-flight CPI 1.3512 -- Total Cycles 128915 ---- Thread 24 ---- PC 5: Stalled ----- 90611 in-flight CPI 1.4224 -- Total Cycles 128915 ---- Thread 25 ---- PC 5: Stalled ----- 98098 in-flight CPI 1.3138 -- Total Cycles 128915 ---- Thread 26 ---- PC 5: Stalled ----- 87939 in-flight CPI 1.4656 -- Total Cycles 128915 ---- Thread 27 ---- PC 5: Stalled ----- 85310 in-flight CPI 1.5110 -- Total Cycles 128915 ---- Thread 28 ---- PC 5: Stalled ----- 90642 in-flight CPI 1.4220 -- Total Cycles 128915 ---- Thread 29 ---- PC 5: Stalled ----- 92060 in-flight CPI 1.4000 -- Total Cycles 128915 ---- Thread 30 ---- PC 5: Stalled ----- 92767 in-flight CPI 1.3894 -- Total Cycles 128915 ---- Thread 31 ---- PC 5: Stalled ----- 84906 in-flight CPI 1.5180 -- Total Cycles 128915 Total CPI 0.0425 , IPC 23.5237 -- Total Cycles 128915 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7431 (3.616762%) FPSUB: 0 (0.000000%) FPMUL: 31059 (15.116811%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 85581 (41.653363%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5654 (2.751874%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 67868 (33.032220%) DIV: 7601 (3.699504%) FPUN: 0 (0.000000%) FPRSUB: 266 (0.129466%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3323448 total) ADD%: 7.477 (248499) SUB%: 0.000 (0) MUL%: 0.006 (206) BITOR%: 1.520 (50520) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.529 (17573) FPSUB%: 0.000 (0) FPMUL%: 4.709 (156508) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (618) FPMAX%: 0.019 (618) LOAD%: 5.112 (169891) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (238) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (590) FPINV%: 0.000 (0) FPCONV%: 0.020 (650) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.054 (35019) FPLE%: 0.458 (15215) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (618) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.821 (93746) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.733 (24360) CMPU%: 0.000 (0) RSUB%: 0.006 (206) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.694 (521581) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.173 (38990) ORI%: 1.540 (51190) XORI%: 0.000 (0) MULI%: 3.222 (107092) LW%: 1.138 (37828) LWI%: 13.537 (449899) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.289 (9598) SWI%: 4.081 (135640) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.410 (46846) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10318) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.056 (1876) bned%: 0.000 (0) bneid%: 13.806 (458828) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.723 (24043) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.116 (3852) DIV%: 0.012 (412) FPUN%: 1.479 (49152) FPRSUB%: 4.153 (138038) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (78) FPGT%: 2.956 (98238) FPGE%: 1.021 (33937) SYNC%: 0.000 (0) NOP%: 8.751 (290841) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 12 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 10 FPSUB 0 FPMUL 4 FPCMPLT 0 FPMIN 0 FPMAX 401 LOAD 39165 INTCONV 0 ATOMIC_INC 15 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 14 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 2111 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 11 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48717 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 14 ORI 10541 XORI 0 MULI 9485 LW 0 LWI 142094 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 56 DIV 32 FPUN 0 FPRSUB 60 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.5239 --Total thread-cycles: 4125280 --total thread-cycles issued: 3032607 (73.512755%) --iCache conflicts: 109705 (2.659335%) --thread*cycles of FU dependence: 252749 (6.126833%) --thread*cycles of data dependence: 205460 (4.980510%) --iCache cycles*banks: 4125280 (80.563744% used) Issue breakdown: --thread*cycles of issue worked: 3032607 (73.512755%) --thread*cycles of issue failed: 801832 (19.437032%) --thread*cycles of issue NOP/other: 290841 (7.050212%) Number of thread-cycles not ready: 205460 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3323448 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 6 1: 8 2: 5 3: 9 4: 8 5: 5 6: 7 7: 8 8: 8 9: 9 10: 7 11: 9 12: 8 13: 7 14: 7 15: 9 16: 9 17: 7 18: 5 19: 7 20: 7 21: 8 22: 7 23: 8 24: 8 25: 9 26: 8 27: 5 28: 7 29: 8 30: 8 31: 7 <=== Core 66 ===> ---- Thread 00 ---- PC 5: Stalled ----- 94971 in-flight CPI 1.5605 -- Total Cycles 148231 ---- Thread 01 ---- PC 5: Stalled ----- 97271 in-flight CPI 1.5236 -- Total Cycles 148231 ---- Thread 02 ---- PC 5: Stalled ----- 102913 in-flight CPI 1.4401 -- Total Cycles 148231 ---- Thread 03 ---- PC 5: Stalled ----- 103958 in-flight CPI 1.4255 -- Total Cycles 148231 ---- Thread 04 ---- PC 5: Stalled ----- 103199 in-flight CPI 1.4361 -- Total Cycles 148231 ---- Thread 05 ---- PC 5: Stalled ----- 95749 in-flight CPI 1.5479 -- Total Cycles 148231 ---- Thread 06 ---- PC 5: Stalled ----- 102057 in-flight CPI 1.4522 -- Total Cycles 148231 ---- Thread 07 ---- PC 5: Stalled ----- 103988 in-flight CPI 1.4252 -- Total Cycles 148231 ---- Thread 08 ---- PC 5: Stalled ----- 95123 in-flight CPI 1.5581 -- Total Cycles 148231 ---- Thread 09 ---- PC 5: Stalled ----- 93361 in-flight CPI 1.5874 -- Total Cycles 148231 ---- Thread 10 ---- PC 5: Stalled ----- 101244 in-flight CPI 1.4638 -- Total Cycles 148231 ---- Thread 11 ---- PC 5: Stalled ----- 98833 in-flight CPI 1.4995 -- Total Cycles 148231 ---- Thread 12 ---- PC 5: Stalled ----- 98236 in-flight CPI 1.5086 -- Total Cycles 148231 ---- Thread 13 ---- PC 5: Stalled ----- 96109 in-flight CPI 1.5420 -- Total Cycles 148231 ---- Thread 14 ---- PC 5: Stalled ----- 92117 in-flight CPI 1.6089 -- Total Cycles 148231 ---- Thread 15 ---- PC 5: Stalled ----- 99888 in-flight CPI 1.4837 -- Total Cycles 148231 ---- Thread 16 ---- PC 5: Stalled ----- 98398 in-flight CPI 1.5061 -- Total Cycles 148231 ---- Thread 17 ---- PC 5: Stalled ----- 96530 in-flight CPI 1.5353 -- Total Cycles 148231 ---- Thread 18 ---- PC 5: Stalled ----- 97148 in-flight CPI 1.5256 -- Total Cycles 148231 ---- Thread 19 ---- PC 5: Stalled ----- 96237 in-flight CPI 1.5400 -- Total Cycles 148231 ---- Thread 20 ---- PC 5: Stalled ----- 90838 in-flight CPI 1.6315 -- Total Cycles 148231 ---- Thread 21 ---- PC 5: Stalled ----- 94323 in-flight CPI 1.5712 -- Total Cycles 148231 ---- Thread 22 ---- PC 5: Stalled ----- 88794 in-flight CPI 1.6691 -- Total Cycles 148231 ---- Thread 23 ---- PC 5: Stalled ----- 106548 in-flight CPI 1.3910 -- Total Cycles 148231 ---- Thread 24 ---- PC 5: Stalled ----- 92829 in-flight CPI 1.5965 -- Total Cycles 148231 ---- Thread 25 ---- PC 5: Stalled ----- 94234 in-flight CPI 1.5727 -- Total Cycles 148231 ---- Thread 26 ---- PC 5: Stalled ----- 93413 in-flight CPI 1.5866 -- Total Cycles 148231 ---- Thread 27 ---- PC 5: Stalled ----- 93558 in-flight CPI 1.5840 -- Total Cycles 148231 ---- Thread 28 ---- PC 5: Stalled ----- 88054 in-flight CPI 1.6832 -- Total Cycles 148231 ---- Thread 29 ---- PC 5: Stalled ----- 92791 in-flight CPI 1.5972 -- Total Cycles 148231 ---- Thread 30 ---- PC 5: Stalled ----- 86934 in-flight CPI 1.7048 -- Total Cycles 148231 ---- Thread 31 ---- PC 5: Stalled ----- 90257 in-flight CPI 1.6420 -- Total Cycles 148231 Total CPI 0.0481 , IPC 20.7816 -- Total Cycles 148231 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7838 (3.988215%) FPSUB: 0 (0.000000%) FPMUL: 32281 (16.425566%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 70791 (36.020638%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5727 (2.914074%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 71909 (36.589511%) DIV: 7713 (3.924612%) FPUN: 0 (0.000000%) FPRSUB: 270 (0.137384%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3376250 total) ADD%: 7.441 (251222) SUB%: 0.000 (0) MUL%: 0.006 (209) BITOR%: 1.525 (51499) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.553 (18680) FPSUB%: 0.000 (0) FPMUL%: 4.777 (161296) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (627) FPMAX%: 0.019 (627) LOAD%: 5.130 (173189) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (241) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (598) FPINV%: 0.000 (0) FPCONV%: 0.020 (659) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.066 (35976) FPLE%: 0.456 (15410) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (627) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.800 (94549) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.744 (25121) CMPU%: 0.000 (0) RSUB%: 0.006 (209) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.676 (529263) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.172 (39560) ORI%: 1.561 (52707) XORI%: 0.000 (0) MULI%: 3.201 (108086) LW%: 1.130 (38154) LWI%: 13.472 (454835) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.287 (9696) SWI%: 4.059 (137058) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.399 (47233) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10481) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.057 (1913) bned%: 0.000 (0) bneid%: 13.807 (466147) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.716 (24175) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.121 (4090) DIV%: 0.012 (418) FPUN%: 1.477 (49884) FPRSUB%: 4.202 (141877) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (66) FPGT%: 2.949 (99578) FPGE%: 1.021 (34474) SYNC%: 0.000 (0) NOP%: 8.759 (295720) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 17 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 7 FPSUB 0 FPMUL 1 FPCMPLT 0 FPMIN 0 FPMAX 409 LOAD 40181 INTCONV 0 ATOMIC_INC 17 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 13 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1540 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 11 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49236 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 13 ORI 11232 XORI 0 MULI 9538 LW 0 LWI 143829 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 94 DIV 31 FPUN 0 FPRSUB 55 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 20.7818 --Total thread-cycles: 4743392 --total thread-cycles issued: 3080530 (64.943610%) --iCache conflicts: 112573 (2.373259%) --thread*cycles of FU dependence: 256233 (5.401894%) --thread*cycles of data dependence: 196529 (4.143216%) --iCache cycles*banks: 4743392 (71.178642% used) Issue breakdown: --thread*cycles of issue worked: 3080530 (64.943610%) --thread*cycles of issue failed: 1367142 (28.822033%) --thread*cycles of issue NOP/other: 295720 (6.234357%) Number of thread-cycles not ready: 196529 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3376250 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 8 3: 10 4: 8 5: 6 6: 8 7: 8 8: 6 9: 8 10: 9 11: 8 12: 8 13: 8 14: 7 15: 8 16: 8 17: 7 18: 7 19: 7 20: 8 21: 8 22: 7 23: 6 24: 8 25: 7 26: 7 27: 8 28: 6 29: 7 30: 7 31: 7 <=== Core 67 ===> ---- Thread 00 ---- PC 5: Stalled ----- 99472 in-flight CPI 1.2819 -- Total Cycles 127537 ---- Thread 01 ---- PC 5: Stalled ----- 97616 in-flight CPI 1.3063 -- Total Cycles 127537 ---- Thread 02 ---- PC 5: Stalled ----- 101523 in-flight CPI 1.2560 -- Total Cycles 127537 ---- Thread 03 ---- PC 5: Stalled ----- 99746 in-flight CPI 1.2784 -- Total Cycles 127537 ---- Thread 04 ---- PC 5: Stalled ----- 104875 in-flight CPI 1.2158 -- Total Cycles 127537 ---- Thread 05 ---- PC 5: Stalled ----- 98186 in-flight CPI 1.2988 -- Total Cycles 127537 ---- Thread 06 ---- PC 5: Stalled ----- 102291 in-flight CPI 1.2465 -- Total Cycles 127537 ---- Thread 07 ---- PC 5: Stalled ----- 99326 in-flight CPI 1.2837 -- Total Cycles 127537 ---- Thread 08 ---- PC 5: Stalled ----- 93375 in-flight CPI 1.3656 -- Total Cycles 127537 ---- Thread 09 ---- PC 5: Stalled ----- 94958 in-flight CPI 1.3428 -- Total Cycles 127537 ---- Thread 10 ---- PC 5: Stalled ----- 97243 in-flight CPI 1.3113 -- Total Cycles 127537 ---- Thread 11 ---- PC 5: Stalled ----- 89333 in-flight CPI 1.4274 -- Total Cycles 127537 ---- Thread 12 ---- PC 5: Stalled ----- 95105 in-flight CPI 1.3407 -- Total Cycles 127537 ---- Thread 13 ---- PC 5: Stalled ----- 101332 in-flight CPI 1.2584 -- Total Cycles 127537 ---- Thread 14 ---- PC 5: Stalled ----- 99326 in-flight CPI 1.2837 -- Total Cycles 127537 ---- Thread 15 ---- PC 5: Stalled ----- 96444 in-flight CPI 1.3222 -- Total Cycles 127537 ---- Thread 16 ---- PC 5: Stalled ----- 99899 in-flight CPI 1.2764 -- Total Cycles 127537 ---- Thread 17 ---- PC 5: Stalled ----- 94417 in-flight CPI 1.3506 -- Total Cycles 127537 ---- Thread 18 ---- PC 5: Stalled ----- 98170 in-flight CPI 1.2988 -- Total Cycles 127537 ---- Thread 19 ---- PC 5: Stalled ----- 93504 in-flight CPI 1.3637 -- Total Cycles 127537 ---- Thread 20 ---- PC 5: Stalled ----- 98632 in-flight CPI 1.2928 -- Total Cycles 127537 ---- Thread 21 ---- PC 5: Stalled ----- 93916 in-flight CPI 1.3578 -- Total Cycles 127537 ---- Thread 22 ---- PC 5: Stalled ----- 96582 in-flight CPI 1.3202 -- Total Cycles 127537 ---- Thread 23 ---- PC 5: Stalled ----- 95469 in-flight CPI 1.3357 -- Total Cycles 127537 ---- Thread 24 ---- PC 5: Stalled ----- 86354 in-flight CPI 1.4768 -- Total Cycles 127537 ---- Thread 25 ---- PC 5: Stalled ----- 95211 in-flight CPI 1.3392 -- Total Cycles 127537 ---- Thread 26 ---- PC 5: Stalled ----- 93684 in-flight CPI 1.3611 -- Total Cycles 127537 ---- Thread 27 ---- PC 5: Stalled ----- 91057 in-flight CPI 1.4004 -- Total Cycles 127537 ---- Thread 28 ---- PC 5: Stalled ----- 86323 in-flight CPI 1.4772 -- Total Cycles 127537 ---- Thread 29 ---- PC 5: Stalled ----- 85782 in-flight CPI 1.4865 -- Total Cycles 127537 ---- Thread 30 ---- PC 5: Stalled ----- 89493 in-flight CPI 1.4248 -- Total Cycles 127537 ---- Thread 31 ---- PC 5: Stalled ----- 88835 in-flight CPI 1.4354 -- Total Cycles 127537 Total CPI 0.0417 , IPC 23.9778 -- Total Cycles 127537 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8091 (3.923841%) FPSUB: 0 (0.000000%) FPMUL: 32277 (15.653173%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 77196 (37.437258%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5915 (2.868560%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 74657 (36.205935%) DIV: 7792 (3.778837%) FPUN: 0 (0.000000%) FPRSUB: 273 (0.132395%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3350873 total) ADD%: 7.389 (247592) SUB%: 0.000 (0) MUL%: 0.006 (211) BITOR%: 1.525 (51096) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.565 (18941) FPSUB%: 0.000 (0) FPMUL%: 4.813 (161267) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (633) FPMAX%: 0.019 (633) LOAD%: 5.165 (173077) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (243) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (610) FPINV%: 0.000 (0) FPCONV%: 0.020 (665) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.070 (35847) FPLE%: 0.452 (15155) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (633) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.797 (93716) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.755 (25297) CMPU%: 0.000 (0) RSUB%: 0.006 (211) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.657 (524649) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.177 (39425) ORI%: 1.564 (52400) XORI%: 0.000 (0) MULI%: 3.193 (107006) LW%: 1.129 (37824) LWI%: 13.471 (451400) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.285 (9562) SWI%: 4.070 (136373) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.399 (46876) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.309 (10338) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.056 (1861) bned%: 0.000 (0) bneid%: 13.782 (461802) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.710 (23779) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.127 (4250) DIV%: 0.013 (422) FPUN%: 1.471 (49284) FPRSUB%: 4.249 (142362) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (81) FPGT%: 2.936 (98366) FPGE%: 1.019 (34129) SYNC%: 0.000 (0) NOP%: 8.737 (292761) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 15 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 9 FPSUB 0 FPMUL 0 FPCMPLT 0 FPMIN 0 FPMAX 410 LOAD 40132 INTCONV 0 ATOMIC_INC 21 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 15 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1624 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 3 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48866 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 8 ORI 11482 XORI 0 MULI 9021 LW 0 LWI 142697 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 95 DIV 30 FPUN 0 FPRSUB 47 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.9780 --Total thread-cycles: 4081184 --total thread-cycles issued: 3058112 (74.931981%) --iCache conflicts: 111467 (2.731242%) --thread*cycles of FU dependence: 254503 (6.236009%) --thread*cycles of data dependence: 206201 (5.052480%) --iCache cycles*banks: 4081184 (82.106198% used) Issue breakdown: --thread*cycles of issue worked: 3058112 (74.931981%) --thread*cycles of issue failed: 730311 (17.894586%) --thread*cycles of issue NOP/other: 292761 (7.173433%) Number of thread-cycles not ready: 206201 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3350873 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 9 1: 7 2: 9 3: 7 4: 9 5: 6 6: 9 7: 9 8: 7 9: 8 10: 7 11: 6 12: 8 13: 8 14: 9 15: 7 16: 9 17: 6 18: 10 19: 7 20: 9 21: 7 22: 9 23: 7 24: 4 25: 8 26: 8 27: 7 28: 7 29: 6 30: 7 31: 7 <=== Core 68 ===> ---- Thread 00 ---- PC 5: Stalled ----- 102075 in-flight CPI 1.3350 -- Total Cycles 136300 ---- Thread 01 ---- PC 5: Stalled ----- 98372 in-flight CPI 1.3853 -- Total Cycles 136300 ---- Thread 02 ---- PC 5: Stalled ----- 101585 in-flight CPI 1.3414 -- Total Cycles 136300 ---- Thread 03 ---- PC 5: Stalled ----- 95267 in-flight CPI 1.4305 -- Total Cycles 136300 ---- Thread 04 ---- PC 5: Stalled ----- 103393 in-flight CPI 1.3180 -- Total Cycles 136300 ---- Thread 05 ---- PC 5: Stalled ----- 93722 in-flight CPI 1.4540 -- Total Cycles 136300 ---- Thread 06 ---- PC 5: Stalled ----- 99820 in-flight CPI 1.3653 -- Total Cycles 136300 ---- Thread 07 ---- PC 5: Stalled ----- 100380 in-flight CPI 1.3575 -- Total Cycles 136300 ---- Thread 08 ---- PC 5: Stalled ----- 102222 in-flight CPI 1.3331 -- Total Cycles 136300 ---- Thread 09 ---- PC 5: Stalled ----- 98464 in-flight CPI 1.3840 -- Total Cycles 136300 ---- Thread 10 ---- PC 5: Stalled ----- 99129 in-flight CPI 1.3747 -- Total Cycles 136300 ---- Thread 11 ---- PC 5: Stalled ----- 93817 in-flight CPI 1.4525 -- Total Cycles 136300 ---- Thread 12 ---- PC 5: Stalled ----- 98328 in-flight CPI 1.3859 -- Total Cycles 136300 ---- Thread 13 ---- PC 5: Stalled ----- 99074 in-flight CPI 1.3755 -- Total Cycles 136300 ---- Thread 14 ---- PC 5: Stalled ----- 95032 in-flight CPI 1.4340 -- Total Cycles 136300 ---- Thread 15 ---- PC 5: Stalled ----- 91402 in-flight CPI 1.4910 -- Total Cycles 136300 ---- Thread 16 ---- PC 5: Stalled ----- 96886 in-flight CPI 1.4066 -- Total Cycles 136300 ---- Thread 17 ---- PC 5: Stalled ----- 93289 in-flight CPI 1.4608 -- Total Cycles 136300 ---- Thread 18 ---- PC 5: Stalled ----- 91794 in-flight CPI 1.4846 -- Total Cycles 136300 ---- Thread 19 ---- PC 5: Stalled ----- 91099 in-flight CPI 1.4959 -- Total Cycles 136300 ---- Thread 20 ---- PC 5: Stalled ----- 86571 in-flight CPI 1.5742 -- Total Cycles 136300 ---- Thread 21 ---- PC 5: Stalled ----- 98136 in-flight CPI 1.3886 -- Total Cycles 136300 ---- Thread 22 ---- PC 5: Stalled ----- 99800 in-flight CPI 1.3656 -- Total Cycles 136300 ---- Thread 23 ---- PC 5: Stalled ----- 93173 in-flight CPI 1.4626 -- Total Cycles 136300 ---- Thread 24 ---- PC 5: Stalled ----- 91729 in-flight CPI 1.4857 -- Total Cycles 136300 ---- Thread 25 ---- PC 5: Stalled ----- 86928 in-flight CPI 1.5677 -- Total Cycles 136300 ---- Thread 26 ---- PC 5: Stalled ----- 93362 in-flight CPI 1.4596 -- Total Cycles 136300 ---- Thread 27 ---- PC 5: Stalled ----- 92729 in-flight CPI 1.4696 -- Total Cycles 136300 ---- Thread 28 ---- PC 5: Stalled ----- 93713 in-flight CPI 1.4541 -- Total Cycles 136300 ---- Thread 29 ---- PC 5: Stalled ----- 91251 in-flight CPI 1.4934 -- Total Cycles 136300 ---- Thread 30 ---- PC 5: Stalled ----- 88298 in-flight CPI 1.5434 -- Total Cycles 136300 ---- Thread 31 ---- PC 5: Stalled ----- 91825 in-flight CPI 1.4840 -- Total Cycles 136300 Total CPI 0.0446 , IPC 22.4008 -- Total Cycles 136300 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8184 (3.753336%) FPSUB: 0 (0.000000%) FPMUL: 32517 (14.912908%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 88915 (40.778093%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5640 (2.586610%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 74888 (34.345046%) DIV: 7633 (3.500637%) FPUN: 0 (0.000000%) FPRSUB: 269 (0.123368%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3345587 total) ADD%: 7.426 (248453) SUB%: 0.000 (0) MUL%: 0.006 (207) BITOR%: 1.540 (51527) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.572 (19144) FPSUB%: 0.000 (0) FPMUL%: 4.830 (161589) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (621) FPMAX%: 0.019 (621) LOAD%: 5.157 (172535) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (239) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (591) FPINV%: 0.000 (0) FPCONV%: 0.020 (653) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.071 (35828) FPLE%: 0.456 (15253) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (621) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.782 (93067) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.757 (25322) CMPU%: 0.000 (0) RSUB%: 0.006 (207) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.660 (523933) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.173 (39254) ORI%: 1.580 (52852) XORI%: 0.000 (0) MULI%: 3.182 (106458) LW%: 1.123 (37558) LWI%: 13.410 (448647) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.285 (9530) SWI%: 4.049 (135454) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.390 (46506) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.309 (10329) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.055 (1825) bned%: 0.000 (0) bneid%: 13.788 (461281) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.715 (23933) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.127 (4265) DIV%: 0.012 (414) FPUN%: 1.483 (49624) FPRSUB%: 4.257 (142413) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (71) FPGT%: 2.929 (97994) FPGE%: 1.027 (34371) SYNC%: 0.000 (0) NOP%: 8.737 (292301) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 41 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 7 FPSUB 0 FPMUL 1 FPCMPLT 0 FPMIN 0 FPMAX 404 LOAD 40399 INTCONV 0 ATOMIC_INC 29 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 14 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1666 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 7 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48579 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 17 ORI 11638 XORI 0 MULI 8926 LW 0 LWI 142184 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 65 DIV 39 FPUN 0 FPRSUB 47 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 22.4011 --Total thread-cycles: 4361600 --total thread-cycles issued: 3053286 (70.003806%) --iCache conflicts: 111116 (2.547597%) --thread*cycles of FU dependence: 254073 (5.825225%) --thread*cycles of data dependence: 218046 (4.999220%) --iCache cycles*banks: 4361600 (76.706232% used) Issue breakdown: --thread*cycles of issue worked: 3053286 (70.003806%) --thread*cycles of issue failed: 1016013 (23.294502%) --thread*cycles of issue NOP/other: 292301 (6.701692%) Number of thread-cycles not ready: 218046 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3345587 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 9 1: 7 2: 9 3: 7 4: 9 5: 8 6: 6 7: 9 8: 9 9: 8 10: 9 11: 8 12: 9 13: 8 14: 7 15: 6 16: 6 17: 7 18: 7 19: 7 20: 6 21: 8 22: 5 23: 8 24: 6 25: 6 26: 8 27: 7 28: 8 29: 7 30: 7 31: 8 <=== Core 69 ===> ---- Thread 00 ---- PC 5: Stalled ----- 100792 in-flight CPI 1.3005 -- Total Cycles 131092 ---- Thread 01 ---- PC 5: Stalled ----- 104822 in-flight CPI 1.2504 -- Total Cycles 131092 ---- Thread 02 ---- PC 5: Stalled ----- 101786 in-flight CPI 1.2876 -- Total Cycles 131092 ---- Thread 03 ---- PC 5: Stalled ----- 101067 in-flight CPI 1.2968 -- Total Cycles 131092 ---- Thread 04 ---- PC 5: Stalled ----- 104449 in-flight CPI 1.2549 -- Total Cycles 131092 ---- Thread 05 ---- PC 5: Stalled ----- 97665 in-flight CPI 1.3420 -- Total Cycles 131092 ---- Thread 06 ---- PC 5: Stalled ----- 102251 in-flight CPI 1.2818 -- Total Cycles 131092 ---- Thread 07 ---- PC 5: Stalled ----- 98964 in-flight CPI 1.3244 -- Total Cycles 131092 ---- Thread 08 ---- PC 5: Stalled ----- 92465 in-flight CPI 1.4175 -- Total Cycles 131092 ---- Thread 09 ---- PC 5: Stalled ----- 101392 in-flight CPI 1.2927 -- Total Cycles 131092 ---- Thread 10 ---- PC 5: Stalled ----- 96636 in-flight CPI 1.3563 -- Total Cycles 131092 ---- Thread 11 ---- PC 5: Stalled ----- 93386 in-flight CPI 1.4035 -- Total Cycles 131092 ---- Thread 12 ---- PC 5: Stalled ----- 94746 in-flight CPI 1.3833 -- Total Cycles 131092 ---- Thread 13 ---- PC 5: Stalled ----- 101278 in-flight CPI 1.2942 -- Total Cycles 131092 ---- Thread 14 ---- PC 5: Stalled ----- 96590 in-flight CPI 1.3569 -- Total Cycles 131092 ---- Thread 15 ---- PC 5: Stalled ----- 96465 in-flight CPI 1.3587 -- Total Cycles 131092 ---- Thread 16 ---- PC 5: Stalled ----- 96047 in-flight CPI 1.3646 -- Total Cycles 131092 ---- Thread 17 ---- PC 5: Stalled ----- 93792 in-flight CPI 1.3975 -- Total Cycles 131092 ---- Thread 18 ---- PC 5: Stalled ----- 98515 in-flight CPI 1.3305 -- Total Cycles 131092 ---- Thread 19 ---- PC 5: Stalled ----- 92191 in-flight CPI 1.4217 -- Total Cycles 131092 ---- Thread 20 ---- PC 5: Stalled ----- 93668 in-flight CPI 1.3993 -- Total Cycles 131092 ---- Thread 21 ---- PC 5: Stalled ----- 90727 in-flight CPI 1.4446 -- Total Cycles 131092 ---- Thread 22 ---- PC 5: Stalled ----- 93690 in-flight CPI 1.3989 -- Total Cycles 131092 ---- Thread 23 ---- PC 5: Stalled ----- 92792 in-flight CPI 1.4125 -- Total Cycles 131092 ---- Thread 24 ---- PC 5: Stalled ----- 95674 in-flight CPI 1.3700 -- Total Cycles 131092 ---- Thread 25 ---- PC 5: Stalled ----- 92583 in-flight CPI 1.4157 -- Total Cycles 131092 ---- Thread 26 ---- PC 5: Stalled ----- 95851 in-flight CPI 1.3674 -- Total Cycles 131092 ---- Thread 27 ---- PC 5: Stalled ----- 93944 in-flight CPI 1.3951 -- Total Cycles 131092 ---- Thread 28 ---- PC 5: Stalled ----- 87592 in-flight CPI 1.4963 -- Total Cycles 131092 ---- Thread 29 ---- PC 5: Stalled ----- 89135 in-flight CPI 1.4705 -- Total Cycles 131092 ---- Thread 30 ---- PC 5: Stalled ----- 91117 in-flight CPI 1.4385 -- Total Cycles 131092 ---- Thread 31 ---- PC 5: Stalled ----- 89006 in-flight CPI 1.4726 -- Total Cycles 131092 Total CPI 0.0427 , IPC 23.4311 -- Total Cycles 131092 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7916 (3.796898%) FPSUB: 0 (0.000000%) FPMUL: 32195 (15.442284%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 82646 (39.641031%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5567 (2.670203%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 72438 (34.744779%) DIV: 7459 (3.577698%) FPUN: 0 (0.000000%) FPRSUB: 265 (0.127107%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3365905 total) ADD%: 7.432 (250165) SUB%: 0.000 (0) MUL%: 0.006 (202) BITOR%: 1.540 (51833) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.554 (18650) FPSUB%: 0.000 (0) FPMUL%: 4.780 (160893) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (606) FPMAX%: 0.018 (606) LOAD%: 5.136 (172885) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (234) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (581) FPINV%: 0.000 (0) FPCONV%: 0.019 (638) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.064 (35830) FPLE%: 0.455 (15308) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (606) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.798 (94187) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.750 (25238) CMPU%: 0.000 (0) RSUB%: 0.006 (202) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.666 (527297) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.174 (39521) ORI%: 1.574 (52982) XORI%: 0.000 (0) MULI%: 3.197 (107614) LW%: 1.129 (37998) LWI%: 13.459 (453003) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.286 (9638) SWI%: 4.054 (136466) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.398 (47068) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10427) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.056 (1887) bned%: 0.000 (0) bneid%: 13.800 (464495) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.722 (24306) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.123 (4124) DIV%: 0.012 (404) FPUN%: 1.489 (50108) FPRSUB%: 4.219 (142017) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (66) FPGT%: 2.932 (98703) FPGE%: 1.034 (34800) SYNC%: 0.000 (0) NOP%: 8.741 (294221) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 15 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 6 FPSUB 0 FPMUL 0 FPCMPLT 0 FPMIN 0 FPMAX 389 LOAD 39668 INTCONV 0 ATOMIC_INC 24 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 10 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1675 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 5 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48953 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 13 ORI 11321 XORI 0 MULI 9231 LW 0 LWI 143171 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 75 DIV 22 FPUN 0 FPRSUB 47 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.4313 --Total thread-cycles: 4194944 --total thread-cycles issued: 3071684 (73.223480%) --iCache conflicts: 111185 (2.650453%) --thread*cycles of FU dependence: 254657 (6.070570%) --thread*cycles of data dependence: 208486 (4.969935%) --iCache cycles*banks: 4194944 (80.237948% used) Issue breakdown: --thread*cycles of issue worked: 3071684 (73.223480%) --thread*cycles of issue failed: 829039 (19.762814%) --thread*cycles of issue NOP/other: 294221 (7.013705%) Number of thread-cycles not ready: 208486 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3365905 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 5 1: 8 2: 9 3: 8 4: 8 5: 7 6: 8 7: 8 8: 6 9: 8 10: 8 11: 8 12: 8 13: 7 14: 8 15: 8 16: 8 17: 6 18: 7 19: 7 20: 7 21: 7 22: 8 23: 6 24: 7 25: 7 26: 8 27: 8 28: 7 29: 6 30: 6 31: 7 <=== Core 70 ===> ---- Thread 00 ---- PC 5: Stalled ----- 95709 in-flight CPI 1.3297 -- Total Cycles 127286 ---- Thread 01 ---- PC 5: Stalled ----- 98781 in-flight CPI 1.2883 -- Total Cycles 127286 ---- Thread 02 ---- PC 5: Stalled ----- 100707 in-flight CPI 1.2637 -- Total Cycles 127286 ---- Thread 03 ---- PC 5: Stalled ----- 99930 in-flight CPI 1.2735 -- Total Cycles 127286 ---- Thread 04 ---- PC 5: Stalled ----- 97409 in-flight CPI 1.3065 -- Total Cycles 127286 ---- Thread 05 ---- PC 5: Stalled ----- 97615 in-flight CPI 1.3037 -- Total Cycles 127286 ---- Thread 06 ---- PC 5: Stalled ----- 98697 in-flight CPI 1.2894 -- Total Cycles 127286 ---- Thread 07 ---- PC 5: Stalled ----- 91591 in-flight CPI 1.3895 -- Total Cycles 127286 ---- Thread 08 ---- PC 5: Stalled ----- 98535 in-flight CPI 1.2916 -- Total Cycles 127286 ---- Thread 09 ---- PC 5: Stalled ----- 99629 in-flight CPI 1.2774 -- Total Cycles 127286 ---- Thread 10 ---- PC 5: Stalled ----- 88890 in-flight CPI 1.4317 -- Total Cycles 127286 ---- Thread 11 ---- PC 5: Stalled ----- 97024 in-flight CPI 1.3116 -- Total Cycles 127286 ---- Thread 12 ---- PC 5: Stalled ----- 95338 in-flight CPI 1.3348 -- Total Cycles 127286 ---- Thread 13 ---- PC 5: Stalled ----- 91921 in-flight CPI 1.3845 -- Total Cycles 127286 ---- Thread 14 ---- PC 5: Stalled ----- 91571 in-flight CPI 1.3898 -- Total Cycles 127286 ---- Thread 15 ---- PC 5: Stalled ----- 97027 in-flight CPI 1.3116 -- Total Cycles 127286 ---- Thread 16 ---- PC 5: Stalled ----- 97038 in-flight CPI 1.3114 -- Total Cycles 127286 ---- Thread 17 ---- PC 5: Stalled ----- 95683 in-flight CPI 1.3301 -- Total Cycles 127286 ---- Thread 18 ---- PC 5: Stalled ----- 95605 in-flight CPI 1.3311 -- Total Cycles 127286 ---- Thread 19 ---- PC 5: Stalled ----- 91241 in-flight CPI 1.3948 -- Total Cycles 127286 ---- Thread 20 ---- PC 5: Stalled ----- 93718 in-flight CPI 1.3579 -- Total Cycles 127286 ---- Thread 21 ---- PC 5: Stalled ----- 93996 in-flight CPI 1.3539 -- Total Cycles 127286 ---- Thread 22 ---- PC 5: Stalled ----- 94241 in-flight CPI 1.3504 -- Total Cycles 127286 ---- Thread 23 ---- PC 5: Stalled ----- 92889 in-flight CPI 1.3701 -- Total Cycles 127286 ---- Thread 24 ---- PC 5: Stalled ----- 93041 in-flight CPI 1.3678 -- Total Cycles 127286 ---- Thread 25 ---- PC 5: Stalled ----- 96110 in-flight CPI 1.3241 -- Total Cycles 127286 ---- Thread 26 ---- PC 5: Stalled ----- 93783 in-flight CPI 1.3570 -- Total Cycles 127286 ---- Thread 27 ---- PC 5: Stalled ----- 92009 in-flight CPI 1.3832 -- Total Cycles 127286 ---- Thread 28 ---- PC 5: Stalled ----- 90785 in-flight CPI 1.4018 -- Total Cycles 127286 ---- Thread 29 ---- PC 5: Stalled ----- 91550 in-flight CPI 1.3901 -- Total Cycles 127286 ---- Thread 30 ---- PC 5: Stalled ----- 90452 in-flight CPI 1.4070 -- Total Cycles 127286 ---- Thread 31 ---- PC 5: Stalled ----- 88008 in-flight CPI 1.4460 -- Total Cycles 127286 Total CPI 0.0420 , IPC 23.8130 -- Total Cycles 127286 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8208 (3.849870%) FPSUB: 0 (0.000000%) FPMUL: 32471 (15.230157%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 84902 (39.822328%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5522 (2.590032%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 74467 (34.927909%) DIV: 7371 (3.457285%) FPUN: 0 (0.000000%) FPRSUB: 261 (0.122419%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3321678 total) ADD%: 7.434 (246925) SUB%: 0.000 (0) MUL%: 0.006 (200) BITOR%: 1.508 (50089) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.578 (19201) FPSUB%: 0.000 (0) FPMUL%: 4.857 (161318) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (600) FPMAX%: 0.018 (600) LOAD%: 5.168 (171668) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (232) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (577) FPINV%: 0.000 (0) FPCONV%: 0.019 (632) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.073 (35654) FPLE%: 0.447 (14863) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (600) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.786 (92540) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.754 (25039) CMPU%: 0.000 (0) RSUB%: 0.006 (200) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.639 (519470) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.170 (38867) ORI%: 1.577 (52379) XORI%: 0.000 (0) MULI%: 3.185 (105812) LW%: 1.124 (37336) LWI%: 13.456 (446964) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.285 (9454) SWI%: 4.048 (134453) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.393 (46263) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.308 (10238) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.061 (2012) bned%: 0.000 (0) bneid%: 13.757 (456972) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.709 (23548) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.128 (4246) DIV%: 0.012 (400) FPUN%: 1.456 (48358) FPRSUB%: 4.278 (142085) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (64) FPGT%: 2.940 (97673) FPGE%: 1.008 (33495) SYNC%: 0.000 (0) NOP%: 8.747 (290555) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 14 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 11 FPSUB 0 FPMUL 4 FPCMPLT 0 FPMIN 0 FPMAX 394 LOAD 40393 INTCONV 0 ATOMIC_INC 20 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 15 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1391 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 12 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48282 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 12 ORI 11679 XORI 0 MULI 9019 LW 0 LWI 141339 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 79 DIV 24 FPUN 0 FPRSUB 61 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.8133 --Total thread-cycles: 4073152 --total thread-cycles issued: 3031123 (74.417134%) --iCache conflicts: 111044 (2.726242%) --thread*cycles of FU dependence: 252751 (6.205293%) --thread*cycles of data dependence: 213202 (5.234325%) --iCache cycles*banks: 4073152 (81.551339% used) Issue breakdown: --thread*cycles of issue worked: 3031123 (74.417134%) --thread*cycles of issue failed: 751474 (18.449447%) --thread*cycles of issue NOP/other: 290555 (7.133419%) Number of thread-cycles not ready: 213202 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3321678 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 7 3: 8 4: 7 5: 7 6: 8 7: 6 8: 7 9: 8 10: 6 11: 8 12: 8 13: 7 14: 7 15: 8 16: 9 17: 6 18: 8 19: 7 20: 7 21: 7 22: 7 23: 7 24: 8 25: 8 26: 7 27: 7 28: 7 29: 6 30: 6 31: 7 <=== Core 71 ===> ---- Thread 00 ---- PC 5: Stalled ----- 102200 in-flight CPI 1.2493 -- Total Cycles 127706 ---- Thread 01 ---- PC 5: Stalled ----- 98554 in-flight CPI 1.2955 -- Total Cycles 127706 ---- Thread 02 ---- PC 5: Stalled ----- 97527 in-flight CPI 1.3092 -- Total Cycles 127706 ---- Thread 03 ---- PC 5: Stalled ----- 95274 in-flight CPI 1.3401 -- Total Cycles 127706 ---- Thread 04 ---- PC 5: Stalled ----- 103808 in-flight CPI 1.2300 -- Total Cycles 127706 ---- Thread 05 ---- PC 5: Stalled ----- 103592 in-flight CPI 1.2325 -- Total Cycles 127706 ---- Thread 06 ---- PC 5: Stalled ----- 91256 in-flight CPI 1.3992 -- Total Cycles 127706 ---- Thread 07 ---- PC 5: Stalled ----- 96113 in-flight CPI 1.3284 -- Total Cycles 127706 ---- Thread 08 ---- PC 5: Stalled ----- 102057 in-flight CPI 1.2510 -- Total Cycles 127706 ---- Thread 09 ---- PC 5: Stalled ----- 99051 in-flight CPI 1.2890 -- Total Cycles 127706 ---- Thread 10 ---- PC 5: Stalled ----- 98386 in-flight CPI 1.2978 -- Total Cycles 127706 ---- Thread 11 ---- PC 5: Stalled ----- 96321 in-flight CPI 1.3256 -- Total Cycles 127706 ---- Thread 12 ---- PC 5: Stalled ----- 95435 in-flight CPI 1.3379 -- Total Cycles 127706 ---- Thread 13 ---- PC 5: Stalled ----- 92003 in-flight CPI 1.3879 -- Total Cycles 127706 ---- Thread 14 ---- PC 5: Stalled ----- 95340 in-flight CPI 1.3393 -- Total Cycles 127706 ---- Thread 15 ---- PC 5: Stalled ----- 102928 in-flight CPI 1.2405 -- Total Cycles 127706 ---- Thread 16 ---- PC 5: Stalled ----- 95644 in-flight CPI 1.3350 -- Total Cycles 127706 ---- Thread 17 ---- PC 5: Stalled ----- 97550 in-flight CPI 1.3089 -- Total Cycles 127706 ---- Thread 18 ---- PC 5: Stalled ----- 89701 in-flight CPI 1.4235 -- Total Cycles 127706 ---- Thread 19 ---- PC 5: Stalled ----- 94027 in-flight CPI 1.3579 -- Total Cycles 127706 ---- Thread 20 ---- PC 5: Stalled ----- 94924 in-flight CPI 1.3451 -- Total Cycles 127706 ---- Thread 21 ---- PC 5: Stalled ----- 89578 in-flight CPI 1.4254 -- Total Cycles 127706 ---- Thread 22 ---- PC 5: Stalled ----- 93749 in-flight CPI 1.3620 -- Total Cycles 127706 ---- Thread 23 ---- PC 5: Stalled ----- 91130 in-flight CPI 1.4011 -- Total Cycles 127706 ---- Thread 24 ---- PC 5: Stalled ----- 89964 in-flight CPI 1.4192 -- Total Cycles 127706 ---- Thread 25 ---- PC 5: Stalled ----- 90756 in-flight CPI 1.4069 -- Total Cycles 127706 ---- Thread 26 ---- PC 5: Stalled ----- 86308 in-flight CPI 1.4795 -- Total Cycles 127706 ---- Thread 27 ---- PC 5: Stalled ----- 86976 in-flight CPI 1.4681 -- Total Cycles 127706 ---- Thread 28 ---- PC 5: Stalled ----- 94047 in-flight CPI 1.3576 -- Total Cycles 127706 ---- Thread 29 ---- PC 5: Stalled ----- 90373 in-flight CPI 1.4129 -- Total Cycles 127706 ---- Thread 30 ---- PC 5: Stalled ----- 92772 in-flight CPI 1.3763 -- Total Cycles 127706 ---- Thread 31 ---- PC 5: Stalled ----- 90856 in-flight CPI 1.4053 -- Total Cycles 127706 Total CPI 0.0420 , IPC 23.7949 -- Total Cycles 127706 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7865 (4.102101%) FPSUB: 0 (0.000000%) FPMUL: 31752 (16.560702%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 67067 (34.979737%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5581 (2.910849%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 71752 (37.423265%) DIV: 7449 (3.885131%) FPUN: 0 (0.000000%) FPRSUB: 265 (0.138214%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3329990 total) ADD%: 7.451 (248124) SUB%: 0.000 (0) MUL%: 0.006 (202) BITOR%: 1.520 (50621) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.554 (18464) FPSUB%: 0.000 (0) FPMUL%: 4.783 (159267) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (606) FPMAX%: 0.018 (606) LOAD%: 5.152 (171546) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (234) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (583) FPINV%: 0.000 (0) FPCONV%: 0.019 (638) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.065 (35459) FPLE%: 0.455 (15143) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (606) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.803 (93332) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.749 (24933) CMPU%: 0.000 (0) RSUB%: 0.006 (202) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.674 (521959) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.175 (39128) ORI%: 1.557 (51846) XORI%: 0.000 (0) MULI%: 3.199 (106530) LW%: 1.131 (37656) LWI%: 13.472 (448607) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.287 (9566) SWI%: 4.063 (135294) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.400 (46629) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10331) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.056 (1877) bned%: 0.000 (0) bneid%: 13.787 (459108) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.714 (23771) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.123 (4085) DIV%: 0.012 (404) FPUN%: 1.470 (48947) FPRSUB%: 4.219 (140507) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (75) FPGT%: 2.944 (98020) FPGE%: 1.015 (33804) SYNC%: 0.000 (0) NOP%: 8.744 (291184) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 15 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 6 FPSUB 0 FPMUL 1 FPCMPLT 0 FPMIN 0 FPMAX 394 LOAD 39662 INTCONV 0 ATOMIC_INC 29 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 12 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1130 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 6 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48613 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 15 ORI 11121 XORI 0 MULI 9808 LW 0 LWI 141902 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 79 DIV 24 FPUN 0 FPRSUB 62 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.7951 --Total thread-cycles: 4086592 --total thread-cycles issued: 3038806 (74.360396%) --iCache conflicts: 110588 (2.706118%) --thread*cycles of FU dependence: 252896 (6.188433%) --thread*cycles of data dependence: 191731 (4.691709%) --iCache cycles*banks: 4086592 (81.486530% used) Issue breakdown: --thread*cycles of issue worked: 3038806 (74.360396%) --thread*cycles of issue failed: 756602 (18.514253%) --thread*cycles of issue NOP/other: 291184 (7.125350%) Number of thread-cycles not ready: 191731 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3329990 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 9 1: 9 2: 7 3: 8 4: 9 5: 9 6: 7 7: 8 8: 9 9: 8 10: 7 11: 7 12: 8 13: 6 14: 7 15: 8 16: 8 17: 7 18: 6 19: 7 20: 7 21: 6 22: 7 23: 7 24: 8 25: 7 26: 5 27: 6 28: 8 29: 5 30: 7 31: 7 <=== Core 72 ===> ---- Thread 00 ---- PC 5: Stalled ----- 94714 in-flight CPI 1.5736 -- Total Cycles 149067 ---- Thread 01 ---- PC 5: Stalled ----- 105026 in-flight CPI 1.4191 -- Total Cycles 149067 ---- Thread 02 ---- PC 5: Stalled ----- 92625 in-flight CPI 1.6091 -- Total Cycles 149067 ---- Thread 03 ---- PC 5: Stalled ----- 99307 in-flight CPI 1.5008 -- Total Cycles 149067 ---- Thread 04 ---- PC 5: Stalled ----- 99736 in-flight CPI 1.4944 -- Total Cycles 149067 ---- Thread 05 ---- PC 5: Stalled ----- 98594 in-flight CPI 1.5116 -- Total Cycles 149067 ---- Thread 06 ---- PC 5: Stalled ----- 97228 in-flight CPI 1.5329 -- Total Cycles 149067 ---- Thread 07 ---- PC 5: Stalled ----- 98804 in-flight CPI 1.5084 -- Total Cycles 149067 ---- Thread 08 ---- PC 5: Stalled ----- 99357 in-flight CPI 1.5000 -- Total Cycles 149067 ---- Thread 09 ---- PC 5: Stalled ----- 98809 in-flight CPI 1.5083 -- Total Cycles 149067 ---- Thread 10 ---- PC 5: Stalled ----- 101502 in-flight CPI 1.4683 -- Total Cycles 149067 ---- Thread 11 ---- PC 5: Stalled ----- 98243 in-flight CPI 1.5170 -- Total Cycles 149067 ---- Thread 12 ---- PC 5: Stalled ----- 95221 in-flight CPI 1.5651 -- Total Cycles 149067 ---- Thread 13 ---- PC 5: Stalled ----- 96864 in-flight CPI 1.5386 -- Total Cycles 149067 ---- Thread 14 ---- PC 5: Stalled ----- 89562 in-flight CPI 1.6642 -- Total Cycles 149067 ---- Thread 15 ---- PC 5: Stalled ----- 97823 in-flight CPI 1.5234 -- Total Cycles 149067 ---- Thread 16 ---- PC 5: Stalled ----- 92486 in-flight CPI 1.6115 -- Total Cycles 149067 ---- Thread 17 ---- PC 5: Stalled ----- 94518 in-flight CPI 1.5769 -- Total Cycles 149067 ---- Thread 18 ---- PC 5: Stalled ----- 90713 in-flight CPI 1.6430 -- Total Cycles 149067 ---- Thread 19 ---- PC 5: Stalled ----- 93836 in-flight CPI 1.5883 -- Total Cycles 149067 ---- Thread 20 ---- PC 5: Stalled ----- 92874 in-flight CPI 1.6047 -- Total Cycles 149067 ---- Thread 21 ---- PC 5: Stalled ----- 90966 in-flight CPI 1.6385 -- Total Cycles 149067 ---- Thread 22 ---- PC 5: Stalled ----- 92394 in-flight CPI 1.6130 -- Total Cycles 149067 ---- Thread 23 ---- PC 5: Stalled ----- 92591 in-flight CPI 1.6097 -- Total Cycles 149067 ---- Thread 24 ---- PC 5: Stalled ----- 92386 in-flight CPI 1.6133 -- Total Cycles 149067 ---- Thread 25 ---- PC 5: Stalled ----- 90870 in-flight CPI 1.6401 -- Total Cycles 149067 ---- Thread 26 ---- PC 5: Stalled ----- 91794 in-flight CPI 1.6236 -- Total Cycles 149067 ---- Thread 27 ---- PC 5: Stalled ----- 89238 in-flight CPI 1.6701 -- Total Cycles 149067 ---- Thread 28 ---- PC 5: Stalled ----- 90601 in-flight CPI 1.6450 -- Total Cycles 149067 ---- Thread 29 ---- PC 5: Stalled ----- 94212 in-flight CPI 1.5820 -- Total Cycles 149067 ---- Thread 30 ---- PC 5: Stalled ----- 89780 in-flight CPI 1.6600 -- Total Cycles 149067 ---- Thread 31 ---- PC 5: Stalled ----- 98450 in-flight CPI 1.5139 -- Total Cycles 149067 Total CPI 0.0490 , IPC 20.4048 -- Total Cycles 149067 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7973 (3.623267%) FPSUB: 0 (0.000000%) FPMUL: 31907 (14.499886%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 93439 (42.462622%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5675 (2.578959%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 73256 (33.290616%) DIV: 7533 (3.423313%) FPUN: 0 (0.000000%) FPRSUB: 267 (0.121336%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3332938 total) ADD%: 7.443 (248081) SUB%: 0.000 (0) MUL%: 0.006 (204) BITOR%: 1.514 (50459) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.559 (18620) FPSUB%: 0.000 (0) FPMUL%: 4.798 (159927) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (612) FPMAX%: 0.018 (612) LOAD%: 5.169 (172269) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (236) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (590) FPINV%: 0.000 (0) FPCONV%: 0.019 (644) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.064 (35459) FPLE%: 0.453 (15093) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (612) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.802 (93374) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.750 (24997) CMPU%: 0.000 (0) RSUB%: 0.006 (204) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.666 (522125) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.173 (39105) ORI%: 1.559 (51977) XORI%: 0.000 (0) MULI%: 3.197 (106570) LW%: 1.130 (37676) LWI%: 13.478 (449226) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.286 (9544) SWI%: 4.066 (135521) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.401 (46680) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10317) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.060 (2002) bned%: 0.000 (0) bneid%: 13.759 (458584) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.714 (23812) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.125 (4172) DIV%: 0.012 (408) FPUN%: 1.465 (48826) FPRSUB%: 4.242 (141388) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (70) FPGT%: 2.938 (97911) FPGE%: 1.012 (33733) SYNC%: 0.000 (0) NOP%: 8.737 (291202) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 17 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 7 FPSUB 0 FPMUL 5 FPCMPLT 0 FPMIN 0 FPMAX 392 LOAD 39970 INTCONV 0 ATOMIC_INC 13 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 13 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1765 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 11 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48550 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 14 ORI 11280 XORI 0 MULI 8782 LW 0 LWI 142170 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 64 DIV 34 FPUN 0 FPRSUB 55 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 20.4050 --Total thread-cycles: 4770144 --total thread-cycles issued: 3041736 (63.766125%) --iCache conflicts: 109376 (2.292929%) --thread*cycles of FU dependence: 253143 (5.306821%) --thread*cycles of data dependence: 220050 (4.613068%) --iCache cycles*banks: 4770144 (69.871476% used) Issue breakdown: --thread*cycles of issue worked: 3041736 (63.766125%) --thread*cycles of issue failed: 1437206 (30.129195%) --thread*cycles of issue NOP/other: 291202 (6.104679%) Number of thread-cycles not ready: 220050 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3332938 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 6 1: 8 2: 7 3: 8 4: 6 5: 8 6: 7 7: 9 8: 8 9: 8 10: 8 11: 9 12: 9 13: 8 14: 5 15: 10 16: 6 17: 6 18: 7 19: 8 20: 8 21: 6 22: 8 23: 7 24: 6 25: 8 26: 7 27: 7 28: 7 29: 7 30: 8 31: 6 <=== Core 73 ===> ---- Thread 00 ---- PC 5: Stalled ----- 95879 in-flight CPI 1.3500 -- Total Cycles 129454 ---- Thread 01 ---- PC 5: Stalled ----- 102906 in-flight CPI 1.2577 -- Total Cycles 129454 ---- Thread 02 ---- PC 5: Stalled ----- 103261 in-flight CPI 1.2534 -- Total Cycles 129454 ---- Thread 03 ---- PC 5: Stalled ----- 103428 in-flight CPI 1.2514 -- Total Cycles 129454 ---- Thread 04 ---- PC 5: Stalled ----- 103956 in-flight CPI 1.2450 -- Total Cycles 129454 ---- Thread 05 ---- PC 5: Stalled ----- 102186 in-flight CPI 1.2666 -- Total Cycles 129454 ---- Thread 06 ---- PC 5: Stalled ----- 96184 in-flight CPI 1.3457 -- Total Cycles 129454 ---- Thread 07 ---- PC 5: Stalled ----- 100122 in-flight CPI 1.2928 -- Total Cycles 129454 ---- Thread 08 ---- PC 5: Stalled ----- 103976 in-flight CPI 1.2448 -- Total Cycles 129454 ---- Thread 09 ---- PC 5: Stalled ----- 98994 in-flight CPI 1.3074 -- Total Cycles 129454 ---- Thread 10 ---- PC 5: Stalled ----- 95801 in-flight CPI 1.3511 -- Total Cycles 129454 ---- Thread 11 ---- PC 5: Stalled ----- 99009 in-flight CPI 1.3072 -- Total Cycles 129454 ---- Thread 12 ---- PC 5: Stalled ----- 101467 in-flight CPI 1.2755 -- Total Cycles 129454 ---- Thread 13 ---- PC 5: Stalled ----- 95904 in-flight CPI 1.3495 -- Total Cycles 129454 ---- Thread 14 ---- PC 5: Stalled ----- 88945 in-flight CPI 1.4552 -- Total Cycles 129454 ---- Thread 15 ---- PC 5: Stalled ----- 94408 in-flight CPI 1.3709 -- Total Cycles 129454 ---- Thread 16 ---- PC 5: Stalled ----- 98078 in-flight CPI 1.3196 -- Total Cycles 129454 ---- Thread 17 ---- PC 5: Stalled ----- 101924 in-flight CPI 1.2699 -- Total Cycles 129454 ---- Thread 18 ---- PC 5: Stalled ----- 100972 in-flight CPI 1.2819 -- Total Cycles 129454 ---- Thread 19 ---- PC 5: Stalled ----- 91513 in-flight CPI 1.4144 -- Total Cycles 129454 ---- Thread 20 ---- PC 5: Stalled ----- 98800 in-flight CPI 1.3100 -- Total Cycles 129454 ---- Thread 21 ---- PC 5: Stalled ----- 91798 in-flight CPI 1.4100 -- Total Cycles 129454 ---- Thread 22 ---- PC 5: Stalled ----- 89760 in-flight CPI 1.4420 -- Total Cycles 129454 ---- Thread 23 ---- PC 5: Stalled ----- 94673 in-flight CPI 1.3671 -- Total Cycles 129454 ---- Thread 24 ---- PC 5: Stalled ----- 89392 in-flight CPI 1.4479 -- Total Cycles 129454 ---- Thread 25 ---- PC 5: Stalled ----- 89172 in-flight CPI 1.4515 -- Total Cycles 129454 ---- Thread 26 ---- PC 5: Stalled ----- 95240 in-flight CPI 1.3590 -- Total Cycles 129454 ---- Thread 27 ---- PC 5: Stalled ----- 88547 in-flight CPI 1.4618 -- Total Cycles 129454 ---- Thread 28 ---- PC 5: Stalled ----- 90101 in-flight CPI 1.4365 -- Total Cycles 129454 ---- Thread 29 ---- PC 5: Stalled ----- 82897 in-flight CPI 1.5613 -- Total Cycles 129454 ---- Thread 30 ---- PC 5: Stalled ----- 90571 in-flight CPI 1.4290 -- Total Cycles 129454 ---- Thread 31 ---- PC 5: Stalled ----- 83560 in-flight CPI 1.5489 -- Total Cycles 129454 Total CPI 0.0423 , IPC 23.6686 -- Total Cycles 129454 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7819 (3.921067%) FPSUB: 0 (0.000000%) FPMUL: 31901 (15.997693%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 74554 (37.387293%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5729 (2.872975%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 71433 (35.822175%) DIV: 7703 (3.862896%) FPUN: 0 (0.000000%) FPRSUB: 271 (0.135901%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3357970 total) ADD%: 7.452 (250224) SUB%: 0.000 (0) MUL%: 0.006 (209) BITOR%: 1.528 (51317) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.548 (18392) FPSUB%: 0.000 (0) FPMUL%: 4.763 (159934) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (627) FPMAX%: 0.019 (627) LOAD%: 5.127 (172179) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (241) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (599) FPINV%: 0.000 (0) FPCONV%: 0.020 (659) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.064 (35736) FPLE%: 0.457 (15334) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (627) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.802 (94084) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.744 (24993) CMPU%: 0.000 (0) RSUB%: 0.006 (209) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.675 (526365) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.174 (39429) ORI%: 1.553 (52161) XORI%: 0.000 (0) MULI%: 3.204 (107590) LW%: 1.131 (37968) LWI%: 13.486 (452849) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.287 (9644) SWI%: 4.063 (136450) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.400 (47007) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10397) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.054 (1799) bned%: 0.000 (0) bneid%: 13.811 (463772) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.714 (23978) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.121 (4060) DIV%: 0.012 (418) FPUN%: 1.478 (49647) FPRSUB%: 4.199 (141007) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (65) FPGT%: 2.950 (99045) FPGE%: 1.022 (34313) SYNC%: 0.000 (0) NOP%: 8.753 (293919) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 17 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 16 FPSUB 0 FPMUL 0 FPCMPLT 0 FPMIN 0 FPMAX 411 LOAD 39886 INTCONV 0 ATOMIC_INC 28 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 20 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1542 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 4 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49181 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 10 ORI 11056 XORI 0 MULI 9395 LW 0 LWI 143408 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 67 DIV 24 FPUN 0 FPRSUB 64 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.6689 --Total thread-cycles: 4142528 --total thread-cycles issued: 3064051 (73.965728%) --iCache conflicts: 110302 (2.662674%) --thread*cycles of FU dependence: 255144 (6.159138%) --thread*cycles of data dependence: 199410 (4.813727%) --iCache cycles*banks: 4142528 (81.061661% used) Issue breakdown: --thread*cycles of issue worked: 3064051 (73.965728%) --thread*cycles of issue failed: 784558 (18.939112%) --thread*cycles of issue NOP/other: 293919 (7.095160%) Number of thread-cycles not ready: 199410 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3357970 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 9 3: 9 4: 8 5: 8 6: 7 7: 7 8: 8 9: 8 10: 7 11: 8 12: 9 13: 9 14: 6 15: 9 16: 8 17: 8 18: 7 19: 5 20: 9 21: 6 22: 7 23: 7 24: 7 25: 7 26: 8 27: 6 28: 7 29: 7 30: 8 31: 7 <=== Core 74 ===> ---- Thread 00 ---- PC 5: Stalled ----- 101978 in-flight CPI 1.2444 -- Total Cycles 126923 ---- Thread 01 ---- PC 5: Stalled ----- 95785 in-flight CPI 1.3249 -- Total Cycles 126923 ---- Thread 02 ---- PC 5: Stalled ----- 100966 in-flight CPI 1.2568 -- Total Cycles 126923 ---- Thread 03 ---- PC 5: Stalled ----- 102031 in-flight CPI 1.2437 -- Total Cycles 126923 ---- Thread 04 ---- PC 5: Stalled ----- 103774 in-flight CPI 1.2228 -- Total Cycles 126923 ---- Thread 05 ---- PC 5: Stalled ----- 94239 in-flight CPI 1.3465 -- Total Cycles 126923 ---- Thread 06 ---- PC 5: Stalled ----- 95802 in-flight CPI 1.3246 -- Total Cycles 126923 ---- Thread 07 ---- PC 5: Stalled ----- 96155 in-flight CPI 1.3197 -- Total Cycles 126923 ---- Thread 08 ---- PC 5: Stalled ----- 98107 in-flight CPI 1.2935 -- Total Cycles 126923 ---- Thread 09 ---- PC 5: Stalled ----- 95519 in-flight CPI 1.3286 -- Total Cycles 126923 ---- Thread 10 ---- PC 5: Stalled ----- 100308 in-flight CPI 1.2651 -- Total Cycles 126923 ---- Thread 11 ---- PC 5: Stalled ----- 100745 in-flight CPI 1.2596 -- Total Cycles 126923 ---- Thread 12 ---- PC 5: Stalled ----- 96128 in-flight CPI 1.3202 -- Total Cycles 126923 ---- Thread 13 ---- PC 5: Stalled ----- 101005 in-flight CPI 1.2563 -- Total Cycles 126923 ---- Thread 14 ---- PC 5: Stalled ----- 95152 in-flight CPI 1.3337 -- Total Cycles 126923 ---- Thread 15 ---- PC 5: Stalled ----- 97608 in-flight CPI 1.3001 -- Total Cycles 126923 ---- Thread 16 ---- PC 5: Stalled ----- 91360 in-flight CPI 1.3890 -- Total Cycles 126923 ---- Thread 17 ---- PC 5: Stalled ----- 101072 in-flight CPI 1.2555 -- Total Cycles 126923 ---- Thread 18 ---- PC 5: Stalled ----- 95600 in-flight CPI 1.3274 -- Total Cycles 126923 ---- Thread 19 ---- PC 5: Stalled ----- 94989 in-flight CPI 1.3359 -- Total Cycles 126923 ---- Thread 20 ---- PC 5: Stalled ----- 92038 in-flight CPI 1.3787 -- Total Cycles 126923 ---- Thread 21 ---- PC 5: Stalled ----- 96517 in-flight CPI 1.3148 -- Total Cycles 126923 ---- Thread 22 ---- PC 5: Stalled ----- 91095 in-flight CPI 1.3931 -- Total Cycles 126923 ---- Thread 23 ---- PC 5: Stalled ----- 95821 in-flight CPI 1.3243 -- Total Cycles 126923 ---- Thread 24 ---- PC 5: Stalled ----- 97457 in-flight CPI 1.3021 -- Total Cycles 126923 ---- Thread 25 ---- PC 5: Stalled ----- 95851 in-flight CPI 1.3239 -- Total Cycles 126923 ---- Thread 26 ---- PC 5: Stalled ----- 85367 in-flight CPI 1.4866 -- Total Cycles 126923 ---- Thread 27 ---- PC 5: Stalled ----- 94574 in-flight CPI 1.3418 -- Total Cycles 126923 ---- Thread 28 ---- PC 5: Stalled ----- 92983 in-flight CPI 1.3648 -- Total Cycles 126923 ---- Thread 29 ---- PC 5: Stalled ----- 88893 in-flight CPI 1.4276 -- Total Cycles 126923 ---- Thread 30 ---- PC 5: Stalled ----- 88670 in-flight CPI 1.4311 -- Total Cycles 126923 ---- Thread 31 ---- PC 5: Stalled ----- 90469 in-flight CPI 1.4027 -- Total Cycles 126923 Total CPI 0.0414 , IPC 24.1771 -- Total Cycles 126923 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7241 (3.942225%) FPSUB: 0 (0.000000%) FPMUL: 30814 (16.776097%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 64870 (35.317240%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5804 (3.159878%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 66886 (36.414813%) DIV: 7789 (4.240573%) FPUN: 0 (0.000000%) FPRSUB: 274 (0.149174%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3362594 total) ADD%: 7.492 (251942) SUB%: 0.000 (0) MUL%: 0.006 (211) BITOR%: 1.537 (51667) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.511 (17174) FPSUB%: 0.000 (0) FPMUL%: 4.655 (156530) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (633) FPMAX%: 0.019 (633) LOAD%: 5.084 (170955) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (243) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (604) FPINV%: 0.000 (0) FPCONV%: 0.020 (665) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.050 (35304) FPLE%: 0.457 (15365) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (633) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.828 (95096) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.736 (24744) CMPU%: 0.000 (0) RSUB%: 0.006 (211) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.705 (528095) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.182 (39743) ORI%: 1.530 (51456) XORI%: 0.000 (0) MULI%: 3.232 (108678) LW%: 1.141 (38376) LWI%: 13.561 (456018) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.290 (9744) SWI%: 4.094 (137674) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.413 (47516) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10458) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.047 (1568) bned%: 0.000 (0) bneid%: 13.841 (465430) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.721 (24259) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.113 (3791) DIV%: 0.013 (422) FPUN%: 1.490 (50089) FPRSUB%: 4.116 (138413) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (69) FPGT%: 2.958 (99462) FPGE%: 1.033 (34724) SYNC%: 0.000 (0) NOP%: 8.740 (293903) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 34 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 13 FPSUB 0 FPMUL 1 FPCMPLT 0 FPMIN 0 FPMAX 406 LOAD 38723 INTCONV 0 ATOMIC_INC 29 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 14 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1489 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 6 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49473 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 9 ORI 10190 XORI 0 MULI 9926 LW 0 LWI 143880 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 60 DIV 25 FPUN 0 FPRSUB 50 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 24.1774 --Total thread-cycles: 4061536 --total thread-cycles issued: 3068691 (75.554938%) --iCache conflicts: 112789 (2.777004%) --thread*cycles of FU dependence: 254343 (6.262237%) --thread*cycles of data dependence: 183678 (4.522378%) --iCache cycles*banks: 4061536 (82.791978% used) Issue breakdown: --thread*cycles of issue worked: 3068691 (75.554938%) --thread*cycles of issue failed: 698942 (17.208810%) --thread*cycles of issue NOP/other: 293903 (7.236252%) Number of thread-cycles not ready: 183678 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3362594 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 7 2: 8 3: 8 4: 9 5: 8 6: 7 7: 8 8: 8 9: 6 10: 9 11: 9 12: 6 13: 9 14: 7 15: 7 16: 7 17: 8 18: 8 19: 8 20: 8 21: 8 22: 7 23: 9 24: 8 25: 7 26: 6 27: 7 28: 7 29: 7 30: 7 31: 7 <=== Core 75 ===> ---- Thread 00 ---- PC 5: Stalled ----- 94904 in-flight CPI 1.3436 -- Total Cycles 127547 ---- Thread 01 ---- PC 5: Stalled ----- 95884 in-flight CPI 1.3300 -- Total Cycles 127547 ---- Thread 02 ---- PC 5: Stalled ----- 103681 in-flight CPI 1.2299 -- Total Cycles 127547 ---- Thread 03 ---- PC 5: Stalled ----- 97145 in-flight CPI 1.3127 -- Total Cycles 127547 ---- Thread 04 ---- PC 5: Stalled ----- 98058 in-flight CPI 1.3004 -- Total Cycles 127547 ---- Thread 05 ---- PC 5: Stalled ----- 97183 in-flight CPI 1.3122 -- Total Cycles 127547 ---- Thread 06 ---- PC 5: Stalled ----- 93466 in-flight CPI 1.3644 -- Total Cycles 127547 ---- Thread 07 ---- PC 5: Stalled ----- 98747 in-flight CPI 1.2914 -- Total Cycles 127547 ---- Thread 08 ---- PC 5: Stalled ----- 99261 in-flight CPI 1.2848 -- Total Cycles 127547 ---- Thread 09 ---- PC 5: Stalled ----- 86885 in-flight CPI 1.4678 -- Total Cycles 127547 ---- Thread 10 ---- PC 5: Stalled ----- 100089 in-flight CPI 1.2741 -- Total Cycles 127547 ---- Thread 11 ---- PC 5: Stalled ----- 102259 in-flight CPI 1.2471 -- Total Cycles 127547 ---- Thread 12 ---- PC 5: Stalled ----- 89984 in-flight CPI 1.4172 -- Total Cycles 127547 ---- Thread 13 ---- PC 5: Stalled ----- 95013 in-flight CPI 1.3421 -- Total Cycles 127547 ---- Thread 14 ---- PC 5: Stalled ----- 94991 in-flight CPI 1.3425 -- Total Cycles 127547 ---- Thread 15 ---- PC 5: Stalled ----- 92502 in-flight CPI 1.3786 -- Total Cycles 127547 ---- Thread 16 ---- PC 5: Stalled ----- 97928 in-flight CPI 1.3022 -- Total Cycles 127547 ---- Thread 17 ---- PC 5: Stalled ----- 101058 in-flight CPI 1.2619 -- Total Cycles 127547 ---- Thread 18 ---- PC 5: Stalled ----- 94561 in-flight CPI 1.3486 -- Total Cycles 127547 ---- Thread 19 ---- PC 5: Stalled ----- 92114 in-flight CPI 1.3844 -- Total Cycles 127547 ---- Thread 20 ---- PC 5: Stalled ----- 92860 in-flight CPI 1.3733 -- Total Cycles 127547 ---- Thread 21 ---- PC 5: Stalled ----- 88625 in-flight CPI 1.4389 -- Total Cycles 127547 ---- Thread 22 ---- PC 5: Stalled ----- 90228 in-flight CPI 1.4133 -- Total Cycles 127547 ---- Thread 23 ---- PC 5: Stalled ----- 93695 in-flight CPI 1.3611 -- Total Cycles 127547 ---- Thread 24 ---- PC 5: Stalled ----- 89269 in-flight CPI 1.4285 -- Total Cycles 127547 ---- Thread 25 ---- PC 5: Stalled ----- 91695 in-flight CPI 1.3908 -- Total Cycles 127547 ---- Thread 26 ---- PC 5: Stalled ----- 90562 in-flight CPI 1.4081 -- Total Cycles 127547 ---- Thread 27 ---- PC 5: Stalled ----- 92524 in-flight CPI 1.3783 -- Total Cycles 127547 ---- Thread 28 ---- PC 5: Stalled ----- 94213 in-flight CPI 1.3535 -- Total Cycles 127547 ---- Thread 29 ---- PC 5: Stalled ----- 85904 in-flight CPI 1.4846 -- Total Cycles 127547 ---- Thread 30 ---- PC 5: Stalled ----- 90888 in-flight CPI 1.4031 -- Total Cycles 127547 ---- Thread 31 ---- PC 5: Stalled ----- 90076 in-flight CPI 1.4157 -- Total Cycles 127547 Total CPI 0.0423 , IPC 23.6525 -- Total Cycles 127547 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7620 (3.850994%) FPSUB: 0 (0.000000%) FPMUL: 31279 (15.807774%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 76059 (38.438680%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5488 (2.773524%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 69739 (35.244680%) DIV: 7422 (3.750929%) FPUN: 0 (0.000000%) FPRSUB: 264 (0.133420%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3305753 total) ADD%: 7.463 (246719) SUB%: 0.000 (0) MUL%: 0.006 (201) BITOR%: 1.533 (50673) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.545 (18007) FPSUB%: 0.000 (0) FPMUL%: 4.753 (157138) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (603) FPMAX%: 0.018 (603) LOAD%: 5.135 (169738) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (233) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (577) FPINV%: 0.000 (0) FPCONV%: 0.019 (635) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.060 (35033) FPLE%: 0.459 (15174) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (603) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.807 (92801) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.746 (24653) CMPU%: 0.000 (0) RSUB%: 0.006 (201) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.686 (518538) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.177 (38914) ORI%: 1.553 (51346) XORI%: 0.000 (0) MULI%: 3.204 (105904) LW%: 1.133 (37442) LWI%: 13.479 (445570) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.288 (9515) SWI%: 4.064 (134357) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.402 (46357) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10253) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.053 (1737) bned%: 0.000 (0) bneid%: 13.803 (456301) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.719 (23779) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.120 (3968) DIV%: 0.012 (402) FPUN%: 1.482 (49000) FPRSUB%: 4.194 (138640) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (68) FPGT%: 2.942 (97250) FPGE%: 1.023 (33826) SYNC%: 0.000 (0) NOP%: 8.739 (288898) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 12 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 11 FPSUB 0 FPMUL 3 FPCMPLT 0 FPMIN 0 FPMAX 400 LOAD 39171 INTCONV 0 ATOMIC_INC 14 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 12 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1677 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 8 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48343 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 10 ORI 10847 XORI 0 MULI 9616 LW 0 LWI 140975 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 93 DIV 30 FPUN 0 FPRSUB 69 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.6527 --Total thread-cycles: 4081504 --total thread-cycles issued: 3016855 (73.915277%) --iCache conflicts: 108789 (2.665415%) --thread*cycles of FU dependence: 251321 (6.157559%) --thread*cycles of data dependence: 197871 (4.847992%) --iCache cycles*banks: 4081504 (80.994285% used) Issue breakdown: --thread*cycles of issue worked: 3016855 (73.915277%) --thread*cycles of issue failed: 775751 (19.006499%) --thread*cycles of issue NOP/other: 288898 (7.078224%) Number of thread-cycles not ready: 197871 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3305753 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 9 1: 7 2: 9 3: 8 4: 9 5: 7 6: 6 7: 9 8: 7 9: 6 10: 7 11: 8 12: 6 13: 8 14: 6 15: 7 16: 7 17: 8 18: 8 19: 7 20: 7 21: 7 22: 8 23: 7 24: 7 25: 6 26: 8 27: 7 28: 8 29: 5 30: 7 31: 7 <=== Core 76 ===> ---- Thread 00 ---- PC 5: Stalled ----- 98745 in-flight CPI 1.3003 -- Total Cycles 128413 ---- Thread 01 ---- PC 5: Stalled ----- 100672 in-flight CPI 1.2754 -- Total Cycles 128413 ---- Thread 02 ---- PC 5: Stalled ----- 102636 in-flight CPI 1.2509 -- Total Cycles 128413 ---- Thread 03 ---- PC 5: Stalled ----- 98182 in-flight CPI 1.3077 -- Total Cycles 128413 ---- Thread 04 ---- PC 5: Stalled ----- 100138 in-flight CPI 1.2821 -- Total Cycles 128413 ---- Thread 05 ---- PC 5: Stalled ----- 99728 in-flight CPI 1.2874 -- Total Cycles 128413 ---- Thread 06 ---- PC 5: Stalled ----- 96316 in-flight CPI 1.3330 -- Total Cycles 128413 ---- Thread 07 ---- PC 5: Stalled ----- 98506 in-flight CPI 1.3034 -- Total Cycles 128413 ---- Thread 08 ---- PC 5: Stalled ----- 102046 in-flight CPI 1.2581 -- Total Cycles 128413 ---- Thread 09 ---- PC 5: Stalled ----- 94764 in-flight CPI 1.3548 -- Total Cycles 128413 ---- Thread 10 ---- PC 5: Stalled ----- 95260 in-flight CPI 1.3478 -- Total Cycles 128413 ---- Thread 11 ---- PC 5: Stalled ----- 93659 in-flight CPI 1.3708 -- Total Cycles 128413 ---- Thread 12 ---- PC 5: Stalled ----- 96272 in-flight CPI 1.3335 -- Total Cycles 128413 ---- Thread 13 ---- PC 5: Stalled ----- 96953 in-flight CPI 1.3243 -- Total Cycles 128413 ---- Thread 14 ---- PC 5: Stalled ----- 96373 in-flight CPI 1.3322 -- Total Cycles 128413 ---- Thread 15 ---- PC 5: Stalled ----- 96515 in-flight CPI 1.3303 -- Total Cycles 128413 ---- Thread 16 ---- PC 5: Stalled ----- 96134 in-flight CPI 1.3355 -- Total Cycles 128413 ---- Thread 17 ---- PC 5: Stalled ----- 95889 in-flight CPI 1.3390 -- Total Cycles 128413 ---- Thread 18 ---- PC 5: Stalled ----- 94164 in-flight CPI 1.3635 -- Total Cycles 128413 ---- Thread 19 ---- PC 5: Stalled ----- 96146 in-flight CPI 1.3353 -- Total Cycles 128413 ---- Thread 20 ---- PC 5: Stalled ----- 98848 in-flight CPI 1.2988 -- Total Cycles 128413 ---- Thread 21 ---- PC 5: Stalled ----- 95763 in-flight CPI 1.3407 -- Total Cycles 128413 ---- Thread 22 ---- PC 5: Stalled ----- 99734 in-flight CPI 1.2873 -- Total Cycles 128413 ---- Thread 23 ---- PC 5: Stalled ----- 93985 in-flight CPI 1.3661 -- Total Cycles 128413 ---- Thread 24 ---- PC 5: Stalled ----- 96057 in-flight CPI 1.3366 -- Total Cycles 128413 ---- Thread 25 ---- PC 5: Stalled ----- 96062 in-flight CPI 1.3365 -- Total Cycles 128413 ---- Thread 26 ---- PC 5: Stalled ----- 92034 in-flight CPI 1.3951 -- Total Cycles 128413 ---- Thread 27 ---- PC 5: Stalled ----- 93182 in-flight CPI 1.3778 -- Total Cycles 128413 ---- Thread 28 ---- PC 5: Stalled ----- 88715 in-flight CPI 1.4472 -- Total Cycles 128413 ---- Thread 29 ---- PC 5: Stalled ----- 92842 in-flight CPI 1.3828 -- Total Cycles 128413 ---- Thread 30 ---- PC 5: Stalled ----- 87315 in-flight CPI 1.4704 -- Total Cycles 128413 ---- Thread 31 ---- PC 5: Stalled ----- 89905 in-flight CPI 1.4281 -- Total Cycles 128413 Total CPI 0.0418 , IPC 23.9392 -- Total Cycles 128413 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7457 (3.998713%) FPSUB: 0 (0.000000%) FPMUL: 31132 (16.694104%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 65534 (35.141700%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5708 (3.060836%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 68713 (36.846395%) DIV: 7671 (4.113468%) FPUN: 0 (0.000000%) FPRSUB: 270 (0.144784%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3368616 total) ADD%: 7.497 (252542) SUB%: 0.000 (0) MUL%: 0.006 (208) BITOR%: 1.534 (51675) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.522 (17580) FPSUB%: 0.000 (0) FPMUL%: 4.689 (157959) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (624) FPMAX%: 0.019 (624) LOAD%: 5.104 (171939) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (240) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (596) FPINV%: 0.000 (0) FPCONV%: 0.019 (656) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.051 (35409) FPLE%: 0.455 (15342) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (624) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.823 (95093) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.736 (24787) CMPU%: 0.000 (0) RSUB%: 0.006 (208) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.685 (528360) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.177 (39640) ORI%: 1.548 (52132) XORI%: 0.000 (0) MULI%: 3.219 (108442) LW%: 1.139 (38370) LWI%: 13.530 (455784) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.289 (9737) SWI%: 4.080 (137435) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.411 (47518) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10461) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.054 (1806) bned%: 0.000 (0) bneid%: 13.815 (465367) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.724 (24385) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.116 (3900) DIV%: 0.012 (416) FPUN%: 1.490 (50197) FPRSUB%: 4.149 (139772) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (84) FPGT%: 2.948 (99301) FPGE%: 1.035 (34855) SYNC%: 0.000 (0) NOP%: 8.741 (294452) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 23 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 10 FPSUB 0 FPMUL 1 FPCMPLT 0 FPMIN 0 FPMAX 407 LOAD 39287 INTCONV 0 ATOMIC_INC 14 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 11 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1889 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 10 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49427 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 14 ORI 10523 XORI 0 MULI 9673 LW 0 LWI 144072 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 62 DIV 34 FPUN 0 FPRSUB 53 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.9395 --Total thread-cycles: 4109216 --total thread-cycles issued: 3074164 (74.811448%) --iCache conflicts: 113162 (2.753859%) --thread*cycles of FU dependence: 255520 (6.218218%) --thread*cycles of data dependence: 186485 (4.538214%) --iCache cycles*banks: 4109216 (81.977876% used) Issue breakdown: --thread*cycles of issue worked: 3074164 (74.811448%) --thread*cycles of issue failed: 740600 (18.022903%) --thread*cycles of issue NOP/other: 294452 (7.165649%) Number of thread-cycles not ready: 186485 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3368616 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 6 1: 7 2: 8 3: 6 4: 8 5: 8 6: 8 7: 8 8: 8 9: 8 10: 6 11: 7 12: 9 13: 7 14: 8 15: 7 16: 7 17: 7 18: 7 19: 8 20: 9 21: 7 22: 8 23: 7 24: 7 25: 9 26: 6 27: 8 28: 8 29: 9 30: 7 31: 7 <=== Core 77 ===> ---- Thread 00 ---- PC 5: Stalled ----- 95474 in-flight CPI 1.3402 -- Total Cycles 127975 ---- Thread 01 ---- PC 5: Stalled ----- 103821 in-flight CPI 1.2324 -- Total Cycles 127975 ---- Thread 02 ---- PC 5: Stalled ----- 98876 in-flight CPI 1.2940 -- Total Cycles 127975 ---- Thread 03 ---- PC 5: Stalled ----- 96288 in-flight CPI 1.3289 -- Total Cycles 127975 ---- Thread 04 ---- PC 5: Stalled ----- 97299 in-flight CPI 1.3150 -- Total Cycles 127975 ---- Thread 05 ---- PC 5: Stalled ----- 99265 in-flight CPI 1.2890 -- Total Cycles 127975 ---- Thread 06 ---- PC 5: Stalled ----- 100210 in-flight CPI 1.2769 -- Total Cycles 127975 ---- Thread 07 ---- PC 5: Stalled ----- 96098 in-flight CPI 1.3315 -- Total Cycles 127975 ---- Thread 08 ---- PC 5: Stalled ----- 96430 in-flight CPI 1.3269 -- Total Cycles 127975 ---- Thread 09 ---- PC 5: Stalled ----- 99532 in-flight CPI 1.2856 -- Total Cycles 127975 ---- Thread 10 ---- PC 5: Stalled ----- 99964 in-flight CPI 1.2800 -- Total Cycles 127975 ---- Thread 11 ---- PC 5: Stalled ----- 97791 in-flight CPI 1.3084 -- Total Cycles 127975 ---- Thread 12 ---- PC 5: Stalled ----- 100078 in-flight CPI 1.2785 -- Total Cycles 127975 ---- Thread 13 ---- PC 5: Stalled ----- 92584 in-flight CPI 1.3820 -- Total Cycles 127975 ---- Thread 14 ---- PC 5: Stalled ----- 97498 in-flight CPI 1.3123 -- Total Cycles 127975 ---- Thread 15 ---- PC 5: Stalled ----- 95198 in-flight CPI 1.3441 -- Total Cycles 127975 ---- Thread 16 ---- PC 5: Stalled ----- 95391 in-flight CPI 1.3414 -- Total Cycles 127975 ---- Thread 17 ---- PC 5: Stalled ----- 92042 in-flight CPI 1.3902 -- Total Cycles 127975 ---- Thread 18 ---- PC 5: Stalled ----- 93693 in-flight CPI 1.3656 -- Total Cycles 127975 ---- Thread 19 ---- PC 5: Stalled ----- 95317 in-flight CPI 1.3424 -- Total Cycles 127975 ---- Thread 20 ---- PC 5: Stalled ----- 89709 in-flight CPI 1.4263 -- Total Cycles 127975 ---- Thread 21 ---- PC 5: Stalled ----- 92126 in-flight CPI 1.3888 -- Total Cycles 127975 ---- Thread 22 ---- PC 5: Stalled ----- 96811 in-flight CPI 1.3216 -- Total Cycles 127975 ---- Thread 23 ---- PC 5: Stalled ----- 99074 in-flight CPI 1.2914 -- Total Cycles 127975 ---- Thread 24 ---- PC 5: Stalled ----- 90429 in-flight CPI 1.4149 -- Total Cycles 127975 ---- Thread 25 ---- PC 5: Stalled ----- 98573 in-flight CPI 1.2980 -- Total Cycles 127975 ---- Thread 26 ---- PC 5: Stalled ----- 90484 in-flight CPI 1.4141 -- Total Cycles 127975 ---- Thread 27 ---- PC 5: Stalled ----- 88872 in-flight CPI 1.4398 -- Total Cycles 127975 ---- Thread 28 ---- PC 5: Stalled ----- 93419 in-flight CPI 1.3697 -- Total Cycles 127975 ---- Thread 29 ---- PC 5: Stalled ----- 86163 in-flight CPI 1.4850 -- Total Cycles 127975 ---- Thread 30 ---- PC 5: Stalled ----- 89372 in-flight CPI 1.4317 -- Total Cycles 127975 ---- Thread 31 ---- PC 5: Stalled ----- 88196 in-flight CPI 1.4508 -- Total Cycles 127975 Total CPI 0.0420 , IPC 23.8065 -- Total Cycles 127975 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7800 (3.978780%) FPSUB: 0 (0.000000%) FPMUL: 31672 (16.155887%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 71638 (36.542542%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5633 (2.873393%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 71426 (36.434401%) DIV: 7604 (3.878800%) FPUN: 0 (0.000000%) FPRSUB: 267 (0.136197%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3338465 total) ADD%: 7.489 (250026) SUB%: 0.000 (0) MUL%: 0.006 (206) BITOR%: 1.525 (50912) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.548 (18304) FPSUB%: 0.000 (0) FPMUL%: 4.765 (159074) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (618) FPMAX%: 0.019 (618) LOAD%: 5.141 (171628) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (238) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (590) FPINV%: 0.000 (0) FPCONV%: 0.019 (650) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.062 (35445) FPLE%: 0.455 (15206) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (618) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.804 (93616) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.747 (24924) CMPU%: 0.000 (0) RSUB%: 0.006 (206) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.672 (523194) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.175 (39242) ORI%: 1.552 (51807) XORI%: 0.000 (0) MULI%: 3.202 (106882) LW%: 1.132 (37776) LWI%: 13.475 (449844) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.288 (9605) SWI%: 4.065 (135708) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.401 (46761) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10363) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.055 (1820) bned%: 0.000 (0) bneid%: 13.791 (460404) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.713 (23817) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.122 (4062) DIV%: 0.012 (412) FPUN%: 1.475 (49240) FPRSUB%: 4.206 (140408) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (68) FPGT%: 2.944 (98273) FPGE%: 1.019 (34034) SYNC%: 0.000 (0) NOP%: 8.740 (291770) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 31 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 9 FPSUB 0 FPMUL 1 FPCMPLT 0 FPMIN 0 FPMAX 400 LOAD 39577 INTCONV 0 ATOMIC_INC 29 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 11 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1609 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 10 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48763 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 14 ORI 11060 XORI 0 MULI 9224 LW 0 LWI 142148 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 66 DIV 30 FPUN 0 FPRSUB 52 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.8068 --Total thread-cycles: 4095200 --total thread-cycles issued: 3046695 (74.396733%) --iCache conflicts: 110844 (2.706681%) --thread*cycles of FU dependence: 253041 (6.178966%) --thread*cycles of data dependence: 196040 (4.787068%) --iCache cycles*banks: 4095200 (81.522197% used) Issue breakdown: --thread*cycles of issue worked: 3046695 (74.396733%) --thread*cycles of issue failed: 756735 (18.478585%) --thread*cycles of issue NOP/other: 291770 (7.124683%) Number of thread-cycles not ready: 196040 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3338465 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 6 1: 8 2: 8 3: 7 4: 8 5: 7 6: 7 7: 7 8: 8 9: 7 10: 8 11: 8 12: 9 13: 7 14: 8 15: 7 16: 7 17: 7 18: 8 19: 8 20: 7 21: 8 22: 8 23: 9 24: 7 25: 9 26: 7 27: 6 28: 7 29: 7 30: 7 31: 6 <=== Core 78 ===> ---- Thread 00 ---- PC 5: Stalled ----- 103241 in-flight CPI 1.2576 -- Total Cycles 129862 ---- Thread 01 ---- PC 5: Stalled ----- 93404 in-flight CPI 1.3901 -- Total Cycles 129862 ---- Thread 02 ---- PC 5: Stalled ----- 102055 in-flight CPI 1.2722 -- Total Cycles 129862 ---- Thread 03 ---- PC 5: Stalled ----- 97639 in-flight CPI 1.3298 -- Total Cycles 129862 ---- Thread 04 ---- PC 5: Stalled ----- 101403 in-flight CPI 1.2804 -- Total Cycles 129862 ---- Thread 05 ---- PC 5: Stalled ----- 103367 in-flight CPI 1.2560 -- Total Cycles 129862 ---- Thread 06 ---- PC 5: Stalled ----- 101866 in-flight CPI 1.2746 -- Total Cycles 129862 ---- Thread 07 ---- PC 5: Stalled ----- 98675 in-flight CPI 1.3158 -- Total Cycles 129862 ---- Thread 08 ---- PC 5: Stalled ----- 101930 in-flight CPI 1.2738 -- Total Cycles 129862 ---- Thread 09 ---- PC 5: Stalled ----- 96977 in-flight CPI 1.3389 -- Total Cycles 129862 ---- Thread 10 ---- PC 5: Stalled ----- 95464 in-flight CPI 1.3601 -- Total Cycles 129862 ---- Thread 11 ---- PC 5: Stalled ----- 97808 in-flight CPI 1.3275 -- Total Cycles 129862 ---- Thread 12 ---- PC 5: Stalled ----- 97927 in-flight CPI 1.3258 -- Total Cycles 129862 ---- Thread 13 ---- PC 5: Stalled ----- 96370 in-flight CPI 1.3473 -- Total Cycles 129862 ---- Thread 14 ---- PC 5: Stalled ----- 98781 in-flight CPI 1.3144 -- Total Cycles 129862 ---- Thread 15 ---- PC 5: Stalled ----- 95414 in-flight CPI 1.3608 -- Total Cycles 129862 ---- Thread 16 ---- PC 5: Stalled ----- 96419 in-flight CPI 1.3466 -- Total Cycles 129862 ---- Thread 17 ---- PC 5: Stalled ----- 91550 in-flight CPI 1.4183 -- Total Cycles 129862 ---- Thread 18 ---- PC 5: Stalled ----- 98497 in-flight CPI 1.3182 -- Total Cycles 129862 ---- Thread 19 ---- PC 5: Stalled ----- 94080 in-flight CPI 1.3801 -- Total Cycles 129862 ---- Thread 20 ---- PC 5: Stalled ----- 96826 in-flight CPI 1.3410 -- Total Cycles 129862 ---- Thread 21 ---- PC 5: Stalled ----- 94802 in-flight CPI 1.3695 -- Total Cycles 129862 ---- Thread 22 ---- PC 5: Stalled ----- 98637 in-flight CPI 1.3163 -- Total Cycles 129862 ---- Thread 23 ---- PC 5: Stalled ----- 95678 in-flight CPI 1.3570 -- Total Cycles 129862 ---- Thread 24 ---- PC 5: Stalled ----- 93486 in-flight CPI 1.3889 -- Total Cycles 129862 ---- Thread 25 ---- PC 5: Stalled ----- 92470 in-flight CPI 1.4041 -- Total Cycles 129862 ---- Thread 26 ---- PC 5: Stalled ----- 92521 in-flight CPI 1.4033 -- Total Cycles 129862 ---- Thread 27 ---- PC 5: Stalled ----- 87540 in-flight CPI 1.4832 -- Total Cycles 129862 ---- Thread 28 ---- PC 5: Stalled ----- 88483 in-flight CPI 1.4674 -- Total Cycles 129862 ---- Thread 29 ---- PC 5: Stalled ----- 88964 in-flight CPI 1.4594 -- Total Cycles 129862 ---- Thread 30 ---- PC 5: Stalled ----- 91966 in-flight CPI 1.4119 -- Total Cycles 129862 ---- Thread 31 ---- PC 5: Stalled ----- 84915 in-flight CPI 1.5290 -- Total Cycles 129862 Total CPI 0.0423 , IPC 23.6382 -- Total Cycles 129862 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7295 (3.726369%) FPSUB: 0 (0.000000%) FPMUL: 30915 (15.791732%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 77143 (39.405518%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5487 (2.802822%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 67179 (34.315794%) DIV: 7484 (3.822912%) FPUN: 0 (0.000000%) FPRSUB: 264 (0.134854%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3364092 total) ADD%: 7.498 (252236) SUB%: 0.000 (0) MUL%: 0.006 (203) BITOR%: 1.525 (51296) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.513 (17248) FPSUB%: 0.000 (0) FPMUL%: 4.667 (156999) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (609) FPMAX%: 0.018 (609) LOAD%: 5.109 (171886) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (235) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (579) FPINV%: 0.000 (0) FPCONV%: 0.019 (641) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.048 (35239) FPLE%: 0.458 (15398) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (609) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.832 (95263) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.734 (24678) CMPU%: 0.000 (0) RSUB%: 0.006 (203) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.716 (528698) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.179 (39673) ORI%: 1.531 (51519) XORI%: 0.000 (0) MULI%: 3.228 (108594) LW%: 1.142 (38430) LWI%: 13.539 (455477) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.291 (9777) SWI%: 4.082 (137328) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.414 (47571) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.312 (10480) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.052 (1759) bned%: 0.000 (0) bneid%: 13.823 (465009) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.720 (24209) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.113 (3814) DIV%: 0.012 (406) FPUN%: 1.482 (49850) FPRSUB%: 4.134 (139070) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (65) FPGT%: 2.959 (99556) FPGE%: 1.024 (34452) SYNC%: 0.000 (0) NOP%: 8.749 (294328) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 36 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 12 FPSUB 0 FPMUL 3 FPCMPLT 0 FPMIN 0 FPMAX 394 LOAD 39608 INTCONV 0 ATOMIC_INC 25 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 16 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1340 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 8 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49480 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 13 ORI 10336 XORI 0 MULI 9687 LW 0 LWI 143934 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 72 DIV 20 FPUN 0 FPRSUB 57 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.6385 --Total thread-cycles: 4155584 --total thread-cycles issued: 3069764 (73.870821%) --iCache conflicts: 111756 (2.689297%) --thread*cycles of FU dependence: 255051 (6.137549%) --thread*cycles of data dependence: 195767 (4.710938%) --iCache cycles*banks: 4155584 (80.954301% used) Issue breakdown: --thread*cycles of issue worked: 3069764 (73.870821%) --thread*cycles of issue failed: 791492 (19.046469%) --thread*cycles of issue NOP/other: 294328 (7.082711%) Number of thread-cycles not ready: 195767 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3364092 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 7 2: 8 3: 8 4: 7 5: 9 6: 9 7: 8 8: 8 9: 7 10: 7 11: 7 12: 8 13: 8 14: 7 15: 8 16: 7 17: 6 18: 7 19: 7 20: 7 21: 8 22: 7 23: 8 24: 7 25: 7 26: 7 27: 6 28: 6 29: 8 30: 6 31: 7 <=== Core 79 ===> ---- Thread 00 ---- PC 5: Stalled ----- 99346 in-flight CPI 1.6654 -- Total Cycles 165491 ---- Thread 01 ---- PC 5: Stalled ----- 107487 in-flight CPI 1.5395 -- Total Cycles 165491 ---- Thread 02 ---- PC 5: Stalled ----- 94939 in-flight CPI 1.7428 -- Total Cycles 165491 ---- Thread 03 ---- PC 5: Stalled ----- 90661 in-flight CPI 1.8251 -- Total Cycles 165491 ---- Thread 04 ---- PC 5: Stalled ----- 96435 in-flight CPI 1.7157 -- Total Cycles 165491 ---- Thread 05 ---- PC 5: Stalled ----- 100815 in-flight CPI 1.6413 -- Total Cycles 165491 ---- Thread 06 ---- PC 5: Stalled ----- 95091 in-flight CPI 1.7400 -- Total Cycles 165491 ---- Thread 07 ---- PC 5: Stalled ----- 95887 in-flight CPI 1.7257 -- Total Cycles 165491 ---- Thread 08 ---- PC 5: Stalled ----- 96994 in-flight CPI 1.7060 -- Total Cycles 165491 ---- Thread 09 ---- PC 5: Stalled ----- 93816 in-flight CPI 1.7636 -- Total Cycles 165491 ---- Thread 10 ---- PC 5: Stalled ----- 96117 in-flight CPI 1.7214 -- Total Cycles 165491 ---- Thread 11 ---- PC 5: Stalled ----- 98955 in-flight CPI 1.6720 -- Total Cycles 165491 ---- Thread 12 ---- PC 5: Stalled ----- 100262 in-flight CPI 1.6503 -- Total Cycles 165491 ---- Thread 13 ---- PC 5: Stalled ----- 92191 in-flight CPI 1.7948 -- Total Cycles 165491 ---- Thread 14 ---- PC 5: Stalled ----- 95455 in-flight CPI 1.7334 -- Total Cycles 165491 ---- Thread 15 ---- PC 5: Stalled ----- 94759 in-flight CPI 1.7461 -- Total Cycles 165491 ---- Thread 16 ---- PC 5: Stalled ----- 96869 in-flight CPI 1.7081 -- Total Cycles 165491 ---- Thread 17 ---- PC 5: Stalled ----- 91122 in-flight CPI 1.8158 -- Total Cycles 165491 ---- Thread 18 ---- PC 5: Stalled ----- 89815 in-flight CPI 1.8423 -- Total Cycles 165491 ---- Thread 19 ---- PC 5: Stalled ----- 91926 in-flight CPI 1.7999 -- Total Cycles 165491 ---- Thread 20 ---- PC 5: Stalled ----- 92287 in-flight CPI 1.7928 -- Total Cycles 165491 ---- Thread 21 ---- PC 5: Stalled ----- 93661 in-flight CPI 1.7665 -- Total Cycles 165491 ---- Thread 22 ---- PC 5: Stalled ----- 92973 in-flight CPI 1.7797 -- Total Cycles 165491 ---- Thread 23 ---- PC 5: Stalled ----- 92888 in-flight CPI 1.7812 -- Total Cycles 165491 ---- Thread 24 ---- PC 5: Stalled ----- 91737 in-flight CPI 1.8036 -- Total Cycles 165491 ---- Thread 25 ---- PC 5: Stalled ----- 92343 in-flight CPI 1.7918 -- Total Cycles 165491 ---- Thread 26 ---- PC 5: Stalled ----- 92597 in-flight CPI 1.7869 -- Total Cycles 165491 ---- Thread 27 ---- PC 5: Stalled ----- 115227 in-flight CPI 1.4361 -- Total Cycles 165491 ---- Thread 28 ---- PC 5: Stalled ----- 89090 in-flight CPI 1.8574 -- Total Cycles 165491 ---- Thread 29 ---- PC 5: Stalled ----- 91766 in-flight CPI 1.8030 -- Total Cycles 165491 ---- Thread 30 ---- PC 5: Stalled ----- 90898 in-flight CPI 1.8202 -- Total Cycles 165491 ---- Thread 31 ---- PC 5: Stalled ----- 87085 in-flight CPI 1.9000 -- Total Cycles 165491 Total CPI 0.0544 , IPC 18.3819 -- Total Cycles 165491 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 9208 (3.887528%) FPSUB: 0 (0.000000%) FPMUL: 34344 (14.499704%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 97700 (41.247995%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 5314 (2.243519%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 82683 (34.907963%) DIV: 7349 (3.102677%) FPUN: 0 (0.000000%) FPRSUB: 262 (0.110614%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3333554 total) ADD%: 7.364 (245487) SUB%: 0.000 (0) MUL%: 0.006 (199) BITOR%: 1.512 (50415) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.634 (21126) FPSUB%: 0.000 (0) FPMUL%: 5.015 (167169) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (597) FPMAX%: 0.018 (597) LOAD%: 5.246 (174868) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (231) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (565) FPINV%: 0.000 (0) FPCONV%: 0.019 (629) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.095 (36487) FPLE%: 0.449 (14965) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (597) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.744 (91469) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.770 (25680) CMPU%: 0.000 (0) RSUB%: 0.006 (199) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.602 (520092) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.161 (38699) ORI%: 1.618 (53950) XORI%: 0.000 (0) MULI%: 3.140 (104662) LW%: 1.107 (36906) LWI%: 13.313 (443791) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.282 (9402) SWI%: 4.005 (133512) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.370 (45662) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.307 (10248) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.069 (2294) bned%: 0.000 (0) bneid%: 13.707 (456927) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.706 (23521) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.142 (4730) DIV%: 0.012 (398) FPUN%: 1.453 (48423) FPRSUB%: 4.409 (146961) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (58) FPGT%: 2.910 (97021) FPGE%: 1.004 (33458) SYNC%: 0.000 (0) NOP%: 8.743 (291463) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 12 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 13 FPSUB 0 FPMUL 2 FPCMPLT 0 FPMIN 0 FPMAX 388 LOAD 41303 INTCONV 0 ATOMIC_INC 22 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 10 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1371 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 7 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 47881 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 15 ORI 13184 XORI 0 MULI 8512 LW 0 LWI 140941 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 85 DIV 26 FPUN 0 FPRSUB 61 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 18.3821 --Total thread-cycles: 5295712 --total thread-cycles issued: 3042091 (57.444419%) --iCache conflicts: 108689 (2.052396%) --thread*cycles of FU dependence: 253838 (4.793274%) --thread*cycles of data dependence: 236860 (4.472675%) --iCache cycles*banks: 5295712 (62.948778% used) Issue breakdown: --thread*cycles of issue worked: 3042091 (57.444419%) --thread*cycles of issue failed: 1962158 (37.051826%) --thread*cycles of issue NOP/other: 291463 (5.503755%) Number of thread-cycles not ready: 236860 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3333554 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 9 1: 5 2: 8 3: 6 4: 8 5: 7 6: 7 7: 6 8: 6 9: 8 10: 8 11: 9 12: 8 13: 7 14: 7 15: 7 16: 7 17: 7 18: 6 19: 8 20: 8 21: 8 22: 7 23: 8 24: 8 25: 7 26: 7 27: 6 28: 5 29: 8 30: 8 31: 7 ## Core 0 ## Module Utilization FP AddSub: 15.58 FP MinMax: 0.03 FP Compare: 5.72 Int AddSub: 21.88 FP Mul: 15.61 Int Mul: 42.22 FP InvSqrt: 0.47 FP Div: 3.52 Conversion Unit: 0.02 ## Core 1 ## Module Utilization FP AddSub: 15.45 FP MinMax: 0.03 FP Compare: 5.56 Int AddSub: 21.30 FP Mul: 15.44 Int Mul: 40.95 FP InvSqrt: 0.45 FP Div: 3.55 Conversion Unit: 0.02 ## Core 2 ## Module Utilization FP AddSub: 14.12 FP MinMax: 0.03 FP Compare: 5.02 Int AddSub: 19.21 FP Mul: 14.10 Int Mul: 36.83 FP InvSqrt: 0.41 FP Div: 3.32 Conversion Unit: 0.01 ## Core 3 ## Module Utilization FP AddSub: 15.52 FP MinMax: 0.03 FP Compare: 5.66 Int AddSub: 21.68 FP Mul: 15.57 Int Mul: 41.58 FP InvSqrt: 0.46 FP Div: 3.52 Conversion Unit: 0.02 ## Core 4 ## Module Utilization FP AddSub: 12.87 FP MinMax: 0.02 FP Compare: 4.65 Int AddSub: 17.82 FP Mul: 12.89 Int Mul: 34.26 FP InvSqrt: 0.38 FP Div: 2.95 Conversion Unit: 0.01 ## Core 5 ## Module Utilization FP AddSub: 15.28 FP MinMax: 0.03 FP Compare: 5.69 Int AddSub: 21.84 FP Mul: 15.34 Int Mul: 42.16 FP InvSqrt: 0.48 FP Div: 3.39 Conversion Unit: 0.02 ## Core 6 ## Module Utilization FP AddSub: 13.29 FP MinMax: 0.02 FP Compare: 4.77 Int AddSub: 18.28 FP Mul: 13.25 Int Mul: 35.15 FP InvSqrt: 0.37 FP Div: 3.04 Conversion Unit: 0.01 ## Core 7 ## Module Utilization FP AddSub: 15.76 FP MinMax: 0.03 FP Compare: 5.73 Int AddSub: 21.92 FP Mul: 15.77 Int Mul: 42.10 FP InvSqrt: 0.47 FP Div: 3.62 Conversion Unit: 0.02 ## Core 8 ## Module Utilization FP AddSub: 14.99 FP MinMax: 0.03 FP Compare: 5.28 Int AddSub: 20.25 FP Mul: 14.93 Int Mul: 38.46 FP InvSqrt: 0.40 FP Div: 3.52 Conversion Unit: 0.01 ## Core 9 ## Module Utilization FP AddSub: 15.36 FP MinMax: 0.03 FP Compare: 5.68 Int AddSub: 21.63 FP Mul: 15.45 Int Mul: 41.74 FP InvSqrt: 0.48 FP Div: 3.44 Conversion Unit: 0.02 ## Core 10 ## Module Utilization FP AddSub: 11.89 FP MinMax: 0.02 FP Compare: 4.22 Int AddSub: 16.16 FP Mul: 11.86 Int Mul: 31.00 FP InvSqrt: 0.34 FP Div: 2.79 Conversion Unit: 0.01 ## Core 11 ## Module Utilization FP AddSub: 14.41 FP MinMax: 0.03 FP Compare: 5.20 Int AddSub: 19.98 FP Mul: 14.42 Int Mul: 38.40 FP InvSqrt: 0.42 FP Div: 3.29 Conversion Unit: 0.01 ## Core 12 ## Module Utilization FP AddSub: 12.77 FP MinMax: 0.02 FP Compare: 4.50 Int AddSub: 17.26 FP Mul: 12.74 Int Mul: 32.99 FP InvSqrt: 0.36 FP Div: 3.00 Conversion Unit: 0.01 ## Core 13 ## Module Utilization FP AddSub: 15.29 FP MinMax: 0.03 FP Compare: 5.62 Int AddSub: 21.51 FP Mul: 15.33 Int Mul: 41.51 FP InvSqrt: 0.46 FP Div: 3.42 Conversion Unit: 0.02 ## Core 14 ## Module Utilization FP AddSub: 15.55 FP MinMax: 0.03 FP Compare: 5.68 Int AddSub: 21.67 FP Mul: 15.58 Int Mul: 41.65 FP InvSqrt: 0.45 FP Div: 3.53 Conversion Unit: 0.02 ## Core 15 ## Module Utilization FP AddSub: 15.87 FP MinMax: 0.03 FP Compare: 5.62 Int AddSub: 21.50 FP Mul: 15.82 Int Mul: 41.16 FP InvSqrt: 0.43 FP Div: 3.69 Conversion Unit: 0.01 ## Core 16 ## Module Utilization FP AddSub: 15.24 FP MinMax: 0.03 FP Compare: 5.54 Int AddSub: 21.20 FP Mul: 15.27 Int Mul: 40.79 FP InvSqrt: 0.45 FP Div: 3.45 Conversion Unit: 0.02 ## Core 17 ## Module Utilization FP AddSub: 13.63 FP MinMax: 0.03 FP Compare: 5.11 Int AddSub: 19.60 FP Mul: 13.69 Int Mul: 37.89 FP InvSqrt: 0.41 FP Div: 2.97 Conversion Unit: 0.01 ## Core 18 ## Module Utilization FP AddSub: 13.97 FP MinMax: 0.03 FP Compare: 5.14 Int AddSub: 19.62 FP Mul: 14.02 Int Mul: 37.79 FP InvSqrt: 0.42 FP Div: 3.14 Conversion Unit: 0.01 ## Core 19 ## Module Utilization FP AddSub: 14.38 FP MinMax: 0.03 FP Compare: 5.12 Int AddSub: 19.58 FP Mul: 14.36 Int Mul: 37.58 FP InvSqrt: 0.40 FP Div: 3.34 Conversion Unit: 0.01 ## Core 20 ## Module Utilization FP AddSub: 15.40 FP MinMax: 0.03 FP Compare: 5.78 Int AddSub: 22.19 FP Mul: 15.50 Int Mul: 42.95 FP InvSqrt: 0.48 FP Div: 3.33 Conversion Unit: 0.02 ## Core 21 ## Module Utilization FP AddSub: 14.51 FP MinMax: 0.03 FP Compare: 5.42 Int AddSub: 20.72 FP Mul: 14.59 Int Mul: 39.98 FP InvSqrt: 0.45 FP Div: 3.22 Conversion Unit: 0.02 ## Core 22 ## Module Utilization FP AddSub: 15.08 FP MinMax: 0.03 FP Compare: 5.67 Int AddSub: 21.71 FP Mul: 15.18 Int Mul: 41.84 FP InvSqrt: 0.46 FP Div: 3.30 Conversion Unit: 0.02 ## Core 23 ## Module Utilization FP AddSub: 15.41 FP MinMax: 0.03 FP Compare: 5.62 Int AddSub: 21.46 FP Mul: 15.44 Int Mul: 41.26 FP InvSqrt: 0.45 FP Div: 3.50 Conversion Unit: 0.02 ## Core 24 ## Module Utilization FP AddSub: 15.20 FP MinMax: 0.03 FP Compare: 5.59 Int AddSub: 21.43 FP Mul: 15.24 Int Mul: 41.33 FP InvSqrt: 0.45 FP Div: 3.38 Conversion Unit: 0.02 ## Core 25 ## Module Utilization FP AddSub: 15.09 FP MinMax: 0.03 FP Compare: 5.66 Int AddSub: 21.67 FP Mul: 15.18 Int Mul: 41.91 FP InvSqrt: 0.47 FP Div: 3.29 Conversion Unit: 0.02 ## Core 26 ## Module Utilization FP AddSub: 15.33 FP MinMax: 0.03 FP Compare: 5.67 Int AddSub: 21.74 FP Mul: 15.38 Int Mul: 41.91 FP InvSqrt: 0.46 FP Div: 3.40 Conversion Unit: 0.02 ## Core 27 ## Module Utilization FP AddSub: 13.35 FP MinMax: 0.02 FP Compare: 4.63 Int AddSub: 17.77 FP Mul: 13.27 Int Mul: 33.77 FP InvSqrt: 0.36 FP Div: 3.21 Conversion Unit: 0.01 ## Core 28 ## Module Utilization FP AddSub: 15.90 FP MinMax: 0.03 FP Compare: 5.75 Int AddSub: 21.93 FP Mul: 15.91 Int Mul: 42.05 FP InvSqrt: 0.47 FP Div: 3.67 Conversion Unit: 0.02 ## Core 29 ## Module Utilization FP AddSub: 13.21 FP MinMax: 0.03 FP Compare: 4.81 Int AddSub: 18.53 FP Mul: 13.24 Int Mul: 35.73 FP InvSqrt: 0.40 FP Div: 2.96 Conversion Unit: 0.01 ## Core 30 ## Module Utilization FP AddSub: 15.18 FP MinMax: 0.03 FP Compare: 5.59 Int AddSub: 21.42 FP Mul: 15.22 Int Mul: 41.34 FP InvSqrt: 0.46 FP Div: 3.40 Conversion Unit: 0.02 ## Core 31 ## Module Utilization FP AddSub: 13.13 FP MinMax: 0.03 FP Compare: 4.69 Int AddSub: 18.02 FP Mul: 13.12 Int Mul: 34.45 FP InvSqrt: 0.39 FP Div: 3.07 Conversion Unit: 0.01 ## Core 32 ## Module Utilization FP AddSub: 15.88 FP MinMax: 0.03 FP Compare: 5.65 Int AddSub: 21.65 FP Mul: 15.87 Int Mul: 41.37 FP InvSqrt: 0.44 FP Div: 3.68 Conversion Unit: 0.02 ## Core 33 ## Module Utilization FP AddSub: 16.00 FP MinMax: 0.03 FP Compare: 5.62 Int AddSub: 21.51 FP Mul: 15.96 Int Mul: 41.14 FP InvSqrt: 0.44 FP Div: 3.77 Conversion Unit: 0.02 ## Core 34 ## Module Utilization FP AddSub: 15.56 FP MinMax: 0.03 FP Compare: 5.58 Int AddSub: 21.50 FP Mul: 15.56 Int Mul: 41.27 FP InvSqrt: 0.46 FP Div: 3.58 Conversion Unit: 0.02 ## Core 35 ## Module Utilization FP AddSub: 15.68 FP MinMax: 0.03 FP Compare: 5.70 Int AddSub: 21.79 FP Mul: 15.71 Int Mul: 41.93 FP InvSqrt: 0.46 FP Div: 3.58 Conversion Unit: 0.02 ## Core 36 ## Module Utilization FP AddSub: 15.65 FP MinMax: 0.03 FP Compare: 5.62 Int AddSub: 21.55 FP Mul: 15.67 Int Mul: 41.27 FP InvSqrt: 0.46 FP Div: 3.61 Conversion Unit: 0.02 ## Core 37 ## Module Utilization FP AddSub: 12.39 FP MinMax: 0.02 FP Compare: 4.42 Int AddSub: 16.94 FP Mul: 12.37 Int Mul: 32.45 FP InvSqrt: 0.35 FP Div: 2.88 Conversion Unit: 0.01 ## Core 38 ## Module Utilization FP AddSub: 15.86 FP MinMax: 0.03 FP Compare: 5.68 Int AddSub: 21.75 FP Mul: 15.87 Int Mul: 41.55 FP InvSqrt: 0.46 FP Div: 3.70 Conversion Unit: 0.02 ## Core 39 ## Module Utilization FP AddSub: 15.65 FP MinMax: 0.03 FP Compare: 5.69 Int AddSub: 21.79 FP Mul: 15.67 Int Mul: 41.87 FP InvSqrt: 0.47 FP Div: 3.56 Conversion Unit: 0.02 ## Core 40 ## Module Utilization FP AddSub: 15.30 FP MinMax: 0.03 FP Compare: 5.68 Int AddSub: 21.76 FP Mul: 15.36 Int Mul: 42.09 FP InvSqrt: 0.47 FP Div: 3.39 Conversion Unit: 0.02 ## Core 41 ## Module Utilization FP AddSub: 14.22 FP MinMax: 0.03 FP Compare: 5.20 Int AddSub: 19.98 FP Mul: 14.23 Int Mul: 38.36 FP InvSqrt: 0.43 FP Div: 3.24 Conversion Unit: 0.01 ## Core 42 ## Module Utilization FP AddSub: 14.87 FP MinMax: 0.03 FP Compare: 5.14 Int AddSub: 19.70 FP Mul: 14.78 Int Mul: 37.50 FP InvSqrt: 0.40 FP Div: 3.59 Conversion Unit: 0.01 ## Core 43 ## Module Utilization FP AddSub: 15.73 FP MinMax: 0.03 FP Compare: 5.71 Int AddSub: 21.73 FP Mul: 15.77 Int Mul: 41.74 FP InvSqrt: 0.48 FP Div: 3.61 Conversion Unit: 0.02 ## Core 44 ## Module Utilization FP AddSub: 14.81 FP MinMax: 0.03 FP Compare: 5.39 Int AddSub: 20.66 FP Mul: 14.83 Int Mul: 39.70 FP InvSqrt: 0.43 FP Div: 3.36 Conversion Unit: 0.01 ## Core 45 ## Module Utilization FP AddSub: 15.07 FP MinMax: 0.03 FP Compare: 5.67 Int AddSub: 21.75 FP Mul: 15.14 Int Mul: 42.03 FP InvSqrt: 0.45 FP Div: 3.25 Conversion Unit: 0.02 ## Core 46 ## Module Utilization FP AddSub: 15.43 FP MinMax: 0.03 FP Compare: 5.74 Int AddSub: 22.02 FP Mul: 15.49 Int Mul: 42.37 FP InvSqrt: 0.47 FP Div: 3.43 Conversion Unit: 0.02 ## Core 47 ## Module Utilization FP AddSub: 15.45 FP MinMax: 0.03 FP Compare: 5.57 Int AddSub: 21.31 FP Mul: 15.47 Int Mul: 41.01 FP InvSqrt: 0.46 FP Div: 3.54 Conversion Unit: 0.02 ## Core 48 ## Module Utilization FP AddSub: 13.65 FP MinMax: 0.03 FP Compare: 4.98 Int AddSub: 19.07 FP Mul: 13.66 Int Mul: 36.68 FP InvSqrt: 0.40 FP Div: 3.10 Conversion Unit: 0.01 ## Core 49 ## Module Utilization FP AddSub: 14.82 FP MinMax: 0.03 FP Compare: 5.70 Int AddSub: 21.89 FP Mul: 14.99 Int Mul: 42.38 FP InvSqrt: 0.47 FP Div: 3.11 Conversion Unit: 0.02 ## Core 50 ## Module Utilization FP AddSub: 15.94 FP MinMax: 0.03 FP Compare: 5.54 Int AddSub: 21.17 FP Mul: 15.89 Int Mul: 40.30 FP InvSqrt: 0.44 FP Div: 3.85 Conversion Unit: 0.02 ## Core 51 ## Module Utilization FP AddSub: 13.68 FP MinMax: 0.03 FP Compare: 4.93 Int AddSub: 18.88 FP Mul: 13.69 Int Mul: 36.33 FP InvSqrt: 0.40 FP Div: 3.13 Conversion Unit: 0.01 ## Core 52 ## Module Utilization FP AddSub: 15.80 FP MinMax: 0.03 FP Compare: 5.67 Int AddSub: 21.72 FP Mul: 15.79 Int Mul: 41.83 FP InvSqrt: 0.47 FP Div: 3.66 Conversion Unit: 0.02 ## Core 53 ## Module Utilization FP AddSub: 15.89 FP MinMax: 0.03 FP Compare: 5.71 Int AddSub: 21.83 FP Mul: 15.92 Int Mul: 41.90 FP InvSqrt: 0.46 FP Div: 3.65 Conversion Unit: 0.02 ## Core 54 ## Module Utilization FP AddSub: 15.16 FP MinMax: 0.03 FP Compare: 5.55 Int AddSub: 21.23 FP Mul: 15.22 Int Mul: 40.82 FP InvSqrt: 0.44 FP Div: 3.42 Conversion Unit: 0.02 ## Core 55 ## Module Utilization FP AddSub: 14.64 FP MinMax: 0.03 FP Compare: 5.23 Int AddSub: 20.05 FP Mul: 14.63 Int Mul: 38.29 FP InvSqrt: 0.41 FP Div: 3.40 Conversion Unit: 0.01 ## Core 56 ## Module Utilization FP AddSub: 15.75 FP MinMax: 0.03 FP Compare: 5.69 Int AddSub: 21.79 FP Mul: 15.80 Int Mul: 41.75 FP InvSqrt: 0.47 FP Div: 3.61 Conversion Unit: 0.02 ## Core 57 ## Module Utilization FP AddSub: 15.52 FP MinMax: 0.03 FP Compare: 5.64 Int AddSub: 21.57 FP Mul: 15.50 Int Mul: 41.41 FP InvSqrt: 0.44 FP Div: 3.54 Conversion Unit: 0.02 ## Core 58 ## Module Utilization FP AddSub: 15.25 FP MinMax: 0.03 FP Compare: 5.62 Int AddSub: 21.50 FP Mul: 15.28 Int Mul: 41.49 FP InvSqrt: 0.45 FP Div: 3.40 Conversion Unit: 0.02 ## Core 59 ## Module Utilization FP AddSub: 15.47 FP MinMax: 0.03 FP Compare: 5.76 Int AddSub: 22.09 FP Mul: 15.55 Int Mul: 42.50 FP InvSqrt: 0.48 FP Div: 3.44 Conversion Unit: 0.02 ## Core 60 ## Module Utilization FP AddSub: 15.61 FP MinMax: 0.03 FP Compare: 5.73 Int AddSub: 21.83 FP Mul: 15.67 Int Mul: 42.27 FP InvSqrt: 0.48 FP Div: 3.54 Conversion Unit: 0.02 ## Core 61 ## Module Utilization FP AddSub: 15.27 FP MinMax: 0.03 FP Compare: 5.70 Int AddSub: 21.77 FP Mul: 15.35 Int Mul: 42.10 FP InvSqrt: 0.46 FP Div: 3.34 Conversion Unit: 0.02 ## Core 62 ## Module Utilization FP AddSub: 15.48 FP MinMax: 0.03 FP Compare: 5.66 Int AddSub: 21.66 FP Mul: 15.51 Int Mul: 41.61 FP InvSqrt: 0.48 FP Div: 3.55 Conversion Unit: 0.02 ## Core 63 ## Module Utilization FP AddSub: 16.05 FP MinMax: 0.03 FP Compare: 5.71 Int AddSub: 21.83 FP Mul: 16.05 Int Mul: 41.91 FP InvSqrt: 0.46 FP Div: 3.75 Conversion Unit: 0.02 ## Core 64 ## Module Utilization FP AddSub: 15.03 FP MinMax: 0.03 FP Compare: 5.60 Int AddSub: 21.46 FP Mul: 15.09 Int Mul: 41.39 FP InvSqrt: 0.45 FP Div: 3.30 Conversion Unit: 0.02 ## Core 65 ## Module Utilization FP AddSub: 15.09 FP MinMax: 0.03 FP Compare: 5.62 Int AddSub: 21.54 FP Mul: 15.18 Int Mul: 41.62 FP InvSqrt: 0.46 FP Div: 3.31 Conversion Unit: 0.02 ## Core 66 ## Module Utilization FP AddSub: 13.54 FP MinMax: 0.03 FP Compare: 4.96 Int AddSub: 18.98 FP Mul: 13.60 Int Mul: 36.53 FP InvSqrt: 0.40 FP Div: 3.04 Conversion Unit: 0.01 ## Core 67 ## Module Utilization FP AddSub: 15.81 FP MinMax: 0.03 FP Compare: 5.71 Int AddSub: 21.84 FP Mul: 15.81 Int Mul: 42.03 FP InvSqrt: 0.48 FP Div: 3.66 Conversion Unit: 0.02 ## Core 68 ## Module Utilization FP AddSub: 14.82 FP MinMax: 0.03 FP Compare: 5.35 Int AddSub: 20.43 FP Mul: 14.82 Int Mul: 39.13 FP InvSqrt: 0.43 FP Div: 3.43 Conversion Unit: 0.01 ## Core 69 ## Module Utilization FP AddSub: 15.32 FP MinMax: 0.03 FP Compare: 5.60 Int AddSub: 21.39 FP Mul: 15.34 Int Mul: 41.12 FP InvSqrt: 0.44 FP Div: 3.45 Conversion Unit: 0.02 ## Core 70 ## Module Utilization FP AddSub: 15.84 FP MinMax: 0.03 FP Compare: 5.65 Int AddSub: 21.71 FP Mul: 15.84 Int Mul: 41.64 FP InvSqrt: 0.45 FP Div: 3.65 Conversion Unit: 0.02 ## Core 71 ## Module Utilization FP AddSub: 15.56 FP MinMax: 0.03 FP Compare: 5.66 Int AddSub: 21.74 FP Mul: 15.59 Int Mul: 41.79 FP InvSqrt: 0.46 FP Div: 3.52 Conversion Unit: 0.02 ## Core 72 ## Module Utilization FP AddSub: 13.42 FP MinMax: 0.03 FP Compare: 4.84 Int AddSub: 18.63 FP Mul: 13.41 Int Mul: 35.81 FP InvSqrt: 0.40 FP Div: 3.07 Conversion Unit: 0.01 ## Core 73 ## Module Utilization FP AddSub: 15.39 FP MinMax: 0.03 FP Compare: 5.65 Int AddSub: 21.63 FP Mul: 15.44 Int Mul: 41.64 FP InvSqrt: 0.46 FP Div: 3.46 Conversion Unit: 0.02 ## Core 74 ## Module Utilization FP AddSub: 15.32 FP MinMax: 0.03 FP Compare: 5.79 Int AddSub: 22.16 FP Mul: 15.42 Int Mul: 42.90 FP InvSqrt: 0.48 FP Div: 3.32 Conversion Unit: 0.02 ## Core 75 ## Module Utilization FP AddSub: 15.35 FP MinMax: 0.03 FP Compare: 5.64 Int AddSub: 21.63 FP Mul: 15.40 Int Mul: 41.59 FP InvSqrt: 0.45 FP Div: 3.43 Conversion Unit: 0.02 ## Core 76 ## Module Utilization FP AddSub: 15.32 FP MinMax: 0.03 FP Compare: 5.72 Int AddSub: 21.93 FP Mul: 15.38 Int Mul: 42.30 FP InvSqrt: 0.46 FP Div: 3.36 Conversion Unit: 0.02 ## Core 77 ## Module Utilization FP AddSub: 15.50 FP MinMax: 0.03 FP Compare: 5.67 Int AddSub: 21.78 FP Mul: 15.54 Int Mul: 41.84 FP InvSqrt: 0.46 FP Div: 3.50 Conversion Unit: 0.02 ## Core 78 ## Module Utilization FP AddSub: 15.05 FP MinMax: 0.03 FP Compare: 5.64 Int AddSub: 21.68 FP Mul: 15.11 Int Mul: 41.89 FP InvSqrt: 0.45 FP Div: 3.25 Conversion Unit: 0.02 ## Core 79 ## Module Utilization FP AddSub: 12.70 FP MinMax: 0.02 FP Compare: 4.35 Int AddSub: 16.67 FP Mul: 12.63 Int Mul: 31.68 FP InvSqrt: 0.34 FP Div: 3.10 Conversion Unit: 0.01 L1 accesses: 13905006 L1 hits: 13147537 L1 misses: 757469 L1 bank conflicts: 2973081 L1 stores: 49152 L1 near hit: 0 L1 hit rate: 0.945525 -= L2 #0 =- L2 accesses: 191714 L2 hits: 165182 L2 misses: 26532 L2 stores: 12219 L2 bank conflicts: 23992 L2 hit rate: 0.861606 L2 memory faults: 545 L2 bandwidth limited stalls: 42281 -= L2 #1 =- L2 accesses: 190365 L2 hits: 164508 L2 misses: 25857 L2 stores: 12285 L2 bank conflicts: 24113 L2 hit rate: 0.864171 L2 memory faults: 471 L2 bandwidth limited stalls: 39293 -= L2 #2 =- L2 accesses: 187546 L2 hits: 161741 L2 misses: 25805 L2 stores: 12297 L2 bank conflicts: 23577 L2 hit rate: 0.862407 L2 memory faults: 522 L2 bandwidth limited stalls: 40051 -= L2 #3 =- L2 accesses: 185831 L2 hits: 160937 L2 misses: 24894 L2 stores: 12351 L2 bank conflicts: 23030 L2 hit rate: 0.866040 L2 memory faults: 475 L2 bandwidth limited stalls: 37882 Bandwidth numbers for 1000MHz clock: register to L1 bandwidth: 315784231234.351320 L1 to L2 bandwidth: 549007670340.027160 L2 to memory bandwidth: 74916477888.868073 Core size: 0.9818 L2 size: 0.0000 4-L2 size: 0.0000 80-core chip size: 78.5458 FPS Statistics: FPS assuming 1000MHz clock: 5677.5278