--load-assembly ../LLVM_Trax/examples/project4_noInh/rt-llvm.s --model test_models/conference.obj --view-file views/conference.view --light-file lights/conference.light --num-cores 20 --num-thread-procs 32 --num-l2s 4 --num-icaches 2 --num-icache-banks 16 No configuration sepcified, using default Loading core 0. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 1. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 2. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 3. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 4. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 5. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 6. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 7. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 8. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 9. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 10. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 11. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 12. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 13. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 14. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 15. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 16. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 17. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 18. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 19. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 20. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 21. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 22. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 23. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 24. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 25. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 26. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 27. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 28. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 29. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 30. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 31. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 32. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 33. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 34. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 35. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 36. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 37. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 38. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 39. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 40. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 41. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 42. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 43. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 44. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 45. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 46. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 47. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 48. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 49. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 50. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 51. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 52. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 53. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 54. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 55. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 56. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 57. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 58. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 59. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 60. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 61. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 62. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 63. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 64. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 65. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 66. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 67. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 68. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 69. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 70. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 71. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 72. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 73. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 74. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 75. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 76. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 77. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 78. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Loading core 79. Found Unit FPADD with latency 2 and issue width 8 Found Unit FPMIN with latency 1 and issue width 32 Found Unit FPCMP with latency 1 and issue width 32 Found Unit INTADD with latency 1 and issue width 32 Found Unit FPMUL with latency 2 and issue width 8 Found Unit INTMUL with latency 1 and issue width 2 Found Unit FPINV with latency 20 and issue width 1 Found Unit CONV with latency 1 and issue width 32 Found Unit BLT with latency 1 and issue width 32 Found Unit BITWISE with latency 1 and issue width 32 Found Unit SPHERE with latency 40 and issue width 4 Found Unit DEBUG with latency 1 and issue width 100 Size estimate (HW config): 0.3588 Center is 1.5233 1.618 1.7711 Corner is 1.9101 3.20911 0.248412 Across is 1.25037 -1.56095 0 Up is 0.523201 0.419102 1.88431 U is 0.360963 -0.450621 0 V is 0.15104 0.120988 0.543969 radius is 0 Work queue starts at 30 (0x0000001e) FB starts at 32 (0x00000020) FB ends at 49183 (0x0000c01f) loading model test_models/conference.obj MTL file: "test_models/conference.mtl" loading material file test_models/conference.mtl Found 43 total materials Found 282664 total triangles vertex min/max = x: (-0.177790, 11.125200) y: (-0.164592, 7.010400) z: (-0.005078, 2.712720) Materials start at 49184 (0x0000c020) Materials end at 50284 (0x0000c46c) Starting BVH build. BVH build complete with 266189 nodes. Scene starts at 50285 (0x0000c46d) BVH bounds [-0.177790 -0.164592 -0.005078] [11.125200 7.010400 2.712720] Triangles start at 2179800 (0x002142d8) Scene ends at 11326495 (0x00acd41f) Starting camera at 11326496 (0x00acd420) Camera ended at 11326518 (0x00acd436) Background Color 0x00acd437 to 0x00acd439 Light at 0x00acd43a to 0x00acd43c Permutation table from 0x00acd43d to 0x00acd63c Hammersley table from 0x00acd63d to 0x00acd83c Memory used: 11327549 (0x00acd83d) Image size: 49152 start_wq: 30 start_fb: 32 start_scene: 50288 start_camera: 11326496 start_matls: 49184 start_bg_color: 11326519 start_light: 11326522 start_permutation: 11326525 Loading assembly file ../LLVM_Trax/examples/project4_noInh/rt-llvm.s using 36 registers Number of instructions: 1261 Creating thread 0... Creating thread 1... Creating thread 2... Creating thread 3... Creating thread 4... Creating thread 5... Creating thread 6... Core 0 running... Core 1 running... Core 2 running... Core 3 running... Core 4 running... Core 5 running... Core 6 running... Creating thread 7... Creating thread 8... Creating thread 9... Creating thread 10... Creating thread 11... Creating thread 12... Creating thread 13... Core 7 running... Core 8 running... Core 9 running... Core 10 running... Core 11 running... Core 12 running... Core 13 running... Creating thread 14... Creating thread 15... Creating thread 16... Creating thread 17... Creating thread 18... Core 16 running... Core 15 running... Core 14 running... Core 17 running... Creating thread 19... Creating thread 20... Creating thread 21... Creating thread 22... Creating thread 23... Creating thread 24... Creating thread 25... Creating thread 26... Creating thread 27... Core 18 running... Core 19 running... Core 20 running... Core 21 running... Core 22 running... Core 23 running... Core 24 running... Core 25 running... Core 26 running... Core 27 running... Creating thread 28... Creating thread 29... Creating thread 30... Creating thread 31... Creating thread 32... Creating thread 33... Creating thread 34... Core 28 running... Core 29 running... Core 30 running... Core 31 running... Core 32 running... Core 33 running... Core 34 running... Creating thread 35... Creating thread 36... Creating thread 37... Creating thread 38... Creating thread 39... Creating thread 40... Creating thread 41... Core 35 running... Core 36 running... Core 37 running... Core 38 running... Core 39 running... Core 40 running... Core 41 running... Creating thread 42... Creating thread 43... Creating thread 44... Creating thread 45... Creating thread 46... Creating thread 47... Creating thread 48... Core 42 running... Core 43 running... Core 44 running... Core 45 running... Core 46 running... Core 47 running... Core 48 running... Creating thread 49... Creating thread 50... Creating thread 51... Creating thread 52... Creating thread 53... Creating thread 54... Creating thread 55... Core 49 running... Core 50 running... Core 51 running... Core 52 running... Core 53 running... Core 54 running... Core 55 running... Creating thread 56... Creating thread 57... Creating thread 58... Creating thread 59... Creating thread 60... Creating thread 61... Creating thread 62... Core 56 running... Core 57 running... Core 58 running... Core 59 running... Core 60 running... Core 61 running... Core 62 running... Creating thread 63... Creating thread 64... Creating thread 65... Creating thread 66... Creating thread 67... Creating thread 68... Creating thread 69... Core 63 running... Core 64 running... Core 65 running... Core 66 running... Core 67 running... Core 68 running... Core 69 running... Creating thread 70... Creating thread 71... Creating thread 72... Creating thread 73... Creating thread 74... Creating thread 75... Creating thread 76... Core 70 running... Core 71 running... Core 76 running... Core 72 running... Core 75 running... Core 73 running... Creating thread 77... Core 74 running... Creating thread 78... Creating thread 79... Core 77 running... Core 78 running... Core 79 running... <=== Core 0 ===> ---- Thread 00 ---- PC 5: Stalled ----- 99916 in-flight CPI 1.3004 -- Total Cycles 129951 ---- Thread 01 ---- PC 5: Stalled ----- 97660 in-flight CPI 1.3304 -- Total Cycles 129951 ---- Thread 02 ---- PC 5: Stalled ----- 92294 in-flight CPI 1.4078 -- Total Cycles 129951 ---- Thread 03 ---- PC 5: Stalled ----- 100301 in-flight CPI 1.2954 -- Total Cycles 129951 ---- Thread 04 ---- PC 5: Stalled ----- 95877 in-flight CPI 1.3552 -- Total Cycles 129951 ---- Thread 05 ---- PC 5: Stalled ----- 100102 in-flight CPI 1.2979 -- Total Cycles 129951 ---- Thread 06 ---- PC 5: Stalled ----- 100765 in-flight CPI 1.2894 -- Total Cycles 129951 ---- Thread 07 ---- PC 5: Stalled ----- 93271 in-flight CPI 1.3931 -- Total Cycles 129951 ---- Thread 08 ---- PC 5: Stalled ----- 101653 in-flight CPI 1.2781 -- Total Cycles 129951 ---- Thread 09 ---- PC 5: Stalled ----- 95815 in-flight CPI 1.3560 -- Total Cycles 129951 ---- Thread 10 ---- PC 5: Stalled ----- 94795 in-flight CPI 1.3706 -- Total Cycles 129951 ---- Thread 11 ---- PC 5: Stalled ----- 92421 in-flight CPI 1.4058 -- Total Cycles 129951 ---- Thread 12 ---- PC 5: Stalled ----- 99484 in-flight CPI 1.3060 -- Total Cycles 129951 ---- Thread 13 ---- PC 5: Stalled ----- 96404 in-flight CPI 1.3478 -- Total Cycles 129951 ---- Thread 14 ---- PC 5: Stalled ----- 100362 in-flight CPI 1.2945 -- Total Cycles 129951 ---- Thread 15 ---- PC 5: Stalled ----- 102165 in-flight CPI 1.2717 -- Total Cycles 129951 ---- Thread 16 ---- PC 5: Stalled ----- 96298 in-flight CPI 1.3492 -- Total Cycles 129951 ---- Thread 17 ---- PC 5: Stalled ----- 96608 in-flight CPI 1.3449 -- Total Cycles 129951 ---- Thread 18 ---- PC 5: Stalled ----- 95451 in-flight CPI 1.3612 -- Total Cycles 129951 ---- Thread 19 ---- PC 5: Stalled ----- 90633 in-flight CPI 1.4335 -- Total Cycles 129951 ---- Thread 20 ---- PC 5: Stalled ----- 91727 in-flight CPI 1.4165 -- Total Cycles 129951 ---- Thread 21 ---- PC 5: Stalled ----- 86258 in-flight CPI 1.5063 -- Total Cycles 129951 ---- Thread 22 ---- PC 5: Stalled ----- 97500 in-flight CPI 1.3326 -- Total Cycles 129951 ---- Thread 23 ---- PC 5: Stalled ----- 92703 in-flight CPI 1.4016 -- Total Cycles 129951 ---- Thread 24 ---- PC 5: Stalled ----- 92663 in-flight CPI 1.4021 -- Total Cycles 129951 ---- Thread 25 ---- PC 5: Stalled ----- 91625 in-flight CPI 1.4180 -- Total Cycles 129951 ---- Thread 26 ---- PC 5: Stalled ----- 91816 in-flight CPI 1.4151 -- Total Cycles 129951 ---- Thread 27 ---- PC 5: Stalled ----- 91837 in-flight CPI 1.4147 -- Total Cycles 129951 ---- Thread 28 ---- PC 5: Stalled ----- 85743 in-flight CPI 1.5154 -- Total Cycles 129951 ---- Thread 29 ---- PC 5: Stalled ----- 91219 in-flight CPI 1.4244 -- Total Cycles 129951 ---- Thread 30 ---- PC 5: Stalled ----- 93076 in-flight CPI 1.3959 -- Total Cycles 129951 ---- Thread 31 ---- PC 5: Stalled ----- 90921 in-flight CPI 1.4290 -- Total Cycles 129951 Total CPI 0.0427 , IPC 23.3928 -- Total Cycles 129951 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7875 (3.981838%) FPSUB: 0 (0.000000%) FPMUL: 31953 (16.156403%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 74517 (37.678043%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4051 (2.048308%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 71620 (36.213234%) DIV: 7497 (3.790709%) FPUN: 0 (0.000000%) FPRSUB: 260 (0.131464%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3341455 total) ADD%: 7.184 (240037) SUB%: 0.000 (0) MUL%: 0.006 (203) BITOR%: 1.537 (51374) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.555 (18532) FPSUB%: 0.000 (0) FPMUL%: 4.787 (159952) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (609) FPMAX%: 0.018 (609) LOAD%: 5.134 (171549) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (235) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (579) FPINV%: 0.000 (0) FPCONV%: 0.019 (641) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.066 (35611) FPLE%: 0.457 (15282) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (609) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.797 (93473) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.745 (24907) CMPU%: 0.000 (0) RSUB%: 0.006 (203) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.672 (523683) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.171 (39117) ORI%: 1.572 (52525) XORI%: 0.000 (0) MULI%: 3.196 (106808) LW%: 1.394 (46576) LWI%: 13.078 (436990) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.287 (9589) SWI%: 4.122 (137741) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.397 (46685) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10374) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.058 (1944) bned%: 0.000 (0) bneid%: 13.809 (461435) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.721 (24086) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.122 (4077) DIV%: 0.012 (406) FPUN%: 1.489 (49768) FPRSUB%: 4.217 (140898) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (69) FPGT%: 2.939 (98214) FPGE%: 1.032 (34486) SYNC%: 0.000 (0) NOP%: 9.023 (301483) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 42 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 159 FPSUB 0 FPMUL 3 FPCMPLT 0 FPMIN 0 FPMAX 398 LOAD 39878 INTCONV 0 ATOMIC_INC 15 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 18 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1180 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 11 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49261 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 13 ORI 11254 XORI 0 MULI 9402 LW 0 LWI 142371 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 74 DIV 21 FPUN 0 FPRSUB 56 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.3930 --Total thread-cycles: 4158432 --total thread-cycles issued: 3039972 (73.103806%) --iCache conflicts: 114247 (2.747358%) --thread*cycles of FU dependence: 254173 (6.112232%) --thread*cycles of data dependence: 197773 (4.755951%) --iCache cycles*banks: 4158432 (80.354492% used) Issue breakdown: --thread*cycles of issue worked: 3039972 (73.103806%) --thread*cycles of issue failed: 816977 (19.646275%) --thread*cycles of issue NOP/other: 986549693455085 (23724079104.000000%) Number of thread-cycles not ready: 197773 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3341455 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 7 2: 6 3: 8 4: 7 5: 8 6: 8 7: 6 8: 8 9: 7 10: 8 11: 8 12: 8 13: 7 14: 9 15: 8 16: 7 17: 7 18: 8 19: 8 20: 6 21: 6 22: 7 23: 6 24: 8 25: 8 26: 7 27: 8 28: 6 29: 7 30: 8 31: 7 <=== Core 1 ===> ---- Thread 00 ---- PC 5: Stalled ----- 93812 in-flight CPI 1.6170 -- Total Cycles 151716 ---- Thread 01 ---- PC 5: Stalled ----- 97144 in-flight CPI 1.5615 -- Total Cycles 151716 ---- Thread 02 ---- PC 5: Stalled ----- 99423 in-flight CPI 1.5256 -- Total Cycles 151716 ---- Thread 03 ---- PC 5: Stalled ----- 99710 in-flight CPI 1.5213 -- Total Cycles 151716 ---- Thread 04 ---- PC 5: Stalled ----- 97003 in-flight CPI 1.5638 -- Total Cycles 151716 ---- Thread 05 ---- PC 5: Stalled ----- 92108 in-flight CPI 1.6469 -- Total Cycles 151716 ---- Thread 06 ---- PC 5: Stalled ----- 99151 in-flight CPI 1.5299 -- Total Cycles 151716 ---- Thread 07 ---- PC 5: Stalled ----- 99078 in-flight CPI 1.5310 -- Total Cycles 151716 ---- Thread 08 ---- PC 5: Stalled ----- 101796 in-flight CPI 1.4901 -- Total Cycles 151716 ---- Thread 09 ---- PC 5: Stalled ----- 97428 in-flight CPI 1.5569 -- Total Cycles 151716 ---- Thread 10 ---- PC 5: Stalled ----- 96512 in-flight CPI 1.5717 -- Total Cycles 151716 ---- Thread 11 ---- PC 5: Stalled ----- 97375 in-flight CPI 1.5577 -- Total Cycles 151716 ---- Thread 12 ---- PC 5: Stalled ----- 91238 in-flight CPI 1.6626 -- Total Cycles 151716 ---- Thread 13 ---- PC 5: Stalled ----- 112466 in-flight CPI 1.3488 -- Total Cycles 151716 ---- Thread 14 ---- PC 5: Stalled ----- 97348 in-flight CPI 1.5582 -- Total Cycles 151716 ---- Thread 15 ---- PC 5: Stalled ----- 94552 in-flight CPI 1.6042 -- Total Cycles 151716 ---- Thread 16 ---- PC 5: Stalled ----- 98281 in-flight CPI 1.5434 -- Total Cycles 151716 ---- Thread 17 ---- PC 5: Stalled ----- 99046 in-flight CPI 1.5315 -- Total Cycles 151716 ---- Thread 18 ---- PC 5: Stalled ----- 99231 in-flight CPI 1.5286 -- Total Cycles 151716 ---- Thread 19 ---- PC 5: Stalled ----- 98209 in-flight CPI 1.5445 -- Total Cycles 151716 ---- Thread 20 ---- PC 5: Stalled ----- 91317 in-flight CPI 1.6611 -- Total Cycles 151716 ---- Thread 21 ---- PC 5: Stalled ----- 97516 in-flight CPI 1.5555 -- Total Cycles 151716 ---- Thread 22 ---- PC 5: Stalled ----- 98878 in-flight CPI 1.5340 -- Total Cycles 151716 ---- Thread 23 ---- PC 5: Stalled ----- 97772 in-flight CPI 1.5514 -- Total Cycles 151716 ---- Thread 24 ---- PC 5: Stalled ----- 92444 in-flight CPI 1.6409 -- Total Cycles 151716 ---- Thread 25 ---- PC 5: Stalled ----- 87797 in-flight CPI 1.7278 -- Total Cycles 151716 ---- Thread 26 ---- PC 5: Stalled ----- 104076 in-flight CPI 1.4576 -- Total Cycles 151716 ---- Thread 27 ---- PC 5: Stalled ----- 87095 in-flight CPI 1.7417 -- Total Cycles 151716 ---- Thread 28 ---- PC 5: Stalled ----- 92916 in-flight CPI 1.6325 -- Total Cycles 151716 ---- Thread 29 ---- PC 5: Stalled ----- 91131 in-flight CPI 1.6645 -- Total Cycles 151716 ---- Thread 30 ---- PC 5: Stalled ----- 93215 in-flight CPI 1.6273 -- Total Cycles 151716 ---- Thread 31 ---- PC 5: Stalled ----- 89261 in-flight CPI 1.6994 -- Total Cycles 151716 Total CPI 0.0492 , IPC 20.3333 -- Total Cycles 151716 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8891 (4.012963%) FPSUB: 0 (0.000000%) FPMUL: 33926 (15.312537%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 86246 (38.927227%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4079 (1.841061%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 80660 (36.405979%) DIV: 7495 (3.382876%) FPUN: 0 (0.000000%) FPRSUB: 260 (0.117351%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3390384 total) ADD%: 7.111 (241079) SUB%: 0.000 (0) MUL%: 0.006 (203) BITOR%: 1.520 (51529) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.607 (20565) FPSUB%: 0.000 (0) FPMUL%: 4.934 (167282) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (609) FPMAX%: 0.018 (609) LOAD%: 5.218 (176897) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (235) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (581) FPINV%: 0.000 (0) FPCONV%: 0.019 (641) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.084 (36737) FPLE%: 0.450 (15243) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (609) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.767 (93803) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.765 (25941) CMPU%: 0.000 (0) RSUB%: 0.006 (203) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.639 (530206) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.168 (39611) ORI%: 1.604 (54370) XORI%: 0.000 (0) MULI%: 3.161 (107172) LW%: 1.379 (46744) LWI%: 12.992 (440488) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.284 (9619) SWI%: 4.099 (138970) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.382 (46855) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.308 (10439) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.063 (2151) bned%: 0.000 (0) bneid%: 13.745 (466021) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.706 (23950) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.136 (4609) DIV%: 0.012 (406) FPUN%: 1.462 (49561) FPRSUB%: 4.349 (147463) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (62) FPGT%: 2.922 (99061) FPGE%: 1.012 (34318) SYNC%: 0.000 (0) NOP%: 9.009 (305446) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 28 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 154 FPSUB 0 FPMUL 1 FPCMPLT 0 FPMIN 0 FPMAX 393 LOAD 41678 INTCONV 0 ATOMIC_INC 19 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 18 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1259 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 7 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49471 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 8 ORI 12663 XORI 0 MULI 8853 LW 0 LWI 143895 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 73 DIV 32 FPUN 0 FPRSUB 67 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 20.3335 --Total thread-cycles: 4854912 --total thread-cycles issued: 3084938 (63.542618%) --iCache conflicts: 112812 (2.323667%) --thread*cycles of FU dependence: 258638 (5.327347%) --thread*cycles of data dependence: 221557 (4.563564%) --iCache cycles*banks: 4854912 (69.834755% used) Issue breakdown: --thread*cycles of issue worked: 3084938 (63.542618%) --thread*cycles of issue failed: 1464528 (30.165901%) --thread*cycles of issue NOP/other: 864040046279382 (17797232640.000000%) Number of thread-cycles not ready: 221557 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3390384 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 6 1: 6 2: 9 3: 8 4: 7 5: 7 6: 7 7: 8 8: 8 9: 8 10: 8 11: 8 12: 7 13: 6 14: 7 15: 8 16: 8 17: 8 18: 8 19: 8 20: 8 21: 8 22: 9 23: 8 24: 7 25: 6 26: 4 27: 6 28: 7 29: 7 30: 8 31: 7 <=== Core 2 ===> ---- Thread 00 ---- PC 5: Stalled ----- 102215 in-flight CPI 1.2844 -- Total Cycles 131310 ---- Thread 01 ---- PC 5: Stalled ----- 99406 in-flight CPI 1.3207 -- Total Cycles 131310 ---- Thread 02 ---- PC 5: Stalled ----- 102176 in-flight CPI 1.2849 -- Total Cycles 131310 ---- Thread 03 ---- PC 5: Stalled ----- 91570 in-flight CPI 1.4338 -- Total Cycles 131310 ---- Thread 04 ---- PC 5: Stalled ----- 102695 in-flight CPI 1.2785 -- Total Cycles 131310 ---- Thread 05 ---- PC 5: Stalled ----- 100416 in-flight CPI 1.3074 -- Total Cycles 131310 ---- Thread 06 ---- PC 5: Stalled ----- 98319 in-flight CPI 1.3353 -- Total Cycles 131310 ---- Thread 07 ---- PC 5: Stalled ----- 97241 in-flight CPI 1.3501 -- Total Cycles 131310 ---- Thread 08 ---- PC 5: Stalled ----- 98123 in-flight CPI 1.3380 -- Total Cycles 131310 ---- Thread 09 ---- PC 5: Stalled ----- 104363 in-flight CPI 1.2579 -- Total Cycles 131310 ---- Thread 10 ---- PC 5: Stalled ----- 101126 in-flight CPI 1.2982 -- Total Cycles 131310 ---- Thread 11 ---- PC 5: Stalled ----- 93486 in-flight CPI 1.4043 -- Total Cycles 131310 ---- Thread 12 ---- PC 5: Stalled ----- 100716 in-flight CPI 1.3035 -- Total Cycles 131310 ---- Thread 13 ---- PC 5: Stalled ----- 94450 in-flight CPI 1.3900 -- Total Cycles 131310 ---- Thread 14 ---- PC 5: Stalled ----- 97757 in-flight CPI 1.3429 -- Total Cycles 131310 ---- Thread 15 ---- PC 5: Stalled ----- 89278 in-flight CPI 1.4706 -- Total Cycles 131310 ---- Thread 16 ---- PC 5: Stalled ----- 92774 in-flight CPI 1.4151 -- Total Cycles 131310 ---- Thread 17 ---- PC 5: Stalled ----- 97616 in-flight CPI 1.3449 -- Total Cycles 131310 ---- Thread 18 ---- PC 5: Stalled ----- 90415 in-flight CPI 1.4520 -- Total Cycles 131310 ---- Thread 19 ---- PC 5: Stalled ----- 99132 in-flight CPI 1.3243 -- Total Cycles 131310 ---- Thread 20 ---- PC 5: Stalled ----- 91890 in-flight CPI 1.4287 -- Total Cycles 131310 ---- Thread 21 ---- PC 5: Stalled ----- 92234 in-flight CPI 1.4234 -- Total Cycles 131310 ---- Thread 22 ---- PC 5: Stalled ----- 93605 in-flight CPI 1.4026 -- Total Cycles 131310 ---- Thread 23 ---- PC 5: Stalled ----- 97892 in-flight CPI 1.3411 -- Total Cycles 131310 ---- Thread 24 ---- PC 5: Stalled ----- 91122 in-flight CPI 1.4408 -- Total Cycles 131310 ---- Thread 25 ---- PC 5: Stalled ----- 93397 in-flight CPI 1.4056 -- Total Cycles 131310 ---- Thread 26 ---- PC 5: Stalled ----- 90056 in-flight CPI 1.4578 -- Total Cycles 131310 ---- Thread 27 ---- PC 5: Stalled ----- 90151 in-flight CPI 1.4563 -- Total Cycles 131310 ---- Thread 28 ---- PC 5: Stalled ----- 88333 in-flight CPI 1.4862 -- Total Cycles 131310 ---- Thread 29 ---- PC 5: Stalled ----- 88710 in-flight CPI 1.4799 -- Total Cycles 131310 ---- Thread 30 ---- PC 5: Stalled ----- 90890 in-flight CPI 1.4445 -- Total Cycles 131310 ---- Thread 31 ---- PC 5: Stalled ----- 85039 in-flight CPI 1.5438 -- Total Cycles 131310 Total CPI 0.0431 , IPC 23.2059 -- Total Cycles 131310 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7769 (3.841058%) FPSUB: 0 (0.000000%) FPMUL: 31629 (15.637638%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 78865 (38.991505%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4320 (2.135844%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 71613 (35.406055%) DIV: 7801 (3.856879%) FPUN: 0 (0.000000%) FPRSUB: 265 (0.131018%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3349308 total) ADD%: 7.194 (240954) SUB%: 0.000 (0) MUL%: 0.006 (211) BITOR%: 1.535 (51423) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.545 (18253) FPSUB%: 0.000 (0) FPMUL%: 4.751 (159141) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (633) FPMAX%: 0.019 (633) LOAD%: 5.124 (171629) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (243) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (605) FPINV%: 0.000 (0) FPCONV%: 0.020 (665) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.063 (35607) FPLE%: 0.456 (15283) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (633) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.800 (93786) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.746 (24977) CMPU%: 0.000 (0) RSUB%: 0.006 (211) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.677 (525056) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.175 (39339) ORI%: 1.560 (52265) XORI%: 0.000 (0) MULI%: 3.204 (107318) LW%: 1.396 (46742) LWI%: 13.112 (439177) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.287 (9614) SWI%: 4.145 (138817) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.399 (46861) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10375) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.053 (1775) bned%: 0.000 (0) bneid%: 13.819 (462832) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.717 (24016) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.122 (4070) DIV%: 0.013 (422) FPUN%: 1.485 (49725) FPRSUB%: 4.197 (140565) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (73) FPGT%: 2.949 (98759) FPGE%: 1.028 (34442) SYNC%: 0.000 (0) NOP%: 9.019 (302082) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 18 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 161 FPSUB 0 FPMUL 1 FPCMPLT 0 FPMIN 0 FPMAX 406 LOAD 39574 INTCONV 0 ATOMIC_INC 15 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 18 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1322 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 4 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49460 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 9 ORI 11034 XORI 0 MULI 9514 LW 0 LWI 143119 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 55 DIV 20 FPUN 0 FPRSUB 55 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.2062 --Total thread-cycles: 4201920 --total thread-cycles issued: 3047226 (72.519852%) --iCache conflicts: 113210 (2.694244%) --thread*cycles of FU dependence: 254813 (6.064204%) --thread*cycles of data dependence: 202262 (4.813561%) --iCache cycles*banks: 4201920 (79.709747% used) Issue breakdown: --thread*cycles of issue worked: 3047226 (72.519852%) --thread*cycles of issue failed: 852612 (20.291010%) --thread*cycles of issue NOP/other: 664637599578237 (15817474048.000000%) Number of thread-cycles not ready: 202262 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3349308 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 8 3: 6 4: 6 5: 8 6: 8 7: 8 8: 7 9: 10 10: 8 11: 8 12: 9 13: 7 14: 9 15: 5 16: 8 17: 8 18: 7 19: 8 20: 7 21: 8 22: 7 23: 9 24: 6 25: 8 26: 8 27: 7 28: 8 29: 7 30: 7 31: 7 <=== Core 3 ===> ---- Thread 00 ---- PC 5: Stalled ----- 93510 in-flight CPI 1.5369 -- Total Cycles 143740 ---- Thread 01 ---- PC 5: Stalled ----- 99669 in-flight CPI 1.4419 -- Total Cycles 143740 ---- Thread 02 ---- PC 5: Stalled ----- 94164 in-flight CPI 1.5262 -- Total Cycles 143740 ---- Thread 03 ---- PC 5: Stalled ----- 95056 in-flight CPI 1.5119 -- Total Cycles 143740 ---- Thread 04 ---- PC 5: Stalled ----- 96655 in-flight CPI 1.4868 -- Total Cycles 143740 ---- Thread 05 ---- PC 5: Stalled ----- 104622 in-flight CPI 1.3736 -- Total Cycles 143740 ---- Thread 06 ---- PC 5: Stalled ----- 92002 in-flight CPI 1.5621 -- Total Cycles 143740 ---- Thread 07 ---- PC 5: Stalled ----- 95334 in-flight CPI 1.5075 -- Total Cycles 143740 ---- Thread 08 ---- PC 5: Stalled ----- 98010 in-flight CPI 1.4663 -- Total Cycles 143740 ---- Thread 09 ---- PC 5: Stalled ----- 99947 in-flight CPI 1.4379 -- Total Cycles 143740 ---- Thread 10 ---- PC 5: Stalled ----- 107623 in-flight CPI 1.3355 -- Total Cycles 143740 ---- Thread 11 ---- PC 5: Stalled ----- 98743 in-flight CPI 1.4554 -- Total Cycles 143740 ---- Thread 12 ---- PC 5: Stalled ----- 100846 in-flight CPI 1.4251 -- Total Cycles 143740 ---- Thread 13 ---- PC 5: Stalled ----- 100158 in-flight CPI 1.4349 -- Total Cycles 143740 ---- Thread 14 ---- PC 5: Stalled ----- 97212 in-flight CPI 1.4784 -- Total Cycles 143740 ---- Thread 15 ---- PC 5: Stalled ----- 97519 in-flight CPI 1.4737 -- Total Cycles 143740 ---- Thread 16 ---- PC 5: Stalled ----- 97315 in-flight CPI 1.4768 -- Total Cycles 143740 ---- Thread 17 ---- PC 5: Stalled ----- 96918 in-flight CPI 1.4829 -- Total Cycles 143740 ---- Thread 18 ---- PC 5: Stalled ----- 94987 in-flight CPI 1.5130 -- Total Cycles 143740 ---- Thread 19 ---- PC 5: Stalled ----- 94234 in-flight CPI 1.5251 -- Total Cycles 143740 ---- Thread 20 ---- PC 5: Stalled ----- 93304 in-flight CPI 1.5403 -- Total Cycles 143740 ---- Thread 21 ---- PC 5: Stalled ----- 98401 in-flight CPI 1.4604 -- Total Cycles 143740 ---- Thread 22 ---- PC 5: Stalled ----- 88667 in-flight CPI 1.6209 -- Total Cycles 143740 ---- Thread 23 ---- PC 5: Stalled ----- 88833 in-flight CPI 1.6178 -- Total Cycles 143740 ---- Thread 24 ---- PC 5: Stalled ----- 93475 in-flight CPI 1.5374 -- Total Cycles 143740 ---- Thread 25 ---- PC 5: Stalled ----- 90621 in-flight CPI 1.5859 -- Total Cycles 143740 ---- Thread 26 ---- PC 5: Stalled ----- 88070 in-flight CPI 1.6319 -- Total Cycles 143740 ---- Thread 27 ---- PC 5: Stalled ----- 93579 in-flight CPI 1.5358 -- Total Cycles 143740 ---- Thread 28 ---- PC 5: Stalled ----- 88374 in-flight CPI 1.6262 -- Total Cycles 143740 ---- Thread 29 ---- PC 5: Stalled ----- 85890 in-flight CPI 1.6732 -- Total Cycles 143740 ---- Thread 30 ---- PC 5: Stalled ----- 84479 in-flight CPI 1.7012 -- Total Cycles 143740 ---- Thread 31 ---- PC 5: Stalled ----- 84727 in-flight CPI 1.6962 -- Total Cycles 143740 Total CPI 0.0474 , IPC 21.1040 -- Total Cycles 143740 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8575 (4.195986%) FPSUB: 0 (0.000000%) FPMUL: 32836 (16.067566%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 74328 (36.370754%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 3820 (1.869232%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 77267 (37.808887%) DIV: 7280 (3.562306%) FPUN: 0 (0.000000%) FPRSUB: 256 (0.125268%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3333053 total) ADD%: 7.188 (239579) SUB%: 0.000 (0) MUL%: 0.006 (197) BITOR%: 1.532 (51068) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.590 (19665) FPSUB%: 0.000 (0) FPMUL%: 4.889 (162969) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (591) FPMAX%: 0.018 (591) LOAD%: 5.212 (173729) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (229) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (559) FPINV%: 0.000 (0) FPCONV%: 0.019 (623) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.074 (35784) FPLE%: 0.456 (15208) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (591) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.780 (92662) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.760 (25346) CMPU%: 0.000 (0) RSUB%: 0.006 (197) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.654 (521746) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.172 (39050) ORI%: 1.588 (52944) XORI%: 0.000 (0) MULI%: 3.167 (105574) LW%: 1.385 (46168) LWI%: 12.990 (432948) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.285 (9508) SWI%: 4.096 (136538) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.388 (46269) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.309 (10284) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.061 (2045) bned%: 0.000 (0) bneid%: 13.741 (458006) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.715 (23817) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.132 (4414) DIV%: 0.012 (394) FPUN%: 1.476 (49201) FPRSUB%: 4.317 (143887) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (63) FPGT%: 2.910 (97002) FPGE%: 1.020 (33993) SYNC%: 0.000 (0) NOP%: 8.986 (299518) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 14 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 147 FPSUB 0 FPMUL 3 FPCMPLT 0 FPMIN 0 FPMAX 384 LOAD 40191 INTCONV 0 ATOMIC_INC 15 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 17 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1270 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 8 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48680 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 12 ORI 12166 XORI 0 MULI 9703 LW 0 LWI 141614 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 75 DIV 22 FPUN 0 FPRSUB 55 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.1042 --Total thread-cycles: 4599680 --total thread-cycles issued: 3033535 (65.951004%) --iCache conflicts: 112641 (2.448888%) --thread*cycles of FU dependence: 254398 (5.530776%) --thread*cycles of data dependence: 204362 (4.442961%) --iCache cycles*banks: 4599680 (72.463409% used) Issue breakdown: --thread*cycles of issue worked: 3033535 (65.951004%) --thread*cycles of issue failed: 1266627 (27.537287%) --thread*cycles of issue NOP/other: 4619245006249824766 (100425361522688.000000%) Number of thread-cycles not ready: 204362 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3333053 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 8 3: 7 4: 8 5: 9 6: 6 7: 7 8: 7 9: 7 10: 5 11: 8 12: 8 13: 8 14: 7 15: 7 16: 7 17: 7 18: 7 19: 7 20: 7 21: 9 22: 6 23: 6 24: 8 25: 7 26: 6 27: 7 28: 7 29: 7 30: 7 31: 7 <=== Core 4 ===> ---- Thread 00 ---- PC 5: Stalled ----- 104190 in-flight CPI 1.2158 -- Total Cycles 126706 ---- Thread 01 ---- PC 5: Stalled ----- 96871 in-flight CPI 1.3078 -- Total Cycles 126706 ---- Thread 02 ---- PC 5: Stalled ----- 96839 in-flight CPI 1.3082 -- Total Cycles 126706 ---- Thread 03 ---- PC 5: Stalled ----- 102196 in-flight CPI 1.2396 -- Total Cycles 126706 ---- Thread 04 ---- PC 5: Stalled ----- 97762 in-flight CPI 1.2958 -- Total Cycles 126706 ---- Thread 05 ---- PC 5: Stalled ----- 100871 in-flight CPI 1.2559 -- Total Cycles 126706 ---- Thread 06 ---- PC 5: Stalled ----- 97513 in-flight CPI 1.2991 -- Total Cycles 126706 ---- Thread 07 ---- PC 5: Stalled ----- 90150 in-flight CPI 1.4053 -- Total Cycles 126706 ---- Thread 08 ---- PC 5: Stalled ----- 96235 in-flight CPI 1.3164 -- Total Cycles 126706 ---- Thread 09 ---- PC 5: Stalled ----- 98263 in-flight CPI 1.2892 -- Total Cycles 126706 ---- Thread 10 ---- PC 5: Stalled ----- 94191 in-flight CPI 1.3450 -- Total Cycles 126706 ---- Thread 11 ---- PC 5: Stalled ----- 99314 in-flight CPI 1.2755 -- Total Cycles 126706 ---- Thread 12 ---- PC 5: Stalled ----- 92769 in-flight CPI 1.3656 -- Total Cycles 126706 ---- Thread 13 ---- PC 5: Stalled ----- 92029 in-flight CPI 1.3766 -- Total Cycles 126706 ---- Thread 14 ---- PC 5: Stalled ----- 93757 in-flight CPI 1.3512 -- Total Cycles 126706 ---- Thread 15 ---- PC 5: Stalled ----- 100197 in-flight CPI 1.2643 -- Total Cycles 126706 ---- Thread 16 ---- PC 5: Stalled ----- 99497 in-flight CPI 1.2732 -- Total Cycles 126706 ---- Thread 17 ---- PC 5: Stalled ----- 92717 in-flight CPI 1.3663 -- Total Cycles 126706 ---- Thread 18 ---- PC 5: Stalled ----- 93633 in-flight CPI 1.3530 -- Total Cycles 126706 ---- Thread 19 ---- PC 5: Stalled ----- 91268 in-flight CPI 1.3880 -- Total Cycles 126706 ---- Thread 20 ---- PC 5: Stalled ----- 90899 in-flight CPI 1.3937 -- Total Cycles 126706 ---- Thread 21 ---- PC 5: Stalled ----- 94486 in-flight CPI 1.3407 -- Total Cycles 126706 ---- Thread 22 ---- PC 5: Stalled ----- 89889 in-flight CPI 1.4093 -- Total Cycles 126706 ---- Thread 23 ---- PC 5: Stalled ----- 95878 in-flight CPI 1.3213 -- Total Cycles 126706 ---- Thread 24 ---- PC 5: Stalled ----- 89537 in-flight CPI 1.4149 -- Total Cycles 126706 ---- Thread 25 ---- PC 5: Stalled ----- 92372 in-flight CPI 1.3715 -- Total Cycles 126706 ---- Thread 26 ---- PC 5: Stalled ----- 88504 in-flight CPI 1.4314 -- Total Cycles 126706 ---- Thread 27 ---- PC 5: Stalled ----- 89987 in-flight CPI 1.4078 -- Total Cycles 126706 ---- Thread 28 ---- PC 5: Stalled ----- 85842 in-flight CPI 1.4758 -- Total Cycles 126706 ---- Thread 29 ---- PC 5: Stalled ----- 88479 in-flight CPI 1.4318 -- Total Cycles 126706 ---- Thread 30 ---- PC 5: Stalled ----- 90558 in-flight CPI 1.3989 -- Total Cycles 126706 ---- Thread 31 ---- PC 5: Stalled ----- 89764 in-flight CPI 1.4112 -- Total Cycles 126706 Total CPI 0.0420 , IPC 23.8112 -- Total Cycles 126706 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7762 (4.033612%) FPSUB: 0 (0.000000%) FPMUL: 31648 (16.446243%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 70363 (36.564934%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4238 (2.202325%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 70518 (36.645481%) DIV: 7644 (3.972291%) FPUN: 0 (0.000000%) FPRSUB: 260 (0.135112%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3316369 total) ADD%: 7.130 (236458) SUB%: 0.000 (0) MUL%: 0.006 (207) BITOR%: 1.528 (50666) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.552 (18297) FPSUB%: 0.000 (0) FPMUL%: 4.774 (158328) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (621) FPMAX%: 0.019 (621) LOAD%: 5.133 (170217) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (239) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (595) FPINV%: 0.000 (0) FPCONV%: 0.020 (653) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.063 (35258) FPLE%: 0.453 (15038) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (621) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.808 (93137) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.744 (24665) CMPU%: 0.000 (0) RSUB%: 0.006 (207) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.686 (520211) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.174 (38937) ORI%: 1.568 (52015) XORI%: 0.000 (0) MULI%: 3.208 (106376) LW%: 1.400 (46418) LWI%: 13.113 (434880) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.288 (9547) SWI%: 4.145 (137468) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.403 (46535) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10307) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.057 (1879) bned%: 0.000 (0) bneid%: 13.812 (458061) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.718 (23815) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.121 (4009) DIV%: 0.012 (414) FPUN%: 1.481 (49108) FPRSUB%: 4.199 (139248) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (76) FPGT%: 2.948 (97780) FPGE%: 1.027 (34070) SYNC%: 0.000 (0) NOP%: 9.025 (299291) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 14 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 155 FPSUB 0 FPMUL 1 FPCMPLT 0 FPMIN 0 FPMAX 402 LOAD 39731 INTCONV 0 ATOMIC_INC 20 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 12 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1098 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 9 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48977 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 6 ORI 11102 XORI 0 MULI 9242 LW 0 LWI 141440 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 71 DIV 28 FPUN 0 FPRSUB 54 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.8115 --Total thread-cycles: 4054592 --total thread-cycles issued: 3017078 (74.411385%) --iCache conflicts: 114189 (2.816288%) --thread*cycles of FU dependence: 252376 (6.224449%) --thread*cycles of data dependence: 192433 (4.746051%) --iCache cycles*banks: 4054592 (81.793701% used) Issue breakdown: --thread*cycles of issue worked: 3017078 (74.411385%) --thread*cycles of issue failed: 738223 (18.207085%) --thread*cycles of issue NOP/other: 673804955 (16618.316406%) Number of thread-cycles not ready: 192433 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3316369 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 9 1: 7 2: 7 3: 8 4: 8 5: 8 6: 9 7: 6 8: 7 9: 8 10: 7 11: 9 12: 7 13: 7 14: 7 15: 9 16: 8 17: 7 18: 7 19: 8 20: 7 21: 8 22: 7 23: 7 24: 7 25: 7 26: 7 27: 7 28: 7 29: 7 30: 7 31: 8 <=== Core 5 ===> ---- Thread 00 ---- PC 5: Stalled ----- 93874 in-flight CPI 1.3833 -- Total Cycles 129886 ---- Thread 01 ---- PC 5: Stalled ----- 102790 in-flight CPI 1.2634 -- Total Cycles 129886 ---- Thread 02 ---- PC 5: Stalled ----- 98832 in-flight CPI 1.3139 -- Total Cycles 129886 ---- Thread 03 ---- PC 5: Stalled ----- 103106 in-flight CPI 1.2595 -- Total Cycles 129886 ---- Thread 04 ---- PC 5: Stalled ----- 96820 in-flight CPI 1.3413 -- Total Cycles 129886 ---- Thread 05 ---- PC 5: Stalled ----- 103822 in-flight CPI 1.2508 -- Total Cycles 129886 ---- Thread 06 ---- PC 5: Stalled ----- 98251 in-flight CPI 1.3217 -- Total Cycles 129886 ---- Thread 07 ---- PC 5: Stalled ----- 95375 in-flight CPI 1.3616 -- Total Cycles 129886 ---- Thread 08 ---- PC 5: Stalled ----- 98644 in-flight CPI 1.3165 -- Total Cycles 129886 ---- Thread 09 ---- PC 5: Stalled ----- 104192 in-flight CPI 1.2464 -- Total Cycles 129886 ---- Thread 10 ---- PC 5: Stalled ----- 97546 in-flight CPI 1.3313 -- Total Cycles 129886 ---- Thread 11 ---- PC 5: Stalled ----- 97766 in-flight CPI 1.3283 -- Total Cycles 129886 ---- Thread 12 ---- PC 5: Stalled ----- 97682 in-flight CPI 1.3294 -- Total Cycles 129886 ---- Thread 13 ---- PC 5: Stalled ----- 90232 in-flight CPI 1.4393 -- Total Cycles 129886 ---- Thread 14 ---- PC 5: Stalled ----- 94576 in-flight CPI 1.3731 -- Total Cycles 129886 ---- Thread 15 ---- PC 5: Stalled ----- 90014 in-flight CPI 1.4427 -- Total Cycles 129886 ---- Thread 16 ---- PC 5: Stalled ----- 92346 in-flight CPI 1.4063 -- Total Cycles 129886 ---- Thread 17 ---- PC 5: Stalled ----- 101041 in-flight CPI 1.2852 -- Total Cycles 129886 ---- Thread 18 ---- PC 5: Stalled ----- 98657 in-flight CPI 1.3162 -- Total Cycles 129886 ---- Thread 19 ---- PC 5: Stalled ----- 96292 in-flight CPI 1.3486 -- Total Cycles 129886 ---- Thread 20 ---- PC 5: Stalled ----- 94963 in-flight CPI 1.3675 -- Total Cycles 129886 ---- Thread 21 ---- PC 5: Stalled ----- 94630 in-flight CPI 1.3723 -- Total Cycles 129886 ---- Thread 22 ---- PC 5: Stalled ----- 89995 in-flight CPI 1.4430 -- Total Cycles 129886 ---- Thread 23 ---- PC 5: Stalled ----- 89377 in-flight CPI 1.4529 -- Total Cycles 129886 ---- Thread 24 ---- PC 5: Stalled ----- 91105 in-flight CPI 1.4254 -- Total Cycles 129886 ---- Thread 25 ---- PC 5: Stalled ----- 90426 in-flight CPI 1.4361 -- Total Cycles 129886 ---- Thread 26 ---- PC 5: Stalled ----- 93286 in-flight CPI 1.3921 -- Total Cycles 129886 ---- Thread 27 ---- PC 5: Stalled ----- 92883 in-flight CPI 1.3981 -- Total Cycles 129886 ---- Thread 28 ---- PC 5: Stalled ----- 94108 in-flight CPI 1.3799 -- Total Cycles 129886 ---- Thread 29 ---- PC 5: Stalled ----- 91189 in-flight CPI 1.4241 -- Total Cycles 129886 ---- Thread 30 ---- PC 5: Stalled ----- 85234 in-flight CPI 1.5235 -- Total Cycles 129886 ---- Thread 31 ---- PC 5: Stalled ----- 93492 in-flight CPI 1.3890 -- Total Cycles 129886 Total CPI 0.0425 , IPC 23.5062 -- Total Cycles 129886 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7440 (4.083044%) FPSUB: 0 (0.000000%) FPMUL: 31170 (17.105978%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 62550 (34.327202%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4303 (2.361470%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 68735 (37.721508%) DIV: 7756 (4.256464%) FPUN: 0 (0.000000%) FPRSUB: 263 (0.144333%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3355723 total) ADD%: 7.232 (242686) SUB%: 0.000 (0) MUL%: 0.006 (210) BITOR%: 1.533 (51437) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.526 (17662) FPSUB%: 0.000 (0) FPMUL%: 4.701 (157739) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (630) FPMAX%: 0.019 (630) LOAD%: 5.104 (171287) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (242) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (604) FPINV%: 0.000 (0) FPCONV%: 0.020 (662) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.054 (35368) FPLE%: 0.456 (15312) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (630) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.818 (94565) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.738 (24766) CMPU%: 0.000 (0) RSUB%: 0.006 (210) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.699 (526806) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.178 (39533) ORI%: 1.545 (51849) XORI%: 0.000 (0) MULI%: 3.219 (108016) LW%: 1.404 (47120) LWI%: 13.144 (441062) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.289 (9703) SWI%: 4.161 (139635) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.408 (47240) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10419) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.050 (1687) bned%: 0.000 (0) bneid%: 13.827 (464011) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.718 (24104) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.116 (3902) DIV%: 0.013 (420) FPUN%: 1.486 (49856) FPRSUB%: 4.151 (139282) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (73) FPGT%: 2.955 (99178) FPGE%: 1.029 (34544) SYNC%: 0.000 (0) NOP%: 9.016 (302547) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 21 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 153 FPSUB 0 FPMUL 1 FPCMPLT 0 FPMIN 0 FPMAX 404 LOAD 38758 INTCONV 0 ATOMIC_INC 18 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 16 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1210 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 5 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49640 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 10 ORI 10555 XORI 0 MULI 9778 LW 0 LWI 143620 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 71 DIV 27 FPUN 0 FPRSUB 46 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.5064 --Total thread-cycles: 4156352 --total thread-cycles issued: 3053176 (73.458069%) --iCache conflicts: 114097 (2.745124%) --thread*cycles of FU dependence: 254356 (6.119693%) --thread*cycles of data dependence: 182217 (4.384061%) --iCache cycles*banks: 4156352 (80.737991% used) Issue breakdown: --thread*cycles of issue worked: 3053176 (73.458069%) --thread*cycles of issue failed: 800629 (19.262781%) --thread*cycles of issue NOP/other: 4613544946845392339 (110999856218112.000000%) Number of thread-cycles not ready: 182217 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3355723 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 9 3: 8 4: 8 5: 9 6: 9 7: 7 8: 7 9: 8 10: 8 11: 7 12: 8 13: 5 14: 7 15: 6 16: 6 17: 8 18: 9 19: 7 20: 8 21: 7 22: 6 23: 8 24: 7 25: 7 26: 8 27: 8 28: 8 29: 7 30: 8 31: 8 <=== Core 6 ===> ---- Thread 00 ---- PC 5: Stalled ----- 103754 in-flight CPI 1.2426 -- Total Cycles 128948 ---- Thread 01 ---- PC 5: Stalled ----- 97213 in-flight CPI 1.3262 -- Total Cycles 128948 ---- Thread 02 ---- PC 5: Stalled ----- 101863 in-flight CPI 1.2657 -- Total Cycles 128948 ---- Thread 03 ---- PC 5: Stalled ----- 97952 in-flight CPI 1.3161 -- Total Cycles 128948 ---- Thread 04 ---- PC 5: Stalled ----- 90993 in-flight CPI 1.4169 -- Total Cycles 128948 ---- Thread 05 ---- PC 5: Stalled ----- 97270 in-flight CPI 1.3254 -- Total Cycles 128948 ---- Thread 06 ---- PC 5: Stalled ----- 97326 in-flight CPI 1.3246 -- Total Cycles 128948 ---- Thread 07 ---- PC 5: Stalled ----- 100209 in-flight CPI 1.2865 -- Total Cycles 128948 ---- Thread 08 ---- PC 5: Stalled ----- 100069 in-flight CPI 1.2883 -- Total Cycles 128948 ---- Thread 09 ---- PC 5: Stalled ----- 95614 in-flight CPI 1.3484 -- Total Cycles 128948 ---- Thread 10 ---- PC 5: Stalled ----- 100672 in-flight CPI 1.2806 -- Total Cycles 128948 ---- Thread 11 ---- PC 5: Stalled ----- 102017 in-flight CPI 1.2637 -- Total Cycles 128948 ---- Thread 12 ---- PC 5: Stalled ----- 98631 in-flight CPI 1.3071 -- Total Cycles 128948 ---- Thread 13 ---- PC 5: Stalled ----- 93818 in-flight CPI 1.3742 -- Total Cycles 128948 ---- Thread 14 ---- PC 5: Stalled ----- 95633 in-flight CPI 1.3481 -- Total Cycles 128948 ---- Thread 15 ---- PC 5: Stalled ----- 100796 in-flight CPI 1.2791 -- Total Cycles 128948 ---- Thread 16 ---- PC 5: Stalled ----- 95848 in-flight CPI 1.3451 -- Total Cycles 128948 ---- Thread 17 ---- PC 5: Stalled ----- 91874 in-flight CPI 1.4033 -- Total Cycles 128948 ---- Thread 18 ---- PC 5: Stalled ----- 94911 in-flight CPI 1.3584 -- Total Cycles 128948 ---- Thread 19 ---- PC 5: Stalled ----- 92644 in-flight CPI 1.3916 -- Total Cycles 128948 ---- Thread 20 ---- PC 5: Stalled ----- 91809 in-flight CPI 1.4043 -- Total Cycles 128948 ---- Thread 21 ---- PC 5: Stalled ----- 95214 in-flight CPI 1.3541 -- Total Cycles 128948 ---- Thread 22 ---- PC 5: Stalled ----- 90354 in-flight CPI 1.4268 -- Total Cycles 128948 ---- Thread 23 ---- PC 5: Stalled ----- 92589 in-flight CPI 1.3924 -- Total Cycles 128948 ---- Thread 24 ---- PC 5: Stalled ----- 93683 in-flight CPI 1.3761 -- Total Cycles 128948 ---- Thread 25 ---- PC 5: Stalled ----- 88764 in-flight CPI 1.4524 -- Total Cycles 128948 ---- Thread 26 ---- PC 5: Stalled ----- 86964 in-flight CPI 1.4824 -- Total Cycles 128948 ---- Thread 27 ---- PC 5: Stalled ----- 90149 in-flight CPI 1.4302 -- Total Cycles 128948 ---- Thread 28 ---- PC 5: Stalled ----- 88789 in-flight CPI 1.4520 -- Total Cycles 128948 ---- Thread 29 ---- PC 5: Stalled ----- 87112 in-flight CPI 1.4800 -- Total Cycles 128948 ---- Thread 30 ---- PC 5: Stalled ----- 95016 in-flight CPI 1.3568 -- Total Cycles 128948 ---- Thread 31 ---- PC 5: Stalled ----- 88503 in-flight CPI 1.4567 -- Total Cycles 128948 Total CPI 0.0424 , IPC 23.5648 -- Total Cycles 128948 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7285 (3.931950%) FPSUB: 0 (0.000000%) FPMUL: 30800 (16.623758%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 66858 (36.085430%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4321 (2.332184%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 67876 (36.634876%) DIV: 7869 (4.247154%) FPUN: 0 (0.000000%) FPRSUB: 268 (0.144648%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3340113 total) ADD%: 7.186 (240006) SUB%: 0.000 (0) MUL%: 0.006 (213) BITOR%: 1.530 (51106) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.519 (17327) FPSUB%: 0.000 (0) FPMUL%: 4.680 (156327) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (639) FPMAX%: 0.019 (639) LOAD%: 5.097 (170257) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (245) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (609) FPINV%: 0.000 (0) FPCONV%: 0.020 (671) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.052 (35146) FPLE%: 0.455 (15181) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (639) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.825 (94358) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.734 (24511) CMPU%: 0.000 (0) RSUB%: 0.006 (213) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.701 (524446) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.178 (39338) ORI%: 1.543 (51553) XORI%: 0.000 (0) MULI%: 3.228 (107804) LW%: 1.408 (47021) LWI%: 13.180 (440239) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.290 (9679) SWI%: 4.176 (139497) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.411 (47140) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10382) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.051 (1713) bned%: 0.000 (0) bneid%: 13.835 (462108) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.720 (24059) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.115 (3849) DIV%: 0.013 (426) FPUN%: 1.486 (49639) FPRSUB%: 4.137 (138191) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (66) FPGT%: 2.961 (98901) FPGE%: 1.032 (34458) SYNC%: 0.000 (0) NOP%: 9.024 (301421) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 22 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 151 FPSUB 0 FPMUL 2 FPCMPLT 0 FPMIN 0 FPMAX 406 LOAD 38987 INTCONV 0 ATOMIC_INC 13 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 15 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1164 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 5 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49629 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 11 ORI 10340 XORI 0 MULI 9747 LW 0 LWI 143250 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 69 DIV 30 FPUN 0 FPRSUB 43 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.5651 --Total thread-cycles: 4126336 --total thread-cycles issued: 3038692 (73.641411%) --iCache conflicts: 113635 (2.753896%) --thread*cycles of FU dependence: 253886 (6.152820%) --thread*cycles of data dependence: 185277 (4.490109%) --iCache cycles*banks: 4126336 (80.946991% used) Issue breakdown: --thread*cycles of issue worked: 3038692 (73.641411%) --thread*cycles of issue failed: 786223 (19.053782%) --thread*cycles of issue NOP/other: 302285 (7.325748%) Number of thread-cycles not ready: 185277 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3340113 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 9 1: 7 2: 8 3: 9 4: 7 5: 7 6: 8 7: 8 8: 8 9: 8 10: 9 11: 8 12: 8 13: 8 14: 8 15: 8 16: 7 17: 7 18: 7 19: 8 20: 6 21: 7 22: 8 23: 8 24: 8 25: 8 26: 8 27: 5 28: 7 29: 7 30: 8 31: 8 <=== Core 7 ===> ---- Thread 00 ---- PC 5: Stalled ----- 104161 in-flight CPI 1.2316 -- Total Cycles 128315 ---- Thread 01 ---- PC 5: Stalled ----- 97477 in-flight CPI 1.3161 -- Total Cycles 128315 ---- Thread 02 ---- PC 5: Stalled ----- 94029 in-flight CPI 1.3644 -- Total Cycles 128315 ---- Thread 03 ---- PC 5: Stalled ----- 95422 in-flight CPI 1.3445 -- Total Cycles 128315 ---- Thread 04 ---- PC 5: Stalled ----- 97976 in-flight CPI 1.3094 -- Total Cycles 128315 ---- Thread 05 ---- PC 5: Stalled ----- 95605 in-flight CPI 1.3419 -- Total Cycles 128315 ---- Thread 06 ---- PC 5: Stalled ----- 97807 in-flight CPI 1.3117 -- Total Cycles 128315 ---- Thread 07 ---- PC 5: Stalled ----- 94555 in-flight CPI 1.3567 -- Total Cycles 128315 ---- Thread 08 ---- PC 5: Stalled ----- 99866 in-flight CPI 1.2846 -- Total Cycles 128315 ---- Thread 09 ---- PC 5: Stalled ----- 96242 in-flight CPI 1.3330 -- Total Cycles 128315 ---- Thread 10 ---- PC 5: Stalled ----- 96416 in-flight CPI 1.3306 -- Total Cycles 128315 ---- Thread 11 ---- PC 5: Stalled ----- 96236 in-flight CPI 1.3331 -- Total Cycles 128315 ---- Thread 12 ---- PC 5: Stalled ----- 99916 in-flight CPI 1.2840 -- Total Cycles 128315 ---- Thread 13 ---- PC 5: Stalled ----- 94465 in-flight CPI 1.3581 -- Total Cycles 128315 ---- Thread 14 ---- PC 5: Stalled ----- 96260 in-flight CPI 1.3328 -- Total Cycles 128315 ---- Thread 15 ---- PC 5: Stalled ----- 96096 in-flight CPI 1.3351 -- Total Cycles 128315 ---- Thread 16 ---- PC 5: Stalled ----- 99190 in-flight CPI 1.2934 -- Total Cycles 128315 ---- Thread 17 ---- PC 5: Stalled ----- 90934 in-flight CPI 1.4108 -- Total Cycles 128315 ---- Thread 18 ---- PC 5: Stalled ----- 93168 in-flight CPI 1.3770 -- Total Cycles 128315 ---- Thread 19 ---- PC 5: Stalled ----- 96121 in-flight CPI 1.3347 -- Total Cycles 128315 ---- Thread 20 ---- PC 5: Stalled ----- 91775 in-flight CPI 1.3979 -- Total Cycles 128315 ---- Thread 21 ---- PC 5: Stalled ----- 95922 in-flight CPI 1.3374 -- Total Cycles 128315 ---- Thread 22 ---- PC 5: Stalled ----- 90999 in-flight CPI 1.4098 -- Total Cycles 128315 ---- Thread 23 ---- PC 5: Stalled ----- 88901 in-flight CPI 1.4431 -- Total Cycles 128315 ---- Thread 24 ---- PC 5: Stalled ----- 93309 in-flight CPI 1.3749 -- Total Cycles 128315 ---- Thread 25 ---- PC 5: Stalled ----- 93807 in-flight CPI 1.3676 -- Total Cycles 128315 ---- Thread 26 ---- PC 5: Stalled ----- 92690 in-flight CPI 1.3841 -- Total Cycles 128315 ---- Thread 27 ---- PC 5: Stalled ----- 94925 in-flight CPI 1.3515 -- Total Cycles 128315 ---- Thread 28 ---- PC 5: Stalled ----- 88364 in-flight CPI 1.4518 -- Total Cycles 128315 ---- Thread 29 ---- PC 5: Stalled ----- 87496 in-flight CPI 1.4663 -- Total Cycles 128315 ---- Thread 30 ---- PC 5: Stalled ----- 86112 in-flight CPI 1.4899 -- Total Cycles 128315 ---- Thread 31 ---- PC 5: Stalled ----- 89459 in-flight CPI 1.4341 -- Total Cycles 128315 Total CPI 0.0424 , IPC 23.5846 -- Total Cycles 128315 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8032 (3.990481%) FPSUB: 0 (0.000000%) FPMUL: 32117 (15.956458%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 76170 (37.842995%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4174 (2.073738%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 72990 (36.263096%) DIV: 7538 (3.745050%) FPUN: 0 (0.000000%) FPRSUB: 258 (0.128180%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3325953 total) ADD%: 7.181 (238835) SUB%: 0.000 (0) MUL%: 0.006 (204) BITOR%: 1.526 (50769) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.567 (18866) FPSUB%: 0.000 (0) FPMUL%: 4.819 (160271) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (612) FPMAX%: 0.018 (612) LOAD%: 5.156 (171492) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (236) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (587) FPINV%: 0.000 (0) FPCONV%: 0.019 (644) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.068 (35514) FPLE%: 0.455 (15149) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (612) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.796 (92984) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.749 (24914) CMPU%: 0.000 (0) RSUB%: 0.006 (204) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.663 (520930) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.172 (38977) ORI%: 1.573 (52328) XORI%: 0.000 (0) MULI%: 3.191 (106124) LW%: 1.394 (46361) LWI%: 13.073 (434818) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.286 (9511) SWI%: 4.131 (137409) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.397 (46476) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.309 (10278) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.059 (1950) bned%: 0.000 (0) bneid%: 13.780 (458306) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.718 (23875) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.125 (4157) DIV%: 0.012 (408) FPUN%: 1.477 (49108) FPRSUB%: 4.242 (141071) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (64) FPGT%: 2.935 (97602) FPGE%: 1.021 (33959) SYNC%: 0.000 (0) NOP%: 9.009 (299640) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 15 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 158 FPSUB 0 FPMUL 2 FPCMPLT 0 FPMIN 0 FPMAX 397 LOAD 39677 INTCONV 0 ATOMIC_INC 16 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 13 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1328 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 11 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48803 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 7 ORI 11482 XORI 0 MULI 9472 LW 0 LWI 141736 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 74 DIV 24 FPUN 0 FPRSUB 54 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.5848 --Total thread-cycles: 4106080 --total thread-cycles issued: 3026313 (73.703217%) --iCache conflicts: 112700 (2.744710%) --thread*cycles of FU dependence: 253301 (6.168925%) --thread*cycles of data dependence: 201279 (4.901975%) --iCache cycles*banks: 4106080 (81.001465% used) Issue breakdown: --thread*cycles of issue worked: 3026313 (73.703217%) --thread*cycles of issue failed: 780127 (18.999313%) --thread*cycles of issue NOP/other: 299640 (7.297471%) Number of thread-cycles not ready: 201279 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3325953 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 9 1: 8 2: 7 3: 6 4: 8 5: 8 6: 8 7: 9 8: 8 9: 8 10: 8 11: 8 12: 8 13: 6 14: 7 15: 5 16: 7 17: 7 18: 7 19: 7 20: 7 21: 8 22: 8 23: 7 24: 7 25: 8 26: 7 27: 8 28: 8 29: 6 30: 6 31: 7 <=== Core 8 ===> ---- Thread 00 ---- PC 5: Stalled ----- 99171 in-flight CPI 1.4521 -- Total Cycles 144033 ---- Thread 01 ---- PC 5: Stalled ----- 99076 in-flight CPI 1.4535 -- Total Cycles 144033 ---- Thread 02 ---- PC 5: Stalled ----- 97180 in-flight CPI 1.4818 -- Total Cycles 144033 ---- Thread 03 ---- PC 5: Stalled ----- 95880 in-flight CPI 1.5019 -- Total Cycles 144033 ---- Thread 04 ---- PC 5: Stalled ----- 98633 in-flight CPI 1.4601 -- Total Cycles 144033 ---- Thread 05 ---- PC 5: Stalled ----- 101389 in-flight CPI 1.4203 -- Total Cycles 144033 ---- Thread 06 ---- PC 5: Stalled ----- 109021 in-flight CPI 1.3210 -- Total Cycles 144033 ---- Thread 07 ---- PC 5: Stalled ----- 100095 in-flight CPI 1.4387 -- Total Cycles 144033 ---- Thread 08 ---- PC 5: Stalled ----- 94486 in-flight CPI 1.5241 -- Total Cycles 144033 ---- Thread 09 ---- PC 5: Stalled ----- 96975 in-flight CPI 1.4850 -- Total Cycles 144033 ---- Thread 10 ---- PC 5: Stalled ----- 102129 in-flight CPI 1.4100 -- Total Cycles 144033 ---- Thread 11 ---- PC 5: Stalled ----- 97774 in-flight CPI 1.4728 -- Total Cycles 144033 ---- Thread 12 ---- PC 5: Stalled ----- 94742 in-flight CPI 1.5200 -- Total Cycles 144033 ---- Thread 13 ---- PC 5: Stalled ----- 97548 in-flight CPI 1.4762 -- Total Cycles 144033 ---- Thread 14 ---- PC 5: Stalled ----- 91158 in-flight CPI 1.5798 -- Total Cycles 144033 ---- Thread 15 ---- PC 5: Stalled ----- 100553 in-flight CPI 1.4321 -- Total Cycles 144033 ---- Thread 16 ---- PC 5: Stalled ----- 98012 in-flight CPI 1.4693 -- Total Cycles 144033 ---- Thread 17 ---- PC 5: Stalled ----- 98488 in-flight CPI 1.4622 -- Total Cycles 144033 ---- Thread 18 ---- PC 5: Stalled ----- 93548 in-flight CPI 1.5394 -- Total Cycles 144033 ---- Thread 19 ---- PC 5: Stalled ----- 97318 in-flight CPI 1.4797 -- Total Cycles 144033 ---- Thread 20 ---- PC 5: Stalled ----- 99832 in-flight CPI 1.4425 -- Total Cycles 144033 ---- Thread 21 ---- PC 5: Stalled ----- 91436 in-flight CPI 1.5750 -- Total Cycles 144033 ---- Thread 22 ---- PC 5: Stalled ----- 87792 in-flight CPI 1.6404 -- Total Cycles 144033 ---- Thread 23 ---- PC 5: Stalled ----- 91192 in-flight CPI 1.5791 -- Total Cycles 144033 ---- Thread 24 ---- PC 5: Stalled ----- 92767 in-flight CPI 1.5524 -- Total Cycles 144033 ---- Thread 25 ---- PC 5: Stalled ----- 90052 in-flight CPI 1.5992 -- Total Cycles 144033 ---- Thread 26 ---- PC 5: Stalled ----- 86127 in-flight CPI 1.6720 -- Total Cycles 144033 ---- Thread 27 ---- PC 5: Stalled ----- 92186 in-flight CPI 1.5622 -- Total Cycles 144033 ---- Thread 28 ---- PC 5: Stalled ----- 85119 in-flight CPI 1.6919 -- Total Cycles 144033 ---- Thread 29 ---- PC 5: Stalled ----- 91197 in-flight CPI 1.5790 -- Total Cycles 144033 ---- Thread 30 ---- PC 5: Stalled ----- 90082 in-flight CPI 1.5986 -- Total Cycles 144033 ---- Thread 31 ---- PC 5: Stalled ----- 91287 in-flight CPI 1.5775 -- Total Cycles 144033 Total CPI 0.0472 , IPC 21.1952 -- Total Cycles 144033 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8027 (4.039657%) FPSUB: 0 (0.000000%) FPMUL: 32336 (16.273371%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 73145 (36.810852%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4110 (2.068393%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 73213 (36.845074%) DIV: 7614 (3.831811%) FPUN: 0 (0.000000%) FPRSUB: 260 (0.130847%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3355706 total) ADD%: 7.178 (240871) SUB%: 0.000 (0) MUL%: 0.006 (206) BITOR%: 1.527 (51231) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.562 (18859) FPSUB%: 0.000 (0) FPMUL%: 4.801 (161115) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (618) FPMAX%: 0.018 (618) LOAD%: 5.149 (172789) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (238) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (587) FPINV%: 0.000 (0) FPCONV%: 0.019 (650) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.068 (35845) FPLE%: 0.456 (15308) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (618) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.795 (93786) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.746 (25043) CMPU%: 0.000 (0) RSUB%: 0.006 (206) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.671 (525864) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.169 (39223) ORI%: 1.577 (52905) XORI%: 0.000 (0) MULI%: 3.194 (107180) LW%: 1.393 (46742) LWI%: 13.071 (438640) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.286 (9612) SWI%: 4.127 (138504) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.396 (46853) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10406) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.061 (2046) bned%: 0.000 (0) bneid%: 13.793 (462848) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.718 (24095) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.124 (4168) DIV%: 0.012 (412) FPUN%: 1.479 (49646) FPRSUB%: 4.230 (141943) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (63) FPGT%: 2.941 (98691) FPGE%: 1.023 (34338) SYNC%: 0.000 (0) NOP%: 9.025 (302843) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 20 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 153 FPSUB 0 FPMUL 0 FPCMPLT 0 FPMIN 0 FPMAX 398 LOAD 39802 INTCONV 0 ATOMIC_INC 18 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 18 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1283 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 7 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49327 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 10 ORI 11412 XORI 0 MULI 9257 LW 0 LWI 143029 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 68 DIV 20 FPUN 0 FPRSUB 48 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.1954 --Total thread-cycles: 4609056 --total thread-cycles issued: 3052863 (66.236183%) --iCache conflicts: 112504 (2.440934%) --thread*cycles of FU dependence: 254896 (5.530330%) --thread*cycles of data dependence: 198705 (4.311186%) --iCache cycles*banks: 4609056 (72.807487% used) Issue breakdown: --thread*cycles of issue worked: 3052863 (66.236183%) --thread*cycles of issue failed: 1253350 (27.193203%) --thread*cycles of issue NOP/other: 4605152142125473531 (99915300601856.000000%) Number of thread-cycles not ready: 198705 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3355706 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 8 3: 8 4: 7 5: 9 6: 5 7: 8 8: 7 9: 7 10: 8 11: 8 12: 8 13: 8 14: 7 15: 9 16: 8 17: 8 18: 8 19: 8 20: 8 21: 7 22: 6 23: 8 24: 7 25: 7 26: 7 27: 6 28: 6 29: 8 30: 7 31: 7 <=== Core 9 ===> ---- Thread 00 ---- PC 5: Stalled ----- 103144 in-flight CPI 1.2487 -- Total Cycles 128822 ---- Thread 01 ---- PC 5: Stalled ----- 97684 in-flight CPI 1.3185 -- Total Cycles 128822 ---- Thread 02 ---- PC 5: Stalled ----- 100569 in-flight CPI 1.2807 -- Total Cycles 128822 ---- Thread 03 ---- PC 5: Stalled ----- 99511 in-flight CPI 1.2943 -- Total Cycles 128822 ---- Thread 04 ---- PC 5: Stalled ----- 96445 in-flight CPI 1.3354 -- Total Cycles 128822 ---- Thread 05 ---- PC 5: Stalled ----- 98837 in-flight CPI 1.3031 -- Total Cycles 128822 ---- Thread 06 ---- PC 5: Stalled ----- 101008 in-flight CPI 1.2751 -- Total Cycles 128822 ---- Thread 07 ---- PC 5: Stalled ----- 100860 in-flight CPI 1.2770 -- Total Cycles 128822 ---- Thread 08 ---- PC 5: Stalled ----- 94280 in-flight CPI 1.3661 -- Total Cycles 128822 ---- Thread 09 ---- PC 5: Stalled ----- 95322 in-flight CPI 1.3512 -- Total Cycles 128822 ---- Thread 10 ---- PC 5: Stalled ----- 96896 in-flight CPI 1.3293 -- Total Cycles 128822 ---- Thread 11 ---- PC 5: Stalled ----- 92243 in-flight CPI 1.3963 -- Total Cycles 128822 ---- Thread 12 ---- PC 5: Stalled ----- 97949 in-flight CPI 1.3149 -- Total Cycles 128822 ---- Thread 13 ---- PC 5: Stalled ----- 91676 in-flight CPI 1.4050 -- Total Cycles 128822 ---- Thread 14 ---- PC 5: Stalled ----- 96639 in-flight CPI 1.3328 -- Total Cycles 128822 ---- Thread 15 ---- PC 5: Stalled ----- 97930 in-flight CPI 1.3151 -- Total Cycles 128822 ---- Thread 16 ---- PC 5: Stalled ----- 92031 in-flight CPI 1.3996 -- Total Cycles 128822 ---- Thread 17 ---- PC 5: Stalled ----- 87987 in-flight CPI 1.4639 -- Total Cycles 128822 ---- Thread 18 ---- PC 5: Stalled ----- 96843 in-flight CPI 1.3300 -- Total Cycles 128822 ---- Thread 19 ---- PC 5: Stalled ----- 96248 in-flight CPI 1.3382 -- Total Cycles 128822 ---- Thread 20 ---- PC 5: Stalled ----- 91035 in-flight CPI 1.4149 -- Total Cycles 128822 ---- Thread 21 ---- PC 5: Stalled ----- 97592 in-flight CPI 1.3197 -- Total Cycles 128822 ---- Thread 22 ---- PC 5: Stalled ----- 88564 in-flight CPI 1.4543 -- Total Cycles 128822 ---- Thread 23 ---- PC 5: Stalled ----- 96521 in-flight CPI 1.3344 -- Total Cycles 128822 ---- Thread 24 ---- PC 5: Stalled ----- 90575 in-flight CPI 1.4220 -- Total Cycles 128822 ---- Thread 25 ---- PC 5: Stalled ----- 87606 in-flight CPI 1.4702 -- Total Cycles 128822 ---- Thread 26 ---- PC 5: Stalled ----- 91240 in-flight CPI 1.4117 -- Total Cycles 128822 ---- Thread 27 ---- PC 5: Stalled ----- 87329 in-flight CPI 1.4749 -- Total Cycles 128822 ---- Thread 28 ---- PC 5: Stalled ----- 92600 in-flight CPI 1.3909 -- Total Cycles 128822 ---- Thread 29 ---- PC 5: Stalled ----- 90769 in-flight CPI 1.4189 -- Total Cycles 128822 ---- Thread 30 ---- PC 5: Stalled ----- 90208 in-flight CPI 1.4278 -- Total Cycles 128822 ---- Thread 31 ---- PC 5: Stalled ----- 85657 in-flight CPI 1.5036 -- Total Cycles 128822 Total CPI 0.0426 , IPC 23.4770 -- Total Cycles 128822 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8025 (3.698634%) FPSUB: 0 (0.000000%) FPMUL: 32006 (14.751212%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 92323 (42.550652%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4006 (1.846321%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 72884 (33.591431%) DIV: 7465 (3.440536%) FPUN: 0 (0.000000%) FPRSUB: 263 (0.121214%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3324069 total) ADD%: 7.185 (238847) SUB%: 0.000 (0) MUL%: 0.006 (202) BITOR%: 1.539 (51149) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.565 (18775) FPSUB%: 0.000 (0) FPMUL%: 4.810 (159877) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (606) FPMAX%: 0.018 (606) LOAD%: 5.151 (171234) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (234) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (575) FPINV%: 0.000 (0) FPCONV%: 0.019 (638) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.069 (35529) FPLE%: 0.458 (15218) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (606) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.787 (92657) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.754 (25060) CMPU%: 0.000 (0) RSUB%: 0.006 (202) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.676 (521071) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.173 (38993) ORI%: 1.574 (52337) XORI%: 0.000 (0) MULI%: 3.185 (105878) LW%: 1.389 (46157) LWI%: 13.045 (433635) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.286 (9518) SWI%: 4.124 (137081) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.392 (46264) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10321) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.055 (1839) bned%: 0.000 (0) bneid%: 13.802 (458795) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.717 (23828) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.125 (4152) DIV%: 0.012 (404) FPUN%: 1.484 (49340) FPRSUB%: 4.238 (140885) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (70) FPGT%: 2.936 (97603) FPGE%: 1.027 (34122) SYNC%: 0.000 (0) NOP%: 9.015 (299665) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 23 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 148 FPSUB 0 FPMUL 3 FPCMPLT 0 FPMIN 0 FPMAX 393 LOAD 39913 INTCONV 0 ATOMIC_INC 16 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 20 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1062 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 8 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48924 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 9 ORI 11378 XORI 0 MULI 9167 LW 0 LWI 141513 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 75 DIV 19 FPUN 0 FPRSUB 43 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.4772 --Total thread-cycles: 4122304 --total thread-cycles issued: 3024404 (73.366837%) --iCache conflicts: 112218 (2.722215%) --thread*cycles of FU dependence: 252720 (6.130552%) --thread*cycles of data dependence: 216972 (5.263367%) --iCache cycles*banks: 4122304 (80.636963% used) Issue breakdown: --thread*cycles of issue worked: 3024404 (73.366837%) --thread*cycles of issue failed: 798235 (19.363808%) --thread*cycles of issue NOP/other: 4606178863721255569 (111737978224640.000000%) Number of thread-cycles not ready: 216972 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3324069 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 7 3: 7 4: 9 5: 8 6: 8 7: 8 8: 8 9: 7 10: 7 11: 6 12: 8 13: 6 14: 7 15: 10 16: 6 17: 6 18: 8 19: 8 20: 6 21: 8 22: 7 23: 7 24: 7 25: 7 26: 5 27: 7 28: 7 29: 9 30: 7 31: 7 <=== Core 10 ===> ---- Thread 00 ---- PC 5: Stalled ----- 97835 in-flight CPI 1.4447 -- Total Cycles 141365 ---- Thread 01 ---- PC 5: Stalled ----- 98971 in-flight CPI 1.4281 -- Total Cycles 141365 ---- Thread 02 ---- PC 5: Stalled ----- 101169 in-flight CPI 1.3971 -- Total Cycles 141365 ---- Thread 03 ---- PC 5: Stalled ----- 98220 in-flight CPI 1.4389 -- Total Cycles 141365 ---- Thread 04 ---- PC 5: Stalled ----- 99022 in-flight CPI 1.4273 -- Total Cycles 141365 ---- Thread 05 ---- PC 5: Stalled ----- 93330 in-flight CPI 1.5144 -- Total Cycles 141365 ---- Thread 06 ---- PC 5: Stalled ----- 98978 in-flight CPI 1.4280 -- Total Cycles 141365 ---- Thread 07 ---- PC 5: Stalled ----- 105804 in-flight CPI 1.3360 -- Total Cycles 141365 ---- Thread 08 ---- PC 5: Stalled ----- 101537 in-flight CPI 1.3920 -- Total Cycles 141365 ---- Thread 09 ---- PC 5: Stalled ----- 95189 in-flight CPI 1.4848 -- Total Cycles 141365 ---- Thread 10 ---- PC 5: Stalled ----- 95871 in-flight CPI 1.4743 -- Total Cycles 141365 ---- Thread 11 ---- PC 5: Stalled ----- 97465 in-flight CPI 1.4501 -- Total Cycles 141365 ---- Thread 12 ---- PC 5: Stalled ----- 99144 in-flight CPI 1.4256 -- Total Cycles 141365 ---- Thread 13 ---- PC 5: Stalled ----- 99071 in-flight CPI 1.4267 -- Total Cycles 141365 ---- Thread 14 ---- PC 5: Stalled ----- 93552 in-flight CPI 1.5109 -- Total Cycles 141365 ---- Thread 15 ---- PC 5: Stalled ----- 96813 in-flight CPI 1.4599 -- Total Cycles 141365 ---- Thread 16 ---- PC 5: Stalled ----- 85507 in-flight CPI 1.6530 -- Total Cycles 141365 ---- Thread 17 ---- PC 5: Stalled ----- 93061 in-flight CPI 1.5188 -- Total Cycles 141365 ---- Thread 18 ---- PC 5: Stalled ----- 92875 in-flight CPI 1.5218 -- Total Cycles 141365 ---- Thread 19 ---- PC 5: Stalled ----- 97511 in-flight CPI 1.4494 -- Total Cycles 141365 ---- Thread 20 ---- PC 5: Stalled ----- 98515 in-flight CPI 1.4347 -- Total Cycles 141365 ---- Thread 21 ---- PC 5: Stalled ----- 91410 in-flight CPI 1.5462 -- Total Cycles 141365 ---- Thread 22 ---- PC 5: Stalled ----- 91290 in-flight CPI 1.5482 -- Total Cycles 141365 ---- Thread 23 ---- PC 5: Stalled ----- 95240 in-flight CPI 1.4841 -- Total Cycles 141365 ---- Thread 24 ---- PC 5: Stalled ----- 98167 in-flight CPI 1.4398 -- Total Cycles 141365 ---- Thread 25 ---- PC 5: Stalled ----- 95825 in-flight CPI 1.4749 -- Total Cycles 141365 ---- Thread 26 ---- PC 5: Stalled ----- 86349 in-flight CPI 1.6369 -- Total Cycles 141365 ---- Thread 27 ---- PC 5: Stalled ----- 93632 in-flight CPI 1.5095 -- Total Cycles 141365 ---- Thread 28 ---- PC 5: Stalled ----- 84644 in-flight CPI 1.6698 -- Total Cycles 141365 ---- Thread 29 ---- PC 5: Stalled ----- 84702 in-flight CPI 1.6686 -- Total Cycles 141365 ---- Thread 30 ---- PC 5: Stalled ----- 84119 in-flight CPI 1.6802 -- Total Cycles 141365 ---- Thread 31 ---- PC 5: Stalled ----- 91686 in-flight CPI 1.5416 -- Total Cycles 141365 Total CPI 0.0465 , IPC 21.4837 -- Total Cycles 141365 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8462 (3.929545%) FPSUB: 0 (0.000000%) FPMUL: 33008 (15.328105%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 85725 (39.808582%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4081 (1.895116%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 76455 (35.503826%) DIV: 7357 (3.416410%) FPUN: 0 (0.000000%) FPRSUB: 255 (0.118416%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3338591 total) ADD%: 7.147 (238597) SUB%: 0.000 (0) MUL%: 0.006 (199) BITOR%: 1.510 (50407) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.589 (19660) FPSUB%: 0.000 (0) FPMUL%: 4.886 (163117) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (597) FPMAX%: 0.018 (597) LOAD%: 5.190 (173260) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (231) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (575) FPINV%: 0.000 (0) FPCONV%: 0.019 (629) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.080 (36043) FPLE%: 0.448 (14955) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (597) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.780 (92799) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.754 (25187) CMPU%: 0.000 (0) RSUB%: 0.006 (199) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.643 (522258) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.165 (38882) ORI%: 1.592 (53165) XORI%: 0.000 (0) MULI%: 3.177 (106070) LW%: 1.385 (46252) LWI%: 13.033 (435109) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.285 (9507) SWI%: 4.103 (136986) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.389 (46365) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.309 (10321) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.068 (2255) bned%: 0.000 (0) bneid%: 13.761 (459407) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.708 (23639) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.131 (4364) DIV%: 0.012 (398) FPUN%: 1.461 (48764) FPRSUB%: 4.304 (143681) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (76) FPGT%: 2.937 (98048) FPGE%: 1.013 (33809) SYNC%: 0.000 (0) NOP%: 9.030 (301490) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 15 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 153 FPSUB 0 FPMUL 3 FPCMPLT 0 FPMIN 0 FPMAX 382 LOAD 41156 INTCONV 0 ATOMIC_INC 13 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 17 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1387 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 12 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48874 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 9 ORI 12141 XORI 0 MULI 9143 LW 0 LWI 141878 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 68 DIV 17 FPUN 0 FPRSUB 53 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.4840 --Total thread-cycles: 4523680 --total thread-cycles issued: 3037101 (67.137840%) --iCache conflicts: 112449 (2.485786%) --thread*cycles of FU dependence: 255346 (5.644652%) --thread*cycles of data dependence: 215343 (4.760350%) --iCache cycles*banks: 4523680 (73.803253% used) Issue breakdown: --thread*cycles of issue worked: 3037101 (67.137840%) --thread*cycles of issue failed: 1185089 (26.197454%) --thread*cycles of issue NOP/other: 4614811651886324146 (102014541365248.000000%) Number of thread-cycles not ready: 215343 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3338591 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 7 3: 9 4: 8 5: 7 6: 8 7: 5 8: 8 9: 8 10: 7 11: 8 12: 7 13: 7 14: 6 15: 7 16: 6 17: 7 18: 7 19: 9 20: 8 21: 8 22: 7 23: 6 24: 8 25: 8 26: 6 27: 7 28: 6 29: 7 30: 7 31: 7 <=== Core 11 ===> ---- Thread 00 ---- PC 5: Stalled ----- 98967 in-flight CPI 1.3339 -- Total Cycles 132036 ---- Thread 01 ---- PC 5: Stalled ----- 92177 in-flight CPI 1.4322 -- Total Cycles 132036 ---- Thread 02 ---- PC 5: Stalled ----- 100502 in-flight CPI 1.3135 -- Total Cycles 132036 ---- Thread 03 ---- PC 5: Stalled ----- 94621 in-flight CPI 1.3952 -- Total Cycles 132036 ---- Thread 04 ---- PC 5: Stalled ----- 93150 in-flight CPI 1.4172 -- Total Cycles 132036 ---- Thread 05 ---- PC 5: Stalled ----- 94317 in-flight CPI 1.3997 -- Total Cycles 132036 ---- Thread 06 ---- PC 5: Stalled ----- 97794 in-flight CPI 1.3499 -- Total Cycles 132036 ---- Thread 07 ---- PC 5: Stalled ----- 99632 in-flight CPI 1.3251 -- Total Cycles 132036 ---- Thread 08 ---- PC 5: Stalled ----- 93145 in-flight CPI 1.4173 -- Total Cycles 132036 ---- Thread 09 ---- PC 5: Stalled ----- 94907 in-flight CPI 1.3910 -- Total Cycles 132036 ---- Thread 10 ---- PC 5: Stalled ----- 98404 in-flight CPI 1.3415 -- Total Cycles 132036 ---- Thread 11 ---- PC 5: Stalled ----- 97116 in-flight CPI 1.3593 -- Total Cycles 132036 ---- Thread 12 ---- PC 5: Stalled ----- 100716 in-flight CPI 1.3107 -- Total Cycles 132036 ---- Thread 13 ---- PC 5: Stalled ----- 96777 in-flight CPI 1.3641 -- Total Cycles 132036 ---- Thread 14 ---- PC 5: Stalled ----- 100930 in-flight CPI 1.3079 -- Total Cycles 132036 ---- Thread 15 ---- PC 5: Stalled ----- 91822 in-flight CPI 1.4377 -- Total Cycles 132036 ---- Thread 16 ---- PC 5: Stalled ----- 96316 in-flight CPI 1.3707 -- Total Cycles 132036 ---- Thread 17 ---- PC 5: Stalled ----- 92905 in-flight CPI 1.4209 -- Total Cycles 132036 ---- Thread 18 ---- PC 5: Stalled ----- 87393 in-flight CPI 1.5106 -- Total Cycles 132036 ---- Thread 19 ---- PC 5: Stalled ----- 95989 in-flight CPI 1.3753 -- Total Cycles 132036 ---- Thread 20 ---- PC 5: Stalled ----- 92448 in-flight CPI 1.4279 -- Total Cycles 132036 ---- Thread 21 ---- PC 5: Stalled ----- 99310 in-flight CPI 1.3293 -- Total Cycles 132036 ---- Thread 22 ---- PC 5: Stalled ----- 96018 in-flight CPI 1.3748 -- Total Cycles 132036 ---- Thread 23 ---- PC 5: Stalled ----- 94519 in-flight CPI 1.3967 -- Total Cycles 132036 ---- Thread 24 ---- PC 5: Stalled ----- 93705 in-flight CPI 1.4088 -- Total Cycles 132036 ---- Thread 25 ---- PC 5: Stalled ----- 91029 in-flight CPI 1.4503 -- Total Cycles 132036 ---- Thread 26 ---- PC 5: Stalled ----- 89684 in-flight CPI 1.4719 -- Total Cycles 132036 ---- Thread 27 ---- PC 5: Stalled ----- 95346 in-flight CPI 1.3845 -- Total Cycles 132036 ---- Thread 28 ---- PC 5: Stalled ----- 85338 in-flight CPI 1.5469 -- Total Cycles 132036 ---- Thread 29 ---- PC 5: Stalled ----- 80204 in-flight CPI 1.6460 -- Total Cycles 132036 ---- Thread 30 ---- PC 5: Stalled ----- 88358 in-flight CPI 1.4941 -- Total Cycles 132036 ---- Thread 31 ---- PC 5: Stalled ----- 93155 in-flight CPI 1.4171 -- Total Cycles 132036 Total CPI 0.0438 , IPC 22.8516 -- Total Cycles 132036 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8272 (4.065165%) FPSUB: 0 (0.000000%) FPMUL: 32430 (15.937292%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 76421 (37.556084%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 3893 (1.913163%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 74856 (36.786987%) DIV: 7353 (3.613534%) FPUN: 0 (0.000000%) FPRSUB: 260 (0.127774%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3315869 total) ADD%: 7.219 (239376) SUB%: 0.000 (0) MUL%: 0.006 (199) BITOR%: 1.531 (50767) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.581 (19253) FPSUB%: 0.000 (0) FPMUL%: 4.852 (160898) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (597) FPMAX%: 0.018 (597) LOAD%: 5.178 (171697) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (231) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (566) FPINV%: 0.000 (0) FPCONV%: 0.019 (629) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.073 (35581) FPLE%: 0.456 (15104) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (597) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.781 (92219) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.756 (25061) CMPU%: 0.000 (0) RSUB%: 0.006 (199) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.652 (519012) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.171 (38816) ORI%: 1.587 (52627) XORI%: 0.000 (0) MULI%: 3.175 (105294) LW%: 1.386 (45945) LWI%: 13.012 (431452) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.285 (9466) SWI%: 4.106 (136139) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.389 (46049) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.309 (10248) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.059 (1954) bned%: 0.000 (0) bneid%: 13.770 (456605) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.714 (23683) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.129 (4270) DIV%: 0.012 (398) FPUN%: 1.476 (48958) FPRSUB%: 4.275 (141749) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (66) FPGT%: 2.927 (97039) FPGE%: 1.021 (33854) SYNC%: 0.000 (0) NOP%: 9.005 (298578) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 33 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 143 FPSUB 0 FPMUL 2 FPCMPLT 0 FPMIN 0 FPMAX 384 LOAD 39935 INTCONV 0 ATOMIC_INC 19 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 19 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1536 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 4 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48492 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 12 ORI 11875 XORI 0 MULI 8908 LW 0 LWI 140864 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 69 DIV 29 FPUN 0 FPRSUB 49 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 22.8519 --Total thread-cycles: 4225152 --total thread-cycles issued: 3017291 (71.412605%) --iCache conflicts: 110404 (2.613019%) --thread*cycles of FU dependence: 252398 (5.973702%) --thread*cycles of data dependence: 203485 (4.816040%) --iCache cycles*banks: 4225152 (78.480042% used) Issue breakdown: --thread*cycles of issue worked: 3017291 (71.412605%) --thread*cycles of issue failed: 909283 (21.520718%) --thread*cycles of issue NOP/other: 4699042 (111.215927%) Number of thread-cycles not ready: 203485 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3315869 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 7 2: 8 3: 7 4: 7 5: 6 6: 8 7: 5 8: 7 9: 7 10: 9 11: 7 12: 9 13: 7 14: 8 15: 6 16: 6 17: 8 18: 6 19: 6 20: 8 21: 8 22: 9 23: 7 24: 8 25: 6 26: 8 27: 8 28: 7 29: 5 30: 7 31: 8 <=== Core 12 ===> ---- Thread 00 ---- PC 5: Stalled ----- 98000 in-flight CPI 1.4579 -- Total Cycles 142902 ---- Thread 01 ---- PC 5: Stalled ----- 95472 in-flight CPI 1.4965 -- Total Cycles 142902 ---- Thread 02 ---- PC 5: Stalled ----- 98309 in-flight CPI 1.4534 -- Total Cycles 142902 ---- Thread 03 ---- PC 5: Stalled ----- 93601 in-flight CPI 1.5264 -- Total Cycles 142902 ---- Thread 04 ---- PC 5: Stalled ----- 96400 in-flight CPI 1.4821 -- Total Cycles 142902 ---- Thread 05 ---- PC 5: Stalled ----- 97366 in-flight CPI 1.4674 -- Total Cycles 142902 ---- Thread 06 ---- PC 5: Stalled ----- 100543 in-flight CPI 1.4210 -- Total Cycles 142902 ---- Thread 07 ---- PC 5: Stalled ----- 102990 in-flight CPI 1.3873 -- Total Cycles 142902 ---- Thread 08 ---- PC 5: Stalled ----- 93526 in-flight CPI 1.5276 -- Total Cycles 142902 ---- Thread 09 ---- PC 5: Stalled ----- 97667 in-flight CPI 1.4629 -- Total Cycles 142902 ---- Thread 10 ---- PC 5: Stalled ----- 99823 in-flight CPI 1.4313 -- Total Cycles 142902 ---- Thread 11 ---- PC 5: Stalled ----- 96016 in-flight CPI 1.4880 -- Total Cycles 142902 ---- Thread 12 ---- PC 5: Stalled ----- 98089 in-flight CPI 1.4566 -- Total Cycles 142902 ---- Thread 13 ---- PC 5: Stalled ----- 94137 in-flight CPI 1.5177 -- Total Cycles 142902 ---- Thread 14 ---- PC 5: Stalled ----- 97430 in-flight CPI 1.4664 -- Total Cycles 142902 ---- Thread 15 ---- PC 5: Stalled ----- 96731 in-flight CPI 1.4770 -- Total Cycles 142902 ---- Thread 16 ---- PC 5: Stalled ----- 104413 in-flight CPI 1.3685 -- Total Cycles 142902 ---- Thread 17 ---- PC 5: Stalled ----- 90924 in-flight CPI 1.5714 -- Total Cycles 142902 ---- Thread 18 ---- PC 5: Stalled ----- 94910 in-flight CPI 1.5054 -- Total Cycles 142902 ---- Thread 19 ---- PC 5: Stalled ----- 90215 in-flight CPI 1.5837 -- Total Cycles 142902 ---- Thread 20 ---- PC 5: Stalled ----- 90978 in-flight CPI 1.5704 -- Total Cycles 142902 ---- Thread 21 ---- PC 5: Stalled ----- 99225 in-flight CPI 1.4399 -- Total Cycles 142902 ---- Thread 22 ---- PC 5: Stalled ----- 90764 in-flight CPI 1.5742 -- Total Cycles 142902 ---- Thread 23 ---- PC 5: Stalled ----- 95165 in-flight CPI 1.5013 -- Total Cycles 142902 ---- Thread 24 ---- PC 5: Stalled ----- 89415 in-flight CPI 1.5979 -- Total Cycles 142902 ---- Thread 25 ---- PC 5: Stalled ----- 90669 in-flight CPI 1.5759 -- Total Cycles 142902 ---- Thread 26 ---- PC 5: Stalled ----- 95967 in-flight CPI 1.4888 -- Total Cycles 142902 ---- Thread 27 ---- PC 5: Stalled ----- 91248 in-flight CPI 1.5657 -- Total Cycles 142902 ---- Thread 28 ---- PC 5: Stalled ----- 91255 in-flight CPI 1.5657 -- Total Cycles 142902 ---- Thread 29 ---- PC 5: Stalled ----- 91395 in-flight CPI 1.5632 -- Total Cycles 142902 ---- Thread 30 ---- PC 5: Stalled ----- 91994 in-flight CPI 1.5532 -- Total Cycles 142902 ---- Thread 31 ---- PC 5: Stalled ----- 87136 in-flight CPI 1.6397 -- Total Cycles 142902 Total CPI 0.0470 , IPC 21.2896 -- Total Cycles 142902 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8250 (3.887017%) FPSUB: 0 (0.000000%) FPMUL: 32731 (15.421329%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 84064 (39.607056%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4202 (1.979787%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 75199 (35.430283%) DIV: 7536 (3.550614%) FPUN: 0 (0.000000%) FPRSUB: 263 (0.123913%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3344280 total) ADD%: 7.167 (239674) SUB%: 0.000 (0) MUL%: 0.006 (204) BITOR%: 1.516 (50687) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.579 (19379) FPSUB%: 0.000 (0) FPMUL%: 4.860 (162522) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (612) FPMAX%: 0.018 (612) LOAD%: 5.167 (172812) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (236) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (589) FPINV%: 0.000 (0) FPCONV%: 0.019 (644) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.076 (35980) FPLE%: 0.453 (15140) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (612) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.782 (93029) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.752 (25163) CMPU%: 0.000 (0) RSUB%: 0.006 (204) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.647 (523268) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.166 (38989) ORI%: 1.577 (52744) XORI%: 0.000 (0) MULI%: 3.183 (106450) LW%: 1.387 (46399) LWI%: 13.062 (436833) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.284 (9500) SWI%: 4.114 (137572) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.391 (46516) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.309 (10323) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.064 (2144) bned%: 0.000 (0) bneid%: 13.771 (460547) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.713 (23841) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.128 (4287) DIV%: 0.012 (408) FPUN%: 1.466 (49032) FPRSUB%: 4.278 (143061) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (68) FPGT%: 2.940 (98316) FPGE%: 1.013 (33892) SYNC%: 0.000 (0) NOP%: 9.027 (301895) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 24 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 157 FPSUB 0 FPMUL 1 FPCMPLT 0 FPMIN 0 FPMAX 396 LOAD 40743 INTCONV 0 ATOMIC_INC 15 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 19 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1517 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 12 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49119 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 15 ORI 11827 XORI 0 MULI 9086 LW 0 LWI 142678 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 93 DIV 23 FPUN 0 FPRSUB 58 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.2899 --Total thread-cycles: 4572864 --total thread-cycles issued: 3042385 (66.531281%) --iCache conflicts: 112865 (2.468147%) --thread*cycles of FU dependence: 255799 (5.593847%) --thread*cycles of data dependence: 212245 (4.641402%) --iCache cycles*banks: 4572864 (73.133865% used) Issue breakdown: --thread*cycles of issue worked: 3042385 (66.531281%) --thread*cycles of issue failed: 1228584 (26.866838%) --thread*cycles of issue NOP/other: 4619085657594436423 (101010777309184.000000%) Number of thread-cycles not ready: 212245 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3344280 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 7 2: 7 3: 7 4: 8 5: 8 6: 9 7: 8 8: 8 9: 8 10: 8 11: 8 12: 8 13: 8 14: 8 15: 8 16: 5 17: 6 18: 8 19: 8 20: 7 21: 8 22: 7 23: 8 24: 6 25: 6 26: 7 27: 9 28: 7 29: 8 30: 5 31: 6 <=== Core 13 ===> ---- Thread 00 ---- PC 5: Stalled ----- 98508 in-flight CPI 1.3108 -- Total Cycles 129150 ---- Thread 01 ---- PC 5: Stalled ----- 95876 in-flight CPI 1.3468 -- Total Cycles 129150 ---- Thread 02 ---- PC 5: Stalled ----- 100046 in-flight CPI 1.2906 -- Total Cycles 129150 ---- Thread 03 ---- PC 5: Stalled ----- 96142 in-flight CPI 1.3431 -- Total Cycles 129150 ---- Thread 04 ---- PC 5: Stalled ----- 96599 in-flight CPI 1.3367 -- Total Cycles 129150 ---- Thread 05 ---- PC 5: Stalled ----- 96616 in-flight CPI 1.3365 -- Total Cycles 129150 ---- Thread 06 ---- PC 5: Stalled ----- 99204 in-flight CPI 1.3016 -- Total Cycles 129150 ---- Thread 07 ---- PC 5: Stalled ----- 98398 in-flight CPI 1.3123 -- Total Cycles 129150 ---- Thread 08 ---- PC 5: Stalled ----- 96034 in-flight CPI 1.3446 -- Total Cycles 129150 ---- Thread 09 ---- PC 5: Stalled ----- 97159 in-flight CPI 1.3290 -- Total Cycles 129150 ---- Thread 10 ---- PC 5: Stalled ----- 93656 in-flight CPI 1.3788 -- Total Cycles 129150 ---- Thread 11 ---- PC 5: Stalled ----- 101611 in-flight CPI 1.2707 -- Total Cycles 129150 ---- Thread 12 ---- PC 5: Stalled ----- 97343 in-flight CPI 1.3265 -- Total Cycles 129150 ---- Thread 13 ---- PC 5: Stalled ----- 103029 in-flight CPI 1.2533 -- Total Cycles 129150 ---- Thread 14 ---- PC 5: Stalled ----- 99494 in-flight CPI 1.2978 -- Total Cycles 129150 ---- Thread 15 ---- PC 5: Stalled ----- 95316 in-flight CPI 1.3547 -- Total Cycles 129150 ---- Thread 16 ---- PC 5: Stalled ----- 93835 in-flight CPI 1.3761 -- Total Cycles 129150 ---- Thread 17 ---- PC 5: Stalled ----- 96313 in-flight CPI 1.3406 -- Total Cycles 129150 ---- Thread 18 ---- PC 5: Stalled ----- 90851 in-flight CPI 1.4213 -- Total Cycles 129150 ---- Thread 19 ---- PC 5: Stalled ----- 90839 in-flight CPI 1.4214 -- Total Cycles 129150 ---- Thread 20 ---- PC 5: Stalled ----- 99190 in-flight CPI 1.3018 -- Total Cycles 129150 ---- Thread 21 ---- PC 5: Stalled ----- 90067 in-flight CPI 1.4337 -- Total Cycles 129150 ---- Thread 22 ---- PC 5: Stalled ----- 87364 in-flight CPI 1.4780 -- Total Cycles 129150 ---- Thread 23 ---- PC 5: Stalled ----- 96047 in-flight CPI 1.3443 -- Total Cycles 129150 ---- Thread 24 ---- PC 5: Stalled ----- 93970 in-flight CPI 1.3741 -- Total Cycles 129150 ---- Thread 25 ---- PC 5: Stalled ----- 85590 in-flight CPI 1.5087 -- Total Cycles 129150 ---- Thread 26 ---- PC 5: Stalled ----- 92865 in-flight CPI 1.3904 -- Total Cycles 129150 ---- Thread 27 ---- PC 5: Stalled ----- 91416 in-flight CPI 1.4125 -- Total Cycles 129150 ---- Thread 28 ---- PC 5: Stalled ----- 86983 in-flight CPI 1.4844 -- Total Cycles 129150 ---- Thread 29 ---- PC 5: Stalled ----- 89998 in-flight CPI 1.4348 -- Total Cycles 129150 ---- Thread 30 ---- PC 5: Stalled ----- 88062 in-flight CPI 1.4663 -- Total Cycles 129150 ---- Thread 31 ---- PC 5: Stalled ----- 91421 in-flight CPI 1.4124 -- Total Cycles 129150 Total CPI 0.0426 , IPC 23.4645 -- Total Cycles 129150 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 6993 (4.150395%) FPSUB: 0 (0.000000%) FPMUL: 30125 (17.879400%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 53007 (31.460026%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4461 (2.647635%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 65580 (38.922192%) DIV: 8046 (4.775358%) FPUN: 0 (0.000000%) FPRSUB: 278 (0.164995%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3330731 total) ADD%: 7.236 (241024) SUB%: 0.000 (0) MUL%: 0.007 (218) BITOR%: 1.537 (51197) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.504 (16771) FPSUB%: 0.000 (0) FPMUL%: 4.626 (154069) FPCMPLT%: 0.000 (0) FPMIN%: 0.020 (654) FPMAX%: 0.020 (654) LOAD%: 5.073 (168972) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.008 (250) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (624) FPINV%: 0.000 (0) FPCONV%: 0.021 (686) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.046 (34823) FPLE%: 0.458 (15255) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.020 (654) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.836 (94463) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.727 (24215) CMPU%: 0.000 (0) RSUB%: 0.007 (218) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.711 (523296) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.180 (39303) ORI%: 1.533 (51054) XORI%: 0.000 (0) MULI%: 3.239 (107888) LW%: 1.413 (47075) LWI%: 13.202 (439723) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.291 (9690) SWI%: 4.195 (139718) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.417 (47199) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10354) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.048 (1586) bned%: 0.000 (0) bneid%: 13.848 (461225) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.725 (24140) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.111 (3711) DIV%: 0.013 (436) FPUN%: 1.495 (49810) FPRSUB%: 4.088 (136145) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (76) FPGT%: 2.962 (98669) FPGE%: 1.037 (34555) SYNC%: 0.000 (0) NOP%: 9.014 (300235) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 24 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 166 FPSUB 0 FPMUL 2 FPCMPLT 0 FPMIN 0 FPMAX 424 LOAD 37381 INTCONV 0 ATOMIC_INC 14 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 17 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1530 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 4 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49599 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 7 ORI 9891 XORI 0 MULI 9780 LW 0 LWI 142890 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 74 DIV 20 FPUN 0 FPRSUB 33 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.4648 --Total thread-cycles: 4132800 --total thread-cycles issued: 3030496 (73.327911%) --iCache conflicts: 114399 (2.768075%) --thread*cycles of FU dependence: 251875 (6.094537%) --thread*cycles of data dependence: 168490 (4.076897%) --iCache cycles*banks: 4132800 (80.593376% used) Issue breakdown: --thread*cycles of issue worked: 3030496 (73.327911%) --thread*cycles of issue failed: 802069 (19.407398%) --thread*cycles of issue NOP/other: 4616440254629713099 (111702477635584.000000%) Number of thread-cycles not ready: 168490 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3330731 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 9 3: 8 4: 8 5: 7 6: 8 7: 8 8: 7 9: 8 10: 6 11: 9 12: 7 13: 8 14: 9 15: 8 16: 8 17: 9 18: 8 19: 8 20: 8 21: 7 22: 7 23: 9 24: 8 25: 6 26: 8 27: 8 28: 8 29: 7 30: 7 31: 8 <=== Core 14 ===> ---- Thread 00 ---- PC 5: Stalled ----- 96033 in-flight CPI 1.3153 -- Total Cycles 126336 ---- Thread 01 ---- PC 5: Stalled ----- 100912 in-flight CPI 1.2517 -- Total Cycles 126336 ---- Thread 02 ---- PC 5: Stalled ----- 100444 in-flight CPI 1.2575 -- Total Cycles 126336 ---- Thread 03 ---- PC 5: Stalled ----- 98652 in-flight CPI 1.2804 -- Total Cycles 126336 ---- Thread 04 ---- PC 5: Stalled ----- 96807 in-flight CPI 1.3048 -- Total Cycles 126336 ---- Thread 05 ---- PC 5: Stalled ----- 102527 in-flight CPI 1.2320 -- Total Cycles 126336 ---- Thread 06 ---- PC 5: Stalled ----- 97745 in-flight CPI 1.2922 -- Total Cycles 126336 ---- Thread 07 ---- PC 5: Stalled ----- 94207 in-flight CPI 1.3408 -- Total Cycles 126336 ---- Thread 08 ---- PC 5: Stalled ----- 99078 in-flight CPI 1.2749 -- Total Cycles 126336 ---- Thread 09 ---- PC 5: Stalled ----- 100255 in-flight CPI 1.2600 -- Total Cycles 126336 ---- Thread 10 ---- PC 5: Stalled ----- 96884 in-flight CPI 1.3038 -- Total Cycles 126336 ---- Thread 11 ---- PC 5: Stalled ----- 100331 in-flight CPI 1.2589 -- Total Cycles 126336 ---- Thread 12 ---- PC 5: Stalled ----- 94804 in-flight CPI 1.3324 -- Total Cycles 126336 ---- Thread 13 ---- PC 5: Stalled ----- 97255 in-flight CPI 1.2988 -- Total Cycles 126336 ---- Thread 14 ---- PC 5: Stalled ----- 96826 in-flight CPI 1.3046 -- Total Cycles 126336 ---- Thread 15 ---- PC 5: Stalled ----- 97804 in-flight CPI 1.2915 -- Total Cycles 126336 ---- Thread 16 ---- PC 5: Stalled ----- 95314 in-flight CPI 1.3252 -- Total Cycles 126336 ---- Thread 17 ---- PC 5: Stalled ----- 93796 in-flight CPI 1.3467 -- Total Cycles 126336 ---- Thread 18 ---- PC 5: Stalled ----- 90620 in-flight CPI 1.3939 -- Total Cycles 126336 ---- Thread 19 ---- PC 5: Stalled ----- 99584 in-flight CPI 1.2684 -- Total Cycles 126336 ---- Thread 20 ---- PC 5: Stalled ----- 92989 in-flight CPI 1.3583 -- Total Cycles 126336 ---- Thread 21 ---- PC 5: Stalled ----- 92515 in-flight CPI 1.3652 -- Total Cycles 126336 ---- Thread 22 ---- PC 5: Stalled ----- 90552 in-flight CPI 1.3949 -- Total Cycles 126336 ---- Thread 23 ---- PC 5: Stalled ----- 93628 in-flight CPI 1.3491 -- Total Cycles 126336 ---- Thread 24 ---- PC 5: Stalled ----- 90461 in-flight CPI 1.3963 -- Total Cycles 126336 ---- Thread 25 ---- PC 5: Stalled ----- 91985 in-flight CPI 1.3732 -- Total Cycles 126336 ---- Thread 26 ---- PC 5: Stalled ----- 90869 in-flight CPI 1.3901 -- Total Cycles 126336 ---- Thread 27 ---- PC 5: Stalled ----- 90284 in-flight CPI 1.3991 -- Total Cycles 126336 ---- Thread 28 ---- PC 5: Stalled ----- 86824 in-flight CPI 1.4548 -- Total Cycles 126336 ---- Thread 29 ---- PC 5: Stalled ----- 90787 in-flight CPI 1.3913 -- Total Cycles 126336 ---- Thread 30 ---- PC 5: Stalled ----- 81466 in-flight CPI 1.5505 -- Total Cycles 126336 ---- Thread 31 ---- PC 5: Stalled ----- 90113 in-flight CPI 1.4017 -- Total Cycles 126336 Total CPI 0.0417 , IPC 24.0067 -- Total Cycles 126336 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7196 (3.887964%) FPSUB: 0 (0.000000%) FPMUL: 30633 (16.550863%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 68947 (37.251736%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4127 (2.229798%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 66361 (35.854530%) DIV: 7559 (4.084092%) FPUN: 0 (0.000000%) FPRSUB: 261 (0.141017%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3333864 total) ADD%: 7.258 (241966) SUB%: 0.000 (0) MUL%: 0.006 (205) BITOR%: 1.523 (50766) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.514 (17145) FPSUB%: 0.000 (0) FPMUL%: 4.667 (155601) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (615) FPMAX%: 0.018 (615) LOAD%: 5.098 (169957) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (237) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (586) FPINV%: 0.000 (0) FPCONV%: 0.019 (647) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.050 (34993) FPLE%: 0.458 (15259) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (615) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.830 (94365) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.730 (24350) CMPU%: 0.000 (0) RSUB%: 0.006 (205) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.707 (523635) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.176 (39197) ORI%: 1.535 (51179) XORI%: 0.000 (0) MULI%: 3.231 (107718) LW%: 1.411 (47038) LWI%: 13.173 (439171) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.290 (9663) SWI%: 4.161 (138712) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.414 (47150) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10378) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.055 (1830) bned%: 0.000 (0) bneid%: 13.824 (460885) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.722 (24073) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.113 (3766) DIV%: 0.012 (410) FPUN%: 1.483 (49438) FPRSUB%: 4.125 (137524) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (66) FPGT%: 2.961 (98731) FPGE%: 1.025 (34179) SYNC%: 0.000 (0) NOP%: 9.026 (300898) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 19 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 152 FPSUB 0 FPMUL 2 FPCMPLT 0 FPMIN 0 FPMAX 401 LOAD 38344 INTCONV 0 ATOMIC_INC 15 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 18 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1090 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 3 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49466 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 8 ORI 10285 XORI 0 MULI 9679 LW 0 LWI 142763 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 64 DIV 22 FPUN 0 FPRSUB 38 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 24.0069 --Total thread-cycles: 4042752 --total thread-cycles issued: 3032966 (75.022308%) --iCache conflicts: 114248 (2.825996%) --thread*cycles of FU dependence: 252392 (6.243074%) --thread*cycles of data dependence: 185084 (4.578169%) --iCache cycles*banks: 4042752 (82.466003% used) Issue breakdown: --thread*cycles of issue worked: 3032966 (75.022308%) --thread*cycles of issue failed: 708888 (17.534788%) --thread*cycles of issue NOP/other: 4701362 (116.291130%) Number of thread-cycles not ready: 185084 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3333864 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 9 2: 8 3: 7 4: 7 5: 9 6: 9 7: 7 8: 7 9: 6 10: 7 11: 9 12: 7 13: 8 14: 7 15: 8 16: 7 17: 6 18: 6 19: 8 20: 8 21: 9 22: 7 23: 8 24: 7 25: 8 26: 6 27: 7 28: 7 29: 7 30: 6 31: 8 <=== Core 15 ===> ---- Thread 00 ---- PC 5: Stalled ----- 101691 in-flight CPI 1.6826 -- Total Cycles 171144 ---- Thread 01 ---- PC 5: Stalled ----- 95119 in-flight CPI 1.7989 -- Total Cycles 171144 ---- Thread 02 ---- PC 5: Stalled ----- 95186 in-flight CPI 1.7976 -- Total Cycles 171144 ---- Thread 03 ---- PC 5: Stalled ----- 96876 in-flight CPI 1.7663 -- Total Cycles 171144 ---- Thread 04 ---- PC 5: Stalled ----- 96515 in-flight CPI 1.7729 -- Total Cycles 171144 ---- Thread 05 ---- PC 5: Stalled ----- 93158 in-flight CPI 1.8369 -- Total Cycles 171144 ---- Thread 06 ---- PC 5: Stalled ----- 99535 in-flight CPI 1.7190 -- Total Cycles 171144 ---- Thread 07 ---- PC 5: Stalled ----- 97174 in-flight CPI 1.7609 -- Total Cycles 171144 ---- Thread 08 ---- PC 5: Stalled ----- 96537 in-flight CPI 1.7725 -- Total Cycles 171144 ---- Thread 09 ---- PC 5: Stalled ----- 99685 in-flight CPI 1.7165 -- Total Cycles 171144 ---- Thread 10 ---- PC 5: Stalled ----- 97949 in-flight CPI 1.7469 -- Total Cycles 171144 ---- Thread 11 ---- PC 5: Stalled ----- 97568 in-flight CPI 1.7538 -- Total Cycles 171144 ---- Thread 12 ---- PC 5: Stalled ----- 99632 in-flight CPI 1.7174 -- Total Cycles 171144 ---- Thread 13 ---- PC 5: Stalled ----- 97491 in-flight CPI 1.7552 -- Total Cycles 171144 ---- Thread 14 ---- PC 5: Stalled ----- 96303 in-flight CPI 1.7767 -- Total Cycles 171144 ---- Thread 15 ---- PC 5: Stalled ----- 98083 in-flight CPI 1.7447 -- Total Cycles 171144 ---- Thread 16 ---- PC 5: Stalled ----- 96909 in-flight CPI 1.7657 -- Total Cycles 171144 ---- Thread 17 ---- PC 5: Stalled ----- 96859 in-flight CPI 1.7666 -- Total Cycles 171144 ---- Thread 18 ---- PC 5: Stalled ----- 98678 in-flight CPI 1.7340 -- Total Cycles 171144 ---- Thread 19 ---- PC 5: Stalled ----- 94568 in-flight CPI 1.8094 -- Total Cycles 171144 ---- Thread 20 ---- PC 5: Stalled ----- 94861 in-flight CPI 1.8038 -- Total Cycles 171144 ---- Thread 21 ---- PC 5: Stalled ----- 92517 in-flight CPI 1.8495 -- Total Cycles 171144 ---- Thread 22 ---- PC 5: Stalled ----- 93860 in-flight CPI 1.8231 -- Total Cycles 171144 ---- Thread 23 ---- PC 5: Stalled ----- 87267 in-flight CPI 1.9608 -- Total Cycles 171144 ---- Thread 24 ---- PC 5: Stalled ----- 93655 in-flight CPI 1.8270 -- Total Cycles 171144 ---- Thread 25 ---- PC 5: Stalled ----- 93075 in-flight CPI 1.8385 -- Total Cycles 171144 ---- Thread 26 ---- PC 5: Stalled ----- 94201 in-flight CPI 1.8165 -- Total Cycles 171144 ---- Thread 27 ---- PC 5: Stalled ----- 86593 in-flight CPI 1.9760 -- Total Cycles 171144 ---- Thread 28 ---- PC 5: Stalled ----- 93961 in-flight CPI 1.8211 -- Total Cycles 171144 ---- Thread 29 ---- PC 5: Stalled ----- 88675 in-flight CPI 1.9297 -- Total Cycles 171144 ---- Thread 30 ---- PC 5: Stalled ----- 118876 in-flight CPI 1.4396 -- Total Cycles 171144 ---- Thread 31 ---- PC 5: Stalled ----- 87994 in-flight CPI 1.9446 -- Total Cycles 171144 Total CPI 0.0557 , IPC 17.9476 -- Total Cycles 171144 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7935 (3.802582%) FPSUB: 0 (0.000000%) FPMUL: 32198 (15.429810%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 83373 (39.953709%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4481 (2.147369%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 72701 (34.839512%) DIV: 7718 (3.698592%) FPUN: 0 (0.000000%) FPRSUB: 268 (0.128430%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3375824 total) ADD%: 7.213 (243515) SUB%: 0.000 (0) MUL%: 0.006 (209) BITOR%: 1.527 (51564) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.553 (18679) FPSUB%: 0.000 (0) FPMUL%: 4.777 (161247) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (627) FPMAX%: 0.019 (627) LOAD%: 5.136 (173374) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (241) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (612) FPINV%: 0.000 (0) FPCONV%: 0.020 (659) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.064 (35924) FPLE%: 0.453 (15309) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (627) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.800 (94524) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.750 (25329) CMPU%: 0.000 (0) RSUB%: 0.006 (209) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.666 (528859) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.175 (39674) ORI%: 1.560 (52649) XORI%: 0.000 (0) MULI%: 3.201 (108056) LW%: 1.397 (47148) LWI%: 13.108 (442505) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.286 (9650) SWI%: 4.143 (139844) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.400 (47278) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.309 (10442) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.055 (1857) bned%: 0.000 (0) bneid%: 13.794 (465656) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.715 (24134) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.123 (4137) DIV%: 0.012 (418) FPUN%: 1.476 (49825) FPRSUB%: 4.213 (142237) FPSQRT%: 0.000 (0) FPNEG%: 0.003 (89) FPGT%: 2.942 (99332) FPGE%: 1.022 (34516) SYNC%: 0.000 (0) NOP%: 9.010 (304146) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 18 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 157 FPSUB 0 FPMUL 3 FPCMPLT 0 FPMIN 0 FPMAX 406 LOAD 40125 INTCONV 0 ATOMIC_INC 16 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 17 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1975 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 10 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49888 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 15 ORI 11257 XORI 0 MULI 9163 LW 0 LWI 144312 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 73 DIV 30 FPUN 0 FPRSUB 41 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 17.9478 --Total thread-cycles: 5476608 --total thread-cycles issued: 3071678 (56.087238%) --iCache conflicts: 112637 (2.056693%) --thread*cycles of FU dependence: 257508 (4.701962%) --thread*cycles of data dependence: 208674 (3.810278%) --iCache cycles*banks: 5476608 (61.641365% used) Issue breakdown: --thread*cycles of issue worked: 3071678 (56.087238%) --thread*cycles of issue failed: 2100784 (38.359219%) --thread*cycles of issue NOP/other: 4619523643863770130 (84350095851520.000000%) Number of thread-cycles not ready: 208674 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3375824 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 9 1: 8 2: 8 3: 8 4: 8 5: 6 6: 9 7: 8 8: 8 9: 8 10: 8 11: 7 12: 8 13: 6 14: 9 15: 6 16: 7 17: 8 18: 8 19: 8 20: 8 21: 8 22: 7 23: 7 24: 8 25: 7 26: 7 27: 7 28: 8 29: 7 30: 5 31: 7 <=== Core 16 ===> ---- Thread 00 ---- PC 5: Stalled ----- 101774 in-flight CPI 1.2778 -- Total Cycles 130073 ---- Thread 01 ---- PC 5: Stalled ----- 100726 in-flight CPI 1.2911 -- Total Cycles 130073 ---- Thread 02 ---- PC 5: Stalled ----- 95398 in-flight CPI 1.3632 -- Total Cycles 130073 ---- Thread 03 ---- PC 5: Stalled ----- 98023 in-flight CPI 1.3267 -- Total Cycles 130073 ---- Thread 04 ---- PC 5: Stalled ----- 98315 in-flight CPI 1.3228 -- Total Cycles 130073 ---- Thread 05 ---- PC 5: Stalled ----- 100569 in-flight CPI 1.2931 -- Total Cycles 130073 ---- Thread 06 ---- PC 5: Stalled ----- 99911 in-flight CPI 1.3016 -- Total Cycles 130073 ---- Thread 07 ---- PC 5: Stalled ----- 100752 in-flight CPI 1.2908 -- Total Cycles 130073 ---- Thread 08 ---- PC 5: Stalled ----- 99144 in-flight CPI 1.3117 -- Total Cycles 130073 ---- Thread 09 ---- PC 5: Stalled ----- 97296 in-flight CPI 1.3367 -- Total Cycles 130073 ---- Thread 10 ---- PC 5: Stalled ----- 97422 in-flight CPI 1.3349 -- Total Cycles 130073 ---- Thread 11 ---- PC 5: Stalled ----- 98877 in-flight CPI 1.3152 -- Total Cycles 130073 ---- Thread 12 ---- PC 5: Stalled ----- 94971 in-flight CPI 1.3694 -- Total Cycles 130073 ---- Thread 13 ---- PC 5: Stalled ----- 98653 in-flight CPI 1.3182 -- Total Cycles 130073 ---- Thread 14 ---- PC 5: Stalled ----- 100739 in-flight CPI 1.2909 -- Total Cycles 130073 ---- Thread 15 ---- PC 5: Stalled ----- 91703 in-flight CPI 1.4181 -- Total Cycles 130073 ---- Thread 16 ---- PC 5: Stalled ----- 99390 in-flight CPI 1.3085 -- Total Cycles 130073 ---- Thread 17 ---- PC 5: Stalled ----- 92400 in-flight CPI 1.4075 -- Total Cycles 130073 ---- Thread 18 ---- PC 5: Stalled ----- 94971 in-flight CPI 1.3694 -- Total Cycles 130073 ---- Thread 19 ---- PC 5: Stalled ----- 92685 in-flight CPI 1.4031 -- Total Cycles 130073 ---- Thread 20 ---- PC 5: Stalled ----- 94289 in-flight CPI 1.3793 -- Total Cycles 130073 ---- Thread 21 ---- PC 5: Stalled ----- 92496 in-flight CPI 1.4061 -- Total Cycles 130073 ---- Thread 22 ---- PC 5: Stalled ----- 92595 in-flight CPI 1.4045 -- Total Cycles 130073 ---- Thread 23 ---- PC 5: Stalled ----- 91482 in-flight CPI 1.4216 -- Total Cycles 130073 ---- Thread 24 ---- PC 5: Stalled ----- 92364 in-flight CPI 1.4080 -- Total Cycles 130073 ---- Thread 25 ---- PC 5: Stalled ----- 88342 in-flight CPI 1.4721 -- Total Cycles 130073 ---- Thread 26 ---- PC 5: Stalled ----- 93078 in-flight CPI 1.3972 -- Total Cycles 130073 ---- Thread 27 ---- PC 5: Stalled ----- 90851 in-flight CPI 1.4315 -- Total Cycles 130073 ---- Thread 28 ---- PC 5: Stalled ----- 89454 in-flight CPI 1.4539 -- Total Cycles 130073 ---- Thread 29 ---- PC 5: Stalled ----- 90177 in-flight CPI 1.4421 -- Total Cycles 130073 ---- Thread 30 ---- PC 5: Stalled ----- 90690 in-flight CPI 1.4340 -- Total Cycles 130073 ---- Thread 31 ---- PC 5: Stalled ----- 90716 in-flight CPI 1.4336 -- Total Cycles 130073 Total CPI 0.0426 , IPC 23.4546 -- Total Cycles 130073 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8569 (4.273048%) FPSUB: 0 (0.000000%) FPMUL: 33190 (16.550644%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 68473 (34.144993%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4220 (2.104360%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 78291 (39.040871%) DIV: 7531 (3.755435%) FPUN: 0 (0.000000%) FPRSUB: 262 (0.130650%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3353225 total) ADD%: 7.122 (238804) SUB%: 0.000 (0) MUL%: 0.006 (204) BITOR%: 1.510 (50645) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.593 (19881) FPSUB%: 0.000 (0) FPMUL%: 4.902 (164377) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (612) FPMAX%: 0.018 (612) LOAD%: 5.199 (174321) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (236) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (590) FPINV%: 0.000 (0) FPCONV%: 0.019 (644) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.080 (36227) FPLE%: 0.446 (14961) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (612) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.776 (93099) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.759 (25457) CMPU%: 0.000 (0) RSUB%: 0.006 (204) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.629 (524074) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.167 (39127) ORI%: 1.593 (53421) XORI%: 0.000 (0) MULI%: 3.175 (106460) LW%: 1.385 (46426) LWI%: 13.048 (437518) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.284 (9515) SWI%: 4.113 (137909) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.388 (46544) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.308 (10331) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.066 (2210) bned%: 0.000 (0) bneid%: 13.746 (460947) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.708 (23734) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.133 (4469) DIV%: 0.012 (408) FPUN%: 1.458 (48880) FPRSUB%: 4.326 (145045) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (70) FPGT%: 2.931 (98276) FPGE%: 1.012 (33919) SYNC%: 0.000 (0) NOP%: 9.017 (302360) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 16 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 155 FPSUB 0 FPMUL 1 FPCMPLT 0 FPMIN 0 FPMAX 398 LOAD 41398 INTCONV 0 ATOMIC_INC 20 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 18 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1125 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 14 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49141 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 15 ORI 12219 XORI 0 MULI 9038 LW 0 LWI 142803 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 90 DIV 16 FPUN 0 FPRSUB 50 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.4548 --Total thread-cycles: 4162336 --total thread-cycles issued: 3050865 (73.296944%) --iCache conflicts: 114511 (2.751123%) --thread*cycles of FU dependence: 256533 (6.163198%) --thread*cycles of data dependence: 200536 (4.817872%) --iCache cycles*banks: 4162336 (80.561897% used) Issue breakdown: --thread*cycles of issue worked: 3050865 (73.296944%) --thread*cycles of issue failed: 809111 (19.438868%) --thread*cycles of issue NOP/other: 302648 (7.271110%) Number of thread-cycles not ready: 200536 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3353225 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 8 3: 7 4: 6 5: 9 6: 9 7: 8 8: 7 9: 7 10: 7 11: 9 12: 7 13: 10 14: 9 15: 8 16: 6 17: 7 18: 6 19: 7 20: 7 21: 6 22: 7 23: 7 24: 7 25: 7 26: 7 27: 7 28: 6 29: 8 30: 7 31: 7 <=== Core 17 ===> ---- Thread 00 ---- PC 5: Stalled ----- 101021 in-flight CPI 1.6133 -- Total Cycles 163000 ---- Thread 01 ---- PC 5: Stalled ----- 102295 in-flight CPI 1.5931 -- Total Cycles 163000 ---- Thread 02 ---- PC 5: Stalled ----- 98650 in-flight CPI 1.6519 -- Total Cycles 163000 ---- Thread 03 ---- PC 5: Stalled ----- 100175 in-flight CPI 1.6268 -- Total Cycles 163000 ---- Thread 04 ---- PC 5: Stalled ----- 106888 in-flight CPI 1.5246 -- Total Cycles 163000 ---- Thread 05 ---- PC 5: Stalled ----- 96089 in-flight CPI 1.6960 -- Total Cycles 163000 ---- Thread 06 ---- PC 5: Stalled ----- 99370 in-flight CPI 1.6400 -- Total Cycles 163000 ---- Thread 07 ---- PC 5: Stalled ----- 98417 in-flight CPI 1.6559 -- Total Cycles 163000 ---- Thread 08 ---- PC 5: Stalled ----- 93386 in-flight CPI 1.7451 -- Total Cycles 163000 ---- Thread 09 ---- PC 5: Stalled ----- 98287 in-flight CPI 1.6581 -- Total Cycles 163000 ---- Thread 10 ---- PC 5: Stalled ----- 100600 in-flight CPI 1.6201 -- Total Cycles 163000 ---- Thread 11 ---- PC 5: Stalled ----- 99194 in-flight CPI 1.6429 -- Total Cycles 163000 ---- Thread 12 ---- PC 5: Stalled ----- 98512 in-flight CPI 1.6543 -- Total Cycles 163000 ---- Thread 13 ---- PC 5: Stalled ----- 98336 in-flight CPI 1.6572 -- Total Cycles 163000 ---- Thread 14 ---- PC 5: Stalled ----- 95503 in-flight CPI 1.7064 -- Total Cycles 163000 ---- Thread 15 ---- PC 5: Stalled ----- 97891 in-flight CPI 1.6648 -- Total Cycles 163000 ---- Thread 16 ---- PC 5: Stalled ----- 92934 in-flight CPI 1.7536 -- Total Cycles 163000 ---- Thread 17 ---- PC 5: Stalled ----- 96703 in-flight CPI 1.6853 -- Total Cycles 163000 ---- Thread 18 ---- PC 5: Stalled ----- 116436 in-flight CPI 1.3998 -- Total Cycles 163000 ---- Thread 19 ---- PC 5: Stalled ----- 91503 in-flight CPI 1.7810 -- Total Cycles 163000 ---- Thread 20 ---- PC 5: Stalled ----- 96220 in-flight CPI 1.6937 -- Total Cycles 163000 ---- Thread 21 ---- PC 5: Stalled ----- 90612 in-flight CPI 1.7986 -- Total Cycles 163000 ---- Thread 22 ---- PC 5: Stalled ----- 96254 in-flight CPI 1.6932 -- Total Cycles 163000 ---- Thread 23 ---- PC 5: Stalled ----- 92834 in-flight CPI 1.7556 -- Total Cycles 163000 ---- Thread 24 ---- PC 5: Stalled ----- 94114 in-flight CPI 1.7316 -- Total Cycles 163000 ---- Thread 25 ---- PC 5: Stalled ----- 94249 in-flight CPI 1.7290 -- Total Cycles 163000 ---- Thread 26 ---- PC 5: Stalled ----- 88615 in-flight CPI 1.8390 -- Total Cycles 163000 ---- Thread 27 ---- PC 5: Stalled ----- 87115 in-flight CPI 1.8708 -- Total Cycles 163000 ---- Thread 28 ---- PC 5: Stalled ----- 89068 in-flight CPI 1.8297 -- Total Cycles 163000 ---- Thread 29 ---- PC 5: Stalled ----- 90711 in-flight CPI 1.7966 -- Total Cycles 163000 ---- Thread 30 ---- PC 5: Stalled ----- 85719 in-flight CPI 1.9011 -- Total Cycles 163000 ---- Thread 31 ---- PC 5: Stalled ----- 87674 in-flight CPI 1.8588 -- Total Cycles 163000 Total CPI 0.0530 , IPC 18.8708 -- Total Cycles 163000 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7995 (3.745590%) FPSUB: 0 (0.000000%) FPMUL: 32304 (15.134152%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 88165 (41.304562%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4178 (1.957358%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 72943 (34.173183%) DIV: 7605 (3.562878%) FPUN: 0 (0.000000%) FPRSUB: 261 (0.122276%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3380326 total) ADD%: 7.196 (243265) SUB%: 0.000 (0) MUL%: 0.006 (206) BITOR%: 1.531 (51749) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.555 (18768) FPSUB%: 0.000 (0) FPMUL%: 4.784 (161729) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (618) FPMAX%: 0.018 (618) LOAD%: 5.149 (174053) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (238) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (590) FPINV%: 0.000 (0) FPCONV%: 0.019 (650) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.063 (35925) FPLE%: 0.456 (15425) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (618) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.803 (94766) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.748 (25278) CMPU%: 0.000 (0) RSUB%: 0.006 (206) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.684 (530185) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.176 (39762) ORI%: 1.562 (52815) XORI%: 0.000 (0) MULI%: 3.199 (108130) LW%: 1.397 (47229) LWI%: 13.084 (442270) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.287 (9713) SWI%: 4.136 (139800) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.401 (47343) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.309 (10458) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.054 (1837) bned%: 0.000 (0) bneid%: 13.792 (466214) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.717 (24247) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.123 (4152) DIV%: 0.012 (412) FPUN%: 1.480 (50017) FPRSUB%: 4.220 (142663) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (70) FPGT%: 2.937 (99286) FPGE%: 1.023 (34592) SYNC%: 0.000 (0) NOP%: 9.003 (304333) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 10 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 158 FPSUB 0 FPMUL 2 FPCMPLT 0 FPMIN 0 FPMAX 398 LOAD 39311 INTCONV 0 ATOMIC_INC 15 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 17 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1217 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 5 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49765 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 9 ORI 11420 XORI 0 MULI 9796 LW 0 LWI 144293 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 78 DIV 22 FPUN 0 FPRSUB 49 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 18.8710 --Total thread-cycles: 5216000 --total thread-cycles issued: 3075993 (58.972256%) --iCache conflicts: 111925 (2.145801%) --thread*cycles of FU dependence: 256579 (4.919076%) --thread*cycles of data dependence: 213451 (4.092236%) --iCache cycles*banks: 5216000 (64.807472% used) Issue breakdown: --thread*cycles of issue worked: 3075993 (58.972256%) --thread*cycles of issue failed: 1835674 (35.193138%) --thread*cycles of issue NOP/other: 4618514477409936589 (88545138049024.000000%) Number of thread-cycles not ready: 213451 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3380326 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 9 2: 9 3: 8 4: 9 5: 8 6: 8 7: 8 8: 7 9: 8 10: 5 11: 8 12: 8 13: 9 14: 8 15: 8 16: 7 17: 7 18: 5 19: 7 20: 7 21: 6 22: 6 23: 6 24: 8 25: 9 26: 8 27: 6 28: 7 29: 7 30: 8 31: 7 <=== Core 18 ===> ---- Thread 00 ---- PC 5: Stalled ----- 103504 in-flight CPI 1.2301 -- Total Cycles 127349 ---- Thread 01 ---- PC 5: Stalled ----- 98800 in-flight CPI 1.2887 -- Total Cycles 127349 ---- Thread 02 ---- PC 5: Stalled ----- 95252 in-flight CPI 1.3367 -- Total Cycles 127349 ---- Thread 03 ---- PC 5: Stalled ----- 96598 in-flight CPI 1.3181 -- Total Cycles 127349 ---- Thread 04 ---- PC 5: Stalled ----- 104390 in-flight CPI 1.2197 -- Total Cycles 127349 ---- Thread 05 ---- PC 5: Stalled ----- 95433 in-flight CPI 1.3342 -- Total Cycles 127349 ---- Thread 06 ---- PC 5: Stalled ----- 97683 in-flight CPI 1.3035 -- Total Cycles 127349 ---- Thread 07 ---- PC 5: Stalled ----- 97703 in-flight CPI 1.3032 -- Total Cycles 127349 ---- Thread 08 ---- PC 5: Stalled ----- 99895 in-flight CPI 1.2746 -- Total Cycles 127349 ---- Thread 09 ---- PC 5: Stalled ----- 94283 in-flight CPI 1.3504 -- Total Cycles 127349 ---- Thread 10 ---- PC 5: Stalled ----- 97233 in-flight CPI 1.3095 -- Total Cycles 127349 ---- Thread 11 ---- PC 5: Stalled ----- 99932 in-flight CPI 1.2740 -- Total Cycles 127349 ---- Thread 12 ---- PC 5: Stalled ----- 98173 in-flight CPI 1.2969 -- Total Cycles 127349 ---- Thread 13 ---- PC 5: Stalled ----- 100244 in-flight CPI 1.2701 -- Total Cycles 127349 ---- Thread 14 ---- PC 5: Stalled ----- 95914 in-flight CPI 1.3275 -- Total Cycles 127349 ---- Thread 15 ---- PC 5: Stalled ----- 93007 in-flight CPI 1.3690 -- Total Cycles 127349 ---- Thread 16 ---- PC 5: Stalled ----- 99364 in-flight CPI 1.2814 -- Total Cycles 127349 ---- Thread 17 ---- PC 5: Stalled ----- 90403 in-flight CPI 1.4085 -- Total Cycles 127349 ---- Thread 18 ---- PC 5: Stalled ----- 93613 in-flight CPI 1.3601 -- Total Cycles 127349 ---- Thread 19 ---- PC 5: Stalled ----- 94027 in-flight CPI 1.3541 -- Total Cycles 127349 ---- Thread 20 ---- PC 5: Stalled ----- 89647 in-flight CPI 1.4203 -- Total Cycles 127349 ---- Thread 21 ---- PC 5: Stalled ----- 91417 in-flight CPI 1.3928 -- Total Cycles 127349 ---- Thread 22 ---- PC 5: Stalled ----- 97132 in-flight CPI 1.3109 -- Total Cycles 127349 ---- Thread 23 ---- PC 5: Stalled ----- 90097 in-flight CPI 1.4133 -- Total Cycles 127349 ---- Thread 24 ---- PC 5: Stalled ----- 92950 in-flight CPI 1.3698 -- Total Cycles 127349 ---- Thread 25 ---- PC 5: Stalled ----- 89430 in-flight CPI 1.4237 -- Total Cycles 127349 ---- Thread 26 ---- PC 5: Stalled ----- 89073 in-flight CPI 1.4295 -- Total Cycles 127349 ---- Thread 27 ---- PC 5: Stalled ----- 90951 in-flight CPI 1.3999 -- Total Cycles 127349 ---- Thread 28 ---- PC 5: Stalled ----- 91021 in-flight CPI 1.3988 -- Total Cycles 127349 ---- Thread 29 ---- PC 5: Stalled ----- 92633 in-flight CPI 1.3746 -- Total Cycles 127349 ---- Thread 30 ---- PC 5: Stalled ----- 90162 in-flight CPI 1.4121 -- Total Cycles 127349 ---- Thread 31 ---- PC 5: Stalled ----- 85660 in-flight CPI 1.4864 -- Total Cycles 127349 Total CPI 0.0419 , IPC 23.8416 -- Total Cycles 127349 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7015 (4.041690%) FPSUB: 0 (0.000000%) FPMUL: 30138 (17.364000%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 58598 (33.761223%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4298 (2.476291%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 65498 (37.736656%) DIV: 7759 (4.470346%) FPUN: 0 (0.000000%) FPRSUB: 260 (0.149799%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3336939 total) ADD%: 7.227 (241148) SUB%: 0.000 (0) MUL%: 0.006 (210) BITOR%: 1.543 (51479) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.501 (16718) FPSUB%: 0.000 (0) FPMUL%: 4.624 (154311) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (630) FPMAX%: 0.019 (630) LOAD%: 5.079 (169498) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (242) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (603) FPINV%: 0.000 (0) FPCONV%: 0.020 (662) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.044 (34823) FPLE%: 0.458 (15271) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (630) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.839 (94745) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.729 (24342) CMPU%: 0.000 (0) RSUB%: 0.006 (210) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.711 (524269) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.181 (39399) ORI%: 1.537 (51304) XORI%: 0.000 (0) MULI%: 3.239 (108082) LW%: 1.416 (47246) LWI%: 13.197 (440381) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.290 (9685) SWI%: 4.182 (139567) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.419 (47365) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10370) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.050 (1671) bned%: 0.000 (0) bneid%: 13.842 (461905) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.730 (24376) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.111 (3712) DIV%: 0.013 (420) FPUN%: 1.501 (50104) FPRSUB%: 4.097 (136707) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (72) FPGT%: 2.953 (98538) FPGE%: 1.044 (34833) SYNC%: 0.000 (0) NOP%: 9.011 (300685) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 17 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 160 FPSUB 0 FPMUL 1 FPCMPLT 0 FPMIN 0 FPMAX 402 LOAD 37929 INTCONV 0 ATOMIC_INC 16 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 17 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1410 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 8 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49611 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 9 ORI 9872 XORI 0 MULI 9747 LW 0 LWI 143068 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 56 DIV 20 FPUN 0 FPRSUB 32 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.8418 --Total thread-cycles: 4075168 --total thread-cycles issued: 3036254 (74.506226%) --iCache conflicts: 113890 (2.794731%) --thread*cycles of FU dependence: 252403 (6.193683%) --thread*cycles of data dependence: 173566 (4.259113%) --iCache cycles*banks: 4075168 (81.885483% used) Issue breakdown: --thread*cycles of issue worked: 3036254 (74.506226%) --thread*cycles of issue failed: 738229 (18.115303%) --thread*cycles of issue NOP/other: 4602298175011985037 (112935175192576.000000%) Number of thread-cycles not ready: 173566 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3336939 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 9 1: 7 2: 8 3: 7 4: 9 5: 7 6: 7 7: 8 8: 8 9: 8 10: 8 11: 10 12: 8 13: 9 14: 7 15: 7 16: 8 17: 6 18: 7 19: 8 20: 6 21: 7 22: 7 23: 6 24: 9 25: 7 26: 7 27: 8 28: 8 29: 6 30: 8 31: 7 <=== Core 19 ===> ---- Thread 00 ---- PC 5: Stalled ----- 100947 in-flight CPI 1.2705 -- Total Cycles 128274 ---- Thread 01 ---- PC 5: Stalled ----- 101785 in-flight CPI 1.2600 -- Total Cycles 128274 ---- Thread 02 ---- PC 5: Stalled ----- 99604 in-flight CPI 1.2876 -- Total Cycles 128274 ---- Thread 03 ---- PC 5: Stalled ----- 96655 in-flight CPI 1.3269 -- Total Cycles 128274 ---- Thread 04 ---- PC 5: Stalled ----- 100144 in-flight CPI 1.2806 -- Total Cycles 128274 ---- Thread 05 ---- PC 5: Stalled ----- 96007 in-flight CPI 1.3359 -- Total Cycles 128274 ---- Thread 06 ---- PC 5: Stalled ----- 93592 in-flight CPI 1.3703 -- Total Cycles 128274 ---- Thread 07 ---- PC 5: Stalled ----- 104190 in-flight CPI 1.2309 -- Total Cycles 128274 ---- Thread 08 ---- PC 5: Stalled ----- 99763 in-flight CPI 1.2855 -- Total Cycles 128274 ---- Thread 09 ---- PC 5: Stalled ----- 99919 in-flight CPI 1.2836 -- Total Cycles 128274 ---- Thread 10 ---- PC 5: Stalled ----- 96817 in-flight CPI 1.3246 -- Total Cycles 128274 ---- Thread 11 ---- PC 5: Stalled ----- 94905 in-flight CPI 1.3514 -- Total Cycles 128274 ---- Thread 12 ---- PC 5: Stalled ----- 96174 in-flight CPI 1.3335 -- Total Cycles 128274 ---- Thread 13 ---- PC 5: Stalled ----- 93239 in-flight CPI 1.3756 -- Total Cycles 128274 ---- Thread 14 ---- PC 5: Stalled ----- 95739 in-flight CPI 1.3396 -- Total Cycles 128274 ---- Thread 15 ---- PC 5: Stalled ----- 94719 in-flight CPI 1.3540 -- Total Cycles 128274 ---- Thread 16 ---- PC 5: Stalled ----- 96861 in-flight CPI 1.3240 -- Total Cycles 128274 ---- Thread 17 ---- PC 5: Stalled ----- 94240 in-flight CPI 1.3609 -- Total Cycles 128274 ---- Thread 18 ---- PC 5: Stalled ----- 99901 in-flight CPI 1.2838 -- Total Cycles 128274 ---- Thread 19 ---- PC 5: Stalled ----- 96837 in-flight CPI 1.3244 -- Total Cycles 128274 ---- Thread 20 ---- PC 5: Stalled ----- 90459 in-flight CPI 1.4178 -- Total Cycles 128274 ---- Thread 21 ---- PC 5: Stalled ----- 92718 in-flight CPI 1.3832 -- Total Cycles 128274 ---- Thread 22 ---- PC 5: Stalled ----- 91491 in-flight CPI 1.4018 -- Total Cycles 128274 ---- Thread 23 ---- PC 5: Stalled ----- 93213 in-flight CPI 1.3759 -- Total Cycles 128274 ---- Thread 24 ---- PC 5: Stalled ----- 96958 in-flight CPI 1.3228 -- Total Cycles 128274 ---- Thread 25 ---- PC 5: Stalled ----- 87343 in-flight CPI 1.4683 -- Total Cycles 128274 ---- Thread 26 ---- PC 5: Stalled ----- 92759 in-flight CPI 1.3826 -- Total Cycles 128274 ---- Thread 27 ---- PC 5: Stalled ----- 89388 in-flight CPI 1.4348 -- Total Cycles 128274 ---- Thread 28 ---- PC 5: Stalled ----- 89554 in-flight CPI 1.4321 -- Total Cycles 128274 ---- Thread 29 ---- PC 5: Stalled ----- 89950 in-flight CPI 1.4258 -- Total Cycles 128274 ---- Thread 30 ---- PC 5: Stalled ----- 94085 in-flight CPI 1.3631 -- Total Cycles 128274 ---- Thread 31 ---- PC 5: Stalled ----- 92014 in-flight CPI 1.3938 -- Total Cycles 128274 Total CPI 0.0420 , IPC 23.7970 -- Total Cycles 128274 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8179 (4.507926%) FPSUB: 0 (0.000000%) FPMUL: 32627 (17.982651%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 53900 (29.707443%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4365 (2.405807%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 74499 (41.060760%) DIV: 7604 (4.191010%) FPUN: 0 (0.000000%) FPRSUB: 262 (0.144404%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3354969 total) ADD%: 7.178 (240834) SUB%: 0.000 (0) MUL%: 0.006 (206) BITOR%: 1.525 (51157) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.572 (19189) FPSUB%: 0.000 (0) FPMUL%: 4.835 (162222) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (618) FPMAX%: 0.018 (618) LOAD%: 5.156 (172994) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (238) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (602) FPINV%: 0.000 (0) FPCONV%: 0.019 (650) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.072 (35977) FPLE%: 0.454 (15241) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (618) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.788 (93531) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.753 (25258) CMPU%: 0.000 (0) RSUB%: 0.006 (206) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.661 (525412) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.172 (39337) ORI%: 1.565 (52495) XORI%: 0.000 (0) MULI%: 3.189 (106994) LW%: 1.391 (46665) LWI%: 13.071 (438542) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.284 (9536) SWI%: 4.121 (138257) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.395 (46791) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.307 (10302) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.056 (1893) bned%: 0.000 (0) bneid%: 13.787 (462559) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.712 (23874) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.126 (4244) DIV%: 0.012 (412) FPUN%: 1.471 (49356) FPRSUB%: 4.258 (142852) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (81) FPGT%: 2.939 (98616) FPGE%: 1.017 (34115) SYNC%: 0.000 (0) NOP%: 9.013 (302381) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 12 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 159 FPSUB 0 FPMUL 3 FPCMPLT 0 FPMIN 0 FPMAX 397 LOAD 40132 INTCONV 0 ATOMIC_INC 16 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 20 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1129 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 7 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49383 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 9 ORI 11681 XORI 0 MULI 9444 LW 0 LWI 143089 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 68 DIV 20 FPUN 0 FPRSUB 58 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.7972 --Total thread-cycles: 4104768 --total thread-cycles issued: 3052588 (74.366882%) --iCache conflicts: 115583 (2.815823%) --thread*cycles of FU dependence: 255629 (6.227612%) --thread*cycles of data dependence: 181436 (4.420128%) --iCache cycles*banks: 4104768 (81.734238% used) Issue breakdown: --thread*cycles of issue worked: 3052588 (74.366882%) --thread*cycles of issue failed: 749799 (18.266539%) --thread*cycles of issue NOP/other: 158914289760 (3871456.250000%) Number of thread-cycles not ready: 181436 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3354969 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 8 3: 8 4: 9 5: 7 6: 7 7: 9 8: 9 9: 7 10: 8 11: 6 12: 7 13: 6 14: 8 15: 7 16: 8 17: 7 18: 8 19: 7 20: 7 21: 8 22: 7 23: 6 24: 7 25: 7 26: 7 27: 7 28: 7 29: 7 30: 8 31: 8 <=== Core 20 ===> ---- Thread 00 ---- PC 5: Stalled ----- 101645 in-flight CPI 1.2698 -- Total Cycles 129094 ---- Thread 01 ---- PC 5: Stalled ----- 96405 in-flight CPI 1.3389 -- Total Cycles 129094 ---- Thread 02 ---- PC 5: Stalled ----- 101254 in-flight CPI 1.2747 -- Total Cycles 129094 ---- Thread 03 ---- PC 5: Stalled ----- 98930 in-flight CPI 1.3046 -- Total Cycles 129094 ---- Thread 04 ---- PC 5: Stalled ----- 101819 in-flight CPI 1.2676 -- Total Cycles 129094 ---- Thread 05 ---- PC 5: Stalled ----- 94741 in-flight CPI 1.3623 -- Total Cycles 129094 ---- Thread 06 ---- PC 5: Stalled ----- 96007 in-flight CPI 1.3444 -- Total Cycles 129094 ---- Thread 07 ---- PC 5: Stalled ----- 104154 in-flight CPI 1.2392 -- Total Cycles 129094 ---- Thread 08 ---- PC 5: Stalled ----- 98131 in-flight CPI 1.3153 -- Total Cycles 129094 ---- Thread 09 ---- PC 5: Stalled ----- 100236 in-flight CPI 1.2876 -- Total Cycles 129094 ---- Thread 10 ---- PC 5: Stalled ----- 93577 in-flight CPI 1.3793 -- Total Cycles 129094 ---- Thread 11 ---- PC 5: Stalled ----- 97532 in-flight CPI 1.3233 -- Total Cycles 129094 ---- Thread 12 ---- PC 5: Stalled ----- 102350 in-flight CPI 1.2611 -- Total Cycles 129094 ---- Thread 13 ---- PC 5: Stalled ----- 96283 in-flight CPI 1.3406 -- Total Cycles 129094 ---- Thread 14 ---- PC 5: Stalled ----- 92701 in-flight CPI 1.3924 -- Total Cycles 129094 ---- Thread 15 ---- PC 5: Stalled ----- 96218 in-flight CPI 1.3414 -- Total Cycles 129094 ---- Thread 16 ---- PC 5: Stalled ----- 100498 in-flight CPI 1.2843 -- Total Cycles 129094 ---- Thread 17 ---- PC 5: Stalled ----- 96489 in-flight CPI 1.3377 -- Total Cycles 129094 ---- Thread 18 ---- PC 5: Stalled ----- 92340 in-flight CPI 1.3978 -- Total Cycles 129094 ---- Thread 19 ---- PC 5: Stalled ----- 98033 in-flight CPI 1.3166 -- Total Cycles 129094 ---- Thread 20 ---- PC 5: Stalled ----- 95934 in-flight CPI 1.3454 -- Total Cycles 129094 ---- Thread 21 ---- PC 5: Stalled ----- 93889 in-flight CPI 1.3748 -- Total Cycles 129094 ---- Thread 22 ---- PC 5: Stalled ----- 93457 in-flight CPI 1.3811 -- Total Cycles 129094 ---- Thread 23 ---- PC 5: Stalled ----- 94491 in-flight CPI 1.3660 -- Total Cycles 129094 ---- Thread 24 ---- PC 5: Stalled ----- 94412 in-flight CPI 1.3671 -- Total Cycles 129094 ---- Thread 25 ---- PC 5: Stalled ----- 92156 in-flight CPI 1.4006 -- Total Cycles 129094 ---- Thread 26 ---- PC 5: Stalled ----- 95008 in-flight CPI 1.3585 -- Total Cycles 129094 ---- Thread 27 ---- PC 5: Stalled ----- 90044 in-flight CPI 1.4335 -- Total Cycles 129094 ---- Thread 28 ---- PC 5: Stalled ----- 91360 in-flight CPI 1.4128 -- Total Cycles 129094 ---- Thread 29 ---- PC 5: Stalled ----- 86104 in-flight CPI 1.4989 -- Total Cycles 129094 ---- Thread 30 ---- PC 5: Stalled ----- 85856 in-flight CPI 1.5034 -- Total Cycles 129094 ---- Thread 31 ---- PC 5: Stalled ----- 88950 in-flight CPI 1.4510 -- Total Cycles 129094 Total CPI 0.0422 , IPC 23.7158 -- Total Cycles 129094 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7666 (4.190285%) FPSUB: 0 (0.000000%) FPMUL: 31513 (17.225208%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 61443 (33.585136%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4288 (2.343848%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 70172 (38.356464%) DIV: 7604 (4.156395%) FPUN: 0 (0.000000%) FPRSUB: 261 (0.142664%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3365300 total) ADD%: 7.125 (239783) SUB%: 0.000 (0) MUL%: 0.006 (206) BITOR%: 1.530 (51475) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.537 (18056) FPSUB%: 0.000 (0) FPMUL%: 4.735 (159344) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (618) FPMAX%: 0.018 (618) LOAD%: 5.130 (172645) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (238) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (596) FPINV%: 0.000 (0) FPCONV%: 0.019 (650) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.057 (35556) FPLE%: 0.457 (15374) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (618) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.819 (94861) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.742 (24973) CMPU%: 0.000 (0) RSUB%: 0.006 (206) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.710 (528686) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.178 (39651) ORI%: 1.554 (52308) XORI%: 0.000 (0) MULI%: 3.215 (108204) LW%: 1.405 (47299) LWI%: 13.137 (442096) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.288 (9700) SWI%: 4.154 (139782) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.409 (47419) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10433) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.054 (1817) bned%: 0.000 (0) bneid%: 13.821 (465131) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.720 (24217) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.119 (3989) DIV%: 0.012 (412) FPUN%: 1.483 (49892) FPRSUB%: 4.182 (140738) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (77) FPGT%: 2.952 (99340) FPGE%: 1.026 (34518) SYNC%: 0.000 (0) NOP%: 9.024 (303678) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 27 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 148 FPSUB 0 FPMUL 3 FPCMPLT 0 FPMIN 0 FPMAX 405 LOAD 38830 INTCONV 0 ATOMIC_INC 19 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 14 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1370 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 6 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49819 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 10 ORI 10860 XORI 0 MULI 9970 LW 0 LWI 144076 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 66 DIV 20 FPUN 0 FPRSUB 50 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.7160 --Total thread-cycles: 4131008 --total thread-cycles issued: 3061622 (74.113197%) --iCache conflicts: 115180 (2.788181%) --thread*cycles of FU dependence: 255718 (6.190208%) --thread*cycles of data dependence: 182947 (4.428628%) --iCache cycles*banks: 4131008 (81.465157% used) Issue breakdown: --thread*cycles of issue worked: 3061622 (74.113197%) --thread*cycles of issue failed: 765708 (18.535622%) --thread*cycles of issue NOP/other: 4618584260964819518 (111802847330304.000000%) Number of thread-cycles not ready: 182947 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3365300 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 7 2: 8 3: 9 4: 9 5: 8 6: 7 7: 8 8: 8 9: 10 10: 7 11: 8 12: 8 13: 7 14: 6 15: 8 16: 7 17: 7 18: 6 19: 7 20: 8 21: 6 22: 7 23: 7 24: 8 25: 7 26: 8 27: 6 28: 6 29: 8 30: 6 31: 8 <=== Core 21 ===> ---- Thread 00 ---- PC 5: Stalled ----- 98882 in-flight CPI 1.4441 -- Total Cycles 142824 ---- Thread 01 ---- PC 5: Stalled ----- 99533 in-flight CPI 1.4347 -- Total Cycles 142824 ---- Thread 02 ---- PC 5: Stalled ----- 98399 in-flight CPI 1.4512 -- Total Cycles 142824 ---- Thread 03 ---- PC 5: Stalled ----- 95454 in-flight CPI 1.4960 -- Total Cycles 142824 ---- Thread 04 ---- PC 5: Stalled ----- 96370 in-flight CPI 1.4818 -- Total Cycles 142824 ---- Thread 05 ---- PC 5: Stalled ----- 98421 in-flight CPI 1.4509 -- Total Cycles 142824 ---- Thread 06 ---- PC 5: Stalled ----- 101438 in-flight CPI 1.4077 -- Total Cycles 142824 ---- Thread 07 ---- PC 5: Stalled ----- 94699 in-flight CPI 1.5079 -- Total Cycles 142824 ---- Thread 08 ---- PC 5: Stalled ----- 93337 in-flight CPI 1.5299 -- Total Cycles 142824 ---- Thread 09 ---- PC 5: Stalled ----- 97796 in-flight CPI 1.4602 -- Total Cycles 142824 ---- Thread 10 ---- PC 5: Stalled ----- 104472 in-flight CPI 1.3669 -- Total Cycles 142824 ---- Thread 11 ---- PC 5: Stalled ----- 96130 in-flight CPI 1.4854 -- Total Cycles 142824 ---- Thread 12 ---- PC 5: Stalled ----- 100136 in-flight CPI 1.4260 -- Total Cycles 142824 ---- Thread 13 ---- PC 5: Stalled ----- 90886 in-flight CPI 1.5712 -- Total Cycles 142824 ---- Thread 14 ---- PC 5: Stalled ----- 91637 in-flight CPI 1.5584 -- Total Cycles 142824 ---- Thread 15 ---- PC 5: Stalled ----- 90514 in-flight CPI 1.5777 -- Total Cycles 142824 ---- Thread 16 ---- PC 5: Stalled ----- 96996 in-flight CPI 1.4722 -- Total Cycles 142824 ---- Thread 17 ---- PC 5: Stalled ----- 96008 in-flight CPI 1.4873 -- Total Cycles 142824 ---- Thread 18 ---- PC 5: Stalled ----- 94455 in-flight CPI 1.5118 -- Total Cycles 142824 ---- Thread 19 ---- PC 5: Stalled ----- 97834 in-flight CPI 1.4596 -- Total Cycles 142824 ---- Thread 20 ---- PC 5: Stalled ----- 95025 in-flight CPI 1.5028 -- Total Cycles 142824 ---- Thread 21 ---- PC 5: Stalled ----- 93275 in-flight CPI 1.5309 -- Total Cycles 142824 ---- Thread 22 ---- PC 5: Stalled ----- 96353 in-flight CPI 1.4820 -- Total Cycles 142824 ---- Thread 23 ---- PC 5: Stalled ----- 91434 in-flight CPI 1.5618 -- Total Cycles 142824 ---- Thread 24 ---- PC 5: Stalled ----- 91524 in-flight CPI 1.5602 -- Total Cycles 142824 ---- Thread 25 ---- PC 5: Stalled ----- 86967 in-flight CPI 1.6420 -- Total Cycles 142824 ---- Thread 26 ---- PC 5: Stalled ----- 86117 in-flight CPI 1.6582 -- Total Cycles 142824 ---- Thread 27 ---- PC 5: Stalled ----- 93107 in-flight CPI 1.5338 -- Total Cycles 142824 ---- Thread 28 ---- PC 5: Stalled ----- 90849 in-flight CPI 1.5718 -- Total Cycles 142824 ---- Thread 29 ---- PC 5: Stalled ----- 91320 in-flight CPI 1.5637 -- Total Cycles 142824 ---- Thread 30 ---- PC 5: Stalled ----- 85669 in-flight CPI 1.6669 -- Total Cycles 142824 ---- Thread 31 ---- PC 5: Stalled ----- 87097 in-flight CPI 1.6395 -- Total Cycles 142824 Total CPI 0.0473 , IPC 21.1637 -- Total Cycles 142824 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8271 (4.140904%) FPSUB: 0 (0.000000%) FPMUL: 32624 (16.333315%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 72255 (36.174709%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4312 (2.158817%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 74642 (37.369766%) DIV: 7379 (3.694321%) FPUN: 0 (0.000000%) FPRSUB: 256 (0.128167%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3322227 total) ADD%: 7.150 (237540) SUB%: 0.000 (0) MUL%: 0.006 (200) BITOR%: 1.534 (50951) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.581 (19308) FPSUB%: 0.000 (0) FPMUL%: 4.862 (161515) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (600) FPMAX%: 0.018 (600) LOAD%: 5.162 (171500) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (232) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (589) FPINV%: 0.000 (0) FPCONV%: 0.019 (632) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.075 (35728) FPLE%: 0.456 (15163) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (600) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.780 (92345) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.755 (25087) CMPU%: 0.000 (0) RSUB%: 0.006 (200) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.650 (519927) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.169 (38851) ORI%: 1.585 (52647) XORI%: 0.000 (0) MULI%: 3.181 (105666) LW%: 1.387 (46065) LWI%: 13.044 (433367) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.284 (9422) SWI%: 4.103 (136312) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.390 (46190) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.307 (10201) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.059 (1968) bned%: 0.000 (0) bneid%: 13.786 (458007) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.718 (23850) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.128 (4257) DIV%: 0.012 (400) FPUN%: 1.480 (49158) FPRSUB%: 4.278 (142136) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (77) FPGT%: 2.930 (97352) FPGE%: 1.023 (33995) SYNC%: 0.000 (0) NOP%: 9.015 (299493) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 46 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 154 FPSUB 0 FPMUL 2 FPCMPLT 0 FPMIN 0 FPMAX 395 LOAD 40111 INTCONV 0 ATOMIC_INC 16 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 17 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1473 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 4 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48719 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 4 ORI 11852 XORI 0 MULI 9133 LW 0 LWI 141321 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 73 DIV 22 FPUN 0 FPRSUB 47 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.1639 --Total thread-cycles: 4570368 --total thread-cycles issued: 3022734 (66.137650%) --iCache conflicts: 112453 (2.460480%) --thread*cycles of FU dependence: 253411 (5.544652%) --thread*cycles of data dependence: 199739 (4.370305%) --iCache cycles*banks: 4570368 (72.691277% used) Issue breakdown: --thread*cycles of issue worked: 3022734 (66.137650%) --thread*cycles of issue failed: 1248141 (27.309422%) --thread*cycles of issue NOP/other: 4601448759866724837 (100680048050176.000000%) Number of thread-cycles not ready: 199739 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3322227 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 7 3: 8 4: 7 5: 8 6: 8 7: 7 8: 8 9: 7 10: 6 11: 8 12: 8 13: 6 14: 6 15: 6 16: 8 17: 8 18: 8 19: 7 20: 7 21: 8 22: 8 23: 7 24: 7 25: 7 26: 7 27: 6 28: 7 29: 8 30: 6 31: 7 <=== Core 22 ===> ---- Thread 00 ---- PC 5: Stalled ----- 94996 in-flight CPI 1.6333 -- Total Cycles 155186 ---- Thread 01 ---- PC 5: Stalled ----- 95299 in-flight CPI 1.6281 -- Total Cycles 155186 ---- Thread 02 ---- PC 5: Stalled ----- 93657 in-flight CPI 1.6566 -- Total Cycles 155186 ---- Thread 03 ---- PC 5: Stalled ----- 101817 in-flight CPI 1.5238 -- Total Cycles 155186 ---- Thread 04 ---- PC 5: Stalled ----- 97071 in-flight CPI 1.5984 -- Total Cycles 155186 ---- Thread 05 ---- PC 5: Stalled ----- 100010 in-flight CPI 1.5514 -- Total Cycles 155186 ---- Thread 06 ---- PC 5: Stalled ----- 96677 in-flight CPI 1.6048 -- Total Cycles 155186 ---- Thread 07 ---- PC 5: Stalled ----- 98670 in-flight CPI 1.5725 -- Total Cycles 155186 ---- Thread 08 ---- PC 5: Stalled ----- 96724 in-flight CPI 1.6041 -- Total Cycles 155186 ---- Thread 09 ---- PC 5: Stalled ----- 117792 in-flight CPI 1.3173 -- Total Cycles 155186 ---- Thread 10 ---- PC 5: Stalled ----- 95993 in-flight CPI 1.6164 -- Total Cycles 155186 ---- Thread 11 ---- PC 5: Stalled ----- 97665 in-flight CPI 1.5887 -- Total Cycles 155186 ---- Thread 12 ---- PC 5: Stalled ----- 101325 in-flight CPI 1.5313 -- Total Cycles 155186 ---- Thread 13 ---- PC 5: Stalled ----- 97233 in-flight CPI 1.5957 -- Total Cycles 155186 ---- Thread 14 ---- PC 5: Stalled ----- 99147 in-flight CPI 1.5649 -- Total Cycles 155186 ---- Thread 15 ---- PC 5: Stalled ----- 97147 in-flight CPI 1.5971 -- Total Cycles 155186 ---- Thread 16 ---- PC 5: Stalled ----- 94385 in-flight CPI 1.6438 -- Total Cycles 155186 ---- Thread 17 ---- PC 5: Stalled ----- 96693 in-flight CPI 1.6046 -- Total Cycles 155186 ---- Thread 18 ---- PC 5: Stalled ----- 91800 in-flight CPI 1.6902 -- Total Cycles 155186 ---- Thread 19 ---- PC 5: Stalled ----- 97997 in-flight CPI 1.5833 -- Total Cycles 155186 ---- Thread 20 ---- PC 5: Stalled ----- 90302 in-flight CPI 1.7182 -- Total Cycles 155186 ---- Thread 21 ---- PC 5: Stalled ----- 91084 in-flight CPI 1.7034 -- Total Cycles 155186 ---- Thread 22 ---- PC 5: Stalled ----- 89895 in-flight CPI 1.7260 -- Total Cycles 155186 ---- Thread 23 ---- PC 5: Stalled ----- 89073 in-flight CPI 1.7419 -- Total Cycles 155186 ---- Thread 24 ---- PC 5: Stalled ----- 89706 in-flight CPI 1.7297 -- Total Cycles 155186 ---- Thread 25 ---- PC 5: Stalled ----- 93015 in-flight CPI 1.6680 -- Total Cycles 155186 ---- Thread 26 ---- PC 5: Stalled ----- 91767 in-flight CPI 1.6908 -- Total Cycles 155186 ---- Thread 27 ---- PC 5: Stalled ----- 86159 in-flight CPI 1.8008 -- Total Cycles 155186 ---- Thread 28 ---- PC 5: Stalled ----- 92050 in-flight CPI 1.6855 -- Total Cycles 155186 ---- Thread 29 ---- PC 5: Stalled ----- 90205 in-flight CPI 1.7200 -- Total Cycles 155186 ---- Thread 30 ---- PC 5: Stalled ----- 92522 in-flight CPI 1.6770 -- Total Cycles 155186 ---- Thread 31 ---- PC 5: Stalled ----- 87444 in-flight CPI 1.7744 -- Total Cycles 155186 Total CPI 0.0509 , IPC 19.6274 -- Total Cycles 155186 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7871 (4.100248%) FPSUB: 0 (0.000000%) FPMUL: 31787 (16.558834%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 67682 (35.257652%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4168 (2.171240%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 72430 (37.731033%) DIV: 7760 (4.042425%) FPUN: 0 (0.000000%) FPRSUB: 266 (0.138568%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3347911 total) ADD%: 7.237 (242295) SUB%: 0.000 (0) MUL%: 0.006 (210) BITOR%: 1.523 (50995) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.549 (18395) FPSUB%: 0.000 (0) FPMUL%: 4.768 (159615) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (630) FPMAX%: 0.019 (630) LOAD%: 5.141 (172106) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (242) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (596) FPINV%: 0.000 (0) FPCONV%: 0.020 (662) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.064 (35615) FPLE%: 0.454 (15207) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (630) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.801 (93760) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.740 (24791) CMPU%: 0.000 (0) RSUB%: 0.006 (210) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.663 (524368) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.168 (39096) ORI%: 1.564 (52346) XORI%: 0.000 (0) MULI%: 3.202 (107192) LW%: 1.395 (46712) LWI%: 13.099 (438545) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.288 (9628) SWI%: 4.135 (138436) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.399 (46824) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10394) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.062 (2061) bned%: 0.000 (0) bneid%: 13.787 (461560) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.718 (24039) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.123 (4119) DIV%: 0.013 (420) FPUN%: 1.479 (49529) FPRSUB%: 4.213 (141049) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (75) FPGT%: 2.944 (98550) FPGE%: 1.025 (34322) SYNC%: 0.000 (0) NOP%: 9.019 (301961) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 27 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 159 FPSUB 0 FPMUL 0 FPCMPLT 0 FPMIN 0 FPMAX 408 LOAD 39391 INTCONV 0 ATOMIC_INC 18 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 17 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1437 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 4 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49409 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 6 ORI 11150 XORI 0 MULI 9327 LW 0 LWI 143035 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 75 DIV 26 FPUN 0 FPRSUB 50 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 19.6276 --Total thread-cycles: 4965952 --total thread-cycles issued: 3045950 (61.336678%) --iCache conflicts: 112655 (2.268548%) --thread*cycles of FU dependence: 254562 (5.126147%) --thread*cycles of data dependence: 191964 (3.865603%) --iCache cycles*banks: 4965952 (67.417946% used) Issue breakdown: --thread*cycles of issue worked: 3045950 (61.336678%) --thread*cycles of issue failed: 1618041 (32.582695%) --thread*cycles of issue NOP/other: 158914309021 (3200077.500000%) Number of thread-cycles not ready: 191964 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3347911 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 7 2: 8 3: 10 4: 7 5: 9 6: 9 7: 7 8: 8 9: 6 10: 7 11: 7 12: 8 13: 8 14: 8 15: 8 16: 8 17: 8 18: 7 19: 8 20: 7 21: 8 22: 6 23: 7 24: 6 25: 8 26: 7 27: 7 28: 8 29: 8 30: 7 31: 7 <=== Core 23 ===> ---- Thread 00 ---- PC 5: Stalled ----- 89987 in-flight CPI 1.5352 -- Total Cycles 138161 ---- Thread 01 ---- PC 5: Stalled ----- 96280 in-flight CPI 1.4347 -- Total Cycles 138161 ---- Thread 02 ---- PC 5: Stalled ----- 96197 in-flight CPI 1.4359 -- Total Cycles 138161 ---- Thread 03 ---- PC 5: Stalled ----- 93924 in-flight CPI 1.4707 -- Total Cycles 138161 ---- Thread 04 ---- PC 5: Stalled ----- 99495 in-flight CPI 1.3884 -- Total Cycles 138161 ---- Thread 05 ---- PC 5: Stalled ----- 99575 in-flight CPI 1.3872 -- Total Cycles 138161 ---- Thread 06 ---- PC 5: Stalled ----- 93141 in-flight CPI 1.4831 -- Total Cycles 138161 ---- Thread 07 ---- PC 5: Stalled ----- 97475 in-flight CPI 1.4172 -- Total Cycles 138161 ---- Thread 08 ---- PC 5: Stalled ----- 89682 in-flight CPI 1.5403 -- Total Cycles 138161 ---- Thread 09 ---- PC 5: Stalled ----- 97960 in-flight CPI 1.4102 -- Total Cycles 138161 ---- Thread 10 ---- PC 5: Stalled ----- 96799 in-flight CPI 1.4270 -- Total Cycles 138161 ---- Thread 11 ---- PC 5: Stalled ----- 95788 in-flight CPI 1.4421 -- Total Cycles 138161 ---- Thread 12 ---- PC 5: Stalled ----- 101097 in-flight CPI 1.3664 -- Total Cycles 138161 ---- Thread 13 ---- PC 5: Stalled ----- 93679 in-flight CPI 1.4745 -- Total Cycles 138161 ---- Thread 14 ---- PC 5: Stalled ----- 99591 in-flight CPI 1.3870 -- Total Cycles 138161 ---- Thread 15 ---- PC 5: Stalled ----- 92473 in-flight CPI 1.4939 -- Total Cycles 138161 ---- Thread 16 ---- PC 5: Stalled ----- 91382 in-flight CPI 1.5116 -- Total Cycles 138161 ---- Thread 17 ---- PC 5: Stalled ----- 101204 in-flight CPI 1.3650 -- Total Cycles 138161 ---- Thread 18 ---- PC 5: Stalled ----- 94727 in-flight CPI 1.4583 -- Total Cycles 138161 ---- Thread 19 ---- PC 5: Stalled ----- 97246 in-flight CPI 1.4205 -- Total Cycles 138161 ---- Thread 20 ---- PC 5: Stalled ----- 91714 in-flight CPI 1.5062 -- Total Cycles 138161 ---- Thread 21 ---- PC 5: Stalled ----- 89442 in-flight CPI 1.5445 -- Total Cycles 138161 ---- Thread 22 ---- PC 5: Stalled ----- 92247 in-flight CPI 1.4975 -- Total Cycles 138161 ---- Thread 23 ---- PC 5: Stalled ----- 88454 in-flight CPI 1.5617 -- Total Cycles 138161 ---- Thread 24 ---- PC 5: Stalled ----- 89311 in-flight CPI 1.5467 -- Total Cycles 138161 ---- Thread 25 ---- PC 5: Stalled ----- 91842 in-flight CPI 1.5041 -- Total Cycles 138161 ---- Thread 26 ---- PC 5: Stalled ----- 92154 in-flight CPI 1.4990 -- Total Cycles 138161 ---- Thread 27 ---- PC 5: Stalled ----- 90262 in-flight CPI 1.5304 -- Total Cycles 138161 ---- Thread 28 ---- PC 5: Stalled ----- 89722 in-flight CPI 1.5395 -- Total Cycles 138161 ---- Thread 29 ---- PC 5: Stalled ----- 90432 in-flight CPI 1.5275 -- Total Cycles 138161 ---- Thread 30 ---- PC 5: Stalled ----- 86344 in-flight CPI 1.5999 -- Total Cycles 138161 ---- Thread 31 ---- PC 5: Stalled ----- 90701 in-flight CPI 1.5229 -- Total Cycles 138161 Total CPI 0.0460 , IPC 21.7199 -- Total Cycles 138161 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8220 (3.693569%) FPSUB: 0 (0.000000%) FPMUL: 32065 (14.408062%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 97352 (43.744076%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 3712 (1.667947%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 73897 (33.204823%) DIV: 7055 (3.170088%) FPUN: 0 (0.000000%) FPRSUB: 248 (0.111436%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3298148 total) ADD%: 7.125 (234995) SUB%: 0.000 (0) MUL%: 0.006 (191) BITOR%: 1.526 (50321) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.575 (18957) FPSUB%: 0.000 (0) FPMUL%: 4.849 (159919) FPCMPLT%: 0.000 (0) FPMIN%: 0.017 (573) FPMAX%: 0.017 (573) LOAD%: 5.206 (171691) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (223) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.016 (544) FPINV%: 0.000 (0) FPCONV%: 0.018 (605) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.069 (35260) FPLE%: 0.459 (15125) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.017 (573) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.797 (92246) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.753 (24827) CMPU%: 0.000 (0) RSUB%: 0.006 (191) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.686 (517349) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.170 (38598) ORI%: 1.585 (52262) XORI%: 0.000 (0) MULI%: 3.180 (104880) LW%: 1.393 (45953) LWI%: 13.017 (429315) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.287 (9471) SWI%: 4.111 (135603) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.396 (46051) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10251) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.066 (2177) bned%: 0.000 (0) bneid%: 13.760 (453828) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.720 (23754) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.128 (4219) DIV%: 0.012 (382) FPUN%: 1.478 (48736) FPRSUB%: 4.279 (141135) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (59) FPGT%: 2.922 (96356) FPGE%: 1.019 (33611) SYNC%: 0.000 (0) NOP%: 9.013 (297248) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 32 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 142 FPSUB 0 FPMUL 7 FPCMPLT 0 FPMIN 0 FPMAX 372 LOAD 40462 INTCONV 0 ATOMIC_INC 16 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 19 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1600 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 7 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48285 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 14 ORI 11637 XORI 0 MULI 8818 LW 0 LWI 140086 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 89 DIV 19 FPUN 0 FPRSUB 53 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.7202 --Total thread-cycles: 4421152 --total thread-cycles issued: 3000900 (67.875977%) --iCache conflicts: 110226 (2.493151%) --thread*cycles of FU dependence: 251673 (5.692476%) --thread*cycles of data dependence: 222549 (5.033733%) --iCache cycles*banks: 4421152 (74.600014% used) Issue breakdown: --thread*cycles of issue worked: 3000900 (67.875977%) --thread*cycles of issue failed: 1123004 (25.400711%) --thread*cycles of issue NOP/other: 4618884290737113376 (104472420286464.000000%) Number of thread-cycles not ready: 222549 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3298148 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 5 1: 7 2: 8 3: 8 4: 8 5: 8 6: 7 7: 7 8: 7 9: 5 10: 8 11: 8 12: 8 13: 8 14: 8 15: 6 16: 8 17: 5 18: 6 19: 8 20: 6 21: 6 22: 7 23: 7 24: 6 25: 7 26: 7 27: 6 28: 8 29: 7 30: 5 31: 8 <=== Core 24 ===> ---- Thread 00 ---- PC 5: Stalled ----- 96292 in-flight CPI 1.6969 -- Total Cycles 163426 ---- Thread 01 ---- PC 5: Stalled ----- 103221 in-flight CPI 1.5829 -- Total Cycles 163426 ---- Thread 02 ---- PC 5: Stalled ----- 98415 in-flight CPI 1.6603 -- Total Cycles 163426 ---- Thread 03 ---- PC 5: Stalled ----- 97691 in-flight CPI 1.6725 -- Total Cycles 163426 ---- Thread 04 ---- PC 5: Stalled ----- 95862 in-flight CPI 1.7046 -- Total Cycles 163426 ---- Thread 05 ---- PC 5: Stalled ----- 97432 in-flight CPI 1.6769 -- Total Cycles 163426 ---- Thread 06 ---- PC 5: Stalled ----- 97463 in-flight CPI 1.6765 -- Total Cycles 163426 ---- Thread 07 ---- PC 5: Stalled ----- 94184 in-flight CPI 1.7348 -- Total Cycles 163426 ---- Thread 08 ---- PC 5: Stalled ----- 96475 in-flight CPI 1.6937 -- Total Cycles 163426 ---- Thread 09 ---- PC 5: Stalled ----- 96070 in-flight CPI 1.7008 -- Total Cycles 163426 ---- Thread 10 ---- PC 5: Stalled ----- 98556 in-flight CPI 1.6578 -- Total Cycles 163426 ---- Thread 11 ---- PC 5: Stalled ----- 95226 in-flight CPI 1.7158 -- Total Cycles 163426 ---- Thread 12 ---- PC 5: Stalled ----- 96506 in-flight CPI 1.6931 -- Total Cycles 163426 ---- Thread 13 ---- PC 5: Stalled ----- 96009 in-flight CPI 1.7019 -- Total Cycles 163426 ---- Thread 14 ---- PC 5: Stalled ----- 96636 in-flight CPI 1.6908 -- Total Cycles 163426 ---- Thread 15 ---- PC 5: Stalled ----- 95040 in-flight CPI 1.7193 -- Total Cycles 163426 ---- Thread 16 ---- PC 5: Stalled ----- 96405 in-flight CPI 1.6950 -- Total Cycles 163426 ---- Thread 17 ---- PC 5: Stalled ----- 120203 in-flight CPI 1.3594 -- Total Cycles 163426 ---- Thread 18 ---- PC 5: Stalled ----- 100536 in-flight CPI 1.6252 -- Total Cycles 163426 ---- Thread 19 ---- PC 5: Stalled ----- 98501 in-flight CPI 1.6588 -- Total Cycles 163426 ---- Thread 20 ---- PC 5: Stalled ----- 92802 in-flight CPI 1.7607 -- Total Cycles 163426 ---- Thread 21 ---- PC 5: Stalled ----- 93569 in-flight CPI 1.7462 -- Total Cycles 163426 ---- Thread 22 ---- PC 5: Stalled ----- 89655 in-flight CPI 1.8225 -- Total Cycles 163426 ---- Thread 23 ---- PC 5: Stalled ----- 92074 in-flight CPI 1.7746 -- Total Cycles 163426 ---- Thread 24 ---- PC 5: Stalled ----- 91494 in-flight CPI 1.7859 -- Total Cycles 163426 ---- Thread 25 ---- PC 5: Stalled ----- 92581 in-flight CPI 1.7649 -- Total Cycles 163426 ---- Thread 26 ---- PC 5: Stalled ----- 95684 in-flight CPI 1.7077 -- Total Cycles 163426 ---- Thread 27 ---- PC 5: Stalled ----- 87640 in-flight CPI 1.8645 -- Total Cycles 163426 ---- Thread 28 ---- PC 5: Stalled ----- 87013 in-flight CPI 1.8780 -- Total Cycles 163426 ---- Thread 29 ---- PC 5: Stalled ----- 90294 in-flight CPI 1.8096 -- Total Cycles 163426 ---- Thread 30 ---- PC 5: Stalled ----- 92813 in-flight CPI 1.7604 -- Total Cycles 163426 ---- Thread 31 ---- PC 5: Stalled ----- 89245 in-flight CPI 1.8309 -- Total Cycles 163426 Total CPI 0.0534 , IPC 18.7372 -- Total Cycles 163426 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8094 (3.920771%) FPSUB: 0 (0.000000%) FPMUL: 32478 (15.732492%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 80707 (39.094841%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4001 (1.938103%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 73473 (35.590656%) DIV: 7431 (3.599611%) FPUN: 0 (0.000000%) FPRSUB: 255 (0.123523%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3365820 total) ADD%: 7.161 (241010) SUB%: 0.000 (0) MUL%: 0.006 (201) BITOR%: 1.527 (51411) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.563 (18962) FPSUB%: 0.000 (0) FPMUL%: 4.814 (162042) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (603) FPMAX%: 0.018 (603) LOAD%: 5.160 (173692) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (233) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (574) FPINV%: 0.000 (0) FPCONV%: 0.019 (635) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.068 (35956) FPLE%: 0.457 (15367) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (603) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.795 (94081) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.750 (25258) CMPU%: 0.000 (0) RSUB%: 0.006 (201) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.680 (527774) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.172 (39451) ORI%: 1.571 (52878) XORI%: 0.000 (0) MULI%: 3.192 (107440) LW%: 1.393 (46885) LWI%: 13.066 (439768) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.287 (9644) SWI%: 4.115 (138502) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.396 (46993) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10436) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.059 (1985) bned%: 0.000 (0) bneid%: 13.794 (464277) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.716 (24108) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.124 (4189) DIV%: 0.012 (402) FPUN%: 1.477 (49713) FPRSUB%: 4.245 (142871) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (63) FPGT%: 2.939 (98937) FPGE%: 1.020 (34346) SYNC%: 0.000 (0) NOP%: 9.021 (303630) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 22 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 154 FPSUB 0 FPMUL 2 FPCMPLT 0 FPMIN 0 FPMAX 391 LOAD 39749 INTCONV 0 ATOMIC_INC 18 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 16 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1039 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 10 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49489 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 19 ORI 11536 XORI 0 MULI 9535 LW 0 LWI 143517 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 72 DIV 20 FPUN 0 FPRSUB 47 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 18.7374 --Total thread-cycles: 5229632 --total thread-cycles issued: 3062190 (58.554596%) --iCache conflicts: 113639 (2.172983%) --thread*cycles of FU dependence: 255651 (4.888508%) --thread*cycles of data dependence: 206439 (3.947486%) --iCache cycles*banks: 5229632 (64.361160% used) Issue breakdown: --thread*cycles of issue worked: 3062190 (58.554596%) --thread*cycles of issue failed: 1863812 (35.639446%) --thread*cycles of issue NOP/other: 4603840387709444622 (88033734950912.000000%) Number of thread-cycles not ready: 206439 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3365820 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 9 2: 7 3: 9 4: 6 5: 9 6: 7 7: 8 8: 7 9: 8 10: 9 11: 8 12: 7 13: 7 14: 9 15: 6 16: 6 17: 6 18: 8 19: 8 20: 7 21: 8 22: 7 23: 7 24: 7 25: 7 26: 7 27: 6 28: 5 29: 7 30: 8 31: 6 <=== Core 25 ===> ---- Thread 00 ---- PC 5: Stalled ----- 100709 in-flight CPI 1.2512 -- Total Cycles 126033 ---- Thread 01 ---- PC 5: Stalled ----- 98519 in-flight CPI 1.2790 -- Total Cycles 126033 ---- Thread 02 ---- PC 5: Stalled ----- 96926 in-flight CPI 1.3000 -- Total Cycles 126033 ---- Thread 03 ---- PC 5: Stalled ----- 93510 in-flight CPI 1.3475 -- Total Cycles 126033 ---- Thread 04 ---- PC 5: Stalled ----- 93986 in-flight CPI 1.3407 -- Total Cycles 126033 ---- Thread 05 ---- PC 5: Stalled ----- 96287 in-flight CPI 1.3087 -- Total Cycles 126033 ---- Thread 06 ---- PC 5: Stalled ----- 98069 in-flight CPI 1.2849 -- Total Cycles 126033 ---- Thread 07 ---- PC 5: Stalled ----- 96218 in-flight CPI 1.3096 -- Total Cycles 126033 ---- Thread 08 ---- PC 5: Stalled ----- 91740 in-flight CPI 1.3736 -- Total Cycles 126033 ---- Thread 09 ---- PC 5: Stalled ----- 98850 in-flight CPI 1.2747 -- Total Cycles 126033 ---- Thread 10 ---- PC 5: Stalled ----- 99571 in-flight CPI 1.2655 -- Total Cycles 126033 ---- Thread 11 ---- PC 5: Stalled ----- 99325 in-flight CPI 1.2686 -- Total Cycles 126033 ---- Thread 12 ---- PC 5: Stalled ----- 99029 in-flight CPI 1.2724 -- Total Cycles 126033 ---- Thread 13 ---- PC 5: Stalled ----- 98234 in-flight CPI 1.2827 -- Total Cycles 126033 ---- Thread 14 ---- PC 5: Stalled ----- 95542 in-flight CPI 1.3189 -- Total Cycles 126033 ---- Thread 15 ---- PC 5: Stalled ----- 97139 in-flight CPI 1.2972 -- Total Cycles 126033 ---- Thread 16 ---- PC 5: Stalled ----- 90187 in-flight CPI 1.3972 -- Total Cycles 126033 ---- Thread 17 ---- PC 5: Stalled ----- 95480 in-flight CPI 1.3198 -- Total Cycles 126033 ---- Thread 18 ---- PC 5: Stalled ----- 95436 in-flight CPI 1.3203 -- Total Cycles 126033 ---- Thread 19 ---- PC 5: Stalled ----- 96563 in-flight CPI 1.3049 -- Total Cycles 126033 ---- Thread 20 ---- PC 5: Stalled ----- 87539 in-flight CPI 1.4395 -- Total Cycles 126033 ---- Thread 21 ---- PC 5: Stalled ----- 97053 in-flight CPI 1.2983 -- Total Cycles 126033 ---- Thread 22 ---- PC 5: Stalled ----- 95015 in-flight CPI 1.3262 -- Total Cycles 126033 ---- Thread 23 ---- PC 5: Stalled ----- 95602 in-flight CPI 1.3181 -- Total Cycles 126033 ---- Thread 24 ---- PC 5: Stalled ----- 87071 in-flight CPI 1.4472 -- Total Cycles 126033 ---- Thread 25 ---- PC 5: Stalled ----- 87640 in-flight CPI 1.4379 -- Total Cycles 126033 ---- Thread 26 ---- PC 5: Stalled ----- 93214 in-flight CPI 1.3518 -- Total Cycles 126033 ---- Thread 27 ---- PC 5: Stalled ----- 92172 in-flight CPI 1.3671 -- Total Cycles 126033 ---- Thread 28 ---- PC 5: Stalled ----- 90729 in-flight CPI 1.3888 -- Total Cycles 126033 ---- Thread 29 ---- PC 5: Stalled ----- 91715 in-flight CPI 1.3739 -- Total Cycles 126033 ---- Thread 30 ---- PC 5: Stalled ----- 89060 in-flight CPI 1.4149 -- Total Cycles 126033 ---- Thread 31 ---- PC 5: Stalled ----- 87113 in-flight CPI 1.4466 -- Total Cycles 126033 Total CPI 0.0417 , IPC 24.0081 -- Total Cycles 126033 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8035 (4.272982%) FPSUB: 0 (0.000000%) FPMUL: 32195 (17.121176%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 62492 (33.233002%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4340 (2.307995%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 72961 (38.800373%) DIV: 7759 (4.126205%) FPUN: 0 (0.000000%) FPRSUB: 260 (0.138267%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3326218 total) ADD%: 7.190 (239149) SUB%: 0.000 (0) MUL%: 0.006 (210) BITOR%: 1.528 (50824) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.569 (18916) FPSUB%: 0.000 (0) FPMUL%: 4.819 (160287) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (630) FPMAX%: 0.019 (630) LOAD%: 5.127 (170528) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (242) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (606) FPINV%: 0.000 (0) FPCONV%: 0.020 (662) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.072 (35642) FPLE%: 0.452 (15029) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (630) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.789 (92760) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.742 (24681) CMPU%: 0.000 (0) RSUB%: 0.006 (210) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.640 (520235) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.166 (38770) ORI%: 1.586 (52743) XORI%: 0.000 (0) MULI%: 3.192 (106176) LW%: 1.391 (46278) LWI%: 13.078 (434989) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.284 (9462) SWI%: 4.124 (137181) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.395 (46400) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.307 (10210) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.060 (2011) bned%: 0.000 (0) bneid%: 13.799 (458976) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.718 (23868) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.125 (4151) DIV%: 0.013 (420) FPUN%: 1.482 (49284) FPRSUB%: 4.232 (140763) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (67) FPGT%: 2.943 (97902) FPGE%: 1.030 (34255) SYNC%: 0.000 (0) NOP%: 9.030 (300345) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 13 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 157 FPSUB 0 FPMUL 4 FPCMPLT 0 FPMIN 0 FPMAX 405 LOAD 39177 INTCONV 0 ATOMIC_INC 14 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 16 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1311 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 8 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48964 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 17 ORI 11471 XORI 0 MULI 9342 LW 0 LWI 141809 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 71 DIV 23 FPUN 0 FPRSUB 44 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 24.0084 --Total thread-cycles: 4033056 --total thread-cycles issued: 3025873 (75.026802%) --iCache conflicts: 112768 (2.796093%) --thread*cycles of FU dependence: 252857 (6.269613%) --thread*cycles of data dependence: 188042 (4.662519%) --iCache cycles*banks: 4033056 (82.474678% used) Issue breakdown: --thread*cycles of issue worked: 3025873 (75.026802%) --thread*cycles of issue failed: 706838 (17.526114%) --thread*cycles of issue NOP/other: 416505 (10.327280%) Number of thread-cycles not ready: 188042 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3326218 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 9 2: 8 3: 8 4: 7 5: 8 6: 7 7: 9 8: 6 9: 9 10: 8 11: 8 12: 9 13: 8 14: 7 15: 8 16: 7 17: 7 18: 8 19: 9 20: 6 21: 8 22: 7 23: 7 24: 7 25: 6 26: 8 27: 7 28: 8 29: 8 30: 6 31: 6 <=== Core 26 ===> ---- Thread 00 ---- PC 5: Stalled ----- 97727 in-flight CPI 1.3049 -- Total Cycles 127541 ---- Thread 01 ---- PC 5: Stalled ----- 97917 in-flight CPI 1.3023 -- Total Cycles 127541 ---- Thread 02 ---- PC 5: Stalled ----- 95743 in-flight CPI 1.3319 -- Total Cycles 127541 ---- Thread 03 ---- PC 5: Stalled ----- 98185 in-flight CPI 1.2988 -- Total Cycles 127541 ---- Thread 04 ---- PC 5: Stalled ----- 95562 in-flight CPI 1.3345 -- Total Cycles 127541 ---- Thread 05 ---- PC 5: Stalled ----- 96917 in-flight CPI 1.3158 -- Total Cycles 127541 ---- Thread 06 ---- PC 5: Stalled ----- 97637 in-flight CPI 1.3060 -- Total Cycles 127541 ---- Thread 07 ---- PC 5: Stalled ----- 96160 in-flight CPI 1.3261 -- Total Cycles 127541 ---- Thread 08 ---- PC 5: Stalled ----- 94670 in-flight CPI 1.3470 -- Total Cycles 127541 ---- Thread 09 ---- PC 5: Stalled ----- 97583 in-flight CPI 1.3068 -- Total Cycles 127541 ---- Thread 10 ---- PC 5: Stalled ----- 94917 in-flight CPI 1.3434 -- Total Cycles 127541 ---- Thread 11 ---- PC 5: Stalled ----- 98778 in-flight CPI 1.2910 -- Total Cycles 127541 ---- Thread 12 ---- PC 5: Stalled ----- 95714 in-flight CPI 1.3323 -- Total Cycles 127541 ---- Thread 13 ---- PC 5: Stalled ----- 100273 in-flight CPI 1.2717 -- Total Cycles 127541 ---- Thread 14 ---- PC 5: Stalled ----- 99219 in-flight CPI 1.2852 -- Total Cycles 127541 ---- Thread 15 ---- PC 5: Stalled ----- 94382 in-flight CPI 1.3510 -- Total Cycles 127541 ---- Thread 16 ---- PC 5: Stalled ----- 92150 in-flight CPI 1.3838 -- Total Cycles 127541 ---- Thread 17 ---- PC 5: Stalled ----- 96822 in-flight CPI 1.3170 -- Total Cycles 127541 ---- Thread 18 ---- PC 5: Stalled ----- 93283 in-flight CPI 1.3669 -- Total Cycles 127541 ---- Thread 19 ---- PC 5: Stalled ----- 96033 in-flight CPI 1.3278 -- Total Cycles 127541 ---- Thread 20 ---- PC 5: Stalled ----- 90170 in-flight CPI 1.4141 -- Total Cycles 127541 ---- Thread 21 ---- PC 5: Stalled ----- 96618 in-flight CPI 1.3198 -- Total Cycles 127541 ---- Thread 22 ---- PC 5: Stalled ----- 95361 in-flight CPI 1.3372 -- Total Cycles 127541 ---- Thread 23 ---- PC 5: Stalled ----- 93036 in-flight CPI 1.3706 -- Total Cycles 127541 ---- Thread 24 ---- PC 5: Stalled ----- 87819 in-flight CPI 1.4521 -- Total Cycles 127541 ---- Thread 25 ---- PC 5: Stalled ----- 88659 in-flight CPI 1.4383 -- Total Cycles 127541 ---- Thread 26 ---- PC 5: Stalled ----- 95937 in-flight CPI 1.3292 -- Total Cycles 127541 ---- Thread 27 ---- PC 5: Stalled ----- 95368 in-flight CPI 1.3371 -- Total Cycles 127541 ---- Thread 28 ---- PC 5: Stalled ----- 90295 in-flight CPI 1.4122 -- Total Cycles 127541 ---- Thread 29 ---- PC 5: Stalled ----- 88989 in-flight CPI 1.4329 -- Total Cycles 127541 ---- Thread 30 ---- PC 5: Stalled ----- 87572 in-flight CPI 1.4562 -- Total Cycles 127541 ---- Thread 31 ---- PC 5: Stalled ----- 85810 in-flight CPI 1.4860 -- Total Cycles 127541 Total CPI 0.0422 , IPC 23.7247 -- Total Cycles 127541 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7518 (4.014482%) FPSUB: 0 (0.000000%) FPMUL: 31082 (16.597248%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 67734 (36.168781%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4279 (2.284912%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 68752 (36.712376%) DIV: 7642 (4.080695%) FPUN: 0 (0.000000%) FPRSUB: 265 (0.141505%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3326017 total) ADD%: 7.174 (238599) SUB%: 0.000 (0) MUL%: 0.006 (207) BITOR%: 1.523 (50662) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.531 (17674) FPSUB%: 0.000 (0) FPMUL%: 4.718 (156929) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (621) FPMAX%: 0.019 (621) LOAD%: 5.128 (170564) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (239) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (597) FPINV%: 0.000 (0) FPCONV%: 0.020 (653) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.054 (35047) FPLE%: 0.457 (15210) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (621) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.824 (93917) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.738 (24542) CMPU%: 0.000 (0) RSUB%: 0.006 (207) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.708 (522459) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.176 (39112) ORI%: 1.547 (51465) XORI%: 0.000 (0) MULI%: 3.219 (107052) LW%: 1.408 (46823) LWI%: 13.142 (437090) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.289 (9610) SWI%: 4.163 (138469) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.411 (46942) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10349) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.057 (1893) bned%: 0.000 (0) bneid%: 13.809 (459301) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.721 (23969) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.117 (3905) DIV%: 0.012 (414) FPUN%: 1.480 (49241) FPRSUB%: 4.164 (138497) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (74) FPGT%: 2.953 (98225) FPGE%: 1.023 (34031) SYNC%: 0.000 (0) NOP%: 9.023 (300090) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 34 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 157 FPSUB 0 FPMUL 5 FPCMPLT 0 FPMIN 0 FPMAX 401 LOAD 39368 INTCONV 0 ATOMIC_INC 16 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 19 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1606 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 14 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49181 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 5 ORI 10668 XORI 0 MULI 9941 LW 0 LWI 142231 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 61 DIV 20 FPUN 0 FPRSUB 44 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.7249 --Total thread-cycles: 4081312 --total thread-cycles issued: 3025927 (74.141037%) --iCache conflicts: 113090 (2.770923%) --thread*cycles of FU dependence: 253777 (6.218025%) --thread*cycles of data dependence: 187272 (4.588525%) --iCache cycles*banks: 4081312 (81.494606% used) Issue breakdown: --thread*cycles of issue worked: 3025927 (74.141037%) --thread*cycles of issue failed: 755295 (18.506182%) --thread*cycles of issue NOP/other: 4598300095182836794 (112667201110016.000000%) Number of thread-cycles not ready: 187272 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3326017 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 7 3: 7 4: 6 5: 6 6: 8 7: 8 8: 7 9: 7 10: 8 11: 7 12: 8 13: 9 14: 7 15: 9 16: 7 17: 8 18: 9 19: 8 20: 8 21: 8 22: 7 23: 7 24: 6 25: 7 26: 8 27: 8 28: 7 29: 8 30: 6 31: 8 <=== Core 27 ===> ---- Thread 00 ---- PC 5: Stalled ----- 100265 in-flight CPI 1.6316 -- Total Cycles 163619 ---- Thread 01 ---- PC 5: Stalled ----- 99578 in-flight CPI 1.6429 -- Total Cycles 163619 ---- Thread 02 ---- PC 5: Stalled ----- 97572 in-flight CPI 1.6766 -- Total Cycles 163619 ---- Thread 03 ---- PC 5: Stalled ----- 103928 in-flight CPI 1.5740 -- Total Cycles 163619 ---- Thread 04 ---- PC 5: Stalled ----- 91818 in-flight CPI 1.7817 -- Total Cycles 163619 ---- Thread 05 ---- PC 5: Stalled ----- 96137 in-flight CPI 1.7016 -- Total Cycles 163619 ---- Thread 06 ---- PC 5: Stalled ----- 94386 in-flight CPI 1.7333 -- Total Cycles 163619 ---- Thread 07 ---- PC 5: Stalled ----- 95577 in-flight CPI 1.7116 -- Total Cycles 163619 ---- Thread 08 ---- PC 5: Stalled ----- 103044 in-flight CPI 1.5876 -- Total Cycles 163619 ---- Thread 09 ---- PC 5: Stalled ----- 96371 in-flight CPI 1.6975 -- Total Cycles 163619 ---- Thread 10 ---- PC 5: Stalled ----- 96455 in-flight CPI 1.6960 -- Total Cycles 163619 ---- Thread 11 ---- PC 5: Stalled ----- 95607 in-flight CPI 1.7111 -- Total Cycles 163619 ---- Thread 12 ---- PC 5: Stalled ----- 120746 in-flight CPI 1.3549 -- Total Cycles 163619 ---- Thread 13 ---- PC 5: Stalled ----- 100415 in-flight CPI 1.6291 -- Total Cycles 163619 ---- Thread 14 ---- PC 5: Stalled ----- 91321 in-flight CPI 1.7914 -- Total Cycles 163619 ---- Thread 15 ---- PC 5: Stalled ----- 99783 in-flight CPI 1.6396 -- Total Cycles 163619 ---- Thread 16 ---- PC 5: Stalled ----- 93176 in-flight CPI 1.7558 -- Total Cycles 163619 ---- Thread 17 ---- PC 5: Stalled ----- 94873 in-flight CPI 1.7243 -- Total Cycles 163619 ---- Thread 18 ---- PC 5: Stalled ----- 98945 in-flight CPI 1.6534 -- Total Cycles 163619 ---- Thread 19 ---- PC 5: Stalled ----- 92755 in-flight CPI 1.7637 -- Total Cycles 163619 ---- Thread 20 ---- PC 5: Stalled ----- 93906 in-flight CPI 1.7421 -- Total Cycles 163619 ---- Thread 21 ---- PC 5: Stalled ----- 98186 in-flight CPI 1.6662 -- Total Cycles 163619 ---- Thread 22 ---- PC 5: Stalled ----- 96757 in-flight CPI 1.6907 -- Total Cycles 163619 ---- Thread 23 ---- PC 5: Stalled ----- 93722 in-flight CPI 1.7454 -- Total Cycles 163619 ---- Thread 24 ---- PC 5: Stalled ----- 90575 in-flight CPI 1.8062 -- Total Cycles 163619 ---- Thread 25 ---- PC 5: Stalled ----- 90616 in-flight CPI 1.8053 -- Total Cycles 163619 ---- Thread 26 ---- PC 5: Stalled ----- 88051 in-flight CPI 1.8579 -- Total Cycles 163619 ---- Thread 27 ---- PC 5: Stalled ----- 91668 in-flight CPI 1.7845 -- Total Cycles 163619 ---- Thread 28 ---- PC 5: Stalled ----- 92441 in-flight CPI 1.7697 -- Total Cycles 163619 ---- Thread 29 ---- PC 5: Stalled ----- 89019 in-flight CPI 1.8377 -- Total Cycles 163619 ---- Thread 30 ---- PC 5: Stalled ----- 87854 in-flight CPI 1.8620 -- Total Cycles 163619 ---- Thread 31 ---- PC 5: Stalled ----- 88783 in-flight CPI 1.8426 -- Total Cycles 163619 Total CPI 0.0534 , IPC 18.7318 -- Total Cycles 163619 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8978 (4.061213%) FPSUB: 0 (0.000000%) FPMUL: 34070 (15.411617%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 85115 (38.501904%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 3971 (1.796288%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 81401 (36.821869%) DIV: 7278 (3.292214%) FPUN: 0 (0.000000%) FPRSUB: 254 (0.114897%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3368352 total) ADD%: 7.155 (241015) SUB%: 0.000 (0) MUL%: 0.006 (197) BITOR%: 1.505 (50679) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.612 (20620) FPSUB%: 0.000 (0) FPMUL%: 4.955 (166913) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (591) FPMAX%: 0.018 (591) LOAD%: 5.241 (176527) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (229) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (567) FPINV%: 0.000 (0) FPCONV%: 0.018 (623) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.087 (36601) FPLE%: 0.449 (15126) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (591) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.763 (93072) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.767 (25832) CMPU%: 0.000 (0) RSUB%: 0.006 (197) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.616 (525995) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.164 (39196) ORI%: 1.600 (53898) XORI%: 0.000 (0) MULI%: 3.159 (106420) LW%: 1.378 (46416) LWI%: 12.993 (437649) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.282 (9506) SWI%: 4.081 (137460) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.381 (46525) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.308 (10364) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.071 (2395) bned%: 0.000 (0) bneid%: 13.710 (461810) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.708 (23842) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.138 (4656) DIV%: 0.012 (394) FPUN%: 1.450 (48830) FPRSUB%: 4.378 (147459) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (64) FPGT%: 2.917 (98271) FPGE%: 1.001 (33704) SYNC%: 0.000 (0) NOP%: 9.008 (303431) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 26 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 153 FPSUB 0 FPMUL 3 FPCMPLT 0 FPMIN 0 FPMAX 386 LOAD 41780 INTCONV 0 ATOMIC_INC 18 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 18 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1283 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 5 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48917 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 7 ORI 12873 XORI 0 MULI 9297 LW 0 LWI 142977 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 94 DIV 29 FPUN 0 FPRSUB 59 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 18.7319 --Total thread-cycles: 5235808 --total thread-cycles issued: 3064921 (58.537693%) --iCache conflicts: 112576 (2.150117%) --thread*cycles of FU dependence: 257945 (4.926556%) --thread*cycles of data dependence: 221067 (4.222213%) --iCache cycles*banks: 5235808 (64.333603% used) Issue breakdown: --thread*cycles of issue worked: 3064921 (58.537693%) --thread*cycles of issue failed: 1867456 (35.667007%) --thread*cycles of issue NOP/other: 4603904275884843335 (87931108720640.000000%) Number of thread-cycles not ready: 221067 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3368352 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 7 2: 8 3: 9 4: 6 5: 7 6: 6 7: 8 8: 8 9: 8 10: 7 11: 7 12: 7 13: 8 14: 7 15: 5 16: 6 17: 8 18: 7 19: 7 20: 7 21: 6 22: 7 23: 9 24: 6 25: 7 26: 7 27: 8 28: 7 29: 7 30: 8 31: 7 <=== Core 28 ===> ---- Thread 00 ---- PC 5: Stalled ----- 93878 in-flight CPI 1.3600 -- Total Cycles 127698 ---- Thread 01 ---- PC 5: Stalled ----- 94416 in-flight CPI 1.3523 -- Total Cycles 127698 ---- Thread 02 ---- PC 5: Stalled ----- 103651 in-flight CPI 1.2318 -- Total Cycles 127698 ---- Thread 03 ---- PC 5: Stalled ----- 100943 in-flight CPI 1.2648 -- Total Cycles 127698 ---- Thread 04 ---- PC 5: Stalled ----- 97599 in-flight CPI 1.3081 -- Total Cycles 127698 ---- Thread 05 ---- PC 5: Stalled ----- 98907 in-flight CPI 1.2908 -- Total Cycles 127698 ---- Thread 06 ---- PC 5: Stalled ----- 94762 in-flight CPI 1.3473 -- Total Cycles 127698 ---- Thread 07 ---- PC 5: Stalled ----- 101439 in-flight CPI 1.2586 -- Total Cycles 127698 ---- Thread 08 ---- PC 5: Stalled ----- 93850 in-flight CPI 1.3604 -- Total Cycles 127698 ---- Thread 09 ---- PC 5: Stalled ----- 98957 in-flight CPI 1.2901 -- Total Cycles 127698 ---- Thread 10 ---- PC 5: Stalled ----- 102996 in-flight CPI 1.2396 -- Total Cycles 127698 ---- Thread 11 ---- PC 5: Stalled ----- 99435 in-flight CPI 1.2841 -- Total Cycles 127698 ---- Thread 12 ---- PC 5: Stalled ----- 98084 in-flight CPI 1.3017 -- Total Cycles 127698 ---- Thread 13 ---- PC 5: Stalled ----- 99131 in-flight CPI 1.2879 -- Total Cycles 127698 ---- Thread 14 ---- PC 5: Stalled ----- 97883 in-flight CPI 1.3043 -- Total Cycles 127698 ---- Thread 15 ---- PC 5: Stalled ----- 94976 in-flight CPI 1.3443 -- Total Cycles 127698 ---- Thread 16 ---- PC 5: Stalled ----- 95223 in-flight CPI 1.3408 -- Total Cycles 127698 ---- Thread 17 ---- PC 5: Stalled ----- 96855 in-flight CPI 1.3182 -- Total Cycles 127698 ---- Thread 18 ---- PC 5: Stalled ----- 89864 in-flight CPI 1.4208 -- Total Cycles 127698 ---- Thread 19 ---- PC 5: Stalled ----- 94169 in-flight CPI 1.3559 -- Total Cycles 127698 ---- Thread 20 ---- PC 5: Stalled ----- 88452 in-flight CPI 1.4435 -- Total Cycles 127698 ---- Thread 21 ---- PC 5: Stalled ----- 98866 in-flight CPI 1.2914 -- Total Cycles 127698 ---- Thread 22 ---- PC 5: Stalled ----- 88196 in-flight CPI 1.4476 -- Total Cycles 127698 ---- Thread 23 ---- PC 5: Stalled ----- 89085 in-flight CPI 1.4332 -- Total Cycles 127698 ---- Thread 24 ---- PC 5: Stalled ----- 94823 in-flight CPI 1.3464 -- Total Cycles 127698 ---- Thread 25 ---- PC 5: Stalled ----- 92931 in-flight CPI 1.3739 -- Total Cycles 127698 ---- Thread 26 ---- PC 5: Stalled ----- 89611 in-flight CPI 1.4248 -- Total Cycles 127698 ---- Thread 27 ---- PC 5: Stalled ----- 92145 in-flight CPI 1.3856 -- Total Cycles 127698 ---- Thread 28 ---- PC 5: Stalled ----- 87143 in-flight CPI 1.4651 -- Total Cycles 127698 ---- Thread 29 ---- PC 5: Stalled ----- 89786 in-flight CPI 1.4220 -- Total Cycles 127698 ---- Thread 30 ---- PC 5: Stalled ----- 87542 in-flight CPI 1.4584 -- Total Cycles 127698 ---- Thread 31 ---- PC 5: Stalled ----- 89774 in-flight CPI 1.4221 -- Total Cycles 127698 Total CPI 0.0421 , IPC 23.7744 -- Total Cycles 127698 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7680 (4.173754%) FPSUB: 0 (0.000000%) FPMUL: 31559 (17.150978%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 62366 (33.893276%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4441 (2.413495%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 70055 (38.071922%) DIV: 7644 (4.154190%) FPUN: 0 (0.000000%) FPRSUB: 262 (0.142386%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3337033 total) ADD%: 7.168 (239198) SUB%: 0.000 (0) MUL%: 0.006 (207) BITOR%: 1.540 (51374) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.544 (18166) FPSUB%: 0.000 (0) FPMUL%: 4.751 (158557) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (621) FPMAX%: 0.019 (621) LOAD%: 5.105 (170364) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (239) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (606) FPINV%: 0.000 (0) FPCONV%: 0.020 (653) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.062 (35450) FPLE%: 0.454 (15134) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (621) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.809 (93722) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.741 (24722) CMPU%: 0.000 (0) RSUB%: 0.006 (207) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.671 (522953) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.174 (39162) ORI%: 1.569 (52371) XORI%: 0.000 (0) MULI%: 3.210 (107132) LW%: 1.402 (46779) LWI%: 13.121 (437857) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.286 (9537) SWI%: 4.144 (138300) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.406 (46907) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.308 (10274) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.055 (1849) bned%: 0.000 (0) bneid%: 13.824 (461324) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.724 (24162) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.119 (3982) DIV%: 0.012 (414) FPUN%: 1.493 (49825) FPRSUB%: 4.184 (139625) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (72) FPGT%: 2.943 (98219) FPGE%: 1.040 (34691) SYNC%: 0.000 (0) NOP%: 9.021 (301040) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 25 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 156 FPSUB 0 FPMUL 0 FPCMPLT 0 FPMIN 0 FPMAX 406 LOAD 39514 INTCONV 0 ATOMIC_INC 16 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 15 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1447 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 5 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49351 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 10 ORI 10884 XORI 0 MULI 9962 LW 0 LWI 142547 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 75 DIV 26 FPUN 0 FPRSUB 55 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.7746 --Total thread-cycles: 4086336 --total thread-cycles issued: 3035993 (74.296219%) --iCache conflicts: 114658 (2.805888%) --thread*cycles of FU dependence: 254505 (6.228196%) --thread*cycles of data dependence: 184007 (4.502983%) --iCache cycles*banks: 4086336 (81.663986% used) Issue breakdown: --thread*cycles of issue worked: 3035993 (74.296219%) --thread*cycles of issue failed: 749303 (18.336794%) --thread*cycles of issue NOP/other: 163209076262 (3994020.000000%) Number of thread-cycles not ready: 184007 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3337033 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 7 2: 8 3: 8 4: 9 5: 8 6: 8 7: 9 8: 7 9: 9 10: 9 11: 6 12: 8 13: 8 14: 8 15: 8 16: 7 17: 7 18: 7 19: 6 20: 6 21: 8 22: 7 23: 6 24: 8 25: 7 26: 7 27: 7 28: 7 29: 7 30: 7 31: 8 <=== Core 29 ===> ---- Thread 00 ---- PC 5: Stalled ----- 92483 in-flight CPI 1.3948 -- Total Cycles 129017 ---- Thread 01 ---- PC 5: Stalled ----- 97905 in-flight CPI 1.3175 -- Total Cycles 129017 ---- Thread 02 ---- PC 5: Stalled ----- 99191 in-flight CPI 1.3004 -- Total Cycles 129017 ---- Thread 03 ---- PC 5: Stalled ----- 95328 in-flight CPI 1.3532 -- Total Cycles 129017 ---- Thread 04 ---- PC 5: Stalled ----- 99288 in-flight CPI 1.2992 -- Total Cycles 129017 ---- Thread 05 ---- PC 5: Stalled ----- 101745 in-flight CPI 1.2678 -- Total Cycles 129017 ---- Thread 06 ---- PC 5: Stalled ----- 97303 in-flight CPI 1.3257 -- Total Cycles 129017 ---- Thread 07 ---- PC 5: Stalled ----- 94331 in-flight CPI 1.3674 -- Total Cycles 129017 ---- Thread 08 ---- PC 5: Stalled ----- 95298 in-flight CPI 1.3536 -- Total Cycles 129017 ---- Thread 09 ---- PC 5: Stalled ----- 96333 in-flight CPI 1.3390 -- Total Cycles 129017 ---- Thread 10 ---- PC 5: Stalled ----- 97675 in-flight CPI 1.3206 -- Total Cycles 129017 ---- Thread 11 ---- PC 5: Stalled ----- 104287 in-flight CPI 1.2369 -- Total Cycles 129017 ---- Thread 12 ---- PC 5: Stalled ----- 94144 in-flight CPI 1.3701 -- Total Cycles 129017 ---- Thread 13 ---- PC 5: Stalled ----- 92282 in-flight CPI 1.3979 -- Total Cycles 129017 ---- Thread 14 ---- PC 5: Stalled ----- 92864 in-flight CPI 1.3891 -- Total Cycles 129017 ---- Thread 15 ---- PC 5: Stalled ----- 101831 in-flight CPI 1.2667 -- Total Cycles 129017 ---- Thread 16 ---- PC 5: Stalled ----- 94856 in-flight CPI 1.3599 -- Total Cycles 129017 ---- Thread 17 ---- PC 5: Stalled ----- 89762 in-flight CPI 1.4371 -- Total Cycles 129017 ---- Thread 18 ---- PC 5: Stalled ----- 97571 in-flight CPI 1.3220 -- Total Cycles 129017 ---- Thread 19 ---- PC 5: Stalled ----- 97654 in-flight CPI 1.3209 -- Total Cycles 129017 ---- Thread 20 ---- PC 5: Stalled ----- 95759 in-flight CPI 1.3471 -- Total Cycles 129017 ---- Thread 21 ---- PC 5: Stalled ----- 95390 in-flight CPI 1.3523 -- Total Cycles 129017 ---- Thread 22 ---- PC 5: Stalled ----- 94240 in-flight CPI 1.3688 -- Total Cycles 129017 ---- Thread 23 ---- PC 5: Stalled ----- 90367 in-flight CPI 1.4274 -- Total Cycles 129017 ---- Thread 24 ---- PC 5: Stalled ----- 86219 in-flight CPI 1.4962 -- Total Cycles 129017 ---- Thread 25 ---- PC 5: Stalled ----- 93867 in-flight CPI 1.3742 -- Total Cycles 129017 ---- Thread 26 ---- PC 5: Stalled ----- 92761 in-flight CPI 1.3906 -- Total Cycles 129017 ---- Thread 27 ---- PC 5: Stalled ----- 93210 in-flight CPI 1.3840 -- Total Cycles 129017 ---- Thread 28 ---- PC 5: Stalled ----- 87424 in-flight CPI 1.4755 -- Total Cycles 129017 ---- Thread 29 ---- PC 5: Stalled ----- 94053 in-flight CPI 1.3715 -- Total Cycles 129017 ---- Thread 30 ---- PC 5: Stalled ----- 89387 in-flight CPI 1.4432 -- Total Cycles 129017 ---- Thread 31 ---- PC 5: Stalled ----- 85093 in-flight CPI 1.5159 -- Total Cycles 129017 Total CPI 0.0426 , IPC 23.4889 -- Total Cycles 129017 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8113 (4.238789%) FPSUB: 0 (0.000000%) FPMUL: 32415 (16.935825%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 65137 (34.032047%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4201 (2.194891%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 73666 (38.488186%) DIV: 7604 (3.972853%) FPUN: 0 (0.000000%) FPRSUB: 263 (0.137409%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3331430 total) ADD%: 7.199 (239819) SUB%: 0.000 (0) MUL%: 0.006 (206) BITOR%: 1.516 (50521) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.573 (19090) FPSUB%: 0.000 (0) FPMUL%: 4.839 (161211) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (618) FPMAX%: 0.019 (618) LOAD%: 5.150 (171572) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (238) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (591) FPINV%: 0.000 (0) FPCONV%: 0.020 (650) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.074 (35780) FPLE%: 0.454 (15119) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (618) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.782 (92696) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.747 (24890) CMPU%: 0.000 (0) RSUB%: 0.006 (206) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.652 (521422) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.167 (38865) ORI%: 1.572 (52385) XORI%: 0.000 (0) MULI%: 3.187 (106158) LW%: 1.386 (46185) LWI%: 13.070 (435413) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.286 (9515) SWI%: 4.115 (137085) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.390 (46300) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.309 (10285) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.059 (1975) bned%: 0.000 (0) bneid%: 13.792 (459478) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.710 (23637) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.126 (4195) DIV%: 0.012 (412) FPUN%: 1.467 (48858) FPRSUB%: 4.253 (141673) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (74) FPGT%: 2.951 (98326) FPGE%: 1.013 (33739) SYNC%: 0.000 (0) NOP%: 9.032 (300911) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 21 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 152 FPSUB 0 FPMUL 2 FPCMPLT 0 FPMIN 0 FPMAX 400 LOAD 39549 INTCONV 0 ATOMIC_INC 17 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 17 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1262 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 11 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49033 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 21 ORI 11545 XORI 0 MULI 9227 LW 0 LWI 142054 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 66 DIV 24 FPUN 0 FPRSUB 45 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.4891 --Total thread-cycles: 4128544 --total thread-cycles issued: 3030519 (73.404060%) --iCache conflicts: 113016 (2.737430%) --thread*cycles of FU dependence: 253467 (6.139380%) --thread*cycles of data dependence: 191399 (4.635993%) --iCache cycles*banks: 4128544 (80.693390% used) Issue breakdown: --thread*cycles of issue worked: 3030519 (73.404060%) --thread*cycles of issue failed: 797114 (19.307388%) --thread*cycles of issue NOP/other: 4618546002469885807 (111868639182848.000000%) Number of thread-cycles not ready: 191399 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3331430 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 8 3: 7 4: 8 5: 9 6: 7 7: 9 8: 8 9: 8 10: 9 11: 8 12: 8 13: 6 14: 6 15: 9 16: 7 17: 6 18: 9 19: 8 20: 7 21: 7 22: 7 23: 8 24: 5 25: 8 26: 8 27: 6 28: 7 29: 8 30: 5 31: 7 <=== Core 30 ===> ---- Thread 00 ---- PC 5: Stalled ----- 94093 in-flight CPI 1.3673 -- Total Cycles 128677 ---- Thread 01 ---- PC 5: Stalled ----- 97277 in-flight CPI 1.3225 -- Total Cycles 128677 ---- Thread 02 ---- PC 5: Stalled ----- 94993 in-flight CPI 1.3544 -- Total Cycles 128677 ---- Thread 03 ---- PC 5: Stalled ----- 93499 in-flight CPI 1.3760 -- Total Cycles 128677 ---- Thread 04 ---- PC 5: Stalled ----- 101177 in-flight CPI 1.2716 -- Total Cycles 128677 ---- Thread 05 ---- PC 5: Stalled ----- 99588 in-flight CPI 1.2918 -- Total Cycles 128677 ---- Thread 06 ---- PC 5: Stalled ----- 100473 in-flight CPI 1.2805 -- Total Cycles 128677 ---- Thread 07 ---- PC 5: Stalled ----- 100770 in-flight CPI 1.2767 -- Total Cycles 128677 ---- Thread 08 ---- PC 5: Stalled ----- 96624 in-flight CPI 1.3315 -- Total Cycles 128677 ---- Thread 09 ---- PC 5: Stalled ----- 96192 in-flight CPI 1.3375 -- Total Cycles 128677 ---- Thread 10 ---- PC 5: Stalled ----- 95469 in-flight CPI 1.3476 -- Total Cycles 128677 ---- Thread 11 ---- PC 5: Stalled ----- 97034 in-flight CPI 1.3259 -- Total Cycles 128677 ---- Thread 12 ---- PC 5: Stalled ----- 92449 in-flight CPI 1.3916 -- Total Cycles 128677 ---- Thread 13 ---- PC 5: Stalled ----- 102223 in-flight CPI 1.2586 -- Total Cycles 128677 ---- Thread 14 ---- PC 5: Stalled ----- 94265 in-flight CPI 1.3648 -- Total Cycles 128677 ---- Thread 15 ---- PC 5: Stalled ----- 89541 in-flight CPI 1.4369 -- Total Cycles 128677 ---- Thread 16 ---- PC 5: Stalled ----- 92617 in-flight CPI 1.3891 -- Total Cycles 128677 ---- Thread 17 ---- PC 5: Stalled ----- 92115 in-flight CPI 1.3967 -- Total Cycles 128677 ---- Thread 18 ---- PC 5: Stalled ----- 95050 in-flight CPI 1.3536 -- Total Cycles 128677 ---- Thread 19 ---- PC 5: Stalled ----- 96399 in-flight CPI 1.3346 -- Total Cycles 128677 ---- Thread 20 ---- PC 5: Stalled ----- 98191 in-flight CPI 1.3102 -- Total Cycles 128677 ---- Thread 21 ---- PC 5: Stalled ----- 92431 in-flight CPI 1.3919 -- Total Cycles 128677 ---- Thread 22 ---- PC 5: Stalled ----- 95761 in-flight CPI 1.3434 -- Total Cycles 128677 ---- Thread 23 ---- PC 5: Stalled ----- 95320 in-flight CPI 1.3497 -- Total Cycles 128677 ---- Thread 24 ---- PC 5: Stalled ----- 96824 in-flight CPI 1.3287 -- Total Cycles 128677 ---- Thread 25 ---- PC 5: Stalled ----- 94638 in-flight CPI 1.3594 -- Total Cycles 128677 ---- Thread 26 ---- PC 5: Stalled ----- 91036 in-flight CPI 1.4132 -- Total Cycles 128677 ---- Thread 27 ---- PC 5: Stalled ----- 93925 in-flight CPI 1.3698 -- Total Cycles 128677 ---- Thread 28 ---- PC 5: Stalled ----- 87275 in-flight CPI 1.4742 -- Total Cycles 128677 ---- Thread 29 ---- PC 5: Stalled ----- 86474 in-flight CPI 1.4878 -- Total Cycles 128677 ---- Thread 30 ---- PC 5: Stalled ----- 94238 in-flight CPI 1.3652 -- Total Cycles 128677 ---- Thread 31 ---- PC 5: Stalled ----- 85936 in-flight CPI 1.4971 -- Total Cycles 128677 Total CPI 0.0424 , IPC 23.5819 -- Total Cycles 128677 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8068 (4.067209%) FPSUB: 0 (0.000000%) FPMUL: 32214 (16.239597%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 73105 (36.853409%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4120 (2.076958%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 73171 (36.886681%) DIV: 7427 (3.744070%) FPUN: 0 (0.000000%) FPRSUB: 262 (0.132078%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3335019 total) ADD%: 7.194 (239936) SUB%: 0.000 (0) MUL%: 0.006 (201) BITOR%: 1.535 (51185) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.564 (18826) FPSUB%: 0.000 (0) FPMUL%: 4.812 (160486) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (603) FPMAX%: 0.018 (603) LOAD%: 5.149 (171711) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (233) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (581) FPINV%: 0.000 (0) FPCONV%: 0.019 (635) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.070 (35681) FPLE%: 0.453 (15097) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (603) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.791 (93091) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.753 (25127) CMPU%: 0.000 (0) RSUB%: 0.006 (201) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.660 (522260) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.173 (39131) ORI%: 1.578 (52616) XORI%: 0.000 (0) MULI%: 3.190 (106394) LW%: 1.391 (46406) LWI%: 13.062 (435636) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.286 (9529) SWI%: 4.116 (137257) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.395 (46521) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.309 (10320) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.057 (1894) bned%: 0.000 (0) bneid%: 13.795 (460081) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.715 (23841) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.125 (4169) DIV%: 0.012 (402) FPUN%: 1.482 (49411) FPRSUB%: 4.244 (141524) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (63) FPGT%: 2.934 (97835) FPGE%: 1.029 (34314) SYNC%: 0.000 (0) NOP%: 9.011 (300519) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 30 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 153 FPSUB 0 FPMUL 1 FPCMPLT 0 FPMIN 0 FPMAX 393 LOAD 39922 INTCONV 0 ATOMIC_INC 17 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 19 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1425 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 7 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49085 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 7 ORI 11452 XORI 0 MULI 9677 LW 0 LWI 142019 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 52 DIV 29 FPUN 0 FPRSUB 57 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.5821 --Total thread-cycles: 4117664 --total thread-cycles issued: 3034500 (73.694695%) --iCache conflicts: 112434 (2.730529%) --thread*cycles of FU dependence: 254347 (6.176973%) --thread*cycles of data dependence: 198367 (4.817464%) --iCache cycles*banks: 4117664 (80.993759% used) Issue breakdown: --thread*cycles of issue worked: 3034500 (73.694695%) --thread*cycles of issue failed: 782645 (19.007015%) --thread*cycles of issue NOP/other: 689187463 (16737.341797%) Number of thread-cycles not ready: 198367 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3335019 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 7 3: 7 4: 7 5: 8 6: 8 7: 8 8: 8 9: 6 10: 8 11: 7 12: 8 13: 8 14: 7 15: 6 16: 7 17: 7 18: 7 19: 8 20: 8 21: 6 22: 9 23: 7 24: 8 25: 7 26: 7 27: 7 28: 6 29: 7 30: 7 31: 7 <=== Core 31 ===> ---- Thread 00 ---- PC 5: Stalled ----- 103325 in-flight CPI 1.2352 -- Total Cycles 127654 ---- Thread 01 ---- PC 5: Stalled ----- 98172 in-flight CPI 1.3001 -- Total Cycles 127654 ---- Thread 02 ---- PC 5: Stalled ----- 96552 in-flight CPI 1.3219 -- Total Cycles 127654 ---- Thread 03 ---- PC 5: Stalled ----- 99389 in-flight CPI 1.2841 -- Total Cycles 127654 ---- Thread 04 ---- PC 5: Stalled ----- 99810 in-flight CPI 1.2788 -- Total Cycles 127654 ---- Thread 05 ---- PC 5: Stalled ----- 102229 in-flight CPI 1.2484 -- Total Cycles 127654 ---- Thread 06 ---- PC 5: Stalled ----- 95959 in-flight CPI 1.3300 -- Total Cycles 127654 ---- Thread 07 ---- PC 5: Stalled ----- 97130 in-flight CPI 1.3140 -- Total Cycles 127654 ---- Thread 08 ---- PC 5: Stalled ----- 95717 in-flight CPI 1.3335 -- Total Cycles 127654 ---- Thread 09 ---- PC 5: Stalled ----- 96721 in-flight CPI 1.3196 -- Total Cycles 127654 ---- Thread 10 ---- PC 5: Stalled ----- 99118 in-flight CPI 1.2876 -- Total Cycles 127654 ---- Thread 11 ---- PC 5: Stalled ----- 96043 in-flight CPI 1.3289 -- Total Cycles 127654 ---- Thread 12 ---- PC 5: Stalled ----- 96489 in-flight CPI 1.3228 -- Total Cycles 127654 ---- Thread 13 ---- PC 5: Stalled ----- 96751 in-flight CPI 1.3191 -- Total Cycles 127654 ---- Thread 14 ---- PC 5: Stalled ----- 99805 in-flight CPI 1.2788 -- Total Cycles 127654 ---- Thread 15 ---- PC 5: Stalled ----- 94944 in-flight CPI 1.3443 -- Total Cycles 127654 ---- Thread 16 ---- PC 5: Stalled ----- 94066 in-flight CPI 1.3568 -- Total Cycles 127654 ---- Thread 17 ---- PC 5: Stalled ----- 90975 in-flight CPI 1.4030 -- Total Cycles 127654 ---- Thread 18 ---- PC 5: Stalled ----- 97613 in-flight CPI 1.3075 -- Total Cycles 127654 ---- Thread 19 ---- PC 5: Stalled ----- 94525 in-flight CPI 1.3502 -- Total Cycles 127654 ---- Thread 20 ---- PC 5: Stalled ----- 93848 in-flight CPI 1.3599 -- Total Cycles 127654 ---- Thread 21 ---- PC 5: Stalled ----- 93806 in-flight CPI 1.3606 -- Total Cycles 127654 ---- Thread 22 ---- PC 5: Stalled ----- 95034 in-flight CPI 1.3430 -- Total Cycles 127654 ---- Thread 23 ---- PC 5: Stalled ----- 96807 in-flight CPI 1.3183 -- Total Cycles 127654 ---- Thread 24 ---- PC 5: Stalled ----- 85710 in-flight CPI 1.4891 -- Total Cycles 127654 ---- Thread 25 ---- PC 5: Stalled ----- 91552 in-flight CPI 1.3940 -- Total Cycles 127654 ---- Thread 26 ---- PC 5: Stalled ----- 95879 in-flight CPI 1.3311 -- Total Cycles 127654 ---- Thread 27 ---- PC 5: Stalled ----- 92381 in-flight CPI 1.3815 -- Total Cycles 127654 ---- Thread 28 ---- PC 5: Stalled ----- 84758 in-flight CPI 1.5059 -- Total Cycles 127654 ---- Thread 29 ---- PC 5: Stalled ----- 89222 in-flight CPI 1.4305 -- Total Cycles 127654 ---- Thread 30 ---- PC 5: Stalled ----- 94360 in-flight CPI 1.3526 -- Total Cycles 127654 ---- Thread 31 ---- PC 5: Stalled ----- 85896 in-flight CPI 1.4859 -- Total Cycles 127654 Total CPI 0.0419 , IPC 23.8548 -- Total Cycles 127654 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7709 (4.068654%) FPSUB: 0 (0.000000%) FPMUL: 31587 (16.670977%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 67211 (35.472603%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4372 (2.307453%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 70579 (37.250160%) DIV: 7748 (4.089237%) FPUN: 0 (0.000000%) FPRSUB: 267 (0.140917%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3347033 total) ADD%: 7.200 (240999) SUB%: 0.000 (0) MUL%: 0.006 (210) BITOR%: 1.539 (51498) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.543 (18187) FPSUB%: 0.000 (0) FPMUL%: 4.746 (158862) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (630) FPMAX%: 0.019 (630) LOAD%: 5.118 (171309) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (242) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (606) FPINV%: 0.000 (0) FPCONV%: 0.020 (662) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.060 (35488) FPLE%: 0.458 (15318) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (630) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.808 (93970) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.740 (24781) CMPU%: 0.000 (0) RSUB%: 0.006 (210) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.682 (524873) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.174 (39296) ORI%: 1.563 (52304) XORI%: 0.000 (0) MULI%: 3.208 (107364) LW%: 1.400 (46843) LWI%: 13.103 (438574) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.288 (9623) SWI%: 4.142 (138632) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.403 (46965) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.309 (10349) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.054 (1807) bned%: 0.000 (0) bneid%: 13.820 (462560) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.721 (24136) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.120 (4011) DIV%: 0.013 (420) FPUN%: 1.491 (49916) FPRSUB%: 4.182 (139970) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (78) FPGT%: 2.945 (98569) FPGE%: 1.034 (34598) SYNC%: 0.000 (0) NOP%: 9.017 (301817) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 12 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 162 FPSUB 0 FPMUL 1 FPCMPLT 0 FPMIN 0 FPMAX 406 LOAD 38884 INTCONV 0 ATOMIC_INC 17 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 18 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1555 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 11 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49349 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 11 ORI 10958 XORI 0 MULI 9754 LW 0 LWI 142875 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 65 DIV 25 FPUN 0 FPRSUB 36 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.8550 --Total thread-cycles: 4084928 --total thread-cycles issued: 3045216 (74.547607%) --iCache conflicts: 113735 (2.784260%) --thread*cycles of FU dependence: 254141 (6.221432%) --thread*cycles of data dependence: 189473 (4.638344%) --iCache cycles*banks: 4084928 (81.936935% used) Issue breakdown: --thread*cycles of issue worked: 3045216 (74.547607%) --thread*cycles of issue failed: 737895 (18.063843%) --thread*cycles of issue NOP/other: 4621614879546317561 (113138221449216.000000%) Number of thread-cycles not ready: 189473 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3347033 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 10 1: 7 2: 7 3: 9 4: 7 5: 9 6: 8 7: 8 8: 6 9: 7 10: 9 11: 8 12: 7 13: 8 14: 8 15: 7 16: 7 17: 6 18: 8 19: 8 20: 8 21: 7 22: 7 23: 9 24: 6 25: 8 26: 8 27: 9 28: 6 29: 6 30: 8 31: 6 <=== Core 32 ===> ---- Thread 00 ---- PC 5: Stalled ----- 97776 in-flight CPI 1.2957 -- Total Cycles 126709 ---- Thread 01 ---- PC 5: Stalled ----- 94833 in-flight CPI 1.3359 -- Total Cycles 126709 ---- Thread 02 ---- PC 5: Stalled ----- 101472 in-flight CPI 1.2484 -- Total Cycles 126709 ---- Thread 03 ---- PC 5: Stalled ----- 97509 in-flight CPI 1.2992 -- Total Cycles 126709 ---- Thread 04 ---- PC 5: Stalled ----- 93633 in-flight CPI 1.3530 -- Total Cycles 126709 ---- Thread 05 ---- PC 5: Stalled ----- 100595 in-flight CPI 1.2593 -- Total Cycles 126709 ---- Thread 06 ---- PC 5: Stalled ----- 97397 in-flight CPI 1.3007 -- Total Cycles 126709 ---- Thread 07 ---- PC 5: Stalled ----- 94003 in-flight CPI 1.3477 -- Total Cycles 126709 ---- Thread 08 ---- PC 5: Stalled ----- 97388 in-flight CPI 1.3008 -- Total Cycles 126709 ---- Thread 09 ---- PC 5: Stalled ----- 98962 in-flight CPI 1.2801 -- Total Cycles 126709 ---- Thread 10 ---- PC 5: Stalled ----- 96821 in-flight CPI 1.3085 -- Total Cycles 126709 ---- Thread 11 ---- PC 5: Stalled ----- 92852 in-flight CPI 1.3644 -- Total Cycles 126709 ---- Thread 12 ---- PC 5: Stalled ----- 97603 in-flight CPI 1.2980 -- Total Cycles 126709 ---- Thread 13 ---- PC 5: Stalled ----- 97864 in-flight CPI 1.2945 -- Total Cycles 126709 ---- Thread 14 ---- PC 5: Stalled ----- 94598 in-flight CPI 1.3392 -- Total Cycles 126709 ---- Thread 15 ---- PC 5: Stalled ----- 94284 in-flight CPI 1.3436 -- Total Cycles 126709 ---- Thread 16 ---- PC 5: Stalled ----- 94640 in-flight CPI 1.3386 -- Total Cycles 126709 ---- Thread 17 ---- PC 5: Stalled ----- 100475 in-flight CPI 1.2609 -- Total Cycles 126709 ---- Thread 18 ---- PC 5: Stalled ----- 92961 in-flight CPI 1.3628 -- Total Cycles 126709 ---- Thread 19 ---- PC 5: Stalled ----- 95368 in-flight CPI 1.3284 -- Total Cycles 126709 ---- Thread 20 ---- PC 5: Stalled ----- 92111 in-flight CPI 1.3753 -- Total Cycles 126709 ---- Thread 21 ---- PC 5: Stalled ----- 95885 in-flight CPI 1.3212 -- Total Cycles 126709 ---- Thread 22 ---- PC 5: Stalled ----- 91167 in-flight CPI 1.3896 -- Total Cycles 126709 ---- Thread 23 ---- PC 5: Stalled ----- 93975 in-flight CPI 1.3481 -- Total Cycles 126709 ---- Thread 24 ---- PC 5: Stalled ----- 88783 in-flight CPI 1.4269 -- Total Cycles 126709 ---- Thread 25 ---- PC 5: Stalled ----- 93802 in-flight CPI 1.3506 -- Total Cycles 126709 ---- Thread 26 ---- PC 5: Stalled ----- 85714 in-flight CPI 1.4780 -- Total Cycles 126709 ---- Thread 27 ---- PC 5: Stalled ----- 88632 in-flight CPI 1.4294 -- Total Cycles 126709 ---- Thread 28 ---- PC 5: Stalled ----- 88477 in-flight CPI 1.4318 -- Total Cycles 126709 ---- Thread 29 ---- PC 5: Stalled ----- 82679 in-flight CPI 1.5323 -- Total Cycles 126709 ---- Thread 30 ---- PC 5: Stalled ----- 88916 in-flight CPI 1.4247 -- Total Cycles 126709 ---- Thread 31 ---- PC 5: Stalled ----- 90140 in-flight CPI 1.4054 -- Total Cycles 126709 Total CPI 0.0421 , IPC 23.7700 -- Total Cycles 126709 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7618 (4.083821%) FPSUB: 0 (0.000000%) FPMUL: 31271 (16.763607%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 65964 (35.361664%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4165 (2.232753%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 69687 (37.357471%) DIV: 7574 (4.060234%) FPUN: 0 (0.000000%) FPRSUB: 262 (0.140452%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3310315 total) ADD%: 7.187 (237897) SUB%: 0.000 (0) MUL%: 0.006 (205) BITOR%: 1.529 (50610) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.544 (18019) FPSUB%: 0.000 (0) FPMUL%: 4.749 (157204) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (615) FPMAX%: 0.019 (615) LOAD%: 5.130 (169825) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (237) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (588) FPINV%: 0.000 (0) FPCONV%: 0.020 (647) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.060 (35093) FPLE%: 0.455 (15059) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (615) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.813 (93125) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.743 (24587) CMPU%: 0.000 (0) RSUB%: 0.006 (205) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.687 (519293) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.176 (38920) ORI%: 1.560 (51656) XORI%: 0.000 (0) MULI%: 3.209 (106232) LW%: 1.403 (46428) LWI%: 13.115 (434142) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.288 (9529) SWI%: 4.154 (137527) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.406 (46542) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10276) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.056 (1843) bned%: 0.000 (0) bneid%: 13.805 (456979) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.720 (23838) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.120 (3962) DIV%: 0.012 (410) FPUN%: 1.482 (49063) FPRSUB%: 4.185 (138526) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (64) FPGT%: 2.944 (97454) FPGE%: 1.027 (34004) SYNC%: 0.000 (0) NOP%: 9.014 (298385) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 30 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 154 FPSUB 0 FPMUL 3 FPCMPLT 0 FPMIN 0 FPMAX 406 LOAD 38489 INTCONV 0 ATOMIC_INC 15 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 18 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1180 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 8 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48862 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 15 ORI 10844 XORI 0 MULI 9510 LW 0 LWI 141358 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 74 DIV 21 FPUN 0 FPRSUB 48 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.7703 --Total thread-cycles: 4054688 --total thread-cycles issued: 3011930 (74.282661%) --iCache conflicts: 112266 (2.768795%) --thread*cycles of FU dependence: 251050 (6.191598%) --thread*cycles of data dependence: 186541 (4.600625%) --iCache cycles*banks: 4054688 (81.642464% used) Issue breakdown: --thread*cycles of issue worked: 3011930 (74.282661%) --thread*cycles of issue failed: 744373 (18.358330%) --thread*cycles of issue NOP/other: 4618941655394061713 (113916080291840.000000%) Number of thread-cycles not ready: 186541 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3310315 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 7 2: 9 3: 7 4: 7 5: 9 6: 8 7: 7 8: 8 9: 8 10: 7 11: 7 12: 7 13: 8 14: 7 15: 8 16: 8 17: 8 18: 8 19: 7 20: 8 21: 8 22: 7 23: 8 24: 8 25: 6 26: 6 27: 6 28: 7 29: 6 30: 8 31: 7 <=== Core 33 ===> ---- Thread 00 ---- PC 5: Stalled ----- 98321 in-flight CPI 1.3031 -- Total Cycles 128143 ---- Thread 01 ---- PC 5: Stalled ----- 101151 in-flight CPI 1.2666 -- Total Cycles 128143 ---- Thread 02 ---- PC 5: Stalled ----- 93995 in-flight CPI 1.3631 -- Total Cycles 128143 ---- Thread 03 ---- PC 5: Stalled ----- 98056 in-flight CPI 1.3066 -- Total Cycles 128143 ---- Thread 04 ---- PC 5: Stalled ----- 101856 in-flight CPI 1.2578 -- Total Cycles 128143 ---- Thread 05 ---- PC 5: Stalled ----- 102248 in-flight CPI 1.2530 -- Total Cycles 128143 ---- Thread 06 ---- PC 5: Stalled ----- 96191 in-flight CPI 1.3319 -- Total Cycles 128143 ---- Thread 07 ---- PC 5: Stalled ----- 95830 in-flight CPI 1.3370 -- Total Cycles 128143 ---- Thread 08 ---- PC 5: Stalled ----- 95702 in-flight CPI 1.3388 -- Total Cycles 128143 ---- Thread 09 ---- PC 5: Stalled ----- 92862 in-flight CPI 1.3797 -- Total Cycles 128143 ---- Thread 10 ---- PC 5: Stalled ----- 93491 in-flight CPI 1.3705 -- Total Cycles 128143 ---- Thread 11 ---- PC 5: Stalled ----- 96117 in-flight CPI 1.3329 -- Total Cycles 128143 ---- Thread 12 ---- PC 5: Stalled ----- 96641 in-flight CPI 1.3257 -- Total Cycles 128143 ---- Thread 13 ---- PC 5: Stalled ----- 96547 in-flight CPI 1.3270 -- Total Cycles 128143 ---- Thread 14 ---- PC 5: Stalled ----- 99261 in-flight CPI 1.2907 -- Total Cycles 128143 ---- Thread 15 ---- PC 5: Stalled ----- 96511 in-flight CPI 1.3275 -- Total Cycles 128143 ---- Thread 16 ---- PC 5: Stalled ----- 97218 in-flight CPI 1.3178 -- Total Cycles 128143 ---- Thread 17 ---- PC 5: Stalled ----- 95871 in-flight CPI 1.3364 -- Total Cycles 128143 ---- Thread 18 ---- PC 5: Stalled ----- 97267 in-flight CPI 1.3172 -- Total Cycles 128143 ---- Thread 19 ---- PC 5: Stalled ----- 93328 in-flight CPI 1.3728 -- Total Cycles 128143 ---- Thread 20 ---- PC 5: Stalled ----- 92502 in-flight CPI 1.3851 -- Total Cycles 128143 ---- Thread 21 ---- PC 5: Stalled ----- 96277 in-flight CPI 1.3307 -- Total Cycles 128143 ---- Thread 22 ---- PC 5: Stalled ----- 93823 in-flight CPI 1.3655 -- Total Cycles 128143 ---- Thread 23 ---- PC 5: Stalled ----- 94648 in-flight CPI 1.3537 -- Total Cycles 128143 ---- Thread 24 ---- PC 5: Stalled ----- 91815 in-flight CPI 1.3954 -- Total Cycles 128143 ---- Thread 25 ---- PC 5: Stalled ----- 90489 in-flight CPI 1.4158 -- Total Cycles 128143 ---- Thread 26 ---- PC 5: Stalled ----- 87644 in-flight CPI 1.4619 -- Total Cycles 128143 ---- Thread 27 ---- PC 5: Stalled ----- 83835 in-flight CPI 1.5282 -- Total Cycles 128143 ---- Thread 28 ---- PC 5: Stalled ----- 91502 in-flight CPI 1.4002 -- Total Cycles 128143 ---- Thread 29 ---- PC 5: Stalled ----- 91523 in-flight CPI 1.3999 -- Total Cycles 128143 ---- Thread 30 ---- PC 5: Stalled ----- 94526 in-flight CPI 1.3554 -- Total Cycles 128143 ---- Thread 31 ---- PC 5: Stalled ----- 84232 in-flight CPI 1.5210 -- Total Cycles 128143 Total CPI 0.0423 , IPC 23.6598 -- Total Cycles 128143 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7922 (4.156414%) FPSUB: 0 (0.000000%) FPMUL: 31917 (16.745804%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 66458 (34.868336%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4260 (2.235082%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 72290 (37.928192%) DIV: 7496 (3.932906%) FPUN: 0 (0.000000%) FPRSUB: 254 (0.133265%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3332560 total) ADD%: 7.139 (237907) SUB%: 0.000 (0) MUL%: 0.006 (203) BITOR%: 1.532 (51049) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.559 (18621) FPSUB%: 0.000 (0) FPMUL%: 4.791 (159651) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (609) FPMAX%: 0.018 (609) LOAD%: 5.150 (171615) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (235) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (590) FPINV%: 0.000 (0) FPCONV%: 0.019 (641) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.068 (35582) FPLE%: 0.458 (15247) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (609) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.800 (93328) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.747 (24883) CMPU%: 0.000 (0) RSUB%: 0.006 (203) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.679 (522518) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.173 (39083) ORI%: 1.570 (52307) XORI%: 0.000 (0) MULI%: 3.198 (106582) LW%: 1.396 (46535) LWI%: 13.086 (436111) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.286 (9543) SWI%: 4.129 (137592) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.400 (46655) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.309 (10288) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.058 (1930) bned%: 0.000 (0) bneid%: 13.807 (460124) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.719 (23962) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.124 (4116) DIV%: 0.012 (406) FPUN%: 1.483 (49419) FPRSUB%: 4.225 (140817) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (66) FPGT%: 2.940 (97985) FPGE%: 1.025 (34172) SYNC%: 0.000 (0) NOP%: 9.022 (300671) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 18 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 153 FPSUB 0 FPMUL 5 FPCMPLT 0 FPMIN 0 FPMAX 393 LOAD 40448 INTCONV 0 ATOMIC_INC 18 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 16 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1860 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 6 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49110 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 13 ORI 11255 XORI 0 MULI 9366 LW 0 LWI 142091 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 92 DIV 23 FPUN 0 FPRSUB 47 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.6600 --Total thread-cycles: 4100576 --total thread-cycles issued: 3031889 (73.938126%) --iCache conflicts: 113714 (2.773123%) --thread*cycles of FU dependence: 254916 (6.216590%) --thread*cycles of data dependence: 190597 (4.648055%) --iCache cycles*banks: 4100576 (81.271317% used) Issue breakdown: --thread*cycles of issue worked: 3031889 (73.938126%) --thread*cycles of issue failed: 768016 (18.729465%) --thread*cycles of issue NOP/other: 690743199 (16845.027344%) Number of thread-cycles not ready: 190597 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3332560 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 7 3: 8 4: 9 5: 8 6: 8 7: 7 8: 7 9: 7 10: 6 11: 8 12: 7 13: 7 14: 8 15: 7 16: 8 17: 7 18: 7 19: 6 20: 6 21: 8 22: 9 23: 7 24: 8 25: 8 26: 6 27: 7 28: 7 29: 7 30: 8 31: 7 <=== Core 34 ===> ---- Thread 00 ---- PC 5: Stalled ----- 96476 in-flight CPI 1.3395 -- Total Cycles 129252 ---- Thread 01 ---- PC 5: Stalled ----- 100545 in-flight CPI 1.2852 -- Total Cycles 129252 ---- Thread 02 ---- PC 5: Stalled ----- 95322 in-flight CPI 1.3558 -- Total Cycles 129252 ---- Thread 03 ---- PC 5: Stalled ----- 97266 in-flight CPI 1.3286 -- Total Cycles 129252 ---- Thread 04 ---- PC 5: Stalled ----- 97436 in-flight CPI 1.3263 -- Total Cycles 129252 ---- Thread 05 ---- PC 5: Stalled ----- 95558 in-flight CPI 1.3524 -- Total Cycles 129252 ---- Thread 06 ---- PC 5: Stalled ----- 102504 in-flight CPI 1.2607 -- Total Cycles 129252 ---- Thread 07 ---- PC 5: Stalled ----- 94132 in-flight CPI 1.3728 -- Total Cycles 129252 ---- Thread 08 ---- PC 5: Stalled ----- 103645 in-flight CPI 1.2468 -- Total Cycles 129252 ---- Thread 09 ---- PC 5: Stalled ----- 96371 in-flight CPI 1.3409 -- Total Cycles 129252 ---- Thread 10 ---- PC 5: Stalled ----- 99603 in-flight CPI 1.2974 -- Total Cycles 129252 ---- Thread 11 ---- PC 5: Stalled ----- 97735 in-flight CPI 1.3223 -- Total Cycles 129252 ---- Thread 12 ---- PC 5: Stalled ----- 103013 in-flight CPI 1.2545 -- Total Cycles 129252 ---- Thread 13 ---- PC 5: Stalled ----- 98200 in-flight CPI 1.3160 -- Total Cycles 129252 ---- Thread 14 ---- PC 5: Stalled ----- 92494 in-flight CPI 1.3971 -- Total Cycles 129252 ---- Thread 15 ---- PC 5: Stalled ----- 99316 in-flight CPI 1.3012 -- Total Cycles 129252 ---- Thread 16 ---- PC 5: Stalled ----- 91133 in-flight CPI 1.4180 -- Total Cycles 129252 ---- Thread 17 ---- PC 5: Stalled ----- 96561 in-flight CPI 1.3383 -- Total Cycles 129252 ---- Thread 18 ---- PC 5: Stalled ----- 93199 in-flight CPI 1.3866 -- Total Cycles 129252 ---- Thread 19 ---- PC 5: Stalled ----- 99534 in-flight CPI 1.2983 -- Total Cycles 129252 ---- Thread 20 ---- PC 5: Stalled ----- 95075 in-flight CPI 1.3592 -- Total Cycles 129252 ---- Thread 21 ---- PC 5: Stalled ----- 95329 in-flight CPI 1.3555 -- Total Cycles 129252 ---- Thread 22 ---- PC 5: Stalled ----- 94633 in-flight CPI 1.3656 -- Total Cycles 129252 ---- Thread 23 ---- PC 5: Stalled ----- 85130 in-flight CPI 1.5181 -- Total Cycles 129252 ---- Thread 24 ---- PC 5: Stalled ----- 97192 in-flight CPI 1.3296 -- Total Cycles 129252 ---- Thread 25 ---- PC 5: Stalled ----- 88052 in-flight CPI 1.4676 -- Total Cycles 129252 ---- Thread 26 ---- PC 5: Stalled ----- 94140 in-flight CPI 1.3727 -- Total Cycles 129252 ---- Thread 27 ---- PC 5: Stalled ----- 85783 in-flight CPI 1.5064 -- Total Cycles 129252 ---- Thread 28 ---- PC 5: Stalled ----- 89445 in-flight CPI 1.4448 -- Total Cycles 129252 ---- Thread 29 ---- PC 5: Stalled ----- 86720 in-flight CPI 1.4902 -- Total Cycles 129252 ---- Thread 30 ---- PC 5: Stalled ----- 92468 in-flight CPI 1.3975 -- Total Cycles 129252 ---- Thread 31 ---- PC 5: Stalled ----- 88705 in-flight CPI 1.4569 -- Total Cycles 129252 Total CPI 0.0425 , IPC 23.5454 -- Total Cycles 129252 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7809 (4.338647%) FPSUB: 0 (0.000000%) FPMUL: 31821 (17.679611%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 56268 (31.262259%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4392 (2.440176%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 71639 (39.802319%) DIV: 7795 (4.330868%) FPUN: 0 (0.000000%) FPRSUB: 263 (0.146122%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3345342 total) ADD%: 7.171 (239890) SUB%: 0.000 (0) MUL%: 0.006 (211) BITOR%: 1.527 (51096) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.549 (18381) FPSUB%: 0.000 (0) FPMUL%: 4.768 (159511) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (633) FPMAX%: 0.019 (633) LOAD%: 5.122 (171354) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (243) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (610) FPINV%: 0.000 (0) FPCONV%: 0.020 (665) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.066 (35658) FPLE%: 0.454 (15184) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (633) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.801 (93701) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.740 (24762) CMPU%: 0.000 (0) RSUB%: 0.006 (211) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.668 (524146) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.171 (39158) ORI%: 1.563 (52274) XORI%: 0.000 (0) MULI%: 3.208 (107312) LW%: 1.397 (46718) LWI%: 13.126 (439101) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.287 (9587) SWI%: 4.141 (138544) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.400 (46842) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.308 (10317) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.057 (1897) bned%: 0.000 (0) bneid%: 13.815 (462143) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.717 (23999) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.122 (4071) DIV%: 0.013 (422) FPUN%: 1.481 (49545) FPRSUB%: 4.204 (140628) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (78) FPGT%: 2.951 (98733) FPGE%: 1.027 (34361) SYNC%: 0.000 (0) NOP%: 9.027 (301994) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 19 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 152 FPSUB 0 FPMUL 3 FPCMPLT 0 FPMIN 0 FPMAX 413 LOAD 39076 INTCONV 0 ATOMIC_INC 17 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 16 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1242 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 7 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49433 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 10 ORI 11070 XORI 0 MULI 9424 LW 0 LWI 143074 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 80 DIV 30 FPUN 0 FPRSUB 52 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.5457 --Total thread-cycles: 4136064 --total thread-cycles issued: 3043348 (73.580780%) --iCache conflicts: 115138 (2.783758%) --thread*cycles of FU dependence: 254138 (6.144441%) --thread*cycles of data dependence: 179987 (4.351649%) --iCache cycles*banks: 4136064 (80.883034% used) Issue breakdown: --thread*cycles of issue worked: 3043348 (73.580780%) --thread*cycles of issue failed: 790722 (19.117741%) --thread*cycles of issue NOP/other: 4622071656834440106 (111750477250560.000000%) Number of thread-cycles not ready: 179987 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3345342 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 9 2: 6 3: 7 4: 7 5: 6 6: 9 7: 8 8: 10 9: 8 10: 9 11: 6 12: 7 13: 8 14: 8 15: 8 16: 8 17: 8 18: 7 19: 9 20: 8 21: 9 22: 7 23: 5 24: 8 25: 8 26: 7 27: 7 28: 7 29: 7 30: 8 31: 6 <=== Core 35 ===> ---- Thread 00 ---- PC 5: Stalled ----- 98915 in-flight CPI 1.3142 -- Total Cycles 130017 ---- Thread 01 ---- PC 5: Stalled ----- 98483 in-flight CPI 1.3199 -- Total Cycles 130017 ---- Thread 02 ---- PC 5: Stalled ----- 99017 in-flight CPI 1.3128 -- Total Cycles 130017 ---- Thread 03 ---- PC 5: Stalled ----- 92501 in-flight CPI 1.4054 -- Total Cycles 130017 ---- Thread 04 ---- PC 5: Stalled ----- 96454 in-flight CPI 1.3477 -- Total Cycles 130017 ---- Thread 05 ---- PC 5: Stalled ----- 99771 in-flight CPI 1.3029 -- Total Cycles 130017 ---- Thread 06 ---- PC 5: Stalled ----- 93255 in-flight CPI 1.3939 -- Total Cycles 130017 ---- Thread 07 ---- PC 5: Stalled ----- 99718 in-flight CPI 1.3036 -- Total Cycles 130017 ---- Thread 08 ---- PC 5: Stalled ----- 100944 in-flight CPI 1.2878 -- Total Cycles 130017 ---- Thread 09 ---- PC 5: Stalled ----- 94119 in-flight CPI 1.3811 -- Total Cycles 130017 ---- Thread 10 ---- PC 5: Stalled ----- 96505 in-flight CPI 1.3470 -- Total Cycles 130017 ---- Thread 11 ---- PC 5: Stalled ----- 100241 in-flight CPI 1.2968 -- Total Cycles 130017 ---- Thread 12 ---- PC 5: Stalled ----- 96483 in-flight CPI 1.3473 -- Total Cycles 130017 ---- Thread 13 ---- PC 5: Stalled ----- 91762 in-flight CPI 1.4166 -- Total Cycles 130017 ---- Thread 14 ---- PC 5: Stalled ----- 98749 in-flight CPI 1.3164 -- Total Cycles 130017 ---- Thread 15 ---- PC 5: Stalled ----- 104785 in-flight CPI 1.2405 -- Total Cycles 130017 ---- Thread 16 ---- PC 5: Stalled ----- 95919 in-flight CPI 1.3553 -- Total Cycles 130017 ---- Thread 17 ---- PC 5: Stalled ----- 93629 in-flight CPI 1.3884 -- Total Cycles 130017 ---- Thread 18 ---- PC 5: Stalled ----- 95436 in-flight CPI 1.3621 -- Total Cycles 130017 ---- Thread 19 ---- PC 5: Stalled ----- 98360 in-flight CPI 1.3216 -- Total Cycles 130017 ---- Thread 20 ---- PC 5: Stalled ----- 95903 in-flight CPI 1.3555 -- Total Cycles 130017 ---- Thread 21 ---- PC 5: Stalled ----- 93905 in-flight CPI 1.3843 -- Total Cycles 130017 ---- Thread 22 ---- PC 5: Stalled ----- 89572 in-flight CPI 1.4513 -- Total Cycles 130017 ---- Thread 23 ---- PC 5: Stalled ----- 92815 in-flight CPI 1.4006 -- Total Cycles 130017 ---- Thread 24 ---- PC 5: Stalled ----- 87743 in-flight CPI 1.4815 -- Total Cycles 130017 ---- Thread 25 ---- PC 5: Stalled ----- 90228 in-flight CPI 1.4407 -- Total Cycles 130017 ---- Thread 26 ---- PC 5: Stalled ----- 89886 in-flight CPI 1.4462 -- Total Cycles 130017 ---- Thread 27 ---- PC 5: Stalled ----- 92537 in-flight CPI 1.4048 -- Total Cycles 130017 ---- Thread 28 ---- PC 5: Stalled ----- 89448 in-flight CPI 1.4533 -- Total Cycles 130017 ---- Thread 29 ---- PC 5: Stalled ----- 87385 in-flight CPI 1.4876 -- Total Cycles 130017 ---- Thread 30 ---- PC 5: Stalled ----- 84560 in-flight CPI 1.5374 -- Total Cycles 130017 ---- Thread 31 ---- PC 5: Stalled ----- 92675 in-flight CPI 1.4026 -- Total Cycles 130017 Total CPI 0.0429 , IPC 23.3220 -- Total Cycles 130017 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8062 (4.115134%) FPSUB: 0 (0.000000%) FPMUL: 32233 (16.452879%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 70381 (35.924988%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4068 (2.076453%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 73450 (37.491516%) DIV: 7457 (3.806320%) FPUN: 0 (0.000000%) FPRSUB: 260 (0.132713%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3332757 total) ADD%: 7.184 (239432) SUB%: 0.000 (0) MUL%: 0.006 (202) BITOR%: 1.532 (51052) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.566 (18880) FPSUB%: 0.000 (0) FPMUL%: 4.820 (160634) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (606) FPMAX%: 0.018 (606) LOAD%: 5.152 (171700) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (234) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (580) FPINV%: 0.000 (0) FPCONV%: 0.019 (638) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.072 (35712) FPLE%: 0.455 (15174) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (606) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.789 (92947) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.751 (25015) CMPU%: 0.000 (0) RSUB%: 0.006 (202) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.656 (521771) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.171 (39010) ORI%: 1.573 (52416) XORI%: 0.000 (0) MULI%: 3.191 (106354) LW%: 1.391 (46342) LWI%: 13.070 (435604) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.285 (9507) SWI%: 4.113 (137085) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.394 (46454) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.309 (10286) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.059 (1953) bned%: 0.000 (0) bneid%: 13.793 (459679) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.716 (23879) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.126 (4186) DIV%: 0.012 (404) FPUN%: 1.481 (49342) FPRSUB%: 4.251 (141667) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (72) FPGT%: 2.935 (97814) FPGE%: 1.025 (34168) SYNC%: 0.000 (0) NOP%: 9.015 (300448) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 31 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 163 FPSUB 0 FPMUL 2 FPCMPLT 0 FPMIN 0 FPMAX 387 LOAD 39808 INTCONV 0 ATOMIC_INC 16 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 18 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1452 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 10 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48933 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 8 ORI 11479 XORI 0 MULI 9333 LW 0 LWI 142066 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 77 DIV 24 FPUN 0 FPRSUB 55 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.3222 --Total thread-cycles: 4160544 --total thread-cycles issued: 3032309 (72.882515%) --iCache conflicts: 112787 (2.710871%) --thread*cycles of FU dependence: 253879 (6.102063%) --thread*cycles of data dependence: 195911 (4.708783%) --iCache cycles*banks: 4160544 (80.104645% used) Issue breakdown: --thread*cycles of issue worked: 3032309 (72.882515%) --thread*cycles of issue failed: 827787 (19.896124%) --thread*cycles of issue NOP/other: 4605239007302161824 (110688387203072.000000%) Number of thread-cycles not ready: 195911 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3332757 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 8 3: 5 4: 8 5: 8 6: 8 7: 8 8: 8 9: 8 10: 7 11: 7 12: 7 13: 7 14: 7 15: 9 16: 6 17: 8 18: 8 19: 8 20: 7 21: 8 22: 6 23: 7 24: 7 25: 7 26: 7 27: 7 28: 7 29: 7 30: 5 31: 8 <=== Core 36 ===> ---- Thread 00 ---- PC 5: Stalled ----- 90526 in-flight CPI 1.4045 -- Total Cycles 127154 ---- Thread 01 ---- PC 5: Stalled ----- 94533 in-flight CPI 1.3448 -- Total Cycles 127154 ---- Thread 02 ---- PC 5: Stalled ----- 95417 in-flight CPI 1.3324 -- Total Cycles 127154 ---- Thread 03 ---- PC 5: Stalled ----- 94004 in-flight CPI 1.3524 -- Total Cycles 127154 ---- Thread 04 ---- PC 5: Stalled ----- 97650 in-flight CPI 1.3019 -- Total Cycles 127154 ---- Thread 05 ---- PC 5: Stalled ----- 96641 in-flight CPI 1.3155 -- Total Cycles 127154 ---- Thread 06 ---- PC 5: Stalled ----- 97490 in-flight CPI 1.3040 -- Total Cycles 127154 ---- Thread 07 ---- PC 5: Stalled ----- 96586 in-flight CPI 1.3163 -- Total Cycles 127154 ---- Thread 08 ---- PC 5: Stalled ----- 93895 in-flight CPI 1.3539 -- Total Cycles 127154 ---- Thread 09 ---- PC 5: Stalled ----- 100747 in-flight CPI 1.2619 -- Total Cycles 127154 ---- Thread 10 ---- PC 5: Stalled ----- 95866 in-flight CPI 1.3261 -- Total Cycles 127154 ---- Thread 11 ---- PC 5: Stalled ----- 102488 in-flight CPI 1.2405 -- Total Cycles 127154 ---- Thread 12 ---- PC 5: Stalled ----- 89765 in-flight CPI 1.4163 -- Total Cycles 127154 ---- Thread 13 ---- PC 5: Stalled ----- 98057 in-flight CPI 1.2965 -- Total Cycles 127154 ---- Thread 14 ---- PC 5: Stalled ----- 94771 in-flight CPI 1.3414 -- Total Cycles 127154 ---- Thread 15 ---- PC 5: Stalled ----- 98108 in-flight CPI 1.2958 -- Total Cycles 127154 ---- Thread 16 ---- PC 5: Stalled ----- 98399 in-flight CPI 1.2919 -- Total Cycles 127154 ---- Thread 17 ---- PC 5: Stalled ----- 93081 in-flight CPI 1.3658 -- Total Cycles 127154 ---- Thread 18 ---- PC 5: Stalled ----- 95260 in-flight CPI 1.3345 -- Total Cycles 127154 ---- Thread 19 ---- PC 5: Stalled ----- 95276 in-flight CPI 1.3344 -- Total Cycles 127154 ---- Thread 20 ---- PC 5: Stalled ----- 91399 in-flight CPI 1.3909 -- Total Cycles 127154 ---- Thread 21 ---- PC 5: Stalled ----- 91668 in-flight CPI 1.3869 -- Total Cycles 127154 ---- Thread 22 ---- PC 5: Stalled ----- 96110 in-flight CPI 1.3227 -- Total Cycles 127154 ---- Thread 23 ---- PC 5: Stalled ----- 87758 in-flight CPI 1.4486 -- Total Cycles 127154 ---- Thread 24 ---- PC 5: Stalled ----- 89786 in-flight CPI 1.4160 -- Total Cycles 127154 ---- Thread 25 ---- PC 5: Stalled ----- 91339 in-flight CPI 1.3918 -- Total Cycles 127154 ---- Thread 26 ---- PC 5: Stalled ----- 90316 in-flight CPI 1.4076 -- Total Cycles 127154 ---- Thread 27 ---- PC 5: Stalled ----- 86651 in-flight CPI 1.4672 -- Total Cycles 127154 ---- Thread 28 ---- PC 5: Stalled ----- 89987 in-flight CPI 1.4128 -- Total Cycles 127154 ---- Thread 29 ---- PC 5: Stalled ----- 83230 in-flight CPI 1.5274 -- Total Cycles 127154 ---- Thread 30 ---- PC 5: Stalled ----- 87155 in-flight CPI 1.4587 -- Total Cycles 127154 ---- Thread 31 ---- PC 5: Stalled ----- 84667 in-flight CPI 1.5016 -- Total Cycles 127154 Total CPI 0.0425 , IPC 23.5082 -- Total Cycles 127154 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8285 (4.079593%) FPSUB: 0 (0.000000%) FPMUL: 32263 (15.886530%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 76161 (37.502216%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 3840 (1.890843%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 75053 (36.956631%) DIV: 7233 (3.561581%) FPUN: 0 (0.000000%) FPRSUB: 249 (0.122609%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3285348 total) ADD%: 7.123 (234000) SUB%: 0.000 (0) MUL%: 0.006 (196) BITOR%: 1.521 (49959) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.583 (19169) FPSUB%: 0.000 (0) FPMUL%: 4.870 (160001) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (588) FPMAX%: 0.018 (588) LOAD%: 5.199 (170803) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (228) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (558) FPINV%: 0.000 (0) FPCONV%: 0.019 (620) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.073 (35246) FPLE%: 0.453 (14880) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (588) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.787 (91571) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.753 (24745) CMPU%: 0.000 (0) RSUB%: 0.006 (196) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.655 (514333) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.166 (38317) ORI%: 1.597 (52454) XORI%: 0.000 (0) MULI%: 3.181 (104510) LW%: 1.389 (45634) LWI%: 13.033 (428186) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.286 (9387) SWI%: 4.107 (134920) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.392 (45736) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10181) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.068 (2236) bned%: 0.000 (0) bneid%: 13.751 (451773) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.719 (23623) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.130 (4283) DIV%: 0.012 (392) FPUN%: 1.473 (48388) FPRSUB%: 4.297 (141181) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (68) FPGT%: 2.924 (96072) FPGE%: 1.020 (33508) SYNC%: 0.000 (0) NOP%: 9.014 (296134) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 15 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 139 FPSUB 0 FPMUL 1 FPCMPLT 0 FPMIN 0 FPMAX 374 LOAD 40133 INTCONV 0 ATOMIC_INC 15 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 18 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1633 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 6 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 47929 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 8 ORI 11818 XORI 0 MULI 8956 LW 0 LWI 139769 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 83 DIV 17 FPUN 0 FPRSUB 50 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.5084 --Total thread-cycles: 4068928 --total thread-cycles issued: 2989214 (73.464409%) --iCache conflicts: 110672 (2.719930%) --thread*cycles of FU dependence: 250985 (6.168332%) --thread*cycles of data dependence: 203084 (4.991094%) --iCache cycles*banks: 4068928 (80.743134% used) Issue breakdown: --thread*cycles of issue worked: 2989214 (73.464409%) --thread*cycles of issue failed: 783580 (19.257652%) --thread*cycles of issue NOP/other: 692727398 (17024.814453%) Number of thread-cycles not ready: 203084 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3285348 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 5 1: 7 2: 7 3: 7 4: 7 5: 7 6: 8 7: 6 8: 8 9: 7 10: 8 11: 7 12: 6 13: 8 14: 8 15: 7 16: 9 17: 8 18: 8 19: 7 20: 7 21: 7 22: 8 23: 7 24: 6 25: 8 26: 8 27: 6 28: 6 29: 7 30: 7 31: 6 <=== Core 37 ===> ---- Thread 00 ---- PC 5: Stalled ----- 103458 in-flight CPI 1.2235 -- Total Cycles 126604 ---- Thread 01 ---- PC 5: Stalled ----- 96434 in-flight CPI 1.3126 -- Total Cycles 126604 ---- Thread 02 ---- PC 5: Stalled ----- 99416 in-flight CPI 1.2733 -- Total Cycles 126604 ---- Thread 03 ---- PC 5: Stalled ----- 99029 in-flight CPI 1.2782 -- Total Cycles 126604 ---- Thread 04 ---- PC 5: Stalled ----- 98297 in-flight CPI 1.2877 -- Total Cycles 126604 ---- Thread 05 ---- PC 5: Stalled ----- 99820 in-flight CPI 1.2680 -- Total Cycles 126604 ---- Thread 06 ---- PC 5: Stalled ----- 97319 in-flight CPI 1.3007 -- Total Cycles 126604 ---- Thread 07 ---- PC 5: Stalled ----- 96894 in-flight CPI 1.3064 -- Total Cycles 126604 ---- Thread 08 ---- PC 5: Stalled ----- 99365 in-flight CPI 1.2739 -- Total Cycles 126604 ---- Thread 09 ---- PC 5: Stalled ----- 92315 in-flight CPI 1.3712 -- Total Cycles 126604 ---- Thread 10 ---- PC 5: Stalled ----- 95124 in-flight CPI 1.3307 -- Total Cycles 126604 ---- Thread 11 ---- PC 5: Stalled ----- 91929 in-flight CPI 1.3770 -- Total Cycles 126604 ---- Thread 12 ---- PC 5: Stalled ----- 94314 in-flight CPI 1.3421 -- Total Cycles 126604 ---- Thread 13 ---- PC 5: Stalled ----- 100301 in-flight CPI 1.2620 -- Total Cycles 126604 ---- Thread 14 ---- PC 5: Stalled ----- 98864 in-flight CPI 1.2803 -- Total Cycles 126604 ---- Thread 15 ---- PC 5: Stalled ----- 92674 in-flight CPI 1.3659 -- Total Cycles 126604 ---- Thread 16 ---- PC 5: Stalled ----- 94452 in-flight CPI 1.3402 -- Total Cycles 126604 ---- Thread 17 ---- PC 5: Stalled ----- 99615 in-flight CPI 1.2706 -- Total Cycles 126604 ---- Thread 18 ---- PC 5: Stalled ----- 92289 in-flight CPI 1.3715 -- Total Cycles 126604 ---- Thread 19 ---- PC 5: Stalled ----- 98203 in-flight CPI 1.2890 -- Total Cycles 126604 ---- Thread 20 ---- PC 5: Stalled ----- 94365 in-flight CPI 1.3415 -- Total Cycles 126604 ---- Thread 21 ---- PC 5: Stalled ----- 89371 in-flight CPI 1.4164 -- Total Cycles 126604 ---- Thread 22 ---- PC 5: Stalled ----- 95398 in-flight CPI 1.3268 -- Total Cycles 126604 ---- Thread 23 ---- PC 5: Stalled ----- 93139 in-flight CPI 1.3591 -- Total Cycles 126604 ---- Thread 24 ---- PC 5: Stalled ----- 86104 in-flight CPI 1.4701 -- Total Cycles 126604 ---- Thread 25 ---- PC 5: Stalled ----- 86721 in-flight CPI 1.4596 -- Total Cycles 126604 ---- Thread 26 ---- PC 5: Stalled ----- 89142 in-flight CPI 1.4200 -- Total Cycles 126604 ---- Thread 27 ---- PC 5: Stalled ----- 88383 in-flight CPI 1.4322 -- Total Cycles 126604 ---- Thread 28 ---- PC 5: Stalled ----- 87145 in-flight CPI 1.4525 -- Total Cycles 126604 ---- Thread 29 ---- PC 5: Stalled ----- 89257 in-flight CPI 1.4182 -- Total Cycles 126604 ---- Thread 30 ---- PC 5: Stalled ----- 83936 in-flight CPI 1.5081 -- Total Cycles 126604 ---- Thread 31 ---- PC 5: Stalled ----- 82914 in-flight CPI 1.5267 -- Total Cycles 126604 Total CPI 0.0421 , IPC 23.7476 -- Total Cycles 126604 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7803 (3.961396%) FPSUB: 0 (0.000000%) FPMUL: 31560 (16.022257%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 73721 (37.426388%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4156 (2.109902%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 72014 (36.559784%) DIV: 7464 (3.789294%) FPUN: 0 (0.000000%) FPRSUB: 258 (0.130980%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3304297 total) ADD%: 7.182 (237321) SUB%: 0.000 (0) MUL%: 0.006 (202) BITOR%: 1.530 (50549) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.558 (18425) FPSUB%: 0.000 (0) FPMUL%: 4.787 (158184) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (606) FPMAX%: 0.018 (606) LOAD%: 5.157 (170399) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (234) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (584) FPINV%: 0.000 (0) FPCONV%: 0.019 (638) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.064 (35150) FPLE%: 0.456 (15083) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (606) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.801 (92547) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.750 (24785) CMPU%: 0.000 (0) RSUB%: 0.006 (202) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.676 (517970) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.175 (38810) ORI%: 1.572 (51937) XORI%: 0.000 (0) MULI%: 3.196 (105610) LW%: 1.396 (46140) LWI%: 13.079 (432155) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.287 (9469) SWI%: 4.136 (136669) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.400 (46256) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10231) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.057 (1896) bned%: 0.000 (0) bneid%: 13.787 (455549) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.719 (23742) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.124 (4102) DIV%: 0.012 (404) FPUN%: 1.479 (48879) FPRSUB%: 4.227 (139676) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (71) FPGT%: 2.936 (97014) FPGE%: 1.023 (33796) SYNC%: 0.000 (0) NOP%: 9.010 (297704) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 37 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 144 FPSUB 0 FPMUL 2 FPCMPLT 0 FPMIN 0 FPMAX 392 LOAD 39851 INTCONV 0 ATOMIC_INC 18 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 15 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1468 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 5 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48544 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 13 ORI 11153 XORI 0 MULI 9094 LW 0 LWI 140898 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 70 DIV 25 FPUN 0 FPRSUB 48 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.7478 --Total thread-cycles: 4051328 --total thread-cycles issued: 3006593 (74.212524%) --iCache conflicts: 112245 (2.770573%) --thread*cycles of FU dependence: 251809 (6.215468%) --thread*cycles of data dependence: 196976 (4.862011%) --iCache cycles*banks: 4051328 (81.561623% used) Issue breakdown: --thread*cycles of issue worked: 3006593 (74.212524%) --thread*cycles of issue failed: 747031 (18.439163%) --thread*cycles of issue NOP/other: 4621706060091394792 (114078794121216.000000%) Number of thread-cycles not ready: 196976 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3304297 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 9 1: 7 2: 7 3: 7 4: 8 5: 9 6: 8 7: 7 8: 8 9: 7 10: 8 11: 6 12: 8 13: 8 14: 9 15: 6 16: 7 17: 9 18: 8 19: 8 20: 6 21: 6 22: 8 23: 7 24: 6 25: 8 26: 6 27: 7 28: 8 29: 7 30: 6 31: 5 <=== Core 38 ===> ---- Thread 00 ---- PC 5: Stalled ----- 96236 in-flight CPI 1.3395 -- Total Cycles 128938 ---- Thread 01 ---- PC 5: Stalled ----- 101801 in-flight CPI 1.2663 -- Total Cycles 128938 ---- Thread 02 ---- PC 5: Stalled ----- 104190 in-flight CPI 1.2373 -- Total Cycles 128938 ---- Thread 03 ---- PC 5: Stalled ----- 100467 in-flight CPI 1.2831 -- Total Cycles 128938 ---- Thread 04 ---- PC 5: Stalled ----- 99956 in-flight CPI 1.2897 -- Total Cycles 128938 ---- Thread 05 ---- PC 5: Stalled ----- 101745 in-flight CPI 1.2670 -- Total Cycles 128938 ---- Thread 06 ---- PC 5: Stalled ----- 99846 in-flight CPI 1.2911 -- Total Cycles 128938 ---- Thread 07 ---- PC 5: Stalled ----- 99640 in-flight CPI 1.2938 -- Total Cycles 128938 ---- Thread 08 ---- PC 5: Stalled ----- 101941 in-flight CPI 1.2646 -- Total Cycles 128938 ---- Thread 09 ---- PC 5: Stalled ----- 92933 in-flight CPI 1.3871 -- Total Cycles 128938 ---- Thread 10 ---- PC 5: Stalled ----- 96823 in-flight CPI 1.3314 -- Total Cycles 128938 ---- Thread 11 ---- PC 5: Stalled ----- 98306 in-flight CPI 1.3113 -- Total Cycles 128938 ---- Thread 12 ---- PC 5: Stalled ----- 93730 in-flight CPI 1.3753 -- Total Cycles 128938 ---- Thread 13 ---- PC 5: Stalled ----- 100427 in-flight CPI 1.2836 -- Total Cycles 128938 ---- Thread 14 ---- PC 5: Stalled ----- 93466 in-flight CPI 1.3793 -- Total Cycles 128938 ---- Thread 15 ---- PC 5: Stalled ----- 95104 in-flight CPI 1.3555 -- Total Cycles 128938 ---- Thread 16 ---- PC 5: Stalled ----- 95154 in-flight CPI 1.3548 -- Total Cycles 128938 ---- Thread 17 ---- PC 5: Stalled ----- 93549 in-flight CPI 1.3781 -- Total Cycles 128938 ---- Thread 18 ---- PC 5: Stalled ----- 96849 in-flight CPI 1.3311 -- Total Cycles 128938 ---- Thread 19 ---- PC 5: Stalled ----- 90166 in-flight CPI 1.4297 -- Total Cycles 128938 ---- Thread 20 ---- PC 5: Stalled ----- 92892 in-flight CPI 1.3878 -- Total Cycles 128938 ---- Thread 21 ---- PC 5: Stalled ----- 95360 in-flight CPI 1.3519 -- Total Cycles 128938 ---- Thread 22 ---- PC 5: Stalled ----- 87891 in-flight CPI 1.4668 -- Total Cycles 128938 ---- Thread 23 ---- PC 5: Stalled ----- 91899 in-flight CPI 1.4028 -- Total Cycles 128938 ---- Thread 24 ---- PC 5: Stalled ----- 93970 in-flight CPI 1.3719 -- Total Cycles 128938 ---- Thread 25 ---- PC 5: Stalled ----- 91882 in-flight CPI 1.4030 -- Total Cycles 128938 ---- Thread 26 ---- PC 5: Stalled ----- 96097 in-flight CPI 1.3415 -- Total Cycles 128938 ---- Thread 27 ---- PC 5: Stalled ----- 89310 in-flight CPI 1.4435 -- Total Cycles 128938 ---- Thread 28 ---- PC 5: Stalled ----- 94207 in-flight CPI 1.3684 -- Total Cycles 128938 ---- Thread 29 ---- PC 5: Stalled ----- 93765 in-flight CPI 1.3748 -- Total Cycles 128938 ---- Thread 30 ---- PC 5: Stalled ----- 82578 in-flight CPI 1.5611 -- Total Cycles 128938 ---- Thread 31 ---- PC 5: Stalled ----- 86182 in-flight CPI 1.4958 -- Total Cycles 128938 Total CPI 0.0423 , IPC 23.6466 -- Total Cycles 128938 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7786 (4.086110%) FPSUB: 0 (0.000000%) FPMUL: 31790 (16.683460%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 66456 (34.876251%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4465 (2.343242%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 71879 (37.722252%) DIV: 7899 (4.145412%) FPUN: 0 (0.000000%) FPRSUB: 273 (0.143271%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3351263 total) ADD%: 7.204 (241430) SUB%: 0.000 (0) MUL%: 0.006 (214) BITOR%: 1.515 (50779) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.549 (18408) FPSUB%: 0.000 (0) FPMUL%: 4.763 (159615) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (642) FPMAX%: 0.019 (642) LOAD%: 5.135 (172080) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (246) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (619) FPINV%: 0.000 (0) FPCONV%: 0.020 (674) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.063 (35638) FPLE%: 0.452 (15135) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (642) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.808 (94089) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.742 (24862) CMPU%: 0.000 (0) RSUB%: 0.006 (214) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.670 (525130) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.173 (39303) ORI%: 1.557 (52179) XORI%: 0.000 (0) MULI%: 3.209 (107538) LW%: 1.400 (46906) LWI%: 13.128 (439964) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.287 (9633) SWI%: 4.160 (139420) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.403 (47033) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10380) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.057 (1921) bned%: 0.000 (0) bneid%: 13.792 (462191) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.712 (23865) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.122 (4084) DIV%: 0.013 (428) FPUN%: 1.469 (49240) FPRSUB%: 4.197 (140666) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (75) FPGT%: 2.952 (98918) FPGE%: 1.018 (34105) SYNC%: 0.000 (0) NOP%: 9.019 (302259) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 27 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 163 FPSUB 0 FPMUL 3 FPCMPLT 0 FPMIN 0 FPMAX 407 LOAD 39817 INTCONV 0 ATOMIC_INC 17 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 16 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1465 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 12 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49513 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 12 ORI 11019 XORI 0 MULI 9712 LW 0 LWI 143307 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 77 DIV 22 FPUN 0 FPRSUB 43 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.6469 --Total thread-cycles: 4126016 --total thread-cycles issued: 3049004 (73.897049%) --iCache conflicts: 113628 (2.753940%) --thread*cycles of FU dependence: 255640 (6.195807%) --thread*cycles of data dependence: 190548 (4.618208%) --iCache cycles*banks: 4126016 (81.223511% used) Issue breakdown: --thread*cycles of issue worked: 3049004 (73.897049%) --thread*cycles of issue failed: 774753 (18.777266%) --thread*cycles of issue NOP/other: 4602400586896546995 (111545879101440.000000%) Number of thread-cycles not ready: 190548 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3351263 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 9 1: 8 2: 9 3: 8 4: 9 5: 8 6: 8 7: 8 8: 9 9: 9 10: 8 11: 8 12: 8 13: 9 14: 7 15: 8 16: 8 17: 7 18: 7 19: 7 20: 8 21: 7 22: 6 23: 6 24: 6 25: 8 26: 8 27: 6 28: 7 29: 8 30: 7 31: 7 <=== Core 39 ===> ---- Thread 00 ---- PC 5: Stalled ----- 97831 in-flight CPI 1.4295 -- Total Cycles 139868 ---- Thread 01 ---- PC 5: Stalled ----- 102200 in-flight CPI 1.3683 -- Total Cycles 139868 ---- Thread 02 ---- PC 5: Stalled ----- 88994 in-flight CPI 1.5715 -- Total Cycles 139868 ---- Thread 03 ---- PC 5: Stalled ----- 96426 in-flight CPI 1.4502 -- Total Cycles 139868 ---- Thread 04 ---- PC 5: Stalled ----- 98424 in-flight CPI 1.4208 -- Total Cycles 139868 ---- Thread 05 ---- PC 5: Stalled ----- 98388 in-flight CPI 1.4213 -- Total Cycles 139868 ---- Thread 06 ---- PC 5: Stalled ----- 87209 in-flight CPI 1.6036 -- Total Cycles 139868 ---- Thread 07 ---- PC 5: Stalled ----- 93741 in-flight CPI 1.4918 -- Total Cycles 139868 ---- Thread 08 ---- PC 5: Stalled ----- 103226 in-flight CPI 1.3547 -- Total Cycles 139868 ---- Thread 09 ---- PC 5: Stalled ----- 96728 in-flight CPI 1.4457 -- Total Cycles 139868 ---- Thread 10 ---- PC 5: Stalled ----- 96252 in-flight CPI 1.4529 -- Total Cycles 139868 ---- Thread 11 ---- PC 5: Stalled ----- 98798 in-flight CPI 1.4155 -- Total Cycles 139868 ---- Thread 12 ---- PC 5: Stalled ----- 101154 in-flight CPI 1.3824 -- Total Cycles 139868 ---- Thread 13 ---- PC 5: Stalled ----- 96857 in-flight CPI 1.4438 -- Total Cycles 139868 ---- Thread 14 ---- PC 5: Stalled ----- 94985 in-flight CPI 1.4722 -- Total Cycles 139868 ---- Thread 15 ---- PC 5: Stalled ----- 95078 in-flight CPI 1.4708 -- Total Cycles 139868 ---- Thread 16 ---- PC 5: Stalled ----- 91723 in-flight CPI 1.5246 -- Total Cycles 139868 ---- Thread 17 ---- PC 5: Stalled ----- 101030 in-flight CPI 1.3842 -- Total Cycles 139868 ---- Thread 18 ---- PC 5: Stalled ----- 93553 in-flight CPI 1.4949 -- Total Cycles 139868 ---- Thread 19 ---- PC 5: Stalled ----- 86750 in-flight CPI 1.6121 -- Total Cycles 139868 ---- Thread 20 ---- PC 5: Stalled ----- 87155 in-flight CPI 1.6046 -- Total Cycles 139868 ---- Thread 21 ---- PC 5: Stalled ----- 97910 in-flight CPI 1.4282 -- Total Cycles 139868 ---- Thread 22 ---- PC 5: Stalled ----- 90993 in-flight CPI 1.5369 -- Total Cycles 139868 ---- Thread 23 ---- PC 5: Stalled ----- 93732 in-flight CPI 1.4920 -- Total Cycles 139868 ---- Thread 24 ---- PC 5: Stalled ----- 87972 in-flight CPI 1.5896 -- Total Cycles 139868 ---- Thread 25 ---- PC 5: Stalled ----- 87783 in-flight CPI 1.5930 -- Total Cycles 139868 ---- Thread 26 ---- PC 5: Stalled ----- 92507 in-flight CPI 1.5117 -- Total Cycles 139868 ---- Thread 27 ---- PC 5: Stalled ----- 89908 in-flight CPI 1.5554 -- Total Cycles 139868 ---- Thread 28 ---- PC 5: Stalled ----- 86650 in-flight CPI 1.6139 -- Total Cycles 139868 ---- Thread 29 ---- PC 5: Stalled ----- 83383 in-flight CPI 1.6771 -- Total Cycles 139868 ---- Thread 30 ---- PC 5: Stalled ----- 94013 in-flight CPI 1.4874 -- Total Cycles 139868 ---- Thread 31 ---- PC 5: Stalled ----- 89607 in-flight CPI 1.5606 -- Total Cycles 139868 Total CPI 0.0466 , IPC 21.4595 -- Total Cycles 139868 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7628 (3.534722%) FPSUB: 0 (0.000000%) FPMUL: 31143 (14.431284%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 95634 (44.315620%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 3919 (1.816017%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 69947 (32.412582%) DIV: 7279 (3.372999%) FPUN: 0 (0.000000%) FPRSUB: 252 (0.116774%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3298652 total) ADD%: 7.227 (238378) SUB%: 0.000 (0) MUL%: 0.006 (197) BITOR%: 1.529 (50437) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.543 (17926) FPSUB%: 0.000 (0) FPMUL%: 4.754 (156805) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (591) FPMAX%: 0.018 (591) LOAD%: 5.145 (169713) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (229) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (563) FPINV%: 0.000 (0) FPCONV%: 0.019 (623) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.058 (34896) FPLE%: 0.456 (15042) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (591) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.810 (92677) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.746 (24622) CMPU%: 0.000 (0) RSUB%: 0.006 (197) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.688 (517490) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.177 (38822) ORI%: 1.562 (51539) XORI%: 0.000 (0) MULI%: 3.201 (105588) LW%: 1.400 (46181) LWI%: 13.094 (431939) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.288 (9504) SWI%: 4.143 (136657) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.403 (46286) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10246) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.055 (1816) bned%: 0.000 (0) bneid%: 13.792 (454935) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.720 (23763) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.121 (3981) DIV%: 0.012 (394) FPUN%: 1.480 (48804) FPRSUB%: 4.202 (138625) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (67) FPGT%: 2.940 (96978) FPGE%: 1.024 (33762) SYNC%: 0.000 (0) NOP%: 9.007 (297101) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 19 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 148 FPSUB 0 FPMUL 2 FPCMPLT 0 FPMIN 0 FPMAX 381 LOAD 39190 INTCONV 0 ATOMIC_INC 13 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 14 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1332 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 6 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48647 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 9 ORI 10877 XORI 0 MULI 9021 LW 0 LWI 140760 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 66 DIV 22 FPUN 0 FPRSUB 42 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.4597 --Total thread-cycles: 4475776 --total thread-cycles issued: 3001551 (67.062134%) --iCache conflicts: 109288 (2.441767%) --thread*cycles of FU dependence: 250564 (5.598225%) --thread*cycles of data dependence: 215802 (4.821555%) --iCache cycles*banks: 4475776 (73.700829% used) Issue breakdown: --thread*cycles of issue worked: 3001551 (67.062134%) --thread*cycles of issue failed: 1177124 (26.299885%) --thread*cycles of issue NOP/other: 4697565 (104.955315%) Number of thread-cycles not ready: 215802 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3298652 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 5 3: 8 4: 8 5: 9 6: 5 7: 7 8: 8 9: 8 10: 7 11: 7 12: 9 13: 7 14: 8 15: 7 16: 8 17: 6 18: 6 19: 5 20: 6 21: 9 22: 7 23: 7 24: 7 25: 8 26: 7 27: 7 28: 7 29: 6 30: 8 31: 7 <=== Core 40 ===> ---- Thread 00 ---- PC 5: Stalled ----- 96258 in-flight CPI 1.3399 -- Total Cycles 128995 ---- Thread 01 ---- PC 5: Stalled ----- 98350 in-flight CPI 1.3113 -- Total Cycles 128995 ---- Thread 02 ---- PC 5: Stalled ----- 101957 in-flight CPI 1.2650 -- Total Cycles 128995 ---- Thread 03 ---- PC 5: Stalled ----- 100277 in-flight CPI 1.2861 -- Total Cycles 128995 ---- Thread 04 ---- PC 5: Stalled ----- 95760 in-flight CPI 1.3468 -- Total Cycles 128995 ---- Thread 05 ---- PC 5: Stalled ----- 101428 in-flight CPI 1.2715 -- Total Cycles 128995 ---- Thread 06 ---- PC 5: Stalled ----- 96512 in-flight CPI 1.3363 -- Total Cycles 128995 ---- Thread 07 ---- PC 5: Stalled ----- 96974 in-flight CPI 1.3300 -- Total Cycles 128995 ---- Thread 08 ---- PC 5: Stalled ----- 100339 in-flight CPI 1.2853 -- Total Cycles 128995 ---- Thread 09 ---- PC 5: Stalled ----- 101230 in-flight CPI 1.2740 -- Total Cycles 128995 ---- Thread 10 ---- PC 5: Stalled ----- 97908 in-flight CPI 1.3173 -- Total Cycles 128995 ---- Thread 11 ---- PC 5: Stalled ----- 95120 in-flight CPI 1.3559 -- Total Cycles 128995 ---- Thread 12 ---- PC 5: Stalled ----- 95905 in-flight CPI 1.3448 -- Total Cycles 128995 ---- Thread 13 ---- PC 5: Stalled ----- 93227 in-flight CPI 1.3834 -- Total Cycles 128995 ---- Thread 14 ---- PC 5: Stalled ----- 95297 in-flight CPI 1.3534 -- Total Cycles 128995 ---- Thread 15 ---- PC 5: Stalled ----- 98913 in-flight CPI 1.3038 -- Total Cycles 128995 ---- Thread 16 ---- PC 5: Stalled ----- 98257 in-flight CPI 1.3126 -- Total Cycles 128995 ---- Thread 17 ---- PC 5: Stalled ----- 95485 in-flight CPI 1.3507 -- Total Cycles 128995 ---- Thread 18 ---- PC 5: Stalled ----- 100255 in-flight CPI 1.2864 -- Total Cycles 128995 ---- Thread 19 ---- PC 5: Stalled ----- 93870 in-flight CPI 1.3740 -- Total Cycles 128995 ---- Thread 20 ---- PC 5: Stalled ----- 89252 in-flight CPI 1.4451 -- Total Cycles 128995 ---- Thread 21 ---- PC 5: Stalled ----- 89478 in-flight CPI 1.4413 -- Total Cycles 128995 ---- Thread 22 ---- PC 5: Stalled ----- 92168 in-flight CPI 1.3993 -- Total Cycles 128995 ---- Thread 23 ---- PC 5: Stalled ----- 91662 in-flight CPI 1.4070 -- Total Cycles 128995 ---- Thread 24 ---- PC 5: Stalled ----- 92078 in-flight CPI 1.4007 -- Total Cycles 128995 ---- Thread 25 ---- PC 5: Stalled ----- 92474 in-flight CPI 1.3946 -- Total Cycles 128995 ---- Thread 26 ---- PC 5: Stalled ----- 87213 in-flight CPI 1.4789 -- Total Cycles 128995 ---- Thread 27 ---- PC 5: Stalled ----- 83872 in-flight CPI 1.5378 -- Total Cycles 128995 ---- Thread 28 ---- PC 5: Stalled ----- 93119 in-flight CPI 1.3850 -- Total Cycles 128995 ---- Thread 29 ---- PC 5: Stalled ----- 92950 in-flight CPI 1.3875 -- Total Cycles 128995 ---- Thread 30 ---- PC 5: Stalled ----- 85459 in-flight CPI 1.5093 -- Total Cycles 128995 ---- Thread 31 ---- PC 5: Stalled ----- 84236 in-flight CPI 1.5311 -- Total Cycles 128995 Total CPI 0.0426 , IPC 23.4724 -- Total Cycles 128995 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8071 (3.928832%) FPSUB: 0 (0.000000%) FPMUL: 32133 (15.641825%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 80021 (38.952930%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4166 (2.027941%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 73433 (35.745995%) DIV: 7350 (3.577861%) FPUN: 0 (0.000000%) FPRSUB: 256 (0.124617%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3327798 total) ADD%: 7.204 (239738) SUB%: 0.000 (0) MUL%: 0.006 (199) BITOR%: 1.525 (50762) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.566 (18843) FPSUB%: 0.000 (0) FPMUL%: 4.822 (160473) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (597) FPMAX%: 0.018 (597) LOAD%: 5.158 (171657) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (231) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (579) FPINV%: 0.000 (0) FPCONV%: 0.019 (629) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.070 (35596) FPLE%: 0.453 (15059) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (597) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.790 (92844) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.754 (25107) CMPU%: 0.000 (0) RSUB%: 0.006 (199) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.647 (520707) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.172 (38994) ORI%: 1.574 (52389) XORI%: 0.000 (0) MULI%: 3.188 (106092) LW%: 1.391 (46306) LWI%: 13.076 (435150) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.285 (9480) SWI%: 4.116 (136975) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.395 (46423) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.309 (10287) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.060 (1981) bned%: 0.000 (0) bneid%: 13.777 (458462) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.718 (23901) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.126 (4187) DIV%: 0.012 (398) FPUN%: 1.473 (49025) FPRSUB%: 4.256 (141632) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (77) FPGT%: 2.934 (97645) FPGE%: 1.021 (33966) SYNC%: 0.000 (0) NOP%: 9.013 (299918) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 27 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 153 FPSUB 0 FPMUL 0 FPCMPLT 0 FPMIN 0 FPMAX 385 LOAD 40490 INTCONV 0 ATOMIC_INC 13 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 19 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1214 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 7 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48939 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 9 ORI 11528 XORI 0 MULI 9059 LW 0 LWI 141899 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 100 DIV 22 FPUN 0 FPRSUB 62 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.4727 --Total thread-cycles: 4127840 --total thread-cycles issued: 3027880 (73.352646%) --iCache conflicts: 112753 (2.731525%) --thread*cycles of FU dependence: 253939 (6.151861%) --thread*cycles of data dependence: 205430 (4.976695%) --iCache cycles*banks: 4127840 (80.619164% used) Issue breakdown: --thread*cycles of issue worked: 3027880 (73.352646%) --thread*cycles of issue failed: 800042 (19.381615%) --thread*cycles of issue NOP/other: 4620716698268636046 (111940311449600.000000%) Number of thread-cycles not ready: 205430 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3327798 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 8 3: 8 4: 7 5: 8 6: 7 7: 7 8: 8 9: 8 10: 8 11: 7 12: 7 13: 7 14: 6 15: 9 16: 8 17: 7 18: 8 19: 5 20: 6 21: 8 22: 8 23: 7 24: 7 25: 8 26: 6 27: 5 28: 8 29: 9 30: 5 31: 6 <=== Core 41 ===> ---- Thread 00 ---- PC 5: Stalled ----- 99745 in-flight CPI 1.2612 -- Total Cycles 125824 ---- Thread 01 ---- PC 5: Stalled ----- 94182 in-flight CPI 1.3357 -- Total Cycles 125824 ---- Thread 02 ---- PC 5: Stalled ----- 96544 in-flight CPI 1.3031 -- Total Cycles 125824 ---- Thread 03 ---- PC 5: Stalled ----- 102905 in-flight CPI 1.2225 -- Total Cycles 125824 ---- Thread 04 ---- PC 5: Stalled ----- 100681 in-flight CPI 1.2495 -- Total Cycles 125824 ---- Thread 05 ---- PC 5: Stalled ----- 95572 in-flight CPI 1.3163 -- Total Cycles 125824 ---- Thread 06 ---- PC 5: Stalled ----- 93490 in-flight CPI 1.3456 -- Total Cycles 125824 ---- Thread 07 ---- PC 5: Stalled ----- 95144 in-flight CPI 1.3222 -- Total Cycles 125824 ---- Thread 08 ---- PC 5: Stalled ----- 98801 in-flight CPI 1.2733 -- Total Cycles 125824 ---- Thread 09 ---- PC 5: Stalled ----- 101672 in-flight CPI 1.2374 -- Total Cycles 125824 ---- Thread 10 ---- PC 5: Stalled ----- 97888 in-flight CPI 1.2852 -- Total Cycles 125824 ---- Thread 11 ---- PC 5: Stalled ----- 98367 in-flight CPI 1.2788 -- Total Cycles 125824 ---- Thread 12 ---- PC 5: Stalled ----- 100554 in-flight CPI 1.2510 -- Total Cycles 125824 ---- Thread 13 ---- PC 5: Stalled ----- 98462 in-flight CPI 1.2777 -- Total Cycles 125824 ---- Thread 14 ---- PC 5: Stalled ----- 95581 in-flight CPI 1.3162 -- Total Cycles 125824 ---- Thread 15 ---- PC 5: Stalled ----- 96438 in-flight CPI 1.3045 -- Total Cycles 125824 ---- Thread 16 ---- PC 5: Stalled ----- 98854 in-flight CPI 1.2725 -- Total Cycles 125824 ---- Thread 17 ---- PC 5: Stalled ----- 91519 in-flight CPI 1.3746 -- Total Cycles 125824 ---- Thread 18 ---- PC 5: Stalled ----- 98498 in-flight CPI 1.2772 -- Total Cycles 125824 ---- Thread 19 ---- PC 5: Stalled ----- 95789 in-flight CPI 1.3133 -- Total Cycles 125824 ---- Thread 20 ---- PC 5: Stalled ----- 91306 in-flight CPI 1.3778 -- Total Cycles 125824 ---- Thread 21 ---- PC 5: Stalled ----- 96694 in-flight CPI 1.3010 -- Total Cycles 125824 ---- Thread 22 ---- PC 5: Stalled ----- 90068 in-flight CPI 1.3967 -- Total Cycles 125824 ---- Thread 23 ---- PC 5: Stalled ----- 90032 in-flight CPI 1.3973 -- Total Cycles 125824 ---- Thread 24 ---- PC 5: Stalled ----- 91501 in-flight CPI 1.3749 -- Total Cycles 125824 ---- Thread 25 ---- PC 5: Stalled ----- 94105 in-flight CPI 1.3368 -- Total Cycles 125824 ---- Thread 26 ---- PC 5: Stalled ----- 92324 in-flight CPI 1.3626 -- Total Cycles 125824 ---- Thread 27 ---- PC 5: Stalled ----- 88849 in-flight CPI 1.4158 -- Total Cycles 125824 ---- Thread 28 ---- PC 5: Stalled ----- 87058 in-flight CPI 1.4451 -- Total Cycles 125824 ---- Thread 29 ---- PC 5: Stalled ----- 90120 in-flight CPI 1.3959 -- Total Cycles 125824 ---- Thread 30 ---- PC 5: Stalled ----- 90371 in-flight CPI 1.3921 -- Total Cycles 125824 ---- Thread 31 ---- PC 5: Stalled ----- 88208 in-flight CPI 1.4262 -- Total Cycles 125824 Total CPI 0.0414 , IPC 24.1757 -- Total Cycles 125824 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7650 (4.173304%) FPSUB: 0 (0.000000%) FPMUL: 31418 (17.139458%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 61853 (33.742664%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4240 (2.313047%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 70313 (38.357845%) DIV: 7571 (4.130207%) FPUN: 0 (0.000000%) FPRSUB: 263 (0.143474%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3343037 total) ADD%: 7.167 (239596) SUB%: 0.000 (0) MUL%: 0.006 (205) BITOR%: 1.536 (51358) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.539 (18009) FPSUB%: 0.000 (0) FPMUL%: 4.735 (158278) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (615) FPMAX%: 0.018 (615) LOAD%: 5.133 (171585) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (237) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (593) FPINV%: 0.000 (0) FPCONV%: 0.019 (647) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.057 (35333) FPLE%: 0.458 (15299) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (615) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.814 (94080) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.747 (24983) CMPU%: 0.000 (0) RSUB%: 0.006 (205) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.699 (524826) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.180 (39450) ORI%: 1.557 (52050) XORI%: 0.000 (0) MULI%: 3.213 (107406) LW%: 1.403 (46914) LWI%: 13.124 (438748) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.288 (9616) SWI%: 4.149 (138703) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.407 (47033) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10369) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.052 (1741) bned%: 0.000 (0) bneid%: 13.813 (461768) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.723 (24162) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.120 (3999) DIV%: 0.012 (410) FPUN%: 1.486 (49663) FPRSUB%: 4.185 (139912) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (71) FPGT%: 2.943 (98383) FPGE%: 1.028 (34364) SYNC%: 0.000 (0) NOP%: 9.007 (301100) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 13 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 151 FPSUB 0 FPMUL 2 FPCMPLT 0 FPMIN 0 FPMAX 398 LOAD 39260 INTCONV 0 ATOMIC_INC 17 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 13 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1276 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 6 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49368 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 9 ORI 10845 XORI 0 MULI 9594 LW 0 LWI 142833 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 78 DIV 29 FPUN 0 FPRSUB 64 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 24.1759 --Total thread-cycles: 4026368 --total thread-cycles issued: 3041937 (75.550392%) --iCache conflicts: 114938 (2.854632%) --thread*cycles of FU dependence: 253970 (6.307670%) --thread*cycles of data dependence: 183308 (4.552689%) --iCache cycles*banks: 4026368 (83.029396% used) Issue breakdown: --thread*cycles of issue worked: 3041937 (75.550392%) --thread*cycles of issue failed: 683331 (16.971399%) --thread*cycles of issue NOP/other: 4601473331911497772 (114283476156416.000000%) Number of thread-cycles not ready: 183308 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3343037 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 7 3: 9 4: 8 5: 7 6: 7 7: 7 8: 6 9: 7 10: 7 11: 9 12: 9 13: 7 14: 7 15: 7 16: 10 17: 7 18: 7 19: 8 20: 6 21: 9 22: 7 23: 7 24: 6 25: 8 26: 7 27: 8 28: 5 29: 8 30: 7 31: 7 <=== Core 42 ===> ---- Thread 00 ---- PC 5: Stalled ----- 97382 in-flight CPI 1.3558 -- Total Cycles 132057 ---- Thread 01 ---- PC 5: Stalled ----- 94589 in-flight CPI 1.3959 -- Total Cycles 132057 ---- Thread 02 ---- PC 5: Stalled ----- 96200 in-flight CPI 1.3725 -- Total Cycles 132057 ---- Thread 03 ---- PC 5: Stalled ----- 95235 in-flight CPI 1.3864 -- Total Cycles 132057 ---- Thread 04 ---- PC 5: Stalled ----- 97201 in-flight CPI 1.3584 -- Total Cycles 132057 ---- Thread 05 ---- PC 5: Stalled ----- 94709 in-flight CPI 1.3941 -- Total Cycles 132057 ---- Thread 06 ---- PC 5: Stalled ----- 98955 in-flight CPI 1.3343 -- Total Cycles 132057 ---- Thread 07 ---- PC 5: Stalled ----- 97302 in-flight CPI 1.3569 -- Total Cycles 132057 ---- Thread 08 ---- PC 5: Stalled ----- 99622 in-flight CPI 1.3254 -- Total Cycles 132057 ---- Thread 09 ---- PC 5: Stalled ----- 101817 in-flight CPI 1.2968 -- Total Cycles 132057 ---- Thread 10 ---- PC 5: Stalled ----- 95302 in-flight CPI 1.3855 -- Total Cycles 132057 ---- Thread 11 ---- PC 5: Stalled ----- 98791 in-flight CPI 1.3365 -- Total Cycles 132057 ---- Thread 12 ---- PC 5: Stalled ----- 93615 in-flight CPI 1.4103 -- Total Cycles 132057 ---- Thread 13 ---- PC 5: Stalled ----- 93954 in-flight CPI 1.4053 -- Total Cycles 132057 ---- Thread 14 ---- PC 5: Stalled ----- 93492 in-flight CPI 1.4123 -- Total Cycles 132057 ---- Thread 15 ---- PC 5: Stalled ----- 92082 in-flight CPI 1.4338 -- Total Cycles 132057 ---- Thread 16 ---- PC 5: Stalled ----- 94987 in-flight CPI 1.3900 -- Total Cycles 132057 ---- Thread 17 ---- PC 5: Stalled ----- 100453 in-flight CPI 1.3144 -- Total Cycles 132057 ---- Thread 18 ---- PC 5: Stalled ----- 87579 in-flight CPI 1.5076 -- Total Cycles 132057 ---- Thread 19 ---- PC 5: Stalled ----- 90472 in-flight CPI 1.4593 -- Total Cycles 132057 ---- Thread 20 ---- PC 5: Stalled ----- 88524 in-flight CPI 1.4915 -- Total Cycles 132057 ---- Thread 21 ---- PC 5: Stalled ----- 92698 in-flight CPI 1.4243 -- Total Cycles 132057 ---- Thread 22 ---- PC 5: Stalled ----- 96363 in-flight CPI 1.3701 -- Total Cycles 132057 ---- Thread 23 ---- PC 5: Stalled ----- 81188 in-flight CPI 1.6264 -- Total Cycles 132057 ---- Thread 24 ---- PC 5: Stalled ----- 90110 in-flight CPI 1.4652 -- Total Cycles 132057 ---- Thread 25 ---- PC 5: Stalled ----- 92448 in-flight CPI 1.4282 -- Total Cycles 132057 ---- Thread 26 ---- PC 5: Stalled ----- 91940 in-flight CPI 1.4361 -- Total Cycles 132057 ---- Thread 27 ---- PC 5: Stalled ----- 87902 in-flight CPI 1.5020 -- Total Cycles 132057 ---- Thread 28 ---- PC 5: Stalled ----- 86280 in-flight CPI 1.5304 -- Total Cycles 132057 ---- Thread 29 ---- PC 5: Stalled ----- 89968 in-flight CPI 1.4676 -- Total Cycles 132057 ---- Thread 30 ---- PC 5: Stalled ----- 91141 in-flight CPI 1.4486 -- Total Cycles 132057 ---- Thread 31 ---- PC 5: Stalled ----- 91321 in-flight CPI 1.4458 -- Total Cycles 132057 Total CPI 0.0441 , IPC 22.6732 -- Total Cycles 132057 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8710 (4.177578%) FPSUB: 0 (0.000000%) FPMUL: 33032 (15.843142%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 76654 (36.765568%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 3893 (1.867200%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 78802 (37.795811%) DIV: 7162 (3.435111%) FPUN: 0 (0.000000%) FPRSUB: 241 (0.115591%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3289827 total) ADD%: 7.120 (234227) SUB%: 0.000 (0) MUL%: 0.006 (194) BITOR%: 1.527 (50233) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.612 (20120) FPSUB%: 0.000 (0) FPMUL%: 4.947 (162753) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (582) FPMAX%: 0.018 (582) LOAD%: 5.241 (172420) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (226) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (558) FPINV%: 0.000 (0) FPCONV%: 0.019 (614) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.082 (35587) FPLE%: 0.456 (14996) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (582) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.767 (91034) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.772 (25395) CMPU%: 0.000 (0) RSUB%: 0.006 (194) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.652 (514927) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.173 (38585) ORI%: 1.600 (52630) XORI%: 0.000 (0) MULI%: 3.156 (103822) LW%: 1.379 (45353) LWI%: 12.961 (426387) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.284 (9345) SWI%: 4.093 (134663) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.382 (45459) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.309 (10154) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.061 (2008) bned%: 0.000 (0) bneid%: 13.730 (451700) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.710 (23356) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.137 (4506) DIV%: 0.012 (388) FPUN%: 1.464 (48178) FPRSUB%: 4.360 (143424) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (67) FPGT%: 2.908 (95677) FPGE%: 1.009 (33182) SYNC%: 0.000 (0) NOP%: 8.986 (295623) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 24 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 146 FPSUB 0 FPMUL 0 FPCMPLT 0 FPMIN 0 FPMAX 373 LOAD 40950 INTCONV 0 ATOMIC_INC 19 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 17 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1330 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 8 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 47824 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 14 ORI 12398 XORI 0 MULI 8593 LW 0 LWI 139425 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 96 DIV 40 FPUN 0 FPRSUB 53 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 22.6734 --Total thread-cycles: 4225824 --total thread-cycles issued: 2994204 (70.854912%) --iCache conflicts: 111004 (2.626801%) --thread*cycles of FU dependence: 251336 (5.947621%) --thread*cycles of data dependence: 208494 (4.933807%) --iCache cycles*banks: 4225824 (77.851303% used) Issue breakdown: --thread*cycles of issue worked: 2994204 (70.854912%) --thread*cycles of issue failed: 935997 (22.149456%) --thread*cycles of issue NOP/other: 4696087 (111.128319%) Number of thread-cycles not ready: 208494 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3289827 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 7 2: 8 3: 8 4: 7 5: 6 6: 6 7: 8 8: 7 9: 8 10: 6 11: 8 12: 8 13: 7 14: 6 15: 8 16: 7 17: 7 18: 7 19: 8 20: 6 21: 7 22: 9 23: 4 24: 7 25: 7 26: 6 27: 7 28: 5 29: 7 30: 9 31: 8 <=== Core 43 ===> ---- Thread 00 ---- PC 5: Stalled ----- 99852 in-flight CPI 1.4427 -- Total Cycles 144089 ---- Thread 01 ---- PC 5: Stalled ----- 101621 in-flight CPI 1.4176 -- Total Cycles 144089 ---- Thread 02 ---- PC 5: Stalled ----- 96047 in-flight CPI 1.4999 -- Total Cycles 144089 ---- Thread 03 ---- PC 5: Stalled ----- 100378 in-flight CPI 1.4352 -- Total Cycles 144089 ---- Thread 04 ---- PC 5: Stalled ----- 100082 in-flight CPI 1.4394 -- Total Cycles 144089 ---- Thread 05 ---- PC 5: Stalled ----- 100292 in-flight CPI 1.4365 -- Total Cycles 144089 ---- Thread 06 ---- PC 5: Stalled ----- 96862 in-flight CPI 1.4873 -- Total Cycles 144089 ---- Thread 07 ---- PC 5: Stalled ----- 102027 in-flight CPI 1.4120 -- Total Cycles 144089 ---- Thread 08 ---- PC 5: Stalled ----- 102126 in-flight CPI 1.4106 -- Total Cycles 144089 ---- Thread 09 ---- PC 5: Stalled ----- 103222 in-flight CPI 1.3958 -- Total Cycles 144089 ---- Thread 10 ---- PC 5: Stalled ----- 97953 in-flight CPI 1.4707 -- Total Cycles 144089 ---- Thread 11 ---- PC 5: Stalled ----- 99419 in-flight CPI 1.4490 -- Total Cycles 144089 ---- Thread 12 ---- PC 5: Stalled ----- 94443 in-flight CPI 1.5254 -- Total Cycles 144089 ---- Thread 13 ---- PC 5: Stalled ----- 97501 in-flight CPI 1.4775 -- Total Cycles 144089 ---- Thread 14 ---- PC 5: Stalled ----- 90876 in-flight CPI 1.5853 -- Total Cycles 144089 ---- Thread 15 ---- PC 5: Stalled ----- 98829 in-flight CPI 1.4576 -- Total Cycles 144089 ---- Thread 16 ---- PC 5: Stalled ----- 95272 in-flight CPI 1.5121 -- Total Cycles 144089 ---- Thread 17 ---- PC 5: Stalled ----- 101544 in-flight CPI 1.4187 -- Total Cycles 144089 ---- Thread 18 ---- PC 5: Stalled ----- 100669 in-flight CPI 1.4310 -- Total Cycles 144089 ---- Thread 19 ---- PC 5: Stalled ----- 97118 in-flight CPI 1.4834 -- Total Cycles 144089 ---- Thread 20 ---- PC 5: Stalled ----- 99285 in-flight CPI 1.4510 -- Total Cycles 144089 ---- Thread 21 ---- PC 5: Stalled ----- 91729 in-flight CPI 1.5705 -- Total Cycles 144089 ---- Thread 22 ---- PC 5: Stalled ----- 94728 in-flight CPI 1.5208 -- Total Cycles 144089 ---- Thread 23 ---- PC 5: Stalled ----- 90213 in-flight CPI 1.5969 -- Total Cycles 144089 ---- Thread 24 ---- PC 5: Stalled ----- 88086 in-flight CPI 1.6355 -- Total Cycles 144089 ---- Thread 25 ---- PC 5: Stalled ----- 92659 in-flight CPI 1.5548 -- Total Cycles 144089 ---- Thread 26 ---- PC 5: Stalled ----- 98126 in-flight CPI 1.4681 -- Total Cycles 144089 ---- Thread 27 ---- PC 5: Stalled ----- 86081 in-flight CPI 1.6736 -- Total Cycles 144089 ---- Thread 28 ---- PC 5: Stalled ----- 90993 in-flight CPI 1.5833 -- Total Cycles 144089 ---- Thread 29 ---- PC 5: Stalled ----- 88770 in-flight CPI 1.6229 -- Total Cycles 144089 ---- Thread 30 ---- PC 5: Stalled ----- 87790 in-flight CPI 1.6410 -- Total Cycles 144089 ---- Thread 31 ---- PC 5: Stalled ----- 94845 in-flight CPI 1.5189 -- Total Cycles 144089 Total CPI 0.0468 , IPC 21.3758 -- Total Cycles 144089 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7743 (3.946403%) FPSUB: 0 (0.000000%) FPMUL: 31947 (16.282541%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 73004 (37.208210%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4227 (2.154390%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 71225 (36.301502%) DIV: 7788 (3.969338%) FPUN: 0 (0.000000%) FPRSUB: 270 (0.137612%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3385751 total) ADD%: 7.244 (245250) SUB%: 0.000 (0) MUL%: 0.006 (211) BITOR%: 1.519 (51427) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.542 (18345) FPSUB%: 0.000 (0) FPMUL%: 4.744 (160616) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (633) FPMAX%: 0.019 (633) LOAD%: 5.121 (173374) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (243) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (601) FPINV%: 0.000 (0) FPCONV%: 0.020 (665) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.061 (35935) FPLE%: 0.453 (15327) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (633) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.808 (95081) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.738 (24994) CMPU%: 0.000 (0) RSUB%: 0.006 (211) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.673 (530653) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.171 (39632) ORI%: 1.556 (52693) XORI%: 0.000 (0) MULI%: 3.209 (108640) LW%: 1.400 (47388) LWI%: 13.123 (444300) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.288 (9745) SWI%: 4.146 (140379) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.403 (47503) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10512) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.059 (1981) bned%: 0.000 (0) bneid%: 13.803 (467335) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.716 (24234) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.120 (4049) DIV%: 0.012 (422) FPUN%: 1.475 (49954) FPRSUB%: 4.184 (141647) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (71) FPGT%: 2.954 (100031) FPGE%: 1.023 (34627) SYNC%: 0.000 (0) NOP%: 9.028 (305680) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 15 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 182 FPSUB 0 FPMUL 0 FPCMPLT 0 FPMIN 0 FPMAX 415 LOAD 39180 INTCONV 0 ATOMIC_INC 17 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 18 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1306 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 7 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 50032 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 18 ORI 11054 XORI 0 MULI 9967 LW 0 LWI 144572 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 67 DIV 23 FPUN 0 FPRSUB 40 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.3760 --Total thread-cycles: 4610848 --total thread-cycles issued: 3080071 (66.800537%) --iCache conflicts: 114652 (2.486571%) --thread*cycles of FU dependence: 256936 (5.572424%) --thread*cycles of data dependence: 196204 (4.255270%) --iCache cycles*banks: 4610848 (73.430809% used) Issue breakdown: --thread*cycles of issue worked: 3080071 (66.800537%) --thread*cycles of issue failed: 1225097 (26.569885%) --thread*cycles of issue NOP/other: 4619342666826820112 (100184230985728.000000%) Number of thread-cycles not ready: 196204 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3385751 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 10 2: 7 3: 7 4: 9 5: 7 6: 8 7: 8 8: 8 9: 5 10: 8 11: 9 12: 7 13: 8 14: 7 15: 9 16: 8 17: 8 18: 8 19: 8 20: 8 21: 8 22: 7 23: 7 24: 6 25: 7 26: 9 27: 7 28: 6 29: 6 30: 7 31: 8 <=== Core 44 ===> ---- Thread 00 ---- PC 5: Stalled ----- 98317 in-flight CPI 1.2944 -- Total Cycles 127281 ---- Thread 01 ---- PC 5: Stalled ----- 97218 in-flight CPI 1.3090 -- Total Cycles 127281 ---- Thread 02 ---- PC 5: Stalled ----- 94231 in-flight CPI 1.3505 -- Total Cycles 127281 ---- Thread 03 ---- PC 5: Stalled ----- 99637 in-flight CPI 1.2772 -- Total Cycles 127281 ---- Thread 04 ---- PC 5: Stalled ----- 96272 in-flight CPI 1.3218 -- Total Cycles 127281 ---- Thread 05 ---- PC 5: Stalled ----- 94766 in-flight CPI 1.3429 -- Total Cycles 127281 ---- Thread 06 ---- PC 5: Stalled ----- 100106 in-flight CPI 1.2713 -- Total Cycles 127281 ---- Thread 07 ---- PC 5: Stalled ----- 101281 in-flight CPI 1.2565 -- Total Cycles 127281 ---- Thread 08 ---- PC 5: Stalled ----- 97959 in-flight CPI 1.2991 -- Total Cycles 127281 ---- Thread 09 ---- PC 5: Stalled ----- 101116 in-flight CPI 1.2585 -- Total Cycles 127281 ---- Thread 10 ---- PC 5: Stalled ----- 97829 in-flight CPI 1.3008 -- Total Cycles 127281 ---- Thread 11 ---- PC 5: Stalled ----- 101992 in-flight CPI 1.2477 -- Total Cycles 127281 ---- Thread 12 ---- PC 5: Stalled ----- 99418 in-flight CPI 1.2800 -- Total Cycles 127281 ---- Thread 13 ---- PC 5: Stalled ----- 95619 in-flight CPI 1.3309 -- Total Cycles 127281 ---- Thread 14 ---- PC 5: Stalled ----- 94065 in-flight CPI 1.3529 -- Total Cycles 127281 ---- Thread 15 ---- PC 5: Stalled ----- 95971 in-flight CPI 1.3260 -- Total Cycles 127281 ---- Thread 16 ---- PC 5: Stalled ----- 91888 in-flight CPI 1.3849 -- Total Cycles 127281 ---- Thread 17 ---- PC 5: Stalled ----- 96580 in-flight CPI 1.3177 -- Total Cycles 127281 ---- Thread 18 ---- PC 5: Stalled ----- 92168 in-flight CPI 1.3807 -- Total Cycles 127281 ---- Thread 19 ---- PC 5: Stalled ----- 91816 in-flight CPI 1.3860 -- Total Cycles 127281 ---- Thread 20 ---- PC 5: Stalled ----- 93532 in-flight CPI 1.3606 -- Total Cycles 127281 ---- Thread 21 ---- PC 5: Stalled ----- 89794 in-flight CPI 1.4172 -- Total Cycles 127281 ---- Thread 22 ---- PC 5: Stalled ----- 94732 in-flight CPI 1.3434 -- Total Cycles 127281 ---- Thread 23 ---- PC 5: Stalled ----- 93742 in-flight CPI 1.3575 -- Total Cycles 127281 ---- Thread 24 ---- PC 5: Stalled ----- 94523 in-flight CPI 1.3463 -- Total Cycles 127281 ---- Thread 25 ---- PC 5: Stalled ----- 92230 in-flight CPI 1.3798 -- Total Cycles 127281 ---- Thread 26 ---- PC 5: Stalled ----- 83432 in-flight CPI 1.5254 -- Total Cycles 127281 ---- Thread 27 ---- PC 5: Stalled ----- 89934 in-flight CPI 1.4150 -- Total Cycles 127281 ---- Thread 28 ---- PC 5: Stalled ----- 83826 in-flight CPI 1.5181 -- Total Cycles 127281 ---- Thread 29 ---- PC 5: Stalled ----- 89219 in-flight CPI 1.4264 -- Total Cycles 127281 ---- Thread 30 ---- PC 5: Stalled ----- 91022 in-flight CPI 1.3981 -- Total Cycles 127281 ---- Thread 31 ---- PC 5: Stalled ----- 87101 in-flight CPI 1.4611 -- Total Cycles 127281 Total CPI 0.0421 , IPC 23.7418 -- Total Cycles 127281 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7725 (4.049952%) FPSUB: 0 (0.000000%) FPMUL: 31455 (16.490776%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 69297 (36.330036%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4015 (2.104927%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 70646 (37.037270%) DIV: 7349 (3.852828%) FPUN: 0 (0.000000%) FPRSUB: 256 (0.134212%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3321682 total) ADD%: 7.144 (237286) SUB%: 0.000 (0) MUL%: 0.006 (199) BITOR%: 1.536 (51034) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.548 (18190) FPSUB%: 0.000 (0) FPMUL%: 4.765 (158279) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (597) FPMAX%: 0.018 (597) LOAD%: 5.146 (170919) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (231) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (572) FPINV%: 0.000 (0) FPCONV%: 0.019 (629) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.063 (35293) FPLE%: 0.459 (15235) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (597) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.808 (93269) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.748 (24855) CMPU%: 0.000 (0) RSUB%: 0.006 (199) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.697 (521403) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.176 (39076) ORI%: 1.563 (51921) XORI%: 0.000 (0) MULI%: 3.201 (106312) LW%: 1.400 (46506) LWI%: 13.086 (434688) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.287 (9535) SWI%: 4.132 (137237) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.403 (46616) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10307) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.056 (1862) bned%: 0.000 (0) bneid%: 13.818 (458989) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.720 (23905) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.121 (4022) DIV%: 0.012 (398) FPUN%: 1.487 (49385) FPRSUB%: 4.207 (139746) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (71) FPGT%: 2.942 (97727) FPGE%: 1.028 (34150) SYNC%: 0.000 (0) NOP%: 9.024 (299749) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 23 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 144 FPSUB 0 FPMUL 3 FPCMPLT 0 FPMIN 0 FPMAX 387 LOAD 39742 INTCONV 0 ATOMIC_INC 19 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 16 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1304 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 7 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48994 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 17 ORI 11038 XORI 0 MULI 9583 LW 0 LWI 141707 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 68 DIV 27 FPUN 0 FPRSUB 53 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.7420 --Total thread-cycles: 4072992 --total thread-cycles issued: 3021933 (74.194427%) --iCache conflicts: 113182 (2.778842%) --thread*cycles of FU dependence: 253153 (6.215406%) --thread*cycles of data dependence: 190743 (4.683117%) --iCache cycles*banks: 4072992 (81.554642% used) Issue breakdown: --thread*cycles of issue worked: 3021933 (74.194427%) --thread*cycles of issue failed: 751310 (18.446144%) --thread*cycles of issue NOP/other: 4604186417118352101 (113041878286336.000000%) Number of thread-cycles not ready: 190743 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3321682 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 7 2: 8 3: 8 4: 8 5: 7 6: 7 7: 8 8: 7 9: 8 10: 8 11: 8 12: 8 13: 7 14: 6 15: 7 16: 7 17: 7 18: 8 19: 7 20: 6 21: 7 22: 7 23: 8 24: 8 25: 7 26: 5 27: 8 28: 7 29: 7 30: 7 31: 6 <=== Core 45 ===> ---- Thread 00 ---- PC 5: Stalled ----- 98537 in-flight CPI 1.4283 -- Total Cycles 140771 ---- Thread 01 ---- PC 5: Stalled ----- 97950 in-flight CPI 1.4369 -- Total Cycles 140771 ---- Thread 02 ---- PC 5: Stalled ----- 99219 in-flight CPI 1.4185 -- Total Cycles 140771 ---- Thread 03 ---- PC 5: Stalled ----- 95086 in-flight CPI 1.4803 -- Total Cycles 140771 ---- Thread 04 ---- PC 5: Stalled ----- 96026 in-flight CPI 1.4657 -- Total Cycles 140771 ---- Thread 05 ---- PC 5: Stalled ----- 100722 in-flight CPI 1.3974 -- Total Cycles 140771 ---- Thread 06 ---- PC 5: Stalled ----- 98994 in-flight CPI 1.4218 -- Total Cycles 140771 ---- Thread 07 ---- PC 5: Stalled ----- 98676 in-flight CPI 1.4263 -- Total Cycles 140771 ---- Thread 08 ---- PC 5: Stalled ----- 99736 in-flight CPI 1.4112 -- Total Cycles 140771 ---- Thread 09 ---- PC 5: Stalled ----- 95725 in-flight CPI 1.4703 -- Total Cycles 140771 ---- Thread 10 ---- PC 5: Stalled ----- 93098 in-flight CPI 1.5118 -- Total Cycles 140771 ---- Thread 11 ---- PC 5: Stalled ----- 102281 in-flight CPI 1.3761 -- Total Cycles 140771 ---- Thread 12 ---- PC 5: Stalled ----- 100053 in-flight CPI 1.4067 -- Total Cycles 140771 ---- Thread 13 ---- PC 5: Stalled ----- 100021 in-flight CPI 1.4071 -- Total Cycles 140771 ---- Thread 14 ---- PC 5: Stalled ----- 92227 in-flight CPI 1.5261 -- Total Cycles 140771 ---- Thread 15 ---- PC 5: Stalled ----- 94555 in-flight CPI 1.4885 -- Total Cycles 140771 ---- Thread 16 ---- PC 5: Stalled ----- 91472 in-flight CPI 1.5387 -- Total Cycles 140771 ---- Thread 17 ---- PC 5: Stalled ----- 100108 in-flight CPI 1.4059 -- Total Cycles 140771 ---- Thread 18 ---- PC 5: Stalled ----- 104082 in-flight CPI 1.3523 -- Total Cycles 140771 ---- Thread 19 ---- PC 5: Stalled ----- 98269 in-flight CPI 1.4322 -- Total Cycles 140771 ---- Thread 20 ---- PC 5: Stalled ----- 93798 in-flight CPI 1.5005 -- Total Cycles 140771 ---- Thread 21 ---- PC 5: Stalled ----- 95560 in-flight CPI 1.4729 -- Total Cycles 140771 ---- Thread 22 ---- PC 5: Stalled ----- 90216 in-flight CPI 1.5601 -- Total Cycles 140771 ---- Thread 23 ---- PC 5: Stalled ----- 86317 in-flight CPI 1.6307 -- Total Cycles 140771 ---- Thread 24 ---- PC 5: Stalled ----- 96598 in-flight CPI 1.4570 -- Total Cycles 140771 ---- Thread 25 ---- PC 5: Stalled ----- 91047 in-flight CPI 1.5458 -- Total Cycles 140771 ---- Thread 26 ---- PC 5: Stalled ----- 93521 in-flight CPI 1.5050 -- Total Cycles 140771 ---- Thread 27 ---- PC 5: Stalled ----- 91678 in-flight CPI 1.5352 -- Total Cycles 140771 ---- Thread 28 ---- PC 5: Stalled ----- 85582 in-flight CPI 1.6446 -- Total Cycles 140771 ---- Thread 29 ---- PC 5: Stalled ----- 92556 in-flight CPI 1.5206 -- Total Cycles 140771 ---- Thread 30 ---- PC 5: Stalled ----- 85214 in-flight CPI 1.6516 -- Total Cycles 140771 ---- Thread 31 ---- PC 5: Stalled ----- 89035 in-flight CPI 1.5808 -- Total Cycles 140771 Total CPI 0.0462 , IPC 21.6559 -- Total Cycles 140771 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7920 (3.955530%) FPSUB: 0 (0.000000%) FPMUL: 31904 (15.933995%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 75547 (37.730865%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4216 (2.105621%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 72809 (36.363407%) DIV: 7568 (3.779729%) FPUN: 0 (0.000000%) FPRSUB: 262 (0.130852%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3350491 total) ADD%: 7.173 (240327) SUB%: 0.000 (0) MUL%: 0.006 (205) BITOR%: 1.525 (51104) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.552 (18497) FPSUB%: 0.000 (0) FPMUL%: 4.779 (160114) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (615) FPMAX%: 0.018 (615) LOAD%: 5.160 (172873) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (237) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (591) FPINV%: 0.000 (0) FPCONV%: 0.019 (647) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.063 (35610) FPLE%: 0.456 (15263) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (615) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.806 (94000) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.747 (25022) CMPU%: 0.000 (0) RSUB%: 0.006 (205) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.676 (525236) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.174 (39331) ORI%: 1.560 (52260) XORI%: 0.000 (0) MULI%: 3.202 (107274) LW%: 1.399 (46866) LWI%: 13.104 (439051) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.287 (9616) SWI%: 4.138 (138635) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.402 (46983) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.309 (10369) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.059 (1972) bned%: 0.000 (0) bneid%: 13.785 (461850) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.718 (24050) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.124 (4145) DIV%: 0.012 (410) FPUN%: 1.477 (49496) FPRSUB%: 4.228 (141649) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (71) FPGT%: 2.938 (98441) FPGE%: 1.022 (34233) SYNC%: 0.000 (0) NOP%: 9.011 (301917) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 14 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 163 FPSUB 0 FPMUL 1 FPCMPLT 0 FPMIN 0 FPMAX 404 LOAD 40144 INTCONV 0 ATOMIC_INC 20 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 17 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1252 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 9 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49308 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 10 ORI 11251 XORI 0 MULI 9207 LW 0 LWI 143269 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 90 DIV 24 FPUN 0 FPRSUB 50 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.6561 --Total thread-cycles: 4504672 --total thread-cycles issued: 3048574 (67.675827%) --iCache conflicts: 113081 (2.510305%) --thread*cycles of FU dependence: 255247 (5.666273%) --thread*cycles of data dependence: 200226 (4.444852%) --iCache cycles*banks: 4504672 (74.378845% used) Issue breakdown: --thread*cycles of issue worked: 3048574 (67.675827%) --thread*cycles of issue failed: 1154181 (25.621864%) --thread*cycles of issue NOP/other: 698068285 (15496.539062%) Number of thread-cycles not ready: 200226 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3350491 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 9 3: 6 4: 7 5: 8 6: 7 7: 8 8: 8 9: 7 10: 7 11: 8 12: 8 13: 8 14: 7 15: 8 16: 7 17: 8 18: 6 19: 8 20: 7 21: 7 22: 7 23: 5 24: 8 25: 8 26: 7 27: 7 28: 7 29: 8 30: 8 31: 7 <=== Core 46 ===> ---- Thread 00 ---- PC 5: Stalled ----- 99226 in-flight CPI 1.2864 -- Total Cycles 127665 ---- Thread 01 ---- PC 5: Stalled ----- 92330 in-flight CPI 1.3825 -- Total Cycles 127665 ---- Thread 02 ---- PC 5: Stalled ----- 96343 in-flight CPI 1.3248 -- Total Cycles 127665 ---- Thread 03 ---- PC 5: Stalled ----- 96619 in-flight CPI 1.3211 -- Total Cycles 127665 ---- Thread 04 ---- PC 5: Stalled ----- 95732 in-flight CPI 1.3333 -- Total Cycles 127665 ---- Thread 05 ---- PC 5: Stalled ----- 99786 in-flight CPI 1.2791 -- Total Cycles 127665 ---- Thread 06 ---- PC 5: Stalled ----- 99801 in-flight CPI 1.2789 -- Total Cycles 127665 ---- Thread 07 ---- PC 5: Stalled ----- 102686 in-flight CPI 1.2430 -- Total Cycles 127665 ---- Thread 08 ---- PC 5: Stalled ----- 89985 in-flight CPI 1.4185 -- Total Cycles 127665 ---- Thread 09 ---- PC 5: Stalled ----- 100884 in-flight CPI 1.2652 -- Total Cycles 127665 ---- Thread 10 ---- PC 5: Stalled ----- 101021 in-flight CPI 1.2635 -- Total Cycles 127665 ---- Thread 11 ---- PC 5: Stalled ----- 98896 in-flight CPI 1.2907 -- Total Cycles 127665 ---- Thread 12 ---- PC 5: Stalled ----- 95195 in-flight CPI 1.3409 -- Total Cycles 127665 ---- Thread 13 ---- PC 5: Stalled ----- 92919 in-flight CPI 1.3737 -- Total Cycles 127665 ---- Thread 14 ---- PC 5: Stalled ----- 98553 in-flight CPI 1.2952 -- Total Cycles 127665 ---- Thread 15 ---- PC 5: Stalled ----- 93220 in-flight CPI 1.3692 -- Total Cycles 127665 ---- Thread 16 ---- PC 5: Stalled ----- 93739 in-flight CPI 1.3617 -- Total Cycles 127665 ---- Thread 17 ---- PC 5: Stalled ----- 91889 in-flight CPI 1.3891 -- Total Cycles 127665 ---- Thread 18 ---- PC 5: Stalled ----- 89255 in-flight CPI 1.4301 -- Total Cycles 127665 ---- Thread 19 ---- PC 5: Stalled ----- 98384 in-flight CPI 1.2974 -- Total Cycles 127665 ---- Thread 20 ---- PC 5: Stalled ----- 92297 in-flight CPI 1.3829 -- Total Cycles 127665 ---- Thread 21 ---- PC 5: Stalled ----- 98119 in-flight CPI 1.3009 -- Total Cycles 127665 ---- Thread 22 ---- PC 5: Stalled ----- 92970 in-flight CPI 1.3729 -- Total Cycles 127665 ---- Thread 23 ---- PC 5: Stalled ----- 94421 in-flight CPI 1.3518 -- Total Cycles 127665 ---- Thread 24 ---- PC 5: Stalled ----- 93612 in-flight CPI 1.3635 -- Total Cycles 127665 ---- Thread 25 ---- PC 5: Stalled ----- 86351 in-flight CPI 1.4782 -- Total Cycles 127665 ---- Thread 26 ---- PC 5: Stalled ----- 89195 in-flight CPI 1.4310 -- Total Cycles 127665 ---- Thread 27 ---- PC 5: Stalled ----- 87111 in-flight CPI 1.4653 -- Total Cycles 127665 ---- Thread 28 ---- PC 5: Stalled ----- 92670 in-flight CPI 1.3773 -- Total Cycles 127665 ---- Thread 29 ---- PC 5: Stalled ----- 90311 in-flight CPI 1.4134 -- Total Cycles 127665 ---- Thread 30 ---- PC 5: Stalled ----- 87929 in-flight CPI 1.4516 -- Total Cycles 127665 ---- Thread 31 ---- PC 5: Stalled ----- 86135 in-flight CPI 1.4819 -- Total Cycles 127665 Total CPI 0.0423 , IPC 23.6411 -- Total Cycles 127665 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7855 (3.978343%) FPSUB: 0 (0.000000%) FPMUL: 31736 (16.073418%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 73751 (37.352867%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4245 (2.149977%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 72073 (36.503010%) DIV: 7530 (3.813740%) FPUN: 0 (0.000000%) FPRSUB: 254 (0.128644%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3317716 total) ADD%: 7.189 (238512) SUB%: 0.000 (0) MUL%: 0.006 (204) BITOR%: 1.512 (50172) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.556 (18458) FPSUB%: 0.000 (0) FPMUL%: 4.788 (158859) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (612) FPMAX%: 0.018 (612) LOAD%: 5.153 (170957) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (236) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (590) FPINV%: 0.000 (0) FPCONV%: 0.019 (644) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.066 (35383) FPLE%: 0.452 (14981) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (612) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.803 (92999) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.741 (24594) CMPU%: 0.000 (0) RSUB%: 0.006 (204) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.665 (519715) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.168 (38754) ORI%: 1.564 (51884) XORI%: 0.000 (0) MULI%: 3.202 (106222) LW%: 1.398 (46367) LWI%: 13.108 (434887) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.287 (9514) SWI%: 4.139 (137307) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.401 (46485) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.309 (10258) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.063 (2105) bned%: 0.000 (0) bneid%: 13.780 (457187) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.714 (23685) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.124 (4103) DIV%: 0.012 (408) FPUN%: 1.469 (48740) FPRSUB%: 4.228 (140259) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (69) FPGT%: 2.947 (97763) FPGE%: 1.018 (33759) SYNC%: 0.000 (0) NOP%: 9.028 (299520) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 11 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 155 FPSUB 0 FPMUL 1 FPCMPLT 0 FPMIN 0 FPMAX 390 LOAD 39379 INTCONV 0 ATOMIC_INC 12 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 21 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1671 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 14 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48854 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 16 ORI 11234 XORI 0 MULI 9093 LW 0 LWI 141730 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 63 DIV 24 FPUN 0 FPRSUB 47 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.6413 --Total thread-cycles: 4085280 --total thread-cycles issued: 3018196 (73.879784%) --iCache conflicts: 112959 (2.765025%) --thread*cycles of FU dependence: 252728 (6.186308%) --thread*cycles of data dependence: 197444 (4.833059%) --iCache cycles*banks: 4085280 (81.212250% used) Issue breakdown: --thread*cycles of issue worked: 3018196 (73.879784%) --thread*cycles of issue failed: 767564 (18.788528%) --thread*cycles of issue NOP/other: 4618331090896327168 (113048094244864.000000%) Number of thread-cycles not ready: 197444 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3317716 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 5 2: 8 3: 8 4: 8 5: 8 6: 9 7: 8 8: 6 9: 8 10: 9 11: 7 12: 7 13: 6 14: 7 15: 8 16: 7 17: 7 18: 6 19: 7 20: 8 21: 8 22: 8 23: 8 24: 7 25: 6 26: 8 27: 7 28: 8 29: 7 30: 7 31: 7 <=== Core 47 ===> ---- Thread 00 ---- PC 5: Stalled ----- 103307 in-flight CPI 1.2390 -- Total Cycles 128016 ---- Thread 01 ---- PC 5: Stalled ----- 100964 in-flight CPI 1.2677 -- Total Cycles 128016 ---- Thread 02 ---- PC 5: Stalled ----- 100164 in-flight CPI 1.2778 -- Total Cycles 128016 ---- Thread 03 ---- PC 5: Stalled ----- 96975 in-flight CPI 1.3199 -- Total Cycles 128016 ---- Thread 04 ---- PC 5: Stalled ----- 94304 in-flight CPI 1.3572 -- Total Cycles 128016 ---- Thread 05 ---- PC 5: Stalled ----- 101673 in-flight CPI 1.2589 -- Total Cycles 128016 ---- Thread 06 ---- PC 5: Stalled ----- 101616 in-flight CPI 1.2596 -- Total Cycles 128016 ---- Thread 07 ---- PC 5: Stalled ----- 93293 in-flight CPI 1.3720 -- Total Cycles 128016 ---- Thread 08 ---- PC 5: Stalled ----- 94580 in-flight CPI 1.3533 -- Total Cycles 128016 ---- Thread 09 ---- PC 5: Stalled ----- 99732 in-flight CPI 1.2834 -- Total Cycles 128016 ---- Thread 10 ---- PC 5: Stalled ----- 97253 in-flight CPI 1.3161 -- Total Cycles 128016 ---- Thread 11 ---- PC 5: Stalled ----- 100179 in-flight CPI 1.2777 -- Total Cycles 128016 ---- Thread 12 ---- PC 5: Stalled ----- 95606 in-flight CPI 1.3388 -- Total Cycles 128016 ---- Thread 13 ---- PC 5: Stalled ----- 101791 in-flight CPI 1.2574 -- Total Cycles 128016 ---- Thread 14 ---- PC 5: Stalled ----- 94549 in-flight CPI 1.3537 -- Total Cycles 128016 ---- Thread 15 ---- PC 5: Stalled ----- 99851 in-flight CPI 1.2819 -- Total Cycles 128016 ---- Thread 16 ---- PC 5: Stalled ----- 94971 in-flight CPI 1.3477 -- Total Cycles 128016 ---- Thread 17 ---- PC 5: Stalled ----- 97272 in-flight CPI 1.3158 -- Total Cycles 128016 ---- Thread 18 ---- PC 5: Stalled ----- 91702 in-flight CPI 1.3958 -- Total Cycles 128016 ---- Thread 19 ---- PC 5: Stalled ----- 94854 in-flight CPI 1.3494 -- Total Cycles 128016 ---- Thread 20 ---- PC 5: Stalled ----- 92882 in-flight CPI 1.3781 -- Total Cycles 128016 ---- Thread 21 ---- PC 5: Stalled ----- 89837 in-flight CPI 1.4248 -- Total Cycles 128016 ---- Thread 22 ---- PC 5: Stalled ----- 96804 in-flight CPI 1.3222 -- Total Cycles 128016 ---- Thread 23 ---- PC 5: Stalled ----- 91622 in-flight CPI 1.3970 -- Total Cycles 128016 ---- Thread 24 ---- PC 5: Stalled ----- 93192 in-flight CPI 1.3734 -- Total Cycles 128016 ---- Thread 25 ---- PC 5: Stalled ----- 93166 in-flight CPI 1.3738 -- Total Cycles 128016 ---- Thread 26 ---- PC 5: Stalled ----- 87209 in-flight CPI 1.4676 -- Total Cycles 128016 ---- Thread 27 ---- PC 5: Stalled ----- 87342 in-flight CPI 1.4654 -- Total Cycles 128016 ---- Thread 28 ---- PC 5: Stalled ----- 89377 in-flight CPI 1.4321 -- Total Cycles 128016 ---- Thread 29 ---- PC 5: Stalled ----- 86934 in-flight CPI 1.4723 -- Total Cycles 128016 ---- Thread 30 ---- PC 5: Stalled ----- 83575 in-flight CPI 1.5315 -- Total Cycles 128016 ---- Thread 31 ---- PC 5: Stalled ----- 89003 in-flight CPI 1.4381 -- Total Cycles 128016 Total CPI 0.0422 , IPC 23.7167 -- Total Cycles 128016 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7408 (3.959084%) FPSUB: 0 (0.000000%) FPMUL: 30752 (16.434900%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 69856 (37.333389%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 3997 (2.136131%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 67605 (36.130379%) DIV: 7242 (3.870368%) FPUN: 0 (0.000000%) FPRSUB: 254 (0.135746%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3336468 total) ADD%: 7.220 (240878) SUB%: 0.000 (0) MUL%: 0.006 (196) BITOR%: 1.532 (51128) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.521 (17383) FPSUB%: 0.000 (0) FPMUL%: 4.689 (156442) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (588) FPMAX%: 0.018 (588) LOAD%: 5.132 (171223) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (228) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (566) FPINV%: 0.000 (0) FPCONV%: 0.019 (620) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.047 (34919) FPLE%: 0.458 (15280) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (588) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.834 (94541) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.743 (24780) CMPU%: 0.000 (0) RSUB%: 0.006 (196) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.726 (524701) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.184 (39494) ORI%: 1.544 (51499) XORI%: 0.000 (0) MULI%: 3.222 (107506) LW%: 1.413 (47135) LWI%: 13.131 (438100) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.290 (9668) SWI%: 4.156 (138677) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.416 (47245) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.312 (10399) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.053 (1754) bned%: 0.000 (0) bneid%: 13.807 (460667) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.723 (24133) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.115 (3845) DIV%: 0.012 (392) FPUN%: 1.486 (49568) FPRSUB%: 4.154 (138593) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (71) FPGT%: 2.943 (98192) FPGE%: 1.028 (34288) SYNC%: 0.000 (0) NOP%: 9.001 (300301) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 23 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 151 FPSUB 0 FPMUL 0 FPCMPLT 0 FPMIN 0 FPMAX 383 LOAD 39267 INTCONV 0 ATOMIC_INC 21 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 14 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1922 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 9 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49334 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 13 ORI 10479 XORI 0 MULI 9864 LW 0 LWI 142572 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 78 DIV 20 FPUN 0 FPRSUB 38 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.7169 --Total thread-cycles: 4096512 --total thread-cycles issued: 3036167 (74.115906%) --iCache conflicts: 113711 (2.775800%) --thread*cycles of FU dependence: 254220 (6.205768%) --thread*cycles of data dependence: 187114 (4.567642%) --iCache cycles*banks: 4096512 (81.447342% used) Issue breakdown: --thread*cycles of issue worked: 3036167 (74.115906%) --thread*cycles of issue failed: 760044 (18.553444%) --thread*cycles of issue NOP/other: 4598611744452285709 (112256754909184.000000%) Number of thread-cycles not ready: 187114 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3336468 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 8 3: 7 4: 7 5: 8 6: 7 7: 7 8: 7 9: 8 10: 7 11: 7 12: 7 13: 8 14: 8 15: 7 16: 7 17: 7 18: 6 19: 7 20: 6 21: 6 22: 7 23: 7 24: 9 25: 7 26: 7 27: 7 28: 7 29: 6 30: 6 31: 7 <=== Core 48 ===> ---- Thread 00 ---- PC 5: Stalled ----- 99445 in-flight CPI 1.3054 -- Total Cycles 129841 ---- Thread 01 ---- PC 5: Stalled ----- 102343 in-flight CPI 1.2684 -- Total Cycles 129841 ---- Thread 02 ---- PC 5: Stalled ----- 95672 in-flight CPI 1.3570 -- Total Cycles 129841 ---- Thread 03 ---- PC 5: Stalled ----- 105681 in-flight CPI 1.2284 -- Total Cycles 129841 ---- Thread 04 ---- PC 5: Stalled ----- 92996 in-flight CPI 1.3960 -- Total Cycles 129841 ---- Thread 05 ---- PC 5: Stalled ----- 96743 in-flight CPI 1.3419 -- Total Cycles 129841 ---- Thread 06 ---- PC 5: Stalled ----- 102645 in-flight CPI 1.2647 -- Total Cycles 129841 ---- Thread 07 ---- PC 5: Stalled ----- 99932 in-flight CPI 1.2990 -- Total Cycles 129841 ---- Thread 08 ---- PC 5: Stalled ----- 101914 in-flight CPI 1.2738 -- Total Cycles 129841 ---- Thread 09 ---- PC 5: Stalled ----- 99189 in-flight CPI 1.3088 -- Total Cycles 129841 ---- Thread 10 ---- PC 5: Stalled ----- 95446 in-flight CPI 1.3602 -- Total Cycles 129841 ---- Thread 11 ---- PC 5: Stalled ----- 94648 in-flight CPI 1.3717 -- Total Cycles 129841 ---- Thread 12 ---- PC 5: Stalled ----- 90537 in-flight CPI 1.4339 -- Total Cycles 129841 ---- Thread 13 ---- PC 5: Stalled ----- 96710 in-flight CPI 1.3423 -- Total Cycles 129841 ---- Thread 14 ---- PC 5: Stalled ----- 98730 in-flight CPI 1.3149 -- Total Cycles 129841 ---- Thread 15 ---- PC 5: Stalled ----- 92771 in-flight CPI 1.3993 -- Total Cycles 129841 ---- Thread 16 ---- PC 5: Stalled ----- 100808 in-flight CPI 1.2878 -- Total Cycles 129841 ---- Thread 17 ---- PC 5: Stalled ----- 96814 in-flight CPI 1.3409 -- Total Cycles 129841 ---- Thread 18 ---- PC 5: Stalled ----- 99903 in-flight CPI 1.2994 -- Total Cycles 129841 ---- Thread 19 ---- PC 5: Stalled ----- 91124 in-flight CPI 1.4246 -- Total Cycles 129841 ---- Thread 20 ---- PC 5: Stalled ----- 91511 in-flight CPI 1.4186 -- Total Cycles 129841 ---- Thread 21 ---- PC 5: Stalled ----- 92913 in-flight CPI 1.3972 -- Total Cycles 129841 ---- Thread 22 ---- PC 5: Stalled ----- 89294 in-flight CPI 1.4538 -- Total Cycles 129841 ---- Thread 23 ---- PC 5: Stalled ----- 91025 in-flight CPI 1.4262 -- Total Cycles 129841 ---- Thread 24 ---- PC 5: Stalled ----- 87344 in-flight CPI 1.4863 -- Total Cycles 129841 ---- Thread 25 ---- PC 5: Stalled ----- 86827 in-flight CPI 1.4951 -- Total Cycles 129841 ---- Thread 26 ---- PC 5: Stalled ----- 94074 in-flight CPI 1.3799 -- Total Cycles 129841 ---- Thread 27 ---- PC 5: Stalled ----- 90981 in-flight CPI 1.4268 -- Total Cycles 129841 ---- Thread 28 ---- PC 5: Stalled ----- 91879 in-flight CPI 1.4129 -- Total Cycles 129841 ---- Thread 29 ---- PC 5: Stalled ----- 88146 in-flight CPI 1.4728 -- Total Cycles 129841 ---- Thread 30 ---- PC 5: Stalled ----- 85292 in-flight CPI 1.5220 -- Total Cycles 129841 ---- Thread 31 ---- PC 5: Stalled ----- 86608 in-flight CPI 1.4989 -- Total Cycles 129841 Total CPI 0.0428 , IPC 23.3402 -- Total Cycles 129841 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7824 (3.876376%) FPSUB: 0 (0.000000%) FPMUL: 31719 (15.715079%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 78393 (38.839565%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4182 (2.071959%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 71809 (35.577541%) DIV: 7653 (3.791655%) FPUN: 0 (0.000000%) FPRSUB: 258 (0.127825%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3331045 total) ADD%: 7.217 (240416) SUB%: 0.000 (0) MUL%: 0.006 (207) BITOR%: 1.508 (50230) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.552 (18387) FPSUB%: 0.000 (0) FPMUL%: 4.778 (159152) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (621) FPMAX%: 0.019 (621) LOAD%: 5.154 (171672) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (239) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (592) FPINV%: 0.000 (0) FPCONV%: 0.020 (653) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.063 (35407) FPLE%: 0.451 (15037) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (621) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.808 (93547) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.743 (24756) CMPU%: 0.000 (0) RSUB%: 0.006 (207) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.675 (522149) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.172 (39043) ORI%: 1.554 (51769) XORI%: 0.000 (0) MULI%: 3.205 (106746) LW%: 1.400 (46619) LWI%: 13.110 (436712) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.288 (9592) SWI%: 4.147 (138152) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.403 (46733) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10351) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.060 (2008) bned%: 0.000 (0) bneid%: 13.777 (458907) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.709 (23629) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.123 (4084) DIV%: 0.012 (414) FPUN%: 1.463 (48730) FPRSUB%: 4.215 (140413) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (76) FPGT%: 2.951 (98285) FPGE%: 1.011 (33693) SYNC%: 0.000 (0) NOP%: 9.021 (300479) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 17 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 156 FPSUB 0 FPMUL 0 FPCMPLT 0 FPMIN 0 FPMAX 405 LOAD 40016 INTCONV 0 ATOMIC_INC 16 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 17 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1065 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 15 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49136 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 12 ORI 11150 XORI 0 MULI 9532 LW 0 LWI 142257 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 72 DIV 23 FPUN 0 FPRSUB 41 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.3404 --Total thread-cycles: 4154912 --total thread-cycles issued: 3030566 (72.939354%) --iCache conflicts: 111596 (2.685881%) --thread*cycles of FU dependence: 253959 (6.112259%) --thread*cycles of data dependence: 201838 (4.857817%) --iCache cycles*banks: 4154912 (80.172020% used) Issue breakdown: --thread*cycles of issue worked: 3030566 (72.939354%) --thread*cycles of issue failed: 823867 (19.828747%) --thread*cycles of issue NOP/other: 4700943 (113.141823%) Number of thread-cycles not ready: 201838 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3331045 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 9 2: 6 3: 8 4: 7 5: 8 6: 9 7: 8 8: 8 9: 8 10: 6 11: 5 12: 6 13: 9 14: 8 15: 7 16: 8 17: 8 18: 9 19: 8 20: 7 21: 8 22: 7 23: 6 24: 7 25: 7 26: 8 27: 8 28: 7 29: 6 30: 8 31: 7 <=== Core 49 ===> ---- Thread 00 ---- PC 5: Stalled ----- 93889 in-flight CPI 1.3750 -- Total Cycles 129122 ---- Thread 01 ---- PC 5: Stalled ----- 102113 in-flight CPI 1.2643 -- Total Cycles 129122 ---- Thread 02 ---- PC 5: Stalled ----- 102383 in-flight CPI 1.2609 -- Total Cycles 129122 ---- Thread 03 ---- PC 5: Stalled ----- 97618 in-flight CPI 1.3225 -- Total Cycles 129122 ---- Thread 04 ---- PC 5: Stalled ----- 101427 in-flight CPI 1.2728 -- Total Cycles 129122 ---- Thread 05 ---- PC 5: Stalled ----- 96438 in-flight CPI 1.3386 -- Total Cycles 129122 ---- Thread 06 ---- PC 5: Stalled ----- 98459 in-flight CPI 1.3111 -- Total Cycles 129122 ---- Thread 07 ---- PC 5: Stalled ----- 104006 in-flight CPI 1.2413 -- Total Cycles 129122 ---- Thread 08 ---- PC 5: Stalled ----- 91704 in-flight CPI 1.4078 -- Total Cycles 129122 ---- Thread 09 ---- PC 5: Stalled ----- 93036 in-flight CPI 1.3876 -- Total Cycles 129122 ---- Thread 10 ---- PC 5: Stalled ----- 95548 in-flight CPI 1.3512 -- Total Cycles 129122 ---- Thread 11 ---- PC 5: Stalled ----- 93932 in-flight CPI 1.3744 -- Total Cycles 129122 ---- Thread 12 ---- PC 5: Stalled ----- 100518 in-flight CPI 1.2843 -- Total Cycles 129122 ---- Thread 13 ---- PC 5: Stalled ----- 88828 in-flight CPI 1.4534 -- Total Cycles 129122 ---- Thread 14 ---- PC 5: Stalled ----- 102055 in-flight CPI 1.2649 -- Total Cycles 129122 ---- Thread 15 ---- PC 5: Stalled ----- 94388 in-flight CPI 1.3677 -- Total Cycles 129122 ---- Thread 16 ---- PC 5: Stalled ----- 93152 in-flight CPI 1.3859 -- Total Cycles 129122 ---- Thread 17 ---- PC 5: Stalled ----- 95264 in-flight CPI 1.3551 -- Total Cycles 129122 ---- Thread 18 ---- PC 5: Stalled ----- 98261 in-flight CPI 1.3138 -- Total Cycles 129122 ---- Thread 19 ---- PC 5: Stalled ----- 93578 in-flight CPI 1.3796 -- Total Cycles 129122 ---- Thread 20 ---- PC 5: Stalled ----- 99480 in-flight CPI 1.2977 -- Total Cycles 129122 ---- Thread 21 ---- PC 5: Stalled ----- 90304 in-flight CPI 1.4296 -- Total Cycles 129122 ---- Thread 22 ---- PC 5: Stalled ----- 95304 in-flight CPI 1.3546 -- Total Cycles 129122 ---- Thread 23 ---- PC 5: Stalled ----- 91671 in-flight CPI 1.4083 -- Total Cycles 129122 ---- Thread 24 ---- PC 5: Stalled ----- 97523 in-flight CPI 1.3238 -- Total Cycles 129122 ---- Thread 25 ---- PC 5: Stalled ----- 96230 in-flight CPI 1.3415 -- Total Cycles 129122 ---- Thread 26 ---- PC 5: Stalled ----- 88136 in-flight CPI 1.4647 -- Total Cycles 129122 ---- Thread 27 ---- PC 5: Stalled ----- 94905 in-flight CPI 1.3603 -- Total Cycles 129122 ---- Thread 28 ---- PC 5: Stalled ----- 94411 in-flight CPI 1.3674 -- Total Cycles 129122 ---- Thread 29 ---- PC 5: Stalled ----- 91315 in-flight CPI 1.4137 -- Total Cycles 129122 ---- Thread 30 ---- PC 5: Stalled ----- 84900 in-flight CPI 1.5206 -- Total Cycles 129122 ---- Thread 31 ---- PC 5: Stalled ----- 83316 in-flight CPI 1.5495 -- Total Cycles 129122 Total CPI 0.0424 , IPC 23.5800 -- Total Cycles 129122 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7318 (4.078562%) FPSUB: 0 (0.000000%) FPMUL: 30862 (17.200405%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 60382 (33.652870%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4524 (2.521374%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 67976 (37.885258%) DIV: 8088 (4.507708%) FPUN: 0 (0.000000%) FPRSUB: 276 (0.153824%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3346422 total) ADD%: 7.236 (242150) SUB%: 0.000 (0) MUL%: 0.007 (219) BITOR%: 1.529 (51154) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.521 (17447) FPSUB%: 0.000 (0) FPMUL%: 4.679 (156564) FPCMPLT%: 0.000 (0) FPMIN%: 0.020 (657) FPMAX%: 0.020 (657) LOAD%: 5.090 (170333) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.008 (251) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.019 (629) FPINV%: 0.000 (0) FPCONV%: 0.021 (689) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.054 (35255) FPLE%: 0.455 (15228) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.020 (657) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.824 (94504) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.734 (24568) CMPU%: 0.000 (0) RSUB%: 0.007 (219) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.691 (525092) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.178 (39416) ORI%: 1.539 (51488) XORI%: 0.000 (0) MULI%: 3.227 (107986) LW%: 1.408 (47124) LWI%: 13.177 (440956) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.289 (9666) SWI%: 4.188 (140141) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.412 (47251) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10384) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.050 (1681) bned%: 0.000 (0) bneid%: 13.827 (462720) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.719 (24058) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.115 (3852) DIV%: 0.013 (438) FPUN%: 1.484 (49671) FPRSUB%: 4.124 (138019) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (73) FPGT%: 2.959 (99013) FPGE%: 1.029 (34443) SYNC%: 0.000 (0) NOP%: 9.015 (301673) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 22 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 170 FPSUB 0 FPMUL 0 FPCMPLT 0 FPMIN 0 FPMAX 421 LOAD 38926 INTCONV 0 ATOMIC_INC 19 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 17 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1404 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 6 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49613 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 10 ORI 10282 XORI 0 MULI 9674 LW 0 LWI 143416 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 70 DIV 20 FPUN 0 FPRSUB 38 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.5802 --Total thread-cycles: 4131904 --total thread-cycles issued: 3044749 (73.688766%) --iCache conflicts: 113413 (2.744812%) --thread*cycles of FU dependence: 254133 (6.150506%) --thread*cycles of data dependence: 179426 (4.342453%) --iCache cycles*banks: 4131904 (80.990608% used) Issue breakdown: --thread*cycles of issue worked: 3044749 (73.688766%) --thread*cycles of issue failed: 785482 (19.010172%) --thread*cycles of issue NOP/other: 281474977012329 (6812234240.000000%) Number of thread-cycles not ready: 179426 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3346422 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 9 3: 8 4: 9 5: 8 6: 9 7: 8 8: 6 9: 8 10: 7 11: 8 12: 9 13: 6 14: 9 15: 8 16: 8 17: 8 18: 8 19: 7 20: 9 21: 8 22: 7 23: 7 24: 8 25: 8 26: 8 27: 8 28: 8 29: 8 30: 6 31: 7 <=== Core 50 ===> ---- Thread 00 ---- PC 5: Stalled ----- 102608 in-flight CPI 1.2661 -- Total Cycles 129937 ---- Thread 01 ---- PC 5: Stalled ----- 100753 in-flight CPI 1.2895 -- Total Cycles 129937 ---- Thread 02 ---- PC 5: Stalled ----- 102082 in-flight CPI 1.2726 -- Total Cycles 129937 ---- Thread 03 ---- PC 5: Stalled ----- 94036 in-flight CPI 1.3815 -- Total Cycles 129937 ---- Thread 04 ---- PC 5: Stalled ----- 92300 in-flight CPI 1.4076 -- Total Cycles 129937 ---- Thread 05 ---- PC 5: Stalled ----- 96481 in-flight CPI 1.3465 -- Total Cycles 129937 ---- Thread 06 ---- PC 5: Stalled ----- 94381 in-flight CPI 1.3765 -- Total Cycles 129937 ---- Thread 07 ---- PC 5: Stalled ----- 96361 in-flight CPI 1.3482 -- Total Cycles 129937 ---- Thread 08 ---- PC 5: Stalled ----- 90595 in-flight CPI 1.4341 -- Total Cycles 129937 ---- Thread 09 ---- PC 5: Stalled ----- 101343 in-flight CPI 1.2819 -- Total Cycles 129937 ---- Thread 10 ---- PC 5: Stalled ----- 93689 in-flight CPI 1.3867 -- Total Cycles 129937 ---- Thread 11 ---- PC 5: Stalled ----- 99683 in-flight CPI 1.3033 -- Total Cycles 129937 ---- Thread 12 ---- PC 5: Stalled ----- 96765 in-flight CPI 1.3426 -- Total Cycles 129937 ---- Thread 13 ---- PC 5: Stalled ----- 95112 in-flight CPI 1.3659 -- Total Cycles 129937 ---- Thread 14 ---- PC 5: Stalled ----- 102075 in-flight CPI 1.2727 -- Total Cycles 129937 ---- Thread 15 ---- PC 5: Stalled ----- 94488 in-flight CPI 1.3749 -- Total Cycles 129937 ---- Thread 16 ---- PC 5: Stalled ----- 97057 in-flight CPI 1.3385 -- Total Cycles 129937 ---- Thread 17 ---- PC 5: Stalled ----- 95941 in-flight CPI 1.3541 -- Total Cycles 129937 ---- Thread 18 ---- PC 5: Stalled ----- 95300 in-flight CPI 1.3632 -- Total Cycles 129937 ---- Thread 19 ---- PC 5: Stalled ----- 100463 in-flight CPI 1.2931 -- Total Cycles 129937 ---- Thread 20 ---- PC 5: Stalled ----- 91442 in-flight CPI 1.4208 -- Total Cycles 129937 ---- Thread 21 ---- PC 5: Stalled ----- 93362 in-flight CPI 1.3916 -- Total Cycles 129937 ---- Thread 22 ---- PC 5: Stalled ----- 97530 in-flight CPI 1.3321 -- Total Cycles 129937 ---- Thread 23 ---- PC 5: Stalled ----- 95179 in-flight CPI 1.3649 -- Total Cycles 129937 ---- Thread 24 ---- PC 5: Stalled ----- 91000 in-flight CPI 1.4276 -- Total Cycles 129937 ---- Thread 25 ---- PC 5: Stalled ----- 95987 in-flight CPI 1.3534 -- Total Cycles 129937 ---- Thread 26 ---- PC 5: Stalled ----- 92012 in-flight CPI 1.4119 -- Total Cycles 129937 ---- Thread 27 ---- PC 5: Stalled ----- 91406 in-flight CPI 1.4212 -- Total Cycles 129937 ---- Thread 28 ---- PC 5: Stalled ----- 90886 in-flight CPI 1.4294 -- Total Cycles 129937 ---- Thread 29 ---- PC 5: Stalled ----- 87859 in-flight CPI 1.4786 -- Total Cycles 129937 ---- Thread 30 ---- PC 5: Stalled ----- 82597 in-flight CPI 1.5729 -- Total Cycles 129937 ---- Thread 31 ---- PC 5: Stalled ----- 89111 in-flight CPI 1.4578 -- Total Cycles 129937 Total CPI 0.0427 , IPC 23.3993 -- Total Cycles 129937 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7480 (3.895205%) FPSUB: 0 (0.000000%) FPMUL: 31128 (16.209883%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 73296 (38.168839%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4107 (2.138717%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 68297 (35.565613%) DIV: 7461 (3.885310%) FPUN: 0 (0.000000%) FPRSUB: 262 (0.136436%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3342100 total) ADD%: 7.182 (240032) SUB%: 0.000 (0) MUL%: 0.006 (202) BITOR%: 1.524 (50924) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.527 (17614) FPSUB%: 0.000 (0) FPMUL%: 4.710 (157414) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (606) FPMAX%: 0.018 (606) LOAD%: 5.114 (170917) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (234) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (581) FPINV%: 0.000 (0) FPCONV%: 0.019 (638) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.054 (35215) FPLE%: 0.453 (15130) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (606) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.824 (94367) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.738 (24666) CMPU%: 0.000 (0) RSUB%: 0.006 (202) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.700 (524699) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.177 (39330) ORI%: 1.553 (51893) XORI%: 0.000 (0) MULI%: 3.222 (107688) LW%: 1.408 (47046) LWI%: 13.160 (439834) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.289 (9655) SWI%: 4.157 (138923) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.411 (47159) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10393) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.056 (1856) bned%: 0.000 (0) bneid%: 13.816 (461753) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.722 (24136) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.116 (3881) DIV%: 0.012 (404) FPUN%: 1.480 (49462) FPRSUB%: 4.164 (139155) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (75) FPGT%: 2.955 (98766) FPGE%: 1.027 (34332) SYNC%: 0.000 (0) NOP%: 9.025 (301610) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 16 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 151 FPSUB 0 FPMUL 1 FPCMPLT 0 FPMIN 0 FPMAX 390 LOAD 39206 INTCONV 0 ATOMIC_INC 19 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 17 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1208 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 8 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49430 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 11 ORI 10661 XORI 0 MULI 9480 LW 0 LWI 143051 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 72 DIV 21 FPUN 0 FPRSUB 35 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.3995 --Total thread-cycles: 4157984 --total thread-cycles issued: 3040490 (73.124138%) --iCache conflicts: 113214 (2.722810%) --thread*cycles of FU dependence: 253795 (6.103799%) --thread*cycles of data dependence: 192031 (4.618368%) --iCache cycles*banks: 4157984 (80.378662% used) Issue breakdown: --thread*cycles of issue worked: 3040490 (73.124138%) --thread*cycles of issue failed: 815884 (19.622105%) --thread*cycles of issue NOP/other: 3971626 (95.518074%) Number of thread-cycles not ready: 192031 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3342100 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 9 1: 7 2: 8 3: 7 4: 6 5: 9 6: 7 7: 7 8: 6 9: 7 10: 7 11: 8 12: 7 13: 8 14: 8 15: 7 16: 7 17: 8 18: 7 19: 8 20: 6 21: 6 22: 7 23: 8 24: 7 25: 8 26: 7 27: 8 28: 7 29: 8 30: 6 31: 8 <=== Core 51 ===> ---- Thread 00 ---- PC 5: Stalled ----- 95424 in-flight CPI 1.3349 -- Total Cycles 127406 ---- Thread 01 ---- PC 5: Stalled ----- 94594 in-flight CPI 1.3466 -- Total Cycles 127406 ---- Thread 02 ---- PC 5: Stalled ----- 99632 in-flight CPI 1.2785 -- Total Cycles 127406 ---- Thread 03 ---- PC 5: Stalled ----- 98822 in-flight CPI 1.2890 -- Total Cycles 127406 ---- Thread 04 ---- PC 5: Stalled ----- 97897 in-flight CPI 1.3012 -- Total Cycles 127406 ---- Thread 05 ---- PC 5: Stalled ----- 100295 in-flight CPI 1.2701 -- Total Cycles 127406 ---- Thread 06 ---- PC 5: Stalled ----- 102585 in-flight CPI 1.2417 -- Total Cycles 127406 ---- Thread 07 ---- PC 5: Stalled ----- 97180 in-flight CPI 1.3109 -- Total Cycles 127406 ---- Thread 08 ---- PC 5: Stalled ----- 99315 in-flight CPI 1.2826 -- Total Cycles 127406 ---- Thread 09 ---- PC 5: Stalled ----- 93844 in-flight CPI 1.3574 -- Total Cycles 127406 ---- Thread 10 ---- PC 5: Stalled ----- 101247 in-flight CPI 1.2581 -- Total Cycles 127406 ---- Thread 11 ---- PC 5: Stalled ----- 96518 in-flight CPI 1.3198 -- Total Cycles 127406 ---- Thread 12 ---- PC 5: Stalled ----- 97209 in-flight CPI 1.3104 -- Total Cycles 127406 ---- Thread 13 ---- PC 5: Stalled ----- 94425 in-flight CPI 1.3490 -- Total Cycles 127406 ---- Thread 14 ---- PC 5: Stalled ----- 92890 in-flight CPI 1.3714 -- Total Cycles 127406 ---- Thread 15 ---- PC 5: Stalled ----- 94090 in-flight CPI 1.3539 -- Total Cycles 127406 ---- Thread 16 ---- PC 5: Stalled ----- 90691 in-flight CPI 1.4045 -- Total Cycles 127406 ---- Thread 17 ---- PC 5: Stalled ----- 97923 in-flight CPI 1.3008 -- Total Cycles 127406 ---- Thread 18 ---- PC 5: Stalled ----- 95675 in-flight CPI 1.3314 -- Total Cycles 127406 ---- Thread 19 ---- PC 5: Stalled ----- 90909 in-flight CPI 1.4012 -- Total Cycles 127406 ---- Thread 20 ---- PC 5: Stalled ----- 93198 in-flight CPI 1.3668 -- Total Cycles 127406 ---- Thread 21 ---- PC 5: Stalled ----- 95148 in-flight CPI 1.3388 -- Total Cycles 127406 ---- Thread 22 ---- PC 5: Stalled ----- 94338 in-flight CPI 1.3503 -- Total Cycles 127406 ---- Thread 23 ---- PC 5: Stalled ----- 96407 in-flight CPI 1.3213 -- Total Cycles 127406 ---- Thread 24 ---- PC 5: Stalled ----- 94706 in-flight CPI 1.3450 -- Total Cycles 127406 ---- Thread 25 ---- PC 5: Stalled ----- 95938 in-flight CPI 1.3277 -- Total Cycles 127406 ---- Thread 26 ---- PC 5: Stalled ----- 89656 in-flight CPI 1.4207 -- Total Cycles 127406 ---- Thread 27 ---- PC 5: Stalled ----- 89435 in-flight CPI 1.4243 -- Total Cycles 127406 ---- Thread 28 ---- PC 5: Stalled ----- 89744 in-flight CPI 1.4194 -- Total Cycles 127406 ---- Thread 29 ---- PC 5: Stalled ----- 89005 in-flight CPI 1.4312 -- Total Cycles 127406 ---- Thread 30 ---- PC 5: Stalled ----- 83405 in-flight CPI 1.5274 -- Total Cycles 127406 ---- Thread 31 ---- PC 5: Stalled ----- 83099 in-flight CPI 1.5330 -- Total Cycles 127406 Total CPI 0.0421 , IPC 23.7493 -- Total Cycles 127406 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7623 (3.909311%) FPSUB: 0 (0.000000%) FPMUL: 31340 (16.072124%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 74293 (38.099758%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4067 (2.085684%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 69802 (35.796631%) DIV: 7611 (3.903157%) FPUN: 0 (0.000000%) FPRSUB: 260 (0.133336%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3325618 total) ADD%: 7.232 (240494) SUB%: 0.000 (0) MUL%: 0.006 (206) BITOR%: 1.540 (51210) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.540 (17974) FPSUB%: 0.000 (0) FPMUL%: 4.741 (157655) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (618) FPMAX%: 0.019 (618) LOAD%: 5.108 (169877) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (238) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (585) FPINV%: 0.000 (0) FPCONV%: 0.020 (650) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.062 (35307) FPLE%: 0.455 (15143) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (618) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.803 (93231) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.743 (24699) CMPU%: 0.000 (0) RSUB%: 0.006 (206) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.673 (521223) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.175 (39074) ORI%: 1.560 (51879) XORI%: 0.000 (0) MULI%: 3.206 (106632) LW%: 1.397 (46474) LWI%: 13.114 (436112) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.287 (9547) SWI%: 4.142 (137762) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.401 (46583) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.309 (10290) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.052 (1720) bned%: 0.000 (0) bneid%: 13.823 (459711) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.720 (23959) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.119 (3968) DIV%: 0.012 (412) FPUN%: 1.490 (49552) FPRSUB%: 4.183 (139107) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (64) FPGT%: 2.946 (97959) FPGE%: 1.035 (34409) SYNC%: 0.000 (0) NOP%: 9.014 (299756) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 20 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 152 FPSUB 0 FPMUL 1 FPCMPLT 0 FPMIN 0 FPMAX 401 LOAD 39094 INTCONV 0 ATOMIC_INC 18 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 18 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1525 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 11 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49191 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 14 ORI 10841 XORI 0 MULI 9582 LW 0 LWI 142124 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 76 DIV 23 FPUN 0 FPRSUB 52 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.7496 --Total thread-cycles: 4076992 --total thread-cycles issued: 3025862 (74.218002%) --iCache conflicts: 112969 (2.770891%) --thread*cycles of FU dependence: 253150 (6.209235%) --thread*cycles of data dependence: 194996 (4.782840%) --iCache cycles*banks: 4076992 (81.571167% used) Issue breakdown: --thread*cycles of issue worked: 3025862 (74.218002%) --thread*cycles of issue failed: 751374 (18.429617%) --thread*cycles of issue NOP/other: 4563349568505425627 (111929330761728.000000%) Number of thread-cycles not ready: 194996 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3325618 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 7 2: 8 3: 7 4: 8 5: 8 6: 9 7: 6 8: 8 9: 7 10: 9 11: 8 12: 7 13: 8 14: 6 15: 6 16: 8 17: 9 18: 8 19: 8 20: 7 21: 8 22: 7 23: 8 24: 8 25: 8 26: 8 27: 7 28: 8 29: 7 30: 5 31: 5 <=== Core 52 ===> ---- Thread 00 ---- PC 5: Stalled ----- 101289 in-flight CPI 1.2571 -- Total Cycles 127359 ---- Thread 01 ---- PC 5: Stalled ----- 98038 in-flight CPI 1.2989 -- Total Cycles 127359 ---- Thread 02 ---- PC 5: Stalled ----- 94954 in-flight CPI 1.3410 -- Total Cycles 127359 ---- Thread 03 ---- PC 5: Stalled ----- 101511 in-flight CPI 1.2544 -- Total Cycles 127359 ---- Thread 04 ---- PC 5: Stalled ----- 95182 in-flight CPI 1.3378 -- Total Cycles 127359 ---- Thread 05 ---- PC 5: Stalled ----- 96289 in-flight CPI 1.3225 -- Total Cycles 127359 ---- Thread 06 ---- PC 5: Stalled ----- 95256 in-flight CPI 1.3367 -- Total Cycles 127359 ---- Thread 07 ---- PC 5: Stalled ----- 97719 in-flight CPI 1.3031 -- Total Cycles 127359 ---- Thread 08 ---- PC 5: Stalled ----- 101387 in-flight CPI 1.2559 -- Total Cycles 127359 ---- Thread 09 ---- PC 5: Stalled ----- 102364 in-flight CPI 1.2439 -- Total Cycles 127359 ---- Thread 10 ---- PC 5: Stalled ----- 99107 in-flight CPI 1.2848 -- Total Cycles 127359 ---- Thread 11 ---- PC 5: Stalled ----- 94430 in-flight CPI 1.3484 -- Total Cycles 127359 ---- Thread 12 ---- PC 5: Stalled ----- 99751 in-flight CPI 1.2765 -- Total Cycles 127359 ---- Thread 13 ---- PC 5: Stalled ----- 91591 in-flight CPI 1.3903 -- Total Cycles 127359 ---- Thread 14 ---- PC 5: Stalled ----- 94477 in-flight CPI 1.3479 -- Total Cycles 127359 ---- Thread 15 ---- PC 5: Stalled ----- 90568 in-flight CPI 1.4061 -- Total Cycles 127359 ---- Thread 16 ---- PC 5: Stalled ----- 94759 in-flight CPI 1.3438 -- Total Cycles 127359 ---- Thread 17 ---- PC 5: Stalled ----- 97521 in-flight CPI 1.3057 -- Total Cycles 127359 ---- Thread 18 ---- PC 5: Stalled ----- 99888 in-flight CPI 1.2747 -- Total Cycles 127359 ---- Thread 19 ---- PC 5: Stalled ----- 95903 in-flight CPI 1.3278 -- Total Cycles 127359 ---- Thread 20 ---- PC 5: Stalled ----- 97747 in-flight CPI 1.3027 -- Total Cycles 127359 ---- Thread 21 ---- PC 5: Stalled ----- 95350 in-flight CPI 1.3354 -- Total Cycles 127359 ---- Thread 22 ---- PC 5: Stalled ----- 93417 in-flight CPI 1.3631 -- Total Cycles 127359 ---- Thread 23 ---- PC 5: Stalled ----- 90832 in-flight CPI 1.4019 -- Total Cycles 127359 ---- Thread 24 ---- PC 5: Stalled ----- 94904 in-flight CPI 1.3417 -- Total Cycles 127359 ---- Thread 25 ---- PC 5: Stalled ----- 94441 in-flight CPI 1.3483 -- Total Cycles 127359 ---- Thread 26 ---- PC 5: Stalled ----- 91402 in-flight CPI 1.3931 -- Total Cycles 127359 ---- Thread 27 ---- PC 5: Stalled ----- 91644 in-flight CPI 1.3895 -- Total Cycles 127359 ---- Thread 28 ---- PC 5: Stalled ----- 88799 in-flight CPI 1.4339 -- Total Cycles 127359 ---- Thread 29 ---- PC 5: Stalled ----- 92410 in-flight CPI 1.3779 -- Total Cycles 127359 ---- Thread 30 ---- PC 5: Stalled ----- 91678 in-flight CPI 1.3889 -- Total Cycles 127359 ---- Thread 31 ---- PC 5: Stalled ----- 91931 in-flight CPI 1.3851 -- Total Cycles 127359 Total CPI 0.0417 , IPC 24.0038 -- Total Cycles 127359 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8171 (3.979758%) FPSUB: 0 (0.000000%) FPMUL: 32490 (15.824541%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 78329 (38.150833%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4115 (2.004247%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 74329 (36.202599%) DIV: 7615 (3.708953%) FPUN: 0 (0.000000%) FPRSUB: 265 (0.129071%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3359543 total) ADD%: 7.237 (243143) SUB%: 0.000 (0) MUL%: 0.006 (206) BITOR%: 1.528 (51339) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.568 (19073) FPSUB%: 0.000 (0) FPMUL%: 4.818 (161862) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (618) FPMAX%: 0.018 (618) LOAD%: 5.156 (173223) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (238) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (587) FPINV%: 0.000 (0) FPCONV%: 0.019 (650) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.069 (35898) FPLE%: 0.454 (15267) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (618) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.788 (93651) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.752 (25272) CMPU%: 0.000 (0) RSUB%: 0.006 (206) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.663 (526222) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.173 (39404) ORI%: 1.573 (52854) XORI%: 0.000 (0) MULI%: 3.187 (107062) LW%: 1.389 (46649) LWI%: 13.051 (438463) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.286 (9624) SWI%: 4.126 (138599) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.392 (46760) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.309 (10395) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.055 (1855) bned%: 0.000 (0) bneid%: 13.780 (462943) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.712 (23932) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.126 (4235) DIV%: 0.012 (412) FPUN%: 1.474 (49519) FPRSUB%: 4.247 (142696) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (71) FPGT%: 2.936 (98645) FPGE%: 1.020 (34252) SYNC%: 0.000 (0) NOP%: 9.001 (302386) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 34 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 152 FPSUB 0 FPMUL 2 FPCMPLT 0 FPMIN 0 FPMAX 398 LOAD 39764 INTCONV 0 ATOMIC_INC 21 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 14 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1261 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 11 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49247 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 7 ORI 11643 XORI 0 MULI 9263 LW 0 LWI 142975 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 85 DIV 25 FPUN 0 FPRSUB 58 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 24.0041 --Total thread-cycles: 4075488 --total thread-cycles issued: 3057157 (75.013275%) --iCache conflicts: 113192 (2.777385%) --thread*cycles of FU dependence: 254983 (6.256502%) --thread*cycles of data dependence: 205314 (5.037777%) --iCache cycles*banks: 4075488 (82.433685% used) Issue breakdown: --thread*cycles of issue worked: 3057157 (75.013275%) --thread*cycles of issue failed: 715945 (17.567099%) --thread*cycles of issue NOP/other: 31525197392099239 (773531828224.000000%) Number of thread-cycles not ready: 205314 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3359543 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 9 1: 6 2: 8 3: 8 4: 7 5: 7 6: 8 7: 8 8: 8 9: 8 10: 8 11: 8 12: 8 13: 7 14: 6 15: 5 16: 6 17: 7 18: 9 19: 6 20: 8 21: 8 22: 6 23: 7 24: 8 25: 8 26: 7 27: 7 28: 8 29: 8 30: 8 31: 8 <=== Core 53 ===> ---- Thread 00 ---- PC 5: Stalled ----- 101939 in-flight CPI 1.2641 -- Total Cycles 128889 ---- Thread 01 ---- PC 5: Stalled ----- 103437 in-flight CPI 1.2458 -- Total Cycles 128889 ---- Thread 02 ---- PC 5: Stalled ----- 97286 in-flight CPI 1.3246 -- Total Cycles 128889 ---- Thread 03 ---- PC 5: Stalled ----- 97605 in-flight CPI 1.3203 -- Total Cycles 128889 ---- Thread 04 ---- PC 5: Stalled ----- 100725 in-flight CPI 1.2794 -- Total Cycles 128889 ---- Thread 05 ---- PC 5: Stalled ----- 99986 in-flight CPI 1.2888 -- Total Cycles 128889 ---- Thread 06 ---- PC 5: Stalled ----- 99704 in-flight CPI 1.2925 -- Total Cycles 128889 ---- Thread 07 ---- PC 5: Stalled ----- 98935 in-flight CPI 1.3026 -- Total Cycles 128889 ---- Thread 08 ---- PC 5: Stalled ----- 94256 in-flight CPI 1.3672 -- Total Cycles 128889 ---- Thread 09 ---- PC 5: Stalled ----- 96093 in-flight CPI 1.3411 -- Total Cycles 128889 ---- Thread 10 ---- PC 5: Stalled ----- 103377 in-flight CPI 1.2466 -- Total Cycles 128889 ---- Thread 11 ---- PC 5: Stalled ----- 97995 in-flight CPI 1.3150 -- Total Cycles 128889 ---- Thread 12 ---- PC 5: Stalled ----- 97279 in-flight CPI 1.3247 -- Total Cycles 128889 ---- Thread 13 ---- PC 5: Stalled ----- 98105 in-flight CPI 1.3136 -- Total Cycles 128889 ---- Thread 14 ---- PC 5: Stalled ----- 97757 in-flight CPI 1.3183 -- Total Cycles 128889 ---- Thread 15 ---- PC 5: Stalled ----- 93183 in-flight CPI 1.3829 -- Total Cycles 128889 ---- Thread 16 ---- PC 5: Stalled ----- 90636 in-flight CPI 1.4218 -- Total Cycles 128889 ---- Thread 17 ---- PC 5: Stalled ----- 92469 in-flight CPI 1.3936 -- Total Cycles 128889 ---- Thread 18 ---- PC 5: Stalled ----- 97248 in-flight CPI 1.3251 -- Total Cycles 128889 ---- Thread 19 ---- PC 5: Stalled ----- 92422 in-flight CPI 1.3943 -- Total Cycles 128889 ---- Thread 20 ---- PC 5: Stalled ----- 97005 in-flight CPI 1.3284 -- Total Cycles 128889 ---- Thread 21 ---- PC 5: Stalled ----- 96139 in-flight CPI 1.3404 -- Total Cycles 128889 ---- Thread 22 ---- PC 5: Stalled ----- 89565 in-flight CPI 1.4388 -- Total Cycles 128889 ---- Thread 23 ---- PC 5: Stalled ----- 88411 in-flight CPI 1.4576 -- Total Cycles 128889 ---- Thread 24 ---- PC 5: Stalled ----- 94956 in-flight CPI 1.3570 -- Total Cycles 128889 ---- Thread 25 ---- PC 5: Stalled ----- 92111 in-flight CPI 1.3990 -- Total Cycles 128889 ---- Thread 26 ---- PC 5: Stalled ----- 86280 in-flight CPI 1.4936 -- Total Cycles 128889 ---- Thread 27 ---- PC 5: Stalled ----- 92947 in-flight CPI 1.3865 -- Total Cycles 128889 ---- Thread 28 ---- PC 5: Stalled ----- 95442 in-flight CPI 1.3502 -- Total Cycles 128889 ---- Thread 29 ---- PC 5: Stalled ----- 90531 in-flight CPI 1.4234 -- Total Cycles 128889 ---- Thread 30 ---- PC 5: Stalled ----- 87780 in-flight CPI 1.4681 -- Total Cycles 128889 ---- Thread 31 ---- PC 5: Stalled ----- 85734 in-flight CPI 1.5031 -- Total Cycles 128889 Total CPI 0.0423 , IPC 23.6475 -- Total Cycles 128889 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7524 (4.168352%) FPSUB: 0 (0.000000%) FPMUL: 31280 (17.329351%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 60697 (33.626591%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4242 (2.350099%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 68922 (38.183300%) DIV: 7575 (4.196606%) FPUN: 0 (0.000000%) FPRSUB: 263 (0.145704%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3350046 total) ADD%: 7.236 (242422) SUB%: 0.000 (0) MUL%: 0.006 (205) BITOR%: 1.535 (51415) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.530 (17767) FPSUB%: 0.000 (0) FPMUL%: 4.713 (157882) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (615) FPMAX%: 0.018 (615) LOAD%: 5.108 (171129) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (237) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (593) FPINV%: 0.000 (0) FPCONV%: 0.019 (647) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.055 (35354) FPLE%: 0.457 (15296) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (615) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.815 (94310) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.740 (24782) CMPU%: 0.000 (0) RSUB%: 0.006 (205) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.689 (525573) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.176 (39391) ORI%: 1.553 (52010) XORI%: 0.000 (0) MULI%: 3.215 (107718) LW%: 1.404 (47022) LWI%: 13.132 (439925) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.288 (9646) SWI%: 4.144 (138819) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.407 (47141) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10392) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.054 (1803) bned%: 0.000 (0) bneid%: 13.820 (462992) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.723 (24222) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.117 (3917) DIV%: 0.012 (410) FPUN%: 1.489 (49871) FPRSUB%: 4.163 (139456) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (83) FPGT%: 2.949 (98802) FPGE%: 1.032 (34575) SYNC%: 0.000 (0) NOP%: 9.018 (302093) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 21 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 150 FPSUB 0 FPMUL 2 FPCMPLT 0 FPMIN 0 FPMAX 401 LOAD 39248 INTCONV 0 ATOMIC_INC 17 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 17 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1513 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 11 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49564 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 14 ORI 10650 XORI 0 MULI 9970 LW 0 LWI 143275 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 63 DIV 22 FPUN 0 FPRSUB 56 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.6477 --Total thread-cycles: 4124448 --total thread-cycles issued: 3047953 (73.899658%) --iCache conflicts: 113505 (2.752005%) --thread*cycles of FU dependence: 254998 (6.182597%) --thread*cycles of data dependence: 180503 (4.376416%) --iCache cycles*banks: 4124448 (81.224884% used) Issue breakdown: --thread*cycles of issue worked: 3047953 (73.899658%) --thread*cycles of issue failed: 774402 (18.775894%) --thread*cycles of issue NOP/other: 4536538595914736206 (109991411318784.000000%) Number of thread-cycles not ready: 180503 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3350046 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 9 2: 8 3: 7 4: 8 5: 8 6: 8 7: 7 8: 8 9: 7 10: 8 11: 8 12: 8 13: 7 14: 6 15: 7 16: 7 17: 7 18: 8 19: 7 20: 8 21: 7 22: 7 23: 7 24: 9 25: 7 26: 7 27: 7 28: 8 29: 7 30: 6 31: 6 <=== Core 54 ===> ---- Thread 00 ---- PC 5: Stalled ----- 94632 in-flight CPI 1.4891 -- Total Cycles 140941 ---- Thread 01 ---- PC 5: Stalled ----- 88793 in-flight CPI 1.5871 -- Total Cycles 140941 ---- Thread 02 ---- PC 5: Stalled ----- 101597 in-flight CPI 1.3870 -- Total Cycles 140941 ---- Thread 03 ---- PC 5: Stalled ----- 96423 in-flight CPI 1.4615 -- Total Cycles 140941 ---- Thread 04 ---- PC 5: Stalled ----- 98933 in-flight CPI 1.4243 -- Total Cycles 140941 ---- Thread 05 ---- PC 5: Stalled ----- 94890 in-flight CPI 1.4850 -- Total Cycles 140941 ---- Thread 06 ---- PC 5: Stalled ----- 99083 in-flight CPI 1.4222 -- Total Cycles 140941 ---- Thread 07 ---- PC 5: Stalled ----- 96262 in-flight CPI 1.4639 -- Total Cycles 140941 ---- Thread 08 ---- PC 5: Stalled ----- 98693 in-flight CPI 1.4278 -- Total Cycles 140941 ---- Thread 09 ---- PC 5: Stalled ----- 95479 in-flight CPI 1.4759 -- Total Cycles 140941 ---- Thread 10 ---- PC 5: Stalled ----- 98682 in-flight CPI 1.4280 -- Total Cycles 140941 ---- Thread 11 ---- PC 5: Stalled ----- 98047 in-flight CPI 1.4372 -- Total Cycles 140941 ---- Thread 12 ---- PC 5: Stalled ----- 99272 in-flight CPI 1.4195 -- Total Cycles 140941 ---- Thread 13 ---- PC 5: Stalled ----- 101031 in-flight CPI 1.3948 -- Total Cycles 140941 ---- Thread 14 ---- PC 5: Stalled ----- 92519 in-flight CPI 1.5231 -- Total Cycles 140941 ---- Thread 15 ---- PC 5: Stalled ----- 97119 in-flight CPI 1.4509 -- Total Cycles 140941 ---- Thread 16 ---- PC 5: Stalled ----- 105642 in-flight CPI 1.3340 -- Total Cycles 140941 ---- Thread 17 ---- PC 5: Stalled ----- 95863 in-flight CPI 1.4699 -- Total Cycles 140941 ---- Thread 18 ---- PC 5: Stalled ----- 98235 in-flight CPI 1.4345 -- Total Cycles 140941 ---- Thread 19 ---- PC 5: Stalled ----- 94200 in-flight CPI 1.4959 -- Total Cycles 140941 ---- Thread 20 ---- PC 5: Stalled ----- 90976 in-flight CPI 1.5489 -- Total Cycles 140941 ---- Thread 21 ---- PC 5: Stalled ----- 94875 in-flight CPI 1.4852 -- Total Cycles 140941 ---- Thread 22 ---- PC 5: Stalled ----- 89051 in-flight CPI 1.5824 -- Total Cycles 140941 ---- Thread 23 ---- PC 5: Stalled ----- 91183 in-flight CPI 1.5454 -- Total Cycles 140941 ---- Thread 24 ---- PC 5: Stalled ----- 92059 in-flight CPI 1.5307 -- Total Cycles 140941 ---- Thread 25 ---- PC 5: Stalled ----- 94175 in-flight CPI 1.4963 -- Total Cycles 140941 ---- Thread 26 ---- PC 5: Stalled ----- 92190 in-flight CPI 1.5285 -- Total Cycles 140941 ---- Thread 27 ---- PC 5: Stalled ----- 90792 in-flight CPI 1.5521 -- Total Cycles 140941 ---- Thread 28 ---- PC 5: Stalled ----- 89100 in-flight CPI 1.5815 -- Total Cycles 140941 ---- Thread 29 ---- PC 5: Stalled ----- 90714 in-flight CPI 1.5534 -- Total Cycles 140941 ---- Thread 30 ---- PC 5: Stalled ----- 91745 in-flight CPI 1.5359 -- Total Cycles 140941 ---- Thread 31 ---- PC 5: Stalled ----- 89643 in-flight CPI 1.5719 -- Total Cycles 140941 Total CPI 0.0463 , IPC 21.5868 -- Total Cycles 140941 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7755 (4.065829%) FPSUB: 0 (0.000000%) FPMUL: 31615 (16.575266%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 68361 (35.840641%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4265 (2.236075%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 70836 (37.138245%) DIV: 7645 (4.008158%) FPUN: 0 (0.000000%) FPRSUB: 259 (0.135790%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3344085 total) ADD%: 7.206 (240963) SUB%: 0.000 (0) MUL%: 0.006 (207) BITOR%: 1.520 (50826) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.546 (18255) FPSUB%: 0.000 (0) FPMUL%: 4.757 (159073) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (621) FPMAX%: 0.019 (621) LOAD%: 5.143 (171970) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (239) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (597) FPINV%: 0.000 (0) FPCONV%: 0.020 (653) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.060 (35440) FPLE%: 0.457 (15295) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (621) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.813 (94057) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.742 (24818) CMPU%: 0.000 (0) RSUB%: 0.006 (207) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.687 (524578) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.174 (39267) ORI%: 1.552 (51898) XORI%: 0.000 (0) MULI%: 3.207 (107248) LW%: 1.402 (46886) LWI%: 13.114 (438544) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.288 (9631) SWI%: 4.150 (138774) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.406 (47005) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10383) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.058 (1930) bned%: 0.000 (0) bneid%: 13.791 (461199) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.717 (23990) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.120 (4028) DIV%: 0.012 (414) FPUN%: 1.474 (49306) FPRSUB%: 4.194 (140256) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (65) FPGT%: 2.947 (98547) FPGE%: 1.017 (34011) SYNC%: 0.000 (0) NOP%: 9.018 (301566) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 23 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 148 FPSUB 0 FPMUL 1 FPCMPLT 0 FPMIN 0 FPMAX 394 LOAD 39060 INTCONV 0 ATOMIC_INC 18 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 15 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1541 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 3 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49295 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 17 ORI 11027 XORI 0 MULI 9620 LW 0 LWI 142746 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 63 DIV 31 FPUN 0 FPRSUB 45 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.5870 --Total thread-cycles: 4510112 --total thread-cycles issued: 3042519 (67.459938%) --iCache conflicts: 113192 (2.509738%) --thread*cycles of FU dependence: 254063 (5.633186%) --thread*cycles of data dependence: 190736 (4.229074%) --iCache cycles*banks: 4510112 (74.147095% used) Issue breakdown: --thread*cycles of issue worked: 3042519 (67.459938%) --thread*cycles of issue failed: 1166027 (25.853617%) --thread*cycles of issue NOP/other: 7641598 (169.432556%) Number of thread-cycles not ready: 190736 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3344085 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 6 2: 8 3: 6 4: 9 5: 9 6: 7 7: 7 8: 7 9: 7 10: 8 11: 7 12: 7 13: 7 14: 7 15: 8 16: 6 17: 8 18: 8 19: 8 20: 7 21: 8 22: 7 23: 8 24: 8 25: 8 26: 7 27: 7 28: 7 29: 8 30: 8 31: 8 <=== Core 55 ===> ---- Thread 00 ---- PC 5: Stalled ----- 97845 in-flight CPI 1.3153 -- Total Cycles 128717 ---- Thread 01 ---- PC 5: Stalled ----- 98420 in-flight CPI 1.3076 -- Total Cycles 128717 ---- Thread 02 ---- PC 5: Stalled ----- 96251 in-flight CPI 1.3371 -- Total Cycles 128717 ---- Thread 03 ---- PC 5: Stalled ----- 103490 in-flight CPI 1.2435 -- Total Cycles 128717 ---- Thread 04 ---- PC 5: Stalled ----- 101370 in-flight CPI 1.2695 -- Total Cycles 128717 ---- Thread 05 ---- PC 5: Stalled ----- 96375 in-flight CPI 1.3353 -- Total Cycles 128717 ---- Thread 06 ---- PC 5: Stalled ----- 103998 in-flight CPI 1.2374 -- Total Cycles 128717 ---- Thread 07 ---- PC 5: Stalled ----- 96311 in-flight CPI 1.3362 -- Total Cycles 128717 ---- Thread 08 ---- PC 5: Stalled ----- 97132 in-flight CPI 1.3250 -- Total Cycles 128717 ---- Thread 09 ---- PC 5: Stalled ----- 96434 in-flight CPI 1.3345 -- Total Cycles 128717 ---- Thread 10 ---- PC 5: Stalled ----- 100004 in-flight CPI 1.2869 -- Total Cycles 128717 ---- Thread 11 ---- PC 5: Stalled ----- 87471 in-flight CPI 1.4714 -- Total Cycles 128717 ---- Thread 12 ---- PC 5: Stalled ----- 101974 in-flight CPI 1.2620 -- Total Cycles 128717 ---- Thread 13 ---- PC 5: Stalled ----- 89277 in-flight CPI 1.4416 -- Total Cycles 128717 ---- Thread 14 ---- PC 5: Stalled ----- 90185 in-flight CPI 1.4271 -- Total Cycles 128717 ---- Thread 15 ---- PC 5: Stalled ----- 92142 in-flight CPI 1.3967 -- Total Cycles 128717 ---- Thread 16 ---- PC 5: Stalled ----- 97370 in-flight CPI 1.3217 -- Total Cycles 128717 ---- Thread 17 ---- PC 5: Stalled ----- 97764 in-flight CPI 1.3164 -- Total Cycles 128717 ---- Thread 18 ---- PC 5: Stalled ----- 95697 in-flight CPI 1.3448 -- Total Cycles 128717 ---- Thread 19 ---- PC 5: Stalled ----- 92308 in-flight CPI 1.3941 -- Total Cycles 128717 ---- Thread 20 ---- PC 5: Stalled ----- 92620 in-flight CPI 1.3895 -- Total Cycles 128717 ---- Thread 21 ---- PC 5: Stalled ----- 95107 in-flight CPI 1.3531 -- Total Cycles 128717 ---- Thread 22 ---- PC 5: Stalled ----- 93671 in-flight CPI 1.3739 -- Total Cycles 128717 ---- Thread 23 ---- PC 5: Stalled ----- 96838 in-flight CPI 1.3289 -- Total Cycles 128717 ---- Thread 24 ---- PC 5: Stalled ----- 86916 in-flight CPI 1.4807 -- Total Cycles 128717 ---- Thread 25 ---- PC 5: Stalled ----- 93516 in-flight CPI 1.3762 -- Total Cycles 128717 ---- Thread 26 ---- PC 5: Stalled ----- 96288 in-flight CPI 1.3365 -- Total Cycles 128717 ---- Thread 27 ---- PC 5: Stalled ----- 92940 in-flight CPI 1.3847 -- Total Cycles 128717 ---- Thread 28 ---- PC 5: Stalled ----- 87824 in-flight CPI 1.4654 -- Total Cycles 128717 ---- Thread 29 ---- PC 5: Stalled ----- 86727 in-flight CPI 1.4839 -- Total Cycles 128717 ---- Thread 30 ---- PC 5: Stalled ----- 84100 in-flight CPI 1.5303 -- Total Cycles 128717 ---- Thread 31 ---- PC 5: Stalled ----- 83055 in-flight CPI 1.5495 -- Total Cycles 128717 Total CPI 0.0426 , IPC 23.4776 -- Total Cycles 128717 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8110 (3.837709%) FPSUB: 0 (0.000000%) FPMUL: 32140 (15.208873%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 85604 (40.508415%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 3960 (1.873900%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 73905 (34.972363%) DIV: 7347 (3.476652%) FPUN: 0 (0.000000%) FPRSUB: 258 (0.122087%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3321289 total) ADD%: 7.171 (238178) SUB%: 0.000 (0) MUL%: 0.006 (199) BITOR%: 1.519 (50450) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.568 (18861) FPSUB%: 0.000 (0) FPMUL%: 4.824 (160222) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (597) FPMAX%: 0.018 (597) LOAD%: 5.184 (172172) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (231) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (569) FPINV%: 0.000 (0) FPCONV%: 0.019 (629) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.069 (35489) FPLE%: 0.454 (15065) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (597) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.797 (92889) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.754 (25058) CMPU%: 0.000 (0) RSUB%: 0.006 (199) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.669 (520425) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.173 (38957) ORI%: 1.572 (52217) XORI%: 0.000 (0) MULI%: 3.189 (105906) LW%: 1.394 (46305) LWI%: 13.062 (433840) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.286 (9508) SWI%: 4.126 (137027) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.397 (46412) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10309) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.062 (2049) bned%: 0.000 (0) bneid%: 13.765 (457171) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.713 (23681) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.127 (4213) DIV%: 0.012 (398) FPUN%: 1.468 (48768) FPRSUB%: 4.262 (141565) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (69) FPGT%: 2.932 (97396) FPGE%: 1.015 (33703) SYNC%: 0.000 (0) NOP%: 9.011 (299272) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 39 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 150 FPSUB 0 FPMUL 0 FPCMPLT 0 FPMIN 0 FPMAX 383 LOAD 40829 INTCONV 0 ATOMIC_INC 16 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 17 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1185 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 6 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48748 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 11 ORI 11545 XORI 0 MULI 9308 LW 0 LWI 141604 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 71 DIV 19 FPUN 0 FPRSUB 53 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.4778 --Total thread-cycles: 4118944 --total thread-cycles issued: 3022017 (73.368736%) --iCache conflicts: 111173 (2.699066%) --thread*cycles of FU dependence: 253990 (6.166386%) --thread*cycles of data dependence: 211324 (5.130538%) --iCache cycles*banks: 4118944 (80.635254% used) Issue breakdown: --thread*cycles of issue worked: 3022017 (73.368736%) --thread*cycles of issue failed: 797655 (19.365522%) --thread*cycles of issue NOP/other: 4670985185535825625 (113402496155648.000000%) Number of thread-cycles not ready: 211324 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3321289 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 7 2: 6 3: 9 4: 9 5: 8 6: 9 7: 8 8: 7 9: 7 10: 8 11: 5 12: 8 13: 6 14: 5 15: 7 16: 8 17: 7 18: 7 19: 8 20: 7 21: 8 22: 7 23: 8 24: 6 25: 7 26: 8 27: 8 28: 7 29: 6 30: 6 31: 6 <=== Core 56 ===> ---- Thread 00 ---- PC 5: Stalled ----- 100798 in-flight CPI 1.2784 -- Total Cycles 128883 ---- Thread 01 ---- PC 5: Stalled ----- 99690 in-flight CPI 1.2926 -- Total Cycles 128883 ---- Thread 02 ---- PC 5: Stalled ----- 104084 in-flight CPI 1.2380 -- Total Cycles 128883 ---- Thread 03 ---- PC 5: Stalled ----- 102961 in-flight CPI 1.2515 -- Total Cycles 128883 ---- Thread 04 ---- PC 5: Stalled ----- 92625 in-flight CPI 1.3912 -- Total Cycles 128883 ---- Thread 05 ---- PC 5: Stalled ----- 98450 in-flight CPI 1.3089 -- Total Cycles 128883 ---- Thread 06 ---- PC 5: Stalled ----- 103477 in-flight CPI 1.2453 -- Total Cycles 128883 ---- Thread 07 ---- PC 5: Stalled ----- 99131 in-flight CPI 1.2999 -- Total Cycles 128883 ---- Thread 08 ---- PC 5: Stalled ----- 103119 in-flight CPI 1.2496 -- Total Cycles 128883 ---- Thread 09 ---- PC 5: Stalled ----- 95031 in-flight CPI 1.3560 -- Total Cycles 128883 ---- Thread 10 ---- PC 5: Stalled ----- 92533 in-flight CPI 1.3926 -- Total Cycles 128883 ---- Thread 11 ---- PC 5: Stalled ----- 95988 in-flight CPI 1.3424 -- Total Cycles 128883 ---- Thread 12 ---- PC 5: Stalled ----- 93624 in-flight CPI 1.3764 -- Total Cycles 128883 ---- Thread 13 ---- PC 5: Stalled ----- 91513 in-flight CPI 1.4081 -- Total Cycles 128883 ---- Thread 14 ---- PC 5: Stalled ----- 96517 in-flight CPI 1.3350 -- Total Cycles 128883 ---- Thread 15 ---- PC 5: Stalled ----- 95506 in-flight CPI 1.3492 -- Total Cycles 128883 ---- Thread 16 ---- PC 5: Stalled ----- 97919 in-flight CPI 1.3159 -- Total Cycles 128883 ---- Thread 17 ---- PC 5: Stalled ----- 98408 in-flight CPI 1.3094 -- Total Cycles 128883 ---- Thread 18 ---- PC 5: Stalled ----- 94294 in-flight CPI 1.3666 -- Total Cycles 128883 ---- Thread 19 ---- PC 5: Stalled ----- 91606 in-flight CPI 1.4067 -- Total Cycles 128883 ---- Thread 20 ---- PC 5: Stalled ----- 94496 in-flight CPI 1.3636 -- Total Cycles 128883 ---- Thread 21 ---- PC 5: Stalled ----- 89309 in-flight CPI 1.4429 -- Total Cycles 128883 ---- Thread 22 ---- PC 5: Stalled ----- 89093 in-flight CPI 1.4464 -- Total Cycles 128883 ---- Thread 23 ---- PC 5: Stalled ----- 89005 in-flight CPI 1.4477 -- Total Cycles 128883 ---- Thread 24 ---- PC 5: Stalled ----- 92205 in-flight CPI 1.3975 -- Total Cycles 128883 ---- Thread 25 ---- PC 5: Stalled ----- 93391 in-flight CPI 1.3798 -- Total Cycles 128883 ---- Thread 26 ---- PC 5: Stalled ----- 94174 in-flight CPI 1.3683 -- Total Cycles 128883 ---- Thread 27 ---- PC 5: Stalled ----- 88901 in-flight CPI 1.4496 -- Total Cycles 128883 ---- Thread 28 ---- PC 5: Stalled ----- 96229 in-flight CPI 1.3391 -- Total Cycles 128883 ---- Thread 29 ---- PC 5: Stalled ----- 88866 in-flight CPI 1.4500 -- Total Cycles 128883 ---- Thread 30 ---- PC 5: Stalled ----- 84660 in-flight CPI 1.5221 -- Total Cycles 128883 ---- Thread 31 ---- PC 5: Stalled ----- 85010 in-flight CPI 1.5157 -- Total Cycles 128883 Total CPI 0.0425 , IPC 23.5343 -- Total Cycles 128883 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8146 (4.148038%) FPSUB: 0 (0.000000%) FPMUL: 32399 (16.497948%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 69795 (35.540424%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4229 (2.153456%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 73940 (37.651108%) DIV: 7615 (3.877646%) FPUN: 0 (0.000000%) FPRSUB: 258 (0.131377%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3333902 total) ADD%: 7.155 (238543) SUB%: 0.000 (0) MUL%: 0.006 (206) BITOR%: 1.535 (51192) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.574 (19136) FPSUB%: 0.000 (0) FPMUL%: 4.835 (161197) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (618) FPMAX%: 0.019 (618) LOAD%: 5.149 (171671) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (238) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (594) FPINV%: 0.000 (0) FPCONV%: 0.019 (650) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.071 (35693) FPLE%: 0.454 (15148) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (618) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.787 (92906) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.752 (25061) CMPU%: 0.000 (0) RSUB%: 0.006 (206) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.658 (522020) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.170 (39023) ORI%: 1.589 (52973) XORI%: 0.000 (0) MULI%: 3.184 (106154) LW%: 1.390 (46336) LWI%: 13.049 (435032) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.285 (9490) SWI%: 4.120 (137354) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.393 (46454) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.308 (10276) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.058 (1937) bned%: 0.000 (0) bneid%: 13.794 (459872) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.718 (23941) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.126 (4212) DIV%: 0.012 (412) FPUN%: 1.483 (49455) FPRSUB%: 4.250 (141689) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (69) FPGT%: 2.935 (97834) FPGE%: 1.029 (34307) SYNC%: 0.000 (0) NOP%: 9.019 (300671) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 22 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 148 FPSUB 0 FPMUL 1 FPCMPLT 0 FPMIN 0 FPMAX 400 LOAD 39953 INTCONV 0 ATOMIC_INC 21 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 16 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1544 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 9 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48901 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 14 ORI 11639 XORI 0 MULI 8927 LW 0 LWI 141737 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 86 DIV 24 FPUN 0 FPRSUB 52 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.5346 --Total thread-cycles: 4124256 --total thread-cycles issued: 3033231 (73.546135%) --iCache conflicts: 113847 (2.760425%) --thread*cycles of FU dependence: 253526 (6.147193%) --thread*cycles of data dependence: 196382 (4.761635%) --iCache cycles*banks: 4124256 (80.837219% used) Issue breakdown: --thread*cycles of issue worked: 3033231 (73.546135%) --thread*cycles of issue failed: 790354 (19.163553%) --thread*cycles of issue NOP/other: 4500924 (109.132996%) Number of thread-cycles not ready: 196382 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3333902 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 9 3: 8 4: 7 5: 7 6: 8 7: 8 8: 9 9: 6 10: 7 11: 8 12: 7 13: 7 14: 9 15: 9 16: 9 17: 8 18: 6 19: 6 20: 9 21: 6 22: 6 23: 8 24: 7 25: 8 26: 7 27: 5 28: 8 29: 7 30: 6 31: 8 <=== Core 57 ===> ---- Thread 00 ---- PC 5: Stalled ----- 98385 in-flight CPI 1.3101 -- Total Cycles 128918 ---- Thread 01 ---- PC 5: Stalled ----- 96304 in-flight CPI 1.3384 -- Total Cycles 128918 ---- Thread 02 ---- PC 5: Stalled ----- 99477 in-flight CPI 1.2958 -- Total Cycles 128918 ---- Thread 03 ---- PC 5: Stalled ----- 100842 in-flight CPI 1.2782 -- Total Cycles 128918 ---- Thread 04 ---- PC 5: Stalled ----- 100673 in-flight CPI 1.2803 -- Total Cycles 128918 ---- Thread 05 ---- PC 5: Stalled ----- 93820 in-flight CPI 1.3739 -- Total Cycles 128918 ---- Thread 06 ---- PC 5: Stalled ----- 97067 in-flight CPI 1.3279 -- Total Cycles 128918 ---- Thread 07 ---- PC 5: Stalled ----- 97048 in-flight CPI 1.3281 -- Total Cycles 128918 ---- Thread 08 ---- PC 5: Stalled ----- 95298 in-flight CPI 1.3526 -- Total Cycles 128918 ---- Thread 09 ---- PC 5: Stalled ----- 97303 in-flight CPI 1.3247 -- Total Cycles 128918 ---- Thread 10 ---- PC 5: Stalled ----- 94361 in-flight CPI 1.3659 -- Total Cycles 128918 ---- Thread 11 ---- PC 5: Stalled ----- 97778 in-flight CPI 1.3183 -- Total Cycles 128918 ---- Thread 12 ---- PC 5: Stalled ----- 102471 in-flight CPI 1.2579 -- Total Cycles 128918 ---- Thread 13 ---- PC 5: Stalled ----- 99404 in-flight CPI 1.2966 -- Total Cycles 128918 ---- Thread 14 ---- PC 5: Stalled ----- 97867 in-flight CPI 1.3171 -- Total Cycles 128918 ---- Thread 15 ---- PC 5: Stalled ----- 96214 in-flight CPI 1.3396 -- Total Cycles 128918 ---- Thread 16 ---- PC 5: Stalled ----- 90423 in-flight CPI 1.4255 -- Total Cycles 128918 ---- Thread 17 ---- PC 5: Stalled ----- 95909 in-flight CPI 1.3439 -- Total Cycles 128918 ---- Thread 18 ---- PC 5: Stalled ----- 94914 in-flight CPI 1.3580 -- Total Cycles 128918 ---- Thread 19 ---- PC 5: Stalled ----- 96634 in-flight CPI 1.3338 -- Total Cycles 128918 ---- Thread 20 ---- PC 5: Stalled ----- 89396 in-flight CPI 1.4418 -- Total Cycles 128918 ---- Thread 21 ---- PC 5: Stalled ----- 91777 in-flight CPI 1.4044 -- Total Cycles 128918 ---- Thread 22 ---- PC 5: Stalled ----- 91742 in-flight CPI 1.4050 -- Total Cycles 128918 ---- Thread 23 ---- PC 5: Stalled ----- 92350 in-flight CPI 1.3957 -- Total Cycles 128918 ---- Thread 24 ---- PC 5: Stalled ----- 88520 in-flight CPI 1.4562 -- Total Cycles 128918 ---- Thread 25 ---- PC 5: Stalled ----- 92504 in-flight CPI 1.3934 -- Total Cycles 128918 ---- Thread 26 ---- PC 5: Stalled ----- 92349 in-flight CPI 1.3957 -- Total Cycles 128918 ---- Thread 27 ---- PC 5: Stalled ----- 84766 in-flight CPI 1.5206 -- Total Cycles 128918 ---- Thread 28 ---- PC 5: Stalled ----- 92781 in-flight CPI 1.3892 -- Total Cycles 128918 ---- Thread 29 ---- PC 5: Stalled ----- 92051 in-flight CPI 1.4002 -- Total Cycles 128918 ---- Thread 30 ---- PC 5: Stalled ----- 89913 in-flight CPI 1.4336 -- Total Cycles 128918 ---- Thread 31 ---- PC 5: Stalled ----- 94084 in-flight CPI 1.3700 -- Total Cycles 128918 Total CPI 0.0425 , IPC 23.5419 -- Total Cycles 128918 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7823 (4.005591%) FPSUB: 0 (0.000000%) FPMUL: 31749 (16.256363%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 73027 (37.391834%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4085 (2.091632%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 70898 (36.301727%) DIV: 7456 (3.817677%) FPUN: 0 (0.000000%) FPRSUB: 264 (0.135175%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3335604 total) ADD%: 7.230 (241171) SUB%: 0.000 (0) MUL%: 0.006 (202) BITOR%: 1.534 (51154) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.551 (18364) FPSUB%: 0.000 (0) FPMUL%: 4.771 (159144) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (606) FPMAX%: 0.018 (606) LOAD%: 5.128 (171034) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (234) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (580) FPINV%: 0.000 (0) FPCONV%: 0.019 (638) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.063 (35449) FPLE%: 0.456 (15202) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (606) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.801 (93432) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.746 (24895) CMPU%: 0.000 (0) RSUB%: 0.006 (202) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.672 (522741) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.173 (39127) ORI%: 1.567 (52279) XORI%: 0.000 (0) MULI%: 3.201 (106758) LW%: 1.397 (46587) LWI%: 13.091 (436648) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.286 (9553) SWI%: 4.130 (137772) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.400 (46699) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10338) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.056 (1884) bned%: 0.000 (0) bneid%: 13.800 (460306) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.722 (24071) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.121 (4034) DIV%: 0.012 (404) FPUN%: 1.485 (49521) FPRSUB%: 4.205 (140257) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (72) FPGT%: 2.939 (98046) FPGE%: 1.029 (34319) SYNC%: 0.000 (0) NOP%: 9.011 (300573) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 29 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 154 FPSUB 0 FPMUL 4 FPCMPLT 0 FPMIN 0 FPMAX 399 LOAD 39707 INTCONV 0 ATOMIC_INC 15 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 17 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1092 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 11 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49144 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 8 ORI 11125 XORI 0 MULI 9824 LW 0 LWI 142192 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 51 DIV 18 FPUN 0 FPRSUB 53 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.5422 --Total thread-cycles: 4125376 --total thread-cycles issued: 3035031 (73.569801%) --iCache conflicts: 113439 (2.749786%) --thread*cycles of FU dependence: 253844 (6.153233%) --thread*cycles of data dependence: 195302 (4.734162%) --iCache cycles*banks: 4125376 (80.856529% used) Issue breakdown: --thread*cycles of issue worked: 3035031 (73.569801%) --thread*cycles of issue failed: 789772 (19.144243%) --thread*cycles of issue NOP/other: 4510238074874995405 (109329139105792.000000%) Number of thread-cycles not ready: 195302 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3335604 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 6 3: 7 4: 8 5: 7 6: 8 7: 8 8: 7 9: 7 10: 8 11: 7 12: 8 13: 9 14: 7 15: 9 16: 7 17: 7 18: 7 19: 8 20: 7 21: 8 22: 6 23: 7 24: 5 25: 8 26: 7 27: 6 28: 8 29: 8 30: 6 31: 7 <=== Core 58 ===> ---- Thread 00 ---- PC 5: Stalled ----- 103952 in-flight CPI 1.2280 -- Total Cycles 127680 ---- Thread 01 ---- PC 5: Stalled ----- 99105 in-flight CPI 1.2881 -- Total Cycles 127680 ---- Thread 02 ---- PC 5: Stalled ----- 99456 in-flight CPI 1.2835 -- Total Cycles 127680 ---- Thread 03 ---- PC 5: Stalled ----- 103594 in-flight CPI 1.2322 -- Total Cycles 127680 ---- Thread 04 ---- PC 5: Stalled ----- 101282 in-flight CPI 1.2604 -- Total Cycles 127680 ---- Thread 05 ---- PC 5: Stalled ----- 102005 in-flight CPI 1.2514 -- Total Cycles 127680 ---- Thread 06 ---- PC 5: Stalled ----- 98224 in-flight CPI 1.2996 -- Total Cycles 127680 ---- Thread 07 ---- PC 5: Stalled ----- 97139 in-flight CPI 1.3142 -- Total Cycles 127680 ---- Thread 08 ---- PC 5: Stalled ----- 100632 in-flight CPI 1.2685 -- Total Cycles 127680 ---- Thread 09 ---- PC 5: Stalled ----- 96599 in-flight CPI 1.3215 -- Total Cycles 127680 ---- Thread 10 ---- PC 5: Stalled ----- 96552 in-flight CPI 1.3221 -- Total Cycles 127680 ---- Thread 11 ---- PC 5: Stalled ----- 91052 in-flight CPI 1.4020 -- Total Cycles 127680 ---- Thread 12 ---- PC 5: Stalled ----- 96035 in-flight CPI 1.3292 -- Total Cycles 127680 ---- Thread 13 ---- PC 5: Stalled ----- 93003 in-flight CPI 1.3726 -- Total Cycles 127680 ---- Thread 14 ---- PC 5: Stalled ----- 94903 in-flight CPI 1.3451 -- Total Cycles 127680 ---- Thread 15 ---- PC 5: Stalled ----- 91527 in-flight CPI 1.3947 -- Total Cycles 127680 ---- Thread 16 ---- PC 5: Stalled ----- 95723 in-flight CPI 1.3336 -- Total Cycles 127680 ---- Thread 17 ---- PC 5: Stalled ----- 94837 in-flight CPI 1.3460 -- Total Cycles 127680 ---- Thread 18 ---- PC 5: Stalled ----- 91728 in-flight CPI 1.3916 -- Total Cycles 127680 ---- Thread 19 ---- PC 5: Stalled ----- 90765 in-flight CPI 1.4065 -- Total Cycles 127680 ---- Thread 20 ---- PC 5: Stalled ----- 97076 in-flight CPI 1.3150 -- Total Cycles 127680 ---- Thread 21 ---- PC 5: Stalled ----- 95277 in-flight CPI 1.3399 -- Total Cycles 127680 ---- Thread 22 ---- PC 5: Stalled ----- 88852 in-flight CPI 1.4368 -- Total Cycles 127680 ---- Thread 23 ---- PC 5: Stalled ----- 91942 in-flight CPI 1.3884 -- Total Cycles 127680 ---- Thread 24 ---- PC 5: Stalled ----- 95334 in-flight CPI 1.3391 -- Total Cycles 127680 ---- Thread 25 ---- PC 5: Stalled ----- 96674 in-flight CPI 1.3204 -- Total Cycles 127680 ---- Thread 26 ---- PC 5: Stalled ----- 90101 in-flight CPI 1.4168 -- Total Cycles 127680 ---- Thread 27 ---- PC 5: Stalled ----- 92017 in-flight CPI 1.3873 -- Total Cycles 127680 ---- Thread 28 ---- PC 5: Stalled ----- 91883 in-flight CPI 1.3893 -- Total Cycles 127680 ---- Thread 29 ---- PC 5: Stalled ----- 88454 in-flight CPI 1.4432 -- Total Cycles 127680 ---- Thread 30 ---- PC 5: Stalled ----- 93064 in-flight CPI 1.3717 -- Total Cycles 127680 ---- Thread 31 ---- PC 5: Stalled ----- 92607 in-flight CPI 1.3785 -- Total Cycles 127680 Total CPI 0.0418 , IPC 23.9034 -- Total Cycles 127680 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7654 (4.095523%) FPSUB: 0 (0.000000%) FPMUL: 31507 (16.858850%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 64299 (34.405281%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4465 (2.389144%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 70751 (37.857635%) DIV: 7946 (4.251767%) FPUN: 0 (0.000000%) FPRSUB: 265 (0.141797%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3354573 total) ADD%: 7.216 (242081) SUB%: 0.000 (0) MUL%: 0.006 (215) BITOR%: 1.519 (50959) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.541 (18141) FPSUB%: 0.000 (0) FPMUL%: 4.740 (159022) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (645) FPMAX%: 0.019 (645) LOAD%: 5.124 (171878) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (247) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (619) FPINV%: 0.000 (0) FPCONV%: 0.020 (677) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.060 (35542) FPLE%: 0.454 (15226) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (645) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.813 (94365) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.741 (24859) CMPU%: 0.000 (0) RSUB%: 0.006 (215) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.680 (525996) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.175 (39415) ORI%: 1.547 (51883) XORI%: 0.000 (0) MULI%: 3.212 (107758) LW%: 1.403 (47069) LWI%: 13.139 (440743) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.287 (9636) SWI%: 4.164 (139675) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.407 (47194) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10392) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.055 (1853) bned%: 0.000 (0) bneid%: 13.802 (462991) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.714 (23954) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.120 (4017) DIV%: 0.013 (430) FPUN%: 1.473 (49419) FPRSUB%: 4.178 (140162) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (80) FPGT%: 2.954 (99102) FPGE%: 1.019 (34193) SYNC%: 0.000 (0) NOP%: 9.019 (302534) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 17 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 163 FPSUB 0 FPMUL 2 FPCMPLT 0 FPMIN 0 FPMAX 415 LOAD 39284 INTCONV 0 ATOMIC_INC 15 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 17 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1497 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 12 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49637 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 8 ORI 10882 XORI 0 MULI 9984 LW 0 LWI 143375 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 67 DIV 21 FPUN 0 FPRSUB 30 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.9036 --Total thread-cycles: 4085760 --total thread-cycles issued: 3052039 (74.699417%) --iCache conflicts: 114111 (2.792895%) --thread*cycles of FU dependence: 255434 (6.251811%) --thread*cycles of data dependence: 186887 (4.574106%) --iCache cycles*banks: 4085760 (82.104797% used) Issue breakdown: --thread*cycles of issue worked: 3052039 (74.699417%) --thread*cycles of issue failed: 731187 (17.895985%) --thread*cycles of issue NOP/other: 4611686018427690438 (112872168357888.000000%) Number of thread-cycles not ready: 186887 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3354573 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 8 3: 9 4: 9 5: 9 6: 8 7: 7 8: 8 9: 7 10: 8 11: 7 12: 8 13: 8 14: 7 15: 8 16: 8 17: 8 18: 8 19: 7 20: 8 21: 7 22: 6 23: 8 24: 7 25: 9 26: 8 27: 8 28: 7 29: 7 30: 7 31: 7 <=== Core 59 ===> ---- Thread 00 ---- PC 5: Stalled ----- 102187 in-flight CPI 1.4803 -- Total Cycles 151295 ---- Thread 01 ---- PC 5: Stalled ----- 98188 in-flight CPI 1.5406 -- Total Cycles 151295 ---- Thread 02 ---- PC 5: Stalled ----- 96238 in-flight CPI 1.5717 -- Total Cycles 151295 ---- Thread 03 ---- PC 5: Stalled ----- 101862 in-flight CPI 1.4850 -- Total Cycles 151295 ---- Thread 04 ---- PC 5: Stalled ----- 96962 in-flight CPI 1.5600 -- Total Cycles 151295 ---- Thread 05 ---- PC 5: Stalled ----- 99267 in-flight CPI 1.5239 -- Total Cycles 151295 ---- Thread 06 ---- PC 5: Stalled ----- 99197 in-flight CPI 1.5249 -- Total Cycles 151295 ---- Thread 07 ---- PC 5: Stalled ----- 97035 in-flight CPI 1.5589 -- Total Cycles 151295 ---- Thread 08 ---- PC 5: Stalled ----- 101281 in-flight CPI 1.4935 -- Total Cycles 151295 ---- Thread 09 ---- PC 5: Stalled ----- 97707 in-flight CPI 1.5482 -- Total Cycles 151295 ---- Thread 10 ---- PC 5: Stalled ----- 92011 in-flight CPI 1.6440 -- Total Cycles 151295 ---- Thread 11 ---- PC 5: Stalled ----- 95575 in-flight CPI 1.5827 -- Total Cycles 151295 ---- Thread 12 ---- PC 5: Stalled ----- 97772 in-flight CPI 1.5471 -- Total Cycles 151295 ---- Thread 13 ---- PC 5: Stalled ----- 99481 in-flight CPI 1.5205 -- Total Cycles 151295 ---- Thread 14 ---- PC 5: Stalled ----- 98327 in-flight CPI 1.5383 -- Total Cycles 151295 ---- Thread 15 ---- PC 5: Stalled ----- 97080 in-flight CPI 1.5581 -- Total Cycles 151295 ---- Thread 16 ---- PC 5: Stalled ----- 88831 in-flight CPI 1.7029 -- Total Cycles 151295 ---- Thread 17 ---- PC 5: Stalled ----- 88237 in-flight CPI 1.7144 -- Total Cycles 151295 ---- Thread 18 ---- PC 5: Stalled ----- 110975 in-flight CPI 1.3632 -- Total Cycles 151295 ---- Thread 19 ---- PC 5: Stalled ----- 91798 in-flight CPI 1.6479 -- Total Cycles 151295 ---- Thread 20 ---- PC 5: Stalled ----- 90547 in-flight CPI 1.6706 -- Total Cycles 151295 ---- Thread 21 ---- PC 5: Stalled ----- 85650 in-flight CPI 1.7662 -- Total Cycles 151295 ---- Thread 22 ---- PC 5: Stalled ----- 91866 in-flight CPI 1.6466 -- Total Cycles 151295 ---- Thread 23 ---- PC 5: Stalled ----- 95692 in-flight CPI 1.5807 -- Total Cycles 151295 ---- Thread 24 ---- PC 5: Stalled ----- 87992 in-flight CPI 1.7191 -- Total Cycles 151295 ---- Thread 25 ---- PC 5: Stalled ----- 93265 in-flight CPI 1.6219 -- Total Cycles 151295 ---- Thread 26 ---- PC 5: Stalled ----- 92210 in-flight CPI 1.6405 -- Total Cycles 151295 ---- Thread 27 ---- PC 5: Stalled ----- 93495 in-flight CPI 1.6179 -- Total Cycles 151295 ---- Thread 28 ---- PC 5: Stalled ----- 91042 in-flight CPI 1.6615 -- Total Cycles 151295 ---- Thread 29 ---- PC 5: Stalled ----- 85149 in-flight CPI 1.7765 -- Total Cycles 151295 ---- Thread 30 ---- PC 5: Stalled ----- 92507 in-flight CPI 1.6352 -- Total Cycles 151295 ---- Thread 31 ---- PC 5: Stalled ----- 91583 in-flight CPI 1.6516 -- Total Cycles 151295 Total CPI 0.0497 , IPC 20.1036 -- Total Cycles 151295 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8516 (4.012420%) FPSUB: 0 (0.000000%) FPMUL: 33120 (15.604901%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 80904 (38.118931%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4005 (1.887006%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 77896 (36.701672%) DIV: 7539 (3.552094%) FPUN: 0 (0.000000%) FPRSUB: 261 (0.122973%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3342766 total) ADD%: 7.168 (239609) SUB%: 0.000 (0) MUL%: 0.006 (204) BITOR%: 1.522 (50865) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.593 (19837) FPSUB%: 0.000 (0) FPMUL%: 4.894 (163593) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (612) FPMAX%: 0.018 (612) LOAD%: 5.196 (173681) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (236) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (579) FPINV%: 0.000 (0) FPCONV%: 0.019 (644) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.080 (36086) FPLE%: 0.451 (15068) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (612) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.770 (92579) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.764 (25544) CMPU%: 0.000 (0) RSUB%: 0.006 (204) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.637 (522712) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.170 (39099) ORI%: 1.594 (53283) XORI%: 0.000 (0) MULI%: 3.167 (105872) LW%: 1.380 (46122) LWI%: 13.013 (434990) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.284 (9507) SWI%: 4.115 (137540) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.383 (46229) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10351) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.061 (2024) bned%: 0.000 (0) bneid%: 13.755 (459799) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.708 (23668) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.133 (4446) DIV%: 0.012 (408) FPUN%: 1.463 (48920) FPRSUB%: 4.313 (144190) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (65) FPGT%: 2.928 (97883) FPGE%: 1.013 (33852) SYNC%: 0.000 (0) NOP%: 9.009 (301145) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 20 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 151 FPSUB 0 FPMUL 5 FPCMPLT 0 FPMIN 0 FPMAX 395 LOAD 40526 INTCONV 0 ATOMIC_INC 16 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 20 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1641 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 11 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48860 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 13 ORI 12179 XORI 0 MULI 8959 LW 0 LWI 141985 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 83 DIV 20 FPUN 0 FPRSUB 46 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 20.1038 --Total thread-cycles: 4841440 --total thread-cycles issued: 3041621 (62.824718%) --iCache conflicts: 111296 (2.298820%) --thread*cycles of FU dependence: 254944 (5.265872%) --thread*cycles of data dependence: 212241 (4.383841%) --iCache cycles*banks: 4841440 (69.045532% used) Issue breakdown: --thread*cycles of issue worked: 3041621 (62.824718%) --thread*cycles of issue failed: 1498674 (30.955130%) --thread*cycles of issue NOP/other: 4611686018427689049 (95254422224896.000000%) Number of thread-cycles not ready: 212241 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3342766 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 9 3: 8 4: 9 5: 7 6: 8 7: 7 8: 8 9: 7 10: 7 11: 8 12: 8 13: 8 14: 9 15: 8 16: 6 17: 6 18: 5 19: 6 20: 7 21: 6 22: 7 23: 8 24: 7 25: 7 26: 7 27: 8 28: 8 29: 6 30: 8 31: 8 <=== Core 60 ===> ---- Thread 00 ---- PC 5: Stalled ----- 105152 in-flight CPI 1.2331 -- Total Cycles 129689 ---- Thread 01 ---- PC 5: Stalled ----- 97704 in-flight CPI 1.3271 -- Total Cycles 129689 ---- Thread 02 ---- PC 5: Stalled ----- 94762 in-flight CPI 1.3683 -- Total Cycles 129689 ---- Thread 03 ---- PC 5: Stalled ----- 94540 in-flight CPI 1.3716 -- Total Cycles 129689 ---- Thread 04 ---- PC 5: Stalled ----- 100300 in-flight CPI 1.2928 -- Total Cycles 129689 ---- Thread 05 ---- PC 5: Stalled ----- 96568 in-flight CPI 1.3427 -- Total Cycles 129689 ---- Thread 06 ---- PC 5: Stalled ----- 97546 in-flight CPI 1.3293 -- Total Cycles 129689 ---- Thread 07 ---- PC 5: Stalled ----- 96232 in-flight CPI 1.3474 -- Total Cycles 129689 ---- Thread 08 ---- PC 5: Stalled ----- 97447 in-flight CPI 1.3306 -- Total Cycles 129689 ---- Thread 09 ---- PC 5: Stalled ----- 102933 in-flight CPI 1.2597 -- Total Cycles 129689 ---- Thread 10 ---- PC 5: Stalled ----- 95926 in-flight CPI 1.3517 -- Total Cycles 129689 ---- Thread 11 ---- PC 5: Stalled ----- 101987 in-flight CPI 1.2713 -- Total Cycles 129689 ---- Thread 12 ---- PC 5: Stalled ----- 95057 in-flight CPI 1.3641 -- Total Cycles 129689 ---- Thread 13 ---- PC 5: Stalled ----- 96686 in-flight CPI 1.3410 -- Total Cycles 129689 ---- Thread 14 ---- PC 5: Stalled ----- 93242 in-flight CPI 1.3906 -- Total Cycles 129689 ---- Thread 15 ---- PC 5: Stalled ----- 97714 in-flight CPI 1.3271 -- Total Cycles 129689 ---- Thread 16 ---- PC 5: Stalled ----- 97088 in-flight CPI 1.3356 -- Total Cycles 129689 ---- Thread 17 ---- PC 5: Stalled ----- 95800 in-flight CPI 1.3535 -- Total Cycles 129689 ---- Thread 18 ---- PC 5: Stalled ----- 94647 in-flight CPI 1.3700 -- Total Cycles 129689 ---- Thread 19 ---- PC 5: Stalled ----- 93164 in-flight CPI 1.3919 -- Total Cycles 129689 ---- Thread 20 ---- PC 5: Stalled ----- 89046 in-flight CPI 1.4562 -- Total Cycles 129689 ---- Thread 21 ---- PC 5: Stalled ----- 94913 in-flight CPI 1.3662 -- Total Cycles 129689 ---- Thread 22 ---- PC 5: Stalled ----- 95169 in-flight CPI 1.3625 -- Total Cycles 129689 ---- Thread 23 ---- PC 5: Stalled ----- 97459 in-flight CPI 1.3305 -- Total Cycles 129689 ---- Thread 24 ---- PC 5: Stalled ----- 93506 in-flight CPI 1.3867 -- Total Cycles 129689 ---- Thread 25 ---- PC 5: Stalled ----- 87941 in-flight CPI 1.4745 -- Total Cycles 129689 ---- Thread 26 ---- PC 5: Stalled ----- 95829 in-flight CPI 1.3531 -- Total Cycles 129689 ---- Thread 27 ---- PC 5: Stalled ----- 84921 in-flight CPI 1.5269 -- Total Cycles 129689 ---- Thread 28 ---- PC 5: Stalled ----- 94751 in-flight CPI 1.3685 -- Total Cycles 129689 ---- Thread 29 ---- PC 5: Stalled ----- 91388 in-flight CPI 1.4189 -- Total Cycles 129689 ---- Thread 30 ---- PC 5: Stalled ----- 90448 in-flight CPI 1.4336 -- Total Cycles 129689 ---- Thread 31 ---- PC 5: Stalled ----- 84784 in-flight CPI 1.5293 -- Total Cycles 129689 Total CPI 0.0426 , IPC 23.4809 -- Total Cycles 129689 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7693 (4.208125%) FPSUB: 0 (0.000000%) FPMUL: 31538 (17.251509%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 60823 (33.270611%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4305 (2.354865%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 70545 (38.588612%) DIV: 7646 (4.182416%) FPUN: 0 (0.000000%) FPRSUB: 263 (0.143863%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3347044 total) ADD%: 7.154 (239445) SUB%: 0.000 (0) MUL%: 0.006 (207) BITOR%: 1.526 (51071) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.543 (18160) FPSUB%: 0.000 (0) FPMUL%: 4.746 (158857) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (621) FPMAX%: 0.019 (621) LOAD%: 5.131 (171733) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (239) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (600) FPINV%: 0.000 (0) FPCONV%: 0.020 (653) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.058 (35426) FPLE%: 0.453 (15164) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (621) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.817 (94272) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.741 (24805) CMPU%: 0.000 (0) RSUB%: 0.006 (207) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.688 (525089) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.175 (39328) ORI%: 1.563 (52317) XORI%: 0.000 (0) MULI%: 3.214 (107560) LW%: 1.404 (47009) LWI%: 13.134 (439617) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.288 (9637) SWI%: 4.157 (139149) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.408 (47131) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10379) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.058 (1931) bned%: 0.000 (0) bneid%: 13.801 (461934) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.721 (24136) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.120 (4011) DIV%: 0.012 (414) FPUN%: 1.481 (49576) FPRSUB%: 4.188 (140190) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (77) FPGT%: 2.945 (98576) FPGE%: 1.028 (34412) SYNC%: 0.000 (0) NOP%: 9.016 (301773) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 30 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 151 FPSUB 0 FPMUL 1 FPCMPLT 0 FPMIN 0 FPMAX 400 LOAD 38945 INTCONV 0 ATOMIC_INC 15 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 18 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1299 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 8 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49403 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 12 ORI 10953 XORI 0 MULI 9431 LW 0 LWI 143053 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 65 DIV 24 FPUN 0 FPRSUB 40 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.4812 --Total thread-cycles: 4150048 --total thread-cycles issued: 3045271 (73.379173%) --iCache conflicts: 115597 (2.785438%) --thread*cycles of FU dependence: 253880 (6.117520%) --thread*cycles of data dependence: 182813 (4.405082%) --iCache cycles*banks: 4150048 (80.651505% used) Issue breakdown: --thread*cycles of issue worked: 3045271 (73.379173%) --thread*cycles of issue failed: 803004 (19.349270%) --thread*cycles of issue NOP/other: 1006724204347694 (24258134016.000000%) Number of thread-cycles not ready: 182813 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3347044 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 8 3: 7 4: 7 5: 8 6: 8 7: 8 8: 7 9: 9 10: 7 11: 10 12: 7 13: 9 14: 8 15: 6 16: 7 17: 8 18: 7 19: 6 20: 7 21: 7 22: 6 23: 7 24: 8 25: 7 26: 8 27: 7 28: 8 29: 7 30: 7 31: 7 <=== Core 61 ===> ---- Thread 00 ---- PC 5: Stalled ----- 103581 in-flight CPI 1.2536 -- Total Cycles 129871 ---- Thread 01 ---- PC 5: Stalled ----- 98491 in-flight CPI 1.3184 -- Total Cycles 129871 ---- Thread 02 ---- PC 5: Stalled ----- 94446 in-flight CPI 1.3748 -- Total Cycles 129871 ---- Thread 03 ---- PC 5: Stalled ----- 103925 in-flight CPI 1.2494 -- Total Cycles 129871 ---- Thread 04 ---- PC 5: Stalled ----- 96555 in-flight CPI 1.3448 -- Total Cycles 129871 ---- Thread 05 ---- PC 5: Stalled ----- 98440 in-flight CPI 1.3190 -- Total Cycles 129871 ---- Thread 06 ---- PC 5: Stalled ----- 95683 in-flight CPI 1.3571 -- Total Cycles 129871 ---- Thread 07 ---- PC 5: Stalled ----- 101274 in-flight CPI 1.2821 -- Total Cycles 129871 ---- Thread 08 ---- PC 5: Stalled ----- 98978 in-flight CPI 1.3118 -- Total Cycles 129871 ---- Thread 09 ---- PC 5: Stalled ----- 103990 in-flight CPI 1.2486 -- Total Cycles 129871 ---- Thread 10 ---- PC 5: Stalled ----- 97072 in-flight CPI 1.3376 -- Total Cycles 129871 ---- Thread 11 ---- PC 5: Stalled ----- 93178 in-flight CPI 1.3936 -- Total Cycles 129871 ---- Thread 12 ---- PC 5: Stalled ----- 88486 in-flight CPI 1.4675 -- Total Cycles 129871 ---- Thread 13 ---- PC 5: Stalled ----- 98731 in-flight CPI 1.3151 -- Total Cycles 129871 ---- Thread 14 ---- PC 5: Stalled ----- 94411 in-flight CPI 1.3754 -- Total Cycles 129871 ---- Thread 15 ---- PC 5: Stalled ----- 96673 in-flight CPI 1.3431 -- Total Cycles 129871 ---- Thread 16 ---- PC 5: Stalled ----- 96687 in-flight CPI 1.3429 -- Total Cycles 129871 ---- Thread 17 ---- PC 5: Stalled ----- 95834 in-flight CPI 1.3549 -- Total Cycles 129871 ---- Thread 18 ---- PC 5: Stalled ----- 98179 in-flight CPI 1.3225 -- Total Cycles 129871 ---- Thread 19 ---- PC 5: Stalled ----- 91976 in-flight CPI 1.4118 -- Total Cycles 129871 ---- Thread 20 ---- PC 5: Stalled ----- 90564 in-flight CPI 1.4338 -- Total Cycles 129871 ---- Thread 21 ---- PC 5: Stalled ----- 95265 in-flight CPI 1.3630 -- Total Cycles 129871 ---- Thread 22 ---- PC 5: Stalled ----- 93629 in-flight CPI 1.3868 -- Total Cycles 129871 ---- Thread 23 ---- PC 5: Stalled ----- 90275 in-flight CPI 1.4384 -- Total Cycles 129871 ---- Thread 24 ---- PC 5: Stalled ----- 93302 in-flight CPI 1.3917 -- Total Cycles 129871 ---- Thread 25 ---- PC 5: Stalled ----- 87821 in-flight CPI 1.4785 -- Total Cycles 129871 ---- Thread 26 ---- PC 5: Stalled ----- 91419 in-flight CPI 1.4203 -- Total Cycles 129871 ---- Thread 27 ---- PC 5: Stalled ----- 87909 in-flight CPI 1.4770 -- Total Cycles 129871 ---- Thread 28 ---- PC 5: Stalled ----- 85027 in-flight CPI 1.5272 -- Total Cycles 129871 ---- Thread 29 ---- PC 5: Stalled ----- 88632 in-flight CPI 1.4650 -- Total Cycles 129871 ---- Thread 30 ---- PC 5: Stalled ----- 88290 in-flight CPI 1.4706 -- Total Cycles 129871 ---- Thread 31 ---- PC 5: Stalled ----- 91655 in-flight CPI 1.4167 -- Total Cycles 129871 Total CPI 0.0428 , IPC 23.3381 -- Total Cycles 129871 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8198 (4.295835%) FPSUB: 0 (0.000000%) FPMUL: 32395 (16.975309%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 63155 (33.093861%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4246 (2.224947%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 74896 (39.246265%) DIV: 7683 (4.025970%) FPUN: 0 (0.000000%) FPRSUB: 263 (0.137815%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3331360 total) ADD%: 7.144 (238009) SUB%: 0.000 (0) MUL%: 0.006 (208) BITOR%: 1.518 (50582) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.573 (19091) FPSUB%: 0.000 (0) FPMUL%: 4.839 (161191) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (624) FPMAX%: 0.019 (624) LOAD%: 5.173 (172341) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (240) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (597) FPINV%: 0.000 (0) FPCONV%: 0.020 (656) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.074 (35769) FPLE%: 0.453 (15090) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (624) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.788 (92893) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.753 (25082) CMPU%: 0.000 (0) RSUB%: 0.006 (208) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.657 (521588) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.171 (39017) ORI%: 1.569 (52285) XORI%: 0.000 (0) MULI%: 3.189 (106232) LW%: 1.390 (46298) LWI%: 13.082 (435797) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.286 (9521) SWI%: 4.128 (137526) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.393 (46415) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.309 (10300) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.059 (1971) bned%: 0.000 (0) bneid%: 13.779 (459031) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.710 (23636) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.128 (4266) DIV%: 0.012 (416) FPUN%: 1.466 (48847) FPRSUB%: 4.267 (142140) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (68) FPGT%: 2.941 (97966) FPGE%: 1.013 (33757) SYNC%: 0.000 (0) NOP%: 9.016 (300358) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 27 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 155 FPSUB 0 FPMUL 1 FPCMPLT 0 FPMIN 0 FPMAX 403 LOAD 40725 INTCONV 0 ATOMIC_INC 16 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 13 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1326 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 6 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48953 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 9 ORI 11632 XORI 0 MULI 9022 LW 0 LWI 142208 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 88 DIV 27 FPUN 0 FPRSUB 49 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.3384 --Total thread-cycles: 4155872 --total thread-cycles issued: 3031002 (72.932991%) --iCache conflicts: 113674 (2.735262%) --thread*cycles of FU dependence: 254683 (6.128269%) --thread*cycles of data dependence: 190836 (4.591960%) --iCache cycles*banks: 4155872 (80.161079% used) Issue breakdown: --thread*cycles of issue worked: 3031002 (72.932991%) --thread*cycles of issue failed: 824512 (19.839687%) --thread*cycles of issue NOP/other: 4622452549029762374 (111227019722752.000000%) Number of thread-cycles not ready: 190836 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3331360 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 7 2: 7 3: 8 4: 7 5: 9 6: 7 7: 8 8: 9 9: 8 10: 8 11: 7 12: 6 13: 8 14: 7 15: 8 16: 8 17: 7 18: 8 19: 7 20: 7 21: 8 22: 7 23: 7 24: 8 25: 7 26: 8 27: 8 28: 6 29: 7 30: 8 31: 7 <=== Core 62 ===> ---- Thread 00 ---- PC 5: Stalled ----- 98136 in-flight CPI 1.2972 -- Total Cycles 127329 ---- Thread 01 ---- PC 5: Stalled ----- 93616 in-flight CPI 1.3598 -- Total Cycles 127329 ---- Thread 02 ---- PC 5: Stalled ----- 101465 in-flight CPI 1.2546 -- Total Cycles 127329 ---- Thread 03 ---- PC 5: Stalled ----- 103907 in-flight CPI 1.2252 -- Total Cycles 127329 ---- Thread 04 ---- PC 5: Stalled ----- 98238 in-flight CPI 1.2959 -- Total Cycles 127329 ---- Thread 05 ---- PC 5: Stalled ----- 99765 in-flight CPI 1.2760 -- Total Cycles 127329 ---- Thread 06 ---- PC 5: Stalled ----- 97374 in-flight CPI 1.3074 -- Total Cycles 127329 ---- Thread 07 ---- PC 5: Stalled ----- 93925 in-flight CPI 1.3555 -- Total Cycles 127329 ---- Thread 08 ---- PC 5: Stalled ----- 97816 in-flight CPI 1.3015 -- Total Cycles 127329 ---- Thread 09 ---- PC 5: Stalled ----- 101098 in-flight CPI 1.2593 -- Total Cycles 127329 ---- Thread 10 ---- PC 5: Stalled ----- 97107 in-flight CPI 1.3110 -- Total Cycles 127329 ---- Thread 11 ---- PC 5: Stalled ----- 97087 in-flight CPI 1.3112 -- Total Cycles 127329 ---- Thread 12 ---- PC 5: Stalled ----- 97365 in-flight CPI 1.3076 -- Total Cycles 127329 ---- Thread 13 ---- PC 5: Stalled ----- 95396 in-flight CPI 1.3345 -- Total Cycles 127329 ---- Thread 14 ---- PC 5: Stalled ----- 90348 in-flight CPI 1.4091 -- Total Cycles 127329 ---- Thread 15 ---- PC 5: Stalled ----- 99473 in-flight CPI 1.2798 -- Total Cycles 127329 ---- Thread 16 ---- PC 5: Stalled ----- 96641 in-flight CPI 1.3173 -- Total Cycles 127329 ---- Thread 17 ---- PC 5: Stalled ----- 95855 in-flight CPI 1.3281 -- Total Cycles 127329 ---- Thread 18 ---- PC 5: Stalled ----- 88772 in-flight CPI 1.4341 -- Total Cycles 127329 ---- Thread 19 ---- PC 5: Stalled ----- 93755 in-flight CPI 1.3579 -- Total Cycles 127329 ---- Thread 20 ---- PC 5: Stalled ----- 89576 in-flight CPI 1.4212 -- Total Cycles 127329 ---- Thread 21 ---- PC 5: Stalled ----- 89344 in-flight CPI 1.4249 -- Total Cycles 127329 ---- Thread 22 ---- PC 5: Stalled ----- 91469 in-flight CPI 1.3918 -- Total Cycles 127329 ---- Thread 23 ---- PC 5: Stalled ----- 89397 in-flight CPI 1.4241 -- Total Cycles 127329 ---- Thread 24 ---- PC 5: Stalled ----- 93324 in-flight CPI 1.3641 -- Total Cycles 127329 ---- Thread 25 ---- PC 5: Stalled ----- 93409 in-flight CPI 1.3629 -- Total Cycles 127329 ---- Thread 26 ---- PC 5: Stalled ----- 89713 in-flight CPI 1.4190 -- Total Cycles 127329 ---- Thread 27 ---- PC 5: Stalled ----- 89977 in-flight CPI 1.4148 -- Total Cycles 127329 ---- Thread 28 ---- PC 5: Stalled ----- 91908 in-flight CPI 1.3851 -- Total Cycles 127329 ---- Thread 29 ---- PC 5: Stalled ----- 85025 in-flight CPI 1.4973 -- Total Cycles 127329 ---- Thread 30 ---- PC 5: Stalled ----- 86440 in-flight CPI 1.4728 -- Total Cycles 127329 ---- Thread 31 ---- PC 5: Stalled ----- 82148 in-flight CPI 1.5497 -- Total Cycles 127329 Total CPI 0.0423 , IPC 23.6351 -- Total Cycles 127329 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7394 (3.800585%) FPSUB: 0 (0.000000%) FPMUL: 30896 (15.880832%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 76403 (39.271854%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4181 (2.149073%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 67807 (34.853432%) DIV: 7610 (3.911611%) FPUN: 0 (0.000000%) FPRSUB: 258 (0.132614%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3308076 total) ADD%: 7.229 (239128) SUB%: 0.000 (0) MUL%: 0.006 (206) BITOR%: 1.521 (50303) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.531 (17553) FPSUB%: 0.000 (0) FPMUL%: 4.712 (155875) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (618) FPMAX%: 0.019 (618) LOAD%: 5.112 (169125) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (238) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (590) FPINV%: 0.000 (0) FPCONV%: 0.020 (650) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.056 (34919) FPLE%: 0.456 (15069) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (618) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.819 (93241) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.738 (24422) CMPU%: 0.000 (0) RSUB%: 0.006 (206) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.697 (519269) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.176 (38895) ORI%: 1.545 (51125) XORI%: 0.000 (0) MULI%: 3.217 (106424) LW%: 1.405 (46469) LWI%: 13.146 (434887) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.289 (9558) SWI%: 4.161 (137658) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.408 (46583) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.312 (10307) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.054 (1796) bned%: 0.000 (0) bneid%: 13.816 (457055) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.717 (23709) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.116 (3850) DIV%: 0.012 (412) FPUN%: 1.476 (48828) FPRSUB%: 4.155 (137435) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (72) FPGT%: 2.960 (97921) FPGE%: 1.021 (33759) SYNC%: 0.000 (0) NOP%: 9.026 (298589) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 18 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 156 FPSUB 0 FPMUL 1 FPCMPLT 0 FPMIN 0 FPMAX 396 LOAD 38762 INTCONV 0 ATOMIC_INC 18 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 14 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1473 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 10 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49002 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 13 ORI 10498 XORI 0 MULI 9772 LW 0 LWI 141550 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 93 DIV 25 FPUN 0 FPRSUB 47 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.6353 --Total thread-cycles: 4074528 --total thread-cycles issued: 3009487 (73.861000%) --iCache conflicts: 112188 (2.753399%) --thread*cycles of FU dependence: 251877 (6.181746%) --thread*cycles of data dependence: 194549 (4.774762%) --iCache cycles*banks: 4074528 (81.189964% used) Issue breakdown: --thread*cycles of issue worked: 3009487 (73.861000%) --thread*cycles of issue failed: 766452 (18.810818%) --thread*cycles of issue NOP/other: 18039949270467061 (442749452288.000000%) Number of thread-cycles not ready: 194549 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3308076 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 9 3: 9 4: 8 5: 8 6: 7 7: 6 8: 8 9: 7 10: 8 11: 9 12: 6 13: 8 14: 7 15: 8 16: 8 17: 8 18: 6 19: 7 20: 7 21: 7 22: 7 23: 6 24: 8 25: 6 26: 8 27: 8 28: 8 29: 7 30: 7 31: 6 <=== Core 63 ===> ---- Thread 00 ---- PC 5: Stalled ----- 102138 in-flight CPI 1.3679 -- Total Cycles 139742 ---- Thread 01 ---- PC 5: Stalled ----- 99518 in-flight CPI 1.4040 -- Total Cycles 139742 ---- Thread 02 ---- PC 5: Stalled ----- 92539 in-flight CPI 1.5098 -- Total Cycles 139742 ---- Thread 03 ---- PC 5: Stalled ----- 98900 in-flight CPI 1.4127 -- Total Cycles 139742 ---- Thread 04 ---- PC 5: Stalled ----- 98931 in-flight CPI 1.4122 -- Total Cycles 139742 ---- Thread 05 ---- PC 5: Stalled ----- 103306 in-flight CPI 1.3524 -- Total Cycles 139742 ---- Thread 06 ---- PC 5: Stalled ----- 101954 in-flight CPI 1.3704 -- Total Cycles 139742 ---- Thread 07 ---- PC 5: Stalled ----- 94748 in-flight CPI 1.4747 -- Total Cycles 139742 ---- Thread 08 ---- PC 5: Stalled ----- 100044 in-flight CPI 1.3965 -- Total Cycles 139742 ---- Thread 09 ---- PC 5: Stalled ----- 98657 in-flight CPI 1.4162 -- Total Cycles 139742 ---- Thread 10 ---- PC 5: Stalled ----- 91753 in-flight CPI 1.5228 -- Total Cycles 139742 ---- Thread 11 ---- PC 5: Stalled ----- 108367 in-flight CPI 1.2893 -- Total Cycles 139742 ---- Thread 12 ---- PC 5: Stalled ----- 104499 in-flight CPI 1.3370 -- Total Cycles 139742 ---- Thread 13 ---- PC 5: Stalled ----- 96127 in-flight CPI 1.4534 -- Total Cycles 139742 ---- Thread 14 ---- PC 5: Stalled ----- 92266 in-flight CPI 1.5143 -- Total Cycles 139742 ---- Thread 15 ---- PC 5: Stalled ----- 101226 in-flight CPI 1.3802 -- Total Cycles 139742 ---- Thread 16 ---- PC 5: Stalled ----- 94050 in-flight CPI 1.4855 -- Total Cycles 139742 ---- Thread 17 ---- PC 5: Stalled ----- 97897 in-flight CPI 1.4272 -- Total Cycles 139742 ---- Thread 18 ---- PC 5: Stalled ----- 94297 in-flight CPI 1.4816 -- Total Cycles 139742 ---- Thread 19 ---- PC 5: Stalled ----- 94977 in-flight CPI 1.4710 -- Total Cycles 139742 ---- Thread 20 ---- PC 5: Stalled ----- 94782 in-flight CPI 1.4741 -- Total Cycles 139742 ---- Thread 21 ---- PC 5: Stalled ----- 89679 in-flight CPI 1.5579 -- Total Cycles 139742 ---- Thread 22 ---- PC 5: Stalled ----- 93001 in-flight CPI 1.5022 -- Total Cycles 139742 ---- Thread 23 ---- PC 5: Stalled ----- 86654 in-flight CPI 1.6124 -- Total Cycles 139742 ---- Thread 24 ---- PC 5: Stalled ----- 92067 in-flight CPI 1.5176 -- Total Cycles 139742 ---- Thread 25 ---- PC 5: Stalled ----- 88203 in-flight CPI 1.5840 -- Total Cycles 139742 ---- Thread 26 ---- PC 5: Stalled ----- 92239 in-flight CPI 1.5147 -- Total Cycles 139742 ---- Thread 27 ---- PC 5: Stalled ----- 91472 in-flight CPI 1.5274 -- Total Cycles 139742 ---- Thread 28 ---- PC 5: Stalled ----- 91768 in-flight CPI 1.5225 -- Total Cycles 139742 ---- Thread 29 ---- PC 5: Stalled ----- 89627 in-flight CPI 1.5589 -- Total Cycles 139742 ---- Thread 30 ---- PC 5: Stalled ----- 92574 in-flight CPI 1.5093 -- Total Cycles 139742 ---- Thread 31 ---- PC 5: Stalled ----- 88583 in-flight CPI 1.5772 -- Total Cycles 139742 Total CPI 0.0457 , IPC 21.8790 -- Total Cycles 139742 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7731 (3.923529%) FPSUB: 0 (0.000000%) FPMUL: 31680 (16.077791%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 73973 (37.541740%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4223 (2.143198%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 71459 (36.265869%) DIV: 7715 (3.915409%) FPUN: 0 (0.000000%) FPRSUB: 261 (0.132459%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3360679 total) ADD%: 7.178 (241233) SUB%: 0.000 (0) MUL%: 0.006 (209) BITOR%: 1.520 (51099) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.543 (18260) FPSUB%: 0.000 (0) FPMUL%: 4.752 (159709) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (627) FPMAX%: 0.019 (627) LOAD%: 5.142 (172812) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (241) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (597) FPINV%: 0.000 (0) FPCONV%: 0.020 (659) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.060 (35607) FPLE%: 0.453 (15235) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (627) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.814 (94579) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.740 (24871) CMPU%: 0.000 (0) RSUB%: 0.006 (209) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.681 (526998) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.171 (39369) ORI%: 1.560 (52420) XORI%: 0.000 (0) MULI%: 3.211 (107898) LW%: 1.403 (47146) LWI%: 13.124 (441048) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.288 (9685) SWI%: 4.147 (139375) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.406 (47261) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10457) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.062 (2077) bned%: 0.000 (0) bneid%: 13.792 (463503) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.719 (24159) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.121 (4064) DIV%: 0.012 (418) FPUN%: 1.478 (49687) FPRSUB%: 4.198 (141074) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (72) FPGT%: 2.946 (99010) FPGE%: 1.025 (34452) SYNC%: 0.000 (0) NOP%: 9.022 (303209) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 14 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 159 FPSUB 0 FPMUL 0 FPCMPLT 0 FPMIN 0 FPMAX 407 LOAD 40514 INTCONV 0 ATOMIC_INC 16 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 18 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1618 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 9 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49664 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 11 ORI 11017 XORI 0 MULI 9803 LW 0 LWI 143676 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 80 DIV 32 FPUN 0 FPRSUB 45 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.8792 --Total thread-cycles: 4471744 --total thread-cycles issued: 3057470 (68.373100%) --iCache conflicts: 113001 (2.527001%) --thread*cycles of FU dependence: 257104 (5.749524%) --thread*cycles of data dependence: 197042 (4.406379%) --iCache cycles*banks: 4471744 (75.154373% used) Issue breakdown: --thread*cycles of issue worked: 3057470 (68.373100%) --thread*cycles of issue failed: 1111065 (24.846346%) --thread*cycles of issue NOP/other: 97122818095505539 (2171922677760.000000%) Number of thread-cycles not ready: 197042 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3360679 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 9 1: 7 2: 7 3: 8 4: 8 5: 8 6: 8 7: 6 8: 9 9: 8 10: 7 11: 7 12: 9 13: 8 14: 7 15: 8 16: 8 17: 8 18: 8 19: 8 20: 7 21: 8 22: 9 23: 6 24: 6 25: 7 26: 8 27: 7 28: 7 29: 6 30: 7 31: 7 <=== Core 64 ===> ---- Thread 00 ---- PC 5: Stalled ----- 96411 in-flight CPI 1.3653 -- Total Cycles 131648 ---- Thread 01 ---- PC 5: Stalled ----- 98030 in-flight CPI 1.3427 -- Total Cycles 131648 ---- Thread 02 ---- PC 5: Stalled ----- 99019 in-flight CPI 1.3292 -- Total Cycles 131648 ---- Thread 03 ---- PC 5: Stalled ----- 99677 in-flight CPI 1.3205 -- Total Cycles 131648 ---- Thread 04 ---- PC 5: Stalled ----- 97494 in-flight CPI 1.3501 -- Total Cycles 131648 ---- Thread 05 ---- PC 5: Stalled ----- 93447 in-flight CPI 1.4086 -- Total Cycles 131648 ---- Thread 06 ---- PC 5: Stalled ----- 96327 in-flight CPI 1.3664 -- Total Cycles 131648 ---- Thread 07 ---- PC 5: Stalled ----- 94683 in-flight CPI 1.3902 -- Total Cycles 131648 ---- Thread 08 ---- PC 5: Stalled ----- 96348 in-flight CPI 1.3661 -- Total Cycles 131648 ---- Thread 09 ---- PC 5: Stalled ----- 91972 in-flight CPI 1.4312 -- Total Cycles 131648 ---- Thread 10 ---- PC 5: Stalled ----- 99361 in-flight CPI 1.3247 -- Total Cycles 131648 ---- Thread 11 ---- PC 5: Stalled ----- 93726 in-flight CPI 1.4044 -- Total Cycles 131648 ---- Thread 12 ---- PC 5: Stalled ----- 94033 in-flight CPI 1.3998 -- Total Cycles 131648 ---- Thread 13 ---- PC 5: Stalled ----- 93243 in-flight CPI 1.4115 -- Total Cycles 131648 ---- Thread 14 ---- PC 5: Stalled ----- 94804 in-flight CPI 1.3884 -- Total Cycles 131648 ---- Thread 15 ---- PC 5: Stalled ----- 97100 in-flight CPI 1.3556 -- Total Cycles 131648 ---- Thread 16 ---- PC 5: Stalled ----- 103067 in-flight CPI 1.2770 -- Total Cycles 131648 ---- Thread 17 ---- PC 5: Stalled ----- 96109 in-flight CPI 1.3695 -- Total Cycles 131648 ---- Thread 18 ---- PC 5: Stalled ----- 96430 in-flight CPI 1.3649 -- Total Cycles 131648 ---- Thread 19 ---- PC 5: Stalled ----- 100267 in-flight CPI 1.3127 -- Total Cycles 131648 ---- Thread 20 ---- PC 5: Stalled ----- 93520 in-flight CPI 1.4075 -- Total Cycles 131648 ---- Thread 21 ---- PC 5: Stalled ----- 91522 in-flight CPI 1.4382 -- Total Cycles 131648 ---- Thread 22 ---- PC 5: Stalled ----- 92885 in-flight CPI 1.4171 -- Total Cycles 131648 ---- Thread 23 ---- PC 5: Stalled ----- 94688 in-flight CPI 1.3902 -- Total Cycles 131648 ---- Thread 24 ---- PC 5: Stalled ----- 93160 in-flight CPI 1.4129 -- Total Cycles 131648 ---- Thread 25 ---- PC 5: Stalled ----- 89470 in-flight CPI 1.4712 -- Total Cycles 131648 ---- Thread 26 ---- PC 5: Stalled ----- 91375 in-flight CPI 1.4404 -- Total Cycles 131648 ---- Thread 27 ---- PC 5: Stalled ----- 95505 in-flight CPI 1.3781 -- Total Cycles 131648 ---- Thread 28 ---- PC 5: Stalled ----- 87691 in-flight CPI 1.5010 -- Total Cycles 131648 ---- Thread 29 ---- PC 5: Stalled ----- 94069 in-flight CPI 1.3992 -- Total Cycles 131648 ---- Thread 30 ---- PC 5: Stalled ----- 83311 in-flight CPI 1.5799 -- Total Cycles 131648 ---- Thread 31 ---- PC 5: Stalled ----- 83697 in-flight CPI 1.5727 -- Total Cycles 131648 Total CPI 0.0435 , IPC 22.9626 -- Total Cycles 131648 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8305 (3.923894%) FPSUB: 0 (0.000000%) FPMUL: 32544 (15.376184%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 83338 (39.375011%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4135 (1.953679%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 75687 (35.760117%) DIV: 7384 (3.488745%) FPUN: 0 (0.000000%) FPRSUB: 259 (0.122371%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3322401 total) ADD%: 7.118 (236488) SUB%: 0.000 (0) MUL%: 0.006 (200) BITOR%: 1.519 (50475) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.582 (19322) FPSUB%: 0.000 (0) FPMUL%: 4.864 (161585) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (600) FPMAX%: 0.018 (600) LOAD%: 5.199 (172734) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (232) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (579) FPINV%: 0.000 (0) FPCONV%: 0.019 (632) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.073 (35641) FPLE%: 0.454 (15071) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (600) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.789 (92670) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.758 (25170) CMPU%: 0.000 (0) RSUB%: 0.006 (200) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.662 (520351) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.170 (38881) ORI%: 1.583 (52580) XORI%: 0.000 (0) MULI%: 3.183 (105736) LW%: 1.391 (46204) LWI%: 13.047 (433470) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.285 (9478) SWI%: 4.120 (136868) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.394 (46319) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10295) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.065 (2158) bned%: 0.000 (0) bneid%: 13.754 (456948) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.715 (23767) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.130 (4319) DIV%: 0.012 (400) FPUN%: 1.468 (48782) FPRSUB%: 4.292 (142589) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (67) FPGT%: 2.926 (97223) FPGE%: 1.015 (33711) SYNC%: 0.000 (0) NOP%: 9.010 (299360) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 24 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 152 FPSUB 0 FPMUL 0 FPCMPLT 0 FPMIN 0 FPMAX 389 LOAD 40948 INTCONV 0 ATOMIC_INC 22 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 17 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1693 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 11 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48606 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 13 ORI 11865 XORI 0 MULI 9308 LW 0 LWI 141492 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 83 DIV 20 FPUN 0 FPRSUB 64 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 22.9629 --Total thread-cycles: 4212736 --total thread-cycles issued: 3023041 (71.759567%) --iCache conflicts: 112232 (2.664112%) --thread*cycles of FU dependence: 254716 (6.046332%) --thread*cycles of data dependence: 211652 (5.024098%) --iCache cycles*banks: 4212736 (78.866394% used) Issue breakdown: --thread*cycles of issue worked: 3023041 (71.759567%) --thread*cycles of issue failed: 890335 (21.134365%) --thread*cycles of issue NOP/other: 4499613 (106.809761%) Number of thread-cycles not ready: 211652 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3322401 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 5 1: 8 2: 9 3: 9 4: 8 5: 6 6: 7 7: 7 8: 9 9: 6 10: 7 11: 6 12: 7 13: 9 14: 7 15: 7 16: 9 17: 7 18: 8 19: 8 20: 7 21: 7 22: 6 23: 5 24: 7 25: 7 26: 8 27: 9 28: 6 29: 8 30: 7 31: 6 <=== Core 65 ===> ---- Thread 00 ---- PC 5: Stalled ----- 100284 in-flight CPI 1.4226 -- Total Cycles 142700 ---- Thread 01 ---- PC 5: Stalled ----- 94988 in-flight CPI 1.5020 -- Total Cycles 142700 ---- Thread 02 ---- PC 5: Stalled ----- 96295 in-flight CPI 1.4817 -- Total Cycles 142700 ---- Thread 03 ---- PC 5: Stalled ----- 95637 in-flight CPI 1.4918 -- Total Cycles 142700 ---- Thread 04 ---- PC 5: Stalled ----- 97779 in-flight CPI 1.4591 -- Total Cycles 142700 ---- Thread 05 ---- PC 5: Stalled ----- 95324 in-flight CPI 1.4967 -- Total Cycles 142700 ---- Thread 06 ---- PC 5: Stalled ----- 100964 in-flight CPI 1.4131 -- Total Cycles 142700 ---- Thread 07 ---- PC 5: Stalled ----- 95905 in-flight CPI 1.4876 -- Total Cycles 142700 ---- Thread 08 ---- PC 5: Stalled ----- 98249 in-flight CPI 1.4521 -- Total Cycles 142700 ---- Thread 09 ---- PC 5: Stalled ----- 97346 in-flight CPI 1.4657 -- Total Cycles 142700 ---- Thread 10 ---- PC 5: Stalled ----- 100920 in-flight CPI 1.4137 -- Total Cycles 142700 ---- Thread 11 ---- PC 5: Stalled ----- 97541 in-flight CPI 1.4628 -- Total Cycles 142700 ---- Thread 12 ---- PC 5: Stalled ----- 100501 in-flight CPI 1.4196 -- Total Cycles 142700 ---- Thread 13 ---- PC 5: Stalled ----- 96958 in-flight CPI 1.4714 -- Total Cycles 142700 ---- Thread 14 ---- PC 5: Stalled ----- 97238 in-flight CPI 1.4672 -- Total Cycles 142700 ---- Thread 15 ---- PC 5: Stalled ----- 100957 in-flight CPI 1.4132 -- Total Cycles 142700 ---- Thread 16 ---- PC 5: Stalled ----- 93974 in-flight CPI 1.5182 -- Total Cycles 142700 ---- Thread 17 ---- PC 5: Stalled ----- 89620 in-flight CPI 1.5921 -- Total Cycles 142700 ---- Thread 18 ---- PC 5: Stalled ----- 95660 in-flight CPI 1.4914 -- Total Cycles 142700 ---- Thread 19 ---- PC 5: Stalled ----- 98287 in-flight CPI 1.4515 -- Total Cycles 142700 ---- Thread 20 ---- PC 5: Stalled ----- 87968 in-flight CPI 1.6220 -- Total Cycles 142700 ---- Thread 21 ---- PC 5: Stalled ----- 93603 in-flight CPI 1.5243 -- Total Cycles 142700 ---- Thread 22 ---- PC 5: Stalled ----- 91979 in-flight CPI 1.5512 -- Total Cycles 142700 ---- Thread 23 ---- PC 5: Stalled ----- 90843 in-flight CPI 1.5706 -- Total Cycles 142700 ---- Thread 24 ---- PC 5: Stalled ----- 88528 in-flight CPI 1.6116 -- Total Cycles 142700 ---- Thread 25 ---- PC 5: Stalled ----- 95774 in-flight CPI 1.4897 -- Total Cycles 142700 ---- Thread 26 ---- PC 5: Stalled ----- 91329 in-flight CPI 1.5622 -- Total Cycles 142700 ---- Thread 27 ---- PC 5: Stalled ----- 89849 in-flight CPI 1.5879 -- Total Cycles 142700 ---- Thread 28 ---- PC 5: Stalled ----- 91298 in-flight CPI 1.5628 -- Total Cycles 142700 ---- Thread 29 ---- PC 5: Stalled ----- 93956 in-flight CPI 1.5185 -- Total Cycles 142700 ---- Thread 30 ---- PC 5: Stalled ----- 84425 in-flight CPI 1.6900 -- Total Cycles 142700 ---- Thread 31 ---- PC 5: Stalled ----- 97022 in-flight CPI 1.4707 -- Total Cycles 142700 Total CPI 0.0469 , IPC 21.3143 -- Total Cycles 142700 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8292 (4.064845%) FPSUB: 0 (0.000000%) FPMUL: 32671 (16.015745%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 75618 (37.068920%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4186 (2.052031%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 75505 (37.013523%) DIV: 7468 (3.660910%) FPUN: 0 (0.000000%) FPRSUB: 253 (0.124024%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3342524 total) ADD%: 7.182 (240071) SUB%: 0.000 (0) MUL%: 0.006 (202) BITOR%: 1.513 (50574) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.583 (19492) FPSUB%: 0.000 (0) FPMUL%: 4.869 (162734) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (606) FPMAX%: 0.018 (606) LOAD%: 5.188 (173423) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (234) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (585) FPINV%: 0.000 (0) FPCONV%: 0.019 (638) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.071 (35815) FPLE%: 0.453 (15157) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (606) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.789 (93212) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.756 (25263) CMPU%: 0.000 (0) RSUB%: 0.006 (202) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.658 (523388) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.172 (39160) ORI%: 1.573 (52590) XORI%: 0.000 (0) MULI%: 3.180 (106294) LW%: 1.391 (46485) LWI%: 13.049 (436181) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.285 (9523) SWI%: 4.122 (137781) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.394 (46602) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.309 (10312) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.061 (2045) bned%: 0.000 (0) bneid%: 13.747 (459504) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.711 (23773) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.129 (4307) DIV%: 0.012 (404) FPUN%: 1.461 (48820) FPRSUB%: 4.285 (143215) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (75) FPGT%: 2.931 (97969) FPGE%: 1.007 (33663) SYNC%: 0.000 (0) NOP%: 9.003 (300917) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 34 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 150 FPSUB 0 FPMUL 1 FPCMPLT 0 FPMIN 0 FPMAX 390 LOAD 40173 INTCONV 0 ATOMIC_INC 19 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 19 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1424 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 7 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49123 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 11 ORI 11790 XORI 0 MULI 9201 LW 0 LWI 142465 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 77 DIV 16 FPUN 0 FPRSUB 44 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.3145 --Total thread-cycles: 4566400 --total thread-cycles issued: 3041607 (66.608421%) --iCache conflicts: 113168 (2.478276%) --thread*cycles of FU dependence: 254945 (5.583063%) --thread*cycles of data dependence: 203993 (4.467261%) --iCache cycles*banks: 4566400 (73.198929% used) Issue breakdown: --thread*cycles of issue worked: 3041607 (66.608421%) --thread*cycles of issue failed: 1223876 (26.801771%) --thread*cycles of issue NOP/other: 4632260264712840229 (101442278916096.000000%) Number of thread-cycles not ready: 203993 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3342524 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 9 1: 8 2: 7 3: 8 4: 8 5: 7 6: 8 7: 8 8: 9 9: 7 10: 8 11: 5 12: 9 13: 9 14: 8 15: 9 16: 8 17: 5 18: 8 19: 9 20: 5 21: 7 22: 6 23: 6 24: 8 25: 8 26: 7 27: 7 28: 6 29: 7 30: 6 31: 4 <=== Core 66 ===> ---- Thread 00 ---- PC 5: Stalled ----- 99129 in-flight CPI 1.2989 -- Total Cycles 128787 ---- Thread 01 ---- PC 5: Stalled ----- 95997 in-flight CPI 1.3413 -- Total Cycles 128787 ---- Thread 02 ---- PC 5: Stalled ----- 94958 in-flight CPI 1.3560 -- Total Cycles 128787 ---- Thread 03 ---- PC 5: Stalled ----- 95966 in-flight CPI 1.3417 -- Total Cycles 128787 ---- Thread 04 ---- PC 5: Stalled ----- 100659 in-flight CPI 1.2792 -- Total Cycles 128787 ---- Thread 05 ---- PC 5: Stalled ----- 94933 in-flight CPI 1.3563 -- Total Cycles 128787 ---- Thread 06 ---- PC 5: Stalled ----- 98173 in-flight CPI 1.3116 -- Total Cycles 128787 ---- Thread 07 ---- PC 5: Stalled ----- 97392 in-flight CPI 1.3221 -- Total Cycles 128787 ---- Thread 08 ---- PC 5: Stalled ----- 96447 in-flight CPI 1.3350 -- Total Cycles 128787 ---- Thread 09 ---- PC 5: Stalled ----- 96106 in-flight CPI 1.3398 -- Total Cycles 128787 ---- Thread 10 ---- PC 5: Stalled ----- 98038 in-flight CPI 1.3134 -- Total Cycles 128787 ---- Thread 11 ---- PC 5: Stalled ----- 98287 in-flight CPI 1.3101 -- Total Cycles 128787 ---- Thread 12 ---- PC 5: Stalled ----- 99391 in-flight CPI 1.2955 -- Total Cycles 128787 ---- Thread 13 ---- PC 5: Stalled ----- 95724 in-flight CPI 1.3451 -- Total Cycles 128787 ---- Thread 14 ---- PC 5: Stalled ----- 99588 in-flight CPI 1.2930 -- Total Cycles 128787 ---- Thread 15 ---- PC 5: Stalled ----- 96028 in-flight CPI 1.3408 -- Total Cycles 128787 ---- Thread 16 ---- PC 5: Stalled ----- 91521 in-flight CPI 1.4070 -- Total Cycles 128787 ---- Thread 17 ---- PC 5: Stalled ----- 97351 in-flight CPI 1.3226 -- Total Cycles 128787 ---- Thread 18 ---- PC 5: Stalled ----- 92388 in-flight CPI 1.3937 -- Total Cycles 128787 ---- Thread 19 ---- PC 5: Stalled ----- 92499 in-flight CPI 1.3921 -- Total Cycles 128787 ---- Thread 20 ---- PC 5: Stalled ----- 95307 in-flight CPI 1.3511 -- Total Cycles 128787 ---- Thread 21 ---- PC 5: Stalled ----- 92912 in-flight CPI 1.3859 -- Total Cycles 128787 ---- Thread 22 ---- PC 5: Stalled ----- 91878 in-flight CPI 1.4015 -- Total Cycles 128787 ---- Thread 23 ---- PC 5: Stalled ----- 95002 in-flight CPI 1.3554 -- Total Cycles 128787 ---- Thread 24 ---- PC 5: Stalled ----- 96154 in-flight CPI 1.3391 -- Total Cycles 128787 ---- Thread 25 ---- PC 5: Stalled ----- 87346 in-flight CPI 1.4741 -- Total Cycles 128787 ---- Thread 26 ---- PC 5: Stalled ----- 93437 in-flight CPI 1.3781 -- Total Cycles 128787 ---- Thread 27 ---- PC 5: Stalled ----- 90300 in-flight CPI 1.4259 -- Total Cycles 128787 ---- Thread 28 ---- PC 5: Stalled ----- 89684 in-flight CPI 1.4357 -- Total Cycles 128787 ---- Thread 29 ---- PC 5: Stalled ----- 95005 in-flight CPI 1.3553 -- Total Cycles 128787 ---- Thread 30 ---- PC 5: Stalled ----- 85920 in-flight CPI 1.4987 -- Total Cycles 128787 ---- Thread 31 ---- PC 5: Stalled ----- 90057 in-flight CPI 1.4298 -- Total Cycles 128787 Total CPI 0.0424 , IPC 23.5595 -- Total Cycles 128787 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7362 (4.074517%) FPSUB: 0 (0.000000%) FPMUL: 30803 (17.047995%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 62517 (34.600185%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4295 (2.377078%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 67686 (37.460983%) DIV: 7759 (4.294238%) FPUN: 0 (0.000000%) FPRSUB: 262 (0.145005%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3334884 total) ADD%: 7.194 (239901) SUB%: 0.000 (0) MUL%: 0.006 (210) BITOR%: 1.534 (51152) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.522 (17406) FPSUB%: 0.000 (0) FPMUL%: 4.686 (156287) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (630) FPMAX%: 0.019 (630) LOAD%: 5.100 (170065) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (242) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (604) FPINV%: 0.000 (0) FPCONV%: 0.020 (662) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.053 (35100) FPLE%: 0.456 (15206) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (630) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.825 (94200) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.738 (24602) CMPU%: 0.000 (0) RSUB%: 0.006 (210) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.702 (523636) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.181 (39371) ORI%: 1.541 (51407) XORI%: 0.000 (0) MULI%: 3.225 (107552) LW%: 1.408 (46961) LWI%: 13.172 (439270) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.289 (9643) SWI%: 4.170 (139067) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.412 (47081) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10352) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.049 (1635) bned%: 0.000 (0) bneid%: 13.835 (461386) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.721 (24053) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.115 (3841) DIV%: 0.013 (420) FPUN%: 1.487 (49587) FPRSUB%: 4.140 (138054) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (76) FPGT%: 2.957 (98601) FPGE%: 1.031 (34381) SYNC%: 0.000 (0) NOP%: 9.016 (300677) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 32 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 158 FPSUB 0 FPMUL 1 FPCMPLT 0 FPMIN 0 FPMAX 404 LOAD 39189 INTCONV 0 ATOMIC_INC 16 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 17 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1763 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 8 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49562 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 13 ORI 10370 XORI 0 MULI 9909 LW 0 LWI 142892 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 64 DIV 24 FPUN 0 FPRSUB 44 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.5597 --Total thread-cycles: 4121184 --total thread-cycles issued: 3034207 (73.624641%) --iCache conflicts: 113712 (2.759207%) --thread*cycles of FU dependence: 254469 (6.174657%) --thread*cycles of data dependence: 180684 (4.384274%) --iCache cycles*banks: 4121184 (80.921310% used) Issue breakdown: --thread*cycles of issue worked: 3034207 (73.624641%) --thread*cycles of issue failed: 786300 (19.079470%) --thread*cycles of issue NOP/other: 4611686018427688581 (111901975511040.000000%) Number of thread-cycles not ready: 180684 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3334884 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 7 2: 8 3: 8 4: 8 5: 8 6: 8 7: 7 8: 8 9: 8 10: 8 11: 8 12: 8 13: 8 14: 7 15: 9 16: 6 17: 9 18: 7 19: 7 20: 7 21: 7 22: 7 23: 7 24: 8 25: 8 26: 7 27: 8 28: 7 29: 8 30: 6 31: 7 <=== Core 67 ===> ---- Thread 00 ---- PC 5: Stalled ----- 101858 in-flight CPI 1.2675 -- Total Cycles 129142 ---- Thread 01 ---- PC 5: Stalled ----- 99567 in-flight CPI 1.2968 -- Total Cycles 129142 ---- Thread 02 ---- PC 5: Stalled ----- 105176 in-flight CPI 1.2276 -- Total Cycles 129142 ---- Thread 03 ---- PC 5: Stalled ----- 97821 in-flight CPI 1.3200 -- Total Cycles 129142 ---- Thread 04 ---- PC 5: Stalled ----- 95154 in-flight CPI 1.3570 -- Total Cycles 129142 ---- Thread 05 ---- PC 5: Stalled ----- 98944 in-flight CPI 1.3049 -- Total Cycles 129142 ---- Thread 06 ---- PC 5: Stalled ----- 102898 in-flight CPI 1.2548 -- Total Cycles 129142 ---- Thread 07 ---- PC 5: Stalled ----- 96568 in-flight CPI 1.3370 -- Total Cycles 129142 ---- Thread 08 ---- PC 5: Stalled ----- 105263 in-flight CPI 1.2266 -- Total Cycles 129142 ---- Thread 09 ---- PC 5: Stalled ----- 93817 in-flight CPI 1.3764 -- Total Cycles 129142 ---- Thread 10 ---- PC 5: Stalled ----- 100478 in-flight CPI 1.2850 -- Total Cycles 129142 ---- Thread 11 ---- PC 5: Stalled ----- 100621 in-flight CPI 1.2832 -- Total Cycles 129142 ---- Thread 12 ---- PC 5: Stalled ----- 102568 in-flight CPI 1.2589 -- Total Cycles 129142 ---- Thread 13 ---- PC 5: Stalled ----- 98572 in-flight CPI 1.3099 -- Total Cycles 129142 ---- Thread 14 ---- PC 5: Stalled ----- 97857 in-flight CPI 1.3194 -- Total Cycles 129142 ---- Thread 15 ---- PC 5: Stalled ----- 98592 in-flight CPI 1.3096 -- Total Cycles 129142 ---- Thread 16 ---- PC 5: Stalled ----- 91276 in-flight CPI 1.4146 -- Total Cycles 129142 ---- Thread 17 ---- PC 5: Stalled ----- 88147 in-flight CPI 1.4649 -- Total Cycles 129142 ---- Thread 18 ---- PC 5: Stalled ----- 96668 in-flight CPI 1.3357 -- Total Cycles 129142 ---- Thread 19 ---- PC 5: Stalled ----- 90796 in-flight CPI 1.4221 -- Total Cycles 129142 ---- Thread 20 ---- PC 5: Stalled ----- 93040 in-flight CPI 1.3878 -- Total Cycles 129142 ---- Thread 21 ---- PC 5: Stalled ----- 92665 in-flight CPI 1.3934 -- Total Cycles 129142 ---- Thread 22 ---- PC 5: Stalled ----- 91627 in-flight CPI 1.4092 -- Total Cycles 129142 ---- Thread 23 ---- PC 5: Stalled ----- 89804 in-flight CPI 1.4377 -- Total Cycles 129142 ---- Thread 24 ---- PC 5: Stalled ----- 92845 in-flight CPI 1.3907 -- Total Cycles 129142 ---- Thread 25 ---- PC 5: Stalled ----- 93479 in-flight CPI 1.3813 -- Total Cycles 129142 ---- Thread 26 ---- PC 5: Stalled ----- 94976 in-flight CPI 1.3595 -- Total Cycles 129142 ---- Thread 27 ---- PC 5: Stalled ----- 91743 in-flight CPI 1.4075 -- Total Cycles 129142 ---- Thread 28 ---- PC 5: Stalled ----- 90364 in-flight CPI 1.4289 -- Total Cycles 129142 ---- Thread 29 ---- PC 5: Stalled ----- 93770 in-flight CPI 1.3770 -- Total Cycles 129142 ---- Thread 30 ---- PC 5: Stalled ----- 84668 in-flight CPI 1.5250 -- Total Cycles 129142 ---- Thread 31 ---- PC 5: Stalled ----- 89063 in-flight CPI 1.4497 -- Total Cycles 129142 Total CPI 0.0422 , IPC 23.7045 -- Total Cycles 129142 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8012 (4.112008%) FPSUB: 0 (0.000000%) FPMUL: 32234 (16.543491%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 68773 (35.296444%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4242 (2.177126%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 73707 (37.828724%) DIV: 7610 (3.905689%) FPUN: 0 (0.000000%) FPRSUB: 266 (0.136519%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3364500 total) ADD%: 7.185 (241731) SUB%: 0.000 (0) MUL%: 0.006 (206) BITOR%: 1.534 (51604) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.559 (18810) FPSUB%: 0.000 (0) FPMUL%: 4.796 (161356) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (618) FPMAX%: 0.018 (618) LOAD%: 5.149 (173240) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (238) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (594) FPINV%: 0.000 (0) FPCONV%: 0.019 (650) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.068 (35917) FPLE%: 0.454 (15283) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (618) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.794 (93996) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.752 (25305) CMPU%: 0.000 (0) RSUB%: 0.006 (206) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.666 (527066) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.173 (39471) ORI%: 1.573 (52922) XORI%: 0.000 (0) MULI%: 3.193 (107442) LW%: 1.393 (46855) LWI%: 13.077 (439973) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.286 (9625) SWI%: 4.130 (138941) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.396 (46973) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10425) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.057 (1925) bned%: 0.000 (0) bneid%: 13.795 (464128) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.715 (24072) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.125 (4198) DIV%: 0.012 (412) FPUN%: 1.482 (49855) FPRSUB%: 4.236 (142507) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (74) FPGT%: 2.936 (98781) FPGE%: 1.028 (34572) SYNC%: 0.000 (0) NOP%: 9.012 (303197) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 24 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 152 FPSUB 0 FPMUL 0 FPCMPLT 0 FPMIN 0 FPMAX 400 LOAD 40283 INTCONV 0 ATOMIC_INC 13 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 11 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 2031 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 8 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49529 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 13 ORI 11379 XORI 0 MULI 9551 LW 0 LWI 143569 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 67 DIV 23 FPUN 0 FPRSUB 52 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.7048 --Total thread-cycles: 4132544 --total thread-cycles issued: 3061303 (74.077927%) --iCache conflicts: 113821 (2.754260%) --thread*cycles of FU dependence: 257125 (6.221954%) --thread*cycles of data dependence: 194844 (4.714868%) --iCache cycles*banks: 4132544 (81.415520% used) Issue breakdown: --thread*cycles of issue worked: 3061303 (74.077927%) --thread*cycles of issue failed: 768044 (18.585258%) --thread*cycles of issue NOP/other: 625120691801389 (15126775808.000000%) Number of thread-cycles not ready: 194844 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3364500 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 10 1: 7 2: 10 3: 7 4: 7 5: 8 6: 8 7: 9 8: 9 9: 5 10: 9 11: 8 12: 7 13: 8 14: 8 15: 8 16: 7 17: 5 18: 8 19: 6 20: 7 21: 8 22: 6 23: 8 24: 7 25: 7 26: 7 27: 5 28: 7 29: 7 30: 7 31: 8 <=== Core 68 ===> ---- Thread 00 ---- PC 5: Stalled ----- 107641 in-flight CPI 1.4094 -- Total Cycles 151723 ---- Thread 01 ---- PC 5: Stalled ----- 91397 in-flight CPI 1.6597 -- Total Cycles 151723 ---- Thread 02 ---- PC 5: Stalled ----- 90134 in-flight CPI 1.6831 -- Total Cycles 151723 ---- Thread 03 ---- PC 5: Stalled ----- 94660 in-flight CPI 1.6025 -- Total Cycles 151723 ---- Thread 04 ---- PC 5: Stalled ----- 102565 in-flight CPI 1.4790 -- Total Cycles 151723 ---- Thread 05 ---- PC 5: Stalled ----- 98203 in-flight CPI 1.5447 -- Total Cycles 151723 ---- Thread 06 ---- PC 5: Stalled ----- 100768 in-flight CPI 1.5054 -- Total Cycles 151723 ---- Thread 07 ---- PC 5: Stalled ----- 92454 in-flight CPI 1.6407 -- Total Cycles 151723 ---- Thread 08 ---- PC 5: Stalled ----- 94109 in-flight CPI 1.6119 -- Total Cycles 151723 ---- Thread 09 ---- PC 5: Stalled ----- 97309 in-flight CPI 1.5589 -- Total Cycles 151723 ---- Thread 10 ---- PC 5: Stalled ----- 98276 in-flight CPI 1.5435 -- Total Cycles 151723 ---- Thread 11 ---- PC 5: Stalled ----- 92333 in-flight CPI 1.6429 -- Total Cycles 151723 ---- Thread 12 ---- PC 5: Stalled ----- 93212 in-flight CPI 1.6274 -- Total Cycles 151723 ---- Thread 13 ---- PC 5: Stalled ----- 93997 in-flight CPI 1.6138 -- Total Cycles 151723 ---- Thread 14 ---- PC 5: Stalled ----- 93881 in-flight CPI 1.6158 -- Total Cycles 151723 ---- Thread 15 ---- PC 5: Stalled ----- 97304 in-flight CPI 1.5590 -- Total Cycles 151723 ---- Thread 16 ---- PC 5: Stalled ----- 97917 in-flight CPI 1.5492 -- Total Cycles 151723 ---- Thread 17 ---- PC 5: Stalled ----- 92224 in-flight CPI 1.6449 -- Total Cycles 151723 ---- Thread 18 ---- PC 5: Stalled ----- 89032 in-flight CPI 1.7038 -- Total Cycles 151723 ---- Thread 19 ---- PC 5: Stalled ----- 90669 in-flight CPI 1.6731 -- Total Cycles 151723 ---- Thread 20 ---- PC 5: Stalled ----- 90926 in-flight CPI 1.6684 -- Total Cycles 151723 ---- Thread 21 ---- PC 5: Stalled ----- 96294 in-flight CPI 1.5754 -- Total Cycles 151723 ---- Thread 22 ---- PC 5: Stalled ----- 92054 in-flight CPI 1.6479 -- Total Cycles 151723 ---- Thread 23 ---- PC 5: Stalled ----- 91620 in-flight CPI 1.6557 -- Total Cycles 151723 ---- Thread 24 ---- PC 5: Stalled ----- 88525 in-flight CPI 1.7135 -- Total Cycles 151723 ---- Thread 25 ---- PC 5: Stalled ----- 92896 in-flight CPI 1.6330 -- Total Cycles 151723 ---- Thread 26 ---- PC 5: Stalled ----- 91701 in-flight CPI 1.6542 -- Total Cycles 151723 ---- Thread 27 ---- PC 5: Stalled ----- 89955 in-flight CPI 1.6864 -- Total Cycles 151723 ---- Thread 28 ---- PC 5: Stalled ----- 91874 in-flight CPI 1.6511 -- Total Cycles 151723 ---- Thread 29 ---- PC 5: Stalled ----- 89166 in-flight CPI 1.7012 -- Total Cycles 151723 ---- Thread 30 ---- PC 5: Stalled ----- 86493 in-flight CPI 1.7538 -- Total Cycles 151723 ---- Thread 31 ---- PC 5: Stalled ----- 89895 in-flight CPI 1.6874 -- Total Cycles 151723 Total CPI 0.0506 , IPC 19.7731 -- Total Cycles 151723 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8586 (3.851243%) FPSUB: 0 (0.000000%) FPMUL: 32870 (14.743811%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 91740 (41.149899%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4194 (1.881215%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 77829 (34.910133%) DIV: 7464 (3.347971%) FPUN: 0 (0.000000%) FPRSUB: 258 (0.115726%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3296915 total) ADD%: 7.174 (236509) SUB%: 0.000 (0) MUL%: 0.006 (202) BITOR%: 1.523 (50197) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.602 (19842) FPSUB%: 0.000 (0) FPMUL%: 4.917 (162120) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (606) FPMAX%: 0.018 (606) LOAD%: 5.197 (171348) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (234) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (585) FPINV%: 0.000 (0) FPCONV%: 0.019 (638) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.082 (35686) FPLE%: 0.449 (14804) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (606) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.766 (91197) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.763 (25170) CMPU%: 0.000 (0) RSUB%: 0.006 (202) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.615 (514811) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.168 (38503) ORI%: 1.601 (52793) XORI%: 0.000 (0) MULI%: 3.162 (104256) LW%: 1.379 (45477) LWI%: 13.009 (428882) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.283 (9322) SWI%: 4.105 (135325) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.383 (45594) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.307 (10124) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.062 (2044) bned%: 0.000 (0) bneid%: 13.745 (453149) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.709 (23375) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.135 (4442) DIV%: 0.012 (404) FPUN%: 1.465 (48294) FPRSUB%: 4.331 (142783) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (69) FPGT%: 2.921 (96305) FPGE%: 1.016 (33490) SYNC%: 0.000 (0) NOP%: 9.003 (296825) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 15 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 149 FPSUB 0 FPMUL 1 FPCMPLT 0 FPMIN 0 FPMAX 394 LOAD 40473 INTCONV 0 ATOMIC_INC 16 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 18 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1351 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 4 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48172 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 12 ORI 12245 XORI 0 MULI 8845 LW 0 LWI 140005 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 79 DIV 21 FPUN 0 FPRSUB 52 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 19.7733 --Total thread-cycles: 4855136 --total thread-cycles issued: 3000090 (61.792088%) --iCache conflicts: 110006 (2.265765%) --thread*cycles of FU dependence: 251884 (5.187991%) --thread*cycles of data dependence: 222941 (4.591859%) --iCache cycles*banks: 4855136 (67.906380% used) Issue breakdown: --thread*cycles of issue worked: 3000090 (61.792088%) --thread*cycles of issue failed: 1558221 (32.094280%) --thread*cycles of issue NOP/other: 4622452549029758841 (95207479574528.000000%) Number of thread-cycles not ready: 222941 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3296915 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 6 1: 7 2: 6 3: 8 4: 8 5: 7 6: 8 7: 8 8: 7 9: 8 10: 9 11: 7 12: 8 13: 7 14: 7 15: 8 16: 7 17: 7 18: 7 19: 7 20: 6 21: 7 22: 7 23: 8 24: 8 25: 7 26: 7 27: 6 28: 7 29: 8 30: 8 31: 8 <=== Core 69 ===> ---- Thread 00 ---- PC 5: Stalled ----- 99344 in-flight CPI 1.3686 -- Total Cycles 135980 ---- Thread 01 ---- PC 5: Stalled ----- 99187 in-flight CPI 1.3707 -- Total Cycles 135980 ---- Thread 02 ---- PC 5: Stalled ----- 100572 in-flight CPI 1.3518 -- Total Cycles 135980 ---- Thread 03 ---- PC 5: Stalled ----- 103691 in-flight CPI 1.3112 -- Total Cycles 135980 ---- Thread 04 ---- PC 5: Stalled ----- 96820 in-flight CPI 1.4043 -- Total Cycles 135980 ---- Thread 05 ---- PC 5: Stalled ----- 100638 in-flight CPI 1.3509 -- Total Cycles 135980 ---- Thread 06 ---- PC 5: Stalled ----- 93197 in-flight CPI 1.4588 -- Total Cycles 135980 ---- Thread 07 ---- PC 5: Stalled ----- 96840 in-flight CPI 1.4039 -- Total Cycles 135980 ---- Thread 08 ---- PC 5: Stalled ----- 99638 in-flight CPI 1.3644 -- Total Cycles 135980 ---- Thread 09 ---- PC 5: Stalled ----- 101154 in-flight CPI 1.3440 -- Total Cycles 135980 ---- Thread 10 ---- PC 5: Stalled ----- 99933 in-flight CPI 1.3604 -- Total Cycles 135980 ---- Thread 11 ---- PC 5: Stalled ----- 96567 in-flight CPI 1.4079 -- Total Cycles 135980 ---- Thread 12 ---- PC 5: Stalled ----- 97211 in-flight CPI 1.3986 -- Total Cycles 135980 ---- Thread 13 ---- PC 5: Stalled ----- 96127 in-flight CPI 1.4143 -- Total Cycles 135980 ---- Thread 14 ---- PC 5: Stalled ----- 99838 in-flight CPI 1.3617 -- Total Cycles 135980 ---- Thread 15 ---- PC 5: Stalled ----- 92738 in-flight CPI 1.4660 -- Total Cycles 135980 ---- Thread 16 ---- PC 5: Stalled ----- 88890 in-flight CPI 1.5295 -- Total Cycles 135980 ---- Thread 17 ---- PC 5: Stalled ----- 96417 in-flight CPI 1.4101 -- Total Cycles 135980 ---- Thread 18 ---- PC 5: Stalled ----- 92992 in-flight CPI 1.4620 -- Total Cycles 135980 ---- Thread 19 ---- PC 5: Stalled ----- 96134 in-flight CPI 1.4142 -- Total Cycles 135980 ---- Thread 20 ---- PC 5: Stalled ----- 89613 in-flight CPI 1.5172 -- Total Cycles 135980 ---- Thread 21 ---- PC 5: Stalled ----- 99094 in-flight CPI 1.3720 -- Total Cycles 135980 ---- Thread 22 ---- PC 5: Stalled ----- 88045 in-flight CPI 1.5442 -- Total Cycles 135980 ---- Thread 23 ---- PC 5: Stalled ----- 91581 in-flight CPI 1.4845 -- Total Cycles 135980 ---- Thread 24 ---- PC 5: Stalled ----- 90693 in-flight CPI 1.4991 -- Total Cycles 135980 ---- Thread 25 ---- PC 5: Stalled ----- 96136 in-flight CPI 1.4142 -- Total Cycles 135980 ---- Thread 26 ---- PC 5: Stalled ----- 99432 in-flight CPI 1.3674 -- Total Cycles 135980 ---- Thread 27 ---- PC 5: Stalled ----- 86260 in-flight CPI 1.5762 -- Total Cycles 135980 ---- Thread 28 ---- PC 5: Stalled ----- 89048 in-flight CPI 1.5267 -- Total Cycles 135980 ---- Thread 29 ---- PC 5: Stalled ----- 88553 in-flight CPI 1.5352 -- Total Cycles 135980 ---- Thread 30 ---- PC 5: Stalled ----- 90483 in-flight CPI 1.5026 -- Total Cycles 135980 ---- Thread 31 ---- PC 5: Stalled ----- 83628 in-flight CPI 1.6258 -- Total Cycles 135980 Total CPI 0.0447 , IPC 22.3639 -- Total Cycles 135980 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8056 (4.059338%) FPSUB: 0 (0.000000%) FPMUL: 32238 (16.244408%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 73620 (37.096386%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4094 (2.062926%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 72795 (36.680676%) DIV: 7392 (3.724755%) FPUN: 0 (0.000000%) FPRSUB: 261 (0.131515%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3342675 total) ADD%: 7.142 (238741) SUB%: 0.000 (0) MUL%: 0.006 (200) BITOR%: 1.526 (51000) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.562 (18798) FPSUB%: 0.000 (0) FPMUL%: 4.812 (160858) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (600) FPMAX%: 0.018 (600) LOAD%: 5.153 (172259) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (232) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (577) FPINV%: 0.000 (0) FPCONV%: 0.019 (632) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.070 (35750) FPLE%: 0.452 (15110) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (600) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.800 (93580) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.750 (25062) CMPU%: 0.000 (0) RSUB%: 0.006 (200) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.669 (523756) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.172 (39181) ORI%: 1.573 (52579) XORI%: 0.000 (0) MULI%: 3.197 (106878) LW%: 1.396 (46664) LWI%: 13.084 (437371) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.286 (9564) SWI%: 4.121 (137765) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.399 (46777) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10355) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.061 (2024) bned%: 0.000 (0) bneid%: 13.794 (461100) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.716 (23924) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.124 (4149) DIV%: 0.012 (400) FPUN%: 1.477 (49371) FPRSUB%: 4.243 (141813) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (57) FPGT%: 2.938 (98210) FPGE%: 1.025 (34261) SYNC%: 0.000 (0) NOP%: 9.022 (301581) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 27 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 150 FPSUB 0 FPMUL 5 FPCMPLT 0 FPMIN 0 FPMAX 391 LOAD 40533 INTCONV 0 ATOMIC_INC 15 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 13 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1484 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 11 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49261 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 11 ORI 11432 XORI 0 MULI 9796 LW 0 LWI 142623 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 69 DIV 19 FPUN 0 FPRSUB 48 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 22.3641 --Total thread-cycles: 4351360 --total thread-cycles issued: 3041094 (69.888359%) --iCache conflicts: 113134 (2.599969%) --thread*cycles of FU dependence: 255894 (5.880782%) --thread*cycles of data dependence: 198456 (4.560781%) --iCache cycles*banks: 4351360 (76.819817% used) Issue breakdown: --thread*cycles of issue worked: 3041094 (69.888359%) --thread*cycles of issue failed: 1008685 (23.180914%) --thread*cycles of issue NOP/other: 18039949270309725 (414581850112.000000%) Number of thread-cycles not ready: 198456 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3342675 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 8 2: 7 3: 8 4: 6 5: 8 6: 7 7: 8 8: 9 9: 8 10: 8 11: 6 12: 7 13: 7 14: 9 15: 7 16: 6 17: 7 18: 7 19: 8 20: 6 21: 8 22: 7 23: 7 24: 7 25: 8 26: 6 27: 6 28: 8 29: 8 30: 7 31: 6 <=== Core 70 ===> ---- Thread 00 ---- PC 5: Stalled ----- 98038 in-flight CPI 1.3094 -- Total Cycles 128401 ---- Thread 01 ---- PC 5: Stalled ----- 102782 in-flight CPI 1.2490 -- Total Cycles 128401 ---- Thread 02 ---- PC 5: Stalled ----- 105235 in-flight CPI 1.2199 -- Total Cycles 128401 ---- Thread 03 ---- PC 5: Stalled ----- 97584 in-flight CPI 1.3155 -- Total Cycles 128401 ---- Thread 04 ---- PC 5: Stalled ----- 100298 in-flight CPI 1.2800 -- Total Cycles 128401 ---- Thread 05 ---- PC 5: Stalled ----- 96844 in-flight CPI 1.3257 -- Total Cycles 128401 ---- Thread 06 ---- PC 5: Stalled ----- 94738 in-flight CPI 1.3550 -- Total Cycles 128401 ---- Thread 07 ---- PC 5: Stalled ----- 93918 in-flight CPI 1.3669 -- Total Cycles 128401 ---- Thread 08 ---- PC 5: Stalled ----- 101282 in-flight CPI 1.2675 -- Total Cycles 128401 ---- Thread 09 ---- PC 5: Stalled ----- 99586 in-flight CPI 1.2891 -- Total Cycles 128401 ---- Thread 10 ---- PC 5: Stalled ----- 95635 in-flight CPI 1.3423 -- Total Cycles 128401 ---- Thread 11 ---- PC 5: Stalled ----- 94002 in-flight CPI 1.3657 -- Total Cycles 128401 ---- Thread 12 ---- PC 5: Stalled ----- 91922 in-flight CPI 1.3966 -- Total Cycles 128401 ---- Thread 13 ---- PC 5: Stalled ----- 100768 in-flight CPI 1.2740 -- Total Cycles 128401 ---- Thread 14 ---- PC 5: Stalled ----- 94462 in-flight CPI 1.3591 -- Total Cycles 128401 ---- Thread 15 ---- PC 5: Stalled ----- 89001 in-flight CPI 1.4425 -- Total Cycles 128401 ---- Thread 16 ---- PC 5: Stalled ----- 94595 in-flight CPI 1.3571 -- Total Cycles 128401 ---- Thread 17 ---- PC 5: Stalled ----- 93410 in-flight CPI 1.3743 -- Total Cycles 128401 ---- Thread 18 ---- PC 5: Stalled ----- 93700 in-flight CPI 1.3701 -- Total Cycles 128401 ---- Thread 19 ---- PC 5: Stalled ----- 94087 in-flight CPI 1.3645 -- Total Cycles 128401 ---- Thread 20 ---- PC 5: Stalled ----- 90507 in-flight CPI 1.4184 -- Total Cycles 128401 ---- Thread 21 ---- PC 5: Stalled ----- 93368 in-flight CPI 1.3749 -- Total Cycles 128401 ---- Thread 22 ---- PC 5: Stalled ----- 90788 in-flight CPI 1.4141 -- Total Cycles 128401 ---- Thread 23 ---- PC 5: Stalled ----- 92797 in-flight CPI 1.3833 -- Total Cycles 128401 ---- Thread 24 ---- PC 5: Stalled ----- 96951 in-flight CPI 1.3242 -- Total Cycles 128401 ---- Thread 25 ---- PC 5: Stalled ----- 92810 in-flight CPI 1.3832 -- Total Cycles 128401 ---- Thread 26 ---- PC 5: Stalled ----- 88212 in-flight CPI 1.4554 -- Total Cycles 128401 ---- Thread 27 ---- PC 5: Stalled ----- 94187 in-flight CPI 1.3630 -- Total Cycles 128401 ---- Thread 28 ---- PC 5: Stalled ----- 89425 in-flight CPI 1.4355 -- Total Cycles 128401 ---- Thread 29 ---- PC 5: Stalled ----- 87993 in-flight CPI 1.4589 -- Total Cycles 128401 ---- Thread 30 ---- PC 5: Stalled ----- 87045 in-flight CPI 1.4748 -- Total Cycles 128401 ---- Thread 31 ---- PC 5: Stalled ----- 83311 in-flight CPI 1.5409 -- Total Cycles 128401 Total CPI 0.0425 , IPC 23.5188 -- Total Cycles 128401 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8139 (4.185824%) FPSUB: 0 (0.000000%) FPMUL: 32471 (16.699581%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 67791 (34.864380%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4183 (2.151284%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 73987 (38.050934%) DIV: 7612 (3.914792%) FPUN: 0 (0.000000%) FPRSUB: 259 (0.133202%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3319434 total) ADD%: 7.182 (238400) SUB%: 0.000 (0) MUL%: 0.006 (206) BITOR%: 1.512 (50185) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.577 (19145) FPSUB%: 0.000 (0) FPMUL%: 4.850 (160989) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (618) FPMAX%: 0.019 (618) LOAD%: 5.154 (171079) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (238) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (591) FPINV%: 0.000 (0) FPCONV%: 0.020 (650) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.075 (35698) FPLE%: 0.449 (14909) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (618) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.784 (92411) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.750 (24899) CMPU%: 0.000 (0) RSUB%: 0.006 (206) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.648 (519419) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.169 (38796) ORI%: 1.573 (52225) XORI%: 0.000 (0) MULI%: 3.189 (105848) LW%: 1.387 (46057) LWI%: 13.081 (434211) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.285 (9472) SWI%: 4.126 (136951) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.391 (46172) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.309 (10243) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.059 (1950) bned%: 0.000 (0) bneid%: 13.780 (457419) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.707 (23463) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.127 (4214) DIV%: 0.012 (412) FPUN%: 1.460 (48477) FPRSUB%: 4.263 (141519) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (70) FPGT%: 2.948 (97857) FPGE%: 1.011 (33568) SYNC%: 0.000 (0) NOP%: 9.024 (299535) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 18 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 151 FPSUB 0 FPMUL 2 FPCMPLT 0 FPMIN 0 FPMAX 396 LOAD 39645 INTCONV 0 ATOMIC_INC 14 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 18 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1428 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 11 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48763 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 8 ORI 11640 XORI 0 MULI 9370 LW 0 LWI 141681 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 76 DIV 23 FPUN 0 FPRSUB 51 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.5191 --Total thread-cycles: 4108832 --total thread-cycles issued: 3019899 (73.497749%) --iCache conflicts: 112999 (2.750149%) --thread*cycles of FU dependence: 253325 (6.165378%) --thread*cycles of data dependence: 194442 (4.732294%) --iCache cycles*banks: 4108832 (80.788551% used) Issue breakdown: --thread*cycles of issue worked: 3019899 (73.497749%) --thread*cycles of issue failed: 789398 (19.212223%) --thread*cycles of issue NOP/other: 97122818095501865 (2363757428736.000000%) Number of thread-cycles not ready: 194442 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3319434 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 9 3: 8 4: 7 5: 6 6: 9 7: 7 8: 8 9: 8 10: 8 11: 8 12: 7 13: 8 14: 6 15: 5 16: 7 17: 8 18: 8 19: 7 20: 7 21: 8 22: 6 23: 9 24: 7 25: 7 26: 6 27: 8 28: 8 29: 8 30: 7 31: 7 <=== Core 71 ===> ---- Thread 00 ---- PC 5: Stalled ----- 98679 in-flight CPI 1.2897 -- Total Cycles 127290 ---- Thread 01 ---- PC 5: Stalled ----- 100024 in-flight CPI 1.2723 -- Total Cycles 127290 ---- Thread 02 ---- PC 5: Stalled ----- 98764 in-flight CPI 1.2885 -- Total Cycles 127290 ---- Thread 03 ---- PC 5: Stalled ----- 95830 in-flight CPI 1.3281 -- Total Cycles 127290 ---- Thread 04 ---- PC 5: Stalled ----- 98623 in-flight CPI 1.2904 -- Total Cycles 127290 ---- Thread 05 ---- PC 5: Stalled ----- 101213 in-flight CPI 1.2574 -- Total Cycles 127290 ---- Thread 06 ---- PC 5: Stalled ----- 101114 in-flight CPI 1.2586 -- Total Cycles 127290 ---- Thread 07 ---- PC 5: Stalled ----- 97068 in-flight CPI 1.3111 -- Total Cycles 127290 ---- Thread 08 ---- PC 5: Stalled ----- 96455 in-flight CPI 1.3195 -- Total Cycles 127290 ---- Thread 09 ---- PC 5: Stalled ----- 100209 in-flight CPI 1.2700 -- Total Cycles 127290 ---- Thread 10 ---- PC 5: Stalled ----- 99508 in-flight CPI 1.2790 -- Total Cycles 127290 ---- Thread 11 ---- PC 5: Stalled ----- 92997 in-flight CPI 1.3686 -- Total Cycles 127290 ---- Thread 12 ---- PC 5: Stalled ----- 94847 in-flight CPI 1.3419 -- Total Cycles 127290 ---- Thread 13 ---- PC 5: Stalled ----- 94898 in-flight CPI 1.3411 -- Total Cycles 127290 ---- Thread 14 ---- PC 5: Stalled ----- 98682 in-flight CPI 1.2897 -- Total Cycles 127290 ---- Thread 15 ---- PC 5: Stalled ----- 98393 in-flight CPI 1.2934 -- Total Cycles 127290 ---- Thread 16 ---- PC 5: Stalled ----- 96788 in-flight CPI 1.3149 -- Total Cycles 127290 ---- Thread 17 ---- PC 5: Stalled ----- 95078 in-flight CPI 1.3386 -- Total Cycles 127290 ---- Thread 18 ---- PC 5: Stalled ----- 97677 in-flight CPI 1.3030 -- Total Cycles 127290 ---- Thread 19 ---- PC 5: Stalled ----- 91795 in-flight CPI 1.3864 -- Total Cycles 127290 ---- Thread 20 ---- PC 5: Stalled ----- 96707 in-flight CPI 1.3160 -- Total Cycles 127290 ---- Thread 21 ---- PC 5: Stalled ----- 91234 in-flight CPI 1.3950 -- Total Cycles 127290 ---- Thread 22 ---- PC 5: Stalled ----- 91989 in-flight CPI 1.3836 -- Total Cycles 127290 ---- Thread 23 ---- PC 5: Stalled ----- 93755 in-flight CPI 1.3575 -- Total Cycles 127290 ---- Thread 24 ---- PC 5: Stalled ----- 96216 in-flight CPI 1.3227 -- Total Cycles 127290 ---- Thread 25 ---- PC 5: Stalled ----- 92381 in-flight CPI 1.3776 -- Total Cycles 127290 ---- Thread 26 ---- PC 5: Stalled ----- 92637 in-flight CPI 1.3738 -- Total Cycles 127290 ---- Thread 27 ---- PC 5: Stalled ----- 93673 in-flight CPI 1.3586 -- Total Cycles 127290 ---- Thread 28 ---- PC 5: Stalled ----- 88007 in-flight CPI 1.4460 -- Total Cycles 127290 ---- Thread 29 ---- PC 5: Stalled ----- 92158 in-flight CPI 1.3809 -- Total Cycles 127290 ---- Thread 30 ---- PC 5: Stalled ----- 90293 in-flight CPI 1.4095 -- Total Cycles 127290 ---- Thread 31 ---- PC 5: Stalled ----- 86369 in-flight CPI 1.4736 -- Total Cycles 127290 Total CPI 0.0417 , IPC 23.9973 -- Total Cycles 127290 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8549 (3.984619%) FPSUB: 0 (0.000000%) FPMUL: 33233 (15.489629%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 83434 (38.887905%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 3953 (1.842461%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 77624 (36.179913%) DIV: 7501 (3.496155%) FPUN: 0 (0.000000%) FPRSUB: 256 (0.119320%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3357102 total) ADD%: 7.095 (238177) SUB%: 0.000 (0) MUL%: 0.006 (203) BITOR%: 1.528 (51297) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.591 (19847) FPSUB%: 0.000 (0) FPMUL%: 4.894 (164281) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (609) FPMAX%: 0.018 (609) LOAD%: 5.200 (174564) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (235) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (575) FPINV%: 0.000 (0) FPCONV%: 0.019 (641) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.079 (36220) FPLE%: 0.452 (15180) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (609) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.779 (93293) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.760 (25505) CMPU%: 0.000 (0) RSUB%: 0.006 (203) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.655 (525564) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.171 (39300) ORI%: 1.593 (53471) XORI%: 0.000 (0) MULI%: 3.174 (106542) LW%: 1.384 (46464) LWI%: 13.020 (437087) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.286 (9593) SWI%: 4.108 (137917) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.387 (46569) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10395) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.061 (2055) bned%: 0.000 (0) bneid%: 13.769 (462243) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.710 (23852) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.132 (4430) DIV%: 0.012 (406) FPUN%: 1.473 (49436) FPRSUB%: 4.311 (144717) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (53) FPGT%: 2.924 (98176) FPGE%: 1.020 (34256) SYNC%: 0.000 (0) NOP%: 9.009 (302432) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 22 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 155 FPSUB 0 FPMUL 0 FPCMPLT 0 FPMIN 0 FPMAX 390 LOAD 42513 INTCONV 0 ATOMIC_INC 18 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 17 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 2276 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 9 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49022 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 24 ORI 12212 XORI 0 MULI 9569 LW 0 LWI 142627 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 71 DIV 19 FPUN 0 FPRSUB 58 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.9975 --Total thread-cycles: 4073280 --total thread-cycles issued: 3054670 (74.992882%) --iCache conflicts: 113962 (2.797794%) --thread*cycles of FU dependence: 259010 (6.358757%) --thread*cycles of data dependence: 214550 (5.267254%) --iCache cycles*banks: 4073280 (82.418442% used) Issue breakdown: --thread*cycles of issue worked: 3054670 (74.992882%) --thread*cycles of issue failed: 716178 (17.582342%) --thread*cycles of issue NOP/other: 4502685 (110.542000%) Number of thread-cycles not ready: 214550 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3357102 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 9 2: 9 3: 7 4: 8 5: 8 6: 8 7: 7 8: 6 9: 8 10: 7 11: 6 12: 6 13: 8 14: 7 15: 8 16: 8 17: 7 18: 7 19: 8 20: 7 21: 7 22: 5 23: 7 24: 7 25: 8 26: 7 27: 8 28: 8 29: 8 30: 7 31: 6 <=== Core 72 ===> ---- Thread 00 ---- PC 5: Stalled ----- 96906 in-flight CPI 1.3083 -- Total Cycles 126810 ---- Thread 01 ---- PC 5: Stalled ----- 95056 in-flight CPI 1.3338 -- Total Cycles 126810 ---- Thread 02 ---- PC 5: Stalled ----- 101655 in-flight CPI 1.2472 -- Total Cycles 126810 ---- Thread 03 ---- PC 5: Stalled ----- 93457 in-flight CPI 1.3566 -- Total Cycles 126810 ---- Thread 04 ---- PC 5: Stalled ----- 95118 in-flight CPI 1.3330 -- Total Cycles 126810 ---- Thread 05 ---- PC 5: Stalled ----- 98685 in-flight CPI 1.2847 -- Total Cycles 126810 ---- Thread 06 ---- PC 5: Stalled ----- 101814 in-flight CPI 1.2452 -- Total Cycles 126810 ---- Thread 07 ---- PC 5: Stalled ----- 95595 in-flight CPI 1.3264 -- Total Cycles 126810 ---- Thread 08 ---- PC 5: Stalled ----- 99535 in-flight CPI 1.2738 -- Total Cycles 126810 ---- Thread 09 ---- PC 5: Stalled ----- 98125 in-flight CPI 1.2921 -- Total Cycles 126810 ---- Thread 10 ---- PC 5: Stalled ----- 97480 in-flight CPI 1.3007 -- Total Cycles 126810 ---- Thread 11 ---- PC 5: Stalled ----- 94715 in-flight CPI 1.3386 -- Total Cycles 126810 ---- Thread 12 ---- PC 5: Stalled ----- 98858 in-flight CPI 1.2825 -- Total Cycles 126810 ---- Thread 13 ---- PC 5: Stalled ----- 96549 in-flight CPI 1.3132 -- Total Cycles 126810 ---- Thread 14 ---- PC 5: Stalled ----- 93985 in-flight CPI 1.3490 -- Total Cycles 126810 ---- Thread 15 ---- PC 5: Stalled ----- 95738 in-flight CPI 1.3243 -- Total Cycles 126810 ---- Thread 16 ---- PC 5: Stalled ----- 99152 in-flight CPI 1.2787 -- Total Cycles 126810 ---- Thread 17 ---- PC 5: Stalled ----- 89043 in-flight CPI 1.4238 -- Total Cycles 126810 ---- Thread 18 ---- PC 5: Stalled ----- 94899 in-flight CPI 1.3360 -- Total Cycles 126810 ---- Thread 19 ---- PC 5: Stalled ----- 97578 in-flight CPI 1.2994 -- Total Cycles 126810 ---- Thread 20 ---- PC 5: Stalled ----- 90670 in-flight CPI 1.3983 -- Total Cycles 126810 ---- Thread 21 ---- PC 5: Stalled ----- 97162 in-flight CPI 1.3048 -- Total Cycles 126810 ---- Thread 22 ---- PC 5: Stalled ----- 95497 in-flight CPI 1.3276 -- Total Cycles 126810 ---- Thread 23 ---- PC 5: Stalled ----- 93117 in-flight CPI 1.3616 -- Total Cycles 126810 ---- Thread 24 ---- PC 5: Stalled ----- 94782 in-flight CPI 1.3376 -- Total Cycles 126810 ---- Thread 25 ---- PC 5: Stalled ----- 88507 in-flight CPI 1.4326 -- Total Cycles 126810 ---- Thread 26 ---- PC 5: Stalled ----- 94248 in-flight CPI 1.3453 -- Total Cycles 126810 ---- Thread 27 ---- PC 5: Stalled ----- 92416 in-flight CPI 1.3720 -- Total Cycles 126810 ---- Thread 28 ---- PC 5: Stalled ----- 84804 in-flight CPI 1.4951 -- Total Cycles 126810 ---- Thread 29 ---- PC 5: Stalled ----- 90165 in-flight CPI 1.4062 -- Total Cycles 126810 ---- Thread 30 ---- PC 5: Stalled ----- 84067 in-flight CPI 1.5082 -- Total Cycles 126810 ---- Thread 31 ---- PC 5: Stalled ----- 83408 in-flight CPI 1.5201 -- Total Cycles 126810 Total CPI 0.0419 , IPC 23.8415 -- Total Cycles 126810 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7883 (4.121809%) FPSUB: 0 (0.000000%) FPMUL: 31784 (16.618998%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 67811 (35.456547%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4136 (2.162603%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 71805 (37.544903%) DIV: 7570 (3.958149%) FPUN: 0 (0.000000%) FPRSUB: 262 (0.136993%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3323463 total) ADD%: 7.157 (237847) SUB%: 0.000 (0) MUL%: 0.006 (205) BITOR%: 1.535 (51003) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.557 (18513) FPSUB%: 0.000 (0) FPMUL%: 4.787 (159105) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (615) FPMAX%: 0.019 (615) LOAD%: 5.145 (170992) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (237) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (587) FPINV%: 0.000 (0) FPCONV%: 0.019 (647) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.067 (35450) FPLE%: 0.458 (15234) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (615) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.798 (92990) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.748 (24874) CMPU%: 0.000 (0) RSUB%: 0.006 (205) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.677 (521017) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.171 (38914) ORI%: 1.575 (52331) XORI%: 0.000 (0) MULI%: 3.195 (106178) LW%: 1.394 (46337) LWI%: 13.071 (434404) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.287 (9539) SWI%: 4.129 (137221) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.398 (46450) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.312 (10359) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.060 (1989) bned%: 0.000 (0) bneid%: 13.808 (458898) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.721 (23956) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.123 (4086) DIV%: 0.012 (410) FPUN%: 1.487 (49408) FPRSUB%: 4.215 (140097) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (72) FPGT%: 2.941 (97731) FPGE%: 1.028 (34174) SYNC%: 0.000 (0) NOP%: 9.029 (300062) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 16 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 156 FPSUB 0 FPMUL 3 FPCMPLT 0 FPMIN 0 FPMAX 398 LOAD 40207 INTCONV 0 ATOMIC_INC 17 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 20 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1629 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 9 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 48870 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 19 ORI 11216 XORI 0 MULI 9450 LW 0 LWI 141587 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 76 DIV 27 FPUN 0 FPRSUB 57 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.8418 --Total thread-cycles: 4057920 --total thread-cycles issued: 3023401 (74.506172%) --iCache conflicts: 113898 (2.806808%) --thread*cycles of FU dependence: 253762 (6.253500%) --thread*cycles of data dependence: 191251 (4.713030%) --iCache cycles*banks: 4057920 (81.901443% used) Issue breakdown: --thread*cycles of issue worked: 3023401 (74.506172%) --thread*cycles of issue failed: 734457 (18.099346%) --thread*cycles of issue NOP/other: 4672115542585090254 (115135725174784.000000%) Number of thread-cycles not ready: 191251 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3323463 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 7 2: 9 3: 7 4: 6 5: 8 6: 9 7: 6 8: 8 9: 7 10: 7 11: 7 12: 9 13: 8 14: 8 15: 8 16: 8 17: 8 18: 8 19: 7 20: 8 21: 9 22: 8 23: 8 24: 8 25: 6 26: 7 27: 6 28: 6 29: 6 30: 6 31: 6 <=== Core 73 ===> ---- Thread 00 ---- PC 5: Stalled ----- 103308 in-flight CPI 1.3558 -- Total Cycles 140088 ---- Thread 01 ---- PC 5: Stalled ----- 99531 in-flight CPI 1.4073 -- Total Cycles 140088 ---- Thread 02 ---- PC 5: Stalled ----- 100565 in-flight CPI 1.3927 -- Total Cycles 140088 ---- Thread 03 ---- PC 5: Stalled ----- 103219 in-flight CPI 1.3569 -- Total Cycles 140088 ---- Thread 04 ---- PC 5: Stalled ----- 97255 in-flight CPI 1.4402 -- Total Cycles 140088 ---- Thread 05 ---- PC 5: Stalled ----- 98769 in-flight CPI 1.4180 -- Total Cycles 140088 ---- Thread 06 ---- PC 5: Stalled ----- 96578 in-flight CPI 1.4503 -- Total Cycles 140088 ---- Thread 07 ---- PC 5: Stalled ----- 95403 in-flight CPI 1.4681 -- Total Cycles 140088 ---- Thread 08 ---- PC 5: Stalled ----- 99649 in-flight CPI 1.4055 -- Total Cycles 140088 ---- Thread 09 ---- PC 5: Stalled ----- 95370 in-flight CPI 1.4686 -- Total Cycles 140088 ---- Thread 10 ---- PC 5: Stalled ----- 92970 in-flight CPI 1.5065 -- Total Cycles 140088 ---- Thread 11 ---- PC 5: Stalled ----- 93395 in-flight CPI 1.4997 -- Total Cycles 140088 ---- Thread 12 ---- PC 5: Stalled ----- 101086 in-flight CPI 1.3856 -- Total Cycles 140088 ---- Thread 13 ---- PC 5: Stalled ----- 92864 in-flight CPI 1.5083 -- Total Cycles 140088 ---- Thread 14 ---- PC 5: Stalled ----- 96621 in-flight CPI 1.4496 -- Total Cycles 140088 ---- Thread 15 ---- PC 5: Stalled ----- 100931 in-flight CPI 1.3877 -- Total Cycles 140088 ---- Thread 16 ---- PC 5: Stalled ----- 88482 in-flight CPI 1.5830 -- Total Cycles 140088 ---- Thread 17 ---- PC 5: Stalled ----- 96744 in-flight CPI 1.4478 -- Total Cycles 140088 ---- Thread 18 ---- PC 5: Stalled ----- 95073 in-flight CPI 1.4732 -- Total Cycles 140088 ---- Thread 19 ---- PC 5: Stalled ----- 93836 in-flight CPI 1.4926 -- Total Cycles 140088 ---- Thread 20 ---- PC 5: Stalled ----- 94762 in-flight CPI 1.4781 -- Total Cycles 140088 ---- Thread 21 ---- PC 5: Stalled ----- 97844 in-flight CPI 1.4315 -- Total Cycles 140088 ---- Thread 22 ---- PC 5: Stalled ----- 91614 in-flight CPI 1.5288 -- Total Cycles 140088 ---- Thread 23 ---- PC 5: Stalled ----- 90251 in-flight CPI 1.5519 -- Total Cycles 140088 ---- Thread 24 ---- PC 5: Stalled ----- 100919 in-flight CPI 1.3879 -- Total Cycles 140088 ---- Thread 25 ---- PC 5: Stalled ----- 91750 in-flight CPI 1.5266 -- Total Cycles 140088 ---- Thread 26 ---- PC 5: Stalled ----- 95927 in-flight CPI 1.4601 -- Total Cycles 140088 ---- Thread 27 ---- PC 5: Stalled ----- 95158 in-flight CPI 1.4719 -- Total Cycles 140088 ---- Thread 28 ---- PC 5: Stalled ----- 90446 in-flight CPI 1.5485 -- Total Cycles 140088 ---- Thread 29 ---- PC 5: Stalled ----- 85965 in-flight CPI 1.6294 -- Total Cycles 140088 ---- Thread 30 ---- PC 5: Stalled ----- 86116 in-flight CPI 1.6264 -- Total Cycles 140088 ---- Thread 31 ---- PC 5: Stalled ----- 89628 in-flight CPI 1.5627 -- Total Cycles 140088 Total CPI 0.0459 , IPC 21.7904 -- Total Cycles 140088 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8399 (3.825064%) FPSUB: 0 (0.000000%) FPMUL: 32824 (14.948673%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 90507 (41.218613%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4124 (1.878148%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 76122 (34.667408%) DIV: 7349 (3.346874%) FPUN: 0 (0.000000%) FPRSUB: 253 (0.115221%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3354784 total) ADD%: 7.130 (239182) SUB%: 0.000 (0) MUL%: 0.006 (199) BITOR%: 1.512 (50719) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.581 (19487) FPSUB%: 0.000 (0) FPMUL%: 4.866 (163256) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (597) FPMAX%: 0.018 (597) LOAD%: 5.199 (174431) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (231) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (577) FPINV%: 0.000 (0) FPCONV%: 0.019 (629) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.071 (35941) FPLE%: 0.450 (15098) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (597) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.796 (93789) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.754 (25285) CMPU%: 0.000 (0) RSUB%: 0.006 (199) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.654 (525169) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.170 (39238) ORI%: 1.582 (53061) XORI%: 0.000 (0) MULI%: 3.186 (106898) LW%: 1.395 (46792) LWI%: 13.062 (438195) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.285 (9561) SWI%: 4.116 (138099) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.398 (46907) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.309 (10352) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.067 (2256) bned%: 0.000 (0) bneid%: 13.741 (460981) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.716 (24018) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.130 (4346) DIV%: 0.012 (398) FPUN%: 1.464 (49116) FPRSUB%: 4.296 (144106) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (75) FPGT%: 2.925 (98130) FPGE%: 1.014 (34018) SYNC%: 0.000 (0) NOP%: 9.007 (302158) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 21 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 151 FPSUB 0 FPMUL 1 FPCMPLT 0 FPMIN 0 FPMAX 392 LOAD 41199 INTCONV 0 ATOMIC_INC 14 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 17 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1245 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 8 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49233 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 13 ORI 11934 XORI 0 MULI 9148 LW 0 LWI 143039 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 68 DIV 23 FPUN 0 FPRSUB 50 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.7906 --Total thread-cycles: 4482816 --total thread-cycles issued: 3052626 (68.096176%) --iCache conflicts: 113295 (2.527318%) --thread*cycles of FU dependence: 256564 (5.723278%) --thread*cycles of data dependence: 219578 (4.898216%) --iCache cycles*banks: 4482816 (74.837242% used) Issue breakdown: --thread*cycles of issue worked: 3052626 (68.096176%) --thread*cycles of issue failed: 1128032 (25.163469%) --thread*cycles of issue NOP/other: 302158 (6.740361%) Number of thread-cycles not ready: 219578 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3354784 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 7 2: 8 3: 8 4: 7 5: 9 6: 7 7: 8 8: 8 9: 7 10: 8 11: 7 12: 6 13: 7 14: 7 15: 8 16: 6 17: 6 18: 7 19: 7 20: 7 21: 8 22: 7 23: 8 24: 6 25: 6 26: 7 27: 8 28: 8 29: 5 30: 7 31: 8 <=== Core 74 ===> ---- Thread 00 ---- PC 5: Stalled ----- 99626 in-flight CPI 1.2701 -- Total Cycles 126563 ---- Thread 01 ---- PC 5: Stalled ----- 100615 in-flight CPI 1.2576 -- Total Cycles 126563 ---- Thread 02 ---- PC 5: Stalled ----- 92771 in-flight CPI 1.3640 -- Total Cycles 126563 ---- Thread 03 ---- PC 5: Stalled ----- 99379 in-flight CPI 1.2733 -- Total Cycles 126563 ---- Thread 04 ---- PC 5: Stalled ----- 95942 in-flight CPI 1.3189 -- Total Cycles 126563 ---- Thread 05 ---- PC 5: Stalled ----- 97690 in-flight CPI 1.2953 -- Total Cycles 126563 ---- Thread 06 ---- PC 5: Stalled ----- 102433 in-flight CPI 1.2353 -- Total Cycles 126563 ---- Thread 07 ---- PC 5: Stalled ----- 99301 in-flight CPI 1.2743 -- Total Cycles 126563 ---- Thread 08 ---- PC 5: Stalled ----- 101621 in-flight CPI 1.2452 -- Total Cycles 126563 ---- Thread 09 ---- PC 5: Stalled ----- 98374 in-flight CPI 1.2863 -- Total Cycles 126563 ---- Thread 10 ---- PC 5: Stalled ----- 98771 in-flight CPI 1.2811 -- Total Cycles 126563 ---- Thread 11 ---- PC 5: Stalled ----- 94397 in-flight CPI 1.3405 -- Total Cycles 126563 ---- Thread 12 ---- PC 5: Stalled ----- 95925 in-flight CPI 1.3191 -- Total Cycles 126563 ---- Thread 13 ---- PC 5: Stalled ----- 95716 in-flight CPI 1.3220 -- Total Cycles 126563 ---- Thread 14 ---- PC 5: Stalled ----- 100598 in-flight CPI 1.2579 -- Total Cycles 126563 ---- Thread 15 ---- PC 5: Stalled ----- 95870 in-flight CPI 1.3199 -- Total Cycles 126563 ---- Thread 16 ---- PC 5: Stalled ----- 95126 in-flight CPI 1.3303 -- Total Cycles 126563 ---- Thread 17 ---- PC 5: Stalled ----- 95859 in-flight CPI 1.3200 -- Total Cycles 126563 ---- Thread 18 ---- PC 5: Stalled ----- 94814 in-flight CPI 1.3346 -- Total Cycles 126563 ---- Thread 19 ---- PC 5: Stalled ----- 95013 in-flight CPI 1.3318 -- Total Cycles 126563 ---- Thread 20 ---- PC 5: Stalled ----- 94817 in-flight CPI 1.3346 -- Total Cycles 126563 ---- Thread 21 ---- PC 5: Stalled ----- 89790 in-flight CPI 1.4093 -- Total Cycles 126563 ---- Thread 22 ---- PC 5: Stalled ----- 93727 in-flight CPI 1.3501 -- Total Cycles 126563 ---- Thread 23 ---- PC 5: Stalled ----- 90510 in-flight CPI 1.3981 -- Total Cycles 126563 ---- Thread 24 ---- PC 5: Stalled ----- 90051 in-flight CPI 1.4052 -- Total Cycles 126563 ---- Thread 25 ---- PC 5: Stalled ----- 88961 in-flight CPI 1.4224 -- Total Cycles 126563 ---- Thread 26 ---- PC 5: Stalled ----- 92201 in-flight CPI 1.3724 -- Total Cycles 126563 ---- Thread 27 ---- PC 5: Stalled ----- 91416 in-flight CPI 1.3842 -- Total Cycles 126563 ---- Thread 28 ---- PC 5: Stalled ----- 91137 in-flight CPI 1.3884 -- Total Cycles 126563 ---- Thread 29 ---- PC 5: Stalled ----- 91267 in-flight CPI 1.3865 -- Total Cycles 126563 ---- Thread 30 ---- PC 5: Stalled ----- 84568 in-flight CPI 1.4963 -- Total Cycles 126563 ---- Thread 31 ---- PC 5: Stalled ----- 85816 in-flight CPI 1.4746 -- Total Cycles 126563 Total CPI 0.0417 , IPC 23.9775 -- Total Cycles 126563 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7140 (3.864745%) FPSUB: 0 (0.000000%) FPMUL: 30552 (16.537210%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 68530 (37.093971%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4232 (2.290700%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 66350 (35.913979%) DIV: 7681 (4.157578%) FPUN: 0 (0.000000%) FPRSUB: 262 (0.141816%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3335787 total) ADD%: 7.254 (241980) SUB%: 0.000 (0) MUL%: 0.006 (208) BITOR%: 1.525 (50882) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.511 (17037) FPSUB%: 0.000 (0) FPMUL%: 4.658 (155393) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (624) FPMAX%: 0.019 (624) LOAD%: 5.094 (169938) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (240) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (596) FPINV%: 0.000 (0) FPCONV%: 0.020 (656) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.050 (35015) FPLE%: 0.457 (15228) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (624) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.831 (94443) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.731 (24382) CMPU%: 0.000 (0) RSUB%: 0.006 (208) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.705 (523895) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.177 (39268) ORI%: 1.532 (51103) XORI%: 0.000 (0) MULI%: 3.231 (107774) LW%: 1.411 (47061) LWI%: 13.180 (439655) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.290 (9688) SWI%: 4.169 (139063) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.414 (47177) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.312 (10399) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.053 (1768) bned%: 0.000 (0) bneid%: 13.832 (461393) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.720 (24026) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.113 (3763) DIV%: 0.012 (416) FPUN%: 1.484 (49516) FPRSUB%: 4.119 (137407) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (66) FPGT%: 2.963 (98826) FPGE%: 1.028 (34288) SYNC%: 0.000 (0) NOP%: 9.025 (301061) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 17 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 153 FPSUB 0 FPMUL 1 FPCMPLT 0 FPMIN 0 FPMAX 403 LOAD 38741 INTCONV 0 ATOMIC_INC 16 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 17 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1506 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 2 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49618 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 6 ORI 10111 XORI 0 MULI 10128 LW 0 LWI 143029 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 62 DIV 25 FPUN 0 FPRSUB 47 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.9778 --Total thread-cycles: 4050016 --total thread-cycles issued: 3034726 (74.931213%) --iCache conflicts: 113435 (2.800853%) --thread*cycles of FU dependence: 253900 (6.269111%) --thread*cycles of data dependence: 184747 (4.561636%) --iCache cycles*banks: 4050016 (82.365578% used) Issue breakdown: --thread*cycles of issue worked: 3034726 (74.931213%) --thread*cycles of issue failed: 714229 (17.635214%) --thread*cycles of issue NOP/other: 301061 (7.433576%) Number of thread-cycles not ready: 184747 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3335787 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 9 2: 7 3: 8 4: 7 5: 8 6: 9 7: 8 8: 8 9: 8 10: 8 11: 7 12: 8 13: 8 14: 8 15: 7 16: 6 17: 9 18: 8 19: 7 20: 6 21: 7 22: 7 23: 7 24: 7 25: 8 26: 7 27: 7 28: 8 29: 7 30: 7 31: 6 <=== Core 75 ===> ---- Thread 00 ---- PC 5: Stalled ----- 103226 in-flight CPI 1.3422 -- Total Cycles 138577 ---- Thread 01 ---- PC 5: Stalled ----- 103535 in-flight CPI 1.3382 -- Total Cycles 138577 ---- Thread 02 ---- PC 5: Stalled ----- 96751 in-flight CPI 1.4321 -- Total Cycles 138577 ---- Thread 03 ---- PC 5: Stalled ----- 99207 in-flight CPI 1.3966 -- Total Cycles 138577 ---- Thread 04 ---- PC 5: Stalled ----- 93943 in-flight CPI 1.4749 -- Total Cycles 138577 ---- Thread 05 ---- PC 5: Stalled ----- 102658 in-flight CPI 1.3496 -- Total Cycles 138577 ---- Thread 06 ---- PC 5: Stalled ----- 94883 in-flight CPI 1.4602 -- Total Cycles 138577 ---- Thread 07 ---- PC 5: Stalled ----- 99856 in-flight CPI 1.3875 -- Total Cycles 138577 ---- Thread 08 ---- PC 5: Stalled ----- 99366 in-flight CPI 1.3943 -- Total Cycles 138577 ---- Thread 09 ---- PC 5: Stalled ----- 93671 in-flight CPI 1.4791 -- Total Cycles 138577 ---- Thread 10 ---- PC 5: Stalled ----- 96101 in-flight CPI 1.4417 -- Total Cycles 138577 ---- Thread 11 ---- PC 5: Stalled ----- 97716 in-flight CPI 1.4178 -- Total Cycles 138577 ---- Thread 12 ---- PC 5: Stalled ----- 94185 in-flight CPI 1.4711 -- Total Cycles 138577 ---- Thread 13 ---- PC 5: Stalled ----- 94729 in-flight CPI 1.4626 -- Total Cycles 138577 ---- Thread 14 ---- PC 5: Stalled ----- 94484 in-flight CPI 1.4664 -- Total Cycles 138577 ---- Thread 15 ---- PC 5: Stalled ----- 100735 in-flight CPI 1.3754 -- Total Cycles 138577 ---- Thread 16 ---- PC 5: Stalled ----- 92683 in-flight CPI 1.4949 -- Total Cycles 138577 ---- Thread 17 ---- PC 5: Stalled ----- 91834 in-flight CPI 1.5088 -- Total Cycles 138577 ---- Thread 18 ---- PC 5: Stalled ----- 92310 in-flight CPI 1.5010 -- Total Cycles 138577 ---- Thread 19 ---- PC 5: Stalled ----- 84717 in-flight CPI 1.6356 -- Total Cycles 138577 ---- Thread 20 ---- PC 5: Stalled ----- 94645 in-flight CPI 1.4639 -- Total Cycles 138577 ---- Thread 21 ---- PC 5: Stalled ----- 96600 in-flight CPI 1.4343 -- Total Cycles 138577 ---- Thread 22 ---- PC 5: Stalled ----- 91110 in-flight CPI 1.5208 -- Total Cycles 138577 ---- Thread 23 ---- PC 5: Stalled ----- 98090 in-flight CPI 1.4124 -- Total Cycles 138577 ---- Thread 24 ---- PC 5: Stalled ----- 88426 in-flight CPI 1.5669 -- Total Cycles 138577 ---- Thread 25 ---- PC 5: Stalled ----- 94553 in-flight CPI 1.4653 -- Total Cycles 138577 ---- Thread 26 ---- PC 5: Stalled ----- 87187 in-flight CPI 1.5892 -- Total Cycles 138577 ---- Thread 27 ---- PC 5: Stalled ----- 91277 in-flight CPI 1.5179 -- Total Cycles 138577 ---- Thread 28 ---- PC 5: Stalled ----- 90515 in-flight CPI 1.5307 -- Total Cycles 138577 ---- Thread 29 ---- PC 5: Stalled ----- 91475 in-flight CPI 1.5147 -- Total Cycles 138577 ---- Thread 30 ---- PC 5: Stalled ----- 88882 in-flight CPI 1.5588 -- Total Cycles 138577 ---- Thread 31 ---- PC 5: Stalled ----- 97382 in-flight CPI 1.4228 -- Total Cycles 138577 Total CPI 0.0456 , IPC 21.9176 -- Total Cycles 138577 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7818 (3.637260%) FPSUB: 0 (0.000000%) FPMUL: 31892 (14.837492%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 91983 (42.794334%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 3999 (1.860502%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 71602 (33.312244%) DIV: 7392 (3.439067%) FPUN: 0 (0.000000%) FPRSUB: 256 (0.119102%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3338511 total) ADD%: 7.234 (241502) SUB%: 0.000 (0) MUL%: 0.006 (200) BITOR%: 1.525 (50911) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.552 (18417) FPSUB%: 0.000 (0) FPMUL%: 4.779 (159554) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (600) FPMAX%: 0.018 (600) LOAD%: 5.139 (171568) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (232) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (572) FPINV%: 0.000 (0) FPCONV%: 0.019 (632) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.064 (35531) FPLE%: 0.454 (15163) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (600) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.797 (93365) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.746 (24917) CMPU%: 0.000 (0) RSUB%: 0.006 (200) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.668 (523083) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.171 (39086) ORI%: 1.569 (52374) XORI%: 0.000 (0) MULI%: 3.197 (106746) LW%: 1.394 (46525) LWI%: 13.088 (436957) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.287 (9574) SWI%: 4.127 (137787) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.397 (46633) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10358) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.058 (1952) bned%: 0.000 (0) bneid%: 13.793 (460465) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.719 (23991) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.122 (4079) DIV%: 0.012 (400) FPUN%: 1.476 (49291) FPRSUB%: 4.219 (140864) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (71) FPGT%: 2.945 (98308) FPGE%: 1.022 (34128) SYNC%: 0.000 (0) NOP%: 9.021 (301179) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 18 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 147 FPSUB 0 FPMUL 1 FPCMPLT 0 FPMIN 0 FPMAX 384 LOAD 40014 INTCONV 0 ATOMIC_INC 18 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 18 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1357 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 9 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49186 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 12 ORI 11167 XORI 0 MULI 9635 LW 0 LWI 142270 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 63 DIV 16 FPUN 0 FPRSUB 52 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 21.9179 --Total thread-cycles: 4434464 --total thread-cycles issued: 3037332 (68.493774%) --iCache conflicts: 111906 (2.523552%) --thread*cycles of FU dependence: 254368 (5.736161%) --thread*cycles of data dependence: 214942 (4.847079%) --iCache cycles*banks: 4434464 (75.286278% used) Issue breakdown: --thread*cycles of issue worked: 3037332 (68.493774%) --thread*cycles of issue failed: 1095953 (24.714441%) --thread*cycles of issue NOP/other: 301179 (6.791780%) Number of thread-cycles not ready: 214942 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3338511 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 9 1: 8 2: 6 3: 8 4: 6 5: 8 6: 8 7: 8 8: 8 9: 7 10: 8 11: 9 12: 7 13: 8 14: 8 15: 8 16: 7 17: 6 18: 5 19: 5 20: 8 21: 7 22: 6 23: 9 24: 7 25: 8 26: 6 27: 7 28: 8 29: 6 30: 7 31: 6 <=== Core 76 ===> ---- Thread 00 ---- PC 5: Stalled ----- 102075 in-flight CPI 1.2641 -- Total Cycles 129051 ---- Thread 01 ---- PC 5: Stalled ----- 99867 in-flight CPI 1.2919 -- Total Cycles 129051 ---- Thread 02 ---- PC 5: Stalled ----- 98016 in-flight CPI 1.3164 -- Total Cycles 129051 ---- Thread 03 ---- PC 5: Stalled ----- 96090 in-flight CPI 1.3428 -- Total Cycles 129051 ---- Thread 04 ---- PC 5: Stalled ----- 93929 in-flight CPI 1.3737 -- Total Cycles 129051 ---- Thread 05 ---- PC 5: Stalled ----- 99838 in-flight CPI 1.2924 -- Total Cycles 129051 ---- Thread 06 ---- PC 5: Stalled ----- 100992 in-flight CPI 1.2776 -- Total Cycles 129051 ---- Thread 07 ---- PC 5: Stalled ----- 98999 in-flight CPI 1.3033 -- Total Cycles 129051 ---- Thread 08 ---- PC 5: Stalled ----- 99112 in-flight CPI 1.3019 -- Total Cycles 129051 ---- Thread 09 ---- PC 5: Stalled ----- 96328 in-flight CPI 1.3395 -- Total Cycles 129051 ---- Thread 10 ---- PC 5: Stalled ----- 100406 in-flight CPI 1.2851 -- Total Cycles 129051 ---- Thread 11 ---- PC 5: Stalled ----- 96995 in-flight CPI 1.3303 -- Total Cycles 129051 ---- Thread 12 ---- PC 5: Stalled ----- 96862 in-flight CPI 1.3321 -- Total Cycles 129051 ---- Thread 13 ---- PC 5: Stalled ----- 98709 in-flight CPI 1.3071 -- Total Cycles 129051 ---- Thread 14 ---- PC 5: Stalled ----- 96788 in-flight CPI 1.3331 -- Total Cycles 129051 ---- Thread 15 ---- PC 5: Stalled ----- 99584 in-flight CPI 1.2957 -- Total Cycles 129051 ---- Thread 16 ---- PC 5: Stalled ----- 95475 in-flight CPI 1.3514 -- Total Cycles 129051 ---- Thread 17 ---- PC 5: Stalled ----- 94808 in-flight CPI 1.3609 -- Total Cycles 129051 ---- Thread 18 ---- PC 5: Stalled ----- 98021 in-flight CPI 1.3163 -- Total Cycles 129051 ---- Thread 19 ---- PC 5: Stalled ----- 97462 in-flight CPI 1.3239 -- Total Cycles 129051 ---- Thread 20 ---- PC 5: Stalled ----- 94699 in-flight CPI 1.3625 -- Total Cycles 129051 ---- Thread 21 ---- PC 5: Stalled ----- 93800 in-flight CPI 1.3756 -- Total Cycles 129051 ---- Thread 22 ---- PC 5: Stalled ----- 87406 in-flight CPI 1.4762 -- Total Cycles 129051 ---- Thread 23 ---- PC 5: Stalled ----- 95001 in-flight CPI 1.3581 -- Total Cycles 129051 ---- Thread 24 ---- PC 5: Stalled ----- 94652 in-flight CPI 1.3632 -- Total Cycles 129051 ---- Thread 25 ---- PC 5: Stalled ----- 90316 in-flight CPI 1.4287 -- Total Cycles 129051 ---- Thread 26 ---- PC 5: Stalled ----- 90250 in-flight CPI 1.4296 -- Total Cycles 129051 ---- Thread 27 ---- PC 5: Stalled ----- 85860 in-flight CPI 1.5028 -- Total Cycles 129051 ---- Thread 28 ---- PC 5: Stalled ----- 86168 in-flight CPI 1.4975 -- Total Cycles 129051 ---- Thread 29 ---- PC 5: Stalled ----- 94640 in-flight CPI 1.3634 -- Total Cycles 129051 ---- Thread 30 ---- PC 5: Stalled ----- 90210 in-flight CPI 1.4303 -- Total Cycles 129051 ---- Thread 31 ---- PC 5: Stalled ----- 88237 in-flight CPI 1.4622 -- Total Cycles 129051 Total CPI 0.0423 , IPC 23.6507 -- Total Cycles 129051 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7627 (3.899863%) FPSUB: 0 (0.000000%) FPMUL: 31478 (16.095432%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 75278 (38.491394%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4054 (2.072904%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 69411 (35.491459%) DIV: 7465 (3.817028%) FPUN: 0 (0.000000%) FPRSUB: 258 (0.131921%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3354958 total) ADD%: 7.111 (238584) SUB%: 0.000 (0) MUL%: 0.006 (202) BITOR%: 1.533 (51447) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.535 (17962) FPSUB%: 0.000 (0) FPMUL%: 4.733 (158791) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (606) FPMAX%: 0.018 (606) LOAD%: 5.131 (172156) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (234) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (579) FPINV%: 0.000 (0) FPCONV%: 0.019 (638) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.058 (35483) FPLE%: 0.460 (15442) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (606) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.819 (94582) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.743 (24925) CMPU%: 0.000 (0) RSUB%: 0.006 (202) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.724 (527541) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.179 (39566) ORI%: 1.549 (51954) XORI%: 0.000 (0) MULI%: 3.216 (107906) LW%: 1.405 (47148) LWI%: 13.131 (440537) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.289 (9682) SWI%: 4.147 (139128) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.409 (47259) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.311 (10422) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.053 (1772) bned%: 0.000 (0) bneid%: 13.833 (464088) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.721 (24191) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.118 (3947) DIV%: 0.012 (404) FPUN%: 1.485 (49835) FPRSUB%: 4.179 (140187) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (65) FPGT%: 2.952 (99035) FPGE%: 1.025 (34393) SYNC%: 0.000 (0) NOP%: 9.024 (302757) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 9 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 152 FPSUB 0 FPMUL 2 FPCMPLT 0 FPMIN 0 FPMAX 396 LOAD 40172 INTCONV 0 ATOMIC_INC 30 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 14 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1177 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 13 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49674 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 13 ORI 10791 XORI 0 MULI 9357 LW 0 LWI 143414 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 49 DIV 24 FPUN 0 FPRSUB 49 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.6509 --Total thread-cycles: 4129632 --total thread-cycles issued: 3052201 (73.909760%) --iCache conflicts: 115173 (2.788941%) --thread*cycles of FU dependence: 255368 (6.183795%) --thread*cycles of data dependence: 195571 (4.735797%) --iCache cycles*banks: 4129632 (81.241867% used) Issue breakdown: --thread*cycles of issue worked: 3052201 (73.909760%) --thread*cycles of issue failed: 774674 (18.758911%) --thread*cycles of issue NOP/other: 302757 (7.331331%) Number of thread-cycles not ready: 195571 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3354958 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 7 1: 9 2: 8 3: 8 4: 6 5: 8 6: 8 7: 8 8: 7 9: 7 10: 7 11: 6 12: 6 13: 8 14: 7 15: 8 16: 8 17: 8 18: 8 19: 8 20: 7 21: 7 22: 6 23: 8 24: 7 25: 6 26: 8 27: 7 28: 5 29: 7 30: 8 31: 8 <=== Core 77 ===> ---- Thread 00 ---- PC 5: Stalled ----- 96609 in-flight CPI 1.3258 -- Total Cycles 128112 ---- Thread 01 ---- PC 5: Stalled ----- 100568 in-flight CPI 1.2736 -- Total Cycles 128112 ---- Thread 02 ---- PC 5: Stalled ----- 94899 in-flight CPI 1.3498 -- Total Cycles 128112 ---- Thread 03 ---- PC 5: Stalled ----- 102110 in-flight CPI 1.2544 -- Total Cycles 128112 ---- Thread 04 ---- PC 5: Stalled ----- 97044 in-flight CPI 1.3198 -- Total Cycles 128112 ---- Thread 05 ---- PC 5: Stalled ----- 102859 in-flight CPI 1.2453 -- Total Cycles 128112 ---- Thread 06 ---- PC 5: Stalled ----- 101958 in-flight CPI 1.2563 -- Total Cycles 128112 ---- Thread 07 ---- PC 5: Stalled ----- 96146 in-flight CPI 1.3322 -- Total Cycles 128112 ---- Thread 08 ---- PC 5: Stalled ----- 101651 in-flight CPI 1.2601 -- Total Cycles 128112 ---- Thread 09 ---- PC 5: Stalled ----- 99946 in-flight CPI 1.2816 -- Total Cycles 128112 ---- Thread 10 ---- PC 5: Stalled ----- 95864 in-flight CPI 1.3362 -- Total Cycles 128112 ---- Thread 11 ---- PC 5: Stalled ----- 99048 in-flight CPI 1.2932 -- Total Cycles 128112 ---- Thread 12 ---- PC 5: Stalled ----- 98116 in-flight CPI 1.3055 -- Total Cycles 128112 ---- Thread 13 ---- PC 5: Stalled ----- 90978 in-flight CPI 1.4079 -- Total Cycles 128112 ---- Thread 14 ---- PC 5: Stalled ----- 97530 in-flight CPI 1.3133 -- Total Cycles 128112 ---- Thread 15 ---- PC 5: Stalled ----- 98936 in-flight CPI 1.2946 -- Total Cycles 128112 ---- Thread 16 ---- PC 5: Stalled ----- 93678 in-flight CPI 1.3673 -- Total Cycles 128112 ---- Thread 17 ---- PC 5: Stalled ----- 93531 in-flight CPI 1.3695 -- Total Cycles 128112 ---- Thread 18 ---- PC 5: Stalled ----- 97086 in-flight CPI 1.3193 -- Total Cycles 128112 ---- Thread 19 ---- PC 5: Stalled ----- 98809 in-flight CPI 1.2963 -- Total Cycles 128112 ---- Thread 20 ---- PC 5: Stalled ----- 91953 in-flight CPI 1.3929 -- Total Cycles 128112 ---- Thread 21 ---- PC 5: Stalled ----- 94859 in-flight CPI 1.3503 -- Total Cycles 128112 ---- Thread 22 ---- PC 5: Stalled ----- 95276 in-flight CPI 1.3444 -- Total Cycles 128112 ---- Thread 23 ---- PC 5: Stalled ----- 92202 in-flight CPI 1.3893 -- Total Cycles 128112 ---- Thread 24 ---- PC 5: Stalled ----- 87591 in-flight CPI 1.4623 -- Total Cycles 128112 ---- Thread 25 ---- PC 5: Stalled ----- 88024 in-flight CPI 1.4552 -- Total Cycles 128112 ---- Thread 26 ---- PC 5: Stalled ----- 92732 in-flight CPI 1.3813 -- Total Cycles 128112 ---- Thread 27 ---- PC 5: Stalled ----- 90757 in-flight CPI 1.4113 -- Total Cycles 128112 ---- Thread 28 ---- PC 5: Stalled ----- 93651 in-flight CPI 1.3677 -- Total Cycles 128112 ---- Thread 29 ---- PC 5: Stalled ----- 91763 in-flight CPI 1.3958 -- Total Cycles 128112 ---- Thread 30 ---- PC 5: Stalled ----- 84315 in-flight CPI 1.5192 -- Total Cycles 128112 ---- Thread 31 ---- PC 5: Stalled ----- 88374 in-flight CPI 1.4493 -- Total Cycles 128112 Total CPI 0.0420 , IPC 23.8029 -- Total Cycles 128112 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 8025 (4.234002%) FPSUB: 0 (0.000000%) FPMUL: 32224 (17.001429%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 63692 (33.603992%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4471 (2.358906%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 73110 (38.572945%) DIV: 7749 (4.088384%) FPUN: 0 (0.000000%) FPRSUB: 266 (0.140342%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3351485 total) ADD%: 7.183 (240751) SUB%: 0.000 (0) MUL%: 0.006 (210) BITOR%: 1.526 (51136) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.563 (18881) FPSUB%: 0.000 (0) FPMUL%: 4.804 (161012) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (630) FPMAX%: 0.019 (630) LOAD%: 5.146 (172469) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (242) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (612) FPINV%: 0.000 (0) FPCONV%: 0.020 (662) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.067 (35776) FPLE%: 0.456 (15269) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (630) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.798 (93785) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.749 (25107) CMPU%: 0.000 (0) RSUB%: 0.006 (210) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.671 (525197) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.174 (39361) ORI%: 1.563 (52368) XORI%: 0.000 (0) MULI%: 3.198 (107164) LW%: 1.396 (46786) LWI%: 13.087 (438625) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.286 (9569) SWI%: 4.135 (138570) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.400 (46914) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.308 (10327) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.055 (1854) bned%: 0.000 (0) bneid%: 13.793 (462283) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.713 (23905) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.124 (4159) DIV%: 0.013 (420) FPUN%: 1.474 (49415) FPRSUB%: 4.226 (141647) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (78) FPGT%: 2.942 (98597) FPGE%: 1.019 (34146) SYNC%: 0.000 (0) NOP%: 9.011 (301992) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 25 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 154 FPSUB 0 FPMUL 1 FPCMPLT 0 FPMIN 0 FPMAX 408 LOAD 39733 INTCONV 0 ATOMIC_INC 19 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 20 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1469 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 12 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49363 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 14 ORI 11416 XORI 0 MULI 9572 LW 0 LWI 143009 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 73 DIV 21 FPUN 0 FPRSUB 51 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.8032 --Total thread-cycles: 4099584 --total thread-cycles issued: 3049493 (74.385422%) --iCache conflicts: 114229 (2.786356%) --thread*cycles of FU dependence: 255386 (6.229559%) --thread*cycles of data dependence: 189537 (4.623323%) --iCache cycles*banks: 4099584 (81.752609% used) Issue breakdown: --thread*cycles of issue worked: 3049493 (74.385422%) --thread*cycles of issue failed: 748099 (18.248169%) --thread*cycles of issue NOP/other: 301992 (7.366406%) Number of thread-cycles not ready: 189537 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3351485 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 9 1: 8 2: 5 3: 8 4: 10 5: 8 6: 8 7: 7 8: 8 9: 8 10: 7 11: 8 12: 7 13: 7 14: 7 15: 9 16: 7 17: 7 18: 8 19: 9 20: 8 21: 7 22: 7 23: 6 24: 7 25: 7 26: 7 27: 7 28: 8 29: 9 30: 6 31: 8 <=== Core 78 ===> ---- Thread 00 ---- PC 5: Stalled ----- 101852 in-flight CPI 1.4590 -- Total Cycles 148629 ---- Thread 01 ---- PC 5: Stalled ----- 101187 in-flight CPI 1.4686 -- Total Cycles 148629 ---- Thread 02 ---- PC 5: Stalled ----- 97606 in-flight CPI 1.5224 -- Total Cycles 148629 ---- Thread 03 ---- PC 5: Stalled ----- 102011 in-flight CPI 1.4567 -- Total Cycles 148629 ---- Thread 04 ---- PC 5: Stalled ----- 110563 in-flight CPI 1.3441 -- Total Cycles 148629 ---- Thread 05 ---- PC 5: Stalled ----- 100638 in-flight CPI 1.4766 -- Total Cycles 148629 ---- Thread 06 ---- PC 5: Stalled ----- 99526 in-flight CPI 1.4931 -- Total Cycles 148629 ---- Thread 07 ---- PC 5: Stalled ----- 95194 in-flight CPI 1.5610 -- Total Cycles 148629 ---- Thread 08 ---- PC 5: Stalled ----- 95284 in-flight CPI 1.5595 -- Total Cycles 148629 ---- Thread 09 ---- PC 5: Stalled ----- 100590 in-flight CPI 1.4772 -- Total Cycles 148629 ---- Thread 10 ---- PC 5: Stalled ----- 95302 in-flight CPI 1.5593 -- Total Cycles 148629 ---- Thread 11 ---- PC 5: Stalled ----- 99644 in-flight CPI 1.4913 -- Total Cycles 148629 ---- Thread 12 ---- PC 5: Stalled ----- 94182 in-flight CPI 1.5778 -- Total Cycles 148629 ---- Thread 13 ---- PC 5: Stalled ----- 96705 in-flight CPI 1.5366 -- Total Cycles 148629 ---- Thread 14 ---- PC 5: Stalled ----- 93739 in-flight CPI 1.5852 -- Total Cycles 148629 ---- Thread 15 ---- PC 5: Stalled ----- 98760 in-flight CPI 1.5047 -- Total Cycles 148629 ---- Thread 16 ---- PC 5: Stalled ----- 98581 in-flight CPI 1.5074 -- Total Cycles 148629 ---- Thread 17 ---- PC 5: Stalled ----- 95818 in-flight CPI 1.5509 -- Total Cycles 148629 ---- Thread 18 ---- PC 5: Stalled ----- 102784 in-flight CPI 1.4458 -- Total Cycles 148629 ---- Thread 19 ---- PC 5: Stalled ----- 95073 in-flight CPI 1.5630 -- Total Cycles 148629 ---- Thread 20 ---- PC 5: Stalled ----- 93626 in-flight CPI 1.5872 -- Total Cycles 148629 ---- Thread 21 ---- PC 5: Stalled ----- 95963 in-flight CPI 1.5486 -- Total Cycles 148629 ---- Thread 22 ---- PC 5: Stalled ----- 93825 in-flight CPI 1.5838 -- Total Cycles 148629 ---- Thread 23 ---- PC 5: Stalled ----- 86432 in-flight CPI 1.7193 -- Total Cycles 148629 ---- Thread 24 ---- PC 5: Stalled ----- 93824 in-flight CPI 1.5838 -- Total Cycles 148629 ---- Thread 25 ---- PC 5: Stalled ----- 86812 in-flight CPI 1.7118 -- Total Cycles 148629 ---- Thread 26 ---- PC 5: Stalled ----- 90160 in-flight CPI 1.6482 -- Total Cycles 148629 ---- Thread 27 ---- PC 5: Stalled ----- 90574 in-flight CPI 1.6407 -- Total Cycles 148629 ---- Thread 28 ---- PC 5: Stalled ----- 88122 in-flight CPI 1.6864 -- Total Cycles 148629 ---- Thread 29 ---- PC 5: Stalled ----- 92244 in-flight CPI 1.6110 -- Total Cycles 148629 ---- Thread 30 ---- PC 5: Stalled ----- 84709 in-flight CPI 1.7543 -- Total Cycles 148629 ---- Thread 31 ---- PC 5: Stalled ----- 87348 in-flight CPI 1.7013 -- Total Cycles 148629 Total CPI 0.0486 , IPC 20.5830 -- Total Cycles 148629 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7708 (3.694278%) FPSUB: 0 (0.000000%) FPMUL: 31536 (15.114524%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 86922 (41.659840%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 3857 (1.848577%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 70900 (33.980839%) DIV: 7462 (3.576375%) FPUN: 0 (0.000000%) FPRSUB: 262 (0.125571%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3362082 total) ADD%: 7.217 (242645) SUB%: 0.000 (0) MUL%: 0.006 (202) BITOR%: 1.536 (51626) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.539 (18124) FPSUB%: 0.000 (0) FPMUL%: 4.737 (159249) FPCMPLT%: 0.000 (0) FPMIN%: 0.018 (606) FPMAX%: 0.018 (606) LOAD%: 5.150 (173149) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (234) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.017 (568) FPINV%: 0.000 (0) FPCONV%: 0.019 (638) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.056 (35500) FPLE%: 0.460 (15461) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.018 (606) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.812 (94537) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.748 (25165) CMPU%: 0.000 (0) RSUB%: 0.006 (202) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.716 (528369) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.181 (39701) ORI%: 1.554 (52255) XORI%: 0.000 (0) MULI%: 3.201 (107634) LW%: 1.400 (47062) LWI%: 13.076 (439633) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.290 (9741) SWI%: 4.145 (139345) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.403 (47162) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.312 (10495) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.051 (1712) bned%: 0.000 (0) bneid%: 13.809 (464267) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.717 (24097) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.120 (4036) DIV%: 0.012 (404) FPUN%: 1.483 (49861) FPRSUB%: 4.190 (140881) FPSQRT%: 0.000 (0) FPNEG%: 0.002 (64) FPGT%: 2.943 (98951) FPGE%: 1.023 (34400) SYNC%: 0.000 (0) NOP%: 9.006 (302798) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 16 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 156 FPSUB 0 FPMUL 2 FPCMPLT 0 FPMIN 0 FPMAX 396 LOAD 39428 INTCONV 0 ATOMIC_INC 16 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 17 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1411 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 8 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49561 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 17 ORI 10948 XORI 0 MULI 9508 LW 0 LWI 143239 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 70 DIV 22 FPUN 0 FPRSUB 55 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 20.5832 --Total thread-cycles: 4756128 --total thread-cycles issued: 3059284 (64.322998%) --iCache conflicts: 111416 (2.342578%) --thread*cycles of FU dependence: 254898 (5.359359%) --thread*cycles of data dependence: 208647 (4.386909%) --iCache cycles*banks: 4756128 (70.690147% used) Issue breakdown: --thread*cycles of issue worked: 3059284 (64.322998%) --thread*cycles of issue failed: 1394046 (29.310524%) --thread*cycles of issue NOP/other: 302798 (6.366482%) Number of thread-cycles not ready: 208647 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3362082 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 8 3: 8 4: 6 5: 8 6: 8 7: 8 8: 9 9: 9 10: 6 11: 8 12: 8 13: 8 14: 8 15: 8 16: 7 17: 7 18: 6 19: 7 20: 7 21: 7 22: 8 23: 7 24: 7 25: 6 26: 7 27: 7 28: 6 29: 7 30: 6 31: 6 <=== Core 79 ===> ---- Thread 00 ---- PC 5: Stalled ----- 98033 in-flight CPI 1.3373 -- Total Cycles 131122 ---- Thread 01 ---- PC 5: Stalled ----- 94908 in-flight CPI 1.3813 -- Total Cycles 131122 ---- Thread 02 ---- PC 5: Stalled ----- 102302 in-flight CPI 1.2815 -- Total Cycles 131122 ---- Thread 03 ---- PC 5: Stalled ----- 101977 in-flight CPI 1.2856 -- Total Cycles 131122 ---- Thread 04 ---- PC 5: Stalled ----- 98618 in-flight CPI 1.3293 -- Total Cycles 131122 ---- Thread 05 ---- PC 5: Stalled ----- 95670 in-flight CPI 1.3703 -- Total Cycles 131122 ---- Thread 06 ---- PC 5: Stalled ----- 102624 in-flight CPI 1.2774 -- Total Cycles 131122 ---- Thread 07 ---- PC 5: Stalled ----- 103633 in-flight CPI 1.2650 -- Total Cycles 131122 ---- Thread 08 ---- PC 5: Stalled ----- 98300 in-flight CPI 1.3337 -- Total Cycles 131122 ---- Thread 09 ---- PC 5: Stalled ----- 98634 in-flight CPI 1.3292 -- Total Cycles 131122 ---- Thread 10 ---- PC 5: Stalled ----- 97241 in-flight CPI 1.3482 -- Total Cycles 131122 ---- Thread 11 ---- PC 5: Stalled ----- 97425 in-flight CPI 1.3456 -- Total Cycles 131122 ---- Thread 12 ---- PC 5: Stalled ----- 95068 in-flight CPI 1.3790 -- Total Cycles 131122 ---- Thread 13 ---- PC 5: Stalled ----- 96128 in-flight CPI 1.3638 -- Total Cycles 131122 ---- Thread 14 ---- PC 5: Stalled ----- 91161 in-flight CPI 1.4381 -- Total Cycles 131122 ---- Thread 15 ---- PC 5: Stalled ----- 97380 in-flight CPI 1.3462 -- Total Cycles 131122 ---- Thread 16 ---- PC 5: Stalled ----- 96779 in-flight CPI 1.3546 -- Total Cycles 131122 ---- Thread 17 ---- PC 5: Stalled ----- 98234 in-flight CPI 1.3346 -- Total Cycles 131122 ---- Thread 18 ---- PC 5: Stalled ----- 96830 in-flight CPI 1.3539 -- Total Cycles 131122 ---- Thread 19 ---- PC 5: Stalled ----- 94441 in-flight CPI 1.3881 -- Total Cycles 131122 ---- Thread 20 ---- PC 5: Stalled ----- 91764 in-flight CPI 1.4287 -- Total Cycles 131122 ---- Thread 21 ---- PC 5: Stalled ----- 92990 in-flight CPI 1.4098 -- Total Cycles 131122 ---- Thread 22 ---- PC 5: Stalled ----- 97001 in-flight CPI 1.3514 -- Total Cycles 131122 ---- Thread 23 ---- PC 5: Stalled ----- 91736 in-flight CPI 1.4291 -- Total Cycles 131122 ---- Thread 24 ---- PC 5: Stalled ----- 93303 in-flight CPI 1.4051 -- Total Cycles 131122 ---- Thread 25 ---- PC 5: Stalled ----- 94505 in-flight CPI 1.3872 -- Total Cycles 131122 ---- Thread 26 ---- PC 5: Stalled ----- 93127 in-flight CPI 1.4077 -- Total Cycles 131122 ---- Thread 27 ---- PC 5: Stalled ----- 94328 in-flight CPI 1.3897 -- Total Cycles 131122 ---- Thread 28 ---- PC 5: Stalled ----- 88194 in-flight CPI 1.4865 -- Total Cycles 131122 ---- Thread 29 ---- PC 5: Stalled ----- 91516 in-flight CPI 1.4325 -- Total Cycles 131122 ---- Thread 30 ---- PC 5: Stalled ----- 90991 in-flight CPI 1.4408 -- Total Cycles 131122 ---- Thread 31 ---- PC 5: Stalled ----- 89989 in-flight CPI 1.4568 -- Total Cycles 131122 Total CPI 0.0428 , IPC 23.3784 -- Total Cycles 131122 kernel thread(called, cycles) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Data dependence stalls (caused by): ADD: 0 (0.000000%) SUB: 0 (0.000000%) MUL: 0 (0.000000%) BITOR: 0 (0.000000%) BITAND: 0 (0.000000%) BITSLEFT: 0 (0.000000%) BITSRIGHT: 0 (0.000000%) FPADD: 7508 (3.898579%) FPSUB: 0 (0.000000%) FPMUL: 31294 (16.249617%) FPCMPLT: 0 (0.000000%) FPMIN: 0 (0.000000%) FPMAX: 0 (0.000000%) LOAD: 71479 (37.115944%) INTCONV: 0 (0.000000%) ATOMIC_INC: 0 (0.000000%) INC_RESET: 0 (0.000000%) BARRIER: 0 (0.000000%) GLOBAL_READ: 0 (0.000000%) ATOMIC_ADD: 0 (0.000000%) ATOMIC_FPADD: 0 (0.000000%) FPINVSQRT: 4485 (2.328866%) FPINV: 0 (0.000000%) FPCONV: 0 (0.000000%) FPEQ: 0 (0.000000%) FPNE: 0 (0.000000%) FPLT: 0 (0.000000%) FPLE: 0 (0.000000%) EQ: 0 (0.000000%) NE: 0 (0.000000%) LT: 0 (0.000000%) LE: 0 (0.000000%) BNZ: 0 (0.000000%) LOADL1: 0 (0.000000%) STORE: 0 (0.000000%) LOADIMM: 0 (0.000000%) SPHERE_TEST: 0 (0.000000%) TRITEST: 0 (0.000000%) MOV: 0 (0.000000%) MOVINDRD: 0 (0.000000%) MOVINDWR: 0 (0.000000%) BLT: 0 (0.000000%) BET: 0 (0.000000%) JMP: 0 (0.000000%) JMPREG: 0 (0.000000%) JAL: 0 (0.000000%) RAND: 0 (0.000000%) COS: 0 (0.000000%) SIN: 0 (0.000000%) ADDC: 0 (0.000000%) ADDK: 0 (0.000000%) ADDKC: 0 (0.000000%) BITXOR: 0 (0.000000%) ANDN: 0 (0.000000%) CMP: 0 (0.000000%) CMPU: 0 (0.000000%) RSUB: 0 (0.000000%) RSUBC: 0 (0.000000%) RSUBK: 0 (0.000000%) RSUBKC: 0 (0.000000%) MULH: 0 (0.000000%) MULHU: 0 (0.000000%) sra: 0 (0.000000%) srl: 0 (0.000000%) ADDI: 0 (0.000000%) ADDIC: 0 (0.000000%) ADDIK: 0 (0.000000%) ADDIKC: 0 (0.000000%) RSUBI: 0 (0.000000%) RSUBIC: 0 (0.000000%) RSUBIK: 0 (0.000000%) RSUBIKC: 0 (0.000000%) ANDNI: 0 (0.000000%) ANDI: 0 (0.000000%) ORI: 0 (0.000000%) XORI: 0 (0.000000%) MULI: 0 (0.000000%) LW: 0 (0.000000%) LWI: 0 (0.000000%) lbu: 0 (0.000000%) lbui: 0 (0.000000%) SW: 0 (0.000000%) SWI: 0 (0.000000%) sb: 0 (0.000000%) sbi: 0 (0.000000%) beqd: 0 (0.000000%) beqid: 0 (0.000000%) bged: 0 (0.000000%) bgeid: 0 (0.000000%) bgtd: 0 (0.000000%) bgtid: 0 (0.000000%) bled: 0 (0.000000%) bleid: 0 (0.000000%) bltd: 0 (0.000000%) bltid: 0 (0.000000%) bned: 0 (0.000000%) bneid: 0 (0.000000%) brd: 0 (0.000000%) brad: 0 (0.000000%) brld: 0 (0.000000%) brald: 0 (0.000000%) brid: 0 (0.000000%) braid: 0 (0.000000%) brlid: 0 (0.000000%) bralid: 0 (0.000000%) brk: 0 (0.000000%) brki: 0 (0.000000%) rtsd: 0 (0.000000%) FPDIV: 69572 (36.125721%) DIV: 7971 (4.138995%) FPUN: 0 (0.000000%) FPRSUB: 274 (0.142276%) FPSQRT: 0 (0.000000%) FPNEG: 0 (0.000000%) FPGT: 0 (0.000000%) FPGE: 0 (0.000000%) SYNC: 0 (0.000000%) NOP: 0 (0.000000%) HALT: 0 (0.000000%) PRINT: 0 (0.000000%) PROF: 0 (0.000000%) Dynamic Instruction Mix: (3369457 total) ADD%: 7.250 (244274) SUB%: 0.000 (0) MUL%: 0.006 (216) BITOR%: 1.525 (51372) BITAND%: 0.000 (0) BITSLEFT%: 0.000 (0) BITSRIGHT%: 0.000 (0) FPADD%: 0.529 (17819) FPSUB%: 0.000 (0) FPMUL%: 4.705 (158528) FPCMPLT%: 0.000 (0) FPMIN%: 0.019 (648) FPMAX%: 0.019 (648) LOAD%: 5.097 (171747) INTCONV%: 0.000 (0) ATOMIC_INC%: 0.007 (248) INC_RESET%: 0.000 (0) BARRIER%: 0.000 (0) GLOBAL_READ%: 0.000 (0) ATOMIC_ADD%: 0.000 (0) ATOMIC_FPADD%: 0.000 (0) FPINVSQRT%: 0.018 (622) FPINV%: 0.000 (0) FPCONV%: 0.020 (680) FPEQ%: 0.000 (0) FPNE%: 0.000 (0) FPLT%: 1.056 (35568) FPLE%: 0.452 (15223) EQ%: 0.000 (0) NE%: 0.000 (0) LT%: 0.000 (0) LE%: 0.000 (0) BNZ%: 0.000 (0) LOADL1%: 0.000 (0) STORE%: 0.019 (648) LOADIMM%: 0.001 (32) SPHERE_TEST%: 0.000 (0) TRITEST%: 0.000 (0) MOV%: 0.000 (0) MOVINDRD%: 0.000 (0) MOVINDWR%: 0.000 (0) BLT%: 0.000 (0) BET%: 0.000 (0) JMP%: 0.000 (0) JMPREG%: 0.000 (0) JAL%: 0.000 (0) RAND%: 0.000 (0) COS%: 0.000 (0) SIN%: 0.000 (0) ADDC%: 0.000 (0) ADDK%: 2.817 (94916) ADDKC%: 0.000 (0) BITXOR%: 0.000 (0) ANDN%: 0.000 (0) CMP%: 0.735 (24755) CMPU%: 0.000 (0) RSUB%: 0.006 (216) RSUBC%: 0.000 (0) RSUBK%: 0.000 (0) RSUBKC%: 0.000 (0) MULH%: 0.000 (0) MULHU%: 0.000 (0) sra%: 0.000 (0) srl%: 0.000 (0) ADDI%: 15.670 (527981) ADDIC%: 0.000 (0) ADDIK%: 0.000 (0) ADDIKC%: 0.000 (0) RSUBI%: 0.000 (0) RSUBIC%: 0.000 (0) RSUBIK%: 0.000 (0) RSUBIKC%: 0.000 (0) ANDNI%: 0.000 (0) ANDI%: 1.173 (39507) ORI%: 1.555 (52383) XORI%: 0.000 (0) MULI%: 3.219 (108470) LW%: 1.405 (47331) LWI%: 13.160 (443407) lbu%: 0.000 (0) lbui%: 0.000 (0) SW%: 0.288 (9705) SWI%: 4.168 (140437) sb%: 0.000 (0) sbi%: 0.000 (0) beqd%: 0.000 (0) beqid%: 1.408 (47457) bged%: 0.000 (0) bgeid%: 0.000 (0) bgtd%: 0.000 (0) bgtid%: 0.310 (10456) bled%: 0.000 (0) bleid%: 0.000 (0) bltd%: 0.000 (0) bltid%: 0.056 (1900) bned%: 0.000 (0) bneid%: 13.810 (465316) brd%: 0.000 (0) brad%: 0.000 (0) brld%: 0.000 (0) brald%: 0.000 (0) brid%: 0.721 (24310) braid%: 0.000 (0) brlid%: 0.001 (32) bralid%: 0.000 (0) brk%: 0.000 (0) brki%: 0.000 (0) rtsd%: 0.001 (32) FPDIV%: 0.117 (3946) DIV%: 0.013 (432) FPUN%: 1.483 (49961) FPRSUB%: 4.151 (139875) FPSQRT%: 0.000 (0) FPNEG%: 0.003 (86) FPGT%: 2.955 (99556) FPGE%: 1.031 (34738) SYNC%: 0.000 (0) NOP%: 9.022 (303979) HALT%: 0.000 (0) PRINT%: 0.000 (0) PROF%: 0.000 (0) Number of thread-cycles contention found when issuing: ADD 0 SUB 0 MUL 17 BITOR 0 BITAND 0 BITSLEFT 0 BITSRIGHT 0 FPADD 159 FPSUB 0 FPMUL 5 FPCMPLT 0 FPMIN 0 FPMAX 418 LOAD 38715 INTCONV 0 ATOMIC_INC 18 INC_RESET 0 BARRIER 0 GLOBAL_READ 0 ATOMIC_ADD 0 ATOMIC_FPADD 0 FPINVSQRT 14 FPINV 0 FPCONV 0 FPEQ 0 FPNE 0 FPLT 0 FPLE 0 EQ 0 NE 0 LT 0 LE 0 BNZ 0 LOADL1 0 STORE 1370 LOADIMM 0 SPHERE_TEST 0 TRITEST 0 MOV 0 MOVINDRD 0 MOVINDWR 0 BLT 0 BET 0 JMP 0 JMPREG 0 JAL 0 RAND 0 COS 0 SIN 0 ADDC 0 ADDK 0 ADDKC 0 BITXOR 0 ANDN 0 CMP 12 CMPU 0 RSUB 0 RSUBC 0 RSUBK 0 RSUBKC 0 MULH 0 MULHU 0 sra 0 srl 0 ADDI 49856 ADDIC 0 ADDIK 0 ADDIKC 0 RSUBI 0 RSUBIC 0 RSUBIK 0 RSUBIKC 0 ANDNI 0 ANDI 13 ORI 10615 XORI 0 MULI 9555 LW 0 LWI 144305 lbu 0 lbui 0 SW 0 SWI 0 sb 0 sbi 0 beqd 0 beqid 0 bged 0 bgeid 0 bgtd 0 bgtid 0 bled 0 bleid 0 bltd 0 bltid 0 bned 0 bneid 0 brd 0 brad 0 brld 0 brald 0 brid 0 braid 0 brlid 0 bralid 0 brk 0 brki 0 rtsd 0 FPDIV 73 DIV 24 FPUN 0 FPRSUB 32 FPSQRT 0 FPNEG 0 FPGT 0 FPGE 0 SYNC 0 NOP 0 HALT 0 PRINT 0 PROF 0 --Average #threads Issuing each cycle: 23.3786 --Total thread-cycles: 4195904 --total thread-cycles issued: 3065478 (73.058823%) --iCache conflicts: 115144 (2.744200%) --thread*cycles of FU dependence: 255224 (6.082694%) --thread*cycles of data dependence: 192583 (4.589786%) --iCache cycles*banks: 4195904 (80.304245% used) Issue breakdown: --thread*cycles of issue worked: 3065478 (73.058823%) --thread*cycles of issue failed: 826447 (19.696518%) --thread*cycles of issue NOP/other: 303979 (7.244661%) Number of thread-cycles not ready: 192583 Number of thread-cycles not fetched: 0 SIMD stalls when issuing: 0 SIMD issues: 3369457 SIMD fetches beyond the first: 0 ATOMIC_INC called by threads: 0: 8 1: 8 2: 8 3: 8 4: 9 5: 8 6: 9 7: 9 8: 7 9: 6 10: 7 11: 8 12: 8 13: 7 14: 7 15: 8 16: 7 17: 7 18: 8 19: 8 20: 7 21: 8 22: 9 23: 7 24: 7 25: 8 26: 8 27: 9 28: 7 29: 8 30: 7 31: 8 ## Core 0 ## Module Utilization FP AddSub: 15.34 FP MinMax: 0.03 FP Compare: 5.61 Int AddSub: 21.22 FP Mul: 15.39 Int Mul: 41.17 FP InvSqrt: 0.45 FP Div: 3.45 Conversion Unit: 0.02 ## Core 1 ## Module Utilization FP AddSub: 13.84 FP MinMax: 0.03 FP Compare: 4.84 Int AddSub: 18.36 FP Mul: 13.78 Int Mul: 35.39 FP InvSqrt: 0.38 FP Div: 3.31 Conversion Unit: 0.01 ## Core 2 ## Module Utilization FP AddSub: 15.12 FP MinMax: 0.03 FP Compare: 5.57 Int AddSub: 21.06 FP Mul: 15.15 Int Mul: 40.94 FP InvSqrt: 0.46 FP Div: 3.42 Conversion Unit: 0.02 ## Core 3 ## Module Utilization FP AddSub: 14.22 FP MinMax: 0.03 FP Compare: 5.03 Int AddSub: 19.12 FP Mul: 14.17 Int Mul: 36.79 FP InvSqrt: 0.39 FP Div: 3.34 Conversion Unit: 0.01 ## Core 4 ## Module Utilization FP AddSub: 15.54 FP MinMax: 0.03 FP Compare: 5.71 Int AddSub: 21.57 FP Mul: 15.62 Int Mul: 42.06 FP InvSqrt: 0.47 FP Div: 3.49 Conversion Unit: 0.02 ## Core 5 ## Module Utilization FP AddSub: 15.10 FP MinMax: 0.03 FP Compare: 5.64 Int AddSub: 21.39 FP Mul: 15.18 Int Mul: 41.66 FP InvSqrt: 0.47 FP Div: 3.33 Conversion Unit: 0.02 ## Core 6 ## Module Utilization FP AddSub: 15.08 FP MinMax: 0.03 FP Compare: 5.66 Int AddSub: 21.41 FP Mul: 15.15 Int Mul: 41.88 FP InvSqrt: 0.47 FP Div: 3.32 Conversion Unit: 0.02 ## Core 7 ## Module Utilization FP AddSub: 15.58 FP MinMax: 0.03 FP Compare: 5.64 Int AddSub: 21.38 FP Mul: 15.61 Int Mul: 41.43 FP InvSqrt: 0.46 FP Div: 3.56 Conversion Unit: 0.02 ## Core 8 ## Module Utilization FP AddSub: 13.96 FP MinMax: 0.03 FP Compare: 5.07 Int AddSub: 19.22 FP Mul: 13.98 Int Mul: 37.28 FP InvSqrt: 0.41 FP Div: 3.18 Conversion Unit: 0.01 ## Core 9 ## Module Utilization FP AddSub: 15.49 FP MinMax: 0.03 FP Compare: 5.63 Int AddSub: 21.29 FP Mul: 15.51 Int Mul: 41.17 FP InvSqrt: 0.45 FP Div: 3.54 Conversion Unit: 0.02 ## Core 10 ## Module Utilization FP AddSub: 14.44 FP MinMax: 0.03 FP Compare: 5.12 Int AddSub: 19.43 FP Mul: 14.42 Int Mul: 37.59 FP InvSqrt: 0.41 FP Div: 3.37 Conversion Unit: 0.01 ## Core 11 ## Module Utilization FP AddSub: 15.24 FP MinMax: 0.03 FP Compare: 5.46 Int AddSub: 20.73 FP Mul: 15.23 Int Mul: 39.95 FP InvSqrt: 0.43 FP Div: 3.54 Conversion Unit: 0.01 ## Core 12 ## Module Utilization FP AddSub: 14.21 FP MinMax: 0.03 FP Compare: 5.08 Int AddSub: 19.27 FP Mul: 14.22 Int Mul: 37.32 FP InvSqrt: 0.41 FP Div: 3.29 Conversion Unit: 0.01 ## Core 13 ## Module Utilization FP AddSub: 14.80 FP MinMax: 0.03 FP Compare: 5.64 Int AddSub: 21.37 FP Mul: 14.91 Int Mul: 41.85 FP InvSqrt: 0.48 FP Div: 3.21 Conversion Unit: 0.02 ## Core 14 ## Module Utilization FP AddSub: 15.30 FP MinMax: 0.03 FP Compare: 5.76 Int AddSub: 21.88 FP Mul: 15.40 Int Mul: 42.71 FP InvSqrt: 0.46 FP Div: 3.31 Conversion Unit: 0.02 ## Core 15 ## Module Utilization FP AddSub: 11.75 FP MinMax: 0.02 FP Compare: 4.29 Int AddSub: 16.30 FP Mul: 11.78 Int Mul: 31.63 FP InvSqrt: 0.36 FP Div: 2.66 Conversion Unit: 0.01 ## Core 16 ## Module Utilization FP AddSub: 15.85 FP MinMax: 0.03 FP Compare: 5.58 Int AddSub: 21.18 FP Mul: 15.80 Int Mul: 41.00 FP InvSqrt: 0.45 FP Div: 3.75 Conversion Unit: 0.02 ## Core 17 ## Module Utilization FP AddSub: 12.38 FP MinMax: 0.02 FP Compare: 4.51 Int AddSub: 17.13 FP Mul: 12.40 Int Mul: 33.23 FP InvSqrt: 0.36 FP Div: 2.80 Conversion Unit: 0.01 ## Core 18 ## Module Utilization FP AddSub: 15.06 FP MinMax: 0.03 FP Compare: 5.73 Int AddSub: 21.71 FP Mul: 15.15 Int Mul: 42.52 FP InvSqrt: 0.47 FP Div: 3.24 Conversion Unit: 0.02 ## Core 19 ## Module Utilization FP AddSub: 15.79 FP MinMax: 0.03 FP Compare: 5.69 Int AddSub: 21.57 FP Mul: 15.81 Int Mul: 41.79 FP InvSqrt: 0.47 FP Div: 3.63 Conversion Unit: 0.02 ## Core 20 ## Module Utilization FP AddSub: 15.38 FP MinMax: 0.03 FP Compare: 5.68 Int AddSub: 21.51 FP Mul: 15.43 Int Mul: 41.99 FP InvSqrt: 0.46 FP Div: 3.41 Conversion Unit: 0.02 ## Core 21 ## Module Utilization FP AddSub: 14.13 FP MinMax: 0.03 FP Compare: 5.06 Int AddSub: 19.15 FP Mul: 14.14 Int Mul: 37.06 FP InvSqrt: 0.41 FP Div: 3.26 Conversion Unit: 0.01 ## Core 22 ## Module Utilization FP AddSub: 12.84 FP MinMax: 0.03 FP Compare: 4.70 Int AddSub: 17.83 FP Mul: 12.86 Int Mul: 34.60 FP InvSqrt: 0.38 FP Div: 2.92 Conversion Unit: 0.01 ## Core 23 ## Module Utilization FP AddSub: 14.48 FP MinMax: 0.03 FP Compare: 5.18 Int AddSub: 19.67 FP Mul: 14.47 Int Mul: 38.02 FP InvSqrt: 0.39 FP Div: 3.33 Conversion Unit: 0.01 ## Core 24 ## Module Utilization FP AddSub: 12.38 FP MinMax: 0.02 FP Compare: 4.48 Int AddSub: 16.99 FP Mul: 12.39 Int Mul: 32.93 FP InvSqrt: 0.35 FP Div: 2.81 Conversion Unit: 0.01 ## Core 25 ## Module Utilization FP AddSub: 15.84 FP MinMax: 0.03 FP Compare: 5.76 Int AddSub: 21.75 FP Mul: 15.90 Int Mul: 42.21 FP InvSqrt: 0.48 FP Div: 3.63 Conversion Unit: 0.02 ## Core 26 ## Module Utilization FP AddSub: 15.31 FP MinMax: 0.03 FP Compare: 5.68 Int AddSub: 21.55 FP Mul: 15.38 Int Mul: 42.05 FP InvSqrt: 0.47 FP Div: 3.39 Conversion Unit: 0.02 ## Core 27 ## Module Utilization FP AddSub: 12.84 FP MinMax: 0.02 FP Compare: 4.44 Int AddSub: 16.92 FP Mul: 12.75 Int Mul: 32.58 FP InvSqrt: 0.35 FP Div: 3.09 Conversion Unit: 0.01 ## Core 28 ## Module Utilization FP AddSub: 15.45 FP MinMax: 0.03 FP Compare: 5.71 Int AddSub: 21.55 FP Mul: 15.52 Int Mul: 42.03 FP InvSqrt: 0.47 FP Div: 3.44 Conversion Unit: 0.02 ## Core 29 ## Module Utilization FP AddSub: 15.58 FP MinMax: 0.03 FP Compare: 5.62 Int AddSub: 21.29 FP Mul: 15.62 Int Mul: 41.22 FP InvSqrt: 0.46 FP Div: 3.57 Conversion Unit: 0.02 ## Core 30 ## Module Utilization FP AddSub: 15.58 FP MinMax: 0.03 FP Compare: 5.64 Int AddSub: 21.39 FP Mul: 15.59 Int Mul: 41.42 FP InvSqrt: 0.45 FP Div: 3.55 Conversion Unit: 0.02 ## Core 31 ## Module Utilization FP AddSub: 15.49 FP MinMax: 0.03 FP Compare: 5.73 Int AddSub: 21.66 FP Mul: 15.56 Int Mul: 42.13 FP InvSqrt: 0.47 FP Div: 3.47 Conversion Unit: 0.02 ## Core 32 ## Module Utilization FP AddSub: 15.44 FP MinMax: 0.03 FP Compare: 5.69 Int AddSub: 21.58 FP Mul: 15.51 Int Mul: 42.00 FP InvSqrt: 0.46 FP Div: 3.45 Conversion Unit: 0.02 ## Core 33 ## Module Utilization FP AddSub: 15.55 FP MinMax: 0.03 FP Compare: 5.67 Int AddSub: 21.43 FP Mul: 15.57 Int Mul: 41.67 FP InvSqrt: 0.46 FP Div: 3.53 Conversion Unit: 0.02 ## Core 34 ## Module Utilization FP AddSub: 15.38 FP MinMax: 0.03 FP Compare: 5.65 Int AddSub: 21.34 FP Mul: 15.43 Int Mul: 41.59 FP InvSqrt: 0.47 FP Div: 3.48 Conversion Unit: 0.02 ## Core 35 ## Module Utilization FP AddSub: 15.44 FP MinMax: 0.03 FP Compare: 5.58 Int AddSub: 21.14 FP Mul: 15.44 Int Mul: 40.98 FP InvSqrt: 0.45 FP Div: 3.53 Conversion Unit: 0.02 ## Core 36 ## Module Utilization FP AddSub: 15.76 FP MinMax: 0.03 FP Compare: 5.61 Int AddSub: 21.25 FP Mul: 15.73 Int Mul: 41.17 FP InvSqrt: 0.44 FP Div: 3.68 Conversion Unit: 0.02 ## Core 37 ## Module Utilization FP AddSub: 15.61 FP MinMax: 0.03 FP Compare: 5.68 Int AddSub: 21.54 FP Mul: 15.62 Int Mul: 41.79 FP InvSqrt: 0.46 FP Div: 3.56 Conversion Unit: 0.02 ## Core 38 ## Module Utilization FP AddSub: 15.42 FP MinMax: 0.03 FP Compare: 5.65 Int AddSub: 21.47 FP Mul: 15.47 Int Mul: 41.78 FP InvSqrt: 0.48 FP Div: 3.50 Conversion Unit: 0.02 ## Core 39 ## Module Utilization FP AddSub: 13.99 FP MinMax: 0.03 FP Compare: 5.13 Int AddSub: 19.51 FP Mul: 14.01 Int Mul: 37.82 FP InvSqrt: 0.40 FP Div: 3.13 Conversion Unit: 0.01 ## Core 40 ## Module Utilization FP AddSub: 15.55 FP MinMax: 0.03 FP Compare: 5.61 Int AddSub: 21.28 FP Mul: 15.55 Int Mul: 41.20 FP InvSqrt: 0.45 FP Div: 3.55 Conversion Unit: 0.02 ## Core 41 ## Module Utilization FP AddSub: 15.69 FP MinMax: 0.03 FP Compare: 5.79 Int AddSub: 21.95 FP Mul: 15.72 Int Mul: 42.76 FP InvSqrt: 0.47 FP Div: 3.50 Conversion Unit: 0.02 ## Core 42 ## Module Utilization FP AddSub: 15.48 FP MinMax: 0.03 FP Compare: 5.39 Int AddSub: 20.49 FP Mul: 15.41 Int Mul: 39.38 FP InvSqrt: 0.42 FP Div: 3.71 Conversion Unit: 0.01 ## Core 43 ## Module Utilization FP AddSub: 13.88 FP MinMax: 0.03 FP Compare: 5.12 Int AddSub: 19.44 FP Mul: 13.93 Int Mul: 37.77 FP InvSqrt: 0.42 FP Div: 3.10 Conversion Unit: 0.01 ## Core 44 ## Module Utilization FP AddSub: 15.51 FP MinMax: 0.03 FP Compare: 5.69 Int AddSub: 21.53 FP Mul: 15.54 Int Mul: 41.84 FP InvSqrt: 0.45 FP Div: 3.47 Conversion Unit: 0.02 ## Core 45 ## Module Utilization FP AddSub: 14.22 FP MinMax: 0.03 FP Compare: 5.17 Int AddSub: 19.64 FP Mul: 14.22 Int Mul: 38.18 FP InvSqrt: 0.42 FP Div: 3.24 Conversion Unit: 0.01 ## Core 46 ## Module Utilization FP AddSub: 15.54 FP MinMax: 0.03 FP Compare: 5.65 Int AddSub: 21.44 FP Mul: 15.55 Int Mul: 41.68 FP InvSqrt: 0.46 FP Div: 3.53 Conversion Unit: 0.02 ## Core 47 ## Module Utilization FP AddSub: 15.23 FP MinMax: 0.03 FP Compare: 5.67 Int AddSub: 21.61 FP Mul: 15.28 Int Mul: 42.07 FP InvSqrt: 0.44 FP Div: 3.31 Conversion Unit: 0.02 ## Core 48 ## Module Utilization FP AddSub: 15.29 FP MinMax: 0.03 FP Compare: 5.57 Int AddSub: 21.21 FP Mul: 15.32 Int Mul: 41.19 FP InvSqrt: 0.46 FP Div: 3.46 Conversion Unit: 0.02 ## Core 49 ## Module Utilization FP AddSub: 15.05 FP MinMax: 0.03 FP Compare: 5.66 Int AddSub: 21.46 FP Mul: 15.16 Int Mul: 41.90 FP InvSqrt: 0.49 FP Div: 3.32 Conversion Unit: 0.02 ## Core 50 ## Module Utilization FP AddSub: 15.08 FP MinMax: 0.03 FP Compare: 5.60 Int AddSub: 21.26 FP Mul: 15.14 Int Mul: 41.52 FP InvSqrt: 0.45 FP Div: 3.30 Conversion Unit: 0.02 ## Core 51 ## Module Utilization FP AddSub: 15.41 FP MinMax: 0.03 FP Compare: 5.70 Int AddSub: 21.58 FP Mul: 15.47 Int Mul: 41.93 FP InvSqrt: 0.46 FP Div: 3.44 Conversion Unit: 0.02 ## Core 52 ## Module Utilization FP AddSub: 15.88 FP MinMax: 0.03 FP Compare: 5.73 Int AddSub: 21.80 FP Mul: 15.89 Int Mul: 42.11 FP InvSqrt: 0.46 FP Div: 3.65 Conversion Unit: 0.02 ## Core 53 ## Module Utilization FP AddSub: 15.25 FP MinMax: 0.03 FP Compare: 5.67 Int AddSub: 21.51 FP Mul: 15.31 Int Mul: 41.87 FP InvSqrt: 0.46 FP Div: 3.36 Conversion Unit: 0.02 ## Core 54 ## Module Utilization FP AddSub: 14.06 FP MinMax: 0.03 FP Compare: 5.16 Int AddSub: 19.61 FP Mul: 14.11 Int Mul: 38.12 FP InvSqrt: 0.42 FP Div: 3.15 Conversion Unit: 0.01 ## Core 55 ## Module Utilization FP AddSub: 15.58 FP MinMax: 0.03 FP Compare: 5.60 Int AddSub: 21.29 FP Mul: 15.56 Int Mul: 41.22 FP InvSqrt: 0.44 FP Div: 3.58 Conversion Unit: 0.02 ## Core 56 ## Module Utilization FP AddSub: 15.60 FP MinMax: 0.03 FP Compare: 5.64 Int AddSub: 21.31 FP Mul: 15.63 Int Mul: 41.26 FP InvSqrt: 0.46 FP Div: 3.59 Conversion Unit: 0.02 ## Core 57 ## Module Utilization FP AddSub: 15.38 FP MinMax: 0.03 FP Compare: 5.64 Int AddSub: 21.39 FP Mul: 15.43 Int Mul: 41.48 FP InvSqrt: 0.45 FP Div: 3.44 Conversion Unit: 0.02 ## Core 58 ## Module Utilization FP AddSub: 15.50 FP MinMax: 0.03 FP Compare: 5.72 Int AddSub: 21.72 FP Mul: 15.57 Int Mul: 42.28 FP InvSqrt: 0.48 FP Div: 3.48 Conversion Unit: 0.02 ## Core 59 ## Module Utilization FP AddSub: 13.55 FP MinMax: 0.03 FP Compare: 4.79 Int AddSub: 18.19 FP Mul: 13.52 Int Mul: 35.06 FP InvSqrt: 0.38 FP Div: 3.21 Conversion Unit: 0.01 ## Core 60 ## Module Utilization FP AddSub: 15.26 FP MinMax: 0.03 FP Compare: 5.62 Int AddSub: 21.30 FP Mul: 15.31 Int Mul: 41.55 FP InvSqrt: 0.46 FP Div: 3.41 Conversion Unit: 0.02 ## Core 61 ## Module Utilization FP AddSub: 15.52 FP MinMax: 0.03 FP Compare: 5.57 Int AddSub: 21.12 FP Mul: 15.51 Int Mul: 40.98 FP InvSqrt: 0.46 FP Div: 3.61 Conversion Unit: 0.02 ## Core 62 ## Module Utilization FP AddSub: 15.22 FP MinMax: 0.03 FP Compare: 5.66 Int AddSub: 21.51 FP Mul: 15.30 Int Mul: 41.87 FP InvSqrt: 0.46 FP Div: 3.35 Conversion Unit: 0.02 ## Core 63 ## Module Utilization FP AddSub: 14.25 FP MinMax: 0.03 FP Compare: 5.23 Int AddSub: 19.86 FP Mul: 14.29 Int Mul: 38.68 FP InvSqrt: 0.43 FP Div: 3.21 Conversion Unit: 0.01 ## Core 64 ## Module Utilization FP AddSub: 15.37 FP MinMax: 0.03 FP Compare: 5.47 Int AddSub: 20.77 FP Mul: 15.34 Int Mul: 40.23 FP InvSqrt: 0.44 FP Div: 3.58 Conversion Unit: 0.02 ## Core 65 ## Module Utilization FP AddSub: 14.25 FP MinMax: 0.03 FP Compare: 5.07 Int AddSub: 19.32 FP Mul: 14.25 Int Mul: 37.31 FP InvSqrt: 0.41 FP Div: 3.30 Conversion Unit: 0.01 ## Core 66 ## Module Utilization FP AddSub: 15.09 FP MinMax: 0.03 FP Compare: 5.65 Int AddSub: 21.41 FP Mul: 15.17 Int Mul: 41.84 FP InvSqrt: 0.47 FP Div: 3.31 Conversion Unit: 0.02 ## Core 67 ## Module Utilization FP AddSub: 15.61 FP MinMax: 0.03 FP Compare: 5.67 Int AddSub: 21.50 FP Mul: 15.62 Int Mul: 41.68 FP InvSqrt: 0.46 FP Div: 3.57 Conversion Unit: 0.02 ## Core 68 ## Module Utilization FP AddSub: 13.40 FP MinMax: 0.02 FP Compare: 4.71 Int AddSub: 17.88 FP Mul: 13.36 Int Mul: 34.42 FP InvSqrt: 0.39 FP Div: 3.19 Conversion Unit: 0.01 ## Core 69 ## Module Utilization FP AddSub: 14.76 FP MinMax: 0.03 FP Compare: 5.35 Int AddSub: 20.25 FP Mul: 14.79 Int Mul: 39.37 FP InvSqrt: 0.42 FP Div: 3.35 Conversion Unit: 0.01 ## Core 70 ## Module Utilization FP AddSub: 15.64 FP MinMax: 0.03 FP Compare: 5.61 Int AddSub: 21.30 FP Mul: 15.67 Int Mul: 41.30 FP InvSqrt: 0.46 FP Div: 3.60 Conversion Unit: 0.02 ## Core 71 ## Module Utilization FP AddSub: 16.16 FP MinMax: 0.03 FP Compare: 5.73 Int AddSub: 21.67 FP Mul: 16.13 Int Mul: 41.93 FP InvSqrt: 0.45 FP Div: 3.80 Conversion Unit: 0.02 ## Core 72 ## Module Utilization FP AddSub: 15.63 FP MinMax: 0.03 FP Compare: 5.72 Int AddSub: 21.61 FP Mul: 15.68 Int Mul: 41.95 FP InvSqrt: 0.46 FP Div: 3.55 Conversion Unit: 0.02 ## Core 73 ## Module Utilization FP AddSub: 14.60 FP MinMax: 0.03 FP Compare: 5.18 Int AddSub: 19.71 FP Mul: 14.57 Int Mul: 38.22 FP InvSqrt: 0.41 FP Div: 3.39 Conversion Unit: 0.01 ## Core 74 ## Module Utilization FP AddSub: 15.25 FP MinMax: 0.03 FP Compare: 5.75 Int AddSub: 21.85 FP Mul: 15.35 Int Mul: 42.66 FP InvSqrt: 0.47 FP Div: 3.30 Conversion Unit: 0.02 ## Core 75 ## Module Utilization FP AddSub: 14.37 FP MinMax: 0.03 FP Compare: 5.24 Int AddSub: 19.91 FP Mul: 14.39 Int Mul: 38.59 FP InvSqrt: 0.41 FP Div: 3.23 Conversion Unit: 0.01 ## Core 76 ## Module Utilization FP AddSub: 15.32 FP MinMax: 0.03 FP Compare: 5.67 Int AddSub: 21.45 FP Mul: 15.38 Int Mul: 41.89 FP InvSqrt: 0.45 FP Div: 3.37 Conversion Unit: 0.02 ## Core 77 ## Module Utilization FP AddSub: 15.66 FP MinMax: 0.03 FP Compare: 5.69 Int AddSub: 21.59 FP Mul: 15.71 Int Mul: 41.91 FP InvSqrt: 0.48 FP Div: 3.57 Conversion Unit: 0.02 ## Core 78 ## Module Utilization FP AddSub: 13.37 FP MinMax: 0.03 FP Compare: 4.92 Int AddSub: 18.73 FP Mul: 13.39 Int Mul: 36.28 FP InvSqrt: 0.38 FP Div: 2.99 Conversion Unit: 0.01 ## Core 79 ## Module Utilization FP AddSub: 15.03 FP MinMax: 0.03 FP Compare: 5.60 Int AddSub: 21.26 FP Mul: 15.11 Int Mul: 41.44 FP InvSqrt: 0.47 FP Div: 3.34 Conversion Unit: 0.02 L1 accesses: 13882201 L1 hits: 13146998 L1 misses: 735203 L1 bank conflicts: 2991026 L1 stores: 49152 L1 near hit: 0 L1 hit rate: 0.947040 -= L2 #0 =- L2 accesses: 182698 L2 hits: 158175 L2 misses: 24523 L2 stores: 12348 L2 bank conflicts: 22273 L2 hit rate: 0.865773 L2 memory faults: 462 L2 bandwidth limited stalls: 31406 -= L2 #1 =- L2 accesses: 181579 L2 hits: 157630 L2 misses: 23949 L2 stores: 12228 L2 bank conflicts: 21823 L2 hit rate: 0.868107 L2 memory faults: 456 L2 bandwidth limited stalls: 32147 -= L2 #2 =- L2 accesses: 182400 L2 hits: 158278 L2 misses: 24122 L2 stores: 12273 L2 bank conflicts: 22385 L2 hit rate: 0.867752 L2 memory faults: 391 L2 bandwidth limited stalls: 32095 -= L2 #3 =- L2 accesses: 186747 L2 hits: 162139 L2 misses: 24608 L2 stores: 12303 L2 bank conflicts: 23564 L2 hit rate: 0.868228 L2 memory faults: 470 L2 bandwidth limited stalls: 35428 Bandwidth numbers for 1000MHz clock: register to L1 bandwidth: 324456611840.000000 L1 to L2 bandwidth: 548533796864.000000 L2 to memory bandwidth: 72698167296.000000 Core size: 0.9818 L2 size: 0.0000 4-L2 size: 0.0000 80-core chip size: 78.5458 FPS Statistics: FPS assuming 1000MHz clock: 5843.0328