![neoload yaml nlp neoload yaml nlp](https://www.neotys.com/wp-content/uploads/2020/03/Screen-Shot-2020-03-04-at-10.08.42-AM.png)
There's no symbol associated with that call target, so I guess you didn't compile with debug info enabled. There is a a function call inside the timed region, callq 403c0b (as well as the _kmpc_end_serialized_parallel stuff).
![neoload yaml nlp neoload yaml nlp](https://blog.paperspace.com/content/images/size/w1050/2019/12/yaml.png)
See FLOPS per cycle for sandy-bridge and haswell SSE2/AVX/AVX2 Your Skylake-derived CPU can actually do 2x 4-wide SIMD double-precision FMA operations per core per clock, and each FMA counts as two FLOPs, so theoretical max = 16 double-precision FLOPs per core clock, so 24 * 16 = 384 GFLOP/S.
![neoload yaml nlp neoload yaml nlp](https://d28h099uturm62.cloudfront.net/wp-content/uploads/2019/12/SwaggerEditorwoutbrowser-2.png)
_tag_value_Z12do_timed_runRKmRd.281:ġ FP operation per core clock cycle would be pathetic for a modern superscalar CPU. _tag_value_Z12do_timed_runRKmRd.279:Ĭall _kmpc_end_serialized_parallel #123.5