I am performing time evolution through fitApplyMPO, however, the process is very slow compared to the DMRG algorithm.
In an attempt to speed up the calculation, I have tried adding compile flags -march=native and -O2 when compiling my own program through CMake, but doing this actually slows down the whole program. The README mentions linking the options.mk file, but I don't know how to achieve this in CMake.
Which flags/setting should I use in my CMakeList in order for my program to utilize ITensor to its fullest?
Note, I have compiled ITensor with Intel MKL.