What governs runtime and memory usage of "H = MPO(ampo,sites)"?

Question

What governs runtime and memory usage of "H = MPO(ampo,sites)"?

asked Apr 30, 2020 by sujaykazi (1.1k points)
retagged Jun 29, 2020 by sujaykazi

This is a question about the Julia ITensors package (https://github.com/ITensor/ITensors.jl), which I am using to run DMRG on various Hamiltonians to find the ground state and low-lying excited states.

When you create a Hamiltonian on which to use DMRG, you need to create an AutoMPO object, add terms to it, and then run the command "H = MPO(ampo,sites)." However, I have observed confusing discrepancies in runtime and memory usage across different Hamiltonians.

For example, when I run the following code to create the Hamiltonian for the J1-J2 model (https://en.wikipedia.org/wiki/J1_J2_model) on N = 10^4 (resulting in 60,000 MPO terms), the code takes only 1 minute to run on my desktop.

J1 = 1
J2 = 0.5
N = 10^4
sites = siteinds("S=1/2",N)
ampo = AutoMPO()
for j=1:N-1
  add!(ampo,J1,"Sx",j,"Sx",j+1)
  add!(ampo,J1,"Sy",j,"Sy",j+1)
  add!(ampo,J1,"Sz",j,"Sz",j+1)
end
for j=1:N-2
  add!(ampo,J2,"Sx",j,"Sx",j+2)
  add!(ampo,J2,"Sy",j,"Sy",j+2)
  add!(ampo,J2,"Sz",j,"Sz",j+2)
end
add!(ampo,J1,"Sx",N,"Sx",1)
add!(ampo,J1,"Sy",N,"Sy",1)
add!(ampo,J1,"Sz",N,"Sz",1)
add!(ampo,J2,"Sx",N-1,"Sx",1)
add!(ampo,J2,"Sz",N-1,"Sz",1)
add!(ampo,J2,"Sx",N,"Sx",2)
add!(ampo,J2,"Sy",N,"Sy",2)
add!(ampo,J2,"Sz",N,"Sz",2)
H = MPO(ampo,sites)

However, when I run the following code to create the Hamiltonian for the lattice formulation of the Schwinger model (equation 2.6 on page 4 of https://arxiv.org/abs/1305.3765v2) on N = 60 (resulting in1948 MPO terms), the code takes 2 hours and 20 minutes to run on my desktop. Again, this is just to create the Hamiltonian, not even to run DMRG.

m_over_g = 0 # m, g are Schwinger model parameters
x = 25 # x = 1/(g^2*a^2), a = lattice spacing
μ = 2 * m_over_g * sqrt(x) # μ = 2m/(g^2*a) = 2*(m/g)*sqrt(x)
N = 60
ampo_H = AutoMPO()
trace = N*μ/2 + N^2/8
add!(ampo_H,2*2 * trace,"Sz*Sz",1) # cheat designed to add trace term to Hamiltonian
for j = 1:N
  if j != N
    add!(ampo_H,x,"S+",j,"S-",j+1) # S+ and S- are the same as σ+ and σ-
    add!(ampo_H,x,"S-",j,"S+",j+1)
    # S operators are 1/2 of Pauli matrices, so I have put factors of 2 to compensate
    add!(ampo_H,2 * trunc((N-j+1)/2)/2,"Sz",j)
    for i=1:j-1
      add!(ampo_H,2*2 * (N-j)/2,"Sz",i,"Sz",j)
    end
  end
  add!(ampo_H,2 * μ*(-1)^(j-1)/2,"Sz",j)
end
H = MPO(ampo_H, sites)

It is worth mentioning that this used to take a lot less time. Up until a week ago, this would take probably 5-10 minutes, but somehow, ever since downloading one of the later versions of the code last Thursday, it has been running way slower.

In addition, if I change N = 60 to N = 100 in the above code (resulting in 5248 terms), the code eventually throws an OutOfMemoryError. This also seems weird to me, since there are still far fewer MPO terms, and each individual term is not any more complicated than the terms in the J1-J2 model Hamiltonian.

The obvious difference between these two is that the Schwinger model Hamiltonian is intensely non-local, having an interaction between almost every pair of spins. However, I thought this only significantly affected the speed of DMRG, not the memory usage or speed of the creation of the Hamiltonian itself. Any insights into what factors govern the speed and memory usage of the creation of the Hamiltonian would be greatly appreciated.

1 Answer

miles · Answer 1 · 2020-05-01T15:30:08+0000

Hi, thanks for this question. Also congrats on asking what I think is the first Julia-related question on this forum!

Your question is a good one, about what sort of inputs work well with AutoMPO or not. Without going far into the details of the algorithm, the main steps of it are:
(1) collect and sort terms (meaning sort each term in site order)
(2a) create an uncompressed, sparse version of the MPO
(2b) create a matrix at each bond which contains the coefficients weighting each operator string to the left and right of that bond (note that these matrices are highly redundant, and are not MPO matrices
(3) SVD all of the matrices in (2b), and truncate singular values which are zero. Use the truncated U and V^\dagger matrices to compress the "big", uncompressed sparse MPO from (2a)

So I believe the bottleneck you are hitting comes from either (2a) or (3). I'd have to do profiling to see. There are two hypothetical improvements we could do:
(2a+) don't make the full uncompressed MPO all at once, even in a sparse form
(3) don't make uncompressed matrices with all of the coefficients
What I mean here loosely is that there's a different way of coding the AutoMPO algorithm which builds the (2b) matrices on some bond j already in the compressed basis of bond j-1.

Anyway, you can see that it's technical, and right now I can't guarantee that I know how to do the above improvements in a general way and that they will work.

Unfortunately, this means your input is currently outside of the design scope of AutoMPO. But I appreciate you filing an issue, because if there was some change in recent versions which we can revert which at least alleviates your issue we'll see if we can find it.

But I think your best bet is actually this: for highly unusual and non-local Hamiltonians, it's really best to invest in hand-crafting an MPO. A few years ago I was working on quantum chemistry DMRG calculations, which also involve highly non-local terms, and had to make custom MPOs quite often. So it's necessary for such challenging cases until we can make AutoMPO an even more general solution. (Though I'm sure people will still continue to find cases it can't quite handle!)

It's great that you are using ITensor to study interesting systems outside of the usual condensed matter setting. Again, we'll keep your issue open and try to at least alleviate the memory usage and ultimately later on devise a better-scaling algorithm, though it's sort of an open area of algorithm research for us still.

Best,
Miles

What governs runtime and memory usage of "H = MPO(ampo,sites)"?

Please log in or register to add a comment.

Please log in or register to answer this question.

1 Answer

Please log in or register to add a comment.

Categories