I’ve got a new build of psi4 running on a CentOS linux cluster that continues to fail for somewhat large jobs where I correlate nearly all the electrons. The only error shows up in the standard error file:
Error in PSIO_WT_TOCLEN()!
nothing posts to the output file. The file system has just over 5 TB of space and the largest psi4 file (file 103) is only 143 GB. The case is only a triatomic molecule (ClO2) in Cs symmetry with 351 active MOS. I’m attempting to do a BCCD calculation, but it always dies after the integral transformation but before the first CCSD iteration. Here is the last few lines:
Size of irrep 0 of tIjAb amplitudes: 9.311 (MW) / 74.484 (MB)
Size of irrep 1 of tIjAb amplitudes: 4.442 (MW) / 35.533 (MB)
Total: 13.752 (MW) / 110.018 (MB)
Any ideas? This is reproducible for different molecules and on different compute nodes. Input is pretty simple:
ClO2 test BCCD
memory 5 gb
molecule = {
0 2
Cl
O 1 R1
O 2 R2 1 A
R1=2.03230554
R2=1.20810395
A=115.36876732
}
set {
reference rohf
basis aug-cc-pwcvqz
print_MOs true
print 2
scf_type pk
guess sad
freeze_core -2
}
Thanks Holger for taking a look. Yes, the same input works for awcvtz. The same problem also occurs for a frozen-core calculation with av5z. So definitely a size thing.
I’m fighting too many other issues to deal with this one, but it’s worth adding to the issue tracker.
One technical correction: this specific error message happens before Psi tries to write any files. It happens when Psi tries to move to the start of a file (SYSTEM_LSEEK). If you can reproduce this stably, the first thing I’d do is add a “file exists” check to the function.
Thanks Jonathon. I’ve reproduced it with CCSD as well and am checking a closed-shell calculation now. Up until it dies it’s certainly writing to a lot of files (100 GB or more), so I guess the first thing is to figure out which new file is it getting ready to write to when it dies.
I’m testing a bit more, but setting cachelevel to 0 seems to have fixed it. Might be worth a note in the docs that this doesn’t just affect memory issues but can show up as IO errors as well (for some reason)