Is someone willing to run the following calculation on his/hers system on more 
than one node? If it works then I know the problem is not in my input file for 
QE, but rather a memory, parallelization... problem in my script. This would 
help me to narrow down the possible solutions I can try.


 &control
    calculation = 'relax'
    restart_mode = 'from_scratch'
    prefix = 'cti'
    tstress = .false.
    tprnfor = .true.
    pseudo_dir = '../pseudo'
    outdir = './TMP/'
    wf_collect = .true.
    verbosity='high'
    nstep = 100
 /
&SYSTEM
  ibrav = 0
  A =    10.2670500
  nat = 79
  ntyp = 4
  ecutwfc = 60
  ecutrho = 600
  spline_ps = .true.
/
 &electrons
    electron_maxstep = 400
    diagonalization = 'david'
    conv_thr =  1.0d-8
    diago_thr_init = 1e-4
    mixing_mode = 'local-TF'
    mixing_beta = 0.10
/
&ions
ion_dynamics = 'bfgs'
ion_positions = 'default'
/
ATOMIC_SPECIES
   O   15.99900   O.wc-n-kjpaw_psl.1.0.0.UPF
  Ti   47.86700  Ti.wc-spn-kjpaw_psl.1.0.0.UPF

K_POINTS  automatic
3 3 1 1 1 0

ATOMIC_POSITIONS {angstrom}
Ti      0.4186429       0.0000000       1.0555597 0 0 0
Ti      0.4186429       3.7851500       1.0555597 0 0 0
Ti      13.4976069      0.0000000       8.2645045
Ti      13.4976069      3.7851500       8.2645045
Ti      11.9877668      3.7851500       4.6274892
Ti      11.9877668      0.0000000       4.6274892
Ti      6.8542418       1.8925750       4.6274892
Ti      6.8542418       5.6777250       4.6274892
Ti      5.5521679       1.8925750       1.0555597 0 0 0
Ti      5.5521679       5.6777250       1.0555597 0 0 0
Ti      8.3640819       5.6777250       8.2645045
Ti      8.3640819       1.8925750       8.2645045
Ti      8.4678023       1.8925750       1.9351998 0 0 0
Ti      8.4678023       5.6777250       1.9351998 0 0 0
Ti      11.1525460      1.8925750       8.8260622
Ti      11.1525460      5.6777250       8.8260622
Ti      9.7252141       5.6777250       5.4033466
Ti      9.7252141       1.8925750       5.4033466
Ti      4.5917059       0.0000000       5.4033889
Ti      4.5917059       3.7851500       5.4033889
Ti      3.3343024       0.0000000       1.9352631 0 0 0
Ti      3.3343024       3.7851500       1.9352631 0 0 0
Ti      6.0190191       3.7851500       8.8260833
Ti      6.0190191       0.0000000       8.8260833
O       8.8461469       0.0000000       1.7851414 0 0 0
O       8.8461469       3.7851500       1.7851414 0 0 0
O       3.7126116       1.8925750       1.7851414 0 0 0
O       3.7126116       5.6777250       1.7851414 0 0 0
O       11.6648815      0.0000000       9.0367308
O       11.6648815      3.7851500       9.0367308
O       10.1335967      3.7851500       5.3061929
O       10.1335967      0.0000000       5.3061929
O       5.0000903       1.8925750       5.3062141
O       5.0000903       5.6777250       5.3062141
O       6.5313733       5.6777250       9.0367731
O       6.5313733       1.8925750       9.0367731
O       6.4920242       1.8925750       2.6665971
O       6.4920242       5.6777250       2.6665971
O       9.5524047       1.8925750       9.6887500
O       9.5524047       5.6777250       9.6887500
O       7.9643783       5.6777250       6.2261554
O       7.9643783       1.8925750       6.2261554
O       2.8308616       0.0000000       6.2261765
O       2.8308616       3.7851500       6.2261765
O       1.3585262       0.0000000       2.6666393
O       1.3585262       3.7851500       2.6666393
O       4.4188778       3.7851500       9.6887712
O       4.4188778       0.0000000       9.6887712
O       6.4875556       0.0000000       4.7368663
O       6.4875556       3.7851500       4.7368663
O       5.1733648       0.0000000       1.2058080 0 0 0
O       5.1733648       3.7851500       1.2058080 0 0 0
O       10.3068917      1.8925750       1.2057869 0 0 0
O       10.3068917      5.6777250       1.2057869 0 0 0
O       7.9865405       3.7851500       8.2942713
O       7.9865405       0.0000000       8.2942713
O       13.1200655      1.8925750       8.2942713
O       13.1200655      5.6777250       8.2942713
O       11.6210871      5.6777250       4.7369085
O       11.6210871      1.8925750       4.7369085
O       8.6769466       1.8925750       3.8566351
O       8.6769466       5.6777250       3.8566351
O       7.3912657       1.8925750       0.3261679 0 0 0
O       7.3912657       5.6777250       0.3261679 0 0 0
O       10.2145938      5.6777250       7.3147541
O       10.2145938      1.8925750       7.3147541
O       3.5434486       0.0000000       3.8566773
O       3.5434486       3.7851500       3.8566773
O       2.2577304       0.0000000       0.3261679 0 0 0
O       2.2577304       3.7851500       0.3261679 0 0 0
O       5.0810874       3.7851500       7.3147752
O       5.0810874       0.0000000       7.3147752
C       11.8672904      1.1988342       12.0377009
H       10.8972048      1.5651949       12.4327181
H       12.6146411      1.4156404       12.8602406
H       11.7754366      0.0930383       12.0212237
C       12.1489342      1.7490971       10.6690556
H       12.2444435      2.8549331       10.7299854
H       13.1383125      1.3828863       10.3158727
CELL_PARAMETERS {alat}
  1.0000000   0.0000000   0.0000000
  0.0000000   0.7373394   0.0000000
  0.8155077   0.0000000   2.0562083



Thanks in advance,

Laurens Siemons


________________________________
Van: [email protected] <[email protected]> namens Paolo 
Giannozzi <[email protected]>
Verzonden: dinsdag 27 februari 2018 11:28
Aan: PWSCF Forum
Onderwerp: Re: [Pw_forum] Problems computing cholensky

On Mon, Feb 26, 2018 at 10:26 AM, Laurens Siemons 
<[email protected]<mailto:[email protected]>> wrote:

 Does anybody have a suggestion why it does work on 1 node with 20 cores, but 
fails when I try to increase my nodes?

The final results do not depend upon the number of processors (modulo minor 
numerical differences, within the convergence threshold) but intermediate 
results may depend, due to a different starting point and to small numerical 
differences. This may unfortunately lead some calculations that are "on the 
brink" to fail. I have no evidence that this is related to a bug or to any 
other easily solvable problem.

The "Cholesky" problem signals that the overlap matrix is not positive definite 
(has a zero or negative eigenvalue). When it happens, it is invariably due 
either to a badly wrong structure (not the case here) or to USPP/PAW with small 
negative values of augmentation charges. It is a known problem and there is no 
simple solution available.

Paolo


Thanks in advance,

Laurens Siemons


________________________________
Van: [email protected]<mailto:[email protected]> 
<[email protected]<mailto:[email protected]>> namens Paolo 
Giannozzi <[email protected]<mailto:[email protected]>>
Verzonden: zondag 25 februari 2018 8:33
Aan: PWSCF Forum
Onderwerp: Re: [Pw_forum] Problems computing cholensky

You can use cell_parameters together with A or celldm(1).

On Sat, Feb 24, 2018 at 8:53 PM, Manu Hegde 
<[email protected]<mailto:[email protected]>> wrote:
Hi,
I do not know much about your system but looking quickly into the crystal 
structure there is something that might be causing problem. Looks like you have 
set ibrav=0, in that case you have to use card cell_parameters. A=xx not 
required. Also double check ypur system with xcrysden before starting the 
calculations.
Manu
(SFU)

On Fri, Feb 23, 2018 at 10:19 AM, <[email protected]<mailto:[email protected]>> 
wrote:
Hello Laurens Siemons,

Although I am not one of the experts, I had the same problem in one of
the scf runs I was doing for a GW calculation. Because of the high
number of bands and ecutwfc that I needed to use and in order to get
any results, I had to run the simulation on 100 cores. The strange
thing for me also was that the first one I tried run, but then nothing
again. After a few weeks of trying I was notified by our cluster
services that I should not use more than 60 cores as the I/O
operations that are done by QE were too high and the disk could not
cope. I gave up GW since then, but if there is a solution to this
problem, I would like to hear it too :S


Eleni


Quoting Laurens Siemons 
<[email protected]<mailto:[email protected]>>:

> Dear all,
>
>
> I'm a master student chemistry and I'm using QE (v. 6.1) for a relax
> calculation of a rutile 101 slab with a vacuum above it.
>
> I'm getting the famous error:
>
>
>
> %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
>      Error in routine  cdiaghg (161):
>       problems computing cholesky
>
> %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
>
> I've read almost every related topic on the forum that I could find
> and I tried a lot already to overcome this, like:
> - changing values for ecutwfc and ecutrho
> - changing mixing_beta
> - changing functionals
> - Tried to run the calcualtion with other input files (anatase 101, 001...)
> - Changed diagonalization to 'cg' (In this case it calculates some
> itterations but then crashes with the error: 'Error in routine
> c_bands (1): >> too many bands are not converged')
>
> Nothing seems to help and I'm out of options... I even tried to run
> a calculation of my predecessor (that has succeeded in the past) but
> this also failed (he used an older version of QE though...).
>
> I'm postig my input file at the end here and I really hope somebody
> can help me.
>
> Kind Regards,
> Laurens Siemons
>
> &CONTROL
>   calculation = 'relax'
>   restart_mode = "from_scratch",
>   prefix       = "testen",
>   pseudo_dir =
> '/data/antwerpen/204/vsc20442/pseudo/pslibrary.1.0.0/wc/PSEUDOPOTENTIALS'
>   outdir = '/data/antwerpen/204/vsc20442'
>   nstep = 100
> /
> &SYSTEM
>   ibrav = 0
>   A =    4.59631
>   nat = 36
>   ntyp = 2
>   ecutwfc = 60
>   ecutrho = 600
> /
> &ELECTRONS
>   electron_maxstep = 300
>   mixing_beta = 0.10
>   conv_thr =  1.0d-8
>   mixing_mode = 'local-TF'
>   diago_thr_init = 1e-4
> /
> &IONS
>   ion_dynamics = 'bfgs'
>   ion_positions = 'default'
> /
> CELL_PARAMETERS {alat}
>   1.000000000000000   0.000000000000000   0.640859733133753
>   0.000000000000000   2.000000000000000   0.000000000000000
>   0.000000000000000   0.000000000000000   3.845158398802518
> ATOMIC_SPECIES
>    O   15.99900   O.wc-n-kjpaw_psl.1.0.0.UPF
>   Ti   47.86700   Ti.wc-spn-kjpaw_psl.1.0.0.UPF
> ATOMIC_POSITIONS {crystal}
> Ti  -0.000000000000000  -0.000000000000000   0.075000000000000 0 0 0
> Ti  -0.000000000000000  -0.000000000000000   0.408333333333333
> Ti  -0.000000000000000   0.500000000000000   0.241666666666667
> Ti  -0.000000000000000  -0.000000000000000   0.241666666666667
> Ti  -0.000000000000000   0.500000000000000   0.075000000000000 0 0 0
> Ti  -0.000000000000000   0.500000000000000   0.408333333333333
> Ti   0.500000000000000   0.250000000000000   0.075000000000000 0 0 0
> Ti   0.500000000000000   0.250000000000000   0.408333333333333
> Ti   0.500000000000000   0.750000000000000   0.241666666666667
> Ti   0.500000000000000   0.250000000000000   0.241666666666667
> Ti   0.500000000000000   0.750000000000000   0.075000000000000 0 0 0
> Ti   0.500000000000000   0.750000000000000   0.408333333333333
>  O   0.304303000000000   0.152151500000000   0.024282833333333 0 0 0
>  O   0.304303000000000   0.152151500000000   0.357616166666667
>  O   0.304303000000000   0.652151500000000   0.190949500000000
>  O   0.304303000000000   0.152151500000000   0.190949500000000
>  O   0.304303000000000   0.652151500000000   0.024282833333333 0 0 0
>  O   0.304303000000000   0.652151500000000   0.357616166666667
>  O   0.695697000000000   0.347848500000000   0.459050500000000
>  O   0.695697000000000   0.347848500000000   0.292383833333333
>  O   0.695697000000000   0.847848500000000   0.125717166666667 0 0 0
>  O   0.695697000000000   0.347848500000000   0.125717166666667 0 0 0
>  O   0.695697000000000   0.847848500000000   0.459050500000000
>  O   0.695697000000000   0.847848500000000   0.292383833333333
>  O   0.804303000000000   0.097848500000000   0.024282833333333 0 0 0
>  O   0.804303000000000   0.097848500000000   0.357616166666667
>  O   0.804303000000000   0.597848500000000   0.190949500000000
>  O   0.804303000000000   0.097848500000000   0.190949500000000
>  O   0.804303000000000   0.597848500000000   0.024282833333333 0 0 0
>  O   0.804303000000000   0.597848500000000   0.357616166666667
>  O   0.195697000000000   0.402151500000000   0.125717166666667 0 0 0
>  O   0.195697000000000   0.402151500000000   0.459050500000000
>  O   0.195697000000000   0.902151500000000   0.292383833333333
>  O   0.195697000000000   0.402151500000000   0.292383833333333
>  O   0.195697000000000   0.902151500000000   0.125717166666667 0 0 0
>  O   0.195697000000000   0.902151500000000   0.459050500000000
> K_POINTS {automatic}
> 4 4 6 1 1 1



--
Dr. Eleni Chatzikyriakou
Computational Physics lab
Aristotle University of Thessaloniki
[email protected]<mailto:[email protected]> - tel:+30 2310 
998109<tel:%2B30%202310%20998109>

_______________________________________________
Pw_forum mailing list
[email protected]<mailto:[email protected]>
http://pwscf.org/mailman/listinfo/pw_forum


_______________________________________________
Pw_forum mailing list
[email protected]<mailto:[email protected]>
http://pwscf.org/mailman/listinfo/pw_forum



--
Paolo Giannozzi, Dip. Scienze Matematiche Informatiche e Fisiche,
Univ. Udine, via delle Scienze 208, 33100 Udine, Italy
Phone +39-0432-558216<tel:+39%200432%20558216>, fax 
+39-0432-558222<tel:+39%200432%20558222>


_______________________________________________
Pw_forum mailing list
[email protected]<mailto:[email protected]>
http://pwscf.org/mailman/listinfo/pw_forum


--
Paolo Giannozzi, Dip. Scienze Matematiche Informatiche e Fisiche,
Univ. Udine, via delle Scienze 208, 33100 Udine, Italy
Phone +39-0432-558216<tel:+39%200432%20558216>, fax 
+39-0432-558222<tel:+39%200432%20558222>

_______________________________________________
Pw_forum mailing list
[email protected]
http://pwscf.org/mailman/listinfo/pw_forum

Reply via email to