Giter VIP home page Giter VIP logo

Comments (5)

YenochQin avatar YenochQin commented on August 26, 2024 1

OK, thank you for your advise. ; )

from grasp.

jongrumer avatar jongrumer commented on August 26, 2024

Hi Yenoch,

A google search on "mpirun signal 9" leads me to guess that you are simply running out of memory (on node qy-PC then if you use a multi-node cluster/supercluster) - use top or htop to monitor your run, and check if data is being written to your swap drive. Then you hit your computational wall, memory-wize. Could be something else, but with the limited information you've provided it is hard for me to guide you any further. A general advice is to make sure you include correlation in careful steps, as effectively as possible and not just a bunch of core-correlation that is not necessarily improving the physical quantities you are targeting.

Cheers,
Jon

from grasp.

YenochQin avatar YenochQin commented on August 26, 2024

Hi Jon,

Thank you very much for your reply. I will try to enlarge my desktop's swap drive. But this situation also appears in our group's HPC. The HPC has 98 gigs of memory and 16 gigs of swap. Is it enough for the calculation?

Have a nice day,
Yenoch

from grasp.

jongrumer avatar jongrumer commented on August 26, 2024

No, don't enlarge the swap, that won't do anything, calculations will crash anyways. Swapping should generally be avoided at all times. The amount of required RAM is not straightforward and depends on the system your calculating, how dense the interaction matrix is and so on. As I said, you need to monitor your calculations with e.g. top or htop. A calculation can easily be expanded to require 100's of GB's of RAM, so 98GB might be a too little in your case. I recommend you to study e.g. the MCHF book (Fischer, Brage and Jönsson) on how to design an efficient correlation model. Or some other ref. Also consider discussing HPC with someone knowledgeable, like a senior researcher or maybe even better a suitable sys admin.

Cheers!
Jon

from grasp.

jongrumer avatar jongrumer commented on August 26, 2024

Oh and don't forget to also monitor the convergence of the energy levels (and whatever physical properties you are after) after each layer of MCDHF, this is crucial. But you know this I'm sure :)

Jon

from grasp.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.