Comments (9)
Could be machine specific -- I always get failures on our cluster because the ROM-IO components of the MPI system haven't been compiled properly. Could also be a serious issue in Lemon itself, so I'll have a look.
from tmlqcd.
Hmm, ok. This was with openmpi on my laptop. If it works for you the problem is probably here.
from tmlqcd.
I'm not even sure I have everything properly installed myself, so I'll have to test a couple of things...
from tmlqcd.
Hey there, today David reported the same bug on another machine. I am going to have a look at this now. If you have any other additional information, please let me know.
from tmlqcd.
Sorry, can't tell you anything beyond what it says above. Here's the config.log for the lemon library I was using: https://gist.github.com/2170297
from tmlqcd.
The problem is caused inside the last commit, checking out the previous version is fine. So the quick fix is now to roll back the last commit, to make sure that the head is in a stable state. Then we can fix up the things that were added in the last commit and re-apply once fixed.
An easy test is the lemon_benchmark application that is also installed. That also fails, so an easy check is something like
mpirun -np 4 lemon_benchmark 4 1
So, I'll put this issue inside Lemon, then ask Albert to roll that commit back. When that is done, the issue here on tmLQCD can be closed I think. But please verify that this works with commit 5306a90a4bc1ed47268e156ea1253e96ee03799a for you.
from tmlqcd.
Ok, will try to get this done ASAP, but I will have to figure out all the parameters of the build that I did back then because I don't have that particular build directory anymore.
from tmlqcd.
This was a little careless on my side, but I wasn't aware that this version of lemon was being used! It contains some completely unchecked code -- well, I guess it is somewhat tested now :S. At any rate, I have rolled back all changes having to do with prime factorization in etmc/master
. The later commits are in a separate branch unstable
, which should reduce the changes of accidents like this.
In the current head revision, there is a known bug related to integer overflow for large file sizes. It's not a trivial bug to fix, because the error is related to a known flaw in the MPI-IO interface. I had been working on a fix -- hence the broken code -- but didn't find the time to wrap it up yet.
from tmlqcd.
Today I am doing clean installs of head revisions of Lemon and tmLQCD with both Tiago (in Groningen on the BG/P), and David (on Huygens in Amsterdam). We run into several interesting issues because of this. Once we have tracked them down further, we will file them as bugs. I think that this one can be closed now, thanks!
from tmlqcd.
Related Issues (20)
- finish up merge of ndeg-twisted-clover branch in QUDA
- MonitorForces not working with QUDA HOT 19
- Segmentation fault in multi-shift CG with refinement in the HMC HOT 16
- abort HMC if any monomial_solve reports -1 iterations
- add mechanism to QUDA interface to regenerate setup if an MG-based solve in the HMC does not converge HOT 1
- `usesloppyprecision = double` fails with QUDA-MG
- HMC setup refresh never reached since b70e6f40b441bde1c0b3eff108f5ce58312e1221 HOT 2
- quda backends HOT 1
- TODO carefully investigate QUDA mixed solver parameters HOT 1
- Issues with double-half solver on Meluxina when running a 32c64 lattice on 2 nodes
- issues with coarse-grid deflated QUDA-MG which have appeared "recently" HOT 7
- reasons for allowing rank reordering? HOT 4
- inconsistent behaviour of the MG between Juwels Booster and Meluxina HOT 9
- RHMC for 2+1 simulations needs support for eigenvalue measurement of single-flavour operator
- QUDA interface: MGSetupSolver should be set on a per-level basis
- qbig: Blocking error HOT 4
- leonardo: `--no-relax` flag HOT 1
- QUDA interface: Pipeline does not mean what you think it meas
- invert_doublet_eo_quda does not reset the MG precon settings HOT 1
- deriv_mg_tune: when switching the configuration, the initial MG setup is used for the first inversion
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from tmlqcd.