Comments (1)
Note that from the eventlog in question:
0.000s: job.submit {"userid":5588,"urgency":16,"flags":0,"version":1}
0.011s: job.validate
0.022s: job.depend
0.022s: job.priority {"priority":16}
0.022s: job.alloc {"annotations":{"sched":{"resource_summary":"rank0/core[0-5],gpu0"}}}
0.022s: job.exception type=alloc-check severity=0 resources already allocated
0.024s: job.start
0.023s: exec.init
0.023s: exec.starting
0.036s: exec.shell.init {"service":"5588-shell-fMM79KV","leader-rank":0,"size":1}
0.037s: exec.shell.start {"taskmap":{"version":1,"map":[[0,1,1,1]]}}
0.038s: exec.shell.task-exit {"localid":0,"rank":0,"state":"Exited","pid":2005996,"wait_status":0,"signaled":0,"exitcode":0}
0.039s: exec.complete {"status":0}
0.039s: exec.done
0.039s: job.finish {"status":0}
The job did finish with status=0
.
If exception handling was modified such that the job was terminated or never started, then I think the assumptions of flux job attach
and the job-list
module would better handle this case.
I'm not sure what would happen if a fatal exception was raised after the finish
event though -- I'm guessing it would also be ignored by flux job attach
(which reports the jobs finish status if the job ever entered RUN, as probably does flux jobs
).
from flux-core.
Related Issues (20)
- flux fails in rc1 when BASH_ENV includes code that is not -e (exit on error) clean HOT 1
- Issues when testing manual installation HOT 19
- Idea: job-info: add streaming RPC to "watch" R HOT 10
- `/etc/flux/rc1` fails with >1 file in `/etc/flux/rc1.d` HOT 2
- Build failure Python 3.12 HOT 1
- checking for cffi.__version_info__ >= (1,1) in python module cffi... no HOT 9
- LBANN mpi-catch-test hangs in MPI_Finalize with ompi 4.1.2 and simple PMI HOT 10
- default begin-time dependency format in `flux jobs` HOT 2
- flux run segfaults if user is not in password file on compute node HOT 2
- Keep a copy of R in job-manager for use in jobtap plugin callbacks HOT 3
- jobtap history plugin throws errors HOT 6
- src/tcmalloc.cc:333] Attempt to free invalid pointer 0x561bf1b5ecd0
- flux-core build fails on IBM coral system running rhel 7.9 based OS
- flux-uri slurm:jobid does not work for slurm batch jbos HOT 9
- broker stuck at exit in `zmq_ctx_term()` HOT 7
- overly verbose cleanup messages after allocation expired HOT 1
- Python: `flux.job.wait` is overloaded HOT 5
- basic job resource usage accounting
- testsuite: gitlab ci cluster specific tests HOT 2
- python: some JobInfo attributes don't port `to_dict()` HOT 9
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from flux-core.