Comments (6)
Hi @svengiegerich! sumx looks in /Users/svengiegerich/runx
for directories that contain both metrics.csv and hparams.json. It sounds like you've confirmed that the metrics.csv files exist. Do you also see the hparams.json files there too?
from runx.
Ah, no hparams.json
is indeed missing. I run in the interactive mode (python -m runx.sumx config_simple -i
) because I don't have access to a farm. So probably this is the issue?
Is it possible to configure .runx in a way that I can use runx non-interactive but also not on a farm? In other words, can I just use sumx with a farm?
Reading #9, I tried to modify .runx but with no success.
# not working
FARM: fake
fake:
SUBMIT_CMD: na
RESOURCES:
dummy: na
from runx.
So firstoff, I'll plan to release better support for the 'no farm' mode, where you shouldn't have to define the FARM.
But as a hack, the .runx
you show above should actually work. I just confirmed this myself.
What sort of failure are you seeing?
from runx.
Running python -m runx.runx config_simple.yml
, I get:
File "/opt/anaconda3/envs/thesis/lib/python3.7/runpy.py", line 193, in _run_module_as_main
"__main__", mod_spec)
File "/opt/anaconda3/envs/thesis/lib/python3.7/runpy.py", line 85, in _run_code
exec(code, run_globals)
File "/opt/anaconda3/envs/thesis/lib/python3.7/site-packages/runx/runx.py", line 394, in <module>
main()
File "/opt/anaconda3/envs/thesis/lib/python3.7/site-packages/runx/runx.py", line 387, in main
run_experiment(args.exp_yml)
File "/opt/anaconda3/envs/thesis/lib/python3.7/site-packages/runx/runx.py", line 380, in run_experiment
run_yaml(experiment_copy, runroot)
File "/opt/anaconda3/envs/thesis/lib/python3.7/site-packages/runx/runx.py", line 330, in run_yaml
cmd = build_farm_cmd(cmd, job_name, resource_copy, logdir)
File "/opt/anaconda3/envs/thesis/lib/python3.7/site-packages/runx/farm.py", line 126, in build_farm_cmd
raise f'Unsupported farm: {cfg.FARM}'
TypeError: exceptions must derive from BaseException
And if I rename the farm to FARM: ngc
, I get:
Traceback (most recent call last):
File "/opt/anaconda3/envs/thesis/lib/python3.7/runpy.py", line 193, in _run_module_as_main
"__main__", mod_spec)
File "/opt/anaconda3/envs/thesis/lib/python3.7/runpy.py", line 85, in _run_code
exec(code, run_globals)
File "/opt/anaconda3/envs/thesis/lib/python3.7/site-packages/runx/runx.py", line 394, in <module>
main()
File "/opt/anaconda3/envs/thesis/lib/python3.7/site-packages/runx/runx.py", line 387, in main
run_experiment(args.exp_yml)
File "/opt/anaconda3/envs/thesis/lib/python3.7/site-packages/runx/runx.py", line 361, in run_experiment
experiment = read_config(args.farm, args.exp_yml)
File "/opt/anaconda3/envs/thesis/lib/python3.7/site-packages/runx/utils.py", line 122, in read_config
cfg.NGC_LOGROOT = read_config_item(experiment, 'NGC_LOGROOT')
File "/opt/anaconda3/envs/thesis/lib/python3.7/site-packages/runx/utils.py", line 72, in read_config_item
raise f'can\'t find {key} in config'
TypeError: exceptions must derive from BaseException
Thanks for your time & help!
from runx.
Hi Sven, I appreciate your patience!
I've updated the pypi runx to 0.0.11. Can you please pip install it and try it out. Now your .runx
should only need the LOGROOT defined, and all that fake FARM stuff isn't needed anymore. Please let me know if it works.
I've been trying to improve the examples a little for this case. It could certainly be improved :).
from runx.
Hey, thanks for the update!
Going through the examples again, I found my issue: I didn't include the hparams=vars(args)
argument in logx.initialize()
. Now everything works smoothly. As feedback, it would have helped me as a user if this argument was explained in the README; however, I also may have missed it.
Thanks again for this package!
[Just a side note: the "syntax" of metrics.csv
seems to be inconsistent across rows (also for your example). Right now, the first line is short on two cells as there is probably no validation score. At least for me, a "consistent" format, with the first line containing 7 cells, would simplify analyzing this metrics.csv
's]
from runx.
Related Issues (15)
- farm support HOT 1
- How to write a .runx working with a simple GPU. HOT 3
- A bug for log_dir HOT 2
- Recursive log dirs? HOT 3
- input/output error due to flush
- Run on a single machine HOT 3
- Supporting averaging results of multiple random seed?
- [feature] LOG_ROOT support `~`
- logging HOT 4
- QUES: Framework Usage HOT 1
- Problem of SUBMID_CMD: submit_job in .runx HOT 2
- tensorboard add_scalars
- simple example not working HOT 2
- Is it possible to use runx on NGC? HOT 14
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from runx.