Comments (5)
I got stuck at first on the ImportError: No module named google_compute_engine
error since I couldn't figure out how it ever worked :)
Basically, the clobbering of PATH
to have conda first always breaks gsutil
, so the little gsutil stat
trick trying to fetch notebooks from GCS just never actually worked (but also was untested and apparently unused).
But it's in a conditional so that's actually just a red herring. @nehalecky is correct that #103 should resolve this.
from initialization-actions.
Merged #103 - should work out of the box once again.
from initialization-actions.
@dennishuo, thanks for the details on that ImportError
. Is gsutil
the only component of glcoud sdk that doesn't play nice with conda, or are there others? Any more details on that would be appreciated as we're (@jeffkpayne) currently updating a Docker image that has both. 😄
Also, any more info on the use of Docker containers in Dataproc? :)
Thanks again!
from initialization-actions.
@nehalecky To be honest we don't have much coverage of compatibility constraints between cloud SDK and conda, though at least in my experience gsutil tends to be more picky than base gcloud
commands. A cursory check seems to show gcloud still working okay with conda installed and overridden on PATH
. I'll be happy to hear about your findings if you do discover interesting quirks though!
Re: Docker containers, have you played around with the datalab initialization action which uses a Docker container for the dev environment? It could be a worthwhile pattern to apply to other init actions.
I suppose the part that's less clear is whether there's a role for docker containers on worker nodes.
from initialization-actions.
Yep, that gsutil stat...
trick was definitely not well tested and, for other reasons, the goal that it was supposed to provide was never really utilized by us anyway.
from initialization-actions.
Related Issues (20)
- [livy] update livy init action for 2.1 HOT 1
- [rapids] please update to work with latest dask-rapids v22.12 HOT 2
- [gpu] Driver does not install on 2.2 Rocky/Ubuntu images HOT 1
- [zeppelin] not supported on 2.1+ image versions HOT 1
- Error on wget livy binary naming HOT 5
- [spark-rapids] Drop Spark 2.x support in spark-rapids.sh
- [gpu] apt-get update Init script seeing broken repositories HOT 2
- [bigtable] apt-get update Init script seeing broken repositories
- [cloud-sql-proxy] Running the Cloud SQL Proxy as a persistent service
- Update initialization scripts to install latest RAPIDS `23.12` OR `24.02` HOT 2
- [gpu] Add tests for GPU agent HOT 1
- initialization actions which use apt-get update fail due to purged oldoldstable backports repository HOT 10
- rstudio.sh is unable to get the receive keys. Maybe due to invalid repo key. HOT 1
- Dataproc "apt-get update" failed on ubuntu20 HOT 4
- dataproc Initialization actions for trino and hudi
- [gpu] Driver installation breaking in Dataproc 2.1 image during initialization HOT 15
- MLVM and horovod init action fails for 2.1 and 2.2 HOT 1
- [bigtable] bigtable.sh references deprecated Hortonworks Nexus resources HOT 2
- [hue] mysql installer prompts for user input HOT 1
- [gpu] Rocky kernel-devel is sometimes moved to the vault HOT 4
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from initialization-actions.