Comments (4)
I like your idea. Thanks for opening this issue.
from beam.
filesystem extensions [gcp],[s3],etc are optional dependencies of beam. #31219 will cause excessive warning raised if user not intended to install these dependencies. Can we improve the error message you referred in the description instead?
Unable to get filesystem from specified path, please use the correct path or ensure the required dependency is installed, e.g., pip install apache-beam[gcp]. Path specified: ...
to, e.g.
Unable to get filesystem of scheme "s3://" from specified path
from beam.
Even better, currently it hints 'e.g., pip install apache-beam[gcp]
. If the failed scheme is gs://, we can hint user to do pip install apache-beam[gcp]
; if the failed scheme is s3://, we can hint user to do pip install apache-beam[aws]
; and so on
from beam.
@Abacn Thank you for reviewing my proposal. 👍
#31219 will cause excessive warning raised if user not intended to install these dependencies...
In my PR, there should not be any warning logs for modules ([gcp], [aws], etc.) that the user has not intended to installed. The import statement for not-installed modules should throw ModuleNotFoundError
, so they should be blocked before the section on ImportError
. (Am I right?)
$ python3
Python 3.10.12 (main, Nov 20 2023, 15:14:05) [GCC 11.4.0] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> try:
... import this_is_module_not_found
... except ModuleNotFoundError:
... print("ModuleNotFound!")
... except ImportError:
... print("ImportError!")
...
ModuleNotFound!
>>>
However, on the other hand:
Can we improve the error message you referred to in the description instead?
Even better, currently it hints ...
I think these ideas are excellent!
I was considering this issue under the scope of "when modules (gcp/aws/azure) installed by the user as Filesystem fail to initialize due to for some reason (such as OpenSSL)."
I think the enhancement you suggested(where the user has not installed Filesystem) could either be a separate issue or included in this one. (I would be happy to submit a PR! 👀)
from beam.
Related Issues (20)
- [Bug]: Dataflow runner - Flatten() yields no output HOT 3
- [Failing Test]: PostCommit Java Dataflow V2 - repeat intermittant failures
- [Failing Test]: beam_PostCommit_Java_ValidatesRunner_Dataflow_V2 - repeat intermittant failures
- [Failing Test]: PostCommit Python and PostCommit Python Arm perma red HOT 2
- [Failing Test]: apache_beam.io.requestresponse_test::TestCaller::test_default_throttler possibly flaky HOT 1
- [Bug]: Improve handling of 'not found' BigQuery dataset/table errors with appropriate retry policy
- [Bug]: Python expansion with multiple SqlTransforms is extremely slow HOT 7
- [Bug]: Portable translation or SDK harness appears to have a bug in GBK coders
- [Bug]: Portable Spark 3 Streaming runner appears to have GBK bugs, cannot execute Redistribute composite correctly HOT 2
- [Task]: Upgrade to Pandas 2.2.3 or above once available
- [Task]: Support newer versions of Pyarrow in Beam
- Update ExternalJavaProvider.available method to support Windows systems
- The IcebergIO Unit Tests job is permared HOT 2
- [Failing Test]: Vulnerabilities showing up for many versions
- [Feature request]: Support XGBoost 2.x
- [Failing Test]: Onnx inference unit tests are failing. HOT 1
- Java Apache Beam, allow fake external Clients initialized in @Setup method of DoFn with Constructors variables
- Question : What type should I use to read a numeric from bigquery with the go SDK ?
- [Failing Test]: YAML integration tests are flaky as a result of failing to start expansion service HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from beam.