Comments (3)
Hi @dbeatty10 , thank you for the prompt response and attention to the problem.
We tried to run the model again on the new Dataproc cluster : https://cloud.google.com/dataproc-serverless/docs/release-notes. It has been issued on the 18th of December. And now its working fine, we are able to seamlessly add and remove columns from the python model without manual table deletion.
As we have not changed anything else apart from new Dataproc version, we assumed that could be an issue (which hopefully means its not dbt cloud bug and issue will not reappear!)
I have also updated the thread in dbt community as people were experiencing exactly same problem:
https://discourse.getdbt.com/t/how-to-config-to-allow-new-fields-when-running-pyspark-model-in-gcp-dataproc-serverless/9455/2
Hope it will work for them as well!
from dbt-bigquery.
Thanks for reaching out @Lana-hl !
On first blush, this sounds vaguely similar to #1047
Could you provide the relevant log output with the exact error message you got?
To allow us to reproduce this issue on our end, could you provide the example code for a foo.py
? And also any relevant dbt commands to run?
e.g, we're looking for something similar to this:
models/foo.py
import ...
def model(dbt, session):
my_sql_model_df = dbt.ref("my_sql_model")
final_df = ...
return final_df
Commands:
dbt build -s foo
Any other steps that would allow us to reproduce what you are seeing.
from dbt-bigquery.
Glad the new Dataproc version is working for you @katerina-kogan !
And thanks adding your experience to that Discourse post.
I'm going to close this as resolved outside of dbt-biquery -- just let us know if this pops up again and we can either open a new issue or re-open this one.
from dbt-bigquery.
Related Issues (20)
- [Feature] Make `job_execution_timeout_seconds` configurable at the model level
- [Bug] hours to expiration timeformat is incorrect for materialized view
- [Feature] Keep grants for authorized views in sync when using the `grant_access_to` config HOT 8
- This is a test issue to see if GH project automation is woarking
- [feature] unit testing a recursive CTE fails HOT 8
- [Feature] Implicitly cast nested `STRUCT` properties during unit test fixture generation HOT 1
- [Feature] configure columns to be 'REQUIRED'
- [Feature] dbt manage multiple fields as primary key HOT 1
- BigQuery tags do not work HOT 1
- [Bug] State modified does not pick up changes to policy tags
- [Bug] dbt grant doesn't work for clone
- [Feature] Improvement in data processed/cost incurred in insert_overwrite method HOT 4
- [Release Improvements] Refresh workflows for the `pyproject.toml` migration
- [Bug] persist_docs not working for seeds HOT 2
- Import relevant pytest(s) for cross-database `cast` macro
- [Bug] `docs generate` appears to be returning no table metadata when run with the `--no-compile` option HOT 1
- [Tech Debt] `test_dbt_debug` is not dropping its test schema
- [Bug] `docs generate` does not find all schemas when there are more than 10K schemas
- Cross-database `date` macro
- [Bug] incremental run with __dbt_tmp table does not log the real bytes_billed in run_results.json HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from dbt-bigquery.