Comments (8)
Hi @parthspatel-mendel, what version of pg_lakehouse
are you running?
NOTE: @philippemnoel perhaps a more specific GitHub issue template could be helpful to make sure the environment details are included?
I think the Tokio runtime error could be fixed via #1208 but it's best if the ParadeDB team can confirm, it might be required to restart the postgres service to avoid failures after the crash happens.
OTOH regarding configuration for GCS, this worked for me, please see the docs though as there might be some configuration params that you wish to adapt for your own case https://docs.paradedb.com/analytics/object_stores/gcs:
1. Create the Foreign Data Wrapper
CREATE FOREIGN DATA WRAPPER gcs_fdw
HANDLER gcs_fdw_handler
VALIDATOR gcs_fdw_validator;
2. Create the server using the FDW we just created
In the CREATE SERVER
statement we will include the Google Storage API endpoint (https://storage.googleapis.com
)
CREATE SERVER gcs_server FOREIGN DATA WRAPPER gcs_fdw
OPTIONS (endpoint 'https://storage.googleapis.com', default_storage_class 'STANDARD', predefined_acl 'authenticatedRead');
3. Create the user mapping for authentication using a Service Account key file that is accessible from the container where postgres is running.
CREATE USER MAPPING FOR postgres
SERVER gcs_server
OPTIONS (
credential '/path/to/service_account_key_file.json'
);
4. Create the foreign table
Note that unlike in other query engines like duckdb, for a partitioned parquet file you should leave the folder path (ending in '/') instead of specifying '/*.parquet'
CREATE FOREIGN TABLE foreign_table_name ("column_1" dtype, "column_2" dtype ...)
SERVER gcs_server
OPTIONS (path 'gs://bucket_name/bucket_folder/bucket_subfolder_where_parquet_files_are/', extension 'parquet');
from paradedb.
@Mmoncadaisla thank you for pitching in to help here, that is much appreciated.
@parthspatel-mendel, #1224 addresses the tokio executor crash that occurs when there's a connection problem with an object store. I was finding that bad configuration (or bad internet connection etc) would crash the executor and cause there is no reactor running
.
It's possible that you were seeing this symptom from a configuration problem. We'll release the executor fix shortly, but in the meantime please try out @Mmoncadaisla's steps to see if it is indeed a GCS config issue.
from paradedb.
@neilyio I was using the docker image from the docs
docker run \
--name paradedb \
-e POSTGRESQL_USERNAME=abcd \
-e POSTGRESQL_PASSWORD=abcd \
-e POSTGRESQL_DATABASE=abcd \
-e POSTGRESQL_POSTGRES_PASSWORD=abcd \
-v paradedb_data:/bitnami/postgresql \
-p 5432:5432 \
-v ${HOME}:${HOME} \
-d \
paradedb/paradedb:latest
from paradedb.
@neilyio I was using the docker image from the docs
docker run \ --name paradedb \ -e POSTGRESQL_USERNAME=abcd \ -e POSTGRESQL_PASSWORD=abcd \ -e POSTGRESQL_DATABASE=abcd \ -e POSTGRESQL_POSTGRES_PASSWORD=abcd \ -v paradedb_data:/bitnami/postgresql \ -p 5432:5432 \ -v ${HOME}:${HOME} \ -d \ paradedb/paradedb:latest
@parthspatel-mendel could you try again with v0.7.5
that we've released?
from paradedb.
Related Issues (20)
- bug: index out of bounds error when Postgres foreign table has more columns that delta lake schema
- Expose `more_like_this` HOT 3
- Error "AliasRequired" with large number of rows returned from pg_search
- 0.7.2 pg_search illegal instruction HOT 13
- pg_search error with JSON columns that contain top-level arrays HOT 2
- Slow degradation of index HOT 6
- Sanitize endpoints and paths
- Better document `endpoint`
- Received unsupported data type Utf8 for BuiltIn(INT8OID) error when running EXPLAIN with ANALYZE option HOT 1
- More sorting options HOT 3
- Add numeric_fields INT[] and BIGINT[] arrays to the index for filtering HOT 3
- Add "hits total" value to search result HOT 2
- Cannot cancel or terminate long running query against external table HOT 2
- Crash on index recreation HOT 1
- On an existing table w indices, create_bm25 creates index, but fails to return matches on search(...) HOT 1
- Implement WALs for `pg_search` HOT 1
- Please update icu (or include a new one) tokenizer to support Czech language HOT 1
- bug with sparse vector index creation HOT 1
- Querying case sensitive tables fails
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from paradedb.