Giter VIP home page Giter VIP logo

Comments (8)

Mmoncadaisla avatar Mmoncadaisla commented on July 21, 2024

Hi @parthspatel-mendel, what version of pg_lakehouse are you running?

NOTE: @philippemnoel perhaps a more specific GitHub issue template could be helpful to make sure the environment details are included?

I think the Tokio runtime error could be fixed via #1208 but it's best if the ParadeDB team can confirm, it might be required to restart the postgres service to avoid failures after the crash happens.

OTOH regarding configuration for GCS, this worked for me, please see the docs though as there might be some configuration params that you wish to adapt for your own case https://docs.paradedb.com/analytics/object_stores/gcs:

1. Create the Foreign Data Wrapper

CREATE FOREIGN DATA WRAPPER gcs_fdw
HANDLER gcs_fdw_handler
VALIDATOR gcs_fdw_validator;

2. Create the server using the FDW we just created

In the CREATE SERVER statement we will include the Google Storage API endpoint (https://storage.googleapis.com)

CREATE SERVER gcs_server FOREIGN DATA WRAPPER gcs_fdw
OPTIONS (endpoint 'https://storage.googleapis.com', default_storage_class 'STANDARD', predefined_acl 'authenticatedRead');

3. Create the user mapping for authentication using a Service Account key file that is accessible from the container where postgres is running.

CREATE USER MAPPING FOR postgres
SERVER gcs_server
OPTIONS (
  credential '/path/to/service_account_key_file.json'
);

4. Create the foreign table

Note that unlike in other query engines like duckdb, for a partitioned parquet file you should leave the folder path (ending in '/') instead of specifying '/*.parquet'

CREATE FOREIGN TABLE foreign_table_name ("column_1" dtype, "column_2" dtype ...)
SERVER gcs_server
OPTIONS (path 'gs://bucket_name/bucket_folder/bucket_subfolder_where_parquet_files_are/', extension 'parquet');

from paradedb.

neilyio avatar neilyio commented on July 21, 2024

@Mmoncadaisla thank you for pitching in to help here, that is much appreciated.

@parthspatel-mendel, #1224 addresses the tokio executor crash that occurs when there's a connection problem with an object store. I was finding that bad configuration (or bad internet connection etc) would crash the executor and cause there is no reactor running.

It's possible that you were seeing this symptom from a configuration problem. We'll release the executor fix shortly, but in the meantime please try out @Mmoncadaisla's steps to see if it is indeed a GCS config issue.

from paradedb.

parthspatel-mendel avatar parthspatel-mendel commented on July 21, 2024

@neilyio I was using the docker image from the docs

docker run \
  --name paradedb \
  -e POSTGRESQL_USERNAME=abcd \
  -e POSTGRESQL_PASSWORD=abcd \
  -e POSTGRESQL_DATABASE=abcd \
  -e POSTGRESQL_POSTGRES_PASSWORD=abcd \
  -v paradedb_data:/bitnami/postgresql \
  -p 5432:5432 \
  -v ${HOME}:${HOME} \
  -d \
paradedb/paradedb:latest

from paradedb.

philippemnoel avatar philippemnoel commented on July 21, 2024

@neilyio I was using the docker image from the docs

docker run \
  --name paradedb \
  -e POSTGRESQL_USERNAME=abcd \
  -e POSTGRESQL_PASSWORD=abcd \
  -e POSTGRESQL_DATABASE=abcd \
  -e POSTGRESQL_POSTGRES_PASSWORD=abcd \
  -v paradedb_data:/bitnami/postgresql \
  -p 5432:5432 \
  -v ${HOME}:${HOME} \
  -d \
paradedb/paradedb:latest

@parthspatel-mendel could you try again with v0.7.5 that we've released?

from paradedb.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.