greenplum-db / plcontainer Goto Github PK

View Code? Open in Web Editor NEW

44.0 44.0 34.0 2.98 MB

PL/Container - GPDB execution sandboxing for Python and R

License: Other

Shell 10.84% C 54.66% PLpgSQL 24.71% Python 6.08% R 0.03% CMake 3.68%

plcontainer's People

Contributors

Stargazers

Watchers

plcontainer's Issues

Remove unused test ans files.

e.g. plcontainer_populate.out

I did not check carefully, but I assume they are useless now. We should double-check them and remove them accordingly.

Remove _DEBUG_CLIENT

We do not seem to need to have this code now. After removing this we could close(listen_fd) although this is trivial.

rewrite the handling of arrays to use recursion like upstream

see commit 94aceed317730953476bec490ce0148b2af3c383 upstream

Test Issue

Test if this issue will be synced to tracker.

Make plcontainer PostgreSQL compatible

Dave Cramer has an inital patch
https://github.com/greenplum-db/plcontainer/pull/2/files/7c5b15bc188accc7b719f1dd8eeb3ed107621fb7
Since the code changes a lot since then, more work need to be done. I'm creating the issue here so that later the support could be added later, hopefully not long.

python spi execute with plan should handle null argument properly.

Our code does not seem to align with upstream plpython according to the code although I do not understand that part of code that much. See plpy_spi.c

+       } else {
+           /* FIXME: Wrong ? */
+           args[j].data.isnull = 1;
+       }

Support extension in plcontainer

Now postgresql extension is in gpdb now, we need to add support for this in plcontainer since extension has advantage over language.

Separate metadata and data transfered between qe and client.

Currently, we send metadata, such as udf source code from qe to client every call.
It's better to send metadata only once.

Automate stress tests in concourse

i.e. tests under stress/

Maybe move them to tests/

Should use fixed-width variables for [de]serialization

I've seen a lot of code similar like this.

send_int32(conn, call->hasChanged);

hasChanged is defined as int.

It would better to define them as the fixed-width variable, e.g.

s/int hasChanged/int32 hasChanged/

This could avoid potential bug.

Better to send source code and log level only once.

currently we send source code from qe to client for every tuple. it not neccessary.

Using separate GUC to control log level at client side.

Currently we use log_min_messages to control log level at client side. we'd better use a separate GUC. Since plcontainer is dynamic loaded, set custom guc may not work on segment. Perhaps, after gpdb supporting create extension, we can revisit this problem.

"plcontainer image-add" failed when using image filename contains relative path

[gpadmin@jwu-vm ~]$ pwd
/home/gpadmin
[gpadmin@jwu-vm ~]$ plcontainer image-add -f plcontainer/daily/plcontainer-python-images-devel.tar.gz
20171211:12:52:24:020741 plcontainer:jwu-vm:gpadmin-[INFO]:-Checking whether docker is installed on all hosts...
20171211:12:52:24:020741 plcontainer:jwu-vm:gpadmin-[INFO]:-Distributing image file plcontainer/daily/plcontainer-python-images-devel.tar.gz to all hosts...
20171211:12:52:25:020741 plcontainer:jwu-vm:gpadmin-[CRITICAL]:-plcontainer failed. (Reason='ExecutionError: 'Error Executing Command: ' occured.  Details: '/bin/scp plcontainer/daily/plcontainer-python-images-devel.tar.gz jwu-vm:/usr/local/greenplum-db/./share/postgresql/plcontainer/plcontainer/daily/plcontainer-python-images-devel.tar.gz'  cmd had rc=1 completed=True halted=False
  stdout=''
  stderr='scp: /usr/local/greenplum-db/./share/postgresql/plcontainer/plcontainer/daily/plcontainer-python-images-devel.tar.gz: No such file or directory
'') exiting...

Reformat the code to use tab for indent.

To align with upstream (greenplum and postgres).

Need to set timeout for libcurl.

We saw some times during stress testing, libcurl hangs. It looks like the remote side does not reply. Anyway, we should have a timeout setting in our libcurl code.

Note the options below.
https://curl.haxx.se/libcurl/c/CURLOPT_CONNECTTIMEOUT.html
https://curl.haxx.se/libcurl/c/CURLOPT_TIMEOUT.html
CURLOPT_NOSIGNAL

client should have timeout for accept().

It should not block there at accept() for ever until the container is deleted.

Residual containers cannot be cleanup in faultinjection test randomly.

Detailed error message.
--- /home/gpadmin/plcontainer_src/tests/expected/faultinject_python.out 2017-12-27 23:26:52.654279211 +0000
+++ /home/gpadmin/plcontainer_src/tests/results/faultinject_python.out 2017-12-27 23:26:52.657279189 +0000
@@ -72,7 +72,7 @@
GP_IGNORE:
GP_IGNORE:-- end_ignore
! ssh psql -d ${PL_TESTDB} -c 'select address from gp_segment_configuration where dbid=2' -t -A docker ps -a </dev/null | wc -l
-1
+2

Need to check code coverage.

We really do not know much about the code coverage of our tests. We should have framework to test it.

Remove docker non-curl api

We have both curl and non-curl docker api code. However the non-curl code has some bugs or limitation. e.g. it does not handle tcp partial read well; it does not have timeout mechanism; it does not support chunked encoding (so inspect api code actually is working around this). We might better remove this code and leave this to more professional package, i.e. libcurl.

We requires libcurl >=7.40 to use the curl code. I assume that is because the unix domain socket support in libcurl starts from 7.40. We could document this requirement on README and Makefile. In the long run, we might change to use tcp thus lower version libcurl is allowed.

import upstream test cases.

xml file validation code in plcontainer utility should be more aggressive.

Check whether settings are legal (keys/values are legal).
Dedup entries.
Check mandatory entries are missing or not. Also for each mandatory entry, check the legality (e.g. string requirement; do not exceed the allowed number; etc).

Support DO in plcontainer.

See below.

postgres=# DO LANGUAGE plpythonu $$
# container: plc_python_shared
print 1;
$$;
DO
postgres=#

postgres=# DO LANGUAGE plcontainer $$
# container: plc_python_shared
print 1;
$$;
ERROR:  language "plcontainer" does not support inline code execution

Client should have a solution to use log level

Currently clients do not have a solution to filter various levels of log.

We should allow to set them. Typically solution includes: Set level in client argument or set via guc ( environment variable and/or message).

spi free plan and spi execute with plan should sanity check about plan.

Before spi execute with plan, it should double check the plan pointer which comes from the client code. A typical solution is to save previous spi-planned plans and have a check. See FIXME.

               /* FIXME: Sanity-check is needed!
+                * Maybe hash-store plan pointers for quick search?
+                * Or use array since we need to free all plans when backend quits.
+                * Or both?
+                */
+               plc_plan = (plcPlan *) ((char *) msg->pplan - offsetof(plcPlan, plan));
+               if (plc_plan->nargs != msg->nargs) {
+                   elog(ERROR, "argument number wrong for execute with plan: "
+                       "Saved number (%d) vs transferred number (%d)",
+                       plc_plan->nargs, msg->nargs);

For plan free, it is also needed.

curl_global_init() does not seem to need to call for each REST API in plcCurlRESTAPICall().

Try to rename container to backend.

In theory we have multiple kinds of backends (docker, separate process), we should try to rename "container" to "backend".

clients should do cleanup if response from plcontainer_channel_receive() fail finally.

See FIXME in plpy_spi.c

 } else {
+       /* FIXME: For illegal type & error branch code above, do mem cleanup.
+        * It seems that receive_from_frontend() has this issue also.
+        */

That is to say, for the illegal type plcontainer_channel_receive() responds or the error response case, client program should do cleanup to avoid memory leak.

Need to make sure only one client run in container.

I run the client program after entering the container, and see this.
[root@16bb7a989b05 share]# ./pyclient
plcontainer log: pythonclient, gpadmin, postgres, 11275, LOG: Client has started execution at Fri Feb 9 02:11:44 2018
plcontainer log: pythonclient, gpadmin, postgres, 11275, ERROR: Socket timeout - no client connected within 20 seconds

Typically we could use a file to easily implement this.

Enhance logging on the QE side.

e.g. Add plcontainer meta information in log. Remove or replace debug_print(). Maybe add a guc to control plcontainer log level.

parse_container() should be refactored.

At least

Do xmlFree() before elog(ERROR, ...) to avoid mem leak.
Address similar concerns as #226

Add timestamp info in client log.

This applies to both r and python.

Need to test memory leak.

While clients do not have memory context so we should be careful that there is memory leak in the code. Better have S/W infrastructure to easily test it.

Should use fixed-size type for python/r conversion funtions.

e.g. In code below, (int16)out = .......

static int plc_pyobject_as_int2(PyObject *input, char **output, plcPyType *type UNUSED) {
int res = 0;
char out = (char)malloc(2);
*output = out;
if (PyInt_Check(input))
((short)out) = (short)PyInt_AsLong(input);

This is a GitBot test issue ... Please ignore

I am testing GitBot. Please ignore this issue and corresponding story created in the project's tracker (GPDB Procedural Languages).

Readme is out of date with new github url

Currently the readme points to github.com/greenplum-db/trusted-languages-poc but should point to github.com/greenplum-db/plcontainer for consistency's sake.

[de]serialization code should be robust

At least:

Sanity check if needed (argument number must be >= 0, size be >=0, etc);
Should have rx timeout if sender transmits bytes less than expected;

Should save stack info into log in SIGSEGV handler in pyclient program.

To help debugging.

Automate code coverage test in concourse

Provide code coverage rate and report for specific release. Maye no need to run for each checkin though.

Update README.md

Recently there are a lot of change in plcontainer so that document is much out of date. So need to update README.md. At least:

Docker API version and OS version (We could run on both centos/rhel 6 and 7).
No "plcontainer configure" command.
Whether needs vagrant? Or at least should be friendly to users who have had a Linux environment.
Detailed step-by-step setup/build/test/run doc.

Wrong path when plcontainer install R image

when run

plcontainer install -n plc_r_shared -i /usr/local/greenplum-db-devel/share/postgresql/plcontainer/plcontainer-r-images.tar.gz -c pivotaldata/plcontainer_r_shared:devel;

The host path is set to $GPHOME/bin/pyclient which should be $GPHOME/bin/rclient

Harden the shared path/file access.

Currently we created a shared path for unix domain socket when creating container. This path is writable so in theory client code could write under the path and this introduces a bit security concern. Potential solutions include: setuid to a less-privileged user after client initialization code runs and thus client code does not have permission to write under the path; set quota/limit for the path/directory in a feasible and simple way?

We also have log files in new shared path set in default configuration but we seem to be able to easily resolve this since it seems that there is sophisticated solution for container logging.

          switch (retval) {
            case SPI_OK_SELECT:
            case SPI_OK_INSERT_RETURNING:
            case SPI_OK_DELETE_RETURNING:
            case SPI_OK_UPDATE_RETURNING:
                /* some data was returned back */
                result = (plcMessage*)create_sql_result();
                break;
            default:
                elog(ERROR, "Cannot handle sql ('%s') with fn_readonly (%d) "
                     "and limit (%lld). Returns %d", msg->statement,
                     pinfo->fn_readonly, msg->limit, retval);
                break;
            }

greenplum-db / plcontainer Goto Github PK

plcontainer's People

Contributors

Stargazers

Watchers

Forkers

plcontainer's Issues

Recommend Projects

Recommend Topics

Recommend Org