practicingman / bert_serving Goto Github PK

View Code? Open in Web Editor NEW

141.0 5.0 39.0 4.37 MB

export bert model for serving

License: Apache License 2.0

Python 99.23% Shell 0.77%

google-bert bert-serving tensorflow-serving text-classification

bert_serving's People

Contributors

Stargazers

Watchers

bert_serving's Issues

gRPC client issue

I exported the model using the suggested serving function (unique_ids instead of label_id), as below:
saved_model_cli show --all --dir /$model_saved_dire:

signature_def['serving_default']:
The given SavedModel SignatureDef contains the following input(s):
inputs['input_ids'] tensor_info:
dtype: DT_INT32
shape: (-1, 320)
name: input_ids_1:0
inputs['input_mask'] tensor_info:
dtype: DT_INT32
shape: (-1, 320)
name: input_mask_1:0
inputs['segment_ids'] tensor_info:
dtype: DT_INT32
shape: (-1, 320)
name: segment_ids_1:0
inputs['unique_ids'] tensor_info:
dtype: DT_INT32
shape: (-1)
name: unique_ids_1:0
The given SavedModel SignatureDef contains the following output(s):
outputs['end_logits'] tensor_info:
dtype: DT_FLOAT
shape: (-1, 320)
name: unstack:1
outputs['start_logits'] tensor_info:
dtype: DT_FLOAT
shape: (-1, 320)
name: unstack:0
outputs['unique_ids'] tensor_info:
dtype: DT_INT32
shape: (-1)
name: unique_ids_1:0

RPC server started on 8500 successfully, however, when I call through RPC client, I got error:
url_port='0.0.0.0:8500'
channel = grpc.insecure_channel(url_port)
-- #data is prepared following the standard...
...
request.inputs['input_ids'].CopyFrom(tf.contrib.util.make_tensor_proto(data.input_ids, shape=[ 320], dtype=tf.int32))
request.inputs['input_mask'].CopyFrom(tf.contrib.util.make_tensor_proto(data.input_mask, shape=[ 320], dtype=tf.int32))
request.inputs['segment_ids'].CopyFrom(tf.contrib.util.make_tensor_proto(data.segment_ids, shape=[ 320], dtype=tf.int32))
request.inputs['unique_ids'].CopyFrom(tf.contrib.util.make_tensor_proto(data.unique_id, shape=[ 1], dtype=tf.int32))

predict_response = stub.Predict(request, timeout=10.)

ERROR:
raise _Rendezvous(state, None, None, deadline)
_Rendezvous: <_Rendezvous of RPC that terminated with (StatusCode.INVALID_ARGUMENT, unique_ids_1:0 is both fed and fetched.)>

My question:
is this because name ("unique_ids") conflict between input and output? how do I go around it?

where is the tensor name in the training file(run_classifier.py)?

Hi:
where are the tensor names, such as input_ids_1 , input_mask_1 and so on . the test code is as follows:

 I cannot find anywhere in the file "run_classifier.py" . please tell me about it, thanks!

Sample curl?

Hi, I'm wondering how to consume the model and I was wondering if you have any CURL example to consume BERT from Tensorflow Serve

can not export bert model

i do like your demo, but got this error
Traceback (most recent call last):
File "/opt/tiger/sunflowers/bert_test/bert/run_classifier.py", line 937, in
tf.app.run()
File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/platform/app.py", line 125, in run
_sys.exit(main(argv))
File "/opt/tiger/sunflowers/bert_test/bert/run_classifier.py", line 860, in main
estimator.export_savedmodel(FLAGS.export_dir, serving_input_fn)
File "/usr/local/lib/python2.7/dist-packages/tensorflow_estimator/python/estimator/estimator.py", line 639, in export_savedmodel
mode=model_fn_lib.ModeKeys.PREDICT)
File "/usr/local/lib/python2.7/dist-packages/tensorflow_estimator/python/estimator/estimator.py", line 765, in _export_saved_model_for_mode
strip_default_attrs=strip_default_attrs)
File "/usr/local/lib/python2.7/dist-packages/tensorflow_estimator/python/estimator/estimator.py", line 883, in _export_all_saved_models
mode=model_fn_lib.ModeKeys.PREDICT)
TypeError: _add_meta_graph_for_mode() got multiple values for keyword argument 'mode'

export_outputs error

hello, @bigboNed3 I used the bert_serving to produce savedmodel file, but occur one error below. I don't know how to fix it, do you have any good suggestions?

is there a way to accept the raw string rather than input_ids, segment_id, etc when using tf serving

Check whether your GraphDef-interpreting binary is up to date with your GraphDef-generating binary

Anybody knows how to fix this problem：

Traceback (most recent call last):
File "request_queryclassifier_client.py", line 183, in
tf.app.run()
File "/data/anaconda3/lib/python3.6/site-packages/tensorflow/python/platform/app.py", line 125, in run
_sys.exit(main(argv))
File "request_queryclassifier_client.py", line 167, in main
result = stub.Predict(request, 1000.0) # 10 secs timeout
File "/data/anaconda3/lib/python3.6/site-packages/grpc/_channel.py", line 533, in call
return _end_unary_response_blocking(state, call, False, None)
File "/data/anaconda3/lib/python3.6/site-packages/grpc/_channel.py", line 467, in _end_unary_response_blocking
raise _Rendezvous(state, None, None, deadline)
grpc._channel._Rendezvous: <_Rendezvous of RPC that terminated with:
status = StatusCode.INVALID_ARGUMENT
details = "NodeDef mentions attr 'Truncate' not in Op<name=Cast; signature=x:SrcT -> y:DstT; attr=SrcT:type; attr=DstT:type>; NodeDef: bert/encoder/Cast = CastDstT=DT_FLOAT, SrcT=DT_INT32, Truncate=false, _output_shapes=[[?,1,16]], _device="/job:localhost/replica:0/task:0/device:CPU:0". (Check whether your GraphDef-interpreting binary is up to date with your GraphDef-generating binary.).
[[Node: bert/encoder/Cast = CastDstT=DT_FLOAT, SrcT=DT_INT32, Truncate=false, _output_shapes=[[?,1,16]], _device="/job:localhost/replica:0/task:0/device:CPU:0"]]"
debug_error_string = "{"created":"@1553569490.204528585","description":"Error received from peer","file":"src/core/lib/surface/call.cc","file_line":1017,"grpc_message":"NodeDef mentions attr 'Truncate' not in Op<name=Cast; signature=x:SrcT -> y:DstT; attr=SrcT:type; attr=DstT:type>; NodeDef: bert/encoder/Cast = CastDstT=DT_FLOAT, SrcT=DT_INT32, Truncate=false, _output_shapes=[[?,1,16]], _device="/job:localhost/replica:0/task:0/device:CPU:0". (Check whether your GraphDef-interpreting binary is up to date with your GraphDef-generating binary.).\n\t [[Node: bert/encoder/Cast = CastDstT=DT_FLOAT, SrcT=DT_INT32, Truncate=false, _output_shapes=[[?,1,16]], _device="/job:localhost/replica:0/task:0/device:CPU:0"]]","grpc_status":3}"

ps：the versions of tensorflow are both 1.12.0 in exporting model phase and inference phase.

Does anyone try to export an unfine-tuned BERT model?

Hi, I follow this repo to service bert model as language model. So I try to export the original BERT model as SavedModel but failed. The error messages are as follwoed:

Traceback (most recent call last):
File "export_lm_predictor.py", line 136, in
'./exported_model', serving_input_receiver_fn(max_seq_len, 20))
File "/root/anaconda3/lib/python3.6/site-packages/tensorflow/python/estimator/estimator.py", line 734, in export_saved_model
strip_default_attrs=True)
File "/root/anaconda3/lib/python3.6/site-packages/tensorflow/python/estimator/estimator.py", line 663, in export_savedmodel
mode=model_fn_lib.ModeKeys.PREDICT)
File "/root/anaconda3/lib/python3.6/site-packages/tensorflow/python/estimator/estimator.py", line 789, in _export_saved_model_for_mode
strip_default_attrs=strip_default_attrs)
File "/root/anaconda3/lib/python3.6/site-packages/tensorflow/python/estimator/estimator.py", line 878, in _export_all_saved_models
raise ValueError("Couldn't find trained model at %s." % self._model_dir)
ValueError: Couldn't find trained model at ../bert_models/chinese_L-12_H-768_A-12.

I guess that there is no graph.pbtxt and checkpoint files in the original model dir. Does anyone have any ideas? Thanks!

[edit]
I specify the checkpoint_path parameter in export_saved_model function. By the way, I
create the estimator using tf.estimator.Estimator. Then I got a new error:
ValueError: Couldn't find 'checkpoint' file or checkpoints in given directory ../bert_models/chinese_L-12_H-768_A-12
So we must got checkpoint file in the original BERT model directory?

practicingman / bert_serving Goto Github PK

bert_serving's People

Contributors

Stargazers

Watchers

Forkers

bert_serving's Issues

Recommend Projects

Recommend Topics

Recommend Org