Comments (10)
I am running into a similar issue (Segmentation fault) just trying to run the image/mnist example described in the /README.md file. Installed everything directly (not using docker).
tensorflow version: 1.14.1-dev20190412.
g++ version: (Debian 7.3.0-5) 7.3.0 (Only 4.8 is supported)
Instead I did the example with docker, and that works perfectly fine.
from lingvo.
It's strange that you would be getting SEGV without any output. Try setting the environment variable TF_CPP_MIN_LOG_LEVEL
to zero and rerun. Also, please rerun within GDB and tell us where the program crashed.
from lingvo.
Hi, I know how to use gdb normally, but I do not know how to use in this case. Can you help me with that!
from lingvo.
Just run the program with gdb --args
. Then run
at the prompt. Once it crashes, type where
to display all stack locations of all threads.
from lingvo.
Commands I am running:
bazel build -c dbg //lingvo:trainer
gdb --args bazel-bin/lingvo/trainer --run_locally=gpu --mode=sync --model=lm.one_billion_wds.OneBWdsGPipeTransformer --logdir=/tmp/mnist/log --logtostderr --worker_split_size=4
I am facing the following problem while trying to do run in gdb:
"/home/guest/lingvo/bazel-bin/lingvo/trainer": not in executable format: File format not recognized
(gdb) run
Starting program: --run_locally=gpu --mode=sync --model=lm.one_billion_wds.OneBWdsGPipeTransformer --logdir=/tmp/mnist/log --logtostderr --worker_split_size=4
No executable file specified.
Use the "file" or "exec-file" command.
But when I do a "file /home/guest/lingvo/bazel-bin/lingvo/trainer" it shows: Python script, ASCII text executable
from lingvo.
Oh, right, sorry. It's a bit more complicated.
OK, first, let's try to see the logs by running env TF_CPP_MIN_LOG_LEVEL=1 /home/guest/lingvo/bazel-bin/lingvo/trainer
.
from lingvo.
Hi, after running the above command env TF_CPP_MIN_LOG_LEVEL=1 /home/guest/lingvo/bazel-bin/lingvo/trainer I got the following, I am attaching it in a text file:
output_log.txt
Hi, can we have a common time so that we can solve this problem at one go, its taking a lot of time.
from lingvo.
Hi , I went and did gdb python ./core, and did a where in gdb, so this was the output. I did this maybe something will be related to this:
from lingvo.
Hi any update/solution for the above.
from lingvo.
If docker worked fine then it's probably some package version issue. Unfortunately we're not going to try to support all configurations.
Hi , I went and did gdb python ./core, and did a where in gdb, so this was the output. I did this maybe something will be related to this:
Since there's a reference to /usr/include/c++/7 is it possible you are using g++ version 7 (check g++ --version). If so it might be that lingvo does not work with g++7 like #63 (comment) implies.
from lingvo.
Related Issues (20)
- DeepFusion network structure HOT 1
- DeepFusion Learnable Align Impl Details HOT 1
- How can I get the logits for one whole sequence in the asr task? HOT 3
- when will the deepfusion code be released? HOT 1
- Car models seem to be disabled for now
- Bazel build failure
- Learnable Align Attention Implementation HOT 1
- DeepFusion Readme HOT 4
- DeepFusion reproduce HOT 11
- Cannot run trainer.py with --model=car.waymo_deepfusion.DeepFusionCenterPointPed, undefined symbol: _ZNK10tensorflow8OpKernel11TraceStringERKNS_15OpKernelContextEb HOT 4
- Cannot import py_camera_model_ops from waymo_open_dataset.camera.ops HOT 3
- images
- Question about DeepFusion
- question about RandomVectorQuantizer
- cannot import name "hyperparams_pb2' from lingvo.core' how to deal with
- Switch from prebuilt protoc to build from source
- Raw dependency on "//third_party/py/flax/training:checkpoints"
- unreplicate_metrics=True fails on my training
- Feature request: lingvo.jax.asserts.HasShape HOT 1
- RFC: lingvo.jax exception flag mechanism
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from lingvo.