yuxng / da-rnn Goto Github PK
View Code? Open in Web Editor NEWSemantic Mapping with Data Associated Recurrent Neural Networks
License: MIT License
Semantic Mapping with Data Associated Recurrent Neural Networks
License: MIT License
When trying to run the shapenet experiments I get the following erros:
InvalidArgumentError (see above for traceback): No OpKernel was registered to support Op 'ComputeflowGrad' with these attrs. Registreed devices: [CPU], Registered kernels: device = 'GPU'; T in [DT_FLOAT].
Then an output for the node and then ...:
CUDA Runtime Error: no CUDA-capable device is detected
I am using the same versiosn as you suggested for tensorflow, CUDA and all other dependencies. If I execute small test script like:
import tensorflow as tf
sess = tf.Session(config=tf.ConfigProto(log_device_placement=True))
I can see my 2 Titan 1080 GPUs mapped as devices.
I used any kind of index for $GPU_ID, but the error is still the same.
We really appreciate your help and thank you for your help
I have ran successful your code with set is_kfusion=false. Now I want to ran your kinect_fusion.cpp with set this flag to true, but I got error
Have you encoutered same error as me? Could you give some suggestions?
In order to avoid pangolin error, I have comment all pangolin code in kinect_fusion.cpp
Supplement:
I also got python: free(): invalid next size (fast)
if run test_kinect_fusion.sh on native notebook,, and found that code crash down in initMarchingCubesTables() of create_tensors() by std::cout info
Can you give more methods to test kinect_fusion code (like Video_$1.pango dataset in kinect_fusion/run.sh)?
Another try
I have modified kinect_fusion.cpp/main() image-input-interface and use cv2.imread to replace VideoInput as follows
Then direct run main() function by cmd and error shows cuda_error in initMatchingCubes()
It seems that this error is same with running test_kinect_fusion.py. So all this errors caused by cuda? The CUDA version I installed is cuda-8.0
While building and execute sh make.sh, i got
/usr/bin/ld: cannot find -ltensorflow_framework
collect2: error: ld returned 1 exit status
From tensorflow github issues, i found that tensorflow_framework was removed. tensorflow/tensorflow#1569
Anyone knows how to deal with it? Thanks!
Could you help me about the problem with " No package 'nanoflann' found" ?
Could you support to adding module 'nanoflann' ?
Thank you very much!
Hello,
When trying to compile the kinect_fusion with the suggested version from #2 , I found out that all the data code in camera relies on the namespace "picojson", while the version picojson v.0.5 does not provide this namepsace, it is "pangolin::json" instead, like described in here: picojson.h
This change is in the master-branch under Namespace-Change
So, I have the following questions(s): Was Pangolin v.0.5 used or a newer version that uses the new namespaces?
We checkout the tag v0.5 to get the version 0.5 - however this results in the mentioned error, but when we use the newest versin an unresolved symbol error gets thrown from kinect_fusion.cpp, stating that it cannot find some Pangolin functions - which I assume are just in the 0.5 version.
So can you please tell us the correct version / branch / git commit number ?
Thank you really much for your help
zss@zss:~/DA-RNN$ ./experiments/scripts/rgbd_scene_multi_*.sh $GPU_ID
Dear Dr. Xiang @yuxng ,
Thanks for the work. I am working on reproducing your code on my docker image.
Everything is compiled. I can run the test script without kfusion set to True. However, once set it to True, I got the error as follows:
+ set -e
+ export PYTHONUNBUFFERED=True
+ PYTHONUNBUFFERED=True
+ export CUDA_VISIBLE_DEVICES=0
+ CUDA_VISIBLE_DEVICES=0
++ date +%Y-%m-%d_%H-%M-%S
+ LOG=experiments/logs/rgbd_scene_multi_rgbd_test.txt.2018-06-14_23-27-31
+ exec
++ tee -a experiments/logs/rgbd_scene_multi_rgbd_test.txt.2018-06-14_23-27-31
+ echo Logging output to experiments/logs/rgbd_scene_multi_rgbd_test.txt.2018-06-14_23-27-31
Logging output to experiments/logs/rgbd_scene_multi_rgbd_test.txt.2018-06-14_23-27-31
+ '[' -f /home/weizhang/DA-RNN/output/rgbd_scene/rgbd_scene_val/vgg16_fcn_rgbd_multi_frame_rgbd_scene_iter_40000/segmentations.pkl ']'
+ ./tools/test_net.py --gpu 0 --network vgg16 --model data/fcn_models/rgbd_scene/vgg16_fcn_rgbd_multi_frame_rgbd_scene_iter_40000.ckpt --imdb rgbd_scene_val --cfg experiments/cfgs/rgbd_scene_multi_rgbd.yml --rig data/RGBDScene/camera.json --kfusion 1
Traceback (most recent call last):
File "./tools/test_net.py", line 13, in <module>
from fcn.test import test_net
File "/home/weizhang/DA-RNN/tools/../lib/fcn/test.py", line 25, in <module>
from kinect_fusion import kfusion
ImportError: /home/weizhang/DA-RNN/tools/../lib/kinect_fusion/kfusion.so: undefined symbol: _ZN2df12KinectFusionC1ESs
I am not sure if this is caused by compiling kinect fusion improperly. And comments are very appreciated.
And also @kevinkit , did you successfully run the script with kfusion to 1 in a docker?
Thanks for the help!.
After the succesfull compiliation of KinectFusion (Step 3) we are now stuck on 4)
While there was the first error refering to numpy ("numpy/arrayobject.h not found") which could be fixed with a symbolic link a new error occurs which is a little bit harder to tackle:
when running the command
python setup.py build_ext --inplace
The following error occurs:
g++: error: pose_estimation/build/libransac.so: No such file or directory
when looking at the pose_estimation folder i found the CmakeList and so I did the following steps:
At first I installed NLOPT via : sudo apt-get install libnlopt-dev
And then the same procedure:
mkdir build
cd build
cmake ..
make
Which resolved this issue, however is this the correct way?
I've installed all the dependencies and successfully trained and tested this project (which took me a huge effort). However, I wonder how to show the semantic scene like your illustration:
I set the argument --kfusion=True
when I run test_net.py
, It helps noting, and I see the code is:
if args.kfusion:
gpu_options = tf.GPUOptions(per_process_gpu_memory_fraction=0.2)
sess = tf.Session(config=tf.ConfigProto(allow_soft_placement=True, gpu_options=gpu_options))
else:
sess = tf.Session(config=tf.ConfigProto(allow_soft_placement=True))
How's it works, and how can I get the semantic scene?
@yuxng said he updated the code for latest version of Sophus and Eigen. i tried it once and didnt work. with git pull origin in $ROOT/DA-RNN and complied successfully. for the error: cannot find -ltensorflow_framework. librensorflow, librensorflow_framework are tensorflow dynamic lib which be installed addionally. check http://platanios.org/tensorflow_scala/installation.html#installation-0-dependencies-1 and http://www.rubydoc.info/github/somaticio/tensorflow.rb. dont forget copy the libs to path: /usr/lib/
first run training code come out errors about python version problems. my default version of python is 3.5 and the codes is python2, so i change my default python version and reinstalled tensorflow. then comes some strange errors:
when i try
./experiments/scripts/shapenet_scene_multi_rgbd.sh 0
error : terminate called after throwing an instance of 'std::bad_alloc' what(): std::bad_alloc
i traced it back, the command line: from networks.factory import get_network, is the source for this error. i dont kown how to fix it.
when i try
sudo ./experiments/scripts/shapenet_scene_multi_rgbd_test.sh 0
error: ImportError: libkfusion.so: cannot open shared object file: No such file or directory. but libkfusion.so exits in /kinect_fusion/ build
when i try
./experiments/scripts/shapenet_scene_multi_rgbd_test.sh 0
error: ImportError: ../lib/kinect_fusion/kfusion.so: undefined symbol: _ZN2df12KinectFusion10save_modelESs . kfusion.so exits also in path /kinect_fusion.
wondering is there anyone running all codes successfully without encountering these errors or how u fix with it ?
kinect_fusion.cpp:425
error: no matching function for call to ‘pangolin::OpenGlRenderState::SetModelViewMatrix(Sophus::SE3Base<Sophus::SE3 >::Transformation)
Thank you for sharing the code,
Is there any perspective or milestone in the near future, that this repository will come with a CMakeList that will allow to run this project on a Windows machine ?
Since Ubuntu 16.04 seems to have some issues regarding to pangolin ( #7 ) and @JackHenry1992 successfully got the kinect_fusion code to compile on Ubuntu 14.04 , I am kindly asking to provide the steps needed to get it run on ubuntu 14.04. (see #9 for previous discussions)
We are all looking forward to use this awesome project, since the release of the paper
When trying to build the code we - as a software team - encountered several issues.
However, we could fix one by simply using an other tag (version of Sophus. We used v.0.9.5 , the error was in rigid.h and was gone after this fix.
Now there are many errors coming from Cuda Kernels in relationship with Eigen - so it would be really nice to know which versions / tags etc. were used for the Dependencies, like:
Tensorflow
CUDA
Linux-Version
Sophus
Eigen
Nanoflann
Pangolin
Thank you really much for your help.
Dear Dr. Xiang,
Could you give me some advise with these error?
/usr/lib/gcc/x86_64-linux-gnu/5/include/emmintrin.h(1294): error: expression must have arithmetic, unscoped enum, or pointer type
g++: error: triplet_loss_op.cu.o: No such file or directory
....
g++: error: lifted_structured_loss_op.cu.o No such file or directory
g++: error: computing_flow_op.cu.o: No such file or directory
g++: error: backprojecting_op.cu.o: No such file or directory
g++: error: projecting_op.cu.o: No such file or directory
g++: error: computing_label_op.cu.o: No such file or directory
Sincerely,
Luan
Dear Dr.Xiang,
When I execute the commond line "sh make.sh", I met some errors as follow. Could you give me some advise with these errors?
Assembler messages:
Fatal error: can't create computing_label_op.cu.o: Permission denied
g++: error: computing_label_op.cu.o: No such file or directory
build computing label layer
(DA-Virtualenv) ml@neo-dell:~/DA-RNN-master/lib$ sudo sh make.sh
/usr/local/lib/python3.5/dist-packages/tensorflow/include
make.sh: 10: make.sh: nvcc: not found
g++: error: triplet_loss_op.cu.o: No such file or directory
build triplet loss
make.sh: 20: make.sh: nvcc: not found
g++: error: lifted_structured_loss_op.cu.o: No such file or directory
build lifted structured loss
make.sh: 30: make.sh: nvcc: not found
g++: error: computing_flow_op.cu.o: No such file or directory
build computing flow layer
make.sh: 40: make.sh: nvcc: not found
g++: error: backprojecting_op.cu.o: No such file or directory
build backprojecting layer
make.sh: 50: make.sh: nvcc: not found
g++: error: projecting_op.cu.o: No such file or directory
build projecting layer
make.sh: 60: make.sh: nvcc: not found
g++: error: computing_label_op.cu.o: No such file or directory
build computing label layer
Sincerely,
Felicx
kinect_fusion.cpp:425
error: no matching function for call to ‘pangolin::OpenGlRenderState::SetModelViewMatrix(Sophus::SE3Base<Sophus::SE3 >::Transformation)
I'm sure I have installed correct version of dependencies ,including Eigen,Sophus,Pangolin and so on.I'm not meeting Errors when "sh make.sh",I also succeed in installing KinectFusion ,that part is OK.But When I download and test the trained model Error occurs.It said "Framebuffer with requested attributes not available. Using available framebuffer. You may see visual artifacts.created window" . The details are as follows:
:~/DA-RNN-master$ ./experiments/scripts/rgbd_scene_multi_rgbd_test.sh 0
vgg16
in training{"down":[0,1,0],"forward":[0,0,1],"height":480,"param_names":["fu","fv","u0","v0","k1","k2","k3"],"params":[570.29999999999995,570.29999999999995,320,240,0,0,0],"right":[1,0,0],"serialno":"34178534347","type":"Poly3","width":640}
"Poly3"
params: 570.29999999999995 570.29999999999995 320 240 0 0 0
pose: [
[
1,
0,
0,
0
],
[
0,
1,
0,
0
],
[
0,
0,
1,
0
]
]
T_dc: 1 0 0 0
0 1 0 0
0 0 1 0
0 0 0 1
Framebuffer with requested attributes not available. Using available framebuffer. You may see visual artifacts.created window
./experiments/scripts/rgbd_scene_multi_rgbd_test.sh: 行 25: 15989 段错误 (核心已转储) ./tools/test_net.py --gpu 0 --network vgg16 --model data/fcn_models/rgbd_scene/vgg16_fcn_rgbd_multi_frame_rgbd_scene_iter_40000.ckpt --imdb rgbd_scene_val --cfg experiments/cfgs/rgbd_scene_multi_rgbd.yml --rig data/RGBDScene/camera.json --kfusion 1
"段错误(核心转存储)" means “segmentation fault (core dumped)”. I check the code and I have found in fact Error is at the line 309 of test.py :
if is_kfusion:
KF = kfusion.PyKinectFusion(rig_filename)
But the KinectFusion has been installed correctly so I don't know what causes the Error.
Probable Reason may be:
1.I use RTX 2080 ,with Cuda 8.0 ,Cudnn 6.0.The device is not compatible.
2.I'm using a new computer,I have not installed OpenCV and other libraries in my computer yet.
@yuxng any suggestions? I'm vert interested in this superb project and thanks a lot .
Hi, guys.
Firstly, thanks for your awsome work. However, when I replayed the repository in my machine, I met some errors while run make
in kinect_fusion/build
.
Here is my errors:
/home/a/DA-RNN/lib/kinect_fusion/include/df/transform/rigid.h(14): error: qualified name is not allowed
/home/a/DA-RNN/lib/kinect_fusion/include/df/transform/rigid.h(14): error: explicit type is missing ("int" assumed)
/home/a/DA-RNN/lib/kinect_fusion/include/df/transform/rigid.h(14): error: expected a ";"
/home/a/DA-RNN/lib/kinect_fusion/include/df/transform/rigid.h(24): error: identifier "Transform" is undefined
/home/a/DA-RNN/lib/kinect_fusion/include/df/transform/rigid.h(28): error: identifier "Transform" is undefined
/home/a/DA-RNN/lib/kinect_fusion/include/df/transform/rigid.h(32): error: identifier "Transform" is undefined
/home/a/DA-RNN/lib/kinect_fusion/include/df/transform/rigid.h(36): error: identifier "Transform" is undefined
/home/a/DA-RNN/lib/kinect_fusion/include/df/transform/rigid.h(52): error: identifier "Transform" is undefined
/home/a/DA-RNN/lib/kinect_fusion/include/df/util/dualQuaternion.h(72): error: explicit type is missing ("int" assumed)
/home/a/DA-RNN/lib/kinect_fusion/include/df/util/dualQuaternion.h(72): error: qualified name is not allowed
/home/a/DA-RNN/lib/kinect_fusion/include/df/util/dualQuaternion.h(72): error: expected a ")"
/home/a/DA-RNN/lib/kinect_fusion/include/df/util/dualQuaternion.h(70): warning: constant "OtherOptions" is not used in declaring the parameter types of function template "df::DualQuaternion<Scalar, Options>::DualQuaternion(int)"
.....................
/home/a/DA-RNN/lib/kinect_fusion/include/df/transform/nonrigidDeviceModule.h(107): error: a value of type "float (df::NonrigidTransformer<float, df::DualQuaternion>::*)() const" cannot be used to initialize an entity of type "const float"
detected during instantiation of "df::NonrigidTransformer<Scalar, TransformT>::DeviceModule::DeviceModule(int) [with Scalar=float, TransformT=df::DualQuaternion]"
/home/a/DA-RNN/lib/kinect_fusion/./src/transform/nonrigid.cu(1069): here
79 errors detected in the compilation of "/tmp/tmpxft_0000744b_00000000-7_nonrigid.cpp1.ii".
CMake Error at kinectFusion_generated_nonrigid.cu.o.cmake:266 (message):
Error generating file
/home/a/DA-RNN/lib/kinect_fusion/build/CMakeFiles/kinectFusion.dir/src/transform/./kinectFusion_generated_nonrigid.cu.o
CMakeFiles/kinectFusion.dir/build.make:1142: recipe for target 'CMakeFiles/kinectFusion.dir/src/transform/kinectFusion_generated_nonrigid.cu.o' failed
make[2]: *** [CMakeFiles/kinectFusion.dir/src/transform/kinectFusion_generated_nonrigid.cu.o] Error 1
CMakeFiles/Makefile2:67: recipe for target 'CMakeFiles/kinectFusion.dir/all' failed
make[1]: *** [CMakeFiles/kinectFusion.dir/all] Error 2
Makefile:83: recipe for target 'all' failed
make: *** [all] Error 2
I have strictly followed the steps mentioned in the issue 10: Steps for ubuntu 14.04 . And I can make sure that the versions of dependencies are:
Tensorflow 1.2.0
CUDA 8.0
Ubuntu 16.04 LTS
Sophus SHA1 ID: 341346e306d657ac8acaf052939ffd85dacd8f82
Eigen 3.2.92
Nanoflann 1.2.2
Pangolin 0.5
as @yuxng mentioned in the issue 2:Need more details for versions.
Can anyone @yuxng @kevinkit @JackHenry1992 help me please? Thanks for your generous help!
Hi,
Q1. when I train and test RNN with rgbd data
./experiments/scripts/rgbd_scene_multi_*.sh 0
I got the error of "ERROR: ld.so: object '/usr/lib/libtcmalloc.so.4' from LD_PRELOAD cannot be preloaded (cannot open shared object file): ignored."
Q2. Encountered this error of "tensorflow.python.framework.errors_impl.NotFoundError: /home/path/DA-RNN-master/tools/../lib/backprojecting_layer/backprojecting.so: undefined symbol: _ZN10tensorflow16KernelDefBuilderD2Ev
"
How you guys cope with this problem? Thanks
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.