Comments (4)
I tried to fix it. Can you try again? Anyway, this faulthandler support is optional, so you can remove that. But patches to fix it are also welcome.
from returnn.
The TF benchmark uses also our native LSTM op, which it tries to compile on-the-fly.
It (the native LSTM op and other native ops) expects that it finds the TF header files. They are usually part of the PIP package of TF. It gets the path via tf.sysconfig.get_include()
. Maybe you can check if that function returns a valid path for you and if you have the header files in there. Otherwise, it's a bit strange that the Windows PIP package does not contain the headers. Maybe you can report that upstream here.
The next thing is, it expects that there is a compiler. Currently there is g++
/nvcc
hardcoded and also the flags are probably partly specific for those. You will find related code in TFUtil.OpCodeCompiler
. You would need to see how you can call the Windows default compiler and the relevant settings. See the TF documentation how to compile a custom op on Windows.
You don't need the native ops, though, if you don't use them. They are faster, esp the LSTM implementation is faster, but you can also use LSTMBlockFused
for example. In that case it should work (or maybe you will hit another bug as Windows is not really tested by us). For example, you can try:
rnn.py demos/demo-tf-contribrnn-lstm.12ax.config
Or:
rnn.py demos/demo-tf-vanilla-lstm.12ax.config
You can also run the TF benchmark without the native op, e.g.:
demos/demo-tf-benchmark.py --selected "BasicLSTM,StandardLSTM,LSTMBlock,LSTMBlockFused,CudnnLSTM"
from returnn.
@albertz Thank you very much for the fast reply. It solves the issue!
However, when runing the demos/demo-tf-benchmark.py
, there is another issue raised,
fatal error: tensorflow/core/framework/op.h: No such file or directory
Windows environment seems not suitable for the open source projects.
Thank you again for providing such awesome codes.
from returnn.
@albertz Great, everything works smooth now. I also created a PR to fix the print in config file to support python 3.5. 😄 #12
Thanks for your patient help.
from returnn.
Related Issues (20)
- Different effective learning rate reported over gpus HOT 11
- CUDA error: initialization error HOT 3
- MultiProcDataset inside PyTorch DataLoader with num_workers>0, multiple issues HOT 4
- RuntimeError: CUDA error: unspecified launch failure HOT 2
- NonDaemonicSpawnProcess hangs at exit HOT 2
- High memory usage with datasets (specifically when multi procs are used)
- Hang at exit in TDL worker in multiprocessing `_run_finalizers`, deadlock in `_wait_for_tstate_lock`? HOT 6
- Hang HOT 2
- Returnn Native after using different apptainer uses old compilation HOT 6
- MetaDataset with sequence list filter file
- HDFDataset (or generic dataset) post processing HOT 15
- Dataset batching like ESPnet support
- torch.nn.functional.conv2d: RuntimeError: GET was unable to find an engine to execute this computation HOT 1
- TensorFlow 2.14 degradation in WER HOT 2
- Updates for recent TensorFlow version
- Hang in dataset iterator HOT 5
- Log GPU device for torch backend HOT 2
- torch.onnx.export requires input_names and output_names to be in order HOT 12
- RF weight dropout HOT 6
- Support for larger scale datasets HOT 33
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from returnn.