Comments (8)
re installing everything solved the issue. Thanks for the support
from olive.
would you please help double check the available EPs? from the logging, it seems the cuda EP is not available.
from olive.
hello how can I check that knowing that the following is the result of nvidia-smi ?
Mon Sep 18 05:50:27 2023
+---------------------------------------------------------------------------------------+
| NVIDIA-SMI 535.86.05 Driver Version: 535.86.05 CUDA Version: 12.2 |
|-----------------------------------------+----------------------+----------------------+
| GPU Name Persistence-M | Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap | Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|=========================================+======================+======================|
| 0 Tesla V100S-PCIE-32GB Off | 00000000:00:06.0 Off | 0 |
| N/A 31C P0 26W / 250W | 0MiB / 32768MiB | 0% Default |
| | | N/A |
+-----------------------------------------+----------------------+----------------------+
+---------------------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=======================================================================================|
| No running processes found |
from olive.
Just run onnxruntime.get_available_providers would return the available providers.
from olive.
how strange....
onnxruntime.get_available_providers()
['TensorrtExecutionProvider', 'CUDAExecutionProvider', 'AzureExecutionProvider', 'CPUExecutionProvider']
from olive.
and the result of pip freeze | grep onnx && pip freeze | grep ort is
onnx==1.14.0
onnxruntime-extensions==0.8.0
importlib-resources==6.0.1
ort-nightly-gpu==1.16.0.dev20230903001
from olive.
@jsoto-gladia, would you please use a local onnx model to try the CUDA EP like:
session = onnxruntime.InferenceSession(
model, providers=['CUDAExecutionProvider', 'CPUExecutionProvider']
)
print(session.get_providers())
From the current log, it seems the output only contains CPU EP. If this is the case, some of the cuda EP setting is not correct configured.
from olive.
Follow above setup procedure, whisper case worked well for me.
Just ensure the code snippet that Mike provide can work with CUDA EP.
from olive.
Related Issues (20)
- models_rank.json issue HOT 4
- Whisper model converted via onnxruntime 1.17.1 won't work HOT 5
- Whisper-medium conversion failed HOT 8
- Whisper model does not work If you add a flag --enable_timestamps HOT 7
- whisper pipeline corrupting the model, unable to run on DML EP HOT 1
- GenAIModelExporter Component - parameter mismatch HOT 3
- Missing dependency: psutil HOT 2
- [FR]: FlashAttention support for Whisper HOT 1
- pydantic.error_wrappers.ValidationError: 7 validation errors for RunConfig HOT 1
- Olive workflow for mistral model optimization does not work HOT 16
- Exception while running SD XL: Not enough memory resources are available to complete this operation HOT 1
- UnboundLocalError: local variable 'output_model_json' referenced before assignment HOT 9
- Error with search strategic.py 'Conversion Merged" has no output models for Llama2 optimization HOT 2
- NOT_IMPLEMENTED : Could not find an implementation for BeamSearch(1) node with name 'BeamSearch_node' HOT 6
- I don't have models/optimized/llama_v2 folder after I've run python llama_v2.py --optimize HOT 8
- [Bug] Optimization step for unet fails after 'Protobuf parsing failed' HOT 10
- SDXL crashing when trying to run HOT 3
- This is an invalid model HOT 4
- Conversion of some models are buggy
- Any workarounds to convert phi-2 model on Windows HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from olive.