Comments (4)
@xiaoyu-work, could you please help take a look?
from olive.
That's weird. Mine has correct output model path:
[
{
"rank": 1,
"model_config": {
"type": "ONNXModel",
"config": {
"model_path": "/home/xyz/github/Olive/examples/bert/cache/models/2_OnnxQuantization-1-1431c563dcfda9c9c3bf26c5d61ef58e/output_model/model.onnx",
"onnx_file_name": null,
But what I can improve here is updating this model_path
to the model path in output zip file.
from olive.
I think that's mean the actual path in the zip file. The model_path will not be valid after the zip file is downloaded in another machine.
from olive.
The PR had been merged. Now the model_path
in models_rank.json is the relative path to zip file. Closed.
from olive.
Related Issues (20)
- Whisper model does not work If you add a flag --enable_timestamps HOT 7
- whisper pipeline corrupting the model, unable to run on DML EP HOT 1
- GenAIModelExporter Component - parameter mismatch HOT 3
- Missing dependency: psutil HOT 2
- [FR]: FlashAttention support for Whisper HOT 1
- pydantic.error_wrappers.ValidationError: 7 validation errors for RunConfig HOT 1
- Olive workflow for mistral model optimization does not work HOT 16
- Exception while running SD XL: Not enough memory resources are available to complete this operation HOT 1
- Failed to run symbolic shape inference when doing LLM Optimization with DirectML HOT 8
- Error on the Generate an ONNX model and optimize step HOT 5
- status.IsOK() was false. Tensor shape cannot contain any negative value HOT 1
- Vitis quantization is broken with ORT 1.18 HOT 2
- Enabling openai/whisper-large-v3 using olive-ai-0.6.0 [onnxruntime-gpu: 1.17.1] on Intel CPU/GPU is not supporting HOT 2
- Llava-7b model Conversion to ONNX and Latency Optimization - OOM error (even after setting paging file size) HOT 2
- safetensor model
- onnx
- huggingface_hub.errors.HFValidationError: Repo id must use alphanumeric chars or '-', '_', '.', '--' and '..' are forbidden, '-' and '.'
- "num_images" doesn't work for the example of directml stable_diffusion_xl.
- Get Error while optimizing SDXL of DirectML example
- [Bug]: Optimization of Unet fails - AMD RDNA3.5 Strix Point Processor HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from olive.