Comments (12)
I believe the error should be fixed with the latest code update. Please ensure that the download model path is set to geochat
.
from geochat.
i just noticed that is is the same bug encountered in #10
from geochat.
i change builder.py line 128 :if 'geochat-7b' in model_name.lower():
and geochat_arch.py lines 33 self.vision_tower = build_vision_tower(config, delay_load=False)
,it doesnt work, error like this
RuntimeError: CUDA error: CUBLAS_STATUS_NOT_SUPPORTED when calling cublasLtMatmulAlgoGetHeuristic( ltHandle, computeDesc.descriptor(), Adesc.descriptor(), Bdesc.descriptor(), Cdesc.descriptor(), Cdesc.descriptor(), preference.descriptor(), 1, &heuristicResult, &returnedResult)
. just like yours, adding 2 lines of code, demo worked.
from geochat.
could you be more specific about"i found a workaround by renaming the model downloaded from HF "llava" (instead of geochat-7B) "?
from geochat.
could you be more specific about"i found a workaround by renaming the model downloaded from HF "llava" (instead of geochat-7B) "?
the authors are using the llava repository, so there are harcoded strings into the code that require your weights to be named "llava-XXX". when following the demo installation steps, you are downloading weights from HuggingFace named "geochat-7B". thus, renaming these weights is a straightforward solution.
from geochat.
could you be more specific about"i found a workaround by renaming the model downloaded from HF "llava" (instead of geochat-7B) "?
the authors are using the llava repository, so there are harcoded strings into the code that require your weights to be named "llava-XXX". when following the demo installation steps, you are downloading weights from HuggingFace named "geochat-7B". thus, renaming these weights is a straightforward solution.
Could you please specify the location of that code?
from geochat.
could you be more specific about"i found a workaround by renaming the model downloaded from HF "llava" (instead of geochat-7B) "?
the authors are using the llava repository, so there are harcoded strings into the code that require your weights to be named "llava-XXX". when following the demo installation steps, you are downloading weights from HuggingFace named "geochat-7B". thus, renaming these weights is a straightforward solution.
Could you please specify the location of that code?
https://github.com/mbzuai-oryx/GeoChat?tab=readme-ov-file#geochat-weights-and-demo
from geochat.
could you be more specific about"i found a workaround by renaming the model downloaded from HF "llava" (instead of geochat-7B) "?
the authors are using the llava repository, so there are harcoded strings into the code that require your weights to be named "llava-XXX". when following the demo installation steps, you are downloading weights from HuggingFace named "geochat-7B". thus, renaming these weights is a straightforward solution.
Could you please specify the location of that code?
https://github.com/mbzuai-oryx/GeoChat?tab=readme-ov-file#geochat-weights-and-demo
i download those models from https://huggingface.co/MBZUAI/geochat-7B/tree/mainand in my "models_geo" directory, which files from the image do I need to rename?
from geochat.
from geochat.
Hi @Murkyy, thank you for looking out.
I have updated the code. It should work now without changing the model name. Let me know if you face any other issues.
from geochat.
![Screenshot 2024-04-22 at 11 31 16](https://private-user-images.githubusercontent.com/107103239/324319486-65e34f5c-69f6-49b2-a80e-5d9dc8b548e9.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjMyNjQxMDIsIm5iZiI6MTcyMzI2MzgwMiwicGF0aCI6Ii8xMDcxMDMyMzkvMzI0MzE5NDg2LTY1ZTM0ZjVjLTY5ZjYtNDliMi1hODBlLTVkOWRjOGI1NDhlOS5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwODEwJTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDgxMFQwNDIzMjJaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT0zZDBiNTdjMmU5NGQxZGZmNGEzZGY1NmIyNmJmYWJhZDE0MjNjOGNkOTkyYzIzMzdjZjJhYmY4NjkwNzUxZTAxJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.mCEu0B4knF22T_QVENZWukV40PEdDlbo2icoMN9q__U)
the output is all ,why?
from geochat.
I still have this error: RuntimeError: Internal: could not parse ModelProto from llava/tokenizer.model after following the workarounds mentioned above. Anyone could kindly advise how to resolve this? thanks!
from geochat.
Related Issues (20)
- How to calculate the metrics [email protected], [email protected], ROUGE and METEOR score in table 7, 8, 9? HOT 6
- get_chunk method in batch_geochat_scene.py seems to be undefined HOT 1
- Minimum memory for the training process
- how to run the lora finetuned model? HOT 5
- metrics about region captioning HOT 1
- training data corrupted HOT 1
- is training necessary ?
- Model for visual grounding
- Calculation of metrics
- Evaluation results about Grounding
- The results of MiniGPT in the paper HOT 2
- when training had an error!
- License for Commercial use
- merge lora
- how to finetune on my custom dataset
- training data corrupt
- Using transformers to use geochat directly
- The error encountered when using ZeRO-2 for training.
- Could you describe the procedure of reproduce the GeoChat?
- Multi images HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from geochat.