Comments (6)
Thank you for your answer. After filtering, my training sample iteration count is 1472, which still does not match the over 1600 in the paper.
May I ask how did you filter the data? I used this script to find out how many samples have [refer] or [grounding] keywords:
for sample in data:
conversations = sample['conversations']
if any('[grounding]' in conv['value'] or '[refer]' in conv['value'] for conv in conversations):
num_grounding += 1
num_grounding
is 70464. If we set the batch size to 144, it only has 490 iterations. @zytx121
from geochat.
Hi @zytx121 , we further finetuned our model on just the grounding part of the dataset for some more steps.
from geochat.
Hi @zytx121 , we further finetuned our model on just the grounding part of the dataset for some more steps.
Hi, thanks for your great work.
Would you mind also open-sourcing the specific grounding dataset that you used in stage2? Thanks in advance.
from geochat.
You can filter the data from the geochat_instruct file using the [refer] and [grounding] keywords.
from geochat.
Thank you for your answer. After filtering, my training sample iteration count is 1472, which still does not match the over 1600 in the paper.
from geochat.
You can filter the data from the geochat_instruct file using the [refer] and [grounding] keywords.
Does stage 2 also have a batch size of 144? @KjAeRsTuIsK
from geochat.
Related Issues (20)
- How to calculate the metrics [email protected], [email protected], ROUGE and METEOR score in table 7, 8, 9? HOT 6
- get_chunk method in batch_geochat_scene.py seems to be undefined HOT 1
- Minimum memory for the training process
- how to run the lora finetuned model? HOT 5
- metrics about region captioning HOT 1
- training data corrupted HOT 1
- is training necessary ?
- Model for visual grounding
- Calculation of metrics
- Evaluation results about Grounding
- The results of MiniGPT in the paper HOT 2
- when training had an error!
- License for Commercial use
- merge lora
- how to finetune on my custom dataset
- training data corrupt
- Using transformers to use geochat directly
- The error encountered when using ZeRO-2 for training.
- Could you describe the procedure of reproduce the GeoChat?
- Multi images HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. ๐๐๐
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google โค๏ธ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from geochat.