Hello! First of all, thank you for sharing the code of your project! I was try

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

issue on zero classes prediction,about yl-1993/learn-to-cluster

Comments (7)

yl-1993 commented on May 13, 2024 1

@kak-to-tak To be more specific, I suspect this line is most related to this issue. Low threshold may lead to very large proposals and high threshold may lead to very small proposals. The former will be filtered out by maxsz while the latter will be filtered out by minsz.

The meaning of each hyper-parameter is as follows:

minsz: the minimal size of a proposal.
maxsz: the maximal size of a proposal.
sv_minsz: the minimal number of super vertices, e.g. sv_minsz=2 means that we should combine at least two super vertices. It tries to enlarge the receptive field of the proposals.
sv_maxsz: the maximal number of super vertices, e.g. sv_mazsz=8 means that we should combine at most eight super vertices. It tries to avoid oversized proposals.
th_knn: the threshold to cut the edges with low similarities. It makes the affinity graph more sparse so as to reduce unnecessary neighborhood propagation and increase the computational efficiency.
step: the step in the dynamic threshold algorithm. When generating basic proposals, the threshold will increase a length of step if a super vertex is too large. step=0.05 works stably under most cases.

Thus, other possible solutions are: (depend on whether proposals are too large or too small)
(1) increase the maxsz or decrease the minsz.
(2) decrease the sv_maxsz or increase the sv_minsz

from learn-to-cluster.

yl-1993 commented on May 13, 2024

@kak-to-tak Thanks for checking out our project.

Yes. I think the error means no valid super vertices exist. One possible reason lies in the threshold. If the threshold is too high, each vertex is taken as a cluster and it will be filtered by minsz, which leads to empty idx2lb. Do you mind trying to lower the threshold for super vertices generation?

from learn-to-cluster.

completelyboofyblitzed commented on May 13, 2024

@yl-1993 To what extent is it reasonable to lower the threshold? Do I understand correctly that the default threshold is 0.4? Isn't it already quite low? Could you please dwell a bit on the hyperparameters of your algorithm? Excuse me, if it's all in the paper but I can't seem to find a comprehensive information on this. Thank you.

from learn-to-cluster.

yl-1993 commented on May 13, 2024

@kak-to-tak Thanks for the question. As applying threshold in kNN graph is widely used, we do not emphasize it much in our paper. The threshold 0.4 is for super-vertex (iter=1), which is usually different from threshold for super-vertex (iter=0).

The value of threshold mainly depends on feature manifold. In practice, it can be determined by constructing a small validation set which contains both positive pairs and negative pairs. We can compute the similarity scores for all pairs, and find a threshold to maximize the accuracy on the set.

Another simple way to set threshold in training set is to start with a middle value, e.g., 0.5. Then it can be adjusted according to some indicators, e.g., number of proposals or the average size of proposals. In this case, you may want to adjust the value to both side a little bit, say 0.3 and 0.5. And you may have a quick feeling of what it leads to.

from learn-to-cluster.

completelyboofyblitzed commented on May 13, 2024

@yl-1993 Thank you so much for a quick reply and the details!
I have another question about the number of epochs. After I lowered the threshold to 0.3 I noticed that it managed to work for two iterations but dropped with the same error on the third one. What is the approach for choosing the optimal number of epochs?

from learn-to-cluster.

yl-1993 commented on May 13, 2024

@kak-to-tak The high-level idea is similar to generating super pixels in 2D images. If the initial super pixels are large, then the number of iterations is likely to be small. On the contrary, if the initial super pixels are small, then we may take more iterations. Therefore, the number of iterations is related to size of initial super pixels.

Another perspective is from the selection of desired proposals:

If the size is too small, then the proposal is a conservative formation. In this case, the precision is high but the recall is low. For example, the proposal contains a leg of a person.
If the size is too large, then it is probable that none of the classes dominate in the proposal. In this case, the precision is low but the recall is high. For example, the proposal contains two nearby person.

Overall, it is hard to give a criteria for optimal iterations, but our empirical results show that it brings performance gain when the number of iterations is 2 or 3. Adding more iterations increase the recall of the clustering results but may impair the precision.

One possible future direction is to produce proposals via a learnable network, similar to RPN in object detection.

from learn-to-cluster.

completelyboofyblitzed commented on May 13, 2024

@yl-1993 Thank you!

from learn-to-cluster.

issue on zero classes prediction about learn-to-cluster HOT 7 CLOSED

Comments (7)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent