The current released lightlda doesn't support asymmetric Dirichlet prior optimization.

Hi, guys, I finished to try to add this new feature in <a href="https://github.co

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

Support asymmetric Dirichlet prior optimization about lightlda HOT 4 OPEN

microsoft commented on July 28, 2024

Support asymmetric Dirichlet prior optimization

from lightlda.

Comments (4)

hiyijian commented on July 28, 2024

Hi, guys. Thank you for your amazing work on large scale LDA.
On the other hand, I think model quality is as important as scalability. So I am very intresting in improving it. It is exciting to know asymmetric Dirichlet prior could help. Would you please to share some experience on this? I will try my best to contribute

from lightlda.

hiyijian commented on July 28, 2024

Hi, guys,
I finished to try to add this new feature in PR#22
This PR supports asymmetric alpha in following steps:

Add two extra tables to Multiverso. One is topic frequency table, a matrix to count each topics’ frequency. The other one is doc length table, a row to count how many document is with length k.
Initialize the two extra tables with random initialized documents
Learn alpha distribution with the two extra table every 5 iterations
Build alias table for leanred alpha distribution
Sample topics with learned alpha distribution and alias table. Meanwhile, update countings of topic frequency table if necessary

To use this new feature, please just run with an extra option "-num_alpha_iterations".

Please notice that there are two TODOs. One is Evaluation in asymmetric prior mode, the other is Inference with asymmetric prior.

from lightlda.

feiga commented on July 28, 2024

Thanks, Jianyi! I will review the code.

from lightlda.

hiyijian commented on July 28, 2024

@feiga , I am sorry that I made a mistake when updating topic-frequency-table. I fixed it and commit to PR#22.

from lightlda.

Recommend Projects

Support asymmetric Dirichlet prior optimization about lightlda HOT 4 OPEN

Comments (4)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent