tobiasuhmann / power Goto Github PK

View Code? Open in Web Editor NEW

0.0 0.0 0.0 2.35 MB

Python 97.61% Shell 2.39%

power's People

Contributors

Watchers

power's Issues

Attention Metrics

Run the current attention-based classifier (that concatenates the sentence mixes) and compare its metrics to the baseline.

Build Ruler

Build the ruler that

reads the rules created by AnyBURL
sorts them by confidence and filters out low-confidence rules
predicts facts and stores them for the respective tail entity together with the rule and confidence
sthe result

Also, delete unused dev projects along the way.

No attention

For some reason the valid loss does not decrease significantly on the attention model. Compare with a baseline that does not include the attention mechanism.

Einsum

Use torch’s einsum() to implement the multi-linear layer.

No attention all data

The no-attention model performs well on the minimal test data. Run it on the ower-fb-3 dataset and check the loss curve.

Contrary to expectations, the class embeddings do not fit the embeddings of semantically similar words very well. The reason might be that the words' discriminative features get lost among the many other words in the sentence.

Try min/max pooling for getting the sentence embedding and compare the results with the usual mean pooling.

Set up graph DBs

Set up neo4j graphs for Freebase and CoDEx and perform some Cypher queries for AnyBURL rules.

Overfit

Overfit a single sample on a randomly initialized classifier

New OWER dataset

Build OWER datasets with more sentences. In fact, include all Ryn sentences in the OWER dataset and limit the number of sentences just before training. The same could be done for classes.

Also, include the classes, debug information like entity names and other information from the Ryn dataset that is later required.

Variable Classes Dataset

Notebook classifier

Enhance the notebook to define a classifier and use it in a train/valid loop like in the code base. Also, plot the loss curve.

Lightning Classifier

Implement the classifier as a PyTorch Lightning module.

Refactor multi-linear classifier

Remove the two redundant outputs of the linear layers, making the one-hot-encoding obsolete.

Also, update the code base to the multi-linear version.

Streamlit App

Create a Streamlit App that allows

browsing a Power dataset
make predictions (that explain the model's decision by listing rules and sentence prioritization)

Visualize sentence attention

Visualize how well the class embeddings attend on words and sentences. The expected result would be that the “married” class embedding, for example, attends heavily on words and sentences related to marriage like “married”, “husband”, “wife”, etc.

Mine Rules

Create FB/CoDEx datasets for AnyBURL and browse resulting rules. Assert that those can be parsed and converted to Cypher.

Implement aggregator

Implement the aggregator that combines the predictions from texter and ruler. Check that the predictions for CW valid entities are reasonable.

Show examples during training

The loss curve indicates that training works but it is not illustrative. Print some concrete examples in the training and validation loops to see how well training performs.

Create OWER Dataset

Multi Sentence

Variable Classes Training

Create AnyBURL Dataset

Visualize attentions

Visualize class-word attentions to see whether the class embeddings are learned as intended. Ideally, show attentions at the end of each epoch in Tensorboard, somehow like this:

Refactor original classifier

Refactor the original classifier to take whole texts as input (instead of pre-processed texts) so that it can be used for inference.

Pre-trained word embeddings

Should use pre-trained word embeddings. Otherwise sentence embeddings might not capture meaning of sentences

Zero Rule Baseline

The multi-linear classifier performs better than the base classifier and the concat classifier, but its not clear that it performs better than a zero rule classifier that simply predicts the most common classes in the training data (i.e. pick 0 for every class in the OWER dataset).

Probably, the base classifier and the concat classifier perform slightly better than the zero rule baseline, but this needs to be verified. In the worst case, the null baseline would perform better than the multi-linear classifier.

No concat

The current model concatenates the sentence mixes before feeding them into the linear layer. This is wrong as it does not assign a class embedding to a single output class.

Instead, each sentence mix should be passed through the linear layer individually.

Furthermore, it would be interesting to compare the one-linear-layer-for-all approach to a one-linear-layer-for-each class approach.

tobiasuhmann / power Goto Github PK

power's People

Contributors

Watchers

power's Issues

Recommend Projects

Recommend Topics

Recommend Org