Light

soobinseo / attentive-neural-process Goto Github PK

View Code? Open in Web Editor NEW

72.0 6.0 9.0 143 KB

A Pytorch Implementation of Attentive Neural Process

License: Apache License 2.0

Python 37.33% Jupyter Notebook 62.67%

deep-learning anp pytorch attention neural-processes

attentive-neural-process's Introduction

Attentive-Neural-Process

Description

A pytorch implementation of Attentive Neural Process.
Simple code for generating samples with ANP.
I will update the super-resolution experiments soon.

Requirements

Install python 3
Install pytorch == 0.4.0

File description

preprocess.py includes all preprocessing codes when you loads data.
module.py contains all methods, including attention, linear and so on.
network.py contains whole structure of network.
train.py is for training ANP model.
generate.ipynb is for generating samples.

Results

test samples after 50 epoch training with random context selection.

original

* 10 contexts

* 50 contexts

* 100 contexts

* half contexts

Reference

https://github.com/deepmind/neural-processes

Comments

Any comments for the codes are always welcome.

attentive-neural-process's People

Contributors

Stargazers

Watchers

Forkers

wlj6816 cognoscentai spxnn wangyongguang bobbymun pandeydeep9 lrnavin isakfalk yaoyichen

attentive-neural-process's Issues

where can find the ''./checkpoint/checkpoint_50.pth.tar''?

Hi everyone,
There is the following code which load “checkpoint_50.pth.tar”, but I didn't find it in the project, please help to get the file.
"state_dict = t.load('./checkpoint/checkpoint_50.pth.tar')"

Can we use this model for language translation

Hi. i am very interested in your code. My problem is to know if it is possible to use it for language translation. If it is the case how?

Some clarifications about attention used

Thank you for sharing the code.
According to the paper, Appendix A 2nd paragraph, dropout is not used for attention.

In line 205, the residual and result are concatenated, but I think they should be added elementwise and then passed through a layer_norm (Figure 8 ANP paper). I wonder if there is some reason for this modification.

Thanks,
Deep Pandey

Mean or sum reduction

In networ.py BCELoss has the default settings, which (looking both for pytorch1.1 and pytorch0.4) does the mean reduction. However, the KL divergence function (also in network.py) seems to be using the sum reduction. Intuitively, either both should be sum or both mean. Is this a bug or is this correct?

Thanks!

The paper doesn't mention attention in the latent path

The paper specifically mentions that they do not use attention in the latent path. Was there a reason that you added it into the latent path in your repo?

confused about residual in attention

Hi,
Thanks for your implementation!
I am a little confused about result = t.cat([residual, result], dim=-1) in line 205 as you mentioned very important. Why do you need to concatenate the original residual result ?

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.