Understanding input to mu and logsd layers

Hi thanks for the great tutorial. I have trouble understanding math. What is the reason to pass in `encode3` to `logsd` before the nonlinearity is applied? Why not give `encode3neur` to both `mu` and `logsd`? I would ask if it's a typo, but running the reference prototxt, I can make it converge. 
 
<img width="352" alt="screen shot 2017-06-09 at 4 05 49 pm" src="https://user-images.githubusercontent.com/7897122/26997338-baf27dbc-4d2d-11e7-81d0-1868f05403fb.png">

I have combined the VAE layers with convolution and deconvolution layers, and am having trouble training MNIST with this new architecture. (Using Sigmoid neurons instead of ReLU, if that matters).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Understanding input to mu and logsd layers #5

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Understanding input to mu and logsd layers #5

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions