Add support for CNN and attention-lstm? #28

xinsuinizhuan · 2019-08-20T01:44:23Z

attention-lstm is better than lstm， and cnn is need for forecast to group cnn and（attention-lstm）net.

xinsuinizhuan · 2019-08-20T02:12:24Z

attention-lstm and your old project's net

josephjaspers · 2019-08-20T03:57:52Z

I am not sure what an attention-lstm is could you send a link?
I will start working on the CNN.

I need to implement/find a good implementation of convolution to use. (iI have written it myself before but it was very slow).

xinsuinizhuan · 2019-08-20T05:50:41Z

about lstm-attention:
https://www.sciencedirect.com/science/article/abs/pii/S0950705119302400
https://www.sciencedirect.com/science/article/abs/pii/S1359431118343485
https://aclweb.org/anthology/D16-1058
https://arxiv.org/pdf/1908.02252.pdf
https://machinelearningmastery.com/how-does-attention-work-in-encoder-decoder-recurrent-neural-networks/
https://www.depends-on-the-definition.com/attention-lstm-relation-classification/
https://github.com/ningshixian/LSTM_Attention

xinsuinizhuan · 2019-09-03T06:20:41Z

the Long Short Term Memory Fully Convolutional Network (LSTM-FCN) and Attention LSTM-FCN (ALSTM-FCN)：

https://www.sciencedirect.com/science/article/pii/S0893608019301200?via%3Dihub
https://www.sciencedirect.com/science/article/pii/S0378437118314985?dgcid=raven_sd_recommender_email
https://www.sciencedirect.com/user/recommendations

josephjaspers · 2019-09-08T20:39:46Z

Added CNN as of:

bbc1a2a

It is very slow, and the user must calculate the output-shape itself.
It is "experimental" currently but I will be completing within the next week.
(Improving performance, auto calculating shape, etc).

You can modify the example with this to test:
(I will add an example/tests etc soon).

	auto network = neuralnetwork(
		BC::nn::Convolution<System, double>(28,28,1, 7, 7, 3),
		BC::nn::flatten(system_tag, BC::shape(22, 22, 3)),
		BC::nn::logistic(system_tag, 22*22*3),
		BC::nn::feedforward(system_tag, 22*22*3, 256),
		BC::nn::logistic(system_tag, 256),
		BC::nn::feedforward(system_tag, 256, 10),
		BC::nn::softmax(system_tag, 10),
		BC::nn::logging_output_layer(system_tag, 10, BC::nn::RMSE).skip_every(100/5)
	);

	std::cout << " training..." << std::endl;
	auto start = std::chrono::system_clock::now();
	for (int i = 0; i < epochs; ++i){

		std::cout << " current epoch: " << i << std::endl;
		for (int j = 0; j < samples/batch_size; ++j) {
			network.forward_propagation(BC::reshape(inputs[j], BC::shape(28,28, 1, batch_size)));
			network.back_propagation(outputs[j]);
			network.update_weights();
		}
	}

josephjaspers · 2019-09-08T20:43:34Z

TODO:

Improve implementation of CNN (most likely switch to img2col implementation).
Add Maxpooling
Add GPU Support.
Add auto-deduction of the shape.
Add Attention-LSTM

xinsuinizhuan · 2019-09-09T01:50:11Z

you are so great.I expect it so much.

xinsuinizhuan · 2019-09-30T03:07:17Z

How about Maxpooling,when use the cnn, shoud add Maxpooling? When I want try to use this truct model, how should i do?

josephjaspers · 2019-10-01T21:44:41Z

I have not yet implemented max-pooling, I want to try to optimize CNN first, but maxpooling isn't particularly difficult to implement so I can see if I can do that quickly.

xinsuinizhuan · 2019-10-31T01:25:46Z

How about Maxpooling,when use the cnn, shoud add Maxpooling? When I want try to use this truct model, how should i do?

How about this net struct, now can implement?

josephjaspers · 2019-10-31T21:56:30Z

Yes I can work on that soon.
I am also working on optimizing LSTM and the Convolution Layer.

For Convolution and Maxpooling I may borrow Caffe's implementation.

josephjaspers · 2019-10-31T21:57:08Z

You can see that I have started working on max-pooling here: https://github.com/josephjaspers/blackcat_tensors/blob/master/include/neural_networks/functions/Max_Pooling.h

xinsuinizhuan · 2019-11-01T00:38:29Z

Yes. You are so great. I have seen the Max_Pooling.h, so i ask you that. Now we should first focus on the lstm single_predict, the same input and output, then to implement Convolution and Maxpooling.I expect it.

xinsuinizhuan · 2019-11-01T00:42:40Z

Because i seen one paper, that net struct is efficient to make my forcast. Now only lstm layer the forcast result is not so good, we should make other net sturct or use the attention-lstm.But that is next to implement and test.

josephjaspers · 2019-11-01T22:14:36Z

The order of things I will work on is...

Optimizing Convolution (It is too slow)
MaxPooling
Optimizing LSTM (It can be much faster than the current implementation)
AttentionLSTM
I have found Caffe's implementation of Convoltution/MaxPooling so I will most likely be importing their implementation into this project.

xinsuinizhuan · 2019-11-03T09:49:59Z

thank you very much! But i hope first to debug the single_predict function,how about it now? four days have no update. 在 2019-11-02 06:14:36，"Joseph Jaspers" <[email protected]> 写道： The order of things I will work on is... Optimizing Convolution (It is too slow) MaxPooling AttentionLSTM I have found Caffe's implementation of Convoltution/MaxPooling so I will most likely be importing their implementation into this project. — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub, or unsubscribe.

josephjaspers · 2019-11-03T14:35:10Z

I am working on improving Convolution, hopefully I will have it finished by today or tomorrow.
I will try fix single_predict soon after.

josephjaspers · 2019-11-03T19:15:30Z

Hi, I just added a new version of Convolution. (It still needs testing, does not support single-predict currently).

https://github.com/josephjaspers/blackcat_tensors/blob/master/include/neural_networks/Layers/Convolution_Experimental.h

However it should be much faster than the current version.
I will look into the single_predict function now.

Then...
--> Testing/adding single_predict to the faster Convolution method.
--> Max_Pooling
--> Optimizing LSTM
--> Adding Attention-LSTM

josephjaspers · 2019-11-03T22:49:19Z

Because i seen one paper, that net struct is efficient to make my forcast. Now only lstm layer the forcast result is not so good, we should make other net sturct or use the attention-lstm.But that is next to implement and test.

I also just fixed a bug where set_learning_rate wouldn't actually set the learning_rate of the Layer. So perhaps re-running may improve performance.

xinsuinizhuan · 2019-11-04T00:44:41Z

You are so great. I am so glad to see the updata. I will test the new code just right.So begin our work as you plan.

xinsuinizhuan · 2019-11-04T06:05:29Z

Because i seen one paper, that net struct is efficient to make my forcast. Now only lstm layer the forcast result is not so good, we should make other net sturct or use the attention-lstm.But that is next to implement and test.

I also just fixed a bug where set_learning_rate wouldn't actually set the learning_rate of the Layer. So perhaps re-running may improve performance.

Yes, it is better in performance.But it need train more epochs:
befor version:
BC::nn::RMSE loss, begin is 0.156, i set epoch == 1028, then it reduce to 0.12, and have many mutations in the data.
now version:
BC::nn::RMSE loss, begin is 0.256, i set epoch == 5000, then it reduce to 0.07, and the data is relatively stable. When i set epoch == 1024, it only reduce to 0.166.

josephjaspers · 2019-11-08T00:46:35Z

Eventually I would like to add optimizers (like momentum and Adam), though I havent begun to work on them yet.

xinsuinizhuan · 2019-11-08T01:10:12Z

Eventually I would like to add optimizers (like momentum and Adam), though I havent begun to work on them yet.

I am glad to see your reply! I am so expected. I want to use the net in practical, so you should stepping up time, when you no busy, include the simgle_predict and up functions.

josephjaspers · 2019-11-10T22:00:41Z

Convolution (the experimental version) is now the standard version.
I have tested it (not windows but on linux).
It is considerably faster than the previous version, however it consumes a lot of memory.

ff33dff

josephjaspers · 2019-11-10T22:57:59Z

Maxpooling branch. (Not complete)

5e6602f

josephjaspers · 2019-11-26T00:54:33Z

convolution and maxpooling, have been added!
The attention-lstm ticket has been moved to:
#48

josephjaspers closed this as completed Nov 26, 2019

josephjaspers mentioned this issue Nov 26, 2019

Neural_Networks: Add Attention-LSTM #48

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for CNN and attention-lstm? #28

Add support for CNN and attention-lstm? #28

xinsuinizhuan commented Aug 20, 2019

xinsuinizhuan commented Aug 20, 2019

josephjaspers commented Aug 20, 2019

xinsuinizhuan commented Aug 20, 2019

xinsuinizhuan commented Sep 3, 2019

josephjaspers commented Sep 8, 2019 •

edited

Loading

josephjaspers commented Sep 8, 2019

xinsuinizhuan commented Sep 9, 2019

xinsuinizhuan commented Sep 30, 2019 •

edited

Loading

josephjaspers commented Oct 1, 2019

xinsuinizhuan commented Oct 31, 2019

josephjaspers commented Oct 31, 2019

josephjaspers commented Oct 31, 2019

xinsuinizhuan commented Nov 1, 2019

xinsuinizhuan commented Nov 1, 2019

josephjaspers commented Nov 1, 2019 •

edited

Loading

xinsuinizhuan commented Nov 3, 2019 via email

josephjaspers commented Nov 3, 2019

josephjaspers commented Nov 3, 2019 •

edited

Loading

josephjaspers commented Nov 3, 2019

xinsuinizhuan commented Nov 4, 2019

xinsuinizhuan commented Nov 4, 2019

josephjaspers commented Nov 8, 2019

xinsuinizhuan commented Nov 8, 2019 •

edited

Loading

josephjaspers commented Nov 10, 2019

josephjaspers commented Nov 10, 2019

josephjaspers commented Nov 26, 2019

Add support for CNN and attention-lstm? #28

Add support for CNN and attention-lstm? #28

Comments

xinsuinizhuan commented Aug 20, 2019

xinsuinizhuan commented Aug 20, 2019

josephjaspers commented Aug 20, 2019

xinsuinizhuan commented Aug 20, 2019

xinsuinizhuan commented Sep 3, 2019

josephjaspers commented Sep 8, 2019 • edited Loading

josephjaspers commented Sep 8, 2019

xinsuinizhuan commented Sep 9, 2019

xinsuinizhuan commented Sep 30, 2019 • edited Loading

josephjaspers commented Oct 1, 2019

xinsuinizhuan commented Oct 31, 2019

josephjaspers commented Oct 31, 2019

josephjaspers commented Oct 31, 2019

xinsuinizhuan commented Nov 1, 2019

xinsuinizhuan commented Nov 1, 2019

josephjaspers commented Nov 1, 2019 • edited Loading

xinsuinizhuan commented Nov 3, 2019 via email

josephjaspers commented Nov 3, 2019

josephjaspers commented Nov 3, 2019 • edited Loading

josephjaspers commented Nov 3, 2019

xinsuinizhuan commented Nov 4, 2019

xinsuinizhuan commented Nov 4, 2019

josephjaspers commented Nov 8, 2019

xinsuinizhuan commented Nov 8, 2019 • edited Loading

josephjaspers commented Nov 10, 2019

josephjaspers commented Nov 10, 2019

josephjaspers commented Nov 26, 2019

josephjaspers commented Sep 8, 2019 •

edited

Loading

xinsuinizhuan commented Sep 30, 2019 •

edited

Loading

josephjaspers commented Nov 1, 2019 •

edited

Loading

josephjaspers commented Nov 3, 2019 •

edited

Loading

xinsuinizhuan commented Nov 8, 2019 •

edited

Loading