SliceOut Layer - enchanced dropout #2145

g0lemXIV · 2020-09-03T08:48:43Z

Describe the feature and the current behavior/state.
SliceOut regularization for speedups and memory reduction by dropping contiguous sets of units at random, method preserves the regularization properties of dropout while allowing for more efficient low-level implementation, resulting in training speedups through fast memory access and matrix multiplication of smaller tensors, and memory savings by avoiding allocating memory to zero units in weight gradients and activations. Despite its simplicity, the method is highly effective.

Relevant information

Are you willing to contribute it (yes/no): yes
Are you willing to maintain it going forward? (yes/no): yes
Is there a relevant academic paper? (if so, where): https://arxiv.org/pdf/2007.10909.pdf
Is there already an implementation in another framework? (if so, where): -
Was it part of tf.contrib? (if so, where): -

Which API type would this fall under (layer, metric, optimizer, etc.)

Layer
Who will benefit with this feature?
Anyone who have large network to train
Any other info.

bhack · 2020-09-07T11:33:44Z

/cc @dynamicwebpaige @tanzhenyu is this in your internal roadmap?

tanzhenyu · 2020-09-07T17:21:39Z

This seems like a generic & experimental technique which it might be best to host in addons. (so it's not specific to cv or nlp)

bhack · 2020-09-07T17:23:52Z

@tanzhenyu An so I suppose also not in Keras standalone/tf.keras right?

tanzhenyu · 2020-09-07T17:25:28Z

@tanzhenyu An so I suppose also not in Keras standalone/tf.keras right?

That is correct. If this becomes successful, we should help move it from addons to tf.keras.

g0lemXIV · 2020-09-07T17:33:58Z

@bhack, @tanzhenyu Thank You for the response, can I start to implement it in TensorFlow addons and test its performances?

bhack · 2020-09-07T17:35:02Z

@g0lemXIV Is there any reference impl?

g0lemXIV · 2020-09-07T19:12:24Z

@bhack I couldn't find any... It seems authors didn't share an implementation in any framework.

bhack · 2020-09-07T19:20:03Z

I think that we need to wait for a sponsor to review e co-maintain this feature. /cc @seanpmorgan

seanpmorgan · 2020-09-10T02:21:26Z

I think that we need to wait for a sponsor to review e co-maintain this feature. /cc @seanpmorgan

Yeah I would agree to co-maintain this feature. Will want to benchmark it for performance / accuracy vs. dropout as is done in the paper. Please proceed with a PR @g0lemXIV

g0lemXIV · 2020-11-04T07:40:10Z

Hello, sorry that I didn't respond. I've tried to read and implement the paper but I had many errors during the implementation. I think it will be hard to implement their structure in TensorFlow because of the changes in graph structure dynamically. Therefore, I must leave this feature request. Sorry again.

AakashKumarNain · 2020-11-04T08:10:39Z

@g0lemXIV you can paste the errors here. Maybe we can help out with that?

failure-to-thrive · 2020-11-06T16:19:49Z

The current developments would be helpful to see, too.

failure-to-thrive · 2020-11-30T08:48:39Z

Looks to be quite simple for the Dense layer:

class DenseSliceOut(Dense):

  def __init__(self,
               units,
               dropout,
               **kwargs):
    super().__init__(
        units, **kwargs)
    self.slice_size = int(units * (1 - dropout))

  def call(self, inputs, training=None):
    if training is None:
      training = K.learning_phase()
    if not training:
      return super().call(inputs)
    outputs_shape = self.compute_output_shape(inputs.shape)
    begin = tf.random.uniform([], maxval=self.units-self.slice_size+1, dtype=tf.int32)
    outputs = core_ops.dense(
        inputs,
        tf.slice(self.kernel, [0, begin], [self.kernel.shape[0], self.slice_size]),
        tf.slice(self.bias, [begin], [self.slice_size]),
        self.activation,
        dtype=self._compute_dtype_object)
    outputs = outputs # Placeholder for upscaling (normalization)
    outputs = tf.pad(outputs, [[0, 0]]*(len(outputs_shape)-1) + [[begin, self.units-self.slice_size-begin]])
    if not context.executing_eagerly():
      outputs.set_shape(outputs_shape)
    return outputs

If stacked, inputs could be sliced too.

Not sure if complementary things such as tf.pad will hurt performance anyway.

seanpmorgan · 2023-03-01T03:40:19Z

TensorFlow Addons is transitioning to a minimal maintenance and release mode. New features will not be added to this repository. For more information, please see our public messaging on this decision:
TensorFlow Addons Wind Down

Please consider sending feature requests / contributions to other repositories in the TF community with a similar charters to TFA:
Keras
Keras-CV
Keras-NLP

g0lemXIV changed the title ~~SliceOut Layeer - enchanced dropout~~ SliceOut Layer - enchanced dropout Sep 3, 2020

bhack added Feature Request ecosystem-review labels Sep 7, 2020

bhack added the waiting_sponsor label Sep 8, 2020

seanpmorgan added feature-approved-for-pr and removed waiting_sponsor ecosystem-review labels Sep 10, 2020

seanpmorgan closed this as completed Mar 1, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SliceOut Layer - enchanced dropout #2145

SliceOut Layer - enchanced dropout #2145

g0lemXIV commented Sep 3, 2020

bhack commented Sep 7, 2020

tanzhenyu commented Sep 7, 2020

bhack commented Sep 7, 2020

tanzhenyu commented Sep 7, 2020

g0lemXIV commented Sep 7, 2020

bhack commented Sep 7, 2020

g0lemXIV commented Sep 7, 2020

bhack commented Sep 7, 2020

seanpmorgan commented Sep 10, 2020

g0lemXIV commented Nov 4, 2020

AakashKumarNain commented Nov 4, 2020

failure-to-thrive commented Nov 6, 2020

failure-to-thrive commented Nov 30, 2020

seanpmorgan commented Mar 1, 2023

SliceOut Layer - enchanced dropout #2145

SliceOut Layer - enchanced dropout #2145

Comments

g0lemXIV commented Sep 3, 2020

bhack commented Sep 7, 2020

tanzhenyu commented Sep 7, 2020

bhack commented Sep 7, 2020

tanzhenyu commented Sep 7, 2020

g0lemXIV commented Sep 7, 2020

bhack commented Sep 7, 2020

g0lemXIV commented Sep 7, 2020

bhack commented Sep 7, 2020

seanpmorgan commented Sep 10, 2020

g0lemXIV commented Nov 4, 2020

AakashKumarNain commented Nov 4, 2020

failure-to-thrive commented Nov 6, 2020

failure-to-thrive commented Nov 30, 2020

seanpmorgan commented Mar 1, 2023