Migrate gelu to core #550

WindQAQ · 2019-09-30T16:10:40Z

Describe the feature and the current behavior/state.

tfa.activations.gelu would be moved to tf.keras.activations.gelu, as well as the CC kernels.

Relevant information

Are you willing to contribute it (yes/no): yes
Are you willing to maintain it going forward? (yes/no): yes
Is there a relevant academic paper? (if so, where): yes
Is there already an implementation in another framework? (if so, where): yes, addons
Was it part of tf.contrib? (if so, where): no

Which API type would this fall under (layer, metric, optimizer, etc.)
activations

Who will benefit with this feature?
Whole community

Any other info.
As per tensorflow/tensorflow#32783, seems that keras is seeking for gelu activation, and accepts PR.

The text was updated successfully, but these errors were encountered:

WindQAQ · 2019-09-30T16:11:00Z

cc @seanpmorgan and @facaiy.

facaiy · 2019-09-30T23:12:15Z

+@karmel for visibility

karmel · 2019-10-01T17:23:04Z

Metacomment: @ewilderj / @seanpmorgan -- I wonder if we should start tracking these moves? Seems like it's one interesting metric indicating the utility of Addons.

ewilderj · 2019-10-01T17:25:35Z

+1 I'd be strongly in favor of at least something like a GRADUATED.md doc that kept a list of these with the date they moved. You're right, it's great bragging rights, and also a good piece of documentation for users.

AakashKumarNain · 2019-10-04T17:31:27Z

I am happy that finally GeLU is moving to core. @WindQAQ let me know if you need any help in any way

WindQAQ · 2019-10-06T17:15:20Z

Hi all, my minor concern of this migration is that tfa's gelu is based on C++/CUDA kernel rather than a python function. So I am afraid the PR will not only affect tf.keras.* but also some cc files in tensorflow/core/* as well as tf.nn.*. Is it still okay for us to issue a single PR to cover core and keras part? Thank you all in advance!

facaiy · 2019-10-08T09:27:25Z

Is it still okay for us to issue a single PR to cover core and keras part?

We usually add both c++ and python codes when implementing a new OP. So I think the answer is yes. But I'm a little curious why we write c++ kernel implementation for the activation ops. Those ops looks quite simple, and it seems that they could be composed by tf python ops easily if I'm not wrong.

WindQAQ · 2019-10-08T14:05:49Z

Is it still okay for us to issue a single PR to cover core and keras part?

We usually add both c++ and python codes when implementing a new OP. So I think the answer is yes. But I'm a little curious why we write c++ kernel implementation for the activation ops. Those ops looks quite simple, and it seems that they could be composed by tf python ops easily if I'm not wrong.

The speed. On my local machine, the C++ kernel speed up 3 times and 10 times in forward and backward pass respectively compared with tf python ops. I know that XLA optimization could somewhat fuse multiple operations into a single kernel, but not all of the users can enjoy XLA on their own machine.

A related talk presented by NVIDIA. gelu's performance test on Colab. Similar issue requesting for prelu's C++ kernel in core TF tensorflow/tensorflow#31883.

facaiy · 2019-10-18T06:55:49Z

I agree that C++ implementation should run faster than python implementation, Tzu-Wei. But as the op is so small, I'm not sure whether it is worthy at the cost of maintaining a chunk of code. Moreover, the new op we add might not work well with graph optimization toolkit(say op fusion as you mentained, or TensorRT ect) because those tools might only focus on tf-core. I'm not against c++ implementation, just a little reminder: perhaps we could write c++ implementation or create a fused op later when performance sucks in the mainstream models 😄

WindQAQ · 2019-10-18T07:13:27Z

Thanks for pointing out! So what should we migrate for this time; a tf python operation implementation only or kernel version? Both are fine with me, but want to have your ideas 😄

hendrycks · 2019-10-18T16:25:45Z

Note PyTorch uses a kernel version.

…

On Fri, Oct 18, 2019 at 12:13 AM Tzu-Wei Sung ***@***.***> wrote: Thanks for pointing out! So what should we migrate for this time; a tf python operation implementation only or kernel version? Both are fine with me, but want to have your ideas 😄 — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#550?email_source=notifications&email_token=ACZBITSBMIOMOAR2ZAN4EKDQPFOZXA5CNFSM4I35DB6KYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEBTBFGQ#issuecomment-543560346>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ACZBITR3MBNTARHX3JKKOS3QPFOZXANCNFSM4I35DB6A> .

WindQAQ · 2019-11-04T17:57:23Z

Hi all, the PR has been created tensorflow/tensorflow#33945.

AakashKumarNain · 2019-11-04T18:02:09Z

Hi all, the PR has been created tensorflow/tensorflow#33945.

@WindQAQ Are we going with the Python implementation or with the C++ one?

WindQAQ · 2019-11-05T02:35:09Z

Hi all, the PR has been created tensorflow/tensorflow#33945.

@WindQAQ Are we going with the Python implementation or with the C++ one?

I use C++ implementation :D

bhack · 2019-11-08T11:02:56Z

+1 I'd be strongly in favor of at least something like a GRADUATED.md doc that kept a list of these with the date they moved. You're right, it's great bragging rights, and also a good piece of documentation for users.

Yes but the main issue Is the we always need to check in multiple places:
keras-team/keras#11834
keras-team/keras#12020
keras-team/keras#11835
keras-team/keras#11839

We need to find a solution for Keras repo other then contrib vs addon:
keras-team/keras-contrib#519

seanpmorgan · 2020-01-14T00:52:43Z

During the monthly call it was determined that the best way to move this forward is to create a Keras RFC per our new procedure:
https://github.com/tensorflow/addons/blob/master/MIGRATION_TO_CORE.md

The goal is to propose the custom op along with the accompanying performance tests. In the "alternatives considered" we can propose the python composite op for discussion.

@WindQAQ When time allows would you be able to author this RFC? I can help out as well.

WindQAQ · 2020-01-14T05:17:21Z

The goal is to propose the custom op along with the accompanying performance tests. In the "alternatives considered" we can propose the python composite op for discussion.

@WindQAQ When time allows would you be able to author this RFC? I can help out as well.

Of course!

AakashKumarNain · 2020-01-14T05:41:22Z

Lemme know if you guys need any help. I would be more than happy to help

bhack · 2020-03-07T15:05:10Z

@seanpmorgan

The goal is to propose the custom op along with the accompanying performance tests. In the "alternatives considered" we can propose the python composite op for discussion.
Also probably there is an impact with evolving MLIR compiler stack ecosystem about the codegen path vs c++ implemented in custom ops.

seanpmorgan · 2020-07-16T03:21:23Z

Closing as this will be finished up in #2005

WindQAQ added the migration-request label Sep 30, 2019

seanpmorgan mentioned this issue Oct 3, 2019

Record in progress & completed moves to TF-Core #560

Closed

WindQAQ self-assigned this Oct 11, 2019

seanpmorgan mentioned this issue Oct 29, 2019

GELU activation Functions? tensorflow/tensorflow#32783

Closed

WindQAQ mentioned this issue Nov 3, 2019

Add gelu tensorflow/tensorflow#33945

Closed

This was referenced Nov 8, 2019

Graduating (addon2core) and de-graduating (core2addon) policies #686

Closed

Migration of keras-team/keras-contrib to tensorflow/addon keras-team/keras-contrib#519

Open

seanpmorgan closed this as completed Jul 16, 2020

Imod7 mentioned this issue Feb 26, 2021

Eliminate the CI warnings caused from the migration of gelu activation in TensorFlow RasaHQ/rasa#8063

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Migrate gelu to core #550

Migrate gelu to core #550

WindQAQ commented Sep 30, 2019

WindQAQ commented Sep 30, 2019

facaiy commented Sep 30, 2019 •

edited

Loading

karmel commented Oct 1, 2019

ewilderj commented Oct 1, 2019

AakashKumarNain commented Oct 4, 2019

WindQAQ commented Oct 6, 2019

facaiy commented Oct 8, 2019

WindQAQ commented Oct 8, 2019 •

edited

Loading

facaiy commented Oct 18, 2019 •

edited

Loading

WindQAQ commented Oct 18, 2019

hendrycks commented Oct 18, 2019 via email

WindQAQ commented Nov 4, 2019

AakashKumarNain commented Nov 4, 2019

WindQAQ commented Nov 5, 2019

bhack commented Nov 8, 2019

seanpmorgan commented Jan 14, 2020

WindQAQ commented Jan 14, 2020

AakashKumarNain commented Jan 14, 2020

bhack commented Mar 7, 2020

seanpmorgan commented Jul 16, 2020

Migrate gelu to core #550

Migrate gelu to core #550

Comments

WindQAQ commented Sep 30, 2019

WindQAQ commented Sep 30, 2019

facaiy commented Sep 30, 2019 • edited Loading

karmel commented Oct 1, 2019

ewilderj commented Oct 1, 2019

AakashKumarNain commented Oct 4, 2019

WindQAQ commented Oct 6, 2019

facaiy commented Oct 8, 2019

WindQAQ commented Oct 8, 2019 • edited Loading

facaiy commented Oct 18, 2019 • edited Loading

WindQAQ commented Oct 18, 2019

hendrycks commented Oct 18, 2019 via email

WindQAQ commented Nov 4, 2019

AakashKumarNain commented Nov 4, 2019

WindQAQ commented Nov 5, 2019

bhack commented Nov 8, 2019

seanpmorgan commented Jan 14, 2020

WindQAQ commented Jan 14, 2020

AakashKumarNain commented Jan 14, 2020

bhack commented Mar 7, 2020

seanpmorgan commented Jul 16, 2020

facaiy commented Sep 30, 2019 •

edited

Loading

WindQAQ commented Oct 8, 2019 •

edited

Loading

facaiy commented Oct 18, 2019 •

edited

Loading