-
Notifications
You must be signed in to change notification settings - Fork 75
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[ Tensor ] Refactor blas/math related files into cpu backend considering arch-dep @open sesame 10/02 13:19 #2549
base: main
Are you sure you want to change the base?
Conversation
📝 TAOS-CI Version: 1.5.20200925. Thank you for submitting PR #2549. Please a submit 1commit/1PR (one commit per one PR) policy to get comments quickly from reviewers. Your PR must pass all verificiation processes of cibot before starting a review process from reviewers. If you are new member to join this project, please read manuals in documentation folder and wiki page. In order to monitor a progress status of your PR in more detail, visit http://ci.nnstreamer.ai/. |
cibot: @skykongkong8, A builder checker could not be completed because one of the checkers is not completed. In order to find out a reason, please go to http://ci.nnstreamer.ai/nntrainer/ci/repo-workers/pr-checker/2549-202404181618220.29115009307861-c94059c0dfb6b54f2021eb42c4aa56adbc256ee3/. |
c94059c
to
9928557
Compare
cibot: @skykongkong8, nntrainer/tensor/cpu_backend/x86/x86_compute_library.cpp does not include Doxygen tags such as @file @brief @author @bug. You must include the Doxygen tags in the source code. Please refer to a Doxygen manual at http://github.com/nnstreamer/TAOS-CI/blob/main/ci/doc/doxygen-documentation.md |
cibot: @skykongkong8, The last line of a text file must have a newline character. Please append a new line at the end of the line in nntrainer/tensor/cpu_backend/fallback/cblas_fallback.h. |
8e786e1
to
117e75b
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@skykongkong8, 💯 All CI checkers are successfully verified. Thanks.
117e75b
to
1c79d6e
Compare
cibot: @skykongkong8, nntrainer/tensor/cpu_backend/arm/neon_impl_fp16.cpp includes bug(s). Please fix incorrect coding constructs in your commit before entering a review process. |
1c79d6e
to
807936a
Compare
cibot: @skykongkong8, A builder checker could not be completed because one of the checkers is not completed. In order to find out a reason, please go to http://ci.nnstreamer.ai/nntrainer/ci/repo-workers/pr-checker/2549-202408200905540.24510407447815-807936a1f92ba590705155408e7bca6ddaac3bc7/. |
807936a
to
a67ace2
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@skykongkong8, 💯 All CI checkers are successfully verified. Thanks.
a67ace2
to
58bb76d
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@skykongkong8, 💯 All CI checkers are successfully verified. Thanks.
58bb76d
to
b42d031
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@skykongkong8, 💯 All CI checkers are successfully verified. Thanks.
b42d031
to
b88301b
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@skykongkong8, 💯 All CI checkers are successfully verified. Thanks.
7d6c2fc
to
738bec4
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@skykongkong8, 💯 All CI checkers are successfully verified. Thanks.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@skykongkong8, 💯 All CI checkers are successfully verified. Thanks.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@skykongkong8, 💯 All CI checkers are successfully verified. Thanks.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@skykongkong8, 💯 All CI checkers are successfully verified. Thanks.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@skykongkong8, 💯 All CI checkers are successfully verified. Thanks.
738bec4
to
4a02d1c
Compare
1. Substitute `blas_interface.h` to `cpu_backend.h` which has virtual functions of `blas_interface.h` 2. Actual implementations are implemented at `arm_compute_backend`, `x86_compute_backend`, and `fallback`, and they are included considering target cpu architecture. `cblas.h` is used for both of them for fp32 computation. 3. There are some differences (unsupported intrinsics, or dataTypes ) along the versions, and they are managed under each `arm` or `x86` directory. **Self evaluation:** 1. Build test: [X]Passed [ ]Failed [ ]Skipped 2. Run test: [X]Passed [ ]Failed [ ]Skipped Signed-off-by: skykongkong8 <[email protected]>
4a02d1c
to
058221f
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@skykongkong8, 💯 All CI checkers are successfully verified. Thanks.
While during the process of implementing additional features in NEON, I found myself making unnecessary code blocks.
This is a suggestion-draft of refactorization for current blas/math related files.
DONE
cpu_backend.h
itself is done by now.TensorV2
: [Tensor] Refactorize Tensor Class to TensorV2 @open sesame 03/26 12:26 #2500sgemv
,sgemm
(StorageOrder, data addr, ... )tensor.cpp
in this PR, while intentionally eliminating ALLTensorV2
related files#include <blas_interface.h>
to#include <cpu_backend.h>
#include <blas_neon.h>
to#include <neon_impl.h>
and manage fp16 functions in.cpp
filesFinal form of this PR would be like:
and removing
blas_interface.h
In short,
blas_interface.h
tocpu_backend.h
which has virtual functions ofblas_interface.h
arm_compute_backend
,x86_compute_backend
, andfallback
, and they are included considering target cpu architecture.cblas.h
is used for both of them for fp32 computation.arm
orx86
directory.