- Change mace header interfaces, only including necessary methods.
- Return status instead of abort when allocate failed
- support
float
data_type when running in gpu
- Change interface that report error type
- Improve cpu performace
- Merge cpu/gpu engine to on
- Change build and run tools
- Handle runtime failure