Skip to content

Latest commit

 

History

History
 
 

published_results

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 
 
 

Results

Pre-trained embeddings

Data set and vocabulary

Dataset Files LLVM IR lines Vocabulary size XFG stmt pairs
Tensorflow 2,492 16,943,893 220,554 260,250,973
AMD APP SDK 123 1,304,669 4,146 45,081,359
BLAS 300 280,782 566 283,856
NAS 268 572,521 1,793 1,701,968
Parboil 151 118,575 2,175 151,916
PolybenchGPU 40 33,601 577 40,975
Rodinia 92 103,296 3,861 266,354
SHOC 112 399,287 3,381 12,096,508
COSMO 161 152,127 2,344 2,338,153
Linux kernel 1,988 2,544,245 136,545 5,271,179
OpenCV 442 1,908,683 39,920 10,313,451
NVIDIA samples 60 43,563 2,467 74,915
Synthetic 17,801 26,045,547 113,763 303,054,685

Note: the "synthetic" data set is made up of:

Dataset Files LLVM IR lines XFG stmt pairs
eigen 1'301 19'796'291 254'997'306
gemm_eigen_sample 500 3'208'180 46'027'593
gemm_simple_sample 3'200 711'839 671'033
stencil_1d_sample 3'200 395'056 233'552
stencil_2d_sample 3'200 600'133 389'008
stencil_3d_sample 3'200 728'849 433'985
stencil_mc4d_sample 3'200 605'199 302'208

Skip-Gram parameters

Parameter Value
Context width x
x x

Training parameters

Parameter Value
Number epochs x
x x

Pre-trained task models

Algorithm classification here

Prediction Accuracy [%]

Computing Platform Grewe et al. DeepTune ncc / inst2vec
AMD Tahiti 7970 73.38 83.68 82.79
NVIDIA GTX 970  72.94  80.29 81.76

Speedups

Computing Platform Grewe et al. DeepTune ncc / inst2vec
AMD Tahiti 7970 2.91 3.34  3.42
NVIDIA GTX 970  1.26 1.41   1.39

Optimal thread coarsening factor prediction here

Speedups

Computing Platform Magni et al.  DeepTune  DeepTune-TL ncc / inst2vec
AMD Radeon HD 5900  1.21  1.10 1.17  1.25
AMD Tahiti 7970  1.01 1.05 1.23 1.07
NVIDIA GTX 480 0.86 1.10 1.14 1.02
NVIDIA Tesla K20c 0.94 0.99 0.93 1.03