Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

SyCL intel build broken? #542

Open
paoletto opened this issue Jan 1, 2025 · 4 comments
Open

SyCL intel build broken? #542

paoletto opened this issue Jan 1, 2025 · 4 comments

Comments

@paoletto
Copy link

paoletto commented Jan 1, 2025

 LD_LIBRARY_PATH=/opt/intel/oneapi/2025.0/lib/ /home/paolo/AI/tools/stable-diffusion.cpp/SyCL/bin/sd --model /home/paolo/AI/models/StableDiffusion/CP/photon_v1.safetensors -p "a lovely cat"
[SYCL] call ggml_check_sycl
ggml_check_sycl: GGML_SYCL_DEBUG: 0
ggml_check_sycl: GGML_SYCL_F16: yes
found 1 SYCL devices:
|  |                   |                                       |       |Max    |        |Max  |Global |                     |
|  |                   |                                       |       |compute|Max work|sub  |mem    |                     |
|ID|        Device Type|                                   Name|Version|units  |group   |group|size   |       Driver version|
|--|-------------------|---------------------------------------|-------|-------|--------|-----|-------|---------------------|
| 0|     [opencl:gpu:0]|                 Intel Iris Xe Graphics|    3.0|     96|     512|   32| 53645M|       23.17.26241.33|
ggml_sycl_init: GGML_SYCL_FORCE_MMQ:   no
ggml_sycl_init: SYCL_USE_XMX: yes
ggml_sycl_init: found 1 SYCL devices:
[INFO ] stable-diffusion.cpp:195  - loading model from '/home/paolo/AI/models/StableDiffusion/CP/photon_v1.safetensors'
[INFO ] model.cpp:888  - load /home/paolo/AI/models/StableDiffusion/CP/photon_v1.safetensors using safetensors format
[INFO ] stable-diffusion.cpp:242  - Version: SD 1.x 
[INFO ] stable-diffusion.cpp:275  - Weight type:                 f16
[INFO ] stable-diffusion.cpp:276  - Conditioner weight type:     f16
[INFO ] stable-diffusion.cpp:277  - Diffusion model weight type: f16
[INFO ] stable-diffusion.cpp:278  - VAE weight type:             f16
  |==================================================| 1130/1130 - 500.00it/s
[INFO ] stable-diffusion.cpp:516  - total params memory size = 1969.78MB (VRAM 1969.78MB, RAM 0.00MB): clip 235.06MB(VRAM), unet 1640.25MB(VRAM), vae 94.47MB(VRAM), controlnet 0.00MB(VRAM), pmid 0.00MB(VRAM)
[INFO ] stable-diffusion.cpp:520  - loading model from '/home/paolo/AI/models/StableDiffusion/CP/photon_v1.safetensors' completed, taking 2.17s
[INFO ] stable-diffusion.cpp:550  - running in eps-prediction mode
[INFO ] stable-diffusion.cpp:682  - Attempting to apply 0 LoRAs
[INFO ] stable-diffusion.cpp:1235 - apply_loras completed, taking 0.00s
No kernel named _ZTSZL19get_rows_sycl_floatIN4sycl3_V16detail9half_impl4halfEEvR25ggml_backend_sycl_contextPK11ggml_tensorS9_PS7_PKT_PKiPfPNS1_5queueEEUlNS1_7nd_itemILi3EEEE_ was foundException caught at file:/home/paolo/AI/tools/stable-diffusion.cpp/ggml/src/ggml-sycl/common.cpp, line:102

Notes:

  • same error if i try to run sd.cpp through python bindings.
  • Up until a few weeks ago it used to work
  • i removed debian libmkl packages in favor of intel ones (or else mix and matching was messing up everything), which are supposed to be picked up through LD_LIBRARY_PATH=/opt/intel/oneapi/2025.0/lib/
$ dpkg -l | grep onea
ii  intel-oneapi-advisor                             2025.0.0-797                               amd64        Intel® Advisor
ii  intel-oneapi-base-toolkit                        2025.0.1-45                                amd64        Intel® oneAPI Base Toolkit
ii  intel-oneapi-base-toolkit-env-2025.0             2025.0.1-45                                all          Intel® oneAPI Base Toolkit
ii  intel-oneapi-base-toolkit-getting-started-2025.0 2025.0.1-45                                all          Intel® oneAPI Base Toolkit
ii  intel-oneapi-ccl-2021.14                         2021.14.0-505                              amd64        Intel® oneAPI Collective Communications Library Runtime Environment
ii  intel-oneapi-ccl-devel                           2021.14.0-505                              amd64        Intel® oneAPI Collective Communications Library
ii  intel-oneapi-ccl-devel-2021.14                   2021.14.0-505                              amd64        Intel® oneAPI Collective Communications Library
ii  intel-oneapi-common-licensing                    2025.0.1-15                                all          oneAPI Common License
ii  intel-oneapi-common-licensing-2024.1             2024.1.0-579                               all          oneAPI Common License
ii  intel-oneapi-common-licensing-2025.0             2025.0.1-15                                all          oneAPI Common License
ii  intel-oneapi-common-oneapi-vars                  2025.0.1-15                                all          oneAPI Common Toolkit Environment Script
ii  intel-oneapi-common-oneapi-vars-2024.1           2024.1.0-579                               all          oneAPI Common Toolkit Environment Script
ii  intel-oneapi-common-oneapi-vars-2025.0           2025.0.1-15                                all          oneAPI Common Toolkit Environment Script
ii  intel-oneapi-common-vars                         2025.0.1-15                                all          oneAPI Common Environment Scripts
ii  intel-oneapi-compiler-cpp-eclipse-cfg-2025.0     2025.0.4-1519                              all          Intel® oneAPI DPC++/C++ Compiler 2025.0.4 for Linux* eclipse integration configuration file (C++)
ii  intel-oneapi-compiler-dpcpp-cpp                  2025.0.4-1519                              amd64        Intel® oneAPI DPC++/C++ Compiler
ii  intel-oneapi-compiler-dpcpp-cpp-2025.0           2025.0.4-1519                              amd64        Intel® oneAPI DPC++/C++ Compiler
ii  intel-oneapi-compiler-dpcpp-cpp-common-2025.0    2025.0.4-1519                              all          Intel® oneAPI DPC++/C++ Compiler 2025.0.4 for Linux*
ii  intel-oneapi-compiler-dpcpp-cpp-runtime-2024.1   2024.1.2-995                               amd64        Intel® oneAPI DPC++/C++ Compiler 2024.1.2 for Linux* runtime package for Intel(R) 64
ii  intel-oneapi-compiler-dpcpp-cpp-runtime-2025.0   2025.0.4-1519                              amd64        Intel® oneAPI DPC++/C++ Compiler 2025.0.4 for Linux* runtime package for Intel(R) 64
ii  intel-oneapi-compiler-dpcpp-eclipse-cfg-2025.0   2025.0.4-1519                              all          Intel® oneAPI DPC++/C++ Compiler 2025.0.4 for Linux* eclipse integration configuration file (DPC++)
ii  intel-oneapi-compiler-fortran-2025.0             2025.0.4-1519                              amd64        Intel® Fortran Compiler 2025.0.4 for Linux* for Intel(R) 64
ii  intel-oneapi-compiler-fortran-common-2025.0      2025.0.4-1519                              all          Intel® Fortran Compiler 2025.0.4 for Linux*
ii  intel-oneapi-compiler-fortran-runtime-2025.0     2025.0.4-1519                              amd64        Intel® Fortran Compiler 2025.0.4 for Linux* runtime package for Intel(R) 64
ii  intel-oneapi-compiler-shared-2025.0              2025.0.4-1519                              amd64        Intel(R) Compiler Shared Files
ii  intel-oneapi-compiler-shared-common-2025.0       2025.0.4-1519                              all          Intel(R) Compiler Shared Files
ii  intel-oneapi-compiler-shared-runtime-2024.1      2024.1.2-995                               amd64        Intel(R) Compiler Shared Files runtime contents
ii  intel-oneapi-compiler-shared-runtime-2025.0      2025.0.4-1519                              amd64        Intel(R) Compiler Shared Files runtime contents
ii  intel-oneapi-dal-2025.0                          2025.0.1-9                                 amd64        Intel® oneAPI Data Analytics Library
ii  intel-oneapi-dal-common-2025.0                   2025.0.1-9                                 all          Intel® oneAPI Data Analytics Library common
ii  intel-oneapi-dal-common-devel-2025.0             2025.0.1-9                                 all          Intel® oneAPI Data Analytics Library common
ii  intel-oneapi-dal-devel                           2025.0.1-9                                 amd64        Intel® oneAPI Data Analytics Library Development Package
ii  intel-oneapi-dal-devel-2025.0                    2025.0.1-9                                 amd64        Intel® oneAPI Data Analytics Library Development Package
ii  intel-oneapi-dev-utilities                       2025.0.0-599                               amd64        Dev Utilities
ii  intel-oneapi-dev-utilities-2025.0                2025.0.0-599                               amd64        Dev Utilities
ii  intel-oneapi-dev-utilities-eclipse-cfg-2025.0    2025.0.0-599                               all          intel-oneapi-dev-utilities-eclipse-cfg
ii  intel-oneapi-dnnl                                2025.0.1-6                                 amd64        Intel® oneAPI Deep Neural Network Library
ii  intel-oneapi-dnnl-2025.0                         2025.0.1-6                                 amd64        Intel® oneAPI Deep Neural Network Library
ii  intel-oneapi-dnnl-devel                          2025.0.1-6                                 amd64        Intel® oneAPI Deep Neural Network Library Development Package
ii  intel-oneapi-dnnl-devel-2025.0                   2025.0.1-6                                 amd64        Intel® oneAPI Deep Neural Network Library Development Package
ii  intel-oneapi-dpcpp-cpp-2025.0                    2025.0.4-1519                              amd64        Intel® oneAPI DPC++/C++ Compiler 2025.0.4 for Linux* for Intel(R) 64
ii  intel-oneapi-dpcpp-ct                            2025.0.1-17                                amd64        Intel® DPC++ Compatibility Tool
ii  intel-oneapi-dpcpp-ct-2025.0                     2025.0.1-17                                amd64        Intel® DPC++ Compatibility Tool
ii  intel-oneapi-dpcpp-ct-eclipse-cfg-2025.0         2025.0.1-17                                all          Intel® DPC++ Compatibility Tool 2025.0.1 for Linux* eclipse integration configuration file
ii  intel-oneapi-dpcpp-debugger-2025.0               2025.0.0-663                               amd64        Intel® Distribution for GDB*
ii  intel-oneapi-embree                              4.3.1-706                                  amd64        Intel® Embree
ii  intel-oneapi-embree-4.3                          4.3.1-706                                  amd64        Intel® Embree
ii  intel-oneapi-hpc-toolkit                         2025.0.1-46                                amd64        Intel® oneAPI HPC Toolkit
ii  intel-oneapi-hpc-toolkit-env-2025.0              2025.0.1-46                                all          Intel® oneAPI HPC Toolkit
ii  intel-oneapi-hpc-toolkit-getting-started-2025.0  2025.0.1-46                                all          Intel® oneAPI HPC Toolkit
ii  intel-oneapi-icc-eclipse-plugin-cpp-2025.0       2025.0.4-1519                              all          Standards driven high performance cross architecture DPC++/C++ compiler
ii  intel-oneapi-ipp-2022.0                          2022.0.0-808                               amd64        Intel® Integrated Performance Primitives
ii  intel-oneapi-ipp-devel                           2022.0.0-808                               amd64        Intel® Integrated Performance Primitives Development Package
ii  intel-oneapi-ipp-devel-2022.0                    2022.0.0-808                               amd64        Intel® Integrated Performance Primitives Development Package
ii  intel-oneapi-ippcp-2025.0                        2025.0.0-615                               amd64        Intel® Cryptography Primitives Library
ii  intel-oneapi-ippcp-devel                         2025.0.0-615                               amd64        Intel® Cryptography Primitives Library Development Package
ii  intel-oneapi-ippcp-devel-2025.0                  2025.0.0-615                               amd64        Intel® Cryptography Primitives Library Development Package
ii  intel-oneapi-ispc                                1.23.0-721                                 amd64        Intel® Implicit SPMD Program Compiler
ii  intel-oneapi-ispc-1.23                           1.23.0-721                                 amd64        Intel® Implicit SPMD Program Compiler
ii  intel-oneapi-libdpstd-devel-2022.7               2022.7.1-14                                amd64        Intel® oneAPI DPC++ Library 2022.7.1 for Linux*
ii  intel-oneapi-mkl-classic-devel-2025.0            2025.0.1-14                                amd64        Intel® oneAPI Math Kernel Library 2025.0.1 for Linux* classic development package for Intel(R) 64
ii  intel-oneapi-mkl-classic-include-2025.0          2025.0.1-14                                amd64        Intel® oneAPI Math Kernel Library 2025.0.1 for Linux* classic include package for Intel(R) 64
ii  intel-oneapi-mkl-cluster-2025.0                  2025.0.1-14                                amd64        Intel® oneAPI Math Kernel Library 2025.0.1 for Linux* cluster package for Intel(R) 64
ii  intel-oneapi-mkl-cluster-devel-2025.0            2025.0.1-14                                amd64        Intel® oneAPI Math Kernel Library 2025.0.1 for Linux* cluster development package for Intel(R) 64
ii  intel-oneapi-mkl-core-2025.0                     2025.0.1-14                                amd64        Intel® oneAPI Math Kernel Library 2025.0.1 for Linux* core package for Intel(R) 64
ii  intel-oneapi-mkl-core-devel-2025.0               2025.0.1-14                                amd64        Intel® oneAPI Math Kernel Library 2025.0.1 for Linux* core development package for Intel(R) 64
ii  intel-oneapi-mkl-devel                           2025.0.1-14                                amd64        Intel® oneAPI Math Kernel Library 2025.0.1 for Linux* development package for Intel(R) 64
ii  intel-oneapi-mkl-devel-2025.0                    2025.0.1-14                                amd64        Intel® oneAPI Math Kernel Library 2025.0.1 for Linux* development package for Intel(R) 64
ii  intel-oneapi-mkl-sycl                            2025.0.1-14                                amd64        Intel® oneAPI Math Kernel Library 2025.0.1 for Linux* SYCL* package for Intel(R) 64
ii  intel-oneapi-mkl-sycl-2025.0                     2025.0.1-14                                amd64        Intel® oneAPI Math Kernel Library 2025.0.1 for Linux* SYCL* package for Intel(R) 64
ii  intel-oneapi-mkl-sycl-blas                       2025.0.1-14                                amd64        Intel® oneAPI Math Kernel Library 2025.0.1 for Linux* SYCL* BLAS
ii  intel-oneapi-mkl-sycl-blas-2025.0                2025.0.1-14                                amd64        Intel® oneAPI Math Kernel Library 2025.0.1 for Linux* SYCL* BLAS
ii  intel-oneapi-mkl-sycl-data-fitting-2025.0        2025.0.1-14                                amd64        Intel® oneAPI Math Kernel Library 2025.0.1 for Linux* SYCL* data fitting
ii  intel-oneapi-mkl-sycl-devel                      2025.0.1-14                                amd64        Intel® oneAPI Math Kernel Library 2025.0.1 for Linux* SYCL* development package for Intel(R) 64
ii  intel-oneapi-mkl-sycl-devel-2025.0               2025.0.1-14                                amd64        Intel® oneAPI Math Kernel Library 2025.0.1 for Linux* SYCL* development package for Intel(R) 64
ii  intel-oneapi-mkl-sycl-dft-2025.0                 2025.0.1-14                                amd64        Intel® oneAPI Math Kernel Library 2025.0.1 for Linux* SYCL* DFT
ii  intel-oneapi-mkl-sycl-include-2025.0             2025.0.1-14                                amd64        Intel® oneAPI Math Kernel Library 2025.0.1 for Linux* SYCL* include package for Intel(R) 64
ii  intel-oneapi-mkl-sycl-lapack-2025.0              2025.0.1-14                                amd64        Intel® oneAPI Math Kernel Library 2025.0.1 for Linux* SYCL* LAPACK
ii  intel-oneapi-mkl-sycl-rng-2025.0                 2025.0.1-14                                amd64        Intel® oneAPI Math Kernel Library 2025.0.1 for Linux* SYCL* random number generator functions (RNG)
ii  intel-oneapi-mkl-sycl-sparse-2025.0              2025.0.1-14                                amd64        Intel® oneAPI Math Kernel Library 2025.0.1 for Linux* SYCL* sparse solvers
ii  intel-oneapi-mkl-sycl-stats-2025.0               2025.0.1-14                                amd64        Intel® oneAPI Math Kernel Library 2025.0.1 for Linux* SYCL* summary statistics
ii  intel-oneapi-mkl-sycl-vm-2025.0                  2025.0.1-14                                amd64        Intel® oneAPI Math Kernel Library 2025.0.1 for Linux* SYCL* vector math
ii  intel-oneapi-mpi-2021.12                         2021.12.1-5                                amd64        Intel® MPI Library Runtime Environment
ii  intel-oneapi-mpi-2021.14                         2021.14.1-5                                amd64        Intel® MPI Library Runtime Environment
ii  intel-oneapi-mpi-devel                           2021.14.1-5                                amd64        Intel® MPI Library
ii  intel-oneapi-mpi-devel-2021.14                   2021.14.1-5                                amd64        Intel® MPI Library
ii  intel-oneapi-oidn-2.2                            2.2.0-691                                  amd64        Intel® Open Image Denoise
ii  intel-oneapi-openmp                              2025.0.4-1519                              amd64        Intel® OpenMP* Runtime Library 2025.0.4 for Linux* for Intel(R) 64
ii  intel-oneapi-openmp-2024.1                       2024.1.2-995                               amd64        Intel® OpenMP* Runtime Library 2024.1.2 for Linux* for Intel(R) 64
ii  intel-oneapi-openmp-2025.0                       2025.0.4-1519                              amd64        Intel® OpenMP* Runtime Library 2025.0.4 for Linux* for Intel(R) 64
ii  intel-oneapi-openmp-common-2024.1                2024.1.2-995                               all          Intel® OpenMP* Runtime Library 2024.1.2 for Linux*
ii  intel-oneapi-openmp-common-2025.0                2025.0.4-1519                              all          Intel® OpenMP* Runtime Library 2025.0.4 for Linux*
ii  intel-oneapi-openpgl                             0.6.0-722                                  amd64        Intel® Open Path Guiding Library
ii  intel-oneapi-openpgl-0.6                         0.6.0-722                                  amd64        Intel® Open Path Guiding Library
ii  intel-oneapi-openvkl                             2.0.1-725                                  amd64        Intel® Open Volume Kernel Library
ii  intel-oneapi-openvkl-2.0                         2.0.1-725                                  amd64        Intel® Open Volume Kernel Library
ii  intel-oneapi-ospray-3.1                          3.1.0-731                                  amd64        Intel® OSPRay
ii  intel-oneapi-ospray-studio                       1.0.0-702                                  amd64        Intel® OSPRay Studio
ii  intel-oneapi-ospray-studio-1.0                   1.0.0-702                                  amd64        Intel® OSPRay Studio
ii  intel-oneapi-rkcommon-1.13                       1.13.0-702                                 amd64        Rendering toolkit common
ii  intel-oneapi-rkutil                              1.10.0-504                                 amd64        Rendering Toolkit Utilities
ii  intel-oneapi-rkutil-1.10                         1.10.0-504                                 amd64        Rendering Toolkit Utilities
ii  intel-oneapi-runtime-compilers                   2025.0.4-1519                              amd64        Intel® oneAPI DPC++/C++ Compiler runtime common files
ii  intel-oneapi-runtime-compilers-common            2025.0.4-1519                              all          Intel® oneAPI DPC++/C++ Compiler runtime common files
ii  intel-oneapi-runtime-dnnl                        2025.0.1-6                                 amd64        Intel® oneAPI Deep Neural Network Library runtime
ii  intel-oneapi-runtime-dpcpp-cpp                   2025.0.4-1519                              amd64        Intel® oneAPI DPC++/C++ Compiler runtime
ii  intel-oneapi-runtime-dpcpp-cpp-common            2025.0.4-1519                              all          Intel® oneAPI DPC++/C++ Compiler runtime
ii  intel-oneapi-runtime-dpcpp-sycl-core             2025.0.4-1519                              all          Intel® oneAPI DPC++/C++ Compiler SYCL* Runtime Core
ii  intel-oneapi-runtime-dpcpp-sycl-gpu-rt           2025.0.4-1519                              all          Intel® oneAPI DPC++/C++ Compiler SYCL* GPU Runtime
ii  intel-oneapi-runtime-dpcpp-sycl-opencl-cpu       2025.0.4-1519                              amd64        Intel® CPU Runtime for OpenCL(TM) Applications runtime
ii  intel-oneapi-runtime-opencl                      2025.0.4-1519                              amd64        Intel® CPU Runtime for OpenCL(TM) Applications runtime
ii  intel-oneapi-runtime-openmp                      2025.0.4-1519                              amd64        Intel® OpenMP* Runtime Library runtime
ii  intel-oneapi-runtime-openmp-opencl-shared        2025.0.4-1519                              amd64        Intel(R) OpenMP and OpenCL shared files for runtime package
ii  intel-oneapi-runtime-tbb                         2022.0.0-402                               amd64        Intel® oneAPI Threading Building Blocks runtime
ii  intel-oneapi-runtime-tbb-common                  2021.10.0-49541                            all          Intel® oneAPI Threading Building Blocks runtime common
ii  intel-oneapi-runtime-tcm                         1.2.0-589                                  amd64        Thread Composability Manager
ii  intel-oneapi-tbb                                 2022.0.0-402                               amd64        Intel® oneAPI Threading Building Blocks
ii  intel-oneapi-tbb-2021.12                         2021.12.0-495                              amd64        Intel® oneAPI Threading Building Blocks
ii  intel-oneapi-tbb-2022.0                          2022.0.0-402                               amd64        Intel® oneAPI Threading Building Blocks
ii  intel-oneapi-tbb-common-2021.12                  2021.12.0-495                              all          Intel® oneAPI Threading Building Blocks common
ii  intel-oneapi-tbb-common-devel-2021.12            2021.12.0-495                              all          Intel® oneAPI Threading Building Blocks common
ii  intel-oneapi-tbb-devel                           2022.0.0-402                               amd64        Intel® oneAPI Threading Building Blocks Development Package
ii  intel-oneapi-tbb-devel-2021.12                   2021.12.0-495                              amd64        Intel® oneAPI Threading Building Blocks Development Package
ii  intel-oneapi-tbb-devel-2022.0                    2022.0.0-402                               amd64        Intel® oneAPI Threading Building Blocks Development Package
ii  intel-oneapi-tcm-1.0                             1.0.1-175                                  amd64        Thread Composability Manager
ii  intel-oneapi-tcm-1.2                             1.2.0-589                                  amd64        Thread Composability Manager
ii  intel-oneapi-tlt                                 2025.0.0-550                               amd64        Toolkit Linking Tool
ii  intel-oneapi-tlt-2024.1                          2024.1.0-447                               amd64        Toolkit Linking Tool
ii  intel-oneapi-tlt-2025.0                          2025.0.0-550                               amd64        Toolkit Linking Tool
ii  intel-oneapi-umf-0.9                             0.9.1-6                                    amd64        Unified Memory Framework
ii  intel-oneapi-vtune                               2025.0.1-14                                amd64        Intel® VTune(TM) Profiler

@HeyItsBATMAN
Copy link

Similar issue on Arch Linux, also with intel-oneapi-basekit 2025.0.1.

GPU is Battlemage B580

./build/bin/sd --diffusion-model /mnt/hdd1/sd-models/flux1-dev-q4_k.gguf --vae /mnt/hdd1/sd-models/ae.safetensors --clip_l /mnt/hdd1/sd-models/clip_l.safetensors --t5xxl /mnt/hdd1/sd-models/t5xxl_fp16.safetensors -p "a lovely cat holding a sign says 'Battlemage B580'" --cfg-scale 1.0 --sampling-method euler_a -v
Option: 
    n_threads:         8
    mode:              txt2img
    model_path:        
    wtype:             unspecified
    clip_l_path:       /mnt/hdd1/sd-models/clip_l.safetensors
    clip_g_path:       
    t5xxl_path:        /mnt/hdd1/sd-models/t5xxl_fp16.safetensors
    diffusion_model_path:   /mnt/hdd1/sd-models/flux1-dev-q4_k.gguf
    vae_path:          /mnt/hdd1/sd-models/ae.safetensors
    taesd_path:        
    esrgan_path:       
    controlnet_path:   
    embeddings_path:   
    stacked_id_embeddings_path:   
    input_id_images_path:   
    style ratio:       20.00
    normalize input image :  false
    output_path:       output.png
    init_img:          
    mask_img:          
    control_image:     
    clip on cpu:       false
    controlnet cpu:    false
    vae decoder on cpu:false
    diffusion flash attention:false
    strength(control): 0.90
    prompt:            a lovely cat holding a sign says 'Battlemage B580'
    negative_prompt:   
    min_cfg:           1.00
    cfg_scale:         1.00
    slg_scale:         0.00
    guidance:          3.50
    clip_skip:         -1
    width:             512
    height:            512
    sample_method:     euler_a
    schedule:          default
    sample_steps:      20
    strength(img2img): 0.75
    rng:               cuda
    seed:              42
    batch_count:       1
    vae_tiling:        false
    upscale_repeats:   1
System Info: 
    SSE3 = 1
    AVX = 1
    AVX2 = 1
    AVX512 = 0
    AVX512_VBMI = 0
    AVX512_VNNI = 0
    FMA = 1
    NEON = 0
    ARM_FMA = 0
    F16C = 1
    FP16_VA = 0
    WASM_SIMD = 0
    VSX = 0
[DEBUG] stable-diffusion.cpp:181  - Using SYCL backend
[SYCL] call ggml_check_sycl
ggml_check_sycl: GGML_SYCL_DEBUG: 0
ggml_check_sycl: GGML_SYCL_F16: no
found 1 SYCL devices:
|  |                   |                                       |       |Max    |        |Max  |Global |                     |
|  |                   |                                       |       |compute|Max work|sub  |mem    |                     |
|ID|        Device Type|                                   Name|Version|units  |group   |group|size   |       Driver version|
|--|-------------------|---------------------------------------|-------|-------|--------|-----|-------|---------------------|
| 0| [level_zero:gpu:0]|                Intel Graphics [0xe20b]|   20.1|    160|    1024|   32| 12168M|            1.6.31907|
ggml_sycl_init: GGML_SYCL_FORCE_MMQ:   no
ggml_sycl_init: SYCL_USE_XMX: yes
ggml_sycl_init: found 1 SYCL devices:
[INFO ] stable-diffusion.cpp:202  - loading clip_l from '/mnt/hdd1/sd-models/clip_l.safetensors'
[INFO ] model.cpp:888  - load /mnt/hdd1/sd-models/clip_l.safetensors using safetensors format
[DEBUG] model.cpp:959  - init from '/mnt/hdd1/sd-models/clip_l.safetensors'
[INFO ] stable-diffusion.cpp:216  - loading t5xxl from '/mnt/hdd1/sd-models/t5xxl_fp16.safetensors'
[INFO ] model.cpp:888  - load /mnt/hdd1/sd-models/t5xxl_fp16.safetensors using safetensors format
[DEBUG] model.cpp:959  - init from '/mnt/hdd1/sd-models/t5xxl_fp16.safetensors'
[INFO ] stable-diffusion.cpp:223  - loading diffusion model from '/mnt/hdd1/sd-models/flux1-dev-q4_k.gguf'
[INFO ] model.cpp:885  - load /mnt/hdd1/sd-models/flux1-dev-q4_k.gguf using gguf format
[DEBUG] model.cpp:902  - init from '/mnt/hdd1/sd-models/flux1-dev-q4_k.gguf'
[INFO ] stable-diffusion.cpp:230  - loading vae from '/mnt/hdd1/sd-models/ae.safetensors'
[INFO ] model.cpp:888  - load /mnt/hdd1/sd-models/ae.safetensors using safetensors format
[DEBUG] model.cpp:959  - init from '/mnt/hdd1/sd-models/ae.safetensors'
[INFO ] stable-diffusion.cpp:242  - Version: Flux 
[INFO ] stable-diffusion.cpp:275  - Weight type:                 f16
[INFO ] stable-diffusion.cpp:276  - Conditioner weight type:     f16
[INFO ] stable-diffusion.cpp:277  - Diffusion model weight type: q4_K
[INFO ] stable-diffusion.cpp:278  - VAE weight type:             f32
[DEBUG] stable-diffusion.cpp:280  - ggml tensor size = 400 bytes
[INFO ] stable-diffusion.cpp:319  - set clip_on_cpu to true
[INFO ] stable-diffusion.cpp:322  - CLIP: Using CPU backend
[DEBUG] clip.hpp:171  - vocab size: 49408
[DEBUG] clip.hpp:182  -  trigger word img already in vocab
[INFO ] flux.hpp:889  - Flux blocks: 19 double, 38 single
[DEBUG] ggml_extend.hpp:1111 - clip params backend buffer size =  235.06 MB(RAM) (196 tensors)
[DEBUG] ggml_extend.hpp:1111 - t5 params backend buffer size =  9083.77 MB(RAM) (219 tensors)
[DEBUG] ggml_extend.hpp:1111 - flux params backend buffer size =  6604.64 MB(VRAM) (780 tensors)
[DEBUG] ggml_extend.hpp:1111 - vae params backend buffer size =  94.57 MB(VRAM) (138 tensors)
[DEBUG] stable-diffusion.cpp:417  - loading weights
[DEBUG] model.cpp:1698 - loading tensors from /mnt/hdd1/sd-models/clip_l.safetensors
  |======>                                           | 196/1440 - 0.00it/s[DEBUG] model.cpp:1698 - loading tensors from /mnt/hdd1/sd-models/t5xxl_fp16.safetensors
  |==============>                                   | 413/1440 - 0.00it/s[INFO ] model.cpp:1868 - unknown tensor 'text_encoders.t5xxl.transformer.encoder.embed_tokens.weight | f16 | 2 [4096, 32128, 1, 1, 1]' in model file
  |==============>                                   | 416/1440 - 9.43it/s[DEBUG] model.cpp:1698 - loading tensors from /mnt/hdd1/sd-models/flux1-dev-q4_k.gguf
  |=========================================>        | 1196/1440 - 50.00it/s[DEBUG] model.cpp:1698 - loading tensors from /mnt/hdd1/sd-models/ae.safetensors
  |==============================================>   | 1334/1440 - 200.00it/s[INFO ] stable-diffusion.cpp:516  - total params memory size = 16018.05MB (VRAM 6699.22MB, RAM 9318.83MB): clip 9318.83MB(RAM), unet 6604.64MB(VRAM), vae 94.57MB(VRAM), controlnet 0.00MB(VRAM), pmid 0.00MB(RAM)
[INFO ] stable-diffusion.cpp:520  - loading model from '' completed, taking 6.22s
[INFO ] stable-diffusion.cpp:537  - running in Flux FLOW mode
[DEBUG] stable-diffusion.cpp:594  - finished loaded file
[DEBUG] stable-diffusion.cpp:1535 - txt2img 512x512
[DEBUG] stable-diffusion.cpp:1230 - prompt after extract and remove lora: "a lovely cat holding a sign says 'Battlemage B580'"
[INFO ] stable-diffusion.cpp:682  - Attempting to apply 0 LoRAs
[INFO ] stable-diffusion.cpp:1235 - apply_loras completed, taking 0.00s
[DEBUG] conditioner.hpp:1027 - parse 'a lovely cat holding a sign says 'Battlemage B580'' to [['a lovely cat holding a sign says 'Battlemage B580'', 1], ]
[DEBUG] clip.hpp:311  - token length: 77
[DEBUG] t5.hpp:397  - token length: 256
[DEBUG] clip.hpp:736  - Missing text_projection matrix, assuming identity...
[DEBUG] ggml_extend.hpp:1062 - clip compute buffer size: 1.40 MB(RAM)
[DEBUG] clip.hpp:736  - Missing text_projection matrix, assuming identity...
[DEBUG] ggml_extend.hpp:1062 - t5 compute buffer size: 68.25 MB(RAM)
[DEBUG] conditioner.hpp:1142 - computing condition graph completed, taking 5611 ms
[INFO ] stable-diffusion.cpp:1368 - get_learned_condition completed, taking 5613 ms
[INFO ] stable-diffusion.cpp:1391 - sampling using Euler A method
[INFO ] stable-diffusion.cpp:1428 - generating image: 1/1 - seed 42
[DEBUG] stable-diffusion.cpp:798  - Sample
[DEBUG] ggml_extend.hpp:1062 - flux compute buffer size: 398.50 MB(VRAM)
No kernel named _ZTSZN14bin_bcast_syclIXadL_ZL6op_addffEEEclIfffEEvR25ggml_backend_sycl_contextPK11ggml_tensorS6_PS4_PKT_PKT0_PT1_PN4sycl3_V15queueEEUlNSH_7nd_itemILi3EEEE0_ was foundException caught at file:/mnt/hdd1/ipex/stable-diffusion.cpp/ggml/src/ggml-sycl/common.cpp, line:102

@paoletto
Copy link
Author

only version currently working for me is 0.2.1 from stable-diffusion-cpp-python

@DarwinAnim8or
Copy link

only version currently working for me is 0.2.1 from stable-diffusion-cpp-python

Thanks for the heads-up! This one actually works :D

@sss123next
Copy link

this looks like a regression somewhere, 1c168d9 works

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants