Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

symbol lookup error: /kubeshare/library/libgemhook.so.1 #26

Open
fung-hei-man opened this issue Jul 9, 2023 · 0 comments
Open

symbol lookup error: /kubeshare/library/libgemhook.so.1 #26

fung-hei-man opened this issue Jul 9, 2023 · 0 comments

Comments

@fung-hei-man
Copy link

Hello!

I have installed KubeShare to my K8s cluster, tested with a simple pod specification.

apiVersion: v1
kind: Pod
metadata:
  name: test-kubeshare
	namespace: core
  labels:
    "sharedgpu/gpu_request": "0.1"
    "sharedgpu/gpu_limit": "0.2"
spec:
  schedulerName: kubeshare-scheduler
  containers:
    - name: pytorch
      image:  riyazhu/mnist:20220420
      imagePullPolicy: IfNotPresent

The result is positive.

GPU 0: NVIDIA GeForce RTX 3050 Ti Laptop GPU (UUID: GPU-xxx)

However, when I tried to deploy my own Docker image, following error occurs:

/bin/bash: symbol lookup error: /kubeshare/library/libgemhook.so.1: undefined symbol: __libc_dlopen_mode, version GLIBC_PRIVATE

Not sure if this information helps but my base image is nvidia/cuda:12.1.1-cudnn8-devel-ubuntu22.04 and I am compiling OpenCV for object detection using GPU. The image works fine when using the default-schduler.

Any suggestions on this issue? Thank you!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant