Question, how to use embedding using mps ? #19

x4080 · 2024-05-20T00:05:42Z

Here's the embedding code :

from optimum.onnxruntime import ORTModelForFeatureExtraction
from transformers import AutoModel, AutoTokenizer
import numpy as np

model_ort = ORTModelForFeatureExtraction.from_pretrained('BAAI/bge-small-en-v1.5', file_name="onnx/model.onnx")
tokenizer = AutoTokenizer.from_pretrained('BAAI/bge-small-en-v1.5')
model = AutoModel.from_pretrained('BAAI/bge-small-en-v1.5')
...
inputs = tokenizer(documents, padding=True, truncation=True, return_tensors='pt', max_length=512)
embeddings = model(**inputs)[0][:, 0].detach().numpy()

It works but only using cpu, when I tried using to("mps"), it wont work

How can I use mps for this scenario ?

Thanks

The text was updated successfully, but these errors were encountered:

henryruhs · 2024-06-27T12:13:53Z

Use the official onnxruntime, this repo is outdated and can be archived.

x4080 · 2024-06-27T21:35:00Z

@henryruhs I see, thanks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question, how to use embedding using mps ? #19

Question, how to use embedding using mps ? #19

x4080 commented May 20, 2024

henryruhs commented Jun 27, 2024

x4080 commented Jun 27, 2024

Question, how to use embedding using mps ? #19

Question, how to use embedding using mps ? #19

Comments

x4080 commented May 20, 2024

henryruhs commented Jun 27, 2024

x4080 commented Jun 27, 2024