[Feature] combine audio2text with asr service #905

lianhao · 2024-11-15T08:41:59Z

Priority

Undecided

OS type

Ubuntu

Hardware type

Xeon-GNR

Running nodes

Single Node

Description

The audio2text service is almost the same as the asr service, except for the returned data object type. We should combine both of them to minimize maintenance effort

Spycsh · 2024-11-20T09:07:08Z

Hi @lianhao due to a legacy issue huggingface/tokenizers#187, the first implementation of these ASR/TTS code is to split the microservice start and the model initialization to different places. Basically it is because our microservice are initialized in the other process using register_microservice and the huggingface tokenizer just fail with that case.

It is a bit annoying before I finally get some solutions to avoid such situations. I fully agree we should combine these two services together and remove the wrapper, same to tts service. I am starting to look into this together with the OpenAI format compatible feature.

Spycsh self-assigned this Nov 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] combine audio2text with asr service #905

[Feature] combine audio2text with asr service #905

lianhao commented Nov 15, 2024

Spycsh commented Nov 20, 2024

[Feature] combine audio2text with asr service #905

[Feature] combine audio2text with asr service #905

Comments

lianhao commented Nov 15, 2024

Priority

OS type

Hardware type

Running nodes

Description

Spycsh commented Nov 20, 2024