FunASR Version
1.3.5
Python Version
3.10.12
Device
cuda (GPU)
Bug Description
使用python demo_vllm.py --input zh.mp3 --language 中文 --tensor-parallel-size 1 --model-dir /app/models/Fun-ASR-Nano-2512 --dtype fp16 --gpu-memory-utilization 0.5执行结果为空
Steps to Reproduce
python demo_vllm.py --input zh.mp3 --language 中文 --tensor-parallel-size 1 --model-dir /app/models/Fun-ASR-Nano-2512 --dtype fp16 --gpu-memory-utilization 0.5
gpu:
Driver Version: 570.158.01
Tesla V100-SXM2-32GB
python虚拟环境:
vllm==0.12.0
funasr==1.3.5
Package Version
--------------------------------- -------------
aiohappyeyeballs 2.6.2
aiohttp 3.13.5
aiosignal 1.4.0
aliyun-python-sdk-core 2.16.0
aliyun-python-sdk-kms 2.16.5
annotated-doc 0.0.4
annotated-types 0.7.0
anthropic 0.71.0
antlr4-python3-runtime 4.9.3
anyio 4.13.0
apache-tvm-ffi 0.1.11
astor 0.8.1
async-timeout 5.0.1
attrs 26.1.0
audioread 3.1.0
backports.strenum 1.3.1
blake3 1.0.8
cachetools 7.1.4
cbor2 6.1.1
certifi 2026.5.20
cffi 2.0.0
charset-normalizer 3.4.7
click 8.4.1
cloudpickle 3.1.2
compressed-tensors 0.12.2
crcmod 1.7
cryptography 48.0.0
cuda-bindings 13.3.1
cuda-core 1.0.1
cuda-pathfinder 1.5.5
cuda-python 13.3.1
cupy-cuda12x 14.1.1
decorator 5.3.1
depyf 0.20.0
detect-installer 0.1.0
dill 0.4.1
diskcache 5.6.3
distro 1.9.0
dnspython 2.8.0
docstring_parser 0.18.0
editdistance 0.8.1
einops 0.8.2
email-validator 2.3.0
exceptiongroup 1.3.1
fastapi 0.136.3
fastapi-cli 0.0.24
fastapi-cloud-cli 0.19.0
fastar 0.11.0
filelock 3.29.0
flashinfer-python 0.5.3
frozenlist 1.8.0
fsspec 2026.4.0
funasr 1.3.5
gguf 0.19.0
h11 0.16.0
hf-xet 1.5.0
httpcore 1.0.9
httptools 0.8.0
httpx 0.28.1
huggingface_hub 0.36.2
hydra-core 1.3.2
idna 3.17
interegular 0.3.3
jaconv 0.5.0
jamo 0.4.1
jieba 0.42.1
Jinja2 3.1.6
jiter 0.15.0
jmespath 0.10.0
joblib 1.5.3
jsonschema 4.26.0
jsonschema-specifications 2025.9.1
kaldiio 2.18.1
lark 1.2.2
lazy-loader 0.5
librosa 0.11.0
llguidance 1.3.0
llvmlite 0.44.0
lm-format-enforcer 0.11.3
loguru 0.7.3
markdown-it-py 4.2.0
MarkupSafe 3.0.3
mdurl 0.1.2
mistral_common 1.11.2
model-hosting-container-standards 0.1.15
modelscope 1.37.1
mpmath 1.3.0
msgpack 1.1.2
msgspec 0.21.1
multidict 6.7.1
networkx 3.4.2
ninja 1.13.0
numba 0.61.2
numpy 2.2.6
nvidia-cublas-cu12 12.8.4.1
nvidia-cuda-cupti-cu12 12.8.90
nvidia-cuda-nvrtc-cu12 12.8.93
nvidia-cuda-runtime-cu12 12.8.90
nvidia-cudnn-cu12 9.10.2.21
nvidia-cudnn-frontend 1.24.0
nvidia-cufft-cu12 11.3.3.83
nvidia-cufile-cu12 1.13.1.3
nvidia-curand-cu12 10.3.9.90
nvidia-cusolver-cu12 11.7.3.90
nvidia-cusparse-cu12 12.5.8.93
nvidia-cusparselt-cu12 0.7.1
nvidia-cutlass-dsl 4.5.2
nvidia-cutlass-dsl-libs-base 4.5.2
nvidia-ml-py 13.595.45
nvidia-nccl-cu12 2.27.5
nvidia-nvjitlink-cu12 12.8.93
nvidia-nvshmem-cu12 3.3.20
nvidia-nvtx-cu12 12.8.90
omegaconf 2.3.0
openai 2.38.0
openai-harmony 0.0.8
opencv-python-headless 4.13.0.92
oss2 2.19.1
outlines_core 0.2.11
packaging 26.2
partial-json-parser 0.2.1.1.post7
pillow 12.2.0
pip 22.0.2
platformdirs 4.10.0
pooch 1.9.0
prometheus_client 0.25.0
prometheus-fastapi-instrumentator 8.0.0
propcache 0.5.2
protobuf 7.35.0
psutil 7.2.2
py-cpuinfo 9.0.0
pybase64 1.4.3
pycountry 26.2.16
pycparser 3.0
pycryptodome 3.23.0
pydantic 2.13.4
pydantic_core 2.46.4
pydantic-extra-types 2.11.1
pydantic-settings 2.14.1
Pygments 2.20.0
pynndescent 0.6.0
python-dotenv 1.2.2
python-json-logger 4.1.0
python-multipart 0.0.30
pytorch-wpe 0.0.1
PyYAML 6.0.3
pyzmq 27.1.0
ray 2.55.1
referencing 0.37.0
regex 2026.5.9
requests 2.34.2
rich 15.0.0
rich-toolkit 0.19.10
rignore 0.7.6
rpds-py 0.30.0
safetensors 0.7.0
scikit-learn 1.7.2
scipy 1.15.3
sentencepiece 0.2.1
sentry-sdk 2.61.1
setproctitle 1.3.7
setuptools 59.6.0
shellingham 1.5.4
six 1.17.0
sniffio 1.3.1
soundfile 0.13.1
soxr 1.1.0
starlette 1.2.1
supervisor 4.3.0
sympy 1.14.0
tabulate 0.10.0
tensorboardX 2.6.5
threadpoolctl 3.6.0
tiktoken 0.13.0
tokenizers 0.22.2
tomli 2.4.1
torch 2.9.0
torch-complex 0.4.4
torchaudio 2.9.0
torchvision 0.24.0
tqdm 4.67.3
transformers 4.57.6
triton 3.5.0
typer 0.26.4
typing_extensions 4.15.0
typing-inspection 0.4.2
umap-learn 0.5.12
urllib3 2.7.0
uvicorn 0.48.0
uvloop 0.22.1
vllm 0.12.0
watchfiles 1.2.0
websockets 16.0
xgrammar 0.1.27
yarl 1.24.2
Error Message / Traceback
FunASR Version
1.3.5
Python Version
3.10.12
Device
cuda (GPU)
Bug Description
使用python demo_vllm.py --input zh.mp3 --language 中文 --tensor-parallel-size 1 --model-dir /app/models/Fun-ASR-Nano-2512 --dtype fp16 --gpu-memory-utilization 0.5执行结果为空
Steps to Reproduce
Error Message / Traceback