|
|
发表于 2026-5-9 15:59:38
|
显示全部楼层
大佬,我想咨询下,我使用heygem做实时对口型功能,音频特征提取和推理容易报错如下;第二个问题:我又启动了indextts2进行语音生成,当语音生成时,推理的速度骤降,这个是显卡性能有问题吗?我是4060TI显卡。
ffmpeg-info: encoder : Lavc60.16.100 rawvideo
E:\AI\heygem-win-50\wenet\compute_ctc_att_bnf.py:116: FutureWarning: `torch.cuda.amp.autocast(args...)` is deprecated. Please use `torch.amp.autocast('cuda', args...)` instead.
with autocast(enabled=(fp16 and device == 'cuda')):
Exception in thread Thread-13 (_script_loop):
Traceback (most recent call last):
File "E:\AI\heygem-win-50\env\Lib\threading.py", line 1016, in _bootstrap_inner
self.run()
File "E:\AI\heygem-win-50\env\Lib\threading.py", line 953, in run
self._target(*self._args, **self._kwargs)
File "E:\AI\heygem-win-50\service\live_manager.py", line 257, in _script_loop
self.video_streamer.add_audio_to_queue(script)
File "E:\AI\heygem-win-50\service\trans_dh_service.py", line 268, in add_audio_to_queue
audio_wenet_feature = get_aud_feat1(wav_fragment=audio_path, fps=fps, wenet_model=self.wenet_model)
File "E:\AI\heygem-win-50\service\trans_dh_service.py", line 88, in get_aud_feat1
return feature_extraction_wenet(wav_fragment, fps, wenet_model)
File "E:\AI\heygem-win-50\service\trans_dh_service.py", line 51, in feature_extraction_wenet
f_wenet_all = get_weget(audio_file, wenet_model, section) # shape (T, D)
File "E:\AI\heygem-win-50\wenet\compute_ctc_att_bnf.py", line 172, in get_weget
return _compute_internal(wav_arr, wenet_model_and_configs, section, fp16)
File "E:\AI\heygem-win-50\wenet\compute_ctc_att_bnf.py", line 142, in _compute_internal
bnf = ppg_model(wav_tensor, wav_length)
File "E:\AI\heygem-win-50\env\Lib\site-packages\torch\nn\modules\module.py", line 1739, in _wrapped_call_impl
return self._call_impl(*args, **kwargs)
File "E:\AI\heygem-win-50\env\Lib\site-packages\torch\nn\modules\module.py", line 1750, in _call_impl
return forward_call(*args, **kwargs)
File "E:\AI\heygem-win-50\wenet\compute_ctc_att_bnf.py", line 57, in forward
encoder_out, _ = self.encoder(feats, feats_lengths)
File "E:\AI\heygem-win-50\env\Lib\site-packages\torch\nn\modules\module.py", line 1739, in _wrapped_call_impl
return self._call_impl(*args, **kwargs)
File "E:\AI\heygem-win-50\env\Lib\site-packages\torch\nn\modules\module.py", line 1750, in _call_impl
return forward_call(*args, **kwargs)
File "E:\AI\heygem-win-50\wenet\transformer\encoder.py", line 118, in forward
(xs, pos_emb, masks) = self.embed(xs, masks)
File "E:\AI\heygem-win-50\env\Lib\site-packages\torch\nn\modules\module.py", line 1739, in _wrapped_call_impl
return self._call_impl(*args, **kwargs)
File "E:\AI\heygem-win-50\env\Lib\site-packages\torch\nn\modules\module.py", line 1750, in _call_impl
return forward_call(*args, **kwargs)
File "E:\work\heygem-win-50\wenet\transformer\subsampling.py", line 118, in forward
File "E:\AI\heygem-win-50\env\Lib\site-packages\torch\nn\modules\module.py", line 1739, in _wrapped_call_impl
return self._call_impl(*args, **kwargs)
File "E:\AI\heygem-win-50\env\Lib\site-packages\torch\nn\modules\module.py", line 1750, in _call_impl
return forward_call(*args, **kwargs)
File "E:\AI\heygem-win-50\env\Lib\site-packages\torch\nn\modules\container.py", line 250, in forward
input = module(input)
File "E:\AI\heygem-win-50\env\Lib\site-packages\torch\nn\modules\module.py", line 1739, in _wrapped_call_impl
return self._call_impl(*args, **kwargs)
File "E:\AI\heygem-win-50\env\Lib\site-packages\torch\nn\modules\module.py", line 1750, in _call_impl
return forward_call(*args, **kwargs)
File "E:\AI\heygem-win-50\env\Lib\site-packages\torch\nn\modules\conv.py", line 554, in forward
return self._conv_forward(input, self.weight, self.bias)
File "E:\AI\heygem-win-50\env\Lib\site-packages\torch\nn\modules\conv.py", line 549, in _conv_forward
return F.conv2d(
RuntimeError: CUDA error: operation not permitted when stream is capturing
CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect.
For debugging consider passing CUDA_LAUNCH_BLOCKING=1
Compile with `TORCH_USE_CUDA_DSA` to enable device-side assertions.
E:\work\heygem-win-50\landmark2face_wy\digitalhuman_interface.py:295: UserWarning: Creating a tensor from a list of numpy.ndarrays is extremely slow. Please consider converting the list to a single numpy.ndarray with numpy.array() before converting to a tensor. (Triggered internally at C:\actions-runner\_work\pytorch\pytorch\pytorch\torch\csrc\utils\tensor_new.cpp:257.)
E:\AI\heygem-win-50\env\Lib\site-packages\torch\nn\functional.py:5015: UserWarning: Default grid_sample and affine_grid behavior has changed to align_corners=False since 1.3.0. Please specify align_corners=True if the old behavior is desired. See the documentation of grid_sample for details.
warnings.warn(
[3e482e33-378a-44bd-90e8-77d28eb94e59] 显示帧率: 1 fps
|
|