-
Notifications
You must be signed in to change notification settings - Fork 92
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Bug]: GPU_HUNG when both encoder and decoder #288
Comments
Kernel version 5.15 is too old for 12th Gen. https://github.com/intel/media-driver#known-issues-and-limitations |
I used another cpu(12th Gen Intel(R) Core(TM) i9-12900H) cmd: /opt/intel/media/share/vpl/samples/_bin/sample_decode h265 -i v3_1080i5994.h265 -o /dev/null -timeout 10000 Just decoding the H265 file encoded by Amba H2(v3_1080i5994.h265) platform will show the error(no encode at the same time) MFX_ERR_DEVICE_FAILED(-17). please see the log Decoding started Also I can decode H265 files normally when I use intel msdk encoding(v2_500k_1080i5994.h265) |
No issue with ffmpeg qsv decoder (built with onevpl). I think it should be a sample_decode issue.
|
Using the bitstream filter
|
Hi @nyanmisaka Thanks for your help. I did some test and have more infomation about this issue. |
Hi @nyanmisaka I saw your libva version is libva info: Found init function __vaDriverInit_1_19 |
Libva version is not related to this issue. I'm testing the latest tag intel-onevpl-23.3.0. I'm not from intel and probably can't help you fix this. |
Hi Dave. |
Q1:Decode v3_1080i5994.h265 + Encode cnn.yuv |
Hi @chenhao5-Intel |
You mean you can reproduce the encode hang issue: "[ERROR], sts=MFX_ERR_GPU_HANG(-21), SynchronizeFirstTask, SyncOperation fail or timeout at /opt/src/vpl-dispatcher_src/tools/legacy/Sample_encode/src/pipeline_encode.cpp:178" on both 12900H and 12700? |
Hi @DaveHu-TVU @nyanmisaka There are two issue scenarios: (On both i7-12700 and i9-12900H)
Driver log shows no related errors reported and VPL log shows cm_mem_copy.cpp[Line: 3115]CopyVideoToSys: returns MFX_ERR_GPU_HANG. Analysis WIP.
For encode: Analyzed log and found that LibVA will report: [LIBVA]:CRITICAL - StatusReport:261: Something unexpected happened in HW, return error to application As for MFX_ERR_DEVICE_FAILED(-17), it may be a duplicate issue of GPU_HANG. If you have any question, please let me know. Thanks. BRs, |
Hi @chenhao5-Intel [amba_720p5994.zip](https://github.com/oneapi-src/ |
Hi @DaveHu-TVU and all, We have root-caused this issue. We have updated the codes and will open source it soon. To check this at your side, please test it on i9-12900H, run "export INTEL_MEDIA_RESET_WATCHDOG=0" first and then run sample app commands. There should be no issues. For Linux i7-12700, please refer to this known issue: https://community.intel.com/t5/Media-Intel-oneAPI-Video/GPU-hangs-when-decoding-2-HEVC-UHD-streams-444-10-bits-Y410/td-p/1431771 |
OK, Thanks for your help, @chenhao5-Intel |
Which component impacted?
Decode, Encode
Is it regression? Good in old configuration?
Yes, it's good in old version
What happened?
CPU: 12th Gen Intel(R) Core(TM) i7-12700
kernel: Linux tvu-desktop 5.15.0-69-generic #76~20.04.1-Ubuntu SMP Mon Mar 20 15:54:19 UTC 2023 x86_64 x86_64 x86_64 GNU/Linux
vpl: 2023Q1(https://github.com/oneapi-src/oneVPL-intel-gpu/releases/tag/intel-onevpl-23.1.5)
Reproduction steps:
console1:
/opt/intel/media/share/vpl/samples/_bin/sample_decode h265 -i v3_1080i5994.h265 -o /dev/null -timeout 10000
console2:/opt/intel/media/share/vpl/samples/_bin/sample_encode h264 -i cnn.yuv -o /dev/null -w 1920 -h 1080 -timeout 10000 -nv12
[ERROR], sts=MFX_ERR_GPU_HANG(-21), SynchronizeFirstTask, SyncOperation fail or timeout at /opt/src/vpl-dispatcher_src/tools/legacy/Sample_encode/src/pipeline_encode.cpp:178
[ERROR], sts=MFX_ERR_GPU_HANG(-21), GetFreeTask, m_TaskPool.SynchronizeFirstTask failed at /opt/src/vpl-dispatcher_src/tools/legacy/Sample_encode/src/pipeline_encode.cpp:2239
[ERROR], sts=MFX_ERR_GPU_HANG(-21), Run, m_pmfxENC->EncodeFrameAsync failed at /opt/src/vpl-dispatcher_src/tools/legacy/Sample_encode/src/pipeline_encode.cpp:2487
[ERROR], sts=MFX_ERR_GPU_HANG(-21), main, pPipeline->Run failed at /opt/src/vpl-dispatcher_src/tools/legacy/Sample_encode/src/Sample_encode.cpp:1970
What's the usage scenario when you are seeing the problem?
Immersive Media
What impacted?
After testing, we found that:
When decoding H264/H265 encoded by intel msdk or vpl and encoding at the same time, it can work;
When decoding H265 encoded by our other platform (Amba H2) and encoding at the same time, it is easy to have GPU_HUNG
v3_1080i5994.zip
Debug Information
Do you want to contribute a patch to fix the issue?
Yes, I'm glad to submit a patch to fix it
The text was updated successfully, but these errors were encountered: