Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Live DAI DASH streaming freezing with multi periods and no closed caption Ad. #1863

Closed
1 task done
linhai326 opened this issue Nov 7, 2024 · 7 comments
Closed
1 task done
Assignees

Comments

@linhai326
Copy link

linhai326 commented Nov 7, 2024

Version

Media3 1.4.0 and beyond.

More version details

Issue doesn't happen in 1.3.1. and before

Devices that reproduce the issue

issue can be reproduced on all Android devices we tested: amazon FireTV stick 4K, for example.

Devices that do not reproduce the issue

No response

Reproducible in the demo app?

Yes

Reproduction steps

  • Launch exoplayer demo app
  • choose the test content (sent email to android-media-github@google.com)
  • Turn on CC: Click "SELECT TRACKS" --> TEXT --> "Unknown"
  • Streaming for a couple of minutes

Expected result

Stream should play without issue

Actual result

streaming will be freezing in a couple of minutes.

Media

will send content in email.

Bug Report

@linhai326
Copy link
Author

linhai326 commented Nov 7, 2024

We believe this bug was introduced in 1.4.0 when removing "IsDecodeOnly" commits:

0f42dd4
6e0f8e3

Particularly, it seems this is the bug, in CeaDecoder.java, line 88, in queueInputBuffer():
https://github.com/androidx/media/blob/release/libraries/extractor/src/main/java/androidx/media3/extractor/text/cea/CeaDecoder.java
Screenshot 2024-11-07 at 12 54 09 PM

there seems should be a check for ceaInputBuffer.timeUs != C.TIME_END_OF_SOURCE when discarding the ceaInputBuffer.
Without this check, the playback could be stuck due to the text render is keeping waiting for end of source.

@linhai326
Copy link
Author

any update?

@icbaker icbaker assigned icbaker and unassigned marcbaechinger Nov 12, 2024
@icbaker
Copy link
Collaborator

icbaker commented Nov 12, 2024

I think I was able to repro the issue using the media links provided over email. I then added a single log line and went to retry, and now the content is consistently 503ing.

Could you please check to make sure the content is still accessible?

@icbaker
Copy link
Collaborator

icbaker commented Nov 12, 2024

The content seems to be working again now.

I've tried with the following code in CeaDecoder and playback no longer seems to hang:

if (ceaInputBuffer.timeUs != C.TIME_END_OF_SOURCE
    && outputStartTimeUs != C.TIME_UNSET
    && ceaInputBuffer.timeUs < outputStartTimeUs) {
  // We can start decoding anywhere in CEA formats, so discarding on the input side is fine.
  releaseInputBuffer(ceaInputBuffer);
} else {

Thanks for reporting! I'll work on getting this fix reviewed internally and merged.

I've also noticed that the subtitles in your content are mixed up/garbled on main (and likely will be in 1.5.0-rc01 too though I haven't checked) - I'm looking into this as well, as it's likely related to some changes after 1.4.1 (we've seen and fixed similar symptoms recently, but this must be a case we haven't fixed yet).

copybara-service bot pushed a commit that referenced this issue Nov 12, 2024
The behaviour was changed in 1.4.0 with 0f42dd4,
so that the buffer timestamp is compared to `outputStartTimeUs` when
deciding whether to discard a "decode only" buffer before decoding
(instead of the deprecated/removed `isDecodeOnly` property). This breaks
when the buffer timestamp is `TIME_END_OF_SOURCE` (which is
`Long.MIN_VALUE`), because `TIME_END_OF_SOURCE < outputStartTimeUs` is
always true, so the end-of-stream buffer is never passed to the decoder
and on to `TextRenderer` where it is used to
[set `inputStreamEnded = true`](https://github.com/androidx/media/blob/40f187e4b47c445af5049a5a49ee4633ce4c1a76/libraries/exoplayer/src/main/java/androidx/media3/exoplayer/text/TextRenderer.java#L434-L436)
and so playback hangs.

#cherrypick

Issue: #1863
PiperOrigin-RevId: 695767247
@icbaker
Copy link
Collaborator

icbaker commented Nov 12, 2024

The fix for the CeaDecoder is submitted and linked above.

I also investigated the garbled captions. The problem is due to your MP4 container marking every video sample as a keyframe, meaning that we flush our reordering queue of CEA-608 samples here on every sample (which means it can't re-order anything):

if (sampleIsKeyFrameOrEndOfStream) {
reorderingSeiMessageQueue.flush();
}

This MP4 metadata is inconsistent with the H.264 bytes contained in each sample (not every H.264 sample is a keyframe).

If I change the if condition to just (trackBundle.getCurrentSampleFlags() & C.BUFFER_FLAG_END_OF_STREAM) != 0 then the subtitles become un-garbled (because the queue is able to correctly re-order the out-of-order samples). However, I'm not sure this is a correct change in general (we should be able to rely on flushing our queue on each keyframe, and we should be able to rely on the MP4 keyframe metadata being correct).

This seems to be an issue with your content muxing, rather than with ExoPlayer's logic - I suggest you report it to whoever is producing your content.

@linhai326
Copy link
Author

@icbaker
Thanks a lot for looking into this issue.
We will check with our content provider!!

copybara-service bot pushed a commit that referenced this issue Nov 14, 2024
The content in Issue: #1863 has every sample incorrectly marked as a
sync sample in the MP4 metadata, which results in flushing the
re-ordering queue on every sample, so nothing gets re-ordered, so the
subtitles are garbled.

There are currently two "uses" for this call on every keyframe:
1. It offers a safety valve if we don't read a `maxNumReorderSamples`
value from the video. Without this, the queue will just keep growing
and end up swallowing all subtitle data (similar to the bug fixed by
39c7349).

2. When we do read (or infer) a `maxNumReorderSamples` it means we can
emit samples from the queue slightly earlier - but this is pretty
marginal, given i think the max possible value for
`maxNumReorderSamples` is 16, so the most benefit we would get is 16
frames (~0.53s at 30fps) - in most cases we will have more than 0.5s
of buffer ahead of the playback position, so the subtitles will still
get shown at the right time with no problem.

(1) is resolved in this change by setting the queue size to zero (no
reordering) if we don't have a value for `maxNumReorderSamples`.

(2) has minimal impact, so we just accept it.

We may be able to inspect the NAL unit to determine IDR vs non-IDR
instead - we will consider this as a follow-up change, but given the
minimal impact of (2) we may not pursue this.

#cherrypick

PiperOrigin-RevId: 696583702
@icbaker
Copy link
Collaborator

icbaker commented Nov 15, 2024

We discussed as a team and decided the benefit of relying on the MP4 sync sample metadata for flushing is marginal, and the result of poor behaviour on malformed media like this is quite catastrophic - so we decided to remove the assumption that MP4 sync sample metadata is correct for this use-case (re-ordering SEI CEA-608 samples) - see the change linked above.

@icbaker icbaker closed this as completed Nov 15, 2024
icbaker added a commit that referenced this issue Nov 15, 2024
The behaviour was changed in 1.4.0 with 0f42dd4,
so that the buffer timestamp is compared to `outputStartTimeUs` when
deciding whether to discard a "decode only" buffer before decoding
(instead of the deprecated/removed `isDecodeOnly` property). This breaks
when the buffer timestamp is `TIME_END_OF_SOURCE` (which is
`Long.MIN_VALUE`), because `TIME_END_OF_SOURCE < outputStartTimeUs` is
always true, so the end-of-stream buffer is never passed to the decoder
and on to `TextRenderer` where it is used to
[set `inputStreamEnded = true`](https://github.com/androidx/media/blob/40f187e4b47c445af5049a5a49ee4633ce4c1a76/libraries/exoplayer/src/main/java/androidx/media3/exoplayer/text/TextRenderer.java#L434-L436)
and so playback hangs.

Issue: #1863
PiperOrigin-RevId: 695767247
(cherry picked from commit 19b38c8)
icbaker added a commit that referenced this issue Nov 15, 2024
The content in Issue: #1863 has every sample incorrectly marked as a
sync sample in the MP4 metadata, which results in flushing the
re-ordering queue on every sample, so nothing gets re-ordered, so the
subtitles are garbled.

There are currently two "uses" for this call on every keyframe:
1. It offers a safety valve if we don't read a `maxNumReorderSamples`
value from the video. Without this, the queue will just keep growing
and end up swallowing all subtitle data (similar to the bug fixed by
39c7349).

2. When we do read (or infer) a `maxNumReorderSamples` it means we can
emit samples from the queue slightly earlier - but this is pretty
marginal, given i think the max possible value for
`maxNumReorderSamples` is 16, so the most benefit we would get is 16
frames (~0.53s at 30fps) - in most cases we will have more than 0.5s
of buffer ahead of the playback position, so the subtitles will still
get shown at the right time with no problem.

(1) is resolved in this change by setting the queue size to zero (no
reordering) if we don't have a value for `maxNumReorderSamples`.

(2) has minimal impact, so we just accept it.

We may be able to inspect the NAL unit to determine IDR vs non-IDR
instead - we will consider this as a follow-up change, but given the
minimal impact of (2) we may not pursue this.

PiperOrigin-RevId: 696583702
(cherry picked from commit e6448f3)
icbaker added a commit that referenced this issue Nov 16, 2024
The content in Issue: #1863 has every sample incorrectly marked as a
sync sample in the MP4 metadata, which results in flushing the
re-ordering queue on every sample, so nothing gets re-ordered, so the
subtitles are garbled.

There are currently two "uses" for this call on every keyframe:
1. It offers a safety valve if we don't read a `maxNumReorderSamples`
value from the video. Without this, the queue will just keep growing
and end up swallowing all subtitle data (similar to the bug fixed by
39c7349).

2. When we do read (or infer) a `maxNumReorderSamples` it means we can
emit samples from the queue slightly earlier - but this is pretty
marginal, given i think the max possible value for
`maxNumReorderSamples` is 16, so the most benefit we would get is 16
frames (~0.53s at 30fps) - in most cases we will have more than 0.5s
of buffer ahead of the playback position, so the subtitles will still
get shown at the right time with no problem.

(1) is resolved in this change by setting the queue size to zero (no
reordering) if we don't have a value for `maxNumReorderSamples`.

(2) has minimal impact, so we just accept it.

We may be able to inspect the NAL unit to determine IDR vs non-IDR
instead - we will consider this as a follow-up change, but given the
minimal impact of (2) we may not pursue this.

PiperOrigin-RevId: 696583702
(cherry picked from commit e6448f3)
icbaker added a commit that referenced this issue Nov 19, 2024
The behaviour was changed in 1.4.0 with 0f42dd4,
so that the buffer timestamp is compared to `outputStartTimeUs` when
deciding whether to discard a "decode only" buffer before decoding
(instead of the deprecated/removed `isDecodeOnly` property). This breaks
when the buffer timestamp is `TIME_END_OF_SOURCE` (which is
`Long.MIN_VALUE`), because `TIME_END_OF_SOURCE < outputStartTimeUs` is
always true, so the end-of-stream buffer is never passed to the decoder
and on to `TextRenderer` where it is used to
[set `inputStreamEnded = true`](https://github.com/androidx/media/blob/40f187e4b47c445af5049a5a49ee4633ce4c1a76/libraries/exoplayer/src/main/java/androidx/media3/exoplayer/text/TextRenderer.java#L434-L436)
and so playback hangs.

Issue: #1863
PiperOrigin-RevId: 695767247
(cherry picked from commit 19b38c8)
icbaker added a commit that referenced this issue Nov 19, 2024
The content in Issue: #1863 has every sample incorrectly marked as a
sync sample in the MP4 metadata, which results in flushing the
re-ordering queue on every sample, so nothing gets re-ordered, so the
subtitles are garbled.

There are currently two "uses" for this call on every keyframe:
1. It offers a safety valve if we don't read a `maxNumReorderSamples`
value from the video. Without this, the queue will just keep growing
and end up swallowing all subtitle data (similar to the bug fixed by
39c7349).

2. When we do read (or infer) a `maxNumReorderSamples` it means we can
emit samples from the queue slightly earlier - but this is pretty
marginal, given i think the max possible value for
`maxNumReorderSamples` is 16, so the most benefit we would get is 16
frames (~0.53s at 30fps) - in most cases we will have more than 0.5s
of buffer ahead of the playback position, so the subtitles will still
get shown at the right time with no problem.

(1) is resolved in this change by setting the queue size to zero (no
reordering) if we don't have a value for `maxNumReorderSamples`.

(2) has minimal impact, so we just accept it.

We may be able to inspect the NAL unit to determine IDR vs non-IDR
instead - we will consider this as a follow-up change, but given the
minimal impact of (2) we may not pursue this.

PiperOrigin-RevId: 696583702
(cherry picked from commit e6448f3)
@androidx androidx locked and limited conversation to collaborators Jan 15, 2025
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

No branches or pull requests

3 participants