Improve handling of ffmpeg output #87

gingerbreadassassin · 2025-03-18T18:00:39Z

Fixes:

If "speed" should be anything but 0, let's discuss.

BotBlake · 2025-03-18T20:36:51Z

Hi there @gingerbreadassassin .
First of all I want to thank you for using and contributing to jellybench!

Now regarding the topic about the speed value not being reported by ffmpeg in some cases.
In my personal oppinion, there is no point in continuing the benchmark, or at least the run, when ffmpeg is not delivering usefull output.
Just "faking" output with the speed or zero seems like a hacky approach.
How would you like the idea of marking the run as failed when any ffmpeg process doesnt report its speed.
Also I would suggest to add a warning in the logger whenever the output cannot be parsed properly.

In general the script heavily relies on ffmpeg reporting the speed to calculate results and scale up the worker count accordingly.
Therefore overwriting that value with anything might have unexpected consequences.

I look forward to your thoughts on these suggestions.
Kind regards.

gingerbreadassassin · 2025-03-19T04:27:04Z

seems like a hacky approach

💯 it is

marking the run as failed when any ffmpeg process doesnt report its speed

Agree the proper way to handle this would be a try/except clause. I almost started working on that, but my main objective was just to get the benchmark to complete, not to read through the whole script to figure out where best to put the error handling.

In general the script heavily relies on ffmpeg reporting the speed to calculate results and scale up the worker count accordingly.
Therefore overwriting that value with anything might have unexpected consequences.

This is the understanding of the benchmarking tool that I lacked, and why I opened the PR with what I had "working" for discussion. I wasn't sure how much this would affect the outcome of the benchmark. If I have time between work tomorrow, I'll give the more appropriate approach a shot.

…nto develop

BotBlake

I understand the need for a fix, however its not an option to just "ignore" that REQUIRED VALUES are sometimes straight up missing.

Additionaly this implementation is not ideal, as it introduces redundant value checking.
Also if the speed value is missing entirely, the issue will just continue to show up!

BotBlake · 2025-04-01T12:03:38Z

jellybench_py/core.py

@@ -356,8 +356,8 @@ def benchmark(ffmpeg_cmd: str, debug_flag: bool, prog_bar, limit=0) -> tuple:
        result = {
            "max_streams": max_pass,
            "failure_reasons": failure_reason,
-            "single_worker_speed": max_pass_run_data["speed"],
-            "single_worker_rss_kb": max_pass_run_data["rss_kb"],
+            "single_worker_speed": max_pass_run_data.get("speed", 0),


Sets speed to zero - The Client may try to downscale the ammount of workers, or not mark this run instantly as failed.
Additionaly, why is this access protected in core.py? Shouldn't it be ALWAYS written to in worker.py in the first place?

In fact, this PR already introduces the handling of this value in worker.py - this is just duplicated code.

This should not be done this way.
Specifically the lack of this value indicates a greater issue.
The run should be marked as failed.

BotBlake · 2025-04-01T12:05:33Z

jellybench_py/core.py

-            "single_worker_speed": max_pass_run_data["speed"],
-            "single_worker_rss_kb": max_pass_run_data["rss_kb"],
+            "single_worker_speed": max_pass_run_data.get("speed", 0),
+            "single_worker_rss_kb": max_pass_run_data.get("rss_kb", 0),


Sets lasts run rss_kb to zero if not available - This has no effect on the runtime, but on the results.
Its just FALSE that the single worker_rss was zero.
Additionaly, why is this access protected in core.py? Shouldn't it be ALWAYS written to in worker.py in the first place?

Even though this does not introduce greater issues, the lack of this value indicates there are other issues to be solved.

BotBlake · 2025-04-01T12:10:15Z

jellybench_py/worker.py

@@ -122,6 +122,8 @@ def workMan(worker_count: int, ffmpeg_cmd: str, passed_logger: Logger) -> tuple:
                    workrss = float(
                        rssline[1].split("=")[-1].replace("kB", "").replace("KiB", "")
                    )  # maxrss
+                else:
+                    workrss = 0


Sets lasts run rss_kb to zero if not available - This has no effect on the runtime, but on the results.
Its just FALSE that the single worker_rss was zero - sure, WE know it, but the server does not when it tries to interpret the data.

Even though this does not introduce greater issues, the lack of this value indicates there are other issues to be solved.

BotBlake · 2025-04-01T12:13:45Z

jellybench_py/worker.py

+                speed = new_line[6].split("=")[-1].replace("x", "")
+                if speed == "N/A":
+                    speed = 0
+                speeds.append(float(speed))


What if the speed value doesnt exist at all instead of being written to with "N/A"?
Then converting to a float would not be possible.
Thats specifically the issue here.

Also its not clear what causes ffmpeg to not report this value.
Imo it points to other issues!

Anyways_
The results of ANY run that wasnt abled to gather valid result data from ffmpeg should NEVER be marked as healthy!
Having valid result data would indicate this!

Instead the run should be marked as unhealthy, by returning the failure.

gingerbreadassassin · 2025-04-01T20:37:34Z

I understand the need for a fix, however[...]

My sincerest apologies if my last commit re-submitted a request for review; I should've marked this PR as a draft after our previous discussion

gingerbreadassassin added 3 commits March 17, 2025 15:20

fix BotBlake#70

58ae63a

fix KeyError: 'speed' when ffmpeg does not print 'speed'

2eedd22

Fix KeyError: 'rss_kb'

d7611d2

gingerbreadassassin added 2 commits March 18, 2025 23:36

Merge branch 'develop' of https://github.com/BotBlake/jellybench_py i…

1f702bf

…nto develop

hack missing workrss

35b4d43

BotBlake requested changes Apr 1, 2025

View reviewed changes

gingerbreadassassin marked this pull request as draft April 1, 2025 20:34

BotBlake mentioned this pull request May 21, 2025

Fix benchmark function to safely access max_pass_run_data and no successful runs #91

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improve handling of ffmpeg output #87

Improve handling of ffmpeg output #87

Uh oh!

gingerbreadassassin commented Mar 18, 2025

Uh oh!

BotBlake commented Mar 18, 2025

Uh oh!

gingerbreadassassin commented Mar 19, 2025 •

edited

Loading

Uh oh!

BotBlake left a comment

Uh oh!

BotBlake Apr 1, 2025

Uh oh!

BotBlake Apr 1, 2025

Uh oh!

BotBlake Apr 1, 2025

Uh oh!

BotBlake Apr 1, 2025

Uh oh!

gingerbreadassassin commented Apr 1, 2025

Uh oh!

Uh oh!

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Improve handling of ffmpeg output #87

Are you sure you want to change the base?

Improve handling of ffmpeg output #87

Uh oh!

Conversation

gingerbreadassassin commented Mar 18, 2025

Uh oh!

BotBlake commented Mar 18, 2025

Uh oh!

gingerbreadassassin commented Mar 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

BotBlake left a comment

Choose a reason for hiding this comment

Uh oh!

BotBlake Apr 1, 2025

Choose a reason for hiding this comment

Uh oh!

BotBlake Apr 1, 2025

Choose a reason for hiding this comment

Uh oh!

BotBlake Apr 1, 2025

Choose a reason for hiding this comment

Uh oh!

BotBlake Apr 1, 2025

Choose a reason for hiding this comment

Uh oh!

gingerbreadassassin commented Apr 1, 2025

Uh oh!

Uh oh!

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

gingerbreadassassin commented Mar 19, 2025 •

edited

Loading