Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

JenkinsQueueJob: Job progress tracking failed to read job progress #13014

Closed
Nekto89 opened this issue May 27, 2020 · 17 comments
Closed

JenkinsQueueJob: Job progress tracking failed to read job progress #13014

Nekto89 opened this issue May 27, 2020 · 17 comments
Assignees
Labels
Area: ABTT Akvelon Build Tasks Team area of work bug Task: JenkinsQueueJob

Comments

@Nekto89
Copy link

Nekto89 commented May 27, 2020

Required Information

Entering this information will route you directly to the right team and expedite traction.

Question, Bug, or Feature?
Type: Bug

Enter Task Name: JenkinsQueueJob

Environment

  • Server - Azure Pipelines or TFS on-premises?

    • TFS on-premise Version Dev17.M153.5
  • Agent - Hosted or Private:

    • If using private agent, provide the OS of the machine running the agent and the agent version:
      Windows 10.0.18363, Agent.Version 2.153.1

Issue Description

Sometimes task loses connection to Jenkins server temporarily. But it continues to wait until job finishes on Jenkins side.
image
Task fails, build fails. At the same time info is available that Jenkins job has finished successfully on Summary tab.
image

Task logs

In the log there may be errors like this in the middle:
"
##[error]Job progress tracking failed to read job progress
Job progress tracking failed to read job progress
HttpResponse.statusCode=500
HttpResponse.statusMessage=Server Error
##[error]Job progress tracking failed to read job progress
"

@ericstrongDevOps
Copy link

any update on this issue? TIA

@walliski
Copy link

I am also still seeing this issue now and then, very inconvenient.

@anatolybolshakov anatolybolshakov added Area: ABTT Akvelon Build Tasks Team area of work Task: JenkinsQueueJob labels May 31, 2021
@anatolybolshakov
Copy link
Contributor

Hi @Nekto89 @ericstrongDevOps @walliski could you please share the logs with this error - for us to understand better how we could fix this? We could add retry logic - but we need to determine where exactly this issue happens.

@walliski
Copy link

walliski commented Jun 2, 2021

@anatolybolshakov

I managed to reproduce the problem with debug on, by queuing a job in Jenkins that simply had a sleep step, and following the (non-existent) console output.

This is what I got in the logs in AzDo:

...

##[debug](2:Test Stuff:167) debug: Tracking progress of job URL: https://jenkins.int/job/Test%2520Stuff/167/logText/progressiveText/?start=3827
##[debug]streamConsole().requestCallback()

##[debug](2:Test Stuff:167) debug: Tracking progress of job URL: https://jenkins.int/job/Test%2520Stuff/167/logText/progressiveText/?start=3827
Job progress tracking failed to read job progress

##[debug]streamConsole().requestCallback()
HttpResponse.statusCode=500

##[debug]task result: Failed
HttpResponse.statusMessage=Server Error

##[error]Job progress tracking failed to read job progress
##[debug]Processed: ##vso[task.issue type=error;]Job progress tracking failed to read job progress
##[debug]Processed: ##vso[task.complete result=Failed;]Job progress tracking failed to read job progress

##[debug](2:Test Stuff:167) debug: Tracking progress of job URL: https://jenkins.int/job/Test%2520Stuff/167/logText/progressiveText/?start=3827
##[debug]streamConsole().requestCallback()

...

Before and after this was a long list of debug messages for the tracking. So it seems like even if it sets the build as failed, it still continues to poll and run the task in Jenkins (which finished successfully). Build ran for 60 minutes, and there was only one failed poll attempt during that time.

The reason why Jenkins threw a random 500 was a "Socket timeout" for the authentication plugin that we are using.

@anatolybolshakov
Copy link
Contributor

@Nekto89 @walliski thanks! Let us take a look - probably it make sense to add some retry logic for these requests - to avoid intermittent issues.

@cmwiebe24
Copy link

Any update on this issue? Or is there a work around?

@walliski
Copy link

walliski commented Sep 3, 2021

@anatolybolshakov Sorry to be annoying, but any progress on this? This is an issue that we keep seeing almost daily.

@kuleshovilya kuleshovilya assigned kuleshovilya and unassigned leantk Sep 6, 2021
@kuleshovilya
Copy link
Contributor

Hello @walliski We've already implemented this feature, and awaiting for deployment now, it will take few weeks, we will notify you when it's done

@kuleshovilya kuleshovilya added the awaiting deployment Related changes are waiting for deployment to be completed label Sep 6, 2021
@kuleshovilya
Copy link
Contributor

Feature has been merged, you can check this out now, closing the issue

@walliski
Copy link

@kuleshovilya

I just got this again, same error message:

Job progress tracking failed to read job progress
HttpResponse.statusCode=500
HttpResponse.statusMessage=Server Error
##[error]Job progress tracking failed to read job progress

But this message in the end:

Finished: SUCCESS

On this version:

==============================================================================
Task         : Jenkins queue job
Description  : Queue a job on a Jenkins server
Version      : 2.192.0
Author       : Microsoft
Help         : https://docs.microsoft.com/azure/devops/pipelines/tasks/build/jenkins-queue-job
==============================================================================

@kuleshovilya kuleshovilya reopened this Oct 13, 2021
@kuleshovilya
Copy link
Contributor

@walliski Could you please send a full logs for this job for the analysis? You can send it in private to v-ikuleshov@microsoft.com

@walliski
Copy link

The run did not run in debug mode, so what I posted above is the only log output I can see that would be outside of our own scripts output, unless there is something that you are clearly missing? I could try to reproduce the issue again in debug mode though and see if that would bring some more clarity. IIRC I saw some 500 error before in Jenkins logs, but I cannot seem to find that now, so not really sure either what's going on...

@kuleshovilya
Copy link
Contributor

@walliski Yes, can you please try to reproduce and send logs?

@kuleshovilya kuleshovilya removed the awaiting deployment Related changes are waiting for deployment to be completed label Oct 18, 2021
@kuleshovilya
Copy link
Contributor

@walliski Any update?

@walliski
Copy link

walliski commented Oct 25, 2021

I sent an email to the email you mentioned two weeks ago. There was however no interesting things in the logs, the place where it fails looks the same as before. Or what did you mean?

@anatolybolshakov
Copy link
Contributor

Hi @walliski sorry for the delay - we've added new inputs for JenkinsQueueJob task v2.192.1 - 'retryCount' and 'delayBetweenRetries' (you can find more info in docs) in PR - could you please check it now? As I see there is a 2.192.0 task version in the last logs - which does not have a fix.

@walliski
Copy link

walliski commented Nov 10, 2021

Will take a look. I saw that the task was updated compared to previous jobs, so I assumed it would have had the changes in it, but apparently not. Now I see that we are currently on 2.193.0, so I guess that should be good to go. I'll get back to you if I see more issues, thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Area: ABTT Akvelon Build Tasks Team area of work bug Task: JenkinsQueueJob
Projects
None yet
Development

No branches or pull requests

9 participants