Uploaded image for project: 'FlightPath'
  1. FlightPath
  2. FLPATH-1206

[M2K] Workflow stuck in active state if m2k pod is deleted during workflow execution

XMLWordPrintable

    • False
    • Hide

      None

      Show
      None
    • False

      Description of the problem:
      After deleting the m2k pod of a move2kube workflow that has a running instance before answering the Q&A questions, after the m2k pod is fully up, you can answer the questions but the workflow never finishes executing and remains in Active state. No notification is given if workflow failed or succeeded. 

      How reproducible: 100%

      Steps to reproduce:

      1. Deploy Orchestrator 0.2.11 and move2kube 0.2.18.

      2. Start a Move2kube instance with correct input, and wait for the Waiting for Q&A notification.

      3. After the notification pops up, delete the m2k pod. 

      4. After the m2k pod fully comes up, click on the Q&A link, answer the questions.

      Actual results:
      M2k instance stays in active state, never completing. No notification is given:

      Expected results:
      Workflow will complete execution normally.

      Additional information:

      From move2kube logs:

      time="2024-04-04T12:14:22Z" level=info msg="Project: 42ab28ca-e08a-4ad4-92c0-5196a3a2b456; Output:0dc3a81a-93ab-490e-a92e-0455f834b669; level=info msg=\"Transformation done\""
      time="2024-04-04T12:14:22Z" level=info msg="Project: 42ab28ca-e08a-4ad4-92c0-5196a3a2b456; Output:0dc3a81a-93ab-490e-a92e-0455f834b669; level=info msg=\"Transformed target artifacts can be found at [/move2kube-api/data/projects/42ab28ca-e08a-4ad4-92c0-5196a3a2b456/outputs/0dc3a81a-93ab-490e-a92e-0455f834b669/output/mta].\""
      time="2024-04-04T12:14:22Z" level=info msg="failed to fetch the stderr of move2kube transform. Error: \"EOF\""
      time="2024-04-04T12:14:22Z" level=error msg="failed to get the next question. Error: \"failed to send a GET request to the URL http://localhost:42335/problems/current . Error: \\\"Get \\\\\\\"http://localhost:42335/problems/current\\\\\\\": dial tcp [::1]:42335: connect: connection refused\\\"\"" request-id=8688b38e-d84f-4718-af4c-20d01851868a
      time="2024-04-04T12:14:26Z" level=info msg="GET /api/v1/workspaces/352a2798-88d5-468c-a3d0-f16d6254009f/projects/42ab28ca-e08a-4ad4-92c0-5196a3a2b456/outputs/0dc3a81a-93ab-490e-a92e-0455f834b669/problems/current" request-id=0cc5b607-3f17-4b55-a6bb-f1b3a60308e3
      time="2024-04-04T12:14:26Z" level=info msg="GET /api/v1/workspaces/352a2798-88d5-468c-a3d0-f16d6254009f/projects/42ab28ca-e08a-4ad4-92c0-5196a3a2b456" request-id=39a86199-4132-4ed9-a574-f6ed3a7f3c14
       

            rh-ee-gfarache Gabriel Farache
            yfirst Yona First
            Yona First Yona First
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

              Created:
              Updated: