Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Agent remains stuck in Outdated policy state for over 10 minutes on updating spaces used by the policy. #6454

Closed
amolnater-qasource opened this issue Dec 31, 2024 · 7 comments · Fixed by elastic/kibana#207381
Assignees
Labels
bug Something isn't working impact:high Short-term priority; add to current release, or definitely next. QA:Ready For Testing Code is merged and ready for QA to validate Team:Elastic-Agent-Control-Plane Label for the Agent Control Plane team Team:Fleet Label for the Fleet team

Comments

@amolnater-qasource
Copy link

Kibana Build details:

VERSION: 8.18.0 SNAPSHOT
BUILD: 81372
COMMIT: 2b68edac50a036040cbdaf0f14030d39d39da677

Preconditions:

  1. 8.18.0-SNAPSHOT Kibana cloud environment should be available.
  2. A few agents should be installed.
  3. A few spaces should be used by the agent policy.

Steps to reproduce:

  1. Navigate to Fleet>Agents tab.
  2. Update spaces for any agent policy used by Agents.
  3. Observe Agent remains stuck in Outdated policy state for over 10 minutes.
  4. Observe Agent policy remain stuck in updating under agent activity.
  5. Collect logs for the agent.[Before this logs were not collected]
  6. Observe error: Rate limit exceeded.
  7. Wait for sometime and observe agent gets recovered from outdated state and logs are collected.

NOTE:

  • Issue is not observed for agent policy without Agent installed.
    Image

Expected Result:
Agent should get updated on updating spaces used by the policy.

Screen Recording:

Agents.-.Fleet.-.Elastic.-.Google.Chrome.2024-12-31.14-47-29.mp4

Feature:
https://github.com/elastic/ingest-dev/issues/1664

Logs:
elastic-agent-diagnostics-2024-12-31T09-24-56Z-00.zip

JSON:
EC2AMAZ-SN9KQPI-agent-details.zip

@amolnater-qasource amolnater-qasource added bug Something isn't working impact:high Short-term priority; add to current release, or definitely next. Team:Elastic-Agent-Control-Plane Label for the Agent Control Plane team Team:Fleet Label for the Fleet team labels Dec 31, 2024
@elasticmachine
Copy link
Contributor

Pinging @elastic/fleet (Team:Fleet)

@elasticmachine
Copy link
Contributor

Pinging @elastic/elastic-agent-control-plane (Team:Elastic-Agent-Control-Plane)

@amolnater-qasource
Copy link
Author

@muskangulati-qasource Please review.

@muskangulati-qasource
Copy link

Secondary review is Done for this ticket!

@jlind23
Copy link
Contributor

jlind23 commented Dec 31, 2024

FYI @nchaulet adding this one to our current sprint
cc @kpollich

@nchaulet nchaulet self-assigned this Jan 17, 2025
@nchaulet
Copy link
Member

It seems it's an issue when updating previous agent action it trigger fleet server to distribute them again. Introduced by elastic/kibana#203683

I am not sure what will be the best way to solve that one,
@kpollich @nimarezainia will it be an acceptable behaviour to not update previous agent actions?

@nchaulet
Copy link
Member

How to reproduce the issue

with space awareness enabled, enroll an agent and request a diagnostics, than assign that agent to multiple space.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working impact:high Short-term priority; add to current release, or definitely next. QA:Ready For Testing Code is merged and ready for QA to validate Team:Elastic-Agent-Control-Plane Label for the Agent Control Plane team Team:Fleet Label for the Fleet team
Projects
None yet
Development

Successfully merging a pull request may close this issue.

5 participants