feat: agent job distribution #712
Labels
No labels
1week
2weeks
Failed compliance check
IP cameras
NATS
Possible security concern
Review effort 1/5
Review effort 2/5
Review effort 3/5
Review effort 4/5
Review effort 5/5
UI
aardvark
accessibility
amd64
api
arm64
auth
back-end
bgp
blog
bug
build
checkers
ci-cd
cleanup
cnpg
codex
core
dependencies
device-management
documentation
duplicate
dusk
ebpf
enhancement
eta 1d
eta 1hr
eta 3d
eta 3hr
feature
fieldsurvey
github_actions
go
good first issue
help wanted
invalid
javascript
k8s
log-collector
mapper
mtr
needs-triage
netflow
network-sweep
observability
oracle
otel
plug-in
proton
python
question
reddit
redhat
research
rperf
rperf-checker
rust
sdk
security
serviceradar-agent
serviceradar-agent-gateway
serviceradar-web
serviceradar-web-ng
siem
snmp
sysmon
topology
ubiquiti
wasm
wontfix
zen-engine
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
carverauto/serviceradar#712
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Imported from GitHub.
Original GitHub issue: #2237
Original author: @mfreeman451
Original URL: https://github.com/carverauto/serviceradar/issues/2237
Original created: 2026-01-10T00:11:41Z
Is your feature request related to a problem?
If we have a tenant with multiple agents in the same partition and they have jobs configured to say do a network scan/sweep over a group of devices in this same partition, we need one of the agents to pickup the jobs but not more than one agent to pickup the job, and the agent needs to release it when it is finished.
Agents learn about work by us creating a config for that agent to control some functionality of the agent (do a ping check, port check, process check, or call out to an external checker to perform a custom check), and then they use GRPC to ask the agent-gateway if there is any config or config update available. If there is, the agent is expected to download the config and perform the work at the prescribed interval and so on.
What we are missing:
External Checkers are broken --
Describe the solution you'd like
We need to completely redo/update serviceradar-agent so it becomes a GRPC listener, and we need to start updating our GRPC-based external checkers so that instead of being polled, they:
Describe alternatives you've considered
A clear and concise description of any alternative solutions or features you've considered.
Additional context
Add any other context or screenshots about the feature request here.