Previous bug: only code:* and review labels were checked, so agent:document,
agent:tests etc. were never filtered. Any agent could pick up any task.
Now: labels with agent: prefix are matched against agent capabilities.
Other labels are treated as metadata. Includes regression test.