What counts as Test & Improve AI Systems?

A role counts when the person is hired to evaluate AI outputs or behavior and feed those judgments back into product quality, safety, reliability, or retrieval improvement.

Are data labeling jobs included?

Only when the work includes professional evaluation, domain judgment, QA standards, rubric ownership, or a clear system improvement loop.

How is this different from Organize Knowledge for AI?

Test & Improve AI Systems judges outputs and behavior. Organize Knowledge for AI improves the source material, taxonomy, documentation, or retrieval layer the system depends on.

Test & Improve AI Systems

45 live matches30 remotes4 hybrids8 on-sites18 comp disclosed