› Conversation design role writing prompts, optimizing chatbot flows, and QA-testing LLM experiences for customer journeys.
comp not disclosed1d agoapplyAI_EVALUATION_/_QAApplied AI Evaluation ScientistJumpRemote (U.S.)› Evaluation role owning RAG and agent quality frameworks, with research-grade Python instead of production infrastructure.
$180–270k1d agoapplyAI_EVALUATION_/_QAAI Quality OperatorNeon HealthSan Francisco, CA (USA)› Healthcare QA role reviewing AI agent calls, catching errors, labeling issues, and improving real workflows.
$594k1d agoapplyAI_EVALUATION_/_QAOperations Specialist, AI EnablementBumble Inc.Austin, TX / London / Remote› QA role reviewing AI support conversations for accuracy, policy fit, tone, and recurring failure patterns.
comp not disclosed1d agoapplyAI_EVALUATION_/_QAAdversarial Prompt ExpertReinforce LabsRemote› Red-team role finding jailbreaks, ranking model failures, and documenting attack paths so safety teams can patch them.
comp not disclosed1d agoapplyAI_EVALUATION_/_QAPrompt EngineerCantinaLos Angeles; San Francisco› Expert prompt role owning AI character behavior, personality systems, and evaluation frameworks for social generative AI experiences.
$150–180k3d agoapplyAI_EVALUATION_/_QAPrompt Engineer - AI Innovation Team - USSitusAMCUS - Remote› Prompt-focused AI role owning use-case translation, agent behavior oversight, and quality testing for commercial real estate workflows.
$50k3d agoapplyAI_EVALUATION_/_QAAI Content Reviewer (Video)Crossing HurdlesRemote› High-signal role for evaluating the next generation of AI video models.
$25–34k5d agoapplyAI_EVALUATION_/_QASenior AI Evaluation Specialist — IP Guardrails and Agentic WorkflowsAdobeNew York, NY› Recently posted role for Adobe seeking an AI Eval Specialist.
$155–281k5d agoapplyAI_EVALUATION_/_QAAI Agent Architect, Customer ExperienceAirtableRemote - US› Strong fit for Airtable - focus on workflow designer.
$196–278k5d agoapplyAI_EVALUATION_/_QAAI Operations SpecialistBretton AISan Francisco, CA› An interesting operational role for someone interested in the evaluation and quality side of AI agent deployments.
$90–105k5d agoapply