| earlier 45 |
| Test AI Systems | Red Teaming | Generative AI AnalystRed teaming seat — you create prompts to break AI models and document where they fail against safety taxonomies; light on dev, heavy on judgment work. | WGWelo Global | 🇺🇸California, ...Remote | — | 1w ago |
| Test AI Systems | [Remote] AI Prompt Engineer & Evaluator | $50/hr RemotePrompt and eval work shaping e-commerce AI training data — you build realistic shopping simulations and score model outputs, not ship production code. | CAcareersprint | 🇺🇸USARemote | — | 1w ago |
| Build ProductsAutomate WorkflowsTest AI Systems | Creative TechnologistCreative technologists design and maintain generative AI video workflows in a film production setting — heavy on workflow building, moderate Python, and heavy AI fluency. | TRTrueShort | 🇺🇸Los Angeles, ...Remote | $120–180k | 2w ago |
| Test AI SystemsAutomate Workflows | Automated Customer Experience / AI Quality SpecialistQuality-specialist seat evaluating AI agent behavior at scale — you're watching for hallucinations, looping, and escalation failures, then feeding that back to improve how agents perform. | SUSuper.com | 🇺🇸USA or CanadaRemote | $55–81k | 2w ago |
| Test AI Systems | AI Success ManagerCustomer-facing AI adoption role — you translate AI capabilities into working systems for Fortune 500 clients, with hands-on work in prompt design and workflow configuration, not backend engineering. | CRCresta | 🇺🇸United StatesRemote | $160–215k | 2w ago |
| Test AI Systems | AI Red Teamer, CyberAdversarial AI testing role — you probe AI systems for safety and security weaknesses through prompt injection, attack chains, and adversarial inputs. | 1L10a Labs | 🇺🇸Washington DCRemote | — | 2w ago |
| Test AI Systems | AI Quality & Validation SpecialistJunior AI quality and validation role — you test AI outputs, document findings, and flag issues before release; minimal coding, heavy on structured QA work. | PIPingWind | 🇺🇸Annandale, VA, ...Hybrid | — | 2w ago |
| Test AI Systems | AI Tutor - ArabicAudio annotation and linguistic QA role for Grok's voice capabilities — you judge speech quality and cultural nuance, not build AI systems or workflows. | XAxAI | 🌏WorldwideRemote | — | 2w ago |
| Test AI Systems | AI Red Teamer, CybersecurityRed team role that stress-tests frontier AI models for dangerous outputs — you bring offensive security craft, not engineering infrastructure, to a structured evaluation program. | HAHandshake | 🇺🇸San Francisco, ...Remote | — | 3w ago |
| Test AI SystemsAutomate Workflows | Red Teaming | Generative AI Analyst - USARed teaming seat evaluating AI model safety — you construct prompts and document where models fail, not write code or ship features. | WGWelo Global | 🇺🇸United StatesRemote | — | 3w ago |
| Test AI Systems | AI Innovation Quality Assurance - Prompt TeamQA lead role defining and executing quality standards for AI prompt outputs — you own the evaluation lifecycle, not the underlying models or infrastructure. | SISitusAMC | 🇺🇸USRemote | — | 3w ago |
| Build AI AgentsBuild ProductsTest AI Systems | Product ManagerPM role owning agent fleet product and chat flow evals — prototype fast with AI tools, run 15-20 user sessions weekly, decide what ships. | SASauna | 🇺🇸San Francisco, ...On-site | — | 3w ago |
| Automate WorkflowsGrow MarketingTest AI Systems | Operations AI SpecialistPart-time contractor seat contributing senior ops expertise to AI training data — you design real-world tasks and evaluate how AI handles operations scenarios. | WAWeekday AI | 🇺🇸United StatesRemote | — | 1mo ago |
| Test AI SystemsImplement AIAutomate Workflows | AI Product ManagerOwn the systems and metrics that determine whether AI models actually work in the real world—golden sets, eval pipelines, accuracy SLAs, and automated QA. | NANavigate AI | 🇺🇸San Francisco ...Remote | — | 1mo ago |
| Help Adopt AITest AI SystemsAutomate Workflows | Senior AI Enablement SpecialistEnablement role scaling AI adoption across an edtech company — you build the playbooks, prompts, and training, not the platform. Light on dev, heavy on making teams actually use AI day-to-day. | CACurriculum Associates | 🇺🇸USRemote | $93–166k | 1mo ago |
| Build AI AgentsOrganize KnowledgeTest AI Systems | Senior Product Manager — Agentic AI ExperiencesPM seat owning agent behavior end-to-end — you define how AI agents reason, interpret intent, and execute workflows, then measure whether they actually work. | JOJobgether | 🇺🇸United StatesRemote | — | 1mo ago |
| Help Adopt AIGrow MarketingTest AI Systems | Director, AI EnablementStrategic AI enablement seat — you own the roadmap, governance, and champion networks that make the rest of Headspace AI-capable, not the AI products themselves. | HEHeadspace | 🇺🇸United StatesRemote | — | 1mo ago |
| Automate WorkflowsBuild AI AgentsTest AI Systems | AI Operations AnalystSupervise live AI agents in a call center context — step in when agents stumble, catch failure patterns, and feed what you learn back to the team. Light on dev, heavy on judgment and real-time oversight. | CACalltree | 🇺🇸San Francisco, ...On-site | $70–100k | 1mo ago |
| Build ProductsGrow MarketingTest AI Systems | Product Manager (AI Adoption)Builder-first PM role prototyping and shipping AI adoption products — vibe coding with Claude Code/Cursor is non-negotiable, but you're shipping features, not production engineering. | MUMultiverse | 🇬🇧LondonRemote | — | 1mo ago |
| Automate WorkflowsBuild AI AgentsTest AI Systems | Senior Analyst, AI Workflows & AutomationTrust & Safety role building and running AI workflows that flag content, route reports, and handle escalations — you're measured on automation rate and handle time, not on shipping features. | RORoblox | 🌏San Mateo, CAOn-site | — | 1mo ago |
| Build AI AgentsGrow MarketingTest AI Systems | AI Solutions ArchitectOperations role configuring and improving Mercury’s customer-facing chatbot and email agents — you monitor performance, expand bot scope, and optimize workflows rather than build the underlying platform. | MEMercury | 🌏San Francisco, ...Remote | $143–169k | 1mo ago |
| Test AI SystemsImplement AICreate Content | AI Data ExpertContract eval/annotation role for AI output quality — you judge whether AI-generated content meets accuracy, cultural, and brand standards, with flexible hours and no coding required. | LILILT | 🌏Remote; ...Remote | — | 1mo ago |
| Test AI SystemsImplement AIGrow Marketing | AI Operations Specialist | Housing (New Grads 2025-2026)Entry-level AI ops seat watching dashboards and logs to catch AI failures before they escalate — you flag and coordinate, engineers do the fixing. | ELEliseAI | 🌏New York CityOn-site | — | 1mo ago |
| Automate WorkflowsGrow MarketingTest AI Systems | Revenue Systems LeadRevenue systems lead role that builds and runs AI-powered automations with Clay, Zapier, and n8n — you're owning the GTM tech stack and its workflows, not just advising on it. | EVEve | 🌏Remote - USRemote | — | 1mo ago |
| Test AI SystemsCreate ContentAutomate Workflows | AI Conversation DesignerConversation design role writing prompts, optimizing chatbot flows, and QA-testing LLM experiences for customer journeys. | PEPearl | 🌏Remote, USRemote | — | 1mo ago |
| Test AI SystemsBuild AI AgentsOrganize Knowledge | Applied AI Evaluation ScientistEvaluation role owning RAG and agent quality frameworks, with research-grade Python instead of production infrastructure. | JUJump | 🌏Remote (U.S.)Remote | $180–270k | 1mo ago |
| Test AI SystemsBuild AI AgentsGrow Marketing | AI Quality OperatorHealthcare QA role reviewing AI agent calls, catching errors, labeling issues, and improving real workflows. | NHNeon Health | 🌏San Francisco, ...On-site | $59–136k | 1mo ago |
| Automate WorkflowsBuild AI AgentsTest AI Systems | AI OperatorOperator seat embedded in revenue, platform, or research work, using frontier AI to tighten execution rhythms. | DADistyl AI | 🌏()On-site | $140–200k | 1mo ago |
| Test AI SystemsBuild AI AgentsOrganize Knowledge | Operations Specialist, AI EnablementQA role reviewing AI support conversations for accuracy, policy fit, tone, and recurring failure patterns. | BIBumble Inc. | 🌏Austin, TX / ...Remote | — | 1mo ago |
| Test AI SystemsCreate Content | Adversarial Prompt ExpertRed-team role finding jailbreaks, ranking model failures, and documenting attack paths so safety teams can patch them. | RLReinforce Labs | 🌏RemoteRemote | — | 1mo ago |
| Test AI SystemsBuild AI AgentsCreate Content | Prompt EngineerExpert prompt role owning AI character behavior, personality systems, and evaluation frameworks for social generative AI experiences. | CACantina | 🌏Los Angeles; ...Hybrid | $150–180k | 1mo ago |
| Test AI SystemsBuild AI AgentsAutomate Workflows | Prompt Engineer - AI Innovation Team - USPrompt-focused AI role owning use-case translation, agent behavior oversight, and quality testing for commercial real estate workflows. | SISitusAMC | 🌏USRemote | $50–80k | 1mo ago |
| Build AI AgentsTest AI SystemsAutomate Workflows | Product Manager, Agent DevelopmentFound on Sierra career page. | SISierra | 🌏San Francisco, ...Hybrid | — | 1mo ago |
| Build AI AgentsTest AI SystemsAutomate Workflows | Product Manager, AgentsAxion is looking for a PM specifically for 'Agents', requiring prior experience in agentic or automation-heavy products. | AXAxion | 🌏San Francisco, ...On-site | — | 1mo ago |
| Build AI AgentsTest AI SystemsAutomate Workflows | Senior Product Manager, Agentic AIJerry.ai's 'Agentic AI' PM role is a clear fit for the mission, focusing on LLM systems within a personal assistant context. | JEJerry.ai | 🌏Raleigh, NCOn-site | — | 1mo ago |
| Test AI SystemsBuild AI AgentsOrganize Knowledge | AI Content Reviewer (Video)High-signal role for evaluating the next generation of AI video models. | CHCrossing Hurdles | 🌏RemoteRemote | — | 1mo ago |
| Build AI AgentsOrganize KnowledgeTest AI Systems | Staff Product Manager, Enterprise AI AgentsEnterprise-focused AI role applying agents and RAG to business process automation. | WEWeedmaps | 🌏New York City, ...Hybrid | $194–215k | 1mo ago |
| Test AI SystemsBuild AI AgentsOrganize Knowledge | Senior AI Evaluation Specialist — IP Guardrails and Agentic WorkflowsRecently posted role for Adobe seeking an AI Eval Specialist. | ADAdobe | 🌏New York, NYOn-site | $155–281k | 1mo ago |
| Build AI AgentsGrow MarketingTest AI Systems | AI Agent Product ManagerHamilton AI is a pure vibe-coding role. Applied Labs is a hands-on AI agent product role. | ALApplied Labs | 🌏New YorkOn-site | $150–200k | 1mo ago |
| Build AI AgentsOrganize KnowledgeTest AI Systems | Senior Product Manager, AI Agents and PlatformStrong fit for Jerry.ai - focus on agent pm. | JEJerry.ai | 🌏San Francisco ...Remote | $150–200k | 1mo ago |
| Help Adopt AIGrow MarketingTest AI Systems | AI Adoption Manager, Strategic & Enterprise Accounts | SoutheastStrong fit for EvenUp - focus on ai enablement/adoption. | EVEvenUp | 🌏Remote-US; ...Remote | — | 1mo ago |
| Test AI SystemsBuild AI AgentsOrganize Knowledge | AI Agent Architect, Customer ExperienceStrong fit for Airtable - focus on workflow designer. | AIAirtable | 🌏Remote - USRemote | $196–278k | 1mo ago |
| Test AI SystemsBuild AI Agents | AI Operations SpecialistAn interesting operational role for someone interested in the evaluation and quality side of AI agent deployments. | BABretton AI | 🌏San Francisco, ...On-site | $90–105k | 1mo ago |
| Build ProductsBuild AI AgentsTest AI Systems | Voice Ai Prompt EngineerSelected for focus on Conversation Designer and AI fluency. | LALexson Ai | 🌏RemoteRemote | $3k | 1mo ago |
| Automate WorkflowsTest AI SystemsHelp Adopt AI | Senior Operations Manager - AISelected for focus on Ops and AI fluency. | STStepful | 🌏Austin / ...Remote | — | 1mo ago |