Global opportunities

Remote job listings

Find your next remote opportunity from thousands of listings across the globe.

Filters

76 remote jobs found
Red Team / Safety Evaluator Contractor Short-term ↻ Reposted
Data Annotation Technical Writing Cybersecurity +12 LLM Evaluation English AI Safety Prompt Injection Jailbreaking Adversarial Testing Vulnerability Analysis Risk Management Safety Benchmarking Marathi Annotation Security Research
  • Remote (Global)
  • $20 – $22/hr
  • Jun 11, 2026

Mercor seeks bilingual AI Safety Experts fluent in English and Marathi to perform red teaming and adversarial testing on AI systems, identifying vulnerabilities and generating safety data to improve AI robustness. This remote contract role involves structured evaluation, documentation, and collaboration with leading AI researchers.

Bilingual LLM Evaluator Contractor Short-term
Proofreading Fact-Checking Research +12 Critical Thinking Data Annotation LLM Evaluation Dutch English Editing Quality Assurance Transcription Annotation Accuracy Instruction Following Analytical Reasoning
  • United Kingdom, United States, Singapore
  • $50 – $50/hr
  • Jun 11, 2026

Remote opportunity for Dutch and English bilingual professionals to perform transcription, annotation, audio evaluation, rubric development, and AI model benchmarking for leading AI research projects.

Bilingual LLM Evaluator Contractor · Part-time ↻ Reposted
Transcription Localization Editing +11 Proofreading Fact-Checking Quality Assurance LLM Evaluation Research English Argentinian Spanish Annotation Audio Evaluation Linguistics Rubric Development
  • United Kingdom, United States
  • $50 – $50/hr
  • Jun 11, 2026

Remote AI evaluation opportunity for bilingual Argentinian Spanish and English speakers to perform transcription, annotation, audio evaluation, rubric development, and language model benchmarking for leading AI research projects.

Agent System Evaluator Contractor · Full-time
AI Training Engineering +12 Teaching Flexible Organization Reasoning Testing Python JavaScript TypeScript Java C++ Go Rust
  • Remote (Global) (US: WA)
  • $80 – $120/hr
  • Jun 11, 2026

Senior Software Engineer — Agentic Coding (AI Training) About the Role What if your software engineering expertise could define how the next generation of AI writes, debugs, and ships code on its own?

Agent System Evaluator Contractor · Part-time ↘ +45 regions
English AI Testing +12 Infrastructure History Quality Assurance Engineering Writing Python FastAPI JavaScript TypeScript React Docker PostgreSQL
  • Ireland, Belgium, Denmark, Finland, Norway, Sweden
  • $50
  • Jun 10, 2026

Please submit your CV in English and indicate your level of English proficiency. Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems.

View Details
Red Team / Safety Evaluator Contractor Short-term
Cybersecurity LLM Evaluation Technical Writing +11 English AI Safety Prompt Injection Jailbreaking Adversarial Testing Vulnerability Analysis Risk Management Safety Benchmarking Odia Annotation Security Research
  • Remote (Global)
  • $20 – $22/hr
  • Jun 6, 2026

Help leading AI labs identify vulnerabilities and improve AI safety through adversarial testing, red teaming, prompt injection analysis, and multilingual evaluation in English and Odia. Mercor seeks bilingual AI Safety Experts to strengthen frontier AI systems' safety by generating high-quality safety data and documenting findings.

Red Team / Safety Evaluator Contractor Short-term
LLM Evaluation Data Annotation Technical Writing +12 Cybersecurity English AI Safety Prompt Injection Jailbreaking Adversarial Testing Vulnerability Analysis Risk Management Safety Benchmarking Gujarati Annotation Security Research
  • Remote (Global)
  • $20 – $22/hr
  • Jun 6, 2026

Mercor seeks bilingual AI Safety Experts fluent in English and Gujarati to perform red teaming and adversarial testing on AI systems, identifying vulnerabilities and generating safety data to improve AI robustness and trustworthiness. This fully remote contract role offers flexible scheduling and weekly payments.

Red Team / Safety Evaluator Contractor Short-term
LLM Evaluation Data Annotation Technical Writing +12 Cybersecurity English AI Safety Prompt Injection Jailbreaking Adversarial Testing Vulnerability Analysis Risk Management Safety Benchmarking Assamese Annotation Security Research
  • Remote (Global)
  • $20 – $22/hr
  • Jun 6, 2026

Mercor seeks bilingual AI Safety Experts fluent in English and Assamese to identify vulnerabilities and improve AI safety through adversarial testing, red teaming, prompt injection analysis, and multilingual evaluation. This fully remote contract role offers flexible scheduling and weekly payments, focusing on enhancing AI robustness and trustworthiness.

Red Team / Safety Evaluator Contractor Short-term
Cybersecurity Data Annotation Technical Writing +12 LLM Evaluation English AI Safety Prompt Injection Jailbreaking Adversarial Testing Vulnerability Analysis Risk Management Safety Benchmarking Punjabi Annotation Security Research
  • Remote (Global)
  • $20 – $22/hr
  • Jun 6, 2026

Help leading AI labs identify vulnerabilities and improve AI safety through adversarial testing, red teaming, prompt injection analysis, and multilingual evaluation in English and Punjabi. This contract role offers flexible remote work with weekly payments and the opportunity to collaborate with top AI researchers.

Red Team / Safety Evaluator Contractor Short-term
LLM Evaluation Data Annotation Technical Writing +12 Cybersecurity English AI Safety Prompt Injection Jailbreaking Adversarial Testing Vulnerability Analysis Risk Management Safety Benchmarking Malayalam Annotation Security Research
  • Remote (Global)
  • $20 – $22/hr
  • Jun 6, 2026

Mercor seeks bilingual AI Safety Experts fluent in English and Malayalam to perform red teaming and adversarial testing on AI systems, identifying vulnerabilities and generating safety data to improve AI robustness and trustworthiness. This fully remote contract role involves structured evaluation, documentation, and collaboration with leading AI researchers.

Red Team / Safety Evaluator Contractor Short-term
Cybersecurity Data Annotation Technical Writing +12 LLM Evaluation English AI Safety Prompt Injection Jailbreaking Adversarial Testing Vulnerability Analysis Risk Management Safety Benchmarking Telugu Annotation Security Research
  • Remote (Global)
  • $20 – $22/hr
  • Jun 6, 2026

Mercor seeks bilingual AI Safety Experts fluent in English and Telugu to perform red teaming and adversarial testing on AI models, identifying vulnerabilities and generating safety data to improve AI robustness. This fully remote contract role offers flexible scheduling and collaboration with leading AI researchers.

Red Team / Safety Evaluator Contractor Short-term
LLM Evaluation Data Annotation Technical Writing +12 Cybersecurity English AI Safety Prompt Injection Jailbreaking Adversarial Testing Vulnerability Analysis Risk Management Safety Benchmarking Tamil Annotation Security Research
  • Remote (Global)
  • $20 – $22/hr
  • Jun 6, 2026

Mercor seeks bilingual AI Safety Experts fluent in English and Tamil to identify vulnerabilities and improve AI safety through adversarial testing, red teaming, prompt injection analysis, and multilingual evaluation. This fully remote contract role offers flexible scheduling and focuses on generating high-quality safety data to enhance AI robustness.

Red Team / Safety Evaluator Contractor Short-term
LLM Evaluation Data Annotation Technical Writing +12 Cybersecurity English AI Safety Prompt Injection Jailbreaking Adversarial Testing Vulnerability Analysis Risk Management Safety Benchmarking Kannada Annotation Security Research
  • Remote (Global)
  • $20 – $22/hr
  • Jun 6, 2026

Mercor seeks bilingual AI Safety Experts fluent in English and Kannada to perform red teaming and adversarial testing on AI systems, identifying vulnerabilities and generating safety data to improve AI robustness and trustworthiness. This fully remote contract role offers flexible scheduling and weekly payments.

Red Team / Safety Evaluator Contractor Short-term
LLM Evaluation Data Annotation Technical Writing +12 Cybersecurity English AI Safety Prompt Injection Jailbreaking Adversarial Testing Vulnerability Analysis Risk Management Safety Benchmarking Urdu Annotation Security Research
  • Remote (Global)
  • $20 – $22/hr
  • Jun 6, 2026

Mercor seeks bilingual AI Safety Experts fluent in English and Urdu to perform red teaming and adversarial testing on AI systems, identifying vulnerabilities and generating safety data to improve AI robustness. This fully remote contract role offers flexible scheduling and collaboration with leading AI researchers.

Red Team / Safety Evaluator Contractor Short-term
LLM Evaluation Data Annotation Technical Writing +12 Cybersecurity English AI Safety Prompt Injection Jailbreaking Adversarial Testing Vulnerability Analysis Risk Management Safety Benchmarking Bengali Annotation Security Research
  • Remote (Global)
  • $20 – $22/hr
  • Jun 6, 2026

Help leading AI labs identify vulnerabilities and improve AI safety through adversarial testing, red teaming, prompt injection analysis, and multilingual evaluation in English and Bengali. Mercor seeks bilingual AI Safety Experts to strengthen frontier AI systems by generating safety data and evaluating misuse scenarios remotely on a flexible contract basis.

Bilingual LLM Evaluator Freelancer · Part-time
Translation English German +3 Microsoft Word Make English Proficiency
  • Germany
  • $32.9 – $43.8/hr
  • Jun 5, 2026

Join the CrowdGen team as an Independent Contractor for Project Vistula ! We are currently looking for Independent Contractors who are native German speakers with strong English proficiency and a strong background in language expertise .

LLM Evaluator (English) Contractor · Part-time
LLM Evaluation Editing Proofreading +7 Critical Thinking Communication Attention to Detail Writing Content Development English Grammar Content Review
  • New Zealand
  • $25 – $30/hr
  • Jun 4, 2026

Remote AI evaluation opportunity for Secondary Education Teachers in New Zealand. Evaluate AI-generated content, complete writing and editing tasks, and help improve the quality and reasoning capabilities of AI systems.

Agent System Evaluator Contractor · Full-time
LLM Evaluation Quality Assurance Technical Writing +10 Process Improvement Attention to Detail Rubric Development Structured Observation Research Analysis Documentation SaaS Tools Evaluation Frameworks English Communication Reporting
  • Remote (Global)
  • $20 – $35/hr
  • May 29, 2026

Remote contract opportunity for professionals with strong analytical writing and evaluation skills to design AI assessment tasks, create scoring rubrics, and evaluate AI performance across real-world workflows.

Bilingual LLM Evaluator Contractor · Part-time
Transcription Balinese English +12 Linguistic Annotation Cultural Analysis context analysis Written communication Verbal communication Attention to Detail Organizational Skills Grammatical analysis Emotional tone analysis Remote Collaboration Timestamping Independent Work
  • Remote (Global)
  • $15 – $95/hr
  • May 27, 2026

Help train next-generation AI systems as a Balinese Bilingual Expert, applying your expertise to high-quality, real-world input. No prior AI experience required.

Data Entry Attention to Detail Problem Solving +11 Adaptability English Communication Multitasking Typing Remote Collaboration Organizational Skills Workflow Management Documentation Fast-Paced Work Communication Skills General Operations
  • Remote (Global)
  • $15 – $25/hr
  • May 19, 2026

Remote contractor opportunity for adaptable generalists to contribute real-world input, communication, multitasking, and operational support toward training and improving AI systems.

LLM & Agent Evaluation Contractor Short-term
Analytical Reasoning Financial Analysis Rubric Evaluation +10 LLM Evaluation Strategic Analysis Decision Making Quantitative Reasoning Qualitative Assessment Written communication Enterprise Operations Critical Thinking Operational Analysis Attention to Detail
  • Remote (Global)
  • $60 – $85/hr
  • May 17, 2026

Remote contract opportunity for experienced enterprise professionals to evaluate AI-generated reasoning, business analysis, and operational decision-making using structured scoring rubrics and evaluation frameworks.

Acting Role-Play Improvisation +11 Emotional Communication Performing Arts Voice Acting Character Performance Conversational Roleplay Emotional Intelligence Communication Skills Drama Theater Performance Empathy AI Safety Testing
  • Remote (Global)
  • $40/deliverable
  • May 16, 2026

Remote freelance opportunity for actors, performers, and emotionally skilled communicators to participate in AI safety role-play conversations focused on emotional distress and sensitive conversational evaluation.

LLM & Agent Evaluation Intern · Full-time
Multimodal AI LLM Evaluation Benchmark Design +10 Machine Learning Natural Language Processing Computer Vision PyTorch Hugging Face Transformers Data Annotation Statistical Analysis Research Methods Python Dataset Curation
  • United States
  • $40 – $40/hr
  • May 16, 2026

Remote research internship opportunity focused on multimodal LLM benchmarking, AI evaluation, dataset curation, and multimodal foundation model analysis across text, image, audio, and video systems.

LLM & Agent Evaluation Contractor · Part-time
Data Annotation LLM Evaluation LLM Prompt Engineering +10 Content Evaluation Relevance Ranking Summarization Transcription Translation Generative AI Large Language Models Critical Thinking Communication Skills AI Training Data
  • United States
  • $15 – $15/hr
  • May 16, 2026

Remote part-time opportunity for contributors interested in evaluating, labeling, summarizing, ranking, and improving generative AI and large language model systems through flexible project-based work.

Critical Thinking Analytical Reasoning Spatial Reasoning +11 Visual Understanding Multi-Modal Evaluation Problem Solving LLM Evaluation Real-World Reasoning Contextual Analysis Logical Reasoning Attention to Detail Written communication Ambiguity Handling Research Skills
  • Remote (Global)
  • $34 – $40/hr
  • May 14, 2026

Remote contract opportunity for analytically minded generalists to evaluate AI systems on real-world reasoning, visual understanding, and multi-modal problem-solving challenges.

LLM Evaluator (English) Contractor · Full-time Short-term
LLM Evaluation AI Writing Evaluation LLM Prompt Engineering +12 Content Evaluation Business Communication Academic Writing Critical Thinking Analytical Reasoning Quality Assurance Structured Feedback English Writing Editing Research Analysis Attention to Detail Evaluation Rubrics
  • United States, Canada
  • $20 – $23/hr
  • May 10, 2026

Evaluate AI-generated writing across business and academic domains to improve leading LLM systems. Remote retainer-based role paying $20–$23/hr.

Italian LLM Evaluation AI Annotation +12 Data Annotation Prompt Evaluation Content Review Localization Translation Quality Assurance Critical Thinking Reading Comprehension Attention to Detail Written communication AI Tools Structured Labeling
  • Italy
  • $10 – $14/hr
  • May 10, 2026

Review and evaluate Italian AI-generated responses to help improve large language models. Remote AI annotation role paying $10–$14/hr.

LLM Evaluator (English) Contractor Ongoing
Content Writing Editing Proofreading +12 Grammar Brand Voice AI Training LLM Evaluation Generative AI Content Review Writing Journalism Communications Attention to Detail English Prompt Evaluation
  • United States (US: CA, GA, IL +5)
  • From $25/hr
  • May 7, 2026

Evaluate and improve AI-generated writing through editing, grammar review, and content quality analysis in this flexible remote role.

LLM Evaluation German English +11 Data Annotation AI Training Research Analytical thinking Attention to Detail Fact-Checking Prompt Evaluation Content Review Written communication Reasoning Generative AI
  • Germany
  • $35 – $40/hr
  • May 7, 2026

Evaluate and rank AI-generated responses in English and German while helping improve next-generation AI systems remotely from Germany.

Page 1 of 2

Tips for finding remote jobs

  • Set up job alerts on multiple platforms to never miss an opportunity.
  • Highlight your remote work experience and self-management skills.
  • Prepare for video interviews and remote work assessments.
  • Customize your resume and cover letter for each remote position.
  • Build a strong online presence on LinkedIn and professional networks.

Stay in the loop.

One email per week, 5 hand-picked roles.