Who we are At Twilio, we’re shaping the future of communications, all from the comfort of our homes. We deliver innovative solutions to hundreds of thousands of businesses and empower millions of developers worldwide to craft personalized customer experiences.
Remote job listings
Find your next remote opportunity from thousands of listings across the globe.
Who we are At Twilio, we’re shaping the future of communications, all from the comfort of our homes. We deliver innovative solutions to hundreds of thousands of businesses and empower millions of developers worldwide to craft personalized customer experiences.
Remote opportunity for experienced State & Local Tax (SALT) professionals to evaluate AI-generated tax analysis, review complex multi-state tax scenarios, create tax research benchmarks, and improve AI reasoning across corporate, sales and use tax, nexus, apportionment, and compliance topics.
Evaluate AI-generated sales and go-to-market deliverables including reports, spreadsheets, and presentations while providing expert commercial, revenue, and GTM quality assessments.
Remote opportunity for experienced process engineers to create visual reasoning benchmark problems using real-world industrial engineering artifacts such as PFDs, P&IDs, control-loop diagrams, and plant layouts for frontier AI evaluation.
Evaluate AI-generated clinical, biomedical, and pharmaceutical deliverables including reports, spreadsheets, and presentations while providing expert scientific and healthcare quality assessments.
Evaluate AI-generated government and public administration deliverables including reports, spreadsheets, and presentations while providing expert policy and public-sector quality assessments.
Who we are At Twilio, we’re shaping the future of communications, all from the comfort of our homes. We deliver innovative solutions to hundreds of thousands of businesses and empower millions of developers worldwide to craft personalized customer experiences.
Design graduate-level computational problems using PyMC, PyStan, FEniCS, FEniCSx, GUDHI, and related tools across Bayesian statistics, numerical PDEs, and computational topology — calibrating tasks against frontier AI models to build AI reasoning benchmarks.
Masters/PhD biologists to design and solve challenging Biology problems for LLM evaluation — from undergraduate to PhD-level topics in Biology, Biotechnology, and Biochemistry. 4+ hours/day with PST overlap.
US-based academic dermatologists (Assistant Professor+) to lead clinical review on complex dermatology cases for frontier AI model data curation. Board-certified, active faculty, 3+ years post-residency.
Mathematicians (BS to PhD) to develop advanced mathematical models and evaluation benchmarks for frontier LLMs — covering algebra, number theory, topology, analysis, probability, and applied mathematics. 4+ hours/day with PST overlap.
PhD/Postdoc physicists to design and solve challenging Physics problems that evaluate the limitations of large language models — from undergraduate to advanced PhD-level topics. At least 4 hours/day, up to 40 hours/week, with 4-hour PST overlap.
What We're Researching We're running a paid study on the investment behaviors and strategies of high-net-worth individuals. The goal is to understand how experienced investors navigate shifting markets, allocate their assets, and make long-term financial decisions.
Design graduate-level computational problems using scikit-fem or similar finite element libraries for beam analysis, elasticity problems, and computational mechanics — calibrating tasks against frontier AI models to build advanced AI reasoning benchmarks.
Design graduate-level computational problems using PySCF for quantum chemistry calculations including Hartree-Fock, DFT, TDDFT, CASSCF, and post-HF methods — calibrating tasks against frontier AI models to build advanced AI reasoning benchmarks.
Design graduate-level computational problems using scanpy, scvelo, squidpy, and gudhi for single-cell RNA-seq analysis, trajectory inference, and spatial transcriptomics — calibrating tasks against frontier AI models to build advanced AI reasoning benchmarks.
Design graduate-level computational problems using astropy and related tools for cosmological calculations, angular power spectra, and galaxy survey analysis — calibrating tasks against frontier AI models to build advanced AI reasoning benchmarks.
Design graduate-level computational problems using libRoadRunner, Tellurium, or SBML-based tools for compartmental PK/PD modeling, enzyme kinetics, and systems biology simulations — calibrating tasks against frontier AI models to build advanced AI reasoning benchmarks.
Design graduate-level computational problems using scikit-hep and related HEP Python tools for particle physics data analysis, cross-section computations, and perturbative QCD — calibrating tasks against frontier AI models to build advanced AI reasoning benchmarks.
Design graduate-level computational problems using scikit-rf and ngspice for RF/microwave network analysis, S-parameter characterization, circuit simulation, and frequency response — calibrating tasks against frontier AI models to build advanced AI reasoning benchmarks.
Design graduate-level computational problems using ObsPy or SPECFEM for seismic waveform analysis, travel-time tomography, moment tensor inversion, and synthetic seismogram generation — calibrating tasks against frontier AI models to build advanced AI reasoning benchmarks.
About Solace Healthcare in the U.S. is fundamentally broken. The system is so complex that 88% of U.S. adults do not have the health literacy necessary to navigate it without help.
Mercor is recruiting UK-based top business school graduates and faculty to create and review business case study scenarios for AI training at a leading foundational model lab. 20–70/hr, fully remote.
Evaluate AI-generated engineering, manufacturing, and technical operations deliverables including reports, spreadsheets, and presentations while providing expert technical and operational quality assessments.
Evaluate AI-generated branding, creative direction, and marketing collateral deliverables including reports, spreadsheets, and presentations while providing expert creative and strategic quality assessments.
Evaluate AI-generated customer success and support operations deliverables including reports, spreadsheets, and presentations while providing expert customer experience and service quality assessments.
Evaluate AI-generated legal contracts, diligence reviews, and redline analyses including reports, spreadsheets, and presentations while providing expert legal and risk-focused quality assessments.
Evaluate AI-generated humanities, arts, and culture deliverables including reports, spreadsheets, and presentations while providing expert scholarly, cultural, and analytical quality assessments.
Evaluate AI-generated media, journalism, and communications deliverables including reports, spreadsheets, and presentations while providing expert editorial and communications quality assessments.
Evaluate AI-generated nonprofit, philanthropy, and community program deliverables including reports, spreadsheets, and presentations while providing expert social impact and program quality assessments.
Evaluate AI-generated incident management, reliability engineering, and SRE deliverables including reports, spreadsheets, and presentations while providing expert operational and technical quality assessments.
Evaluate AI-generated real estate, hospitality, and event management deliverables including reports, spreadsheets, and presentations while providing expert industry-focused quality assessments.
Evaluate AI-generated training, onboarding, and learning & development deliverables including reports, spreadsheets, and presentations while providing expert instructional design and workforce development quality assessments.
Evaluate AI-generated financial-services compliance, regulatory response, and AI governance deliverables including reports, spreadsheets, and presentations while providing expert regulatory and risk-focused quality assessments.
Evaluate AI-generated privacy and regulatory compliance deliverables including reports, spreadsheets, and presentations while providing expert governance, risk, and compliance quality assessments.
Evaluate AI-generated biology and environmental science deliverables and provide expert scientific quality assessments and feedback. Apply real-world scientific expertise to determine whether outputs meet professional standards.
Evaluate AI-generated legal and compliance deliverables and provide expert regulatory, governance, and legal quality assessments. Apply real-world legal and compliance expertise to determine whether outputs meet professional standards.
Evaluate AI-generated healthcare and clinical deliverables, providing expert medical and healthcare-focused quality assessments. Apply real-world expertise to determine whether outputs meet professional standards.
Evaluate AI-generated pricing, ROI, and revenue economics deliverables and provide expert business-focused feedback and quality assessments. Apply real-world business and economic expertise to determine whether outputs meet professional standards.
As the pioneer of the Agentic Web Marketing Platform, we're redefining how teams Build, Manage, and Optimize for the web — combining visual development, powerful content management systems, AI-driven personalization, seamless hosting, and end-to-end analytics in a single,…
Who we are At Twilio, we’re shaping the future of communications, all from the comfort of our homes. We deliver innovative solutions to hundreds of thousands of businesses and empower millions of developers worldwide to craft personalized customer experiences.
Director of Revenue Accounting About the Role Hopper is hiring a Director of Revenue Accounting to own our end-to-end revenue recognition and partner economics function — building it from the ground up.
Director of Revenue Accounting About the Role Hopper is hiring a Director of Revenue Accounting to own our end-to-end revenue recognition and partner economics function — building it from the ground up.
Directeur(trice) de la comptabilité des revenus À propos du poste Hopper est à la recherche d'un(e) directeur(trice) de la comptabilité des revenus pour prendre en charge notre fonction de reconnaissance des revenus et d'économie des partenaires, de bout en bout — et ce, à…
About Equip Equip is the leading virtual, evidence-based eating disorder treatment program on a mission to ensure that everyone with an eating disorder can access treatment that works.
Sprinto is an AI-native GRC platform that helps organisations manage risks, audits, vendor oversight, and continuous monitoring from a single connected platform.
Job Req: SLSQ427R270 Locations: San Francisco/ Seattle The Cloud & AI Partnerships team is responsible for establishing durable long-term alliances with the world's most important technology companies in the Data & AI ecosystem.We are looking for a Sr.
Tips for finding remote jobs
- Set up job alerts on multiple platforms to never miss an opportunity.
- Highlight your remote work experience and self-management skills.
- Prepare for video interviews and remote work assessments.
- Customize your resume and cover letter for each remote position.
- Build a strong online presence on LinkedIn and professional networks.