Remote | Bilingual Spanish Generalist Evaluator Expert — Up to $37/hr

San Francisco, California, United States
Full-Time
Remote

Job Description:

We are sharing a specialised remote opportunity for bilingual Spanish language experts from the United States, Spain, Chile, or Mexico who have strong writing skills and deep familiarity with regional linguistic and cultural contexts. This role supports a leading AI research lab by contributing high-quality multilingual training data used to improve advanced large language models.

Experts will create Spanish–English prompt and answer pairs, evaluate AI outputs, and help ensure that AI systems produce culturally accurate, fluent, and contextually appropriate responses for Spanish-speaking users across multiple regions.

Key Responsibilities

Create detailed prompts in Spanish and/or English that reflect real-world usage in the United States, Spain, Chile, and Mexico
Design multilingual prompt–response pairs that train and evaluate advanced AI models
Evaluate AI-generated responses for linguistic accuracy, tone, and cultural alignment
Develop evaluation rubrics that capture Spanish linguistic nuance and regional conventions
Test AI outputs and assess model performance across Spanish and English contexts
Contribute to quality assurance processes for Spanish-language benchmarks

Ideal Profile

Strong candidates may have:

Native-level Spanish proficiency specific to the United States, Spain, Chile, or Mexico usage
Deep familiarity with regional language tone, cultural context, and communication conventions
Strong reading and writing ability in English
Bachelors degree (completed or in progress) from a reputable institution
Strong analytical, writing, and critical thinking skills
Ability to work independently and meet deadlines

Preferred experience:

Teaching, research, editing, or academic writing
Experience developing evaluation rubrics or grading frameworks
Familiarity with large language models, prompting, or model evaluation

Why This Opportunity

Contribute to frontier AI research focused on multilingual language understanding
Help shape how AI systems interpret and generate Spanish-language content across multiple regions
Collaborate with a leading AI research lab on high-impact projects
Flexible remote work with competitive compensation

Contract Details

Independent contractor role
Fully remote with flexible scheduling
Experts typically contribute around 20 hours per week
Project duration is approximately 2–4 months
Weekly payments via Stripe or Wise
Projects may be extended or adjusted depending on performance and demand

About the Platform

This opportunity is available through a leading AI-driven work platform.