Remote | Bilingual Italian Generalist Evaluator Expert — Up to $30/hr

  • San Francisco, California, United States
  • Full-Time
  • Remote

Job Description:

We are sharing a specialised remote opportunity for bilingual Italian language experts from Switzerland or Italy who have strong writing skills and deep familiarity with local linguistic and cultural context. This role supports a leading AI research lab by contributing high-quality multilingual training data used to improve advanced large language models.

Experts will create Italian–English prompt and answer pairs, evaluate AI outputs, and help ensure that AI systems produce culturally accurate, fluent, and contextually appropriate responses for Italian-speaking users.

Key Responsibilities

Create detailed prompts in Italian and/or English that reflect real-world usage in Switzerland and Italy
Design multilingual prompt–response pairs that train and evaluate advanced AI models
Evaluate AI-generated responses for linguistic accuracy, tone, and cultural alignment
Develop evaluation rubrics that capture Italian linguistic nuance and regional conventions
Test AI outputs and assess model performance across Italian and English contexts
Contribute to quality assurance processes for Italian-language benchmarks

Ideal Profile

Strong candidates may have:

Native-level Italian proficiency specific to Switzerland or Italy usage
Deep familiarity with local language tone, cultural context, and regional linguistic conventions (including Swiss Italian)
Strong reading and writing ability in English
Bachelors degree (completed or in progress) from a reputable institution
Strong analytical, writing, and critical thinking skills
Ability to work independently and meet deadlines

Preferred experience:

Teaching, research, editing, or academic writing
Experience developing evaluation rubrics or grading frameworks
Familiarity with large language models, prompting, or model evaluation

Why This Opportunity

Contribute to frontier AI research focused on multilingual language understanding
Help shape how AI systems interpret and generate Italian-language content
Collaborate with a leading AI research lab on high-impact projects
Flexible remote work with competitive compensation

Contract Details

Independent contractor role
Fully remote with flexible scheduling
Experts typically contribute around 20 hours per week
Project duration is approximately 2–4 months
Weekly payments via Stripe or Wise
Projects may be extended or adjusted depending on performance and demand

About the Platform

This opportunity is available through a leading AI-driven work platform.