Remote | Professional Domain Expert (Government/Non-Profit) — $25–$35/hour

San Francisco, California, United States
-
Remote

Job Description:

We are sharing a specialised part-time consulting opportunity for experienced government and non-profit professionals with strong backgrounds in public sector operations, mission-driven advisory work, policy-sensitive decision-making, and practical real-world domain judgment.

This role supports an exciting collaboration with a leading frontier AI research laboratory focused on improving the quality, accuracy, and safety of AI-generated responses across subjects related to government and non-profit work.

Selected professionals will assess AI-generated responses across government and non-profit topics, helping improve the reliability of advanced AI systems in high-stakes contexts where inaccurate guidance can create serious practical risk. This opportunity is especially well-suited to professionals with real-world practitioner or advisory experience who can evaluate factual accuracy, regulatory correctness, reasoning quality, and practical usefulness with strong domain judgment.

Key Responsibilities

Professionals in this role may contribute to:

Prompt Writing & Real-World Scenario Design
Write realistic prompts that reflect how professionals and consumers seek domain-specific guidance across government and non-profit contexts
Help ensure that evaluation tasks reflect practical, real-world usage scenarios
Contribute domain-informed examples that support high-quality model assessment workflows

AI Response Evaluation & Quality Review
Evaluate AI-generated responses for factual accuracy, regulatory correctness, and practical usefulness
Identify fabricated claims, incorrect references, or misleading reasoning across model outputs
Help maintain strong standards for quality and reliability in government and non-profit related AI evaluation tasks

Structured Scoring & Written Justification
Score and rank multiple model responses using structured rubrics across defined dimensions
Provide written justifications with specific evidence for each evaluation
Apply professional judgment consistently across high-stakes domain review workflows

Ideal Profile

Strong candidates may have:
Professional experience applying government or non-profit domain expertise in a practitioner or advisory capacity
Familiarity with industry-specific standards, regulations, policy frameworks, or operational expectations
Strong written communication and critical reasoning skills
Comfort evaluating both factual correctness and practical usefulness in domain-specific responses
The ability to apply structured judgment across high-quality evaluation tasks

Preferred qualifications

Experience with public sector operations, non-profit program management, grants, compliance, stakeholder coordination, or mission-driven service delivery
Familiarity with policy-sensitive workflows, regulatory environments, and public-interest decision-making contexts
Ability to assess nuanced domain responses for both accuracy and practical reliability
Comfort working within structured rubrics and written evaluation workflows

Why This Opportunity

Contribute specialised government and non-profit expertise to a cutting-edge AI collaboration
Help improve how advanced AI systems respond to real-world questions in public sector and mission-driven contexts
Work on high-impact evaluation tasks with strong practical relevance and clear real-world value
Flexible remote work with structured expectations and competitive hourly compensation

Contract Details

Independent contractor role
Fully remote with flexible scheduling
Hourly compensation of $25–$35 per hour
Expected commitment of approximately 20 hours per week
Application process includes resume submission and a Model Response Evaluation assessment
Projects may be extended, shortened, or concluded early depending on project needs and performance
Weekly payments via Stripe or Wise
Work will not involve access to confidential or proprietary information from any employer, client, or institution
Please note: We are unable to support H1-B or STEM OPT candidates at this time
Start date: Immediate

About the Platform

This opportunity is available through a leading AI-driven work platform that connects domain experts with frontier AI research projects.

Experts contribute to improving advanced AI systems by providing specialised expertise across real-world workflows, structured evaluation, model training support, and domain-specific content validation.

By submitting this application, you acknowledge that your information may be processed by 24-MAG LLC for recruitment and opportunity matching in accordance with our Privacy Policy: https://www.24-mag.com/privacy-policy