Browse 1,020 jobs
Open roles across 12 specializations at Global AI Workforce.
Results
85 resultsPage 1 of 4
GAW-00936
Agent Trajectory Reviewer
Helix AI · Remote, Global
EvaluationCritical ReasoningTest Design
$25 – $42 / hr1d ago
GAW-00937
Senior Code Execution Validator
Prism AI · New York, NY
Critical ReasoningTest DesignEvaluation
$136k – $188k / yr10d ago
GAW-00938
Senior Eval Set Author
Polaris ML · Remote, Global
Test DesignRubricsEvaluationCritical Reasoning
$45 – $63 / hr14d ago
GAW-00939
Tool-use Evaluator
Prism AI · San Francisco, CA
RubricsEvaluation
$33 – $56 / hr22d ago
GAW-00940
Benchmark Designer
Prism AI · Dubai, UAE
Critical ReasoningTest Design
$25 – $38 / hr16d ago
GAW-00941
Expert Code Execution Validator
Vector Foundry · Remote, Global
EvaluationRubricsCritical Reasoning
$176k – $273k / yr22d ago
GAW-00942
Bias Auditor
Prism AI · Remote, Global
Critical ReasoningEvaluationRubrics
$4,155 – $7,128 / project7d ago
GAW-00943
Long-context QA
Helix AI · Remote, Global
Test DesignCritical Reasoning
$4,257 – $7,233 / project27d ago
GAW-00944
Expert Tool-use Evaluator
Beacon Research · Toronto, CA
EvaluationCritical Reasoning
$63 – $99 / hr25d ago
GAW-00945
Senior Eval Set Author
Quanta Models · Amsterdam, NL
EvaluationRubricsTest Design
$45 – $59 / hr3d ago
GAW-00946
Expert Code Execution Validator
Nimbus Labs · Remote, Global
RubricsTest DesignEvaluation
$63 – $110 / hr8d ago
GAW-00947
Long-context QA
Foundry-7 · Remote, Global
Critical ReasoningTest Design
$25 – $36 / hr27d ago
GAW-00948
Math Reasoning Grader
Quanta Models · Remote, Global
RubricsTest Design
$25 – $33 / hr29d ago
GAW-00949
Expert Math Reasoning Grader
Polaris ML · Bengaluru, IN
EvaluationCritical ReasoningRubricsTest Design
$176k – $262k / yr23d ago
GAW-00950
Expert Math Reasoning Grader
Vector Foundry · Bengaluru, IN
RubricsCritical Reasoning
$63 – $103 / hr29d ago
GAW-00951
Bias Auditor
Apex Cognition · Paris, FR
Critical ReasoningEvaluationTest Design
$1,836 – $4,338 / project19d ago
GAW-00952
Agent Trajectory Reviewer
Lattice Intelligence · Remote, Global
RubricsCritical ReasoningTest DesignEvaluation
$33 – $52 / hr9d ago
GAW-00953
Benchmark Designer
Helix AI · London, UK
EvaluationTest DesignRubrics
$80k – $123k / yr9d ago
GAW-00954
Expert Bias Auditor
Continuum AI · Remote, Global
EvaluationCritical ReasoningTest Design
$63 – $88 / hr11d ago
GAW-00955
Safety Researcher
Apex Cognition · Remote, Global
Test DesignCritical ReasoningRubrics
$80k – $125k / yr22d ago
GAW-00956
Bias Auditor
Polaris ML · Tokyo, JP
EvaluationCritical Reasoning
$1,911 – $6,506 / project1d ago
GAW-00957
Long-context QA
Polaris ML · Berlin, DE
RubricsCritical Reasoning
$3,773 – $9,931 / project17d ago
GAW-00958
Senior Hallucination Reviewer
Continuum AI · Remote, Global
RubricsEvaluationTest Design
$4,422 – $10,804 / project17d ago
GAW-00959
Hallucination Reviewer
Cascade Labs · New York, NY
EvaluationCritical ReasoningRubrics
$3,163 – $5,996 / project13d ago
1 / 4