About the job Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark , General Catalyst , Peter Thiel , Adam D'Angelo , Larry Summers , and Jack Dorsey . Position: Bilingual Italian Generalist Evaluator Expert Type: Contract Compensation: $25–$30/hour Location: Remote Duration: 2–4 months Commitment: 20+ hours/week Role Responsibilities Author Italian/English prompt-golden answer pairs to train and evaluate advanced language models. Create detailed prompts in Italian and/or English, ensuring natural phrasing and real-world relevance for Italian-speaking users in Switzerland and Italy contexts. Establish high-level expectations for correct responses and develop comprehensive rubrics that account for linguistic nuance, tone, and cultural conventions specific to these regions. Run prompts through models and assess preliminary outputs for accuracy, fluency, and cultural fit in Italian, comparing results against English where needed. Collaborate in QA review processes to ensure prompt tasks and rubrics meet rigor, maintaining consistency and reliability across Italian-language benchmarks. Qualifications Must-Have Native-level fluency in Italian (written), specific to Switzerland or Italy usage, with strong reading/writing ability in English. Must be native to Switzerland or Italy and have lived in or spent significant time in-country , with deep cultural and linguistic familiarity. BS or BA from a reputable institution (completed or in progress). Strong writing and critical thinking skills. Ability to work independently and meet deadlines. Significant familiarity with ChatGPT or similar tools for personal decision-making, hobbies, or general interests. Based in Switzerland or Italy (or able to reliably produce Switzerland- or Italy-specific, culturally accurate Italian). Preferred Experience in teaching, research, editing, or academic writing. Experience creating evaluation criteria, rubrics, or grading guidelines. Familiarity with LLMs , prompting, or model evaluation. Application Process (Takes 20–30 mins to complete) Complete an AI-led interview (about 15 minutes). If approved, complete a paid assessment focused on writing and rubric creation. Then, if selected, you will be invited to work on the project. Resources & Support For details about the interview process and platform information, please check: For any help or support, reach out to: PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity. #J-18808-Ljbffr

Italian Evaluator Expert - Fully Remote

MERCOR

Lavori simili

Addetto/A Al Magazzino Amazon A Jesi (Ancona)

JOBTOME

Addetto/A Al Magazzino Amazon A Jesi (Ancona)

JOBTOME

Consulente Di Vendita

JOBTOME

Addetto/A Al Magazzino Amazon A Jesi (Ancona)

ADECCO

Field Service & Survey Engineer.

LHH

Field Service & Survey Engineer.

LHH

Field Service & Survey Engineer.

LHH

Ricevi lavori simili via email