Remote | Marathi-English AI Safety Red Team Evaluator — $20–$30/hour
name
8h ago
No Phone Required$20 - $30OtherUnited Stateshimalayas
AI-SafetyRed-Team-TestingAdversarial-MLTrust-and-SafetyAI-EvaluationRemote-AI-EvaluatorFreelance-AI-Red-Team-SpecialistFreelance-AI-EvaluatorAI-Security-EvaluatorMid-level
Job Description
We are sharing a specialised part-time consulting opportunity for Marathi-English bilingual professionals experienced in AI safety evaluation, red team testing, adversarial review, vulnerability classification, and structured feedback on sensitive text-based AI outputs.This role supports current and upcoming remote consulting opportunities focused on AI safety evaluation, bilingual red team testing, conversational model assessment, misuse-risk review, vulnerability annotation, and high-quality project execution. Selected professionals will test AI systems using structured adversarial scenarios, identify safety weaknesses, classify risks, and produce clear English-language evaluation artifacts across English and Marathi contexts.Key ResponsibilitiesProfessionals in this role may contribute to:Bilingual AI Safety & Red Team TestingReview English and Marathi AI outputs for safety, reliability, bias, misinformation, and harmful-behavior risksStress-test conversational AI models and agents using structured adversarial scenariosEvaluate model behavior across multi-turn conversations, sensitive topics, and edge-case promptsIdentify vulnerabilities that require stronger safety controls, clearer refusals, or improved response qualityVulnerability Classification & Risk ReviewAnnotate failures, classify vulnerabilities, and flag recurring safety patternsApply taxonomies, benchmarks, and project-specific playbooks to keep testing consistentAssess misuse cases, bias exploitation, prompt-injection scenarios, and socio-technical risk patterns at a high levelGenerate high-quality human evaluation data through careful review and structured judgmentReproducible Documentation & Evaluation ArtifactsProduce clear reports, datasets, test cases, and written summaries that support model improvementDocument findings reproducibly so results can be reviewed, compared, and acted uponExplain risks clearly for both technical and non-technical audiencesMaintain accuracy, consistency, and strong attention to detail across submitted evaluationsIdeal ProfileStrong candidates may have:Native-level fluency in both English and MarathiPrior experience in AI red teaming, adversarial testing, cybersecurity, trust and safety, socio-technical risk review, or conversational AI evaluationAbility to think adversarially while staying structured, careful, and methodicalExperience using frameworks, benchmarks, or rubrics rather than unstructured testing aloneStrong written communication skills and ability to explain safety findings clearlyComfort reviewing text-based content involving sensitive topics under clear guidelinesAdaptability across project types, safety categories, and evaluation workflowsEducational BackgroundFormal degree requirements may vary based on project needsBackgrounds in AI safety, cybersecurity, linguistics, policy, trust and safety, social science, psychology, writing, data evaluation, or technical analysis may be highly relevantPractical experience in red team testing, model evaluation, content risk analysis, or structured review work may also be valuableNice to HaveExperience with adversarial ML concepts, jailbreak datasets, prompt injection, RLHF/DPO attack patterns, or model behavior testingCybersecurity experience such as penetration testing, exploit analysis, reverse engineering, or security assessmentSocio-technical risk experience involving harassment, misinformation, abuse analysis, bias testing, or conversational AI safetyCreative probing background, including psychology, acting, writing, role-play design, or unconventional adversarial thinkingExperience producing reproducible reports, labeled datasets, structured risk notes, or benchmark-style evaluation artifactsWhy This OpportunityApply Marathi-English bilingual expertise to structured AI safety and red team evaluation workContribute to stronger, safer, and more reliable AI systems through careful adversarial testingWork on flexible assignments aligned with language skills, safety judgment, and structured analysisBuild experience in human data-driven AI safety evaluation and bilingual risk reviewRemote structure with competitive hourly compensationContract DetailsIndependent contractor roleFully remote with flexible schedulingEligible professionals may be based in approved project locations depending on project needsNative-level English and Marathi fluency are required for project workWork is text-based and may involve sensitive topics such as bias, misinformation, harassment, or harmful-behavior risksTopic areas will be communicated before exposure to content, and participation in higher-sensitivity projects may depend on candidate comfort and project fitPart-time commitment depending on project availabilityCompetitive rates between $20–$30 per hour depending on expertise and project scopeWeekly payments via Stripe or WiseProjects may be extended, shortened, or adjusted depending on scope and performanceWork will not involve access to confidential or proprietary info
