Quality & Evaluation Manager - Generative AI

Other Jobs To Apply

Job Description

Quality & Evaluation Manager – Generative AI Job Title: Quality & Evaluation Manager – Generative AI Location: Remote/Hybrid (Flexible) Position Type: Full-time iMerit is a leading data services company powering the next generation of artificial intelligence. Our platform enables AI development teams to accelerate model training, deployment, and refinement through high-quality, scalable, and secure data pipelines. We partner with organizations across industries to ensure their AI initiatives are grounded in robust, ethical, and performant data solutions. We are looking for an experienced Quality & Evaluation Manager to take ownership of the quality management function across our portfolio of Generative AI projects. This role is responsible for ensuring evaluation integrity, reviewer consistency, scalable audit operations, and high-quality model training data across complex GenAI workflows.

The ideal candidate combines operational rigor, analytical thinking, strong judgment, and the ability to navigate ambiguity in rapidly evolving AI environments. You will be the key quality leader: managing audit teams, resolving complex edge cases, designing standardization processes, and acting as the primary quality liaison with clients. What you’ll do • Quality Leadership & Standardization: %CB; Lead, mentor, and align reviewer, auditor, and evaluation teams across to ensure strict adherence to quality standards and client requirements. %CB; Design and oversee robust quality audit frameworks, tracking key metrics like accuracy, redo percentage, coverage, and turnaround time. %CB; Use quality analytics, audit insights, and operational telemetry to identify trends, predict quality risks, and drive targeted improvement initiatives. %CB; Serve as the final escalation point for complex/ambiguous cases, creating comprehensive documentation to standardize decision-making across team members. • Client & Stakeholder Alignment: %CB; Act as the primary Quality Point-of-Contact (POC) for client stakeholders, delivering regular performance reports, actionable insights, and strategic quality plans. %CB; Coordinate and lead both internal and client-facing calibration sessions to ensure absolute consistency in data interpretation and output quality. %CB; Manage constantly evolving guidelines and requirements, keeping up to date with changes and communicating them to stakeholders. • Evaluation Governance & Quality Intelligence %CB; Define and continuously refine evaluation rubrics, calibration standards, and audit methodologies for GenAI workflows. %CB; Monitor evaluator consistency through inter-rater agreement analysis and targeted calibration interventions %CB; Identify systemic quality risks, ambiguous guideline interpretations, and edge-case trends to improve evaluation integrity. %CB; Partner with DataOps, Training, and Product teams to improve measurable quality outcomes at scale. • Process Improvement: %CB; Drive continuous improvement initiatives focused on increasing overall quality, minimizing errors and enabling processes to scale effectively. %CB; Collaborate cross-functionally with Operations, Training, and Product/Tech teams to proactively resolve recurring quality challenges and implement systemic solutions. • Communication %CB; Communicate complicated, technical concepts to non-technical audiences. %CB; Coach evaluators and reviewers to improve reasoning quality, guideline interpretation, consistency, and decision-making accuracy. What you’ll need • Experience: 5–8 years in Quality Operations, QA, or Quality Leadership roles, including at least 2–3 years in AI/ML, Generative AI, data annotation, model evaluation, or related human-in-the-loop workflows. • Domain Knowledge: Strong, practical knowledge of QA processes, including audits, calibrations, feedback, and reporting. • AI Familiarity: Familiarity with the use cases and quality challenges specific to Generative AI (e.g., reasoning, classification, summarization, red-teaming, multimodal tasks). • Skills: Strong analytical and data interpretation skills to drive quality decisions, and the ability to clearly communicate decisions and recommendations with stakeholders and clients. • Work Environment: Proven success working remotely or in hybrid team environments. • Platform/Tooling Exposure: Familiarity with annotation/evaluation platforms, quality dashboards, workflow tooling, and data-driven operational reporting. • Ambiguity Navigation: Ability to operate effectively in fast-changing, ambiguity-heavy environments with evolving quality standards and workflows. Preferred Experience: • Bachelor’s or Master’s degree in a relevant field (Linguistics, Social Sciences, Philosophy, or related discipline). • Proven track record of successfully leading quality operations on large-scale, global AI data annotation projects. • Demonstrable experience in client relationship management and effective team mentorship. What We Offer: • A flexible hybrid working model tailored for optimal work-life balance. • Competitive compensation and a comprehensive benefits package. • A collaborative, innovative, and inclusive environment. • Continuous professional growth and learning opportunities. Join us to lead groundbreaking projects in Generative AI, making a tangible impact in the evolving field of artificial intelligence.

Back to blog

Common Interview Questions And Answers

1. HOW DO YOU PLAN YOUR DAY?

This is what this question poses: When do you focus and start working seriously? What are the hours you work optimally? Are you a night owl? A morning bird? Remote teams can be made up of people working on different shifts and around the world, so you won't necessarily be stuck in the 9-5 schedule if it's not for you...

2. HOW DO YOU USE THE DIFFERENT COMMUNICATION TOOLS IN DIFFERENT SITUATIONS?

When you're working on a remote team, there's no way to chat in the hallway between meetings or catch up on the latest project during an office carpool. Therefore, virtual communication will be absolutely essential to get your work done...

3. WHAT IS "WORKING REMOTE" REALLY FOR YOU?

Many people want to work remotely because of the flexibility it allows. You can work anywhere and at any time of the day...

4. WHAT DO YOU NEED IN YOUR PHYSICAL WORKSPACE TO SUCCEED IN YOUR WORK?

With this question, companies are looking to see what equipment they may need to provide you with and to verify how aware you are of what remote working could mean for you physically and logistically...

5. HOW DO YOU PROCESS INFORMATION?

Several years ago, I was working in a team to plan a big event. My supervisor made us all work as a team before the big day. One of our activities has been to find out how each of us processes information...

6. HOW DO YOU MANAGE THE CALENDAR AND THE PROGRAM? WHICH APPLICATIONS / SYSTEM DO YOU USE?

Or you may receive even more specific questions, such as: What's on your calendar? Do you plan blocks of time to do certain types of work? Do you have an open calendar that everyone can see?...

7. HOW DO YOU ORGANIZE FILES, LINKS, AND TABS ON YOUR COMPUTER?

Just like your schedule, how you track files and other information is very important. After all, everything is digital!...

8. HOW TO PRIORITIZE WORK?

The day I watched Marie Forleo's film separating the important from the urgent, my life changed. Not all remote jobs start fast, but most of them are...

9. HOW DO YOU PREPARE FOR A MEETING AND PREPARE A MEETING? WHAT DO YOU SEE HAPPENING DURING THE MEETING?

Just as communication is essential when working remotely, so is organization. Because you won't have those opportunities in the elevator or a casual conversation in the lunchroom, you should take advantage of the little time you have in a video or phone conference...

10. HOW DO YOU USE TECHNOLOGY ON A DAILY BASIS, IN YOUR WORK AND FOR YOUR PLEASURE?

This is a great question because it shows your comfort level with technology, which is very important for a remote worker because you will be working with technology over time...