AI Research Evaluator

New

Skills

AI evaluation AI reasoning analytical skills collaboration critical thinking data annotation English fluency error identification NLP STEM expertise

We are seeking detail-oriented reviewers with strong STEM backgrounds to support quality control on AI reasoning datasets. This remote role requires professional fluency in English, as you will be reviewing and validating technical content in the English language.

Key Responsibilities
  • Review and validate AI-generated responses in English for accuracy and reasoning quality.
  • Identify errors, inconsistencies, and areas for improvement in STEM-related datasets.
  • Provide detailed feedback to enhance model performance.
  • Collaborate with the team in English for effective project coordination.
Required Skills & Qualifications
  • Master's or PhD in a STEM field (Science, Technology, Engineering, or Mathematics).
  • Professional fluency in English (reading and writing required).
  • Strong English communication skills.
  • Strong analytical and critical thinking abilities.
  • Attention to detail.

No forms. Your profile is generated instantly.

Job Type: Remote

Salary: Not Disclosed

Experience: Entry

Duration: Months

Share this job:

Similar Jobs

AI Engineer/Data Scientist

Posted 64 days ago

Enhance AI chatbots using real data.

Clean and organize text data for model improvement.

AI Agents Chatbot Platforms Client Communication Conversational AI

AI Research Evaluator

Posted 64 days ago

Review AI-generated technical content for accuracy.

Validate reasoning quality in AI datasets.

AI Evaluation Analytical Skills Attention to Detail Critical Thinking

Conversational AI Engineer

Posted 63 days ago

Design and deploy AI chat solutions.

Build conversational flows using Dialogflow CX.

Agent Assist API Integrations Dialogflow CX Google CCAI

AI Research Evaluator

Posted 63 days ago

Review AI-generated responses for accuracy.

Identify errors in STEM datasets.

AI evaluation Analytical skills Attention to detail Collaboration

Developer - JaneX

Posted 62 days ago

Build and ship features for practitioners.

Collaborate with cross-functional teams.

AI integration Collaborative tools Continuous learning Embeddings

AI Research Evaluator Role

Posted 62 days ago

Review and validate AI-generated responses.

Identify errors and inconsistencies in datasets.

AI evaluation Analytical skills Attention to detail Critical thinking

AI Research Evaluator

Posted 62 days ago

To find reviewers for AI reasoning datasets.

To ensure accuracy in AI-generated responses.

AI evaluation Analytical skills Attention to detail Critical thinking

AI Research Evaluator

Posted 61 days ago

Review AI-generated responses for accuracy.

Identify errors in STEM-related datasets.

AI evaluation Analytical skills Attention to detail Critical thinking

AI Research Evaluator

Posted 61 days ago

Review AI-generated responses for accuracy.

Validate reasoning quality in datasets.

AI evaluation Analytical skills Attention to detail Critical thinking

Territory Account Executive

Posted 60 days ago

Support new business acquisition.

Convert inbound demand into sales.

Contract negotiation Customer onboarding English fluency Entrepreneurial mindset

Intelligent Automation Senior Manager

Posted 53 days ago

Lead a team of automation engineers.

Drive design and development of automation solutions.

AI/ML components Decision automation Document understanding Governance frameworks

Customer Success Engineering Manager

Posted 49 days ago

Lead and manage a customer success engineering team.

Ensure 24x7 support for strategic customers.

AI/ML Expertise Customer Success Management Databases Generative AI

Search Generalist Role

Posted 48 days ago

Evaluate advanced AI systems on search tasks.

Improve AI model outputs for factuality and helpfulness.

AI evaluation Collaboration Data analysis Familiarity with AI systems

eIDAS2 QES Consultant

Posted 47 days ago

Specializing in eIDAS2 compliance

Reviewing AI-generated outputs

AI compliance analytical skills communication skills consulting

AI Engineer Position

Posted 41 days ago

Develop generative AI for automation.

Train and deploy LLMs on proprietary data.

Android AWS Generative AI Google Cloud Platform

Machine Learning Engineer

Posted 27 days ago

Build and deploy ML/AI services.

Design production systems with LLMs and APIs.

API Development CI/CD Data-Driven Decision Making Distributed Systems

Senior Product Designer

Posted 21 days ago

Lead the design of props workflows.

Define interactions for new content types.

collaboration Figma interaction design product design

Machine Learning Engineer

Posted 15 days ago

Build and deploy ML/AI services.

Own ML systems from design to deployment.

APIs CI/CD Collaboration Deployment

Senior Engineering Leader

Posted 8 days ago

This exciting opportunity is for a Senior Engineering Leader to shape the future of discovery experiences through innovative search UX and AI integration. You will lead a talented engineering team, drive technical vision, and collaborate with product and design to elevate user experiences. If you are passionate about search and AI, and eager to tackle meaningful challenges that have a global impact, this role offers a competitive package and a chance to work with top-tier professionals in an inclusive environment.

Lead and grow the discovery engineering team.

AI Integration Cross-functional Collaboration LLMs Machine Learning

Machine Learning Engineer Role

Posted 6 days ago

This exciting opportunity as a Senior Machine Learning Engineer in AI Research involves designing, training, and evaluating advanced ML models for both research and practical applications. You will engage in innovative experiments to enhance model performance while collaborating with a diverse team to transform cutting-edge research into production-ready systems. With a focus on rigorous experimentation and optimization, this role offers a dynamic environment where your skills in machine learning can significantly impact the company's AI initiatives.

Conduct experiments for model improvements.

Computer Vision Hyperparameter Tuning Machine Learning MLOps