New
As a Research Engineer focusing on Evaluations, you will be responsible for overseeing the evaluation process of models, ensuring they meet accuracy, latency, and feature-specific metrics. This role involves building and maintaining benchmarking pipelines, designing experiments, and collaborating with various teams to translate customer feedback into actionable evaluation criteria.
Posted 6 days ago
Conduct comprehensive model evaluations.
Establish and maintain benchmarking pipelines.
New
Conduct comprehensive model evaluations.
Develop and maintain benchmarking pipelines.
New
Lead end-to-end model evaluation processes.
Develop and maintain benchmarking pipelines.
New
Conduct comprehensive model evaluations.
Build and maintain benchmarking pipelines.
Posted 9 days ago
Architect alliances with hardware partners.
Identify decision-makers within partner organizations.
New
Implement and maintain cloud infrastructure with IaC.
Improve CI/CD pipelines for applications and ML workloads.
Posted 7 days ago
Evaluate models across accuracy and latency.
Build benchmarking pipelines for competitive analysis.
Posted 7 days ago
Support delivery of data center programs.
Manage timelines and project scope.
Posted 6 days ago
Conduct comprehensive model evaluations.
Establish and maintain benchmarking pipelines.
Posted 6 days ago
Partner with engineering leaders for sourcing plans.
Lead sourcing across infrastructure and AI technology.
Posted 6 days ago
Unify technology strategy and enhance decision-making.
Oversee cross-functional initiatives from start to finish.
Posted 3 days ago
Develop and maintain ML platform infrastructure.
Provide shared components for deployment and API design.
Posted 3 days ago
Build automation tools for resource delivery.
Collaborate with engineering teams for quality product delivery.
Posted 3 days ago
Lead strategic partnerships with key industry players.
Develop go-to-market strategies for AI and GPU deployments.
New
Ensure user privacy across data handling.
Develop tools for privacy enhancement.
New
Lead security and infrastructure strategy.
Manage and develop security teams.
New
Conduct comprehensive model evaluations.
Develop and maintain benchmarking pipelines.
New
Lead end-to-end model evaluation processes.
Develop and maintain benchmarking pipelines.
New
Conduct comprehensive model evaluations.
Build and maintain benchmarking pipelines.
New
Lead end-to-end model evaluation.
Build competitive benchmarking pipelines.
Posted 9 days ago
Serve as the primary contact for Aviation accounts.
Manage onboarding and account tasks post-signature.
Posted 7 days ago
Hiring for a remote Product Manager position.
Position is full-time and has no geographical restrictions.
New
Conduct comprehensive model evaluations.
Build and maintain benchmarking pipelines.
New
Support clients with data science methodologies.
Collaborate with Data Science teams.
New
Design optimization solutions for pricing and resource allocation.
Deploy and maintain machine learning models in production.
New
Serve as primary technical contact for clients.
Advise on experimental design and process decisions.
New
Leverage statistical analysis for insights on talent themes.
Support development of data-driven models for talent decisions.
New
Enhance Twitch's growth levers through tactical improvements.
Build a strategic plan for notifications platform evolution.
Posted 18 days ago
Lead research direction for advanced AI systems
Guide the design of cutting-edge RAG systems
Posted 18 days ago
Evaluate LLM-generated responses
Conduct fact-checking on model responses
Posted 18 days ago
Connect chemistry experts to AI projects
Improve AI model reasoning in chemistry
Posted 18 days ago
Support AI model development with expert mathematics input
Evaluate and refine AI-generated mathematical responses
Posted 18 days ago
Collaborate remotely on AI projects
Enhance generative AI with domain expertise
Posted 18 days ago
Enhance AI with civil engineering expertise
Generate and evaluate AI prompts
Posted 18 days ago
Improve conversational AI systems
Assess model-generated responses
Posted 18 days ago
Architect and maintain evaluation suites
Build scalable pipelines for model training
Posted 18 days ago
Define AI/ML agents for reliability
Prototype agent behaviours
Posted 18 days ago
Develop and maintain Python code for data analysis, model evaluation, and AI workflow automation.
Design and refine prompts for LLMs to optimize conversational performance.
Posted 18 days ago
Lead and own the Intelligence Catalog and taxonomy
Drive improvements in noise reduction and precision/recall metrics
New
Conduct comprehensive model evaluations.
Develop and maintain benchmarking pipelines.
New
Lead end-to-end model evaluation processes.
Develop and maintain benchmarking pipelines.
New
Conduct comprehensive model evaluations.
Build and maintain benchmarking pipelines.
New
Lead end-to-end model evaluation.
Build competitive benchmarking pipelines.
Posted 18 days ago
Drive adoption of Ubuntu Pro in enterprise settings
Understand and address customer requirements
Posted 18 days ago
Lead team towards high-impact solutions, Work collaboratively with scientific teams, Stay updated
cutting-edge tools, Develop novel assays, Efficiently allocate team
Posted 18 days ago
Understand user experiences with agentic AI systems
Gather insights from developers and practitioners in the field
Posted 18 days ago
Implement public-key cryptography for client security.
Facilitate device addition and revocation for user accounts.
Posted 18 days ago
Implement public-key cryptography for secure client-server communication.
Enable clients to manage device access through per-device keys.