Who are We? 
 
 
 
 
 Field AI is transforming how robots interact with the real world. We are building risk-aware, reliable, and field-ready AI systems that address the most complex challenges in robotics, unlocking the full potential of embodied intelligence. We go beyond typical data-driven approaches or pure transformer-based architectures, and are charting a new course, with already-globally-deployed solutions delivering real-world results and rapidly improving models through real-field applications. Learn more at https://fieldai.com . 
 
 
 
 
 About the Job 
 
 
 
 
 Our Field Foundation Model (FFM) powers a global fleet of autonomous robots that capture massive streams of multimodal data across diverse, dynamic environments every day. As part of the Insight Team our mission is to transform this raw, multimodal data into actionable insights that empower our customers and engineers to deliver value. Field-insight Foundation Model (FiFM) is at the core of how we transform multimodal data from autonomous robots into actionable insights. As a Senior Machine Learning Platform Engineer , you will own the infrastructure that powers FiFM , from model hosting and distributed training pipelines to data systems, observability, and security.This is a role at the intersection of systems engineering and machine learning. You’ll design and operate large-scale ML platforms , ensure FiFM transitions smoothly from research into production, and optimize for both performance and cost across cloud and edge. In addition to building core infrastructure, you’ll play a leadership role by mentoring junior engineers, setting technical direction, and raising the engineering bar across the team.
What You’ll Get To Do:
• Design and manage scalable ML infrastructure with IaC tools (Terraform, CloudFormation).
• Develop and optimize cloud-based pipelines for training, evaluation, and inference on multimodal datasets.
• Build and operate data systems for large-scale video ingestion, indexing, and storage.
• Maintain MLOps workflows for versioning, experiment tracking, reproducibility, and CI/CD.
• Ensure reliability and observability with monitoring, logging, and alerting.
• Collaborate with AI/ML Engineers to productionize workflows.
• Optimize infrastructure for performance and cost across cloud and edge.
• Enforce best practices in security, compliance, and maintainability.
• Mentor and manage junior engineers , providing technical guidance and career development.
The Extras That Set You Apart:
• Experience with vector databases (OpenSearch, Pinecone, Weaviate) for indexing and retrieval.
• Familiarity with distributed training frameworks (Horovod, DDP/FSDP, DeepSpeed, Ray).
• Hands-on experience with GPU orchestration and auto-scaling (Karpenter, SageMaker, EKS).
• Experience with agentic AI deployment workflows , orchestration frameworks, and retrieval-augmented generation.
• Strong knowledge of security and compliance in ML and cloud environments.