AI/ML Ops Engineer, Data & AI Team, Global eCommerce Platform. Total Annual Compensation upto $80,000 (Negotiate)

  • Onboarding
  • TECH.PMI
  • Hanoi, Ho Chi Minh

ABOUT THE ROLE

At Crossian, AI is not an experiment but a core production infrastructure powering a $200M data-driven eCommerce platform on its journey toward a $1B Unicorn.


The Data & AI team designs, deploys, and operates large-scale ML and LLM systems that enable personalization, automation, and intelligent decision-making across global markets. As an AI/ML Ops Engineer, you will be responsible for ensuring these AI platforms run reliably, securely, and cost-effectively on cloud infrastructure—bridging the gap between AI development and real-world production by operating LLM systems, RAG pipelines, and ML workflows at scale, where performance, observability, and uptime directly translate into business impact.

WHAT YOU WILL DO

  • Deploy, monitor, and maintain ML/AI systems on AWS infrastructure
  • Operate and optimize RAG systems, knowledge bases, and vector stores
  • Set up LLM monitoring and evaluation pipelines (Langsmith, custom solutions)
  • Manage orchestration workflows using Airflow and n8n
  • Ensure system reliability, cost efficiency, and scalability

WHAT WE ARE LOOKING FOR

Requirements & Skills

  • Bachelor's degree in Computer Science or a related field.
  • 3+ years in MLOps, DevOps, or platform engineering
  • Strong AWS experience: SageMaker, Lambda, ECS/EKS, S3, Bedrock
  • Hands-on with LLM infrastructure: vector DBs, embedding pipelines, RAG deployments
  • Experience with workflow orchestration (Airflow, n8n, or similar)
  • Experience with LLM observability tools (Langsmith, Langfuse, Weights & Biases)


Preferred

  • Proficient in infrastructure-as-code (Terraform, CloudFormation) and CI/CD
  • Background operating production chatbot or generative AI systems
  • Kubernetes and containerization expertise
  • Experience with data visualization tools, such as AWS QuickSight or Grafana.
  • Familiarity with the Agile toolset, such as GitLab, Jira, Confluence, and Slack.