DevOps Consultant (GPU Infrastructure and Observability)

  • devops
  • other
  • remote
  • Clutch
  • Goodfirms
  • Upwork

IT Craft invites a DevOps Consultant to join a short-term project in the field of GPU infrastructure and observability.

Type of collaboration: Short-term consulting project with the possibility of further cooperation.

Objective:  Support IT Craft in preparing for a call with a potential corporate client by analyzing and refining GPU infrastructure use cases. The consultant will help create an architectural diagram and demonstrate how GPU technologies integrate into our AI/ML solutions.

Expected results:

  • Analysis of existing use cases (GPU provisioning and Kubernetes management for AI/ML workloads).
  • Preparation of an architectural diagram with the integration of GPU orchestration, monitoring, and scheduling.
  • Participation in preparatory sessions to align the client messaging.
  • Possibility of further collaboration on upcoming AI/ML projects.

Required skills and experience:

  • Experience in DevOps and Kubernetes for AI/ML workloads.
  • Hands-on experience with:
    • GPU infrastructure (DGX/BGX class)
    • ArgoCD / GitOps
    • Run:ai for GPU workload orchestration
    • Monitoring stack (Prometheus, Grafana, Alertmanager, NVIDIA DCGM Exporter)
  • Experience in creating and presenting architectural diagrams.

YOU MAY FIND INTERESTING

Full-Stack Developer (Node.js/React)

IT Craft invites a Full-Stack (Node.js/React) Developer to join the company.

See our open position

AI Python Developer

IT Craft invites an AI/ML Python developer to join the company.

See our open position

Automation QA Engineer (Cypress)

IT Craft invites an Automation QA Engineer (Cypress) to join the team, dive into our projects and enhance them with their knowledge and creativity.

See our open position