IT Craft invites a DevOps Consultant to join a short-term project in the field of GPU infrastructure and observability.
Type of collaboration: Short-term consulting project with the possibility of further cooperation.
Objective: Support IT Craft in preparing for a call with a potential corporate client by analyzing and refining GPU infrastructure use cases. The consultant will help create an architectural diagram and demonstrate how GPU technologies integrate into our AI/ML solutions.
Expected results:
- Analysis of existing use cases (GPU provisioning and Kubernetes management for AI/ML workloads).
- Preparation of an architectural diagram with the integration of GPU orchestration, monitoring, and scheduling.
- Participation in preparatory sessions to align the client messaging.
- Possibility of further collaboration on upcoming AI/ML projects.
Required skills and experience:
- Experience in DevOps and Kubernetes for AI/ML workloads.
- Hands-on experience with:
- GPU infrastructure (DGX/BGX class)
- ArgoCD / GitOps
- Run:ai for GPU workload orchestration
- Monitoring stack (Prometheus, Grafana, Alertmanager, NVIDIA DCGM Exporter)
- Experience in creating and presenting architectural diagrams.