IT Craft invites an AI/ML Architect to participate, as a consultant, in a short-term project on enterprise AI solutions.
Type of collaboration: Short-term consulting project with the possibility of further cooperation.
Objective: Support IT Craft in shaping proposals for AI/ML solutions (chatbots, AI agents, etc.) that integrate GPU-accelerated frameworks. The consultant will act as an AI/ML architect and help prepare an architectural solution for client presentation.
Expected results:
- Analysis of the AI solutions portfolio (chatbots, AI agents).
- Definition of how GPU acceleration and AI frameworks (LLMs, inference servers, conversational AI platforms) are integrated into these solutions.
- Preparation of a high-level architectural diagram for AI/ML solutions based on the NVIDIA ecosystem.
- Consulting on positioning tools and technologies (NVIDIA NeMo, CUDA stack, etc.) for enterprise clients.
- Participation in a client session to present the solution.
- Possibility of further collaboration on client projects.
Required skills and experience:
- Experience in designing AI/ML solutions.
- Hands-on experience with:
- NVIDIA NeMo (LLMs, conversational AI)
- Triton Inference Server
- Riva, CUDA, and GPU frameworks
- Understanding of enterprise AI use cases (chatbots, AI agents, RAG pipelines, NLP/LLM deployments).
- Ability to translate technical capabilities into business value for clients.