Job Description
We are seeking a visionary AI Infrastructure Architect to join Nexus Future Systems. As we prepare for the technological breakthroughs of 2026, you will be at the helm of designing the core infrastructure that supports our next-generation generative AI models. You will bridge the gap between theoretical research and scalable production engineering, ensuring our systems are not just functional, but future-proof.
At Nexus, we don't just build software; we architect the future. You will work with a world-class team of researchers and engineers to build robust, secure, and efficient pipelines that handle petabytes of data in real-time.
Responsibilities
- Design and implement scalable, high-availability infrastructure for AI workloads using Kubernetes and serverless technologies.
- Optimize deep learning model inference to reduce latency and maximize throughput on edge devices.
- Establish and enforce best practices for cloud security, data governance, and compliance in a multi-region environment.
- Collaborate closely with data science teams to streamline the MLOps lifecycle from experimentation to deployment.
- Drive architectural decisions that align with our long-term roadmap for 2026 and beyond.
Qualifications
- 10+ years of experience in software engineering, with at least 5 years in infrastructure or platform engineering.
- Deep expertise in Python, PyTorch, TensorFlow, and modern cloud-native tools.
- Strong proficiency in AWS, Azure, or GCP, with specific experience in AI/ML services.
- Proven track record of managing large-scale distributed systems under high load.
- Experience with containerization technologies (Docker, Kubernetes) and CI/CD pipelines.