Job Description
Are you ready to architect the future of intelligent systems? Nebula Future Systems is seeking a visionary Year 1 AI Systems Architect to lead the foundational infrastructure for our next-generation AI platform. In this pivotal role, you will bridge the gap between theoretical research and production-grade engineering, ensuring our AI models are scalable, secure, and transformative. Join a team of elite engineers and data scientists dedicated to pushing the boundaries of what's possible in 2026 and beyond.
Why Join Us?
- Work on cutting-edge Generative AI and Large Language Model infrastructure.
- Competitive compensation package with equity options.
- Flexible remote-first culture with a hub in the heart of San Francisco.
- Access to state-of-the-art hardware and research facilities.
Responsibilities
- Infrastructure Leadership: Design and implement the Year 1 AI infrastructure roadmap, ensuring scalability and efficiency for next-gen applications.
- Model Deployment: Oversee the architectural integrity of our core AI systems, focusing on low-latency inference and high-availability cloud architecture.
- Team Orchestration: Lead a cross-functional team of data scientists and engineers to optimize deep learning models for production deployment.
- Strategic Roadmapping: Collaborate with product leadership to translate futuristic concepts into tangible technical roadmaps.
- MLOps Excellence: Establish best practices for MLOps, model monitoring, and ethical AI governance.
- Performance Optimization: Continuously benchmark and refine system performance to handle millions of concurrent requests.
Qualifications
- Education: Masterβs or Ph.D. in Computer Science, Artificial Intelligence, or a related technical field.
- Experience: Minimum of 8+ years of experience in systems architecture, with a specific focus on AI/ML infrastructure.
- Technical Skills: Deep expertise in Python, PyTorch, TensorFlow, and distributed computing frameworks (Kubernetes, Docker, Apache Spark).
- Production Expertise: Proven track record of deploying large-scale machine learning models in high-volume production environments.
- AI Specialization: Strong understanding of NLP, Computer Vision, or Generative AI paradigms.
- Problem Solving: Exceptional ability to troubleshoot complex system bottlenecks and architectural challenges.