Job Description
We are building the Artificial General Intelligence (AGI) infrastructure of tomorrow. As a Senior AI Engineer (2026 Focus), you will lead the architectural design of our proprietary LLMs and neural networks, directly shaping the capabilities we deploy in the year 2026 and beyond.
You will operate at the bleeding edge of Deep Learning, working with state-of-the-art architectures to solve complex, unsolved problems in reasoning, multimodal synthesis, and autonomous agent orchestration. This is not just a job; it is a mission to define the technological baseline for the next decade.
Why join us?
- Impact: Your code will power the core intelligence of our next-gen platform.
- Compensation: Competitive salary and equity packages.
- Environment: Top-tier research lab in the heart of Silicon Valley.
Responsibilities
- Architect and optimize large-scale Transformer models for deployment in 2026 infrastructure environments.
- Lead the research and implementation of novel fine-tuning techniques (LoRA, QLoRA, P-Tuning) to enhance model reasoning capabilities.
- Design efficient inference pipelines using C++ and CUDA to reduce latency in real-time AI applications.
- Mentor junior engineers and data scientists on best practices in MLOps and model evaluation.
- Collaborate with product teams to translate advanced AI research into scalable, user-facing features.
- Contribute to the open-source community, publishing papers and libraries that influence the broader AI ecosystem.
Qualifications
- PhD or Masterβs degree in Computer Science, Mathematics, or a related field with a focus on Deep Learning.
- 5+ years of professional experience building production-grade AI systems, specifically with PyTorch or TensorFlow.
- Deep understanding of Natural Language Processing (NLP), specifically Large Language Models (LLMs) and Generative AI.
- Proven track record of optimizing model performance and reducing token generation costs.
- Experience with distributed training frameworks (Ray, Horovod) and cloud infrastructure (AWS, GCP, Azure).
- Strong programming skills in Python, C++, and CUDA.