Job Description
We are building the operating system for the next era of intelligence. As a Senior Generative AI Architect at Neural Horizon Labs, you will lead the design and implementation of our flagship Large Language Model (LLM) infrastructure. You will bridge the gap between cutting-edge research and production-grade systems, ensuring our models are scalable, efficient, and ethically aligned.
Join a team of world-class engineers and researchers pushing the boundaries of what is possible in 2026 and beyond.
Responsibilities
- Architect and deploy scalable, high-throughput inference pipelines for large generative models.
- Optimize model performance through quantization, distillation, and hardware acceleration (GPU/TPU).
- Design robust evaluation frameworks to measure model accuracy, safety, and hallucination rates.
- Lead architectural reviews for ML infrastructure projects, ensuring best practices in code quality and scalability.
- Collaborate with product teams to translate complex user requirements into technical AI solutions.
- Stay at the forefront of the AI landscape, researching and integrating emerging techniques (e.g., Multimodal AI, Agentic workflows).
Qualifications
- Ph.D. or Master's degree in Computer Science, Machine Learning, or a related quantitative field.
- 5+ years of experience in software engineering, with at least 3 years specializing in Deep Learning and NLP.
- Expert-level proficiency in Python, PyTorch, and TensorFlow.
- Deep understanding of transformer architectures, attention mechanisms, and LLM training methodologies.
- Experience with distributed training systems (e.g., Ray, DeepSpeed, Megatron-LM) and high-performance computing.
- Strong background in cloud infrastructure (AWS, GCP, or Azure) and containerization (Docker, Kubernetes).