Job Description
Join Apex Neural Systems, a frontier tech company pioneering the next generation of Artificial General Intelligence. We are on a mission to democratize access to advanced reasoning models by 2026. We are seeking a visionary Senior LLM Engineer to lead the architecture of our flagship Large Language Models.
In this high-impact role, you will bridge the gap between theoretical research and production-grade deployment. You will work directly with our research team to fine-tune state-of-the-art models, optimize inference pipelines, and ensure our AI solutions are robust, scalable, and ethically sound.
Why join us? We offer competitive equity packages, unlimited PTO, and the opportunity to work on projects that define the future of human-computer interaction.
Responsibilities
- Architect and fine-tune Large Language Models (LLMs) using PyTorch and Hugging Face Transformers.
- Develop and optimize Retrieval-Augmented Generation (RAG) pipelines to enhance factual accuracy and reduce hallucinations.
- Implement advanced quantization and distillation techniques to reduce model latency and cost.
- Collaborate with cross-functional teams (Product, Design, Research) to integrate AI capabilities into consumer applications.
- Establish MLOps best practices for model versioning, A/B testing, and continuous deployment.
- Conduct rigorous evaluation and testing to ensure model safety, fairness, and compliance.
Qualifications
- Masterβs or PhD in Computer Science, Machine Learning, or a related technical field.
- 5+ years of professional experience in Python, TensorFlow, or PyTorch.
- Deep understanding of NLP, Transformers, Attention Mechanisms, and Tokenization strategies.
- Experience deploying machine learning models on cloud infrastructure (AWS, GCP, or Azure).
- Proven track record of optimizing model inference speed and memory efficiency.
- Familiarity with LlamaIndex, LangChain, or other modern AI framework stacks.