Home Job Details
A
Information Technology 🏒 Full Time ⭐️ Verified

Senior LLM Engineer - Generative AI

Apex Neural Systems
Austin
Estimated Salary
USD 160.000 – USD 240.000
Live Update
19 Mei 2026
Deadline
19 Mei 2027

Job Description

Join Apex Neural Systems, a frontier tech company pioneering the next generation of Artificial General Intelligence. We are on a mission to democratize access to advanced reasoning models by 2026. We are seeking a visionary Senior LLM Engineer to lead the architecture of our flagship Large Language Models.

In this high-impact role, you will bridge the gap between theoretical research and production-grade deployment. You will work directly with our research team to fine-tune state-of-the-art models, optimize inference pipelines, and ensure our AI solutions are robust, scalable, and ethically sound.

Why join us? We offer competitive equity packages, unlimited PTO, and the opportunity to work on projects that define the future of human-computer interaction.

Responsibilities

  • Architect and fine-tune Large Language Models (LLMs) using PyTorch and Hugging Face Transformers.
  • Develop and optimize Retrieval-Augmented Generation (RAG) pipelines to enhance factual accuracy and reduce hallucinations.
  • Implement advanced quantization and distillation techniques to reduce model latency and cost.
  • Collaborate with cross-functional teams (Product, Design, Research) to integrate AI capabilities into consumer applications.
  • Establish MLOps best practices for model versioning, A/B testing, and continuous deployment.
  • Conduct rigorous evaluation and testing to ensure model safety, fairness, and compliance.

Qualifications

  • Master’s or PhD in Computer Science, Machine Learning, or a related technical field.
  • 5+ years of professional experience in Python, TensorFlow, or PyTorch.
  • Deep understanding of NLP, Transformers, Attention Mechanisms, and Tokenization strategies.
  • Experience deploying machine learning models on cloud infrastructure (AWS, GCP, or Azure).
  • Proven track record of optimizing model inference speed and memory efficiency.
  • Familiarity with LlamaIndex, LangChain, or other modern AI framework stacks.

Required Skills

Python PyTorch TensorFlow NLP LLMs MLOps RAG Deep Learning Machine Learning Cloud Computing AWS GCP

Ready to Take This Challenge?

Make sure your resume is ready. Submit your application now before the deadline.

Apply Now

Related Jobs

Similar job recommendations for you

View All