Job Description
About Us: Nexus AI Solutions is pioneering the next generation of generative intelligence. We are looking for a visionary Senior AI/LLM Engineer to join our elite team in San Francisco. In this role, you will architect and deploy state-of-the-art models that define the landscape of 2026 and beyond.
Why Join Us? We offer a competitive compensation package, equity packages, and the opportunity to work on cutting-edge technology that impacts millions.
Responsibilities
- Architect and implement scalable Machine Learning pipelines for Large Language Models (LLMs) and Generative AI applications.
- Optimize model inference latency and cost-effectiveness using techniques like quantization and pruning.
- Collaborate closely with product managers and data scientists to define AI roadmap and technical specifications.
- Research and integrate the latest advancements in NLP, Transformer architectures, and RAG (Retrieval-Augmented Generation).
- Ensure the security, privacy, and ethical use of AI models within production environments.
- Mentor junior engineers and conduct code reviews to maintain high engineering standards.
Qualifications
- Masterβs or PhD in Computer Science, Machine Learning, or a related field.
- 5+ years of professional experience in AI/ML, with at least 2 years specializing in LLMs and NLP.
- Proficiency in Python, PyTorch, or TensorFlow.
- Experience with model serving frameworks (e.g., Kubernetes, TorchServe, TGI).
- Familiarity with vector databases (e.g., Pinecone, Milvus) and RAG architectures.
- Strong understanding of distributed systems and cloud infrastructure (AWS, GCP, or Azure).