CAREERS @ GENERIA

LLM Deployment & Optimization Engineer

Shawinigan, Québec

Hybrid

Your mission: translate innovative LLM research and training into efficient, scalable, and robust production systems, with a focus on pre-deployment optimization (including quantization and compression techniques).

GenerIA is a deep tech company with strong ambitions both technical and ethical. We already define eco-responsibility standards with the world's first carbon-neutral AIs, which are at the same time totally secured by design. Our Agenda for this year includes developing a new series of yet more frugal generative models with SOTA performance, and to debut a photonic quantum model together with industry leaders in the domain.

This certainly sounds serious but our commitment comes with principles. GenerIA is first and foremost a human adventure, whose success is ultimately measured by the personal and professional well-being of each and every one of us. If you can help us achieve our mission, if you recognize yourself in our Values and if you like both challenges and fun, let's talk.

Key Responsibilities

  • Model optimization & quantization: develop and implement strategies for model optimization, including quantization and compression techniques, to reduce inference latency and resource usage
  • Pre-deployment testing & validation: oversee rigorous testing to ensure models perform reliably in real-world environments
  • Infrastructure & scalability: build and manage deployment infrastructures using containerization, orchestration, and cloud-native technologies
  • Collaboration & feedback loop: work with training and research teams to integrate feedback and refine the model throughout its lifecycle.

Requirements

  • Bachelor's or Master's degree in Computer Science, Software Engineering, or a related discipline, with strong experience in deploying ML systems
  • Expertise in software development, API design, and production-level code quality
  • Familiarity with exploratory data analysis (EDA) and data mining techniques to aid in pre-deployment assessments
  • Sound understanding of linear algebra, calculus, and basic statistics, useful for both EDA and performance tuning
  • Strong collaboration skills to work effectively across engineering, research, and operational teams.

Benefits

  • Competitive salary and equity package
  • Flexible work arrangements
  • Health and dental insurance
  • Professional development budget
  • Regular team events and activities.

Apply for this opening

Your privacy is actively protected.

Our in-house application system keeps your application from being exposed.

YOU:
YOUR LINKS:
ADDITIONAL INFORMATION: