Skip to content

Intern Assistant Engineer - AI Inference Performance

    • Waterloo, Ontario
  • wms6e

Job description

Our team has an immediate 12-month internship opening for an Assistant Engineer.

Responsibilities:
  • Assist in developing and maintaining performance monitoring tools.
  • Support profiling and analyzing inference workloads to identify performance bottlenecks.
  • Contribute to applying optimization techniques such as quantization, kernel fusion, and pruning to enhance inference performance under the guidance of senior engineers.
  • Help optimize AI workloads across multiple hardware platforms (e.g., GPUs, edge devices).
  • Collaborate with senior engineers, research teams, and AI infrastructure teams to integrate optimizations into AI inference pipelines.
  • Learn to utilize profiling tools such as TensorBoard, PyTorch Profiler, and NVIDIA Nsight to identify key performance insights.

Job requirements

What you’ll bring to the team:

  • Currently pursuing or recently graduated with a Bachelor's or Master’s degree in Computer Science, Electrical Engineering, AI/ML, or a related field.
  • Familiarity with programming languages like Python or C++.
  • Basic knowledge of deep learning frameworks (e.g., TensorFlow, PyTorch) and AI inference.
  • Strong background in profiling and performance analysis tools.
  • Strong analytical and problem-solving skills with an eagerness to learn.

or