Job Description
Job Title:  Vision Language Model / Computer Vision Engineer (Co-op)
Posting Start Date:  4/20/26
Job Description: 

As a Vision Language Model (VLM)/Computer Vision Engineer Intern, you will contribute to the design, development, and deployment of local, edge-hosted language and vision models within an internet-isolated Operational Technology (OT) environment. This role focuses on integrating Retrieval-Augmented Generation (RAG) systems, Vision Language Models (VLMs), and intelligent agents to enhance decision-making, process monitoring, and operational insights.

This Co-op is a 1 year term starting in September. 

What You'll Do:

 

  • Researching, selecting, and configuring publicly available large language and vision models for OT-specific use cases
  • Developing specialized agents to perform tasks such as sensor data analysis, process monitoring, insight generation, and optimization
  • Implementing lightweight vision models for technical document processing, summarization, and visual data interpretation
  • Ensuring models are optimized for edge deployment in resource-constrained environments

What You'll Bring:

 

  • Currently pursuing an undergraduate or graduate degree in Engineering, Computer Science, Physics, or Mathematics
  • Demonstrated experience with deploying self-hosted language/vision models and building AI-enabled applications
  • Proficiency in Python programming with strong familiarity in Unix/Linux system administration
  • Knowledge of edge AI deployment strategies, model optimization, and integration in isolated networks
  • Ability to work collaboratively with multidisciplinary teams and adapt to evolving technology landscapes