Vision Language Model / Computer Vision Engineer (Co-op) Job Details

Job Title: Vision Language Model / Computer Vision Engineer (Co-op)

Posting Start Date: 4/20/26

Job Description:

As a Vision Language Model (VLM)/Computer Vision Engineer Intern, you will contribute to the design, development, and deployment of local, edge-hosted language and vision models within an internet-isolated Operational Technology (OT) environment. This role focuses on integrating Retrieval-Augmented Generation (RAG) systems, Vision Language Models (VLMs), and intelligent agents to enhance decision-making, process monitoring, and operational insights.

This Co-op is a 1 year term starting in September.

What You'll Do:

Researching, selecting, and configuring publicly available large language and vision models for OT-specific use cases
Developing specialized agents to perform tasks such as sensor data analysis, process monitoring, insight generation, and optimization
Implementing lightweight vision models for technical document processing, summarization, and visual data interpretation
Ensuring models are optimized for edge deployment in resource-constrained environments

What You'll Bring:

Currently pursuing an undergraduate or graduate degree in Engineering, Computer Science, Physics, or Mathematics
Demonstrated experience with deploying self-hosted language/vision models and building AI-enabled applications
Proficiency in Python programming with strong familiarity in Unix/Linux system administration
Knowledge of edge AI deployment strategies, model optimization, and integration in isolated networks
Ability to work collaboratively with multidisciplinary teams and adapt to evolving technology landscapes