Job Description
Job Title:
Vision Language Model / Computer Vision Engineer (Co-op)
Posting Start Date:
4/20/26
Job Description:
As a Vision Language Model (VLM)/Computer Vision Engineer Intern, you will contribute to the design, development, and deployment of local, edge-hosted language and vision models within an internet-isolated Operational Technology (OT) environment. This role focuses on integrating Retrieval-Augmented Generation (RAG) systems, Vision Language Models (VLMs), and intelligent agents to enhance decision-making, process monitoring, and operational insights.
This Co-op is a 1 year term starting in September.
What You'll Do:
- Researching, selecting, and configuring publicly available large language and vision models for OT-specific use cases
- Developing specialized agents to perform tasks such as sensor data analysis, process monitoring, insight generation, and optimization
- Implementing lightweight vision models for technical document processing, summarization, and visual data interpretation
- Ensuring models are optimized for edge deployment in resource-constrained environments
What You'll Bring:
- Currently pursuing an undergraduate or graduate degree in Engineering, Computer Science, Physics, or Mathematics
- Demonstrated experience with deploying self-hosted language/vision models and building AI-enabled applications
- Proficiency in Python programming with strong familiarity in Unix/Linux system administration
- Knowledge of edge AI deployment strategies, model optimization, and integration in isolated networks
- Ability to work collaboratively with multidisciplinary teams and adapt to evolving technology landscapes