We are looking for a skilled technical consultant with expertise in cinematic AI and LLM and VLM to help identify key technical keywords from images and train corresponding models. The ideal candidate will have a deep understanding of AI technologies and image processing, along with experience in training models to enhance performance. You will collaborate with our development team to optimize our existing frameworks and ensure accurate keyword extraction. If you are passionate about pushing the boundaries of AI in cinematic applications, we would love to hear from you.
What We Need Built
A pipeline or API that takes an image and outputs technical terms
Use of multimodal LLMs, vision models, or a combined approach
A fixed vocabulary/taxonomy to ensure consistent output
Confidence scoring + batch inference support
Ability to run on our cloud GPU
Output response through an API
Documentation for all prompts, models, and pipeline steps
Apply Now
Apply Now