Leverage the power of multimodal LLMs to automate your smart home
Endless Possibilities for Automations
Stay in the loop
Analyzes camera feeds and Frigate events in real time to keep you updated.
Just ask
LLM Vision builds a detailed timeline of events around your home, so you know what happened and when. You can even ask it about a specific event.
Image Pipeline
Advanced computer vision algorithms pick the most relevant images to analyze and optimize them for inference. The image pipeline is designed to take advantage of hardware acceleration and reduce latency.
Powerful Integrations
Integrate any AI provider you already use, or host your own LLM for maximum privacy.
Easy to use
Get started with LLM Vision with just a few simple lines of code.
LLM Vision is a Home Assistant integration that can analyze images, videos, and
camera streams using multimodal LLMs.
It exposes four actions that are simple
to use, yet enable powerful automations.
Analyzes one or multiple images from different camera or image entities or image files.
Extracts and analyzes frames from a video file or Frigate event.
Captures and analyzes frames from camera entities.
Analyzes and updates a Home Assistant entity with data extracted from an image based on your prompt.
Discover What's Possible with LLM Vision
Explore community-built automations, scripts, and more—powered by LLM Vision.
Visit GalleryLLM Vision can remember events so you can ask about them later. A conversation agent such as Extended OpenAI Conversation is required.
Person seen
08:52 AM
White SUV seen
08:55 AM
Courier seen
10:11 AM
LLM Vision can extract data from images and update Home Assistant entities.
number
and
input_number
text
and
input_text
select
and
input_select
input_boolean
Main Gate
Open
Providers that implement an OpenAI compatible API endpoint can also be used. Additional setup
may be required.
LLM Vision is not endorsed by
or
associated
with any of the listed providers. All trademarks are property of their respective
owners.
Preview Card
The new Preview Card lets you see the most recent event at a glance. Tap it to see even all the details. Includes all the same advanced filters from the Timeline Card.
Timeline Card
A detailed timeline of events, with smart categories and filters. Tap an event for more details.
Speaks your language
Currently supports English, German, Dutch, French, Spanish, Portuguese, Italian, Polish, and Swedish. More languages are added frequently.
LLM Vision is a Home Assistant integration that can analyze images, videos, live camera feeds and frigate events using the vision capabilities of multimodal LLMs.
You can install LLM Vision via HACS using the GitHub repository URL. For more details, visit the LLM Vision documentation.
While the integration itself is free, depending on your provider you'll pay per query. For more information on pricing, visit Provider Comparison.
Get Started