Vision

Configuration

Image File

Upload an image file

Image File Reference

Reference an image from previous blocks

Image URL (alternative)

Or enter publicly accessible image URL

Vision Model

Select...

Prompt*

Enter prompt for image analysis

API Key*

••••••••

Tools

`vision_tool`

Process and analyze images using advanced vision models. Capable of understanding image content, extracting text, identifying objects, and providing detailed visual descriptions.

Input

Parameter	Type	Required	Description
`apiKey`	string	Yes	API key for the selected model provider
`imageUrl`	string	No	Publicly accessible image URL
`imageFile`	file	No	Image file to analyze
`model`	string	No	Vision model to use (gpt-4o, claude-3-opus-20240229, etc)
`prompt`	string	No	Custom prompt for image analysis

Output

Parameter	Type	Description
`content`	string	The analyzed content and description of the image
`model`	string	The vision model that was used for analysis
`tokens`	number	Total tokens used for the analysis
`input_tokens`	number	Tokens used for input processing
`output_tokens`	number	Tokens used for response generation

Usage Instructions

Integrate Vision into the workflow. Can analyze images with vision models.

Notes

Category: tools
Type: vision