Vision

Analyze images with vision models

Configuration

Image File
Upload an image file
Image File Reference
Reference an image from previous blocks
Image URL (alternative)
Or enter publicly accessible image URL
Vision Model
Select...
Prompt*
Enter prompt for image analysis
API Key*
••••••••

Tools

vision_tool

Process and analyze images using advanced vision models. Capable of understanding image content, extracting text, identifying objects, and providing detailed visual descriptions.

Input

ParameterTypeRequiredDescription
apiKeystringYesAPI key for the selected model provider
imageUrlstringNoPublicly accessible image URL
imageFilefileNoImage file to analyze
modelstringNoVision model to use (gpt-4o, claude-3-opus-20240229, etc)
promptstringNoCustom prompt for image analysis

Output

ParameterTypeDescription
contentstringThe analyzed content and description of the image
modelstringThe vision model that was used for analysis
tokensnumberTotal tokens used for the analysis
input_tokensnumberTokens used for input processing
output_tokensnumberTokens used for response generation

Usage Instructions

Integrate Vision into the workflow. Can analyze images with vision models.

Notes

  • Category: tools
  • Type: vision