AutomationFlowsAI & RAG › Gemini 1.5 Pro Image Captioning Workflow

Gemini 1.5 Pro Image Captioning Workflow

Original n8n title: Easy Image Captioning with Gemini 1.5 Pro (lm Chat Google Gemini)

ByJimleuk @jimleuk on n8n.io

This n8n workflow demonstrates how to automate image captioning tasks using Gemini 1.5 Pro - a multimodal LLM which can accept and analyse images. This is a really simple example of how easy it is to build and leverage powerful AI models in your repetitive tasks. For this demo,…

Event trigger★★★★☆ complexityAI-powered16 nodesGoogle Gemini ChatOutput Parser StructuredEdit ImageHTTP RequestChain Llm
AI & RAG Trigger: Event Nodes: 16 Complexity: ★★★★☆ AI nodes: yes Added:

This workflow corresponds to n8n.io template #2418 — we link there as the canonical source.

This workflow follows the Chainllm → HTTP Request recipe pattern — see all workflows that pair these two integrations.

The workflow JSON

Copy or download the full n8n JSON below. Paste it into a new n8n workflow, add your credentials, activate. Full import guide →

Download .json

  

Credentials you'll need

Each integration node will prompt for credentials when you import. We strip credential IDs before publishing — you'll add your own.

Pro

For the full experience including quality scoring and batch install features for each workflow upgrade to Pro

About this workflow

This n8n workflow demonstrates how to automate image captioning tasks using Gemini 1.5 Pro - a multimodal LLM which can accept and analyse images. This is a really simple example of how easy it is to build and leverage powerful AI models in your repetitive tasks. For this demo,…

Source: https://n8n.io/workflows/2418/ — original creator credit. Request a take-down →

More AI & RAG workflows → · Browse all categories →

Related workflows

Workflows that share integrations, category, or trigger type with this one. All free to copy and import.

AI & RAG

My workflow 53. Uses formTrigger, httpRequest, lmChatOpenAi, form. Event-driven trigger; 74 nodes.

Form Trigger, HTTP Request, OpenAI Chat +15
AI & RAG

This n8n template demonstrates how to automatically generate authentic User-Generated Content (UGC) style marketing videos for eCommerce products using AI. Simply upload a product image, and the workf

Form Trigger, OpenAI, Chain Llm +5
AI & RAG

The Recap AI - eCommerce UGC Video Generator. Uses formTrigger, openAi, chainLlm, outputParserStructured. Event-driven trigger; 24 nodes.

Form Trigger, OpenAI, Chain Llm +5
AI & RAG

Easy Image Captioning With Gemini 1 5 Pro. Uses manualTrigger, lmChatGoogleGemini, outputParserStructured, editImage. Event-driven trigger; 16 nodes.

Google Gemini Chat, Output Parser Structured, Edit Image +2
AI & RAG

Easy Image Captioning With Gemini 1.5 Pro. Uses manualTrigger, lmChatGoogleGemini, outputParserStructured, editImage. Event-driven trigger; 16 nodes.

Google Gemini Chat, Output Parser Structured, Edit Image +2