AutomationFlowsAI & RAG › Convert Tour Pdfs to Vector Database Using Google Drive, Langchain & Openai

Convert Tour Pdfs to Vector Database Using Google Drive, Langchain & Openai

ByMohan Gopal @mohan on n8n.io

Process Tour PDF from Google Drive to Pinecone Vector DB with OpenAI Embeddings

Event trigger★★★★☆ complexityAI-powered10 nodesGoogle DrivePinecone Vector StoreOpenAI EmbeddingsDocument Default Data LoaderText Splitter Recursive Character Text Splitter
AI & RAG Trigger: Event Nodes: 10 Complexity: ★★★★☆ AI nodes: yes Added:

This workflow corresponds to n8n.io template #5085 — we link there as the canonical source.

This workflow follows the Documentdefaultdataloader → OpenAI Embeddings recipe pattern — see all workflows that pair these two integrations.

The workflow JSON

Copy or download the full n8n JSON below. Paste it into a new n8n workflow, add your credentials, activate. Full import guide →

Download .json
{
  "id": "9wzQXONqaSbZWXqh",
  "meta": {
    "templateCredsSetupCompleted": true
  },
  "name": "Tour Package Recomendation Database - Process Tour PDF",
  "tags": [
    {
      "id": "oKGIn6U0wpeHShTN",
      "name": "working flow",
      "createdAt": "2025-06-02T06:27:44.762Z",
      "updatedAt": "2025-06-02T06:27:44.762Z"
    }
  ],
  "nodes": [
    {
      "id": "ff4ada8f-0e63-4035-afb2-9f459cd7f158",
      "name": "When clicking \u2018Test workflow\u2019",
      "type": "n8n-nodes-base.manualTrigger",
      "position": [
        -920,
        1060
      ],
      "parameters": {},
      "typeVersion": 1
    },
    {
      "id": "b17ec584-be56-49c2-ad9d-a85bc98285c2",
      "name": "PDF Tour Package Folder",
      "type": "n8n-nodes-base.googleDrive",
      "position": [
        -680,
        1060
      ],
      "parameters": {
        "filter": {
          "folderId": {
            "__rl": true,
            "mode": "list",
            "value": "1ANNp1YrfmcNhYm2ZqUGtxxLrvPenBlhh",
            "cachedResultUrl": "https://drive.google.com/drive/folders/1ANNp1YrfmcNhYm2ZqUGtxxLrvPenBlhh",
            "cachedResultName": "TestPackages"
          }
        },
        "options": {},
        "resource": "fileFolder",
        "returnAll": true
      },
      "credentials": {
        "googleDriveOAuth2Api": {
          "name": "<your credential>"
        }
      },
      "typeVersion": 3
    },
    {
      "id": "5c4e9e98-4f1f-4024-ab88-6183f4583c00",
      "name": "Download Package Files",
      "type": "n8n-nodes-base.googleDrive",
      "position": [
        -440,
        1060
      ],
      "parameters": {
        "fileId": {
          "__rl": true,
          "mode": "id",
          "value": "={{ $json.id }}"
        },
        "options": {},
        "operation": "download"
      },
      "credentials": {
        "googleDriveOAuth2Api": {
          "name": "<your credential>"
        }
      },
      "typeVersion": 3
    },
    {
      "id": "ad967561-1968-4bdb-9c36-17200f291593",
      "name": "Loop Over each PDF file",
      "type": "n8n-nodes-base.splitInBatches",
      "position": [
        -180,
        1060
      ],
      "parameters": {
        "options": {}
      },
      "typeVersion": 3
    },
    {
      "id": "6408c50e-99fe-46f1-995b-0d9b99e42ca1",
      "name": "Pinecone Vector Store - Store Vector Data",
      "type": "@n8n/n8n-nodes-langchain.vectorStorePinecone",
      "position": [
        160,
        1100
      ],
      "parameters": {
        "mode": "insert",
        "options": {},
        "pineconeIndex": {
          "__rl": true,
          "mode": "list",
          "value": "package1536",
          "cachedResultName": "package1536"
        }
      },
      "credentials": {
        "pineconeApi": {
          "name": "<your credential>"
        }
      },
      "typeVersion": 1.1
    },
    {
      "id": "687269c8-74ca-44fb-8f37-a860ae82f162",
      "name": "Embeddings OpenAI",
      "type": "@n8n/n8n-nodes-langchain.embeddingsOpenAi",
      "position": [
        60,
        1360
      ],
      "parameters": {
        "options": {}
      },
      "credentials": {
        "openAiApi": {
          "name": "<your credential>"
        }
      },
      "typeVersion": 1.2
    },
    {
      "id": "068cde17-e656-440d-abb8-2e498cee82b1",
      "name": "Data Loader",
      "type": "@n8n/n8n-nodes-langchain.documentDefaultDataLoader",
      "position": [
        220,
        1360
      ],
      "parameters": {
        "options": {},
        "dataType": "binary"
      },
      "typeVersion": 1
    },
    {
      "id": "adaeaaf3-1bfc-4b27-878b-5d5ddb1fc9a5",
      "name": "Recursive Character Text Splitter",
      "type": "@n8n/n8n-nodes-langchain.textSplitterRecursiveCharacterTextSplitter",
      "position": [
        300,
        1540
      ],
      "parameters": {
        "options": {},
        "chunkOverlap": 50
      },
      "typeVersion": 1
    },
    {
      "id": "4184166f-f2ea-448e-9876-a432fd85a8c3",
      "name": "Sticky Note",
      "type": "n8n-nodes-base.stickyNote",
      "position": [
        -740,
        920
      ],
      "parameters": {
        "width": 460,
        "height": 320,
        "content": "## Extract PDF file from Google Drive\n**Connect & Download PDF file**"
      },
      "typeVersion": 1
    },
    {
      "id": "cfcef192-f80d-43dc-beac-0b7bfab7d9ba",
      "name": "Sticky Note1",
      "type": "n8n-nodes-base.stickyNote",
      "position": [
        -260,
        920
      ],
      "parameters": {
        "width": 360,
        "height": 320,
        "content": "## Loop over each PDF file\n**Extract each tour package content from the pdf file and vectorise the data in the pinecone database. "
      },
      "typeVersion": 1
    }
  ],
  "active": false,
  "settings": {
    "callerPolicy": "workflowsFromSameOwner",
    "executionOrder": "v1"
  },
  "versionId": "f3d6347b-1f02-4869-8cc0-b4f45029d1f0",
  "connections": {
    "Data Loader": {
      "ai_document": [
        [
          {
            "node": "Pinecone Vector Store - Store Vector Data",
            "type": "ai_document",
            "index": 0
          }
        ]
      ]
    },
    "Embeddings OpenAI": {
      "ai_embedding": [
        [
          {
            "node": "Pinecone Vector Store - Store Vector Data",
            "type": "ai_embedding",
            "index": 0
          }
        ]
      ]
    },
    "Download Package Files": {
      "main": [
        [
          {
            "node": "Loop Over each PDF file",
            "type": "main",
            "index": 0
          }
        ]
      ]
    },
    "Loop Over each PDF file": {
      "main": [
        [],
        [
          {
            "node": "Pinecone Vector Store - Store Vector Data",
            "type": "main",
            "index": 0
          }
        ]
      ]
    },
    "PDF Tour Package Folder": {
      "main": [
        [
          {
            "node": "Download Package Files",
            "type": "main",
            "index": 0
          }
        ]
      ]
    },
    "Recursive Character Text Splitter": {
      "ai_textSplitter": [
        [
          {
            "node": "Data Loader",
            "type": "ai_textSplitter",
            "index": 0
          }
        ]
      ]
    },
    "When clicking \u2018Test workflow\u2019": {
      "main": [
        [
          {
            "node": "PDF Tour Package Folder",
            "type": "main",
            "index": 0
          }
        ]
      ]
    },
    "Pinecone Vector Store - Store Vector Data": {
      "main": [
        [
          {
            "node": "Loop Over each PDF file",
            "type": "main",
            "index": 0
          }
        ]
      ]
    }
  }
}

Credentials you'll need

Each integration node will prompt for credentials when you import. We strip credential IDs before publishing — you'll add your own.

Pro

For the full experience including quality scoring and batch install features for each workflow upgrade to Pro

About this workflow

Process Tour PDF from Google Drive to Pinecone Vector DB with OpenAI Embeddings

Source: https://n8n.io/workflows/5085/ — original creator credit. Request a take-down →

More AI & RAG workflows → · Browse all categories →

Related workflows

Workflows that share integrations, category, or trigger type with this one. All free to copy and import.

AI & RAG

This n8n template automatically classifies incoming emails (Sales, Support, Internal, Finance, Promotions) and routes them to a dedicated OpenAI LLM Agent for processing. Depending on the category, th

OpenAI, Gmail, Text Classifier +16
AI & RAG

Automate Outreach Prospect automates finding, enriching, and messaging potential partners (like restaurants, malls, and bars) using Apify Google Maps scraping, Perplexity enrichment, OpenAI LLMs, Goog

@Devlikeapro/N8N Nodes Waha, Google Drive Trigger, @Apify/N8N Nodes Apify +14
AI & RAG

This workflow automates the early-stage job application process using AI.

Pinecone Vector Store, Document Default Data Loader, Google Drive +9
AI & RAG

Every company has documents sitting in Google Drive that nobody reads. HR policies, sales playbooks, product FAQs, financial guidelines — all written once, never found again. This workflow turns all o

Google Sheets, Gmail, Google Drive +5
AI & RAG

This n8n workflow implements a fully automated Retrieval-Augmented Generation (RAG) pipeline powered by Google Drive, OpenAI embeddings, and Pinecone.

Pinecone Vector Store, Document Default Data Loader, Text Splitter Recursive Character Text Splitter +10