The workflow JSON

Copy or download the full n8n JSON below. Paste it into a new n8n workflow, add your credentials, activate. Full import guide →

Download .json

{
  "name": "AI Data Extractor from Unstructured Text (Ollama)",
  "nodes": [
    {
      "parameters": {},
      "id": "webhook-1",
      "name": "Receive Data",
      "type": "n8n-nodes-base.webhook",
      "typeVersion": 1.1,
      "position": [
        240,
        300
      ]
    },
    {
      "parameters": {
        "assignments": {
          "assignments": [
            {
              "id": "text",
              "name": "text",
              "value": "={{ $json.body.text || '' }}",
              "type": "string"
            },
            {
              "id": "schema",
              "name": "schema",
              "value": "={{ $json.body.schema || '{\"name\": \"string\", \"email\": \"string\", \"phone\": \"string\", \"company\": \"string\", \"role\": \"string\"}' }}",
              "type": "string"
            },
            {
              "id": "context",
              "name": "context",
              "value": "={{ $json.body.context || 'Extract structured data from the text' }}",
              "type": "string"
            }
          ]
        }
      },
      "id": "set-1",
      "name": "Prepare Input",
      "type": "n8n-nodes-base.set",
      "typeVersion": 3.3,
      "position": [
        460,
        300
      ]
    },
    {
      "parameters": {
        "method": "POST",
        "url": "http://localhost:11434/api/generate",
        "sendBody": true,
        "specifyBody": "json",
        "jsonBody": "={{ JSON.stringify({ model: 'llama3:8b', prompt: `You are a data extraction specialist. Extract structured data from unstructured text.\\n\\nContext: ${$json.context}\\n\\nTarget schema (extract these fields):\\n${$json.schema}\\n\\nText to extract from:\\n${$json.text}\\n\\nRules:\\n1. Return ONLY a valid JSON array of objects matching the schema\\n2. If a field cannot be found, use null\\n3. Extract ALL matching entities (there may be multiple)\\n4. Be precise \u2014 only extract what is clearly stated\\n5. Do not invent or hallucinate data\\n\\nReturn ONLY the JSON array, no explanation.`, stream: false, options: { temperature: 0.1, num_predict: 4000 } }) }}",
        "options": {
          "timeout": 120000
        }
      },
      "id": "ollama-1",
      "name": "Extract Data (Ollama)",
      "type": "n8n-nodes-base.httpRequest",
      "typeVersion": 4.1,
      "position": [
        680,
        300
      ]
    },
    {
      "parameters": {
        "jsCode": "const response = $input.first().json.response;\n\nlet extracted;\ntry {\n  // Find JSON array in response\n  const arrayMatch = response.match(/\\[[\\s\\S]*\\]/);\n  if (arrayMatch) {\n    extracted = JSON.parse(arrayMatch[0]);\n  } else {\n    // Try single object\n    const objMatch = response.match(/\\{[\\s\\S]*\\}/);\n    extracted = objMatch ? [JSON.parse(objMatch[0])] : [];\n  }\n} catch (e) {\n  extracted = [{ raw_response: response, parse_error: e.message }];\n}\n\nreturn [{ json: { extracted_count: extracted.length, data: extracted } }];"
      },
      "id": "code-1",
      "name": "Parse & Validate",
      "type": "n8n-nodes-base.code",
      "typeVersion": 2,
      "position": [
        900,
        300
      ]
    },
    {
      "parameters": {
        "method": "POST",
        "url": "http://localhost:11434/api/generate",
        "sendBody": true,
        "specifyBody": "json",
        "jsonBody": "={{ JSON.stringify({ model: 'llama3:8b', prompt: `Verify this extracted data for accuracy. Check each field against common sense and flag any suspicious values.\\n\\nExtracted data:\\n${JSON.stringify($json.data)}\\n\\nFor each record, return a JSON object with:\\n- record_index: number\\n- confidence: \"high\", \"medium\", or \"low\"\\n- flags: array of any concerns (empty if none)\\n\\nReturn a JSON array of verification results.`, stream: false, options: { temperature: 0.1 } }) }}",
        "options": {
          "timeout": 120000
        }
      },
      "id": "ollama-2",
      "name": "Verify Extraction (Ollama)",
      "type": "n8n-nodes-base.httpRequest",
      "typeVersion": 4.1,
      "position": [
        1120,
        300
      ]
    },
    {
      "parameters": {
        "respondWith": "json",
        "responseBody": "={{ JSON.stringify({ extracted_count: $('Parse & Validate').first().json.extracted_count, data: $('Parse & Validate').first().json.data, verification: $json.response }) }}"
      },
      "id": "respond-1",
      "name": "Return Results",
      "type": "n8n-nodes-base.respondToWebhook",
      "typeVersion": 1.1,
      "position": [
        1340,
        300
      ]
    }
  ],
  "connections": {
    "Receive Data": {
      "main": [
        [
          {
            "node": "Prepare Input",
            "type": "main",
            "index": 0
          }
        ]
      ]
    },
    "Prepare Input": {
      "main": [
        [
          {
            "node": "Extract Data (Ollama)",
            "type": "main",
            "index": 0
          }
        ]
      ]
    },
    "Extract Data (Ollama)": {
      "main": [
        [
          {
            "node": "Parse & Validate",
            "type": "main",
            "index": 0
          }
        ]
      ]
    },
    "Parse & Validate": {
      "main": [
        [
          {
            "node": "Verify Extraction (Ollama)",
            "type": "main",
            "index": 0
          }
        ]
      ]
    },
    "Verify Extraction (Ollama)": {
      "main": [
        [
          {
            "node": "Return Results",
            "type": "main",
            "index": 0
          }
        ]
      ]
    }
  },
  "settings": {
    "executionOrder": "v1"
  },
  "staticData": null,
  "tags": [],
  "triggerCount": 0,
  "updatedAt": "2026-03-24T00:00:00.000Z",
  "versionId": "1"
}

Pro

For the full experience including quality scoring and batch install features for each workflow upgrade to Pro

About this workflow

AI Data Extractor from Unstructured Text (Ollama). Uses httpRequest. Webhook trigger; 6 nodes.

Source: https://github.com/Vanillapfalz374/n8n-ai-workflows/blob/main/samples/ai-data-extractor.json — original creator credit. Request a take-down →

More AI & RAG workflows → · Browse all categories →

Related workflows

Workflows that share integrations, category, or trigger type with this one. All free to copy and import.

AI & RAG

Classify & Extract Data From Floorplans with Mistral AI OCR & Jigsawstack

<section> <h2>🌊 What it Does</h2> <p> This workflow <strong>automatically classifies uploaded files</strong> (PDFs or images) as <span>floorplans</span> or <span>non‑floorplans</span>. It filters out

HTTP Request

AI & RAG

Transcribe Meetings and Log Action Items to Notion with Assemblyai and Gemini

This workflow accepts a meeting recording URL via webhook, transcribes the audio with AssemblyAI, uses Google Gemini to extract a summary, action items, decisions, and next steps, then creates a struc

HTTP Request

AI & RAG

Image-based Data Extraction API Using Gemini AI (http Request)

This n8n workflow provides a ready-to-use API endpoint for extracting structured data from images. It processes an image URL using an AI-powered OCR model and returns the extracted details in a struct

HTTP Request

AI & RAG

AI Keyword & Entity Extractor (ollama)

AI Keyword & Entity Extractor (Ollama). Uses httpRequest. Webhook trigger; 7 nodes.

HTTP Request

AI & RAG

Extract Embedded Images From Google Drive Documents with Vlm Run Agent

This workflow automates the process of extracting images from uploaded documents in Google Drive using the VLM Run Execute Agent, then downloads and saves those extracted images into a designated Driv

Google Drive, @Vlm Run/N8N Nodes Vlmrun, HTTP Request +1

AI Data Extractor From Unstructured Text (ollama)

The workflow JSON

About this workflow

Related workflows