This n8n workflow illustrates how to convert PDF files into text with the PDF Toolkit from www.customjs.space.

What this workflow does:

– Change the requested HTML to PDF.
– Extract text from the PDF.
– Use a Code node to handle URLs that point to PDF files.
– Convert the PDF to text.

Requirements:

– Self-hosted n8n instance.
– CustomJS API key for converting PDF to text.
– HTML Data to convert PDF files.
– Code node for handling URL that indicates PDF file.

Workflow Steps:

1. Manual Trigger:
Runs with user interaction.

2. HTML to PDF:
Request HTML Data
Convert HTML to PDF

3. Convert PDF to Text:
Convert the generated Text from PDF

Usage:

1. Get API key from CustomJS:
– Sign up to CustomJS platform.
– Navigate to your profile page.
– Press “Show” button to get API key.

2. Set Credentials for CustomJS API on n8n:
Copy and paste your API key generated from CustomJS here.

3. Design workflow:
– A Manual Trigger for starting workflow.
– HTTP Request Nodes for downloading PDF files.
– Code node for handling URL that indicates PDF file.
– Convert PDF to Text.

You can replace logic for triggering and returning results. For example, you can trigger this workflow by calling a webhook and get a result as a response from webhook. Simply replace Manual Trigger and Write to Disk nodes.