This n8n workflow illustrates how to convert PDF files into text with the PDF Toolkit from www.customjs.space.
What this workflow does:
– Change the requested HTML to PDF.
– Extract text from the PDF.
– Use a Code node to handle URLs that point to PDF files.
– Convert the PDF to text.
Requirements:
– Self-hosted n8n instance.
– CustomJS API key for converting PDF to text.
– HTML Data to convert PDF files.
– Code node for handling URL that indicates PDF file.
Workflow Steps:
1. Manual Trigger:
Runs with user interaction.
2. HTML to PDF:
Request HTML Data
Convert HTML to PDF
3. Convert PDF to Text:
Convert the generated Text from PDF
Usage:
1. Get API key from CustomJS:
– Sign up to CustomJS platform.
– Navigate to your profile page.
– Press “Show” button to get API key.
2. Set Credentials for CustomJS API on n8n:
Copy and paste your API key generated from CustomJS here.
3. Design workflow:
– A Manual Trigger for starting workflow.
– HTTP Request Nodes for downloading PDF files.
– Code node for handling URL that indicates PDF file.
– Convert PDF to Text.
You can replace logic for triggering and returning results. For example, you can trigger this workflow by calling a webhook and get a result as a response from webhook. Simply replace Manual Trigger and Write to Disk nodes.