Extract Pages as Images/Separate Attachments from PDF

Feature : When passing a url for a PDF to Extract from File node, could we allow for each page to be given as a object in it’s output. Either as an image or as a separate PDF object. Ideally could be added as a setting to the node config.

Why : Old documents which are images of pages collated in a PDF and might need OCR could benefit from this segmentation, as this opens room for processing over OCR/Multimodal LLM for text transcription/extraction.

Please authenticate to join the conversation.

Upvoters
Status

Completed

Board

💡 Feature Requests

Date

7 months ago

Author

Dhruv Pamneja

Subscribe to post

Get notified by email when there are changes.