Pdfinfo inside unix shell

6/17/2023

Note: djvused is part of the djvulibre-bin package and may be installed with sudo apt-get install djvulibre-bin. Note: pdfinfo is part of poppler-utils and should come preinstalled on Ubuntu. Run a cell with the following command first:apt-get install poppler-utils Heres a complete example notebook that installs deps, downloads an example PDF, and then uses pdf2image to convert it to an image for display. PDF pdfinfo sample.pdf | grep -oP '(?<=Pages: )*' ODT unzip -p sample.odt meta.xml | grep -oP '(?<=page-count=")*' In other words, users will require some knowledge of command line usage in order to be able to. The (-n) when used in conjunction with (p)rint will avoid repetition of line printing. Sed - matches values found in 'Count' strings. Note: wvSummary (case-sensitive!) is part of the wv package. PDFInfo is a command-line application that will allow you to view a PDF documents information. strings - grabs all strings from PDF binary. WvSummary sample.ppt | grep -oP '(?<=of Slides = )*'

Fetching the following informations and make them easy accessible: Producer Creation date Modified date Tagged Form Pages Encrypted Page size Width as points Height as points Format Rotated Degrees Box (Media, Crop, Bleed, Trim, Art) X coordinate as points Y. Note: unzip can be installed with sudo apt-get install unzip.ĭOC/PPT wvSummary sample.doc | grep -oP '(?<=of Pages = )*' A little PHP wrapper around the Xpdf cli tool: pdfinfo. With your help I was able to compile a list of commands that can extract the page count from almost all relevant office documents:ĭOCX/PPTX unzip -p 'sample.docx' docProps/app.xml | grep -oP '(?).*(?=\)'

0 Comments

Pdfinfo inside unix shell

Leave a Reply.

Author

Archives

Categories