Document new uses of -process-struct-trees

2025-06-05 22:09:39 +02:00 · 2025-03-05 13:32:15 +00:00
parent fa535db035
commit a5bfacaeac
2 changed files with 7 additions and 3 deletions
--- a/cpdfmanual.pdf
+++ b/cpdfmanual.pdf
--- a/cpdfmanual.tex
+++ b/cpdfmanual.tex
@ -1,4 +1,3 @@
-%Document new places one can use -process-struct-trees
 %Document %objnum in -extract-images
 %\DocumentMetadata{lang=en, pdfversion=2.0, pdfstandard=ua-2, pdfstandard=a-4f, testphase={phase-III, title, table, math, firstaid}}
 \documentclass{book}
@ -841,6 +840,8 @@ at least five pages.

  \end{framed}

+\noindent If the file has a structure tree (a.k.a Tagged PDF), it will be preserved whole. To trim the structure tree to only include the output pages, and so save space, add \texttt{-process-struct-trees} to the command line.
+
 \index{decryption}
  \section{Working with Encrypted Documents}
 \index{owner password}
@ -2162,7 +2163,8 @@ The option \texttt{-squeeze-no-pagedata} avoids the reprocessing of page data, w

  \vspace{1.5mm}
  \small\noindent\verb!cpdf -table-of-contents [-toc-title] [-toc-no-bookmark] [-toc-dot-leaders]!\\
-  \small\noindent\verb!     [-font <font>] [-font-size <size>] in.pdf -o out.pdf!
+  \small\noindent\verb!     [-font <font>] [-font-size <size>] [-process-struct-trees]!\\
+  \small\noindent\verb!     in.pdf -o out.pdf!

  \end{framed}

@ -2332,6 +2334,8 @@ Cpdf can automatically generate a table of contents from existing bookmarks, add
   \noindent\small\verb!cpdf -table-of-contents -toc-no-bookmark in.pdf -o out.pdf!
  \end{framed}

+\noindent To create a structure tree for the table of contents, and merge it with the existing one (for example, when adding a table of contents to a PDF/UA file), add \texttt{-process-struct-trees} to the command.
+
 \ \ \

 \clearpage\pagestyle{empty}
@ -4024,7 +4028,7 @@ Other images are written as PNGs, processed with either ImageMagick's ``magick''

 \noindent The \texttt{-im} or \texttt{-p2p} option is used to give the path to the external tool, one of which must be installed (unless \texttt{-raw} is added, which outputs instead just JPEG or plain \texttt{.pnm} files).

-The output specifier, e.g \verb!-o output/%%%! gives the number format for numbering the images. Output files are named serially from 0, and include the page number too. For example, output files might be called \texttt{output/000-p1.jpg}, \texttt{output/001-p1.png}, \texttt{output/002-p3.jpg} etc. Here is an example invocation:
+The output specifier, e.g \verb!-o output/%%%! gives the number format for numbering the images. Output files are named serially from 0, and include the page number too. For example, output files might be called \texttt{output/000-p1.jpg}, \texttt{output/001-p1.png}, \texttt{output/002-p3.jpg} etc. The specification \texttt{\%objnum} may also be used to insert the object number of the image. Here is an example invocation:

  \begin{framed}
  \noindent\small\verb@cpdf -extract-images in.pdf -im magick -o output/%%%@