OpenSystemsDevelopment / qpdf

15 Aug, 2025

1 commit

Add error handling for missing or invalid Resources and invalid or duplicate ann… ...

…otations in page objects

- Repair invalid or missing Resources in page object trees with warnings
- Remove invalid Annots arrays with warnings
- Warn about duplicate annotations
- Update test cases and output to reflect new error handling.
- Improve robustness for annotation and resource validation.

authored

2025-08-15 16:33:17 +0100

Browse Dir »

14 Aug, 2025

3 commits

Remove temporary file accidentally included in #1518
ccc3f98b

m-holger authored
2025-08-14 14:49:51 +0100
Browse Dir »
Revert temporary renaming of `qpdf_pages_fuzzer`. ...
36979440
```
Fuzzer was temporarily renamed in #1466 in order to allow a (fixed) time-out to age-out.
```
m-holger authored
2025-08-14 13:00:55 +0100
Browse Dir »
Refactor `QPDFWriter`: remove unused `stream_decode_level` check in conditional … ...
1f7a6be4
```
…logic in call to `initializeSpecialStreams`.
```
m-holger authored
2025-08-14 12:00:26 +0100
Browse Dir »

07 Aug, 2025

1 commit

Disallow `--deterministic-id` with encrypted output and improve error handling f… ...
b46d4b98
```
…or deterministic ID generation (fixes #1235).
```
m-holger authored
2025-08-07 19:09:16 +0100
Browse Dir »

02 Aug, 2025

1 commit

Improve linearization logic to handle invalid thumbnails gracefully and fix thum… ...
3471c7c7
```
…bnail processing loop to ensure consistency.
```
m-holger authored
2025-08-02 14:48:13 +0100
Browse Dir »

31 Jul, 2025

1 commit

Enhance `QPDF_objects` to ignore excessively large object stream IDs in xref str… ...
4b70fc3c
```
…eams, improving robustness against damaged PDFs.
```
m-holger authored
2025-07-31 18:30:08 +0100
Browse Dir »

30 Jun, 2025

1 commit

Extend xref reconstruction sanity checks ...

37b32f3d

After xref reconstruction treat the input file as suspect and apply sanity checks to all subsequent object reads.

Remove `in_xref_reconstruction` flag and update references to use `reconstructed_xref` for simplified state management during xref processing. Adjust warnings for invalid dictionary keys in test output.

authored

2025-06-30 18:23:03 +0100

Browse Dir »

13 Jun, 2025

1 commit

Improve sanity checks and error handling in `QPDFParser` ...

5b443012

Enhanced handling of unexpected tokens during xref table reconstruction. Adjusted logic for invalid tokens, ensuring better robustness during PDF parsing of corrupt PDF files.

authored

2025-06-13 17:33:24 +0100

Browse Dir »

10 Jun, 2025

1 commit

Enhance error handling for unexpected tokens during sanity checks ...

d68f45e0

Implemented stricter sanity checks to  handle unexpected tokens like array/dictionary close and endobj/endstream more effectively. Improved warning messages and handling of corrupt objects to enhance PDF parsing robustness.

authored

2025-06-10 20:10:31 +0100

Browse Dir »

11 May, 2025

1 commit

Add unit tests for compression handling on empty streams
3470c5f1

TinyServal authored
2025-05-11 13:45:32 +0100
Browse Dir »

27 Apr, 2025

1 commit

Fix QPDFFormFieldObjectHelper::getChoices (fixes #1433) ...

c46cfae7

Return the display value if the choices entry is an array of strings rather
than a single string.

Test file is need-appearances.pdf modified to contain one array entry.

authored

2025-04-27 10:54:06 +0100

Browse Dir »

06 Apr, 2025

1 commit

Add --jpeg-quality-level flag (fixes #488) ...
021edd02
```
Thanks to github user @cdosborn for the basic enhancement.
```
Jay Berkenbilt authored
2025-04-06 08:42:45 -0400
Browse Dir »

05 Apr, 2025

2 commits

Fix logic around cleartext metadata (fixes #1368) ...
8720065c
```
Only top-level XMP metadata is supposed to be left unencrypted. All
other metadata is not treated specially.
```
Jay Berkenbilt authored
2025-04-05 18:06:19 -0400
Browse Dir »
Allow rotate as array in job JSON (fixes #1401)
a160bd4e

Jay Berkenbilt authored
2025-04-05 09:35:00 -0400
Browse Dir »

30 Mar, 2025

2 commits

Fix offsets in QPDF::resolveObjectsInStream warnings ...
e3b77e43
```
As discussed in #1396.
```
m-holger authored
2025-03-30 13:22:11 +0100
Browse Dir »
Enhance --rotate usage message (fixes #1410) ...
249427ea
```
Also, silently fix any angle that is a multiple of 90.
```
m-holger authored
2025-03-30 11:37:15 +0100
Browse Dir »

26 Mar, 2025

1 commit

Add new CLI option --remove-structure ...
464d94af
```
... to remove the /Root /StructTreeRoot and /MarkInfo entries.
```
m-holger authored
2025-03-26 23:30:44 +0000
Browse Dir »

25 Mar, 2025

1 commit

Fix parsing of object streams ...

b37fc717

... containing objects with no white-space between them.

To enforce the rule that objects end at the start-offset of the next
object, each object is parsed in it own object stream.

To facilitate this, a new private API input source is::OffsetBuffer has
been added which only contains the object but reports offsets relative to
the start of the object stream. This is adapted from OffsetInputSource by
changing the direction of the offset, endowing it with its own
BufferInputSource and striooing out checks duplicated in BufferInputSource.

Fixes the expected failure in the test case added in #1266.

authored

2025-03-25 10:40:01 +0000

Browse Dir »

24 Mar, 2025

1 commit

Fix object stream error/warning messages reporting wrong object id ...

1bce5c4f

This was due to the use of last_object_description, which is not set for
the object stream itself.

Also, modify the messages introduced #1391 and #1392 to report the supposed
offset of the objects.

authored

2025-03-24 21:57:16 +0000

Browse Dir »

10 Mar, 2025

2 commits

Refine recovery from missing startxref (fixes #1335) ...
7927241d
```
If startxref cannot be found in the last 1024 try finding it in the
whole file and check whether it is valid.
```
m-holger authored
2025-03-10 18:26:14 +0000
Browse Dir »

Refactor xref table reconstruction (Fixes #1362) ...

649709a8

Split reconstruction into three passes - scanning of input for objects and
trailer, insertion of objects into the xref table, and loading the trailer.

This allows insertion to take place in the usual reverse order and removes
the need for a separate insertReconstructedXrefEntry method.

It also allows trailer to be tried from most recent to oldest.

Ignore any found trailers without /Root entry.

authored

2025-03-10 15:12:28 +0000

Browse Dir »

07 Mar, 2025

1 commit

Enhance error checking of object stream object ids and offsets ...

f06209ca

The original test file contains multiple entries with id 0 and offset 0.
One entry has been modified such that the id is valid (6).

Object streams with invalid offsets are a source of unreproduceable
oss-fuzz time-outs.

authored

2025-03-07 20:27:54 +0000

Browse Dir »

15 Feb, 2025

1 commit

Exclude cygwin from fix-qdf pipe test ...
da42078d
```
Also add debugging information so we can save time if $^O used in
GitHub Actions changes again.
```
Jay Berkenbilt authored
2025-02-15 10:52:32 -0500
Browse Dir »

07 Feb, 2025

1 commit

Refine QPDFParser error handling ...

8df3de5c

Reduce the container size for which a single bad token will cause a failure
from 100,000 to 5,000.

Count missing dictionary keys as errors.

authored

2025-02-07 23:41:56 +0000

Browse Dir »

04 Feb, 2025

1 commit

Add zopfli support (fixes #1323) ...
133da3b6
```
This requires a special build option.
```
Jay Berkenbilt authored
2025-02-04 06:17:34 -0500
Browse Dir »

03 Feb, 2025

1 commit

fix-qdf: accept optional output file (fixes #1330)
a2fc5b52

Jay Berkenbilt authored
2025-02-03 06:42:22 -0500
Browse Dir »

02 Feb, 2025

2 commits

Refine xref reconstruction (fixes #1335) ...

ca3ea2e3

When recovering XRef streams, start with the stream with the largest
/Size rather than the largest offset.

Also, if reconstruction fails to find a trailer with a valid /Root entry
search for a root object.

authored

2025-02-02 21:14:08 +0000

Browse Dir »

Merge pull request #1340 from m-holger/i1286 ...
aa583f29
```
Change QPDFWriter stream_decode_level default to qpdf_dl_generalized (fixes #1286)
```
m-holger authored
2025-02-02 21:03:04 +0000
Browse Dir »

01 Feb, 2025

1 commit

CLI reject flags with parameters (fixes #1329)
985cdf91

m-holger authored
2025-02-01 12:34:57 +0000
Browse Dir »

31 Jan, 2025

3 commits

Add new object stream test case ...
c026b511
```
Exercise stream containing objects with no white-space between them.
```
m-holger authored
2025-01-31 19:22:06 +0000
Browse Dir »
Change QPDFWriter stream_decode_level default to qpdf_dl_generalized ...
718b1400
```
Also, fix disabling of preserve_encryption to be ignore
stream_decode_level, but disable preserve_encryption if compress_streams is
false.

Fixes #1286
```
m-holger authored
2025-01-31 16:09:07 +0000
Browse Dir »
In QPDFWriter::willFilterStream on runtime error on first attempt retry ...
ff0affd8
```
without filtering
```
m-holger authored
2025-01-31 15:34:02 +0000
Browse Dir »

28 Jan, 2025

4 commits

Add copy annotation test ...
73af7567
```
Test fixing /P entry.
```
m-holger authored
2025-01-28 16:26:21 +0000
Browse Dir »
Merge pull request #1307 from m-holger/pages ...
bde5a446
```
Fix QPDF::getAllPagesInternal warning
```
m-holger authored
2025-01-28 15:59:52 +0000
Browse Dir »
Fix QPDF::copyForeignObject warning ...
cc95f473
```
Provide correct obj_gen and offset.
```
m-holger authored
2025-01-28 11:01:18 +0000
Browse Dir »
Fix QPDF::getAllPagesInternal warning ...
b7bf9f3d
```
Provide correct obj_gen.
```
m-holger authored
2025-01-28 10:15:15 +0000
Browse Dir »

16 Jan, 2025

2 commits

Revert "Merge pull request #1272 from m-holger/xref_table" ...
0d5c57c1
```
This reverts commit ff2a78f579ebdd06b417e34260a17dba06e71137, reversing
changes made to 8f54319f7a6514110f4b05cbbf1cb1c9fc8cb6a0.
```
m-holger authored
2025-01-16 16:40:08 +0000
Browse Dir »
Revert "Merge pull request #1289 from m-holger/fuzz" ...
f1800410
```
This reverts commit 0e92cf6bf399249c603c3d0212e898fd29e71fcd, reversing
changes made to 7d34b89a69e8e89c098dd373442f7df809c28eff.
```
m-holger authored
2025-01-16 16:36:48 +0000
Browse Dir »

05 Jan, 2025

1 commit

Tweak test files to work around fixed ghostscript bug ...

531f6877

Ghostscript 10.0.2 failed to handle the files changed in this commit,
but ghostscript 10.0.4 handles them fine as do earlier versions. These
files all have hybird xref in the form of a file with an xref table
appended with a section that has an xref stream. They all have
/PageLabels pointing to 107 0 R in the original file, with 107 higher
than the highest object. The spec says that this should be treated as
null, which results in /PageLabels null, which results in ghostscript
errors in that version. While ghostscript 10.0.2 may be handling the
file incorrectly, the file does something that's not really kosher,
and it's easier to fix the files, which had not been changed since the
very first open source release of qpdf, than to try to work around the
issue.

This was discovered with the GitHub actions runner was bumped to
Ubuntu 24.04, which contains the buggy version of ghostscript. I was
not able to find a specific ghostscript issue that addressed this, but
the problem went away in either 10.0.3 or 10.0.4.

Commenting out /PageLabels without changing offsets was a pragmatic
move to avoid having to regenerate the xref tables manually. I just
had to manually edit the binary xref stream to change the offset of
one item (the new object 1), which I put at the end to avoid breaking
other things.

authored

2025-01-05 17:29:03 -0500

Browse Dir »