OpenSystemsDevelopment / qpdf

30 Nov, 2025

3 commits

Refactor `limits_error` handling for improved clarity and consistency ...

- Introduce `limits_error` method in `QPDFParser` for centralized limit-related error handling.
- Enhance warnings and error messages with detailed limit identifiers (e.g., `parser-max-nesting`).
- Refactor limit checks to improve maintainability and ensure uniformity in error reporting.
- Update tests and output to reflect adjusted error handling approach.

authored

2025-11-30 22:24:06 +0000

Browse File »

Rename `objects_*` limits to `parser_*` for clarity and consistency. Update rela… ...
27e14333
```
…ted tests, documentation, and references across the codebase.
```
m-holger authored
2025-11-30 22:24:06 +0000
Browse File »

Add `limit_errors` tracking to global options ...

2774ac30

Enhance the `global` namespace by introducing `limit_errors` for tracking the number of exceeded limits. Update related tests and documentation to ensure functionality and clarity.

authored

2025-11-30 22:24:06 +0000

Browse File »

05 Nov, 2025

5 commits

Improve `QPDFParser` handling of invalid indirect references by enhancing associated warnings
9abd73ff

m-holger authored
2025-11-05 13:47:36 +0000
Browse File »
Refactor `QPDFParser` to simplify `parse` methods by removing `empty` flag, adju… ...
fe4853fe
```
…sting return values for uninitialized objects, and cleaning up error handling logic.
```
m-holger authored
2025-11-05 13:47:04 +0000
Browse File »
Refactor `QPDFParser` internal `parse` methods to return uninitialized object ha… ...
7061ee1c
```
…ndles on invalid input..
```
m-holger authored
2025-11-05 10:33:13 +0000
Browse File »
Refactor `QPDFParser` error handling by introducing `QPDFParser::Error` class, r… ...
1a1c640a
```
…eplacing redundant logic with centralized functions, and streamlining bad token handling for improved readability and maintainability.
```
m-holger authored
2025-11-05 10:23:23 +0000
Browse File »
Enhance `QPDFParser` by introducing global `Limits` class for configurable const… ...
b841c2d2
```
…raints, replacing hardcoded values for nesting, container size, and error limits.
```
m-holger authored
2025-11-05 10:21:36 +0000
Browse File »

04 Nov, 2025

1 commit

Refactor `QPDFObjectHandle`: replace `getObj` with `BaseHandle::obj_sp` for impr… ...
906034d9
```
…oved shared pointer handling, remove deprecated object methods, and update all references.
```
m-holger authored
2025-11-04 16:05:46 +0000
Browse File »

06 Oct, 2025

1 commit

Move `Objects` to `QPDF::Doc` and update references ...

e0ebf44f

Relocate `Objects` to `QPDF::Doc` for improved encapsulation of object-related logic. Adjust all relevant methods and references to use the new placement.

authored

2025-10-06 13:35:10 +0100

Browse File »

05 Oct, 2025

2 commits

Move `reconstructed_xref` to `QPDF::Doc` and update references ...

070b38bf

Relocate `reconstructed_xref` to `QPDF::Doc` for improved encapsulation of cross-reference reconstruction state. Adjust all references to use the updated placement.

authored

2025-10-05 12:25:25 +0100

Browse File »

Move `ParseGuard` to `QPDF::Doc` and update references ...
0be4a8e7
```
Relocate `ParseGuard` to `QPDF::Doc` for better encapsulation of parsing logic. Adjust references in `QPDFParser` accordingly to use the new placement.
```
m-holger authored
2025-10-05 11:44:58 +0100
Browse File »

27 Jul, 2025

1 commit

Enhance `QPDFParser` sanity checks after xref recovery to include parsing of content streams.
413c8c0f

m-holger authored
2025-07-27 17:35:48 +0100
Browse File »

20 Jul, 2025

1 commit

Refactor `QPDFObjectHandle` and related classes to replace `Buffer` usage with `… ...
c436f035
```
…std::string_view`, improving performance and code clarity.
```
m-holger authored
2025-07-20 15:07:24 +0100
Browse File »

13 Jun, 2025

1 commit

Improve sanity checks and error handling in `QPDFParser` ...

5b443012

Enhanced handling of unexpected tokens during xref table reconstruction. Adjusted logic for invalid tokens, ensuring better robustness during PDF parsing of corrupt PDF files.

authored

2025-06-13 17:33:24 +0100

Browse File »

10 Jun, 2025

1 commit

Enhance error handling for unexpected tokens during sanity checks ...

d68f45e0

Implemented stricter sanity checks to  handle unexpected tokens like array/dictionary close and endobj/endstream more effectively. Improved warning messages and handling of corrupt objects to enhance PDF parsing robustness.

authored

2025-06-10 20:10:31 +0100

Browse File »

13 May, 2025

1 commit

Replace `count()` with `contains()` for cleaner and more efficient checks ...

1d7ebddb

Converted multiple occurrences of `count()` to `contains()` throughout the codebase where the goal was to check key existence in containers. This improves code readability and aligns with modern C++ practices, particularly with C++20, making the intent more explicit and potentially aiding performance.

authored

2025-05-13 21:03:17 +0100

Browse File »

23 Apr, 2025

1 commit

During xref reconstruction reject unreasonably large objects ...

2b94c755

Reject objects containing arrays or dictionaries with more than 5000
elements. We are by definition dealing with damaged files, and such
objects are extremely likely to be invalid or malicious.

authored

2025-04-23 16:10:52 +0100

Browse File »

25 Mar, 2025

3 commits

Fix parsing of object streams ...

b37fc717

... containing objects with no white-space between them.

To enforce the rule that objects end at the start-offset of the next
object, each object is parsed in it own object stream.

To facilitate this, a new private API input source is::OffsetBuffer has
been added which only contains the object but reports offsets relative to
the start of the object stream. This is adapted from OffsetInputSource by
changing the direction of the offset, endowing it with its own
BufferInputSource and striooing out checks duplicated in BufferInputSource.

Fixes the expected failure in the test case added in #1266.

authored

2025-03-25 10:40:01 +0000

Browse File »

Refactor calls to QPDFParser::parse ...

8b0eaaf7

Add static parse methods. Make all external access to QPDFParser through
static methods.

Make all non-static methods including constructors private.

authored

2025-03-25 02:36:03 +0000

Browse File »

Refactor object stream warnings and object descriptions ...
626d5061
```
Only build strings when needed.
```
m-holger authored
2025-03-25 02:04:37 +0000
Browse File »

04 Mar, 2025

1 commit

For QPDFTokenizer add private implementation class qpdf::Tokenizer
39bc2eb4

m-holger authored
2025-03-04 10:14:52 +0000
Browse File »

02 Mar, 2025

2 commits

Change Array to use std::vector<QPDFObjectHandle> for storage
eb629671

m-holger authored
2025-03-02 20:51:32 +0000
Browse File »
Refactor QPDFObject to use std::variant instead of std::shared_pointer
8d7ed764

m-holger authored
2025-03-02 20:45:49 +0000
Browse File »

28 Feb, 2025

1 commit

Refine QPDFParser error handling ...

40f601df

#1349 introduced a limit on the maximum size of arrays and dictionaries
contained in objects that generate errors during parsing, and #1354
reduced that limit to 5000 objects. However, the limit was only imposed
once a further error was encountered.

Stop adding objects to containers once the limit is reached.

Fixes oss-fuzz issue 398060137

authored

2025-02-28 19:42:40 +0000

Browse File »

08 Feb, 2025

1 commit

Bump clang-format to version 20 and reformat ...
38d8cc7f
```
This improves indentation of long strings. This commit also fixes some
trailing whitespace in ChangeLog.
```
Jay Berkenbilt authored
2025-02-08 11:17:57 -0500
Browse File »

07 Feb, 2025

1 commit

Refine QPDFParser error handling ...

8df3de5c

Reduce the container size for which a single bad token will cause a failure
from 100,000 to 5,000.

Count missing dictionary keys as errors.

authored

2025-02-07 23:41:56 +0000

Browse File »

04 Feb, 2025

1 commit

Refine QPDFParser error handling ...

43fa1b27

Fail if a bad token is encountered while parsing an array or dictionary
with more than 100,000 elements.

Fixes oss-fuzz case 388571629.

authored

2025-02-04 15:08:55 +0000

Browse File »

19 Sep, 2024

1 commit

In QPDFParser add a limit on total number of errors in one object ...

06a2d955

Currently, QPDFParser gives up attempting to parse an object if 5
near-consecutive bad tokens are encountered. Add a limit of a total of 15
bad tokens in a single object before giving up.

authored

2024-09-19 17:28:26 +0100

Browse File »

05 Sep, 2024

1 commit

In QPDFParser constructor change input parameter to InputSource&
5d25aac6

m-holger authored
2024-09-05 15:30:32 +0100
Browse File »

06 Aug, 2024

1 commit

Refactor the creation of unresolved objects ...

06001ed2

Create unresolved objects only for objects in the xref table (except during
parsing of the xref table). Do not add indirect nulls into the the object
cache as the result of a cache miss during a call to getObject except
during parsing or creation/updating from JSON. To support this behaviour,
add new private methods getObjectForParser and getObjectForJSON.

As a result of this change, dangling references are treated as direct nulls
rather than indirect nulls.

authored

2024-08-06 12:22:09 +0100

Browse File »

04 Feb, 2024

1 commit

Format code
7caa9ddf

Jay Berkenbilt authored
2024-02-04 16:12:01 -0500
Browse File »

17 Jan, 2024

1 commit

Tighten checks for invalid indirect references in QPDFParser
ed43691b

m-holger authored
2024-01-17 13:15:13 +0000
Browse File »

03 Nov, 2023

7 commits

Add new method QPDFParser::fixMissingKeys
1285f976

m-holger authored
2023-11-03 11:22:21 +0000
Browse File »
In QPDFParser::parse create dictionaries on the fly ...
605b1429
```
Also, don't search for /Contents name unless the result is used.
```
m-holger authored
2023-11-03 11:09:58 +0000
Browse File »
In QPDFParser::parse refactor parsing of indirect references
0328d872

m-holger authored
2023-11-03 01:34:16 +0000
Browse File »
In QPDFParser::parseRemainder eliminate most temporary variables
1548b8d8

m-holger authored
2023-11-03 01:34:10 +0000
Browse File »
In QPDFParser::parse eliminate most temporary variables
4c8836d5

m-holger authored
2023-11-03 01:33:59 +0000
Browse File »
In QPDFParser remove state st_top
c912af73

m-holger authored
2023-11-03 01:32:04 +0000
Browse File »
Remove redundant code in QPDFParser::parse and parseRemainder ...
172cc613
```
Also, fix test cases.
```
m-holger authored
2023-11-03 01:26:34 +0000
Browse File »