OpenSystemsDevelopment / qpdf

23 Apr, 2025

1 commit

During xref reconstruction reject unreasonably large objects ...

Reject objects containing arrays or dictionaries with more than 5000
elements. We are by definition dealing with damaged files, and such
objects are extremely likely to be invalid or malicious.

authored

2025-04-23 16:10:52 +0100

Browse File »

25 Mar, 2025

3 commits

Fix parsing of object streams ...

b37fc717

... containing objects with no white-space between them.

To enforce the rule that objects end at the start-offset of the next
object, each object is parsed in it own object stream.

To facilitate this, a new private API input source is::OffsetBuffer has
been added which only contains the object but reports offsets relative to
the start of the object stream. This is adapted from OffsetInputSource by
changing the direction of the offset, endowing it with its own
BufferInputSource and striooing out checks duplicated in BufferInputSource.

Fixes the expected failure in the test case added in #1266.

authored

2025-03-25 10:40:01 +0000

Browse File »

Refactor calls to QPDFParser::parse ...

8b0eaaf7

Add static parse methods. Make all external access to QPDFParser through
static methods.

Make all non-static methods including constructors private.

authored

2025-03-25 02:36:03 +0000

Browse File »

Refactor object stream warnings and object descriptions ...
626d5061
```
Only build strings when needed.
```
m-holger authored
2025-03-25 02:04:37 +0000
Browse File »

06 Mar, 2025

3 commits

Deprecate QPDFObjectHandle::parse overload and undeprecate isInitialized
cc11285e

m-holger authored
2025-03-06 16:33:28 +0000
Browse File »
Add new method QPDFParser::make_description ...
8b379756
```
Avoid creating new identical descriptions for each content stream token.
```
m-holger authored
2025-03-06 15:48:25 +0000
Browse File »
Use Tokenizer instead of QPDFTokenizer internally in qpdf ...
0518d585
```
Also remove some shared pointers and use std::string instead of Pl_Buffer
in Pl_QPDFTokenizer.
```
m-holger authored
2025-03-06 15:45:52 +0000
Browse File »

04 Mar, 2025

1 commit

In QPDFParser access qpdf::Tokenizer directly ...
00b59979
```
Remove remaining QPDFTokenizer private methods.
Remove QPDFTokenizer privileged access to Tokenizer.
```
m-holger authored
2025-03-04 10:18:53 +0000
Browse File »

02 Mar, 2025

2 commits

Change Array to use std::vector<QPDFObjectHandle> for storage
eb629671

m-holger authored
2025-03-02 20:51:32 +0000
Browse File »
Refactor QPDFObject to use std::variant instead of std::shared_pointer
8d7ed764

m-holger authored
2025-03-02 20:45:49 +0000
Browse File »

28 Feb, 2025

1 commit

Refine QPDFParser error handling ...

40f601df

#1349 introduced a limit on the maximum size of arrays and dictionaries
contained in objects that generate errors during parsing, and #1354
reduced that limit to 5000 objects. However, the limit was only imposed
once a further error was encountered.

Stop adding objects to containers once the limit is reached.

Fixes oss-fuzz issue 398060137

authored

2025-02-28 19:42:40 +0000

Browse File »

08 Feb, 2025

1 commit

Bump clang-format to version 20 and reformat ...
38d8cc7f
```
This improves indentation of long strings. This commit also fixes some
trailing whitespace in ChangeLog.
```
Jay Berkenbilt authored
2025-02-08 11:17:57 -0500
Browse File »

19 Sep, 2024

1 commit

In QPDFParser add a limit on total number of errors in one object ...

06a2d955

Currently, QPDFParser gives up attempting to parse an object if 5
near-consecutive bad tokens are encountered. Add a limit of a total of 15
bad tokens in a single object before giving up.

authored

2024-09-19 17:28:26 +0100

Browse File »

05 Sep, 2024

1 commit

In QPDFParser constructor change input parameter to InputSource&
5d25aac6

m-holger authored
2024-09-05 15:30:32 +0100
Browse File »

06 Aug, 2024

2 commits

Refactor the creation of unresolved objects ...

06001ed2

Create unresolved objects only for objects in the xref table (except during
parsing of the xref table). Do not add indirect nulls into the the object
cache as the result of a cache miss during a call to getObject except
during parsing or creation/updating from JSON. To support this behaviour,
add new private methods getObjectForParser and getObjectForJSON.

As a result of this change, dangling references are treated as direct nulls
rather than indirect nulls.

authored

2024-08-06 12:22:09 +0100

Browse File »

In QPDFParser constructor add add parameter parse_pdf ...

87ee8ad0

Prepare for treating indirect references differently depending on whether
we are parsing a PDF file (in which case reference to objects not in the
xref table are null even if they are in the object cache) or whether parse
from user code (in which case an indirect reference can refer to a user
created object).

authored

2024-08-06 10:02:07 +0100

Browse File »