OpenSystemsDevelopment / qpdf

21 Jun, 2019

3 commits

Fix sign and conversion warnings (major) ...

This makes all integer type conversions that have potential data loss
explicit with calls that do range checks and raise an exception. After
this commit, qpdf builds with no warnings when -Wsign-conversion
-Wconversion is used with gcc or clang or when -W3 -Wd4800 is used
with MSVC. This significantly reduces the likelihood of potential
crashes from bogus integer values.

There are some parts of the code that take int when they should take
size_t or an offset. Such places would make qpdf not support files
with more than 2^31 of something that usually wouldn't be so large. In
the event that such a file shows up and is valid, at least qpdf would
raise an error in the right spot so the issue could be legitimately
addressed rather than failing in some weird way because of a silent
overflow condition.

authored

2019-06-21 13:17:21 -0400

Browse File »

Change QPDFObjectHandle::pipeStreamData's encode_flags type ...
da30764b
```
Change from unsigned long to int since we pass enumerated type values
to this field.
```
Jay Berkenbilt authored
2019-06-21 13:17:21 -0400
Browse File »
Add new integer accessors to QPDFObjectHandle
3608afd5

Jay Berkenbilt authored
2019-06-21 13:17:21 -0400
Browse File »

15 Jun, 2019

1 commit

Give up reading objects with too many consecutive errors
cf469d78

Jay Berkenbilt authored
2019-06-15 08:52:19 -0400
Browse File »

20 Apr, 2019

1 commit

Tighten isPageObject (fixes #310)
4ccb2991

Jay Berkenbilt authored
2019-04-20 21:00:43 -0400
Browse File »

31 Jan, 2019

3 commits

Make inline image token exactly contain the image data ...
eb49e07c
```
Do not include the trailing EI, and handle cases where EI is not
preceded by a delimiter. Such cases have been seen in the wild.
```
Jay Berkenbilt authored
2019-01-31 20:28:44 -0500
Browse File »

Refactor QPDFTokenizer's inline image handling ...

ec9e310c

Add a version of expectInlineImage that takes an input source and
searches for EI. This is in preparation for improving the way EI is
found. This commit just refactors the code without changing the
functionality and adds tests to make sure the old and new code behave
identically.

authored

2019-01-31 09:26:37 -0500

Browse File »

Inline image token value ends with EI, not delimiter ...
31372edc
```
The inline image token erroneously included the delimiter that
followed EI. The ObjectHandle created from it was correct.
```
Jay Berkenbilt authored
2019-01-31 09:26:37 -0500
Browse File »

27 Jan, 2019

1 commit

Add QPDFObjectHandle::getUniqueResourceName
8cb24573

Jay Berkenbilt authored
2019-01-27 07:50:30 -0500
Browse File »

25 Jan, 2019

1 commit

Handle inheritable page attributes ...
009767d9
```
Add getAttribute for handling inheritable page attributes, and fix
getPageImages and annotation flattening code to use it.
```
Jay Berkenbilt authored
2019-01-25 22:30:05 -0500
Browse File »

02 Jan, 2019

1 commit

Switch annotation flattening to use the form xobjects ...

f78ea057

Instead of directly putting the contents of the annotation appearance
streams into the page's content stream, add commands to render the
form xobjects directly. This is a more robust way to do it than the
original solution as it works properly with patterns and avoids
problems with resource name clashes between the pages and the form
xobjects.

authored

2019-01-02 21:49:47 -0500

Browse File »

01 Jan, 2019

1 commit

Add QPDFObjectHandle::mergeDictionary()
95d6b17a

Jay Berkenbilt authored
2019-01-01 08:12:56 -0500
Browse File »

31 Dec, 2018

1 commit

Add Matrix class under QPDFObjectHandle
5059ec0d

Jay Berkenbilt authored
2018-12-31 23:02:43 -0500
Browse File »

21 Dec, 2018

1 commit

Add QPDFObjectHandle::getJSON()
30a0c070

Jay Berkenbilt authored
2018-12-21 18:34:56 -0500
Browse File »

18 Dec, 2018

1 commit

Add QPDFObjectHandle::wrapInArray() ...
077d3d45
```
Wrap an object in an array if it is not already an array.
```
Jay Berkenbilt authored
2018-12-18 16:45:48 -0500
Browse File »

22 Jun, 2018

1 commit

Treat content stream parsing errors as an error, not a warning ...

38c9ed23

If parsing content streams is treated as a warning, there is no way
for a caller to know if a parsing operation has failed. This is very
dangerous and will likely result in data loss when token filters are
parser callbacks are in use.

authored

2018-06-22 10:44:08 -0400

Browse File »

21 Jun, 2018

3 commits

Fix QPDFObjectHandle::shallowCopy ...

ddd78c1b

It's not really a shallow copy. It just doesn't cross indirect object
boundaries. The old implementation had a bug that would cause multiple
shallow copies of the same object to share memory, which was not the
intention.

authored

2018-06-21 20:34:45 -0400

Browse File »

Better support for creating Unicode strings
952a665a

Jay Berkenbilt authored
2018-06-21 15:57:13 -0400
Browse File »
Add QPDFObjectHandle::Rectangle type ...
4cded108
```
Provide a convenient way of accessing rectangles.
```
Jay Berkenbilt authored
2018-06-21 15:57:13 -0400
Browse File »

15 Apr, 2018

1 commit

Limit depth of nesting in direct objects (fixes #202) ...
b4d6cf68
```
This fixes CVE-2018-9918.
```
Jay Berkenbilt authored
2018-04-15 16:11:22 -0400
Browse File »

06 Mar, 2018

1 commit

Properly handle pages with no contents (fixes #194) ...

e4e2e26d

Remove calls to assertPageObject(). All cases in the library that
called assertPageObject() work fine if you don't call
assertPageObject() because nothing assumes anything that was being
checked by that call. Removing the calls enables more files to be
successfully processed.

authored

2018-03-06 11:34:07 -0500

Browse File »

18 Feb, 2018

9 commits

More robust handling of type errors ...

d0e99f19

Give objects descriptions and context so it is possible to issue
warnings instead of fatal errors for attempts to access objects of the
wrong type.

authored

2018-02-18 21:06:27 -0500

Browse File »

Push members of QPDFObjectHandle into a Members object ...
21b7481b
```
As in other cases, this is to enable adding new member variables in
the future without breaking ABI compatibility.
```
Jay Berkenbilt authored
2018-02-18 21:06:27 -0500
Browse File »
Simplify TokenFilter interface ...
e410b0fe
```
Expose Pl_QPDFTokenizer, and have it do more of the work of managing
the token filter's pipeline.
```
Jay Berkenbilt authored
2018-02-18 21:05:47 -0500
Browse File »
Add additional interface for filtering page contents
5708b5d0

Jay Berkenbilt authored
2018-02-18 21:05:47 -0500
Browse File »

Implement TokenFilter and refactor Pl_QPDFTokenizer ...

99101044

Implement a TokenFilter class and refactor Pl_QPDFTokenizer to use a
TokenFilter class called ContentNormalizer. Pl_QPDFTokenizer is now a
general filter that passes data through a TokenFilter.

authored

2018-02-18 21:05:46 -0500

Browse File »

Add coalesce contents capability
b8723e97

Jay Berkenbilt authored
2018-02-18 21:05:46 -0500
Browse File »
Refactor parseContentStream
fcd611b6

Jay Berkenbilt authored
2018-02-18 21:05:46 -0500
Browse File »

Remove redundant method ...

05ff619b

Remove a redundant method that was equal to another one with
additional arguments. This breaks binary compatibility, but there are
other ABI breaking changes in the upcoming release, so now is the time
to do it.

authored

2018-02-18 21:05:46 -0500

Browse File »

Use inline image token in content parser
55ee5539

Jay Berkenbilt authored
2018-02-18 21:05:46 -0500
Browse File »

12 Sep, 2017

1 commit

Improve message for stream decoding error ...
d31a7b76
```
Tweak the message so that we inform the user that we are mitigating
data loss.
```
Jay Berkenbilt authored
2017-09-12 16:03:48 -0400
Browse File »

26 Aug, 2017

1 commit

Fix error caught by clang
728dc9e6

Jay Berkenbilt authored
2017-08-26 21:51:17 -0400
Browse File »

25 Aug, 2017

1 commit

Parse iteratively to avoid stack overflow (fixes #146)
ad527a64

Jay Berkenbilt authored
2017-08-25 21:56:45 -0400
Browse File »

22 Aug, 2017

1 commit

Spell check
e452d9dc

Jay Berkenbilt authored
2017-08-22 14:22:20 -0400
Browse File »

21 Aug, 2017

1 commit

Enable finer grained control of stream decoding ...

9744414c

This commit adds several API methods that enable control over which
types of filters QPDF will attempt to decode. It also adds support for
/RunLengthDecode and /DCTDecode filters for both encoding and
decoding.

authored

2017-08-21 17:44:22 -0400

Browse File »

12 Aug, 2017

1 commit

Add page rotation (fixes #132)
cfa2eb97

Jay Berkenbilt authored
2017-08-12 22:57:38 -0400
Browse File »

29 Jul, 2017

1 commit

Better handle split content streams (fixes #73) ...
b389268f
```
When parsing content streams, allow content to be split arbitrarily
across stream boundaries.
```
Jay Berkenbilt authored
2017-07-29 12:19:04 -0400
Browse File »

27 Jul, 2017

2 commits

Add precheck streams capability ...

7f889252

When requested, QPDFWriter will do more aggress prechecking of streams
to make sure it can actually succeed in decoding them before
attempting to do so. This will allow preservation of raw data even
when the raw data is corrupted relative to the specified filters.

authored

2017-07-27 23:42:27 -0400

Browse File »

Convert object parsing errors to warnings ...

40f00122

QPDFObjectHandle::parseInternal now issues warnings instead of
throwing exceptions for all error conditions that it finds (except
internal logic errors) and has stronger recovery for things like
invalid tokens and malformed dictionaries. This should improve qpdf's
ability to recover from a wide range of broken files that currently
cause it to fail.

authored

2017-07-27 18:20:31 -0400

Browse File »

26 Jul, 2017

1 commit

Don't interpret word tokens in content streams (fixes #82)
12db0989

Jay Berkenbilt authored
2017-07-26 06:24:07 -0400
Browse File »