OpenSystemsDevelopment / qpdf

31 Jan, 2019

3 commits

Make inline image token exactly contain the image data ...
eb49e07c
```
Do not include the trailing EI, and handle cases where EI is not
preceded by a delimiter. Such cases have been seen in the wild.
```
Jay Berkenbilt authored
2019-01-31 20:28:44 -0500
Browse File »

Refactor QPDFTokenizer's inline image handling ...

Add a version of expectInlineImage that takes an input source and
searches for EI. This is in preparation for improving the way EI is
found. This commit just refactors the code without changing the
functionality and adds tests to make sure the old and new code behave
identically.

authored

2019-01-31 09:26:37 -0500

Browse File »

Inline image token value ends with EI, not delimiter ...
31372edc
```
The inline image token erroneously included the delimiter that
followed EI. The ObjectHandle created from it was correct.
```
Jay Berkenbilt authored
2019-01-31 09:26:37 -0500
Browse File »

27 Jan, 2019

1 commit

Add QPDFObjectHandle::getUniqueResourceName
8cb24573

Jay Berkenbilt authored
2019-01-27 07:50:30 -0500
Browse File »

25 Jan, 2019

1 commit

Handle inheritable page attributes ...
009767d9
```
Add getAttribute for handling inheritable page attributes, and fix
getPageImages and annotation flattening code to use it.
```
Jay Berkenbilt authored
2019-01-25 22:30:05 -0500
Browse File »

02 Jan, 2019

1 commit

Switch annotation flattening to use the form xobjects ...

f78ea057

Instead of directly putting the contents of the annotation appearance
streams into the page's content stream, add commands to render the
form xobjects directly. This is a more robust way to do it than the
original solution as it works properly with patterns and avoids
problems with resource name clashes between the pages and the form
xobjects.

authored

2019-01-02 21:49:47 -0500

Browse File »

01 Jan, 2019

1 commit

Add QPDFObjectHandle::mergeDictionary()
95d6b17a

Jay Berkenbilt authored
2019-01-01 08:12:56 -0500
Browse File »

31 Dec, 2018

1 commit

Add Matrix class under QPDFObjectHandle
5059ec0d

Jay Berkenbilt authored
2018-12-31 23:02:43 -0500
Browse File »

21 Dec, 2018

1 commit

Add QPDFObjectHandle::getJSON()
30a0c070

Jay Berkenbilt authored
2018-12-21 18:34:56 -0500
Browse File »

18 Dec, 2018

1 commit

Add QPDFObjectHandle::wrapInArray() ...
077d3d45
```
Wrap an object in an array if it is not already an array.
```
Jay Berkenbilt authored
2018-12-18 16:45:48 -0500
Browse File »

22 Jun, 2018

1 commit

Treat content stream parsing errors as an error, not a warning ...

38c9ed23

If parsing content streams is treated as a warning, there is no way
for a caller to know if a parsing operation has failed. This is very
dangerous and will likely result in data loss when token filters are
parser callbacks are in use.

authored

2018-06-22 10:44:08 -0400

Browse File »

21 Jun, 2018

3 commits

Fix QPDFObjectHandle::shallowCopy ...

ddd78c1b

It's not really a shallow copy. It just doesn't cross indirect object
boundaries. The old implementation had a bug that would cause multiple
shallow copies of the same object to share memory, which was not the
intention.

authored

2018-06-21 20:34:45 -0400

Browse File »

Better support for creating Unicode strings
952a665a

Jay Berkenbilt authored
2018-06-21 15:57:13 -0400
Browse File »
Add QPDFObjectHandle::Rectangle type ...
4cded108
```
Provide a convenient way of accessing rectangles.
```
Jay Berkenbilt authored
2018-06-21 15:57:13 -0400
Browse File »

15 Apr, 2018

1 commit

Limit depth of nesting in direct objects (fixes #202) ...
b4d6cf68
```
This fixes CVE-2018-9918.
```
Jay Berkenbilt authored
2018-04-15 16:11:22 -0400
Browse File »

06 Mar, 2018

1 commit

Properly handle pages with no contents (fixes #194) ...

e4e2e26d

Remove calls to assertPageObject(). All cases in the library that
called assertPageObject() work fine if you don't call
assertPageObject() because nothing assumes anything that was being
checked by that call. Removing the calls enables more files to be
successfully processed.

authored

2018-03-06 11:34:07 -0500

Browse File »

18 Feb, 2018

9 commits

More robust handling of type errors ...

d0e99f19

Give objects descriptions and context so it is possible to issue
warnings instead of fatal errors for attempts to access objects of the
wrong type.

authored

2018-02-18 21:06:27 -0500

Browse File »

Push members of QPDFObjectHandle into a Members object ...
21b7481b
```
As in other cases, this is to enable adding new member variables in
the future without breaking ABI compatibility.
```
Jay Berkenbilt authored
2018-02-18 21:06:27 -0500
Browse File »
Simplify TokenFilter interface ...
e410b0fe
```
Expose Pl_QPDFTokenizer, and have it do more of the work of managing
the token filter's pipeline.
```
Jay Berkenbilt authored
2018-02-18 21:05:47 -0500
Browse File »
Add additional interface for filtering page contents
5708b5d0

Jay Berkenbilt authored
2018-02-18 21:05:47 -0500
Browse File »

Implement TokenFilter and refactor Pl_QPDFTokenizer ...

99101044

Implement a TokenFilter class and refactor Pl_QPDFTokenizer to use a
TokenFilter class called ContentNormalizer. Pl_QPDFTokenizer is now a
general filter that passes data through a TokenFilter.

authored

2018-02-18 21:05:46 -0500

Browse File »

Add coalesce contents capability
b8723e97

Jay Berkenbilt authored
2018-02-18 21:05:46 -0500
Browse File »
Refactor parseContentStream
fcd611b6

Jay Berkenbilt authored
2018-02-18 21:05:46 -0500
Browse File »

Remove redundant method ...

05ff619b

Remove a redundant method that was equal to another one with
additional arguments. This breaks binary compatibility, but there are
other ABI breaking changes in the upcoming release, so now is the time
to do it.

authored

2018-02-18 21:05:46 -0500

Browse File »

Use inline image token in content parser
55ee5539

Jay Berkenbilt authored
2018-02-18 21:05:46 -0500
Browse File »

12 Sep, 2017

1 commit

Improve message for stream decoding error ...
d31a7b76
```
Tweak the message so that we inform the user that we are mitigating
data loss.
```
Jay Berkenbilt authored
2017-09-12 16:03:48 -0400
Browse File »

26 Aug, 2017

1 commit

Fix error caught by clang
728dc9e6

Jay Berkenbilt authored
2017-08-26 21:51:17 -0400
Browse File »

25 Aug, 2017

1 commit

Parse iteratively to avoid stack overflow (fixes #146)
ad527a64

Jay Berkenbilt authored
2017-08-25 21:56:45 -0400
Browse File »

22 Aug, 2017

1 commit

Spell check
e452d9dc

Jay Berkenbilt authored
2017-08-22 14:22:20 -0400
Browse File »

21 Aug, 2017

1 commit

Enable finer grained control of stream decoding ...

9744414c

This commit adds several API methods that enable control over which
types of filters QPDF will attempt to decode. It also adds support for
/RunLengthDecode and /DCTDecode filters for both encoding and
decoding.

authored

2017-08-21 17:44:22 -0400

Browse File »

12 Aug, 2017

1 commit

Add page rotation (fixes #132)
cfa2eb97

Jay Berkenbilt authored
2017-08-12 22:57:38 -0400
Browse File »

29 Jul, 2017

1 commit

Better handle split content streams (fixes #73) ...
b389268f
```
When parsing content streams, allow content to be split arbitrarily
across stream boundaries.
```
Jay Berkenbilt authored
2017-07-29 12:19:04 -0400
Browse File »

27 Jul, 2017

2 commits

Add precheck streams capability ...

7f889252

When requested, QPDFWriter will do more aggress prechecking of streams
to make sure it can actually succeed in decoding them before
attempting to do so. This will allow preservation of raw data even
when the raw data is corrupted relative to the specified filters.

authored

2017-07-27 23:42:27 -0400

Browse File »

Convert object parsing errors to warnings ...

40f00122

QPDFObjectHandle::parseInternal now issues warnings instead of
throwing exceptions for all error conditions that it finds (except
internal logic errors) and has stronger recovery for things like
invalid tokens and malformed dictionaries. This should improve qpdf's
ability to recover from a wide range of broken files that currently
cause it to fail.

authored

2017-07-27 18:20:31 -0400

Browse File »

26 Jul, 2017

3 commits

Don't interpret word tokens in content streams (fixes #82)
12db0989

Jay Berkenbilt authored
2017-07-26 06:24:07 -0400
Browse File »

Handle object ID 0 (fixes #99) ...

afe0242b

This is CVE-2017-9208.

The QPDF library uses object ID 0 internally as a sentinel to
represent a direct object, but prior to this fix, was not blocking
handling of 0 0 obj or 0 0 R as a special case. Creating an object in
the file with 0 0 obj could cause various infinite loops. The PDF spec
doesn't allow for object 0. Having qpdf handle object 0 might be a
better fix, but changing all the places in the code that assumes objid
== 0 means direct would be risky.

authored

2017-07-26 06:24:07 -0400

Browse File »

Fix infinite loop while reporting an error (fixes #101) ...

603f2223

This is CVE-2017-9210.

The description string for an error message included unparsing an
object, which is too complex of a thing to try to do while throwing an
exception. There was only one example of this in the entire codebase,
so it is not a pervasive problem. Fixing this eliminated one class of
infinite loop errors.

authored

2017-07-26 06:24:07 -0400

Browse File »

21 Feb, 2015

1 commit

Avoid resolving arguments to R ...

c729e07d

When checking two objects preceding R while parsing, ensure that the
objects are direct. This avoids stuff like 1 0 obj containing 1 0 R 0 R
from causing an infinite loop in object resolution.

authored

2015-02-21 17:51:08 -0500

Browse File »

14 Nov, 2014

1 commit

Handle pages with no /Contents from getPageContents() ...
caab1b0e
```
The spec allows /Contents to be omitted for pages that are blank, but
QPDFObjectHandle::getPageContents() was throwing an exception in this
case.
```
Jay Berkenbilt authored
2014-11-14 13:43:34 -0500
Browse File »

18 Oct, 2013

1 commit

Security: replace operator[] with at ...
ac9c1f0d
```
For std::string and std::vector, replace operator[] with at.  This was
done using an automated process.  See README.hardening for details.
```
Jay Berkenbilt authored
2013-10-18 10:45:14 -0400
Browse File »