OpenSystemsDevelopment / qpdf

21 May, 2022

1 commit

Add json to large file test
905f47a5

Jay Berkenbilt authored
2022-05-21 09:43:45 -0400
Browse Dir »

20 May, 2022

13 commits

Exercise object description in tests
9b2eb01e

Jay Berkenbilt authored
2022-05-20 14:23:32 -0400
Browse Dir »
Add test for bad data and bad datafile
6c2fb5b8

Jay Berkenbilt authored
2022-05-20 13:33:30 -0400
Browse Dir »
Test --update-from-json
d0650980

Jay Berkenbilt authored
2022-05-20 11:10:12 -0400
Browse Dir »
Test (and fix) handling of dangling references
6d4e3ba8

Jay Berkenbilt authored
2022-05-20 09:16:25 -0400
Browse Dir »
Explicitly test ignoring unknown keys in JSON input
35b1e1c4

Jay Berkenbilt authored
2022-05-20 09:16:25 -0400
Browse Dir »
Make version default to latest for --json-output (like --json)
dc8df962

Jay Berkenbilt authored
2022-05-20 09:16:25 -0400
Browse Dir »
Round-trip tests with --json-stream-data=file
907df2c8

Jay Berkenbilt authored
2022-05-20 09:16:25 -0400
Browse Dir »
Tests with manually constructed qpdf json
a83b7b06

Jay Berkenbilt authored
2022-05-20 09:16:25 -0400
Browse Dir »
Add tests for --json-input
7f8c4b18

Jay Berkenbilt authored
2022-05-20 09:16:25 -0400
Browse Dir »

Add more names and strings in good13 ...

* native UTF-8 strings
* names whose PDF and canonical syntax differ in both dictionary key
  positions and other positions

For json, names are converted both as names and directly when used as
dictionary keys.

authored

2022-05-20 09:16:25 -0400

Browse Dir »

Rename all test files: _ to -
6c5e5906

Jay Berkenbilt authored
2022-05-20 09:16:25 -0400
Browse Dir »

Major rework -- see long comments ...

6f43bf8d

* Replace --create-from-json=file with --json-input, which causes the
  regular input to be treated as json.
* Eliminate --to-json
* In --json=2, bring back "objects" and eliminate "objectinfo". Stream
  data is never present.
* In --json-output=2, write "qpdf-v2" with "objects" and include
  stream data.

authored

2022-05-20 09:16:25 -0400

Browse Dir »

Parse objects; stream data is not yet handled
7e7a9c43

Jay Berkenbilt authored
2022-05-20 09:16:25 -0400
Browse Dir »

16 May, 2022

2 commits

Implement top-level qpdf json parsing
7fa5d177

Jay Berkenbilt authored
2022-05-16 13:41:40 -0400
Browse Dir »
Remove offset from missing /Root error ...
9a0e9a1a
```
The last offset is irrelevant to not being able to find /Root.
```
Jay Berkenbilt authored
2022-05-16 13:39:26 -0400
Browse Dir »

14 May, 2022

1 commit

Split qpdf.test into multiple test suites ...
173b944e
```
This makes it a lot easier to run parts of the test suite.
```
Jay Berkenbilt authored
2022-05-14 17:35:06 -0400
Browse Dir »

08 May, 2022

10 commits

Add maxobjectid to JSON
2a2f7f1b

Jay Berkenbilt authored
2022-05-08 13:45:20 -0400
Browse Dir »
Add --to-json option
e9390aea

Jay Berkenbilt authored
2022-05-08 13:45:20 -0400
Browse Dir »
Test inline stream data with different decode levels
2e87d593

Jay Berkenbilt authored
2022-05-08 13:45:20 -0400
Browse Dir »
Test json v2 with invalid stream data
f08f3989

Jay Berkenbilt authored
2022-05-08 13:45:20 -0400
Browse Dir »
Implement JSON v2 output
c76536dd

Jay Berkenbilt authored
2022-05-08 13:45:20 -0400
Browse Dir »

Apply script across future v2 test files ...

bdfc4da5

There is one unexpected pass in this commit. This script was applied
to the files changed in this commit:

----------
#!/usr/bin/env python3
import json
import sys

def json_dumps(data):
    return json.dumps(data, ensure_ascii=False,
                      indent=2, separators=(',', ': '))

for filename in sys.argv[1:]:
    with open(filename, 'r') as f:
        data = json.loads(f.read())
    data['version'] = 2
    objectinfo = {}
    if 'objectinfo' in data:
        objectinfo = data['objectinfo']
        del data['objectinfo']
    if 'objects' not in data:
        continue
    qpdf = {'jsonversion': 2, 'pdfversion': '1.3', 'objects': {}}
    for k, v in data['objects'].items():
        is_stream = objectinfo.get(k, {}).get('stream', {}).get('is', False)
        if k.endswith(' R'):
            k = 'obj:' + k
        if is_stream:
            v = {'stream': {'dict': v}}
        else:
            v = {'value': v}
        qpdf['objects'][k] = v
    data['qpdf'] = qpdf
    del data['objects']
print(json_dumps(data))
----------

authored

2022-05-08 13:45:20 -0400

Browse Dir »

Prepare test suite for json v2
8d348974

Jay Berkenbilt authored
2022-05-08 13:45:20 -0400
Browse Dir »

Fix typo in json output key name ...

15272662

moddify -> modify. Also carefully spell checked all remaining keys by
splitting them into words and running a spell checker, not just
relying on visual proofreading. That was the only one.

authored

2022-05-08 13:45:20 -0400

Browse Dir »

Implement JSON v2 for Stream ...
1bc8abfd
```
Not fully exercised in this commit
```
Jay Berkenbilt authored
2022-05-08 13:45:20 -0400
Browse Dir »
Implement JSON v2 for String ...
3246923c
```
Also refine the herustic for deciding whether to use hexadecimal
notation for a string.
```
Jay Berkenbilt authored
2022-05-08 13:45:20 -0400
Browse Dir »

07 May, 2022

5 commits

Prepare code for JSON v2 ...
16f4f94c
```
Update getJSON() methods and calls to them
```
Jay Berkenbilt authored
2022-05-07 11:12:01 -0400
Browse Dir »

Objectinfo json: write incrementally and in numeric order ...

a9fbbd5d

This script was used on test data:

----------
#!/usr/bin/env python3
import json
import sys
import re

def json_dumps(data):
    return json.dumps(data, ensure_ascii=False,
                      indent=2, separators=(',', ': '))

for filename in sys.argv[1:]:
    with open(filename, 'r') as f:
        data = json.loads(f.read())
    if 'objectinfo' not in data:
        continue
    trailer = None
    to_sort = []
    for k, v in data['objectinfo'].items():
        if k == 'trailer':
            trailer = v
        else:
            m = re.match(r'^(\d+) \d+ R', k)
            if m:
                to_sort.append([int(m.group(1)), k, v])
    newobjectinfo = {x[1]: x[2] for x in sorted(to_sort)}
    if trailer is not None:
        newobjectinfo['trailer'] = trailer
    data['objectinfo'] = newobjectinfo
print(json_dumps(data))
----------

authored

2022-05-07 08:26:31 -0400

Browse Dir »

Objects json: write incrementally and in numeric order ...

948de609

The following script was used to adjust test data:

----------
#!/usr/bin/env python3
import json
import sys
import re

def json_dumps(data):
    return json.dumps(data, ensure_ascii=False,
                      indent=2, separators=(',', ': '))

for filename in sys.argv[1:]:
    with open(filename, 'r') as f:
        data = json.loads(f.read())
    if 'objects' not in data:
        continue
    trailer = None
    to_sort = []
    for k, v in data['objects'].items():
        if k == 'trailer':
            trailer = v
        else:
            m = re.match(r'^(\d+) \d+ R', k)
            if m:
                to_sort.append([int(m.group(1)), k, v])
    newobjects = {x[1]: x[2] for x in sorted(to_sort)}
    if trailer is not None:
        newobjects['trailer'] = trailer
    data['objects'] = newobjects
print(json_dumps(data))
----------

authored

2022-05-07 08:26:31 -0400

Browse Dir »

Top-level json: write incrementally ...

dc9b7287

This commit just changes the order in which fields are written to the
json without changing their content. All the json files in the test
suite were modified with this script to ensure that we didn't get any
changes other than ordering.

----------
#!/usr/bin/env python3
import json
import sys

def json_dumps(data):
    return json.dumps(data, ensure_ascii=False,
                      indent=2, separators=(',', ': '))

for filename in sys.argv[1:]:
    with open(filename, 'r') as f:
        data = json.loads(f.read())
    newdata = {}
    for i in ('version', 'parameters', 'pages', 'pagelabels',
              'acroform', 'attachments', 'encrypt', 'outlines',
              'objects', 'objectinfo'):
        if i in data:
            newdata[i] = data[i]
print(json_dumps(newdata))
----------

authored

2022-05-07 08:26:31 -0400

Browse Dir »

Test json against schema only on demand ...
7f65a5c2
```
Testing json against schema requires an in-memory copy, so do it only
when requested by the test suite.
```
Jay Berkenbilt authored
2022-05-07 08:26:31 -0400
Browse Dir »

04 May, 2022

1 commit

Make "objects" and "pages" consistent in JSON output
8b25de24

Jay Berkenbilt authored
2022-05-04 08:32:44 -0400
Browse Dir »

30 Apr, 2022

1 commit

Using insecure crytpo from the CLI is now an error by default
cff26040

Jay Berkenbilt authored
2022-04-30 17:23:58 -0400
Browse Dir »

29 Apr, 2022

1 commit

Add new QPDFObjectHandle methods for more fluent programming
e80fad86

Jay Berkenbilt authored
2022-04-29 20:09:10 -0400
Browse Dir »

24 Apr, 2022

3 commits

QPDFJob json: make removeAttachment take an array (fixes #693)
d0b7cc8a

Jay Berkenbilt authored
2022-04-24 13:06:19 -0400
Browse Dir »
Fix some bugs around null values in dictionaries ...
08ba21cf
```
Make it so that a key with a null value is always treated as not being
present. This was inconsistent before.
```
Jay Berkenbilt authored
2022-04-24 10:08:32 -0400
Browse Dir »
Deprecate replaceOrRemoveKey -- it's the same as replaceKey
4be2f360

Jay Berkenbilt authored
2022-04-24 09:31:32 -0400
Browse Dir »

23 Apr, 2022

2 commits

Add new QPDF::warn that takes most of QPDFExc's arguments
68e72198

Jay Berkenbilt authored
2022-04-23 18:25:43 -0400
Browse Dir »
Expose QUtil::get_next_utf8_codepoint
22b35c49

Jay Berkenbilt authored
2022-04-23 18:25:43 -0400
Browse Dir »