PDF support #97

jonmmease · 2023-09-09T15:22:18Z

Closes #91

Overview

This PR adds dependency-free PDF export support to VlConvert. It's been a journey to get to this point, but I'm really happy with the end result.

How it work

This PR uses VlConvert's SVG export path and then converts the resulting SVG image to a PDF. The bulk of the work is done by the wonderful svg2pdf crate. svg2pdf relies on usvg to convert the original SVG image to a simplified collection of paths, and then converts these paths to PDF.

Text

It's possible to render text using svg2pdf by using usvg to convert text to paths before the SVG tree is passed to svg2pdf. But this approach is suboptimal as the resulting text cannot be selected or searched in a PDF viewer like Adobe Acrobat. I opened an svg2pdf issue in January to talk about embedding text. The typst team (who developed svg2pdf, and the pdf-writer crate it depends on) have been really helpful through this process.

It turned out to be possible to accomplish text embedding on top of svg2pdf without changes to the core library. This PR uses pdf-writer to construct a new PDF document and then uses svg2pdf to convert everything in an SVG file except text to a PDF XObject. Then it traverses the SVG tree again and overlays PDF text on top of the XObject.

The logic for using pdf-writer to embed fonts in the resulting PDF file was taken from the typst project repository. It would be nice to eventually find a way to avoid duplicating this logic, but the duplication is worth it for the time being.

Testing

This logic is tested from Python using pdfium2 to convert the PDF to a PNG image and comparing to our existing PNG baselines. The comparison tolerance needs to be a little larger due to the slight differences in text rendering between pdfium and resvg, but they still match really well!

TODO

Update to svg2pdf 0.7 once it is released

…SCII approximations

…ue IDs

jonmmease · 2023-09-09T15:26:15Z

vl-convert/tests/test_cli.rs

@@ -274,7 +274,7 @@ mod test_vl2png {
        let output_png = dssim::load_image(&Dssim::new(), &output).unwrap();

        let attr = Dssim::new();
-        let (diff, _) = attr.compare(&expected_png, &output_png);
+        let (diff, _) = attr.compare(&expected_png, output_png);


clippy --fix caught this

domoritz · 2023-09-17T08:38:26Z

Very cool.

jonmmease added 22 commits August 30, 2023 07:45

Add WIP vl-convert-pdf crate that builds on svg2pdf to add text overlays

45979b4

Add PDF functions to converter, add vl2pdf and vg2pdf CLI subcommands

4cc8586

Compute metrics on base PDF fonts, choose and scale

1d14236

Use proper encoding and convert unsupported unicode characters into A…

7f5eb35

…SCII approximations

fmt

ab8853b

Python PDF conversion functions

357babf

Refactor to use PdfContext

f992f84

Don't excape slashes and parens (pdf-writer must do this itself).

a768e4b

Fix svg_id calculation to make sure svg2pdf has room to allocate uniq…

782e31f

…ue IDs

skip text without fill

1a11061

Update branch

360657a

WIP

d9e3048

cleanup

1ec5b78

Handle multiple spans within single text chunk

f6a9a06

Comments, add basic right-to-left support

22ef675

Add pdf scale to CLI

a346333

fix scale

4876229

Add Python PDF tests

9a9a475

Remove unused deps

dc1d671

clippy fix

479b54e

clippy fixex

5dad2ad

Add VlConvert as PDF creator

69cc8dc

jonmmease commented Sep 9, 2023

View reviewed changes

jonmmease added 4 commits September 9, 2023 11:29

fmt with rust 1.72

7df7913

Update thirdparty_rust.yaml

c5df7ca

Merge remote-tracking branch 'origin/main' into jonmmease/pdf-embed-2

ed3023b

Add show-warnings for vl2pdf

6f2dedf

jonmmease marked this pull request as draft September 9, 2023 15:38

jonmmease added 2 commits September 9, 2023 11:47

Update README.md

0b34d76

skip PDF tests on windows due to pdfium2 issue on CI

cfd8e44

jonmmease mentioned this pull request Sep 9, 2023

PDF support #91

Closed

jonmmease added 8 commits September 13, 2023 07:04

Merge remote-tracking branch 'origin/main' into jonmmease/pdf-embed-2

30323b2

update lockfile

045d432

update to svg2pdf 0.7.0

0ce5788

Fix chunk coordinate conversion

819c4a8

Add README and example for vl-convert-pdf

06bc65e

Update licenses

fffa14f

fmt

3a5e33a

Cargo.lock

dac310f

jonmmease marked this pull request as ready for review September 13, 2023 12:33

jonmmease mentioned this pull request Sep 13, 2023

Embed text rather than converting text to paths typst/svg2pdf#21

Closed

jonmmease added 2 commits September 13, 2023 09:15

misc review updates

1b26834

clippy fix

8a0f2fa

jonmmease merged commit 416b592 into main Sep 13, 2023

jonmmease deleted the jonmmease/pdf-embed-2 branch September 13, 2023 15:04

jonmmease mentioned this pull request Sep 18, 2023

Saving from the interface is different from Chart.save vega/altair#3192

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PDF support #97

PDF support #97

jonmmease commented Sep 9, 2023 •

edited

Loading

jonmmease Sep 9, 2023

domoritz commented Sep 17, 2023

PDF support #97

PDF support #97

Conversation

jonmmease commented Sep 9, 2023 • edited Loading

Overview

How it work

Text

Testing

TODO

jonmmease Sep 9, 2023

Choose a reason for hiding this comment

domoritz commented Sep 17, 2023

jonmmease commented Sep 9, 2023 •

edited

Loading