Skip to content

Initial PDFBox HTML support #748

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 14 commits into from
Jul 24, 2023
Merged

Initial PDFBox HTML support #748

merged 14 commits into from
Jul 24, 2023

Conversation

tomas-sexenian
Copy link
Contributor

Issue: 103392

@genexusbot
Copy link
Collaborator

Manual cherry pick to beta success

@@ -122,6 +122,11 @@
<version>2.0.27</version>
</dependency>
<dependency>
<groupId>org.jsoup</groupId>
<artifactId>jsoup</artifactId>
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@tomas-sexenian as we added a new required dependency, do you know how mucho size this JAR takes?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Jsoup 1.16 takes 433 KB of space which I agree is not optimal. Advantages are that it is open source, free for comercial use, has no known vulnerabilities and its complete enough to handle more complex HTML structures if, in the future, we wanted to expand HTML rendering for PDFBox generated reports.

I can try rewriting what I have now but using another HTML parsing library if it is necessary

@ggallotti
Copy link
Member

In the future, I think we should consider to refactor all PDF implementation into a new Module.

@tomas-sexenian
Copy link
Contributor Author

I agree that we should do something with all the files involved in PDF generation. Modules are a great option but we could also evaluate if, for example, the report being generated is rendering HTML content and if its not, then Jsoup its not necessary. Same goes to barcodes and QR codes rendering

ggallotti
ggallotti previously approved these changes Jul 19, 2023
Copy link
Member

@ggallotti ggallotti left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This report functionality should be moved to an external module in the Future. And only be included if the Developer is using PDF functionality.

@genexusbot
Copy link
Collaborator

Cherry pick to beta success

@tomas-sexenian tomas-sexenian merged commit 9cf9bf8 into master Jul 24, 2023
@tomas-sexenian tomas-sexenian deleted the PDFBoxHTMLSupport branch July 24, 2023 14:01
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants