"View as HTML" page should be generated from censored binary documents #934
Labels
easier-admin
Make issues easier to resolve
f:redaction
improvement
Improves existing functionality (UI tweaks, refactoring, performance, etc)
x:volunteer
Currently, PDF attachments are first converted to HTML, then the PDF and the HTML versions are censored separately. Sometimes PDFs need custom censor rules based on the internal representation of the PDF. So it's also necessary to add a "plain text" rule to catch the HTML version and it's easy to forget to do this.
There's also a small risk that future changes to the HTML extraction would defeat the censor rule - the closer to the original source the censoring is done, the better.
The text was updated successfully, but these errors were encountered: