Take potentially dangerous PDFs, office documents, or images and convert them to safe PDFs
Find a file
deeplow d28aa5a25b
Remove PDFtk dependency (replace w/ pdftoppm)
PDFtk actually isn't needed. It was being used for breaking a PDF
into pages but this is something that be replaced by the already present
'pdftoppm'. Furthermore, by removing this dependency we contribute to
reproducible builds and overall supply chain security because it was
obtained from gitlab with no signature verification or version pinning.

The replacement 'pdftoppm' enabled us to do a shortcut:
 - before: PDF -> PDF pages -> PNG images -> RGB images
 - after:  PDF -> PPM images -> RGB images

And this last conversion step is trivial since the RGB format we were
using is just a PPM file without the metadata in its header.
2023-01-23 14:00:57 +00:00
.circleci Unpin PIP in CI; replace w/ --no-ansi fix same bug 2023-01-20 09:52:39 +00:00
assets Update README screenshots for 0.4.0 release 2022-12-02 11:26:21 +00:00
container Remove PDFtk dependency (replace w/ pdftoppm) 2023-01-23 14:00:57 +00:00
dangerzone CLI: prefix non-INFO logs with log type 2023-01-16 14:58:13 +00:00
dev_scripts Fix qa.py following BUILD.md update in 3b2544a 2023-01-20 09:58:37 +00:00
install install: Fail early when image build fails 2023-01-16 18:48:09 +02:00
share GUI: Add version to header bar 2023-01-16 14:39:27 +00:00
tests Add unit test for --version 2023-01-16 14:39:25 +00:00
.gitignore migrate to pytest & test_docs -> tests/test_docs 2022-09-13 13:07:58 +01:00
BUILD.md Add comment about poetry install keyring prompt 2023-01-18 14:17:59 +00:00
CHANGELOG.md Changelog: add exit confirmation feature 2022-12-01 15:24:19 +00:00
INSTALL.md ci: Remove Fedora 35 support 2023-01-16 18:48:09 +02:00
LICENSE Remove useless files from dangerzone-converter, and move files out of separate scripts directory 2021-11-24 12:47:39 -08:00
Makefile Fix exclusion of dev_scripts/envs from isort 2023-01-19 17:27:11 +02:00
poetry.lock Narrow down installed system packages 2023-01-16 18:48:09 +02:00
pyproject.toml Narrow down installed system packages 2023-01-16 18:48:09 +02:00
README.md README: make screenshots smaller and side-by-side 2022-12-07 10:51:04 +00:00
RELEASE.md Merge pull request #280 from freedomofpress/prepare-0.4.0 2022-12-01 16:50:56 -08:00
setup-windows.py Windows: fix "Open with" dialog showing dz description 2023-01-16 11:38:08 +00:00
setup.py Replace references to github.com/firstlookmedia 2022-12-01 22:31:42 +02:00
stdeb.cfg ci: Fix failing build-debian-bookworm step 2022-12-15 18:30:19 +02:00

Dangerzone

Take potentially dangerous PDFs, office documents, or images and convert them to a safe PDF.

Settings Converting

Dangerzone works like this: You give it a document that you don't know if you can trust (for example, an email attachment). Inside of a sandbox, Dangerzone converts the document to a PDF (if it isn't already one), and then converts the PDF into raw pixel data: a huge list of RGB color values for each page. Then, in a separate sandbox, Dangerzone takes this pixel data and converts it back into a PDF.

Read more about Dangerzone in the blog post Dangerzone: Working With Suspicious Documents Without Getting Hacked.

Getting started

You can also install Dangerzone for Mac using Homebrew: brew install --cask dangerzone

Some features

  • Sandboxes don't have network access, so if a malicious document can compromise one, it can't phone home
  • Dangerzone can optionally OCR the safe PDFs it creates, so it will have a text layer again
  • Dangerzone compresses the safe PDF to reduce file size
  • After converting, Dangerzone lets you open the safe PDF in the PDF viewer of your choice, which allows you to open PDFs and office docs in Dangerzone by default so you never accidentally open a dangerous document

Dangerzone can convert these types of document into safe PDFs:

  • PDF (.pdf)
  • Microsoft Word (.docx, .doc)
  • Microsoft Excel (.xlsx, .xls)
  • Microsoft PowerPoint (.pptx, .ppt)
  • ODF Text (.odt)
  • ODF Spreadsheet (.ods)
  • ODF Presentation (.odp)
  • ODF Graphics (.odg)
  • Jpeg (.jpg, .jpeg)
  • GIF (.gif)
  • PNG (.png)

Dangerzone was inspired by Qubes trusted PDF, but it works in non-Qubes operating systems. It uses containers as sandboxes instead of virtual machines (using Docker for macOS, Windows, and Debian/Ubuntu, and podman for Fedora).

Set up a development environment by following these instructions.