dangerzone

mirror of https://github.com/freedomofpress/dangerzone.git synced 2025-04-28 18:02:38 +02:00

Author	SHA1	Message	Date
Alex Pyrgiotis	d6410652cb	Kill the process group when conversion terminates Instead of killing just the invoked Podman/Docker/qrexec process, kill the whole process group, to make sure that other components that have been spawned die as well. In the case of Podman, conmon is one of the processes that lingers, so that's one way to kill it.	2024-10-07 17:37:39 +03:00
Alex Pyrgiotis	b9a3dd63ad	Always start conversion process in new session Start the conversion process in a new session, so that we can later on kill the process group, without killing the controlling script (i.e., the Dangezone UI). This should not affect the conversion process in any other way.	2024-10-07 17:27:38 +03:00
Alexis Métaireau	3e434d08d1	Always use our own seccomp policy as a default. As per Etienne Perot's comment on #908: > Then it seems to me like it would be easy to simply apply this seccomp profile under all container runtimes (since there's no reason why the same image and the same command-line would call different syscalls under different container runtimes).	2024-10-02 14:12:48 +02:00
amnak613	9b9e265b11	Added try excepts for unhandled exceptions Fixes #776	2024-09-17 16:26:46 +03:00
Alexis Métaireau	0c9f426b68	Do not throw on malformed Desktop Entries on Linux. This just skips the malformed entry when it's found. Fixes #899	2024-09-10 15:25:45 +02:00
Alex Pyrgiotis	cfb5e75be9	tests: Do not let LibreOffice hang on the large test set Some of the files in our large test set can make LibreOffice hang. We do not have a proper solution for this yet, but we can at least make the tests timeout quickly, so that they can finish at some point. Refs #878	2024-08-09 14:32:19 +03:00
Alex Pyrgiotis	756945931f	container: Handle case where `docker kill` hangs We have encountered several conversions where the `docker kill` command hangs. Handle this case by specifying a timeout to this command. If the timeout expires, log a warning and proceed with the rest of the termination logic (i.e., kill the conversion process). Fixes #854	2024-07-01 17:56:21 +03:00
Alex Pyrgiotis	4ea0650f42	tests: Skip a test for missing OCR files on Qubes We have a container-specific test that deals with missing OCR files in the container image. This test _can_ be run under Qubes, and it may fail since it requires Podman. Make the pytest guard more strict and don't allow running this test on Qubes. Also, fix a typo in the word "omission".	2024-06-27 22:11:50 +03:00
Alex Pyrgiotis	c89ef580e0	tests: Properly skip tests for isolation providers The platform where we run our tests directly affects the isolation providers we can choose. For instance, we cannot run Qubes tests on a Windows/macOS platform, nor can we spawn containers in a Qubes platform, if the `QUBES_CONVERSION` envvar has been specified. This platform incompatibility was never an issue before, because Dangerzone is capable of selecting the proper isolation provider under the hood. However, with the addition of tests that target specific isolation providers, it's possible that we may run by mistake a test that does not apply to our platform. To counter this, we employed `pytest.skipif()` guards around classes, but we may omit those by mistake. Case in point, the `TestContainer` class does not have such a guard, which means that we attempt to run this test case on Qubes and it fails. Add module-level guards in our isolation provider tests using pytest's `pytest.skip("...", allow_module_level=True)` function, so that we make such restrictions more explicit, and less easy to forget when we add a new class.	2024-06-27 22:11:37 +03:00
deeplow	d0e1df5546	Add drag and drop support for document selection	2024-06-27 11:51:41 +02:00
Ro	54ab9ce98f	Order list of PDF viewers and return default application first (Linux).	2024-06-12 22:41:04 +02:00
Etienne Perot	f03bc71855	Sandbox all Dangerzone document processing within gVisor. This wraps the existing container image inside a gVisor-based sandbox. gVisor is an open-source OCI-compliant container runtime. It is a userspace reimplementation of the Linux kernel in a memory-safe language. It works by creating a sandboxed environment in which regular Linux applications run, but their system calls are intercepted by gVisor. gVisor then redirects these system calls and reinterprets them in its own kernel. This means the host Linux kernel is isolated from the sandboxed application, thereby providing protection against Linux container escape attacks. It also uses `seccomp-bpf` to provide a secondary layer of defense against container escapes. Even if its userspace kernel gets compromised, attackers would have to additionally have a Linux container escape vector, and that exploit would have to fit within the restricted `seccomp-bpf` rules that gVisor adds on itself. Fixes #126 Fixes #224 Fixes #225 Fixes #228	2024-06-12 13:40:04 +03:00
Alexis Métaireau	81ad3a65c2	tests: use `qt_updater` fixture rather than `updater` I'm actually ensure how the previous version was working, but since we are now loading the pytest fixtures automatically, it uncovered a misuse in the tests. The `updater` fixture sets `updater.dangerzone.app` to a magic mock instance, whereas `qt_updater` returns the real QT app, which is what we want in our tests.	2024-06-05 17:13:31 +02:00
Alexis Métaireau	9bad001c04	chore: remove fixture imports in the tests They ideally should find their way by themselves. > You don’t need to import the fixture you want to use in a test, > it automatically gets discovered by pytest. The discovery of fixture > functions starts at test classes, then test modules, then conftest.py > files and finally builtin and third party plugins.> > > — [pytest docs](https://docs.pytest.org/en/4.6.x/fixture.html#conftest-py-sharing-fixture-functions)	2024-06-05 15:56:09 +02:00
Alexis Métaireau	697b1e0d03	chore: mark some lines as unreachable for mypy	2024-06-05 14:19:31 +02:00
Alexis Métaireau	65a8827daa	chore: minor linting A few minor changes about when to use `==` and when to use `is`. Basically, this uses `is` for booleans, and `==` for other values. With a few other changes about coding style which was enforced by `ruff`.	2024-06-05 14:19:31 +02:00
Alexis Métaireau	cbbd6afcc1	chore: remove unused code This commit removes code that's not being used, it can be exceptions with the `as e` where the exception itself is not used, the same with `with` statements, and some other parts where there were duplicated code.	2024-06-05 14:19:31 +02:00
Alexis Métaireau	99f1e15fd2	chore: Do not use fstrings without placeholders > f-strings are a convenient way to format strings, but they are not > necessary if there are no placeholder expressions to format. In this > case, a regular string should be used instead, as an f-string without > placeholders can be confusing for readers, who may expect such a > placeholder to be present. > > — [ruff docs](https://docs.astral.sh/ruff/rules/f-string-missing-placeholders/)	2024-06-05 14:19:31 +02:00
Alexis Métaireau	5aa4863b52	chore(imports): remove useless imports As detected by [ruff](https://github.com/astral-sh/ruff) Related to #254, although it doesn't provide the command to lint the codebase itself.	2024-06-05 14:19:30 +02:00
Alex Pyrgiotis	0b45360384	Keep newlines when reading debug logs In `d632908a44` we improved our `replace_control_chars()` function, by replacing every control or invalid Unicode character with a placeholder one. This change, however, made our debug logs harder to read, since newlines were not preserved. There are indeed various cases in which replacing newlines is wise (e.g., in filenames), so we should keep this behavior by default. However, specifically for reading debug logs, we add an option to keep newlines to improve readability, at no expense to security.	2024-05-09 15:57:42 +03:00
Alex Pyrgiotis	e11aaec3ac	Always use sys.exit when exiting the application The `exit()` [1] function is not necessarily present in every Python environment, as it's added by the `site` module. Also, this function is "[...] useful for the interactive interpreter shell and should not be used in programs" For this reason, we replace all such occurrences with `sys.exit()` [2], which is the canonical function to exit Python programs. [1]: https://docs.python.org/3/library/constants.html#exit [2]: https://docs.python.org/3/library/sys.html#sys.exit	2024-05-09 15:57:42 +03:00
Alex Pyrgiotis	1c70ee6771	Fix archiving the same doc twice on Windows On Windows, if we somehow attempt to archive the same document twice (e.g, because it got archived once, and then we copy it back), we will get an error, because Windows does not overwrite the target path, if it already exists. Fix this issue by always removing the previously archived version, when performing the next archival action, and update our tests.	2024-05-09 15:57:42 +03:00
Alex Pyrgiotis	63b12abbdf	tests: Fix text sanitization tests on Windows Fix a failing test case on Windows, due to a character that cannot exist in a filename.	2024-05-09 15:57:42 +03:00
Naglis Jonaitis	5b6cc861d8	Don't use pytest-mock mocker.patch as context manager Quote from `pytest-mock` docs [1]: > The purpose of this plugin is to make the use of context managers and > function decorators for mocking unnecessary, so it will emit a warning > when used as such. Thus using it as a context manager currently produces a warning during test runs in CI which is extra noise that could make new (possibly more important) warnings harder to spot. [1]: https://pytest-mock.readthedocs.io/en/latest/usage.html#usage-as-context-manager	2024-05-08 15:40:14 +03:00
Alex Pyrgiotis	307ecd495c	tests: Ignore a lint error found by mypy 1.9.0 Ignore a lint error that has started showing up since mypy 1.9.0. The official docs show that the `.instance()` method does not accept a `cls` argument [1], so either the stubs or mypy are wrong here. [1]: https://doc.qt.io/qtforpython-6.5/PySide6/QtCore/QCoreApplication.html#PySide6.QtCore.PySide6.QtCore.QCoreApplication.instance	2024-04-25 16:23:39 +03:00
Naglis Jonaitis	d632908a44	Fix printing of filenames with surrogate escapes On Unix systems a filename can be a sequence of bytes that is not valid UTF-8. Python uses[1] surrogate escapes to allow to decode such filenames to Unicode (bytes that cannot be decoded are replaced by a surrogate; upon encoding the surrogate is converted to the original byte). From `click` docs[2]: > Invalid bytes or surrogate escapes will raise an error when written > to a stream with `errors="strict"`. This will typically happen with > `stdout` when the locale is something like `en_GB.UTF-8`. To fix that, we use `utils.replace_control_chars()` before printing the filenames to `stdout` so that surrogate escapes are replaced by �. Fixes #768	2024-04-25 14:11:25 +03:00
Naglis Jonaitis	52ced04507	Relax the restrictions of util.replace_control_chars The `util.replace_control_chars()` function was overly strict, and would replace every non-ASCII character with "_". This included both control characters, as well as normal characters in a non-English alphabet. Relax these restrictions by checking each character and deciding if it's a Unicode control character, using the `unicodedata` Python package. With this change, emojis and non-English letters are now allowed.	2024-04-25 14:11:16 +03:00
Alex Pyrgiotis	2fa592eb69	tests: Add a fixture for uncommon filenames Add a pytest fixture that crafts a filename with Unicode characters that are not considered common for this use. By default, this fixture uses an invalid Unicode character as well, but we strip it in case of macOS (APFS) since filenames must be UTF-8 encoded. [1]: https://en.wikipedia.org/wiki/Filename#Comparison_of_filename_limitations	2024-04-25 13:23:22 +03:00
Alex Pyrgiotis	d4974b1229	tests: Add termination tests for Dummy provider Add termination tests for the Dummy provider, so that we can have cross-platform coverage in our Windows/macOS CI runners, which can't use the Container isolation providers.	2024-04-24 15:06:01 +03:00
Alex Pyrgiotis	abc66840a8	tests: Add termination tests for Qubes Add termination-related tests for Qubes. To achieve this, we need to make a change to the Qubes isolation provider. More specifically, we need to make the isolation provider yield control to the caller only when the disposable qube is up and running. Qubes does not provide us a solid guarantee to do so, but we've found a hacky workaround, whereby we wait until the `qrexec-client-vm` process opens a `/dev/xen` character device. This should happen, in theory, once the disposable qube is ready, and has sent a `MSG_SERVICE_CONNECT` RPC message to the caller.	2024-04-24 14:39:15 +03:00
Alex Pyrgiotis	875d49fe10	tests: Add termination tests for containers Add termination-related tests for containers. To achieve this, we need to make a change to the container isolation provider. More specifically, we need to make the isolation provider yield control to the caller only when the container is up and running. Failure to do so may lead to lingering processes.	2024-04-24 14:39:15 +03:00
Alex Pyrgiotis	fec7609547	tests: Add some termination-related test cases Add some test cases in the isolation provider tests, that check how it behaves when a process completes successfully, lingers, or cannot terminate. These tests cannot run yet, since they must be imported by a concrete isolation provider test class. In subsequent commits, we will start enabling them.	2024-04-24 14:39:15 +03:00
Alex Pyrgiotis	6850d31edc	isolation_provider: Pass doc when creating doc-to-pixels proc Pass the Document instance that will be converted to the `IsolationProvider.start_doc_to_pixels_proc()` method. Concrete classes can then associate this name with the started process, so that they can later on kill it.	2024-04-24 14:33:33 +03:00
deeplow	9bb1993e77	Create tests/test_settings.py with extra coverage Previously settings was implicitly tested on tests/gui/test_updater.py. However this was concerned with updater-related tests only, which incidentally covered almost all of settings.py. However, a few tests were missing. This commit increases the test coverage, but also tests additional test conditions. The goal is to help us increase the test coverage of the previous scenario, which tested for the persistence of user data (settings). This way we can drop the requirement to test this on linux hosts, which is slightly harder (more cumbersome) to do.	2024-04-01 18:18:41 +03:00
Alex Pyrgiotis	d376e1da00	tests: Adapt Qubes tests Adapt Qubes tests to the addition of the conversion process in doc_to_pixels() call.	2024-02-20 15:58:42 +02:00
Alex Pyrgiotis	634523dac9	Get underlying error when conversion fails When we get an early EOF from the converter process, we should immediately get the exit code of that process, to find out the actual underlying error. Currently, the exception we raise masks the underlying error. Raise a ConverterProcException, that in turns makes our error handling code read the exit code of the spawned process, and converts it to a helpful error message. Fixes #714	2024-02-20 15:55:45 +02:00
deeplow	69c2a02d81	Remove timeouts Remove timeouts due to several reasons: 1. Lost purpose: after implementing the containers page streaming the only subprocess we have left is LibreOffice. So don't have such a big risk of commands hanging (the original reason for timeouts). 2. Little benefit: predicting execution time is generically unsolvable computer science problem. Ultimately we were guessing an arbitrary time based on the number of pages and the document size. As a guess we made it pretty lax (30s per page or MB). A document hanging for this long will probably lead to user frustration in any case and the user may be compelled to abort the conversion. 3. Technical Challenges with non-blocking timeout: there have been several technical challenges in keeping timeouts that we've made effort to accommodate. A significant one was having to do non-blocking read to ensure we could timeout when reading conversion stream (and then used here) Fixes #687	2024-02-06 20:11:43 +00:00
deeplow	f31374e33c	Revert "Add non-blocking read utility" This reverts commit `fea193e935`. This is part of the purge of timeout-related code since we no longer need it [1]. Non-blocking reads were introduced in the reverted commit in order to be able to cut a stream mid-way due to a timeout. This is no longer needed now that we're getting rid of timeouts. [1]: https://github.com/freedomofpress/dangerzone/issues/687	2024-02-06 19:42:41 +00:00
deeplow	1835756b45	Allow each conversion to have its own proc If we increased the number of parallel conversions, we'd run into an issue where the streams were getting mixed together. This was because the Converter.proc was a single attribute. This breaks it down into a local variable such that this mixup doesn't happen.	2024-02-06 19:42:41 +00:00
deeplow	943bab2def	Move Qubes-specific tests also to containers Now that Qubes and Containers essentially share the same code, we can have both run the same tests.	2024-02-06 19:42:41 +00:00
deeplow	61e7a3c107	Fix isolation provider tests Conversions methods had changed and that was part of the reason why the tests were failing. Furthermore, due to the `provider.proc`, which stores the associated qrexec / container process, "server" exceptions raise a IterruptedConversion error (now ConverterProcException), which then requires interpretation of the process exit code to obtain the "real" exception.	2024-02-06 19:42:41 +00:00
deeplow	550786adfe	Remove untrusted progress parsing (stderr instead) Now that only the second container can send JSON-encoded progress information, we can the untrusted JSON parsing. The parse_progress was also renamed to `parse_progress_trusted` to ensure future developers don't mistake this as a safe method. The old methods for sending untrusted JSON were repurposed to send the progress instead to stderr for troubleshooting in development mode. Fixes #456	2024-02-06 19:42:40 +00:00
deeplow	0a099540c8	Stream pages in containers: merge isolation providers Merge Qubes and Containers isolation providers core code into the class parent IsolationProviders abstract class. This is done by streaming pages in containers for exclusively in first conversion process. The commit is rather large due to the multiple interdependencies of the code, making it difficult to split into various commits. The main conversion method (_convert) now in the superclass simply calls two methods: - doc_to_pixels() - pixels_to_pdf() Critically, doc_to_pixels is implemented in the superclass, diverging only in a specialized method called "start_doc_to_pixels_proc()". This method obtains the process responsible that communicates with the isolation provider (container / disp VM) via `podman/docker` and qrexec on Containers and Qubes respectively. Known regressions: - progress reports stopped working on containers Fixes #443	2024-02-06 19:42:33 +00:00
deeplow	cd99122385	Adds file formats: epub svg bmp pnm bpm ppm Partially fix for #660. Missing some files due to limitations [1]: - PSD - only available from PyMuPDF>=1.23.0 (qubes-fedora is lower) - TXT - only available from PyMuPDF>=1.23.7 (qubes-fedora is lower) - JXR - PyMuPDF was refusing to due to missing codec [1] - JPX - Generated test file was rejected by PyMuPDF [2] - FB2 - Most often cannot be detected by mime type alone [3] - CBZ - (idem) - XPS - (idem) - MOBI - (idem) - PAM - General version of other file format already included, so I decided not to include this extension [0] New test files were generated locally: - epub - generated with calibre's convert-ebook from another sample file - svg - generated with inkscape from a mix of a default template (hexagons) and a logo's PNG file - bmp, pnm, bpm, ppm - generated with ImageMagick's 'convert' from tests/test_docs/sample-png.png [0]: https://github.com/freedomofpress/dangerzone/issues/660#issuecomment-1914681487 [1]: https://github.com/freedomofpress/dangerzone/issues/660#issuecomment-1916803201 [2]: https://github.com/freedomofpress/dangerzone/issues/660#issuecomment-1916870347 [3]: https://github.com/freedomofpress/dangerzone/issues/688	2024-01-31 19:58:48 +00:00
deeplow	f676891482	Remove Dockerfile dependencies replaced by PyMuPDF PyMuPDF replaced the need for almost all dependencies, which this commit now removes. We are also removing tesseract-ocr as a dependency since (to our surprise) PyMuPDF ships directly with tesseract binaries [1]. However, now that tesseract-ocr is not available directly as a binary tool, the `test_ocr.py` needed to be changed. Fixes #658 [1]: https://github.com/freedomofpress/dangerzone/issues/658#issuecomment-1861033149	2024-01-03 12:58:36 +00:00
Moon Sungjoon	63aea4cb45	Enable HWP conversion on MacOS (Apple silicon CPU) This PR reverts the patch that disables HWP / HWPX conversion on MacOS M1. It does not fix conversion on Qubes OS (#494). Previously, HWP / HWPX conversion didn't work on MacOS (Apple silicon CPU) (#498) because libreoffice wasn't built with Java support on Alpine Linux for ARM (aarch64). Gratefully, the Alpine team has enabled Java support on the aarch64 system [1], so we can enable it again for ARM architectures. And this patch is included in Alpine 3.19 This commit was included in #541 and reverted on #562 due to a stability issue. Fixes #498 [1]: `74d443f479`	2023-12-13 12:57:22 +02:00
Garrett Robinson	53115b3ffa	Use more descriptive button labels in update check prompt	2023-10-31 12:52:34 +00:00
Alex Pyrgiotis	ba5adb33c0	Fix a bug in "Change Selection" Fix a bug in the "Change Selection" action, whereby changing your selection and picking files from another directory results in: "Dangerzone does not support adding documents from multiple locations. The newly added documents were ignored." To fix this, change the output directory when we change selection as well.	2023-10-13 22:45:11 +03:00
Alex Pyrgiotis	89a36efe89	tests: Fix typo	2023-10-03 11:32:37 +03:00
Alex Pyrgiotis	2016965c84	Revert "Enable HWP conversion on MacOS M1" This reverts commit `214ce9720d`. The rationale is that we want to wait until the LibreOffice package that allows HWP conversion in Alpine Linux lands in `alpine:latest`. For more info, read https://github.com/freedomofpress/dangerzone/issues/498#issuecomment-1739894100	2023-10-02 14:22:47 +03:00

1 2 3

117 commits