dangerzone

mirror of https://github.com/freedomofpress/dangerzone.git synced 2025-05-04 12:41:50 +02:00

Author	SHA1	Message	Date
Alexis Métaireau	5bd51575fe	Display the `{podman,docker} pull` progress when installing a new image The progressbars we see when using this same commands on the command line doesn't seem to be passed to the python process here, unfortunately.	2025-03-03 12:59:36 +01:00
Alexis Métaireau	a540fc5b08	(WIP) Add tests	2025-02-12 18:23:12 +01:00
Alexis Métaireau	835970b541	fixup! (WIP) Check for container updates rather than using `image-id.txt`	2025-02-12 12:05:20 +01:00
Alexis Métaireau	60674ea6b4	fixup! (WIP) Check for container updates rather than using `image-id.txt`	2025-02-12 11:53:36 +01:00
Alexis Métaireau	5c9a38d370	(WIP) Check for container updates rather than using `image-id.txt`	2025-02-11 19:24:59 +01:00
Alexis Métaireau	3d5cacfffb	Warn users if the minimum version of Docker Desktop is not met This only happens on Windows and macOS. Fixes #693	2025-01-21 10:21:24 +01:00
Alexis Métaireau	acf20ef700	Add a `--debug` flag to the CLI to help retrieve more logs When the flag is set, the `RUNSC_DEBUG=1` environment variable is added to the outer container, and stderr is captured in a separate thread, before printing its output.	2025-01-16 11:35:06 +01:00
Alex Pyrgiotis	ec9f8835e0	Move container security arg to proper place Now that #748 has been merged, we can move the `--userns nomap` argument to the list with the rest of our security arguments.	2024-12-10 11:31:39 +02:00
Alex Pyrgiotis	0383081394	Factor out container utilities to separate module	2024-12-10 11:31:39 +02:00
Alex Pyrgiotis	25fba42022	Extend the interface of the isolation provider Add the following two methods in the isolation provider: 1. `.is_available()`: Mainly used for the Container isolation provider, it specifies whether the container runtime is up and running. May be used in the future by other similar providers. 2. `.should_wait_install()`: Whether the isolation provider takes a while to be installed. Should be `True` only for the Container isolation provider, for the time being.	2024-12-10 11:29:00 +02:00
Alex Pyrgiotis	e22c795cb7	container: Revamp container image installation Revamp the container image installation process in a way that does not involve using image IDs. We don't want to rely on image IDs anymore, since they are brittle (see https://github.com/freedomofpress/dangerzone/issues/933). Instead, we use image tags, as provided in the `image-id.txt` file. This allows us to check fast if an image is up to date, and we no longer need to maintain multiple image IDs from various container runtimes. Refs #933 Refs #988 Fixes #1020	2024-12-10 11:29:00 +02:00
Alex Pyrgiotis	20152fac13	container: Factor out loading an image tarball	2024-12-10 11:18:23 +02:00
Alex Pyrgiotis	6b51d56e9f	container: Manipulate Dangerzone image tags Add the following methods that allow the `Container` isolation provider to work with tags for the Dangerzone image: * `list_image_tag()` * `delete_image_tag()` * `add_image_tag()`	2024-12-10 11:18:23 +02:00
Alex Pyrgiotis	50627d375c	Fix a small typo	2024-10-22 19:07:09 +03:00
Alexis Métaireau	a95b612e78	Catch installation errors and display them. Fixes #193	2024-10-17 16:20:56 +02:00
Alex Pyrgiotis	7ea7c8a0cc	Remove dead code	2024-10-17 15:50:12 +03:00
Alex Pyrgiotis	d6410652cb	Kill the process group when conversion terminates Instead of killing just the invoked Podman/Docker/qrexec process, kill the whole process group, to make sure that other components that have been spawned die as well. In the case of Podman, conmon is one of the processes that lingers, so that's one way to kill it.	2024-10-07 17:37:39 +03:00
Alex Pyrgiotis	b9a3dd63ad	Always start conversion process in new session Start the conversion process in a new session, so that we can later on kill the process group, without killing the controlling script (i.e., the Dangezone UI). This should not affect the conversion process in any other way.	2024-10-07 17:27:38 +03:00
Alexis Métaireau	3e434d08d1	Always use our own seccomp policy as a default. As per Etienne Perot's comment on #908: > Then it seems to me like it would be easy to simply apply this seccomp profile under all container runtimes (since there's no reason why the same image and the same command-line would call different syscalls under different container runtimes).	2024-10-02 14:12:48 +02:00
Alexis Métaireau	eb10082a62	Merge branch 'hotfix-0.7.1' into main	2024-10-01 15:16:25 +02:00
Alex Pyrgiotis	4423fc6232	Handle multiple image IDs in the `image-ids.txt` file. Docker Desktop 4.30.0 uses the containerd image store by default, which generates different IDs for the images, and as a result breaks the logic we are using when verifying the images IDs are present. Now, multiple IDs can be stored in the `image-id.txt` file. Fixes #933	2024-09-30 12:34:34 +02:00
Alex Pyrgiotis	27d201a95b	container: Avoid pop-ups on Windows Avoid window pop-ups on Windows systems, by using the `startupinfo` argument of `subprocess.run`.	2024-09-27 12:55:46 +03:00
Alexis Métaireau	c3c7fbbc20	Fix wrong container-runtime detection on Linux Use "podman" when on Linux, and "docker" otherwise. This commit also adds a text widget to the interface, showing the actual content fo the error that happened, to help debug further if needed. Fixes #212	2024-09-18 15:04:57 +02:00
Alex Pyrgiotis	0a181a3342	container: Set `container_engine_t` SELinux label Set the `container_engine_t` SELinux on the outer Podman container, so that gVisor does not break on systems where SELinux is enforcing. This label is provided for container engines running within a container, which fits our `runsc` within `crun` situation. We have considered using the more permissive `label=disable` option, to disable SELinux labels altogether, but we want to take advantage of as many SELinux protections as we can, even for the outer container. Cherry-picked from `e1e63d14f8` Fixes #880	2024-07-30 16:41:13 +03:00
Alex Pyrgiotis	e1e63d14f8	container: Set `container_engine_t` SELinux label Set the `container_engine_t` SELinux on the outer Podman container, so that gVisor does not break on systems where SELinux is enforcing. This label is provided for container engines running within a container, which fits our `runsc` within `crun` situation. We have considered using the more permissive `label=disable` option, to disable SELinux labels altogether, but we want to take advantage of as many SELinux protections as we can, even for the outer container. Fixes #880	2024-07-26 16:34:19 +03:00
Alex Pyrgiotis	b6f399be6e	container: Avoid pop-ups on Windows Avoid window pop-ups on Windows systems, by using the `startupinfo` argument of `subprocess.run`.	2024-07-02 20:41:58 +03:00
Alex Pyrgiotis	756945931f	container: Handle case where `docker kill` hangs We have encountered several conversions where the `docker kill` command hangs. Handle this case by specifying a timeout to this command. If the timeout expires, log a warning and proceed with the rest of the termination logic (i.e., kill the conversion process). Fixes #854	2024-07-01 17:56:21 +03:00
Alex Pyrgiotis	e7e3430ca1	Use a custom seccomp policy for older Docker Desktop releases We are aware that some Docker Desktop releases before 25.0.0 ship with a seccomp policy which disables the `ptrace(2)` system call. In such cases, we opt to use our own seccomp policy which allows this system call. This seccomp policy is the default one in the latest releases of Podman, and we use it in Linux distributions where Podman version is < 4.0. Fixes #846	2024-06-26 18:49:03 +03:00
Etienne Perot	f03bc71855	Sandbox all Dangerzone document processing within gVisor. This wraps the existing container image inside a gVisor-based sandbox. gVisor is an open-source OCI-compliant container runtime. It is a userspace reimplementation of the Linux kernel in a memory-safe language. It works by creating a sandboxed environment in which regular Linux applications run, but their system calls are intercepted by gVisor. gVisor then redirects these system calls and reinterprets them in its own kernel. This means the host Linux kernel is isolated from the sandboxed application, thereby providing protection against Linux container escape attacks. It also uses `seccomp-bpf` to provide a secondary layer of defense against container escapes. Even if its userspace kernel gets compromised, attackers would have to additionally have a Linux container escape vector, and that exploit would have to fit within the restricted `seccomp-bpf` rules that gVisor adds on itself. Fixes #126 Fixes #224 Fixes #225 Fixes #228	2024-06-12 13:40:04 +03:00
Alex Pyrgiotis	7179d6f734	Get container runtime version Get the (major, minor) parts of the Docker/Podman version, to check if some specific features can be used, or if we need a fallback. These features are related with the upcoming gVisor integration, and will be added in subsequent commits.	2024-06-12 13:40:04 +03:00
Alexis Métaireau	d9d9ab91a3	docs: document why `get_tmp_dir` is required in the imports	2024-06-05 14:19:32 +02:00
Alexis Métaireau	eba30f3c17	fix: do not catch bare exceptions Bare excepts will catch keyboard-exit exceptions, system-exit etc. which is probably not what we want.	2024-06-05 14:19:31 +02:00
Alexis Métaireau	5aa4863b52	chore(imports): remove useless imports As detected by [ruff](https://github.com/astral-sh/ruff) Related to #254, although it doesn't provide the command to lint the codebase itself.	2024-06-05 14:19:30 +02:00
Alex Pyrgiotis	ff25fa3045	Fix stuck conversion processes Gracefully terminate certain conversion processes that may get stuck when writing lots of data to stdout. Also, handle a race condition when a conversion process terminates slightly after the associated container. Fixes #791	2024-05-09 16:46:15 +03:00
Alex Pyrgiotis	d6202cd028	Invoke external command on Windows properly On Windows, if we don't use the `startupinfo=` argument of subprocess.Popen, then a terminal window will flash while running the command. Use `startupinfo=` when killing a container, as we do for every other command.	2024-05-09 15:57:42 +03:00
Alex Pyrgiotis	171a7eca52	isolation_provider: Terminate doc-to-pixels proc Extend the IsolationProvider class with a `terminate_doc_to_pixels_proc()` method, which must be implemented by the Qubes/Container providers and gracefully terminate a process started for the doc to pixels phase. Refs #563	2024-04-24 14:36:14 +03:00
Alex Pyrgiotis	a63f4b85eb	isolation_provider: Set a unique name for spawned containers Set a unique name for spawned containers, based on the ID of the provided document. This ID is not globally unique, as it has few bits of entropy. However, since we only want to avoid collisions within a single Dangerzone invocation, and since we can't support multiple containers running in parallel, this ID will suffice.	2024-04-24 14:33:33 +03:00
Alex Pyrgiotis	6850d31edc	isolation_provider: Pass doc when creating doc-to-pixels proc Pass the Document instance that will be converted to the `IsolationProvider.start_doc_to_pixels_proc()` method. Concrete classes can then associate this name with the started process, so that they can later on kill it.	2024-04-24 14:33:33 +03:00
Alex Pyrgiotis	a31f3370d0	Capture missing logs in second-stage conversion For a while now, we didn't get logs for the second-stage conversion when using containers. Extend the code to log any captured output from the second stage conversion, only if we run Dangerzone via our dev entrypoint. Note that the Qubes isolation provider was always logging output from the second stage of the conversion.	2024-03-13 20:59:50 +02:00
deeplow	879fca6f9f	Remove uneeded TESSDATA_PREFIX setting in container The container image does not need the TESSDATA_PREFIX env variable since its PyMuPDF version is new enough to support `tessdata` as an argument when calling the PyMuPDF tesseract method.	2024-02-07 13:14:08 +00:00
deeplow	69c2a02d81	Remove timeouts Remove timeouts due to several reasons: 1. Lost purpose: after implementing the containers page streaming the only subprocess we have left is LibreOffice. So don't have such a big risk of commands hanging (the original reason for timeouts). 2. Little benefit: predicting execution time is generically unsolvable computer science problem. Ultimately we were guessing an arbitrary time based on the number of pages and the document size. As a guess we made it pretty lax (30s per page or MB). A document hanging for this long will probably lead to user frustration in any case and the user may be compelled to abort the conversion. 3. Technical Challenges with non-blocking timeout: there have been several technical challenges in keeping timeouts that we've made effort to accommodate. A significant one was having to do non-blocking read to ensure we could timeout when reading conversion stream (and then used here) Fixes #687	2024-02-06 20:11:43 +00:00
deeplow	07dd54cd13	Fix hanging: disable container logging The conversion was hanging arbitrarily [1] on some systems. Sometimes it would send the full page other times stop half-way. Originally found by @apyrgio. Co-authored-by: @apyrgio [1]: https://github.com/freedomofpress/dangerzone/pull/627#issuecomment-1892491968	2024-02-06 19:42:41 +00:00
deeplow	61e7a3c107	Fix isolation provider tests Conversions methods had changed and that was part of the reason why the tests were failing. Furthermore, due to the `provider.proc`, which stores the associated qrexec / container process, "server" exceptions raise a IterruptedConversion error (now ConverterProcException), which then requires interpretation of the process exit code to obtain the "real" exception.	2024-02-06 19:42:41 +00:00
deeplow	550786adfe	Remove untrusted progress parsing (stderr instead) Now that only the second container can send JSON-encoded progress information, we can the untrusted JSON parsing. The parse_progress was also renamed to `parse_progress_trusted` to ensure future developers don't mistake this as a safe method. The old methods for sending untrusted JSON were repurposed to send the progress instead to stderr for troubleshooting in development mode. Fixes #456	2024-02-06 19:42:40 +00:00
deeplow	0a099540c8	Stream pages in containers: merge isolation providers Merge Qubes and Containers isolation providers core code into the class parent IsolationProviders abstract class. This is done by streaming pages in containers for exclusively in first conversion process. The commit is rather large due to the multiple interdependencies of the code, making it difficult to split into various commits. The main conversion method (_convert) now in the superclass simply calls two methods: - doc_to_pixels() - pixels_to_pdf() Critically, doc_to_pixels is implemented in the superclass, diverging only in a specialized method called "start_doc_to_pixels_proc()". This method obtains the process responsible that communicates with the isolation provider (container / disp VM) via `podman/docker` and qrexec on Containers and Qubes respectively. Known regressions: - progress reports stopped working on containers Fixes #443	2024-02-06 19:42:33 +00:00
deeplow	331b6514e8	Containers: remove debug messages (via files) Remove container_log messages ahead of debug info being sent over standard streams.	2024-02-06 18:54:39 +00:00
deeplow	77d5ea5940	Add PyMuPDF in pixels_to_pdf replacing old logic Adding PyMuPDF essentially make the code much simpler since it can do everything that we'd need multiple programs for. It also includes tesseract-OCR integration, which this commit makes use of.	2024-01-03 12:56:33 +00:00
Alex Pyrgiotis	6232062146	Add missing newline char	2023-10-02 15:41:29 +03:00
deeplow	b4c3e07d36	Remove attacker-controlled error messages Creates exceptions in the server code to be shared with the client via an identifying exit code. These exceptions are then reconstructed in the client. Refs #456 but does not completely fix it. Unexpected exceptions and progress descriptions are still passed in Containers.	2023-09-19 15:33:20 +01:00
deeplow	9ec9cc5f87	Replace armor guards that indicate isolated output	2023-08-22 16:11:41 +01:00

1 2

69 commits