dangerzone

mirror of https://github.com/freedomofpress/dangerzone.git synced 2025-04-28 18:02:38 +02:00

Author	SHA1	Message	Date
deeplow	69c2a02d81	Remove timeouts Remove timeouts due to several reasons: 1. Lost purpose: after implementing the containers page streaming the only subprocess we have left is LibreOffice. So don't have such a big risk of commands hanging (the original reason for timeouts). 2. Little benefit: predicting execution time is generically unsolvable computer science problem. Ultimately we were guessing an arbitrary time based on the number of pages and the document size. As a guess we made it pretty lax (30s per page or MB). A document hanging for this long will probably lead to user frustration in any case and the user may be compelled to abort the conversion. 3. Technical Challenges with non-blocking timeout: there have been several technical challenges in keeping timeouts that we've made effort to accommodate. A significant one was having to do non-blocking read to ensure we could timeout when reading conversion stream (and then used here) Fixes #687	2024-02-06 20:11:43 +00:00
deeplow	f31374e33c	Revert "Add non-blocking read utility" This reverts commit `fea193e935`. This is part of the purge of timeout-related code since we no longer need it [1]. Non-blocking reads were introduced in the reverted commit in order to be able to cut a stream mid-way due to a timeout. This is no longer needed now that we're getting rid of timeouts. [1]: https://github.com/freedomofpress/dangerzone/issues/687	2024-02-06 19:42:41 +00:00
deeplow	1835756b45	Allow each conversion to have its own proc If we increased the number of parallel conversions, we'd run into an issue where the streams were getting mixed together. This was because the Converter.proc was a single attribute. This breaks it down into a local variable such that this mixup doesn't happen.	2024-02-06 19:42:41 +00:00
deeplow	943bab2def	Move Qubes-specific tests also to containers Now that Qubes and Containers essentially share the same code, we can have both run the same tests.	2024-02-06 19:42:41 +00:00
deeplow	61e7a3c107	Fix isolation provider tests Conversions methods had changed and that was part of the reason why the tests were failing. Furthermore, due to the `provider.proc`, which stores the associated qrexec / container process, "server" exceptions raise a IterruptedConversion error (now ConverterProcException), which then requires interpretation of the process exit code to obtain the "real" exception.	2024-02-06 19:42:41 +00:00
deeplow	550786adfe	Remove untrusted progress parsing (stderr instead) Now that only the second container can send JSON-encoded progress information, we can the untrusted JSON parsing. The parse_progress was also renamed to `parse_progress_trusted` to ensure future developers don't mistake this as a safe method. The old methods for sending untrusted JSON were repurposed to send the progress instead to stderr for troubleshooting in development mode. Fixes #456	2024-02-06 19:42:40 +00:00
deeplow	0a099540c8	Stream pages in containers: merge isolation providers Merge Qubes and Containers isolation providers core code into the class parent IsolationProviders abstract class. This is done by streaming pages in containers for exclusively in first conversion process. The commit is rather large due to the multiple interdependencies of the code, making it difficult to split into various commits. The main conversion method (_convert) now in the superclass simply calls two methods: - doc_to_pixels() - pixels_to_pdf() Critically, doc_to_pixels is implemented in the superclass, diverging only in a specialized method called "start_doc_to_pixels_proc()". This method obtains the process responsible that communicates with the isolation provider (container / disp VM) via `podman/docker` and qrexec on Containers and Qubes respectively. Known regressions: - progress reports stopped working on containers Fixes #443	2024-02-06 19:42:33 +00:00
deeplow	cd99122385	Adds file formats: epub svg bmp pnm bpm ppm Partially fix for #660. Missing some files due to limitations [1]: - PSD - only available from PyMuPDF>=1.23.0 (qubes-fedora is lower) - TXT - only available from PyMuPDF>=1.23.7 (qubes-fedora is lower) - JXR - PyMuPDF was refusing to due to missing codec [1] - JPX - Generated test file was rejected by PyMuPDF [2] - FB2 - Most often cannot be detected by mime type alone [3] - CBZ - (idem) - XPS - (idem) - MOBI - (idem) - PAM - General version of other file format already included, so I decided not to include this extension [0] New test files were generated locally: - epub - generated with calibre's convert-ebook from another sample file - svg - generated with inkscape from a mix of a default template (hexagons) and a logo's PNG file - bmp, pnm, bpm, ppm - generated with ImageMagick's 'convert' from tests/test_docs/sample-png.png [0]: https://github.com/freedomofpress/dangerzone/issues/660#issuecomment-1914681487 [1]: https://github.com/freedomofpress/dangerzone/issues/660#issuecomment-1916803201 [2]: https://github.com/freedomofpress/dangerzone/issues/660#issuecomment-1916870347 [3]: https://github.com/freedomofpress/dangerzone/issues/688	2024-01-31 19:58:48 +00:00
deeplow	f676891482	Remove Dockerfile dependencies replaced by PyMuPDF PyMuPDF replaced the need for almost all dependencies, which this commit now removes. We are also removing tesseract-ocr as a dependency since (to our surprise) PyMuPDF ships directly with tesseract binaries [1]. However, now that tesseract-ocr is not available directly as a binary tool, the `test_ocr.py` needed to be changed. Fixes #658 [1]: https://github.com/freedomofpress/dangerzone/issues/658#issuecomment-1861033149	2024-01-03 12:58:36 +00:00
Moon Sungjoon	63aea4cb45	Enable HWP conversion on MacOS (Apple silicon CPU) This PR reverts the patch that disables HWP / HWPX conversion on MacOS M1. It does not fix conversion on Qubes OS (#494). Previously, HWP / HWPX conversion didn't work on MacOS (Apple silicon CPU) (#498) because libreoffice wasn't built with Java support on Alpine Linux for ARM (aarch64). Gratefully, the Alpine team has enabled Java support on the aarch64 system [1], so we can enable it again for ARM architectures. And this patch is included in Alpine 3.19 This commit was included in #541 and reverted on #562 due to a stability issue. Fixes #498 [1]: `74d443f479`	2023-12-13 12:57:22 +02:00
Garrett Robinson	53115b3ffa	Use more descriptive button labels in update check prompt	2023-10-31 12:52:34 +00:00
Alex Pyrgiotis	ba5adb33c0	Fix a bug in "Change Selection" Fix a bug in the "Change Selection" action, whereby changing your selection and picking files from another directory results in: "Dangerzone does not support adding documents from multiple locations. The newly added documents were ignored." To fix this, change the output directory when we change selection as well.	2023-10-13 22:45:11 +03:00
Alex Pyrgiotis	89a36efe89	tests: Fix typo	2023-10-03 11:32:37 +03:00
Alex Pyrgiotis	2016965c84	Revert "Enable HWP conversion on MacOS M1" This reverts commit `214ce9720d`. The rationale is that we want to wait until the LibreOffice package that allows HWP conversion in Alpine Linux lands in `alpine:latest`. For more info, read https://github.com/freedomofpress/dangerzone/issues/498#issuecomment-1739894100	2023-10-02 14:22:47 +03:00
deeplow	dabdf6c286	FIXUP: rename to QubesQrexecFailed instead	2023-10-02 12:06:18 +01:00
deeplow	23bee23d81	Disable isolation_provider tests on dummy conversion Windows and macOS in CI (which don't support nested virtualization) and thus Docker aren't really candidates for isolation_provider tests.	2023-09-28 11:08:53 +01:00
deeplow	0a6b33ebed	Qubes: detect qube failing to start (missing RAM) In Qubes OS it's often the case that the user doesn't have enough RAM to start the conversion. In this case it raises BrokenPipeException and exits with code 126. It didn't seem possible to distinguish this kind of failure to one where the user has misconfigured qrexec policies. NOTE: this approach is not ideal UX-wise. After the first doc failing the next one will also try and fail. Upon first failure we should inform the user that they need to close some programs or qubes.	2023-09-28 11:08:50 +01:00
deeplow	63f03d5bcd	Add limit and test to max width and height of docs	2023-09-28 11:08:47 +01:00
deeplow	6f26fc6303	Qubes: add test if MAX_PAGES is enforced in client Because the server also checks the MAX_PAGES limit, the test in base would hide the fact that the client is not enforcing the limit. This ensures that's not the case. When the pages in containers are streamed (#443), then this test should be in base.py.	2023-09-28 11:06:36 +01:00
deeplow	54b8ffbf96	Add page limit of 10000 Theoretically the max pages would be 65536 (2byte unsigned int. However this limit is much higher than practical documents have and larger ones can lead to unforseen problems, for example RAM limitations. We thus opted to use a lower limit of 10K. The limit must be detected client-side, given that the server is distrusted. However we also check it in the server, just as a fail-early mechanism.	2023-09-28 11:01:14 +01:00
deeplow	afba362d22	Tests: split isolation provider tests per provider Isolation provider tests done in tests/test_base.py and had pytest.mark.parameterize() for each isolation provider. This logic would not work well when we had test that diverge. We could have marked each one as compatible with one provider or another, but in the end it turned out to be better to have the common ones in a base class and the divergent ones in each. NOTE: this has a strange side-effect: inherited test classes need to have imports for all of the fixtures even if they are not explictly used	2023-09-28 09:53:29 +01:00
Alex Pyrgiotis	fea193e935	Add non-blocking read utility Add a function that can read data from non-blocking fds, which we will used later on to read from standard streams with a timeout.	2023-09-20 17:14:24 +03:00
Moon Sungjoon	214ce9720d	Enable HWP conversion on MacOS M1 This PR reverts the patch that disables HWP / HWPX conversion on MacOS M1. It does not fix conversion on Qubes OS (#494) Previously, HWP / HWPX conversion didn't work on MacOS M1 systems (#498) because libreoffice wasn't built with Java support on Alpine Linux for ARM (aarch64). Gratefully, the Alpine team has enabled Java support on the aarch64 system [1], so we can enable it again for ARM architectures. Fixes #498 [1]: `74d443f479`	2023-09-06 13:10:18 +03:00
deeplow	8221a56c7d	Revert "Propagate "update check" prompt to UI checkbox" This reverts commit 3915a86642502b673aa0e47931823acbe66f1043.	2023-08-23 16:46:44 +01:00
deeplow	1695cc7a6c	Propagate "update check" prompt to UI checkbox The "check for updates" button wasn't showing up immediately as checked as soon as the user is prompted for checking updates. This fixes that. Fixes #513	2023-08-23 16:46:33 +01:00
deeplow	9ec9cc5f87	Replace armor guards that indicate isolated output	2023-08-22 16:11:41 +01:00
deeplow	75369cf621	Adapt code so it works for reporting script Reporting script now parses JunitXML instead of a series of ".container_log" files. The script in in changed submodule. Additionally it makes failed tests actually fail so that this is recorded in the JunitXML report.	2023-08-22 16:11:36 +01:00
deeplow	b73ce5bf6a	Add large test logic and documentation Adds a large pool of document that can and should be used prior to a release to understand effects of the new release over a real-world scenario. Documents are stored in an external git LFS repo under `tests/test_docs_large` and currently it's about 11K documents gathered from multiple PDF readers and office suite's test sets. Documentation on how to run the tests is under `docs/developer/TESTING.md`	2023-08-22 16:11:31 +01:00
Alex Pyrgiotis	e3a8a651f1	Disable HWP / HWPX conversion on MacOS M1 / Qubes The HWP / HWPX conversion feature does not work on the following platforms: * MacOS with Apple Silicon CPU * Native Qubes OS For this reason, we need to: 1. Disable it on the GUI side, by not allowing the user to select these files. 2. Throw an error on the isolation provider side, in case the user directly attempts to convert the file (either through CLI or via "Open With"). Refs #494 Refs #498	2023-08-05 16:50:49 +01:00
Moon Sungjoon	075475c306	Add test files for hwp/hwpx (base64 encoded) Add extra files and base64 encode externally contributed docs. This prevents the accidental opening of such documents, since they couldn't be rebuit by the Dangerzone developers to ensure their safety.	2023-08-01 14:37:14 +01:00
deeplow	72536a05ac	container: Improve parsing of progress reports Improve the `parse_progress()` method of the container isolation provider in the following ways: 1. Make sure that the fields of the progress report have the expected type. 2. In case of a JSON parsing error, sanitize the invalid string so that it doesn't contain escape sequences, or the user considers it as trusted.	2023-08-01 14:43:49 +03:00
Alex Pyrgiotis	9410b68c1d	Sanitize progress reports in a provider-agnostic way Update the common `print_progress()` method in the base `IsolationProvider` class, with two extra features: 1. Always sanitize the provided text argument. 2. Mark the sanitized text argument as untrusted. This is default behavior from now on, since this function is commonly used to parse progress reports from the conversion sandbox.	2023-08-01 14:43:48 +03:00
deeplow	3788139d26	Add utility for sanitizing strings Add `replace_control_chars()` function in `util.py`, which can be used to sanitize strings from ANSI escape sequences or weird Unicode symbols.	2023-08-01 14:43:48 +03:00
Alex Pyrgiotis	81811e0aac	Add collapsible dialog for errors Move the error message from a text browser to a collapsible widget.	2023-08-01 14:29:27 +03:00
deeplow	53ec1cad63	Add update error red dot to hamburger menu	2023-08-01 14:29:11 +03:00
Alex Pyrgiotis	c9eac42855	Improve updater messages Improve the wording of updater messages for better UX.	2023-08-01 14:29:10 +03:00
Alex Pyrgiotis	bc4bba4fa1	tests: Add full test coverage for updater checks Fully test the update check logic, by introducing several Qt tests. Also, improve the `UpdaterThread.get_letest_info()` method, that gets the latest version and changelog from GitHub, with several checks. These checks are also tested in our newly added tests.	2023-07-28 12:18:59 +03:00
Alex Pyrgiotis	fdc53efc35	tests: Test our own custom QApplication By default, `pytest-qt` initializes the default QApplication class that PySide offers. Dangerzone, however, defines its own QApplication subclass. Create a `qapp_cls` fixture that will force `pytest-qt` to use this subclass. For more info, see: https://pytest-qt.readthedocs.io/en/latest/qapplication.html#testing-custom-qapplications	2023-07-28 12:18:58 +03:00
Alex Pyrgiotis	24ba914cc8	updater: Differentiate between "X" and "Cancel" We want to differentiate between the user clicking on "Cancel" and clicking on "X", since in the second case, we want to remind them again on the next run.	2023-07-28 11:50:44 +03:00
deeplow	8254844724	Pass sample_pdf as fixture instead of via class Now that sample_doc was renamed to sample_pdf it could cause some confusion the fact that that the TestBase class had an attribute called sample_doc which referenced the sample PDF. By removing this attribute and passing the fixture instead we are following a more pytest-native approach of passing arguments explicitly.	2023-07-25 15:00:32 +01:00
deeplow	6216761058	remove number from test_doc2 variable create a pytest fixture for a .doc file and .pdf file	2023-07-25 15:00:31 +01:00
deeplow	9ca27fd6fe	Add unit test to document change button Fixes #428	2023-07-25 15:00:29 +01:00
Alex Pyrgiotis	47b337143c	tests: Add Qt test for updates Add a very rudimentary test for GUI update logic. Refs #290	2023-07-24 16:54:16 +03:00
Alex Pyrgiotis	47171a2722	tests: Add tests for update logic	2023-07-24 14:22:27 +03:00
Alex Pyrgiotis	f6e945805e	tests: Add a Pytest fixture for Updater Add a Pytest fixture that returns an UpdaterThread instance which has its own unique settings directory. Note that the UpdaterThread instance needs to be slightly nerfed, so that it doesn't rely on Qt functionality or any isolation providers.	2023-07-24 14:22:27 +03:00
Alex Pyrgiotis	641aa131c9	ci: Add test for OCR languages Test that the languages that we provide to users for OCR match the languages that are installed in the container image Fixes #417	2023-05-24 13:43:29 +03:00
Alex Pyrgiotis	33f81f5064	tests: Add sample files for extra MIME types In PR #378 ("container: Allow converting more document formats"), we added support for the following MIME types: * application/zip * application/octet-stream * application/x-ole-storage * application/vnd.oasis.opendocument.spreadsheet-template * application/vnd.oasis.opendocument.text-template However, we forgot to add some tests for these MIME types in the repo. In this commit, we add a file for each of these MIME types, to make sure we have no regressions in the future.	2023-04-03 18:58:56 +03:00
Alex Pyrgiotis	8b846820d2	Update typing hints for Mypy 1.1.1 Due to a bump in our Python dependencies, we now install Mypy 1.1.1 instead of 0.982. This change triggered the following errors: * Incompatible default for argument <a> (default has type None, argument has type <t>): Mypy further explains here that PEP 484 prohibits implicit Optional, so we need to make these types explicit Optional. * Unused "type: ignore" comment, use narrower [method-assign] instead of [assignment]: Mypy has specialized some of its lints, meaning that we should switch to the newer variants. Also, it detected several other small inconsistencies. We fix all of these errors in this commit.	2023-03-27 15:19:43 +03:00
Alex Pyrgiotis	b94d0712c8	Minor corrections in test code	2023-02-17 01:15:08 +02:00
Alex Pyrgiotis	d733890ca0	container: Do not leave stale temporary dirs Do not leave stale temporary directories when conversion fails unexpectedly. Instead, wrap the conversion operation in a context manager that wipes the temporary dir afterwards. Fixes #317	2023-02-17 01:15:08 +02:00

1 2

81 commits