QUOTE(Spunktimmy @ Aug 12 2024, 19:47)

Hey everyone, I'm really going crazy here. My tech illiterate brain has still been unable to figure out how to extract protected pdf files from dlsite. I tried using the dlsite viewer, but it can't read pdf files. I tried using pdfimages.exe, but it says "Permission Error: Copying of images from this document is not allowed.". Not sure what I should do, but any help is appreciated. I did ask 6 months ago, but I never got a reply. Is there someone I can dm for help?
if you can view the pdf normally in a normal pdf viewer it's totally an arbitrary restriction. I think I've removed that "protection" using pdftk (
the non-java version)
EDIT!!! You want the java version.
[
gitlab.com]
https://gitlab.com/pdftk-java/pdftk/-/releases[
gitlab.com]
https://gitlab.com/api/v4/projects/5024297/...3/pdftk-all.jarCODE
java -jar pdftk.jar input_file.pdf output output_file.pdf
(that's the literal word "output" as the second argument to pdftk).
If it worked, you'll see something like (in my test just now):
CODE
WARNING: The creator of the input PDF:
1973_01.pdf
has set an owner password (which is not required to handle this PDF).
You did not supply this password. Please respect any copyright.
After doing that, maybe pdfimages will work on your outputted PDF file. It did for me.
If there's a "user password" too, and you know it, you can add
CODE
user_pw my-password-goes-here
after the output filename in the above command.
If you want to send it to me I can try to extract it for you, too. If it's a standard PDF, that is.
I did this unprotection trick while I was removing watermarks from a bunch of "National Lampoon" magazine PDF files. I've also done it on a textbook for linux certification for a colleague at my last workplace. PDF hacking can be kind of neat.
This post has been edited by Moonlight Rambler: Aug 13 2024, 06:07