Offline libraries like Tessaract suck ass.
Your best bet will be using either Microsoft's or Google's online APIs. Last I checked, both were free with rate limits that were more than sufficient for personal use (I've scanned ~1k pages in a day without issue).
I work mostly on Korean stuff but I imagine they'd perform mostly the same for JP stuff. In particular, Google's thingy is noticeably more accurate and better with spacing, but their API is dogshit (hard to install / hard to navigate docs / clunky and barebone response).
----
I mostly use it for generating a script template like this: [
docs.google.com]
https://docs.google.com/spreadsheets/d/1FRQ...tpOQ/edit#gid=0 (click top-left cell for the raws)
As for the python code I used to generate that... it's messy as hell so it's kind of a pain to share. But I don't mind sharing specific bits of it / answering any questions though.
Also heads-up that if you want the bubble texts sorted in reading order, you'll probably want do some kind of contour detection / ML model training for the panels. I was too lazy for that so just sorted them by bbox centers and manually corrected lol.
This post has been edited by 프레이: Feb 8 2021, 08:08