Reviews for OCR + Translate
OCR + Translate by Crivella
3 reviews
- Rated 5 out of 5by Firefox user 18707959, 3 months ago
- Rated 5 out of 5by greenlime, 10 months agothis work great for japanese to english.
but i can't make it work with chinese to english. is there any specific combination? i try to use tesseract and helsinki NLP. it is failed when running the OCR. i will try with other combination tomorrowDeveloper response
posted 10 months agoThanks for the review!
PS: I see that you report that tesseract is failing when running the OCR. Have you installed tesseract itself, i do not (not even sure if i can) ship that with my tool.
In case you have feel free to open an issue on github (either the main tool or the tesseract plugin and i will try to work it out)
As of now the models that i was able to find that are freely available are not great for OCRing chinese.
I started developing a plugin for using PaddleOCR that gives much better results, but i was getting inconsistent behavior between Linux and Windows (crashes inside the code of PaddleOCR) with different number of version working on one but not the other. Have yet to decide if letting users just install it themselves (using a version that works for them) is a better solution or try to fix this/wait for an update.
Contributions even in the form of good models that people finds on HuggingFace, are welcome ;P
Haven't had much time to work on it lately
- PaddleOCR main repo https://github.com/PaddlePaddle/PaddleOCR
- WIP plugin for who want to try and install it manually https://github.com/Crivella/ocr_translate-paddle - Rated 5 out of 5by Firefox user 18271254, a year agoExcellent, can be improved if the option to change font color can be provided, red is not legible when night light mode is ON in Windows.
EDIT: I tried the render options as suggested and it works beautifully, thank you. I didn't realize there was a pre-compiled .exe available in the released section on Github until today, you've actually made it super easy to setup for non-tech savvy folks as well, kudos! For users who have come this far, these are the settings I use for Japanese to English translations and it's fast.
Box model - easyocr
OCR Model - kha-white/manga-ocr-base
Translation model - staka/fugumt-ja-enDeveloper response
posted a year agoThanks for the review!
The font size and color (plus other rendering options) can be controlled by clicking on the "Render options" menu at the bottom of the popup window and changing the values.
I left it as numeric RGB values (but could be improved) because unfortunately i could not make the default HTML color picker work with the popup menu (opening the color picker would close the popup) and did not want to introduce another 3rd party JS dependency.