OCR + Translate 的评价
OCR + Translate 作者: Crivella
greenlime 的评价
评分 5 / 5
来自 greenlime,7 个月前this work great for japanese to english.
but i can't make it work with chinese to english. is there any specific combination? i try to use tesseract and helsinki NLP. it is failed when running the OCR. i will try with other combination tomorrow
but i can't make it work with chinese to english. is there any specific combination? i try to use tesseract and helsinki NLP. it is failed when running the OCR. i will try with other combination tomorrow
开发者回应
发布于 7 个月前Thanks for the review!
PS: I see that you report that tesseract is failing when running the OCR. Have you installed tesseract itself, i do not (not even sure if i can) ship that with my tool.
In case you have feel free to open an issue on github (either the main tool or the tesseract plugin and i will try to work it out)
As of now the models that i was able to find that are freely available are not great for OCRing chinese.
I started developing a plugin for using PaddleOCR that gives much better results, but i was getting inconsistent behavior between Linux and Windows (crashes inside the code of PaddleOCR) with different number of version working on one but not the other. Have yet to decide if letting users just install it themselves (using a version that works for them) is a better solution or try to fix this/wait for an update.
Contributions even in the form of good models that people finds on HuggingFace, are welcome ;P
Haven't had much time to work on it lately
- PaddleOCR main repo https://github.com/PaddlePaddle/PaddleOCR
- WIP plugin for who want to try and install it manually https://github.com/Crivella/ocr_translate-paddle
PS: I see that you report that tesseract is failing when running the OCR. Have you installed tesseract itself, i do not (not even sure if i can) ship that with my tool.
In case you have feel free to open an issue on github (either the main tool or the tesseract plugin and i will try to work it out)
As of now the models that i was able to find that are freely available are not great for OCRing chinese.
I started developing a plugin for using PaddleOCR that gives much better results, but i was getting inconsistent behavior between Linux and Windows (crashes inside the code of PaddleOCR) with different number of version working on one but not the other. Have yet to decide if letting users just install it themselves (using a version that works for them) is a better solution or try to fix this/wait for an update.
Contributions even in the form of good models that people finds on HuggingFace, are welcome ;P
Haven't had much time to work on it lately
- PaddleOCR main repo https://github.com/PaddlePaddle/PaddleOCR
- WIP plugin for who want to try and install it manually https://github.com/Crivella/ocr_translate-paddle
4 条评价
开发者回应
发布于 2 个月前From the addon page there are link to to the HOMEPAGE (addon github page) which will have some demos for the extension and links to the server github page with explanations both in the README and in a dedicated documentation page.
I've added those links in a more visible way in the addon page- 评分 4 / 5来自 Devil304,2 个月前No OCR model and no BOX model to be selected.
Translation models can be installed from plugins panel.
Windows 11, python 3.11.9
Thank you for clearing this.
Yes, pls add information about models to documentation, I tried to install google trans but it showed error, then I installed ollama and I didn't know I needed to install easy ocr and hugging face.开发者回应
发布于 2 个月前Thanks for trying the tool.
Would you mind opening an issue on the github repo (see the homepage for the addon as i cannot add links here) to see if we can get it fixed (the reviews for the addon ins hardly the proper venue for it)?
On the top of my head the only thing that comes to mind is which plugins have been installed. Keep in mind that specific plugins add specific type of models (EG for having OCR and BOX models you might want to have huggingface, and possibly also easyocr). Will try to make it clearer in the doc which plugins adds which kind of models.
There is also the option of using the older v0.5.1 release for the server which does not have the plugin manager but ships with all the deps pre-installed/packaged (there will be some features missing though) - 评分 5 / 5来自 Firefox 用户 18271254,8 个月前Excellent, can be improved if the option to change font color can be provided, red is not legible when night light mode is ON in Windows.
EDIT: I tried the render options as suggested and it works beautifully, thank you. I didn't realize there was a pre-compiled .exe available in the released section on Github until today, you've actually made it super easy to setup for non-tech savvy folks as well, kudos! For users who have come this far, these are the settings I use for Japanese to English translations and it's fast.
Box model - easyocr
OCR Model - kha-white/manga-ocr-base
Translation model - staka/fugumt-ja-en开发者回应
发布于 8 个月前Thanks for the review!
The font size and color (plus other rendering options) can be controlled by clicking on the "Render options" menu at the bottom of the popup window and changing the values.
I left it as numeric RGB values (but could be improved) because unfortunately i could not make the default HTML color picker work with the popup menu (opening the color picker would close the popup) and did not want to introduce another 3rd party JS dependency.