Speech Recognition Polyfill 作者: apersongithub
Allows speech recognition in websites such as Google Translate, Duolingo, etc. using OpenAI's Whisper API locally.
實驗中實驗中
擴充套件後設資料
畫面擷圖
關於此擴充套件
On first install this extension will open the options page, the default model language is English but this is changeable. This extension allows per-site customization and a multitude of different models to decipher language. Keep in mind that this is not a complete solution and the API doesn't have full support. Speech Detection is not as instantaneous like in Google Chrome's API which is sent to a server but it is manageable. The extension icon color/indicator changes depending on the process so pin it to your menu to verify the extension is working as intended. A red mic/error icon does not necessarily mean your mic isn't working but rather the speech may have been cancelled by user input or that it is unintelligible (usually its the latter).
Make sure you are using the correct mic and speak loud, slow, and clear otherwise your voice may not be detected or unintelligble. Change the default model to the larger ones if you experience problems with voice recognition (this WILL impair performance). The extension does not support continuous recording.
If you're using Duolingo or similar and are trying to do the speaking practice of the language that you are learning, set the language in the extension to the one you are learning (navigate to the site -> click extension icon -> set language then click "save for site"). This will help with speech detection since the model now knows what language you are trying to speak. (This isn't necessary for every site, one example is google translate which has the auto-language implementation) Look at the images for more help.
The extension will take ~1GB of ram on normal use. Remember it is using a local AI Model so, I've implemented decent memory management to compensate.
~~~~~~~~~~~~~~~~~~~~
❗ General Recommendations:
• 8GB of RAM is a minimum requirement since it could easily take up to 3GB when utilizing larger models.
• A modern CPU is recommended.
• An internet connection. Even though the model runs locally, the extension re-downloads it either when idle or after closing the tab/opening a new one that utilizes the extension (for memory preservation purposes). This is ultimately better than packaging the large models within the extension for the time being and for most models, the download speed will be near instant for the general population.
Make sure you are using the correct mic and speak loud, slow, and clear otherwise your voice may not be detected or unintelligble. Change the default model to the larger ones if you experience problems with voice recognition (this WILL impair performance). The extension does not support continuous recording.
If you're using Duolingo or similar and are trying to do the speaking practice of the language that you are learning, set the language in the extension to the one you are learning (navigate to the site -> click extension icon -> set language then click "save for site"). This will help with speech detection since the model now knows what language you are trying to speak. (This isn't necessary for every site, one example is google translate which has the auto-language implementation) Look at the images for more help.
The extension will take ~1GB of ram on normal use. Remember it is using a local AI Model so, I've implemented decent memory management to compensate.
~~~~~~~~~~~~~~~~~~~~
❗ General Recommendations:
• 8GB of RAM is a minimum requirement since it could easily take up to 3GB when utilizing larger models.
• A modern CPU is recommended.
• An internet connection. Even though the model runs locally, the extension re-downloads it either when idle or after closing the tab/opening a new one that utilizes the extension (for memory preservation purposes). This is ultimately better than packaging the large models within the extension for the time being and for most models, the download speed will be near instant for the general population.
由 1 位評論者給出 0 分
權限與資料
更多資訊