Spracherkennungs-Polyfill Versionsgeschichte – 14 Versionen
Spracherkennungs-Polyfill von apersongithub
Seien Sie vorsichtig mit alten Versionen! Diese Versionen werden zu Test- und Referenzzwecken angezeigt.Sie sollten immer die neueste Version eines Add-ons verwenden.
Neueste Version
Version 1.5.0
Veröffentlicht 14. Feb. 2026 – 121,67 KBFunktioniert mit firefox 58.0 und höher🎊 v1.5 Final Release:
- Continuous Speech (Long awaited)
- Realtime Audio AssemblyAI Streaming (Server, very fast)
- Realtime Audio Vosk Streaming (On device, very fast)
- Less cramped developer options and other options in general
- Customizable grace window per site and a few extra options
- Ability to disable processing time out
- Alot MoreQuelltext steht unter der Mozilla Public License 2.0
Ältere Versionen
Version 1.4.0
Veröffentlicht 10. Feb. 2026 – 97,91 KBFunktioniert mit firefox 58.0 und höherNew:
- Removed speech/word limit
- Fixed WASM being used intermittently despite having WebGPU supported
- Fixed issues with indicators on the microphone icon for the most part
- Added the ability to cancel speech (double tap) while its being processed
- Made it easier to restart you speech recording
- Google Docs/Slides now supports your custom keybind!
- Added scrollbar to overrides list
- Several New Developer Options (All can be exported)
- Customizable Mic Gain
- Customizable Silence Sensitivity
- Ability to hide warning message
- Separated debug mode and toast notificationsQuelltext steht unter der Mozilla Public License 2.0
Version 1.3.3
Veröffentlicht 9. Feb. 2026 – 96,61 KBFunktioniert mit firefox 58.0 und höher- Fixed speech recognition desync by waiting for MediaRecorder.onstart
- Ex: Microphone would sometimes NOT activate during Duolingo speaking practice
Quelltext steht unter der Mozilla Public License 2.0
Version 1.3.2
Veröffentlicht 9. Feb. 2026 – 95,44 KBFunktioniert mit firefox 58.0 und höher- Fixed duplicated speech on sites like speechnotes.co, voicetotext.org, etc...
Quelltext steht unter der Mozilla Public License 2.0
Version 1.3.1
Veröffentlicht 8. Feb. 2026 – 94,43 KBFunktioniert mit firefox 58.0 und höher- Fixed visual glitch with dev options
Quelltext steht unter der Mozilla Public License 2.0
Version 1.3
Veröffentlicht 8. Feb. 2026 – 94,55 KBFunktioniert mit firefox 58.0 und höher- Added WebGPU support for faster transcription (default)
- Added a pipeline indicator
Quelltext steht unter der Mozilla Public License 2.0
Version 1.2
Veröffentlicht 8. Feb. 2026 – 86,57 KBFunktioniert mit firefox 58.0 und höher- Fixed invalid links
Quelltext steht unter der Mozilla Public License 2.0
Version 1.1
Veröffentlicht 8. Feb. 2026 – 86,49 KBFunktioniert mit firefox 58.0 und höher- Fixed typos
- Added back opening on install
Quelltext steht unter der Mozilla Public License 2.0
Version 1.0
Veröffentlicht 8. Feb. 2026 – 86,42 KBFunktioniert mit firefox 58.0 und höher❗ FULL RELEASE- fixed google docs glitch for good
- fixed stale session
- fixed not being able to say single words
- rewrote some parts of background.js
- Made base-multilingual the default model
- moved distil-medium out of experimental
- fixed compatibility on most other sites
- a few new developer mode options (such as microphone gain!)
Note: You may see the word Provider/Engine. They are interchangeable.Quelltext steht unter der Mozilla Public License 2.0
Version 0.3.3
Veröffentlicht 18. Jan. 2026 – 78,21 KBFunktioniert mit firefox 58.0 und höherFixes:
Inability to type in Google Docs or similar canvas-based sites within the google domain.
Known Issues:
1. Using Google Docs with the Speech to Text Keybind currently does not work (Mitigation: You have to use google's built in voice-typing for it to work). This will be fixed in the next build.
2. Stale session (or constant unintelligible/red mic icon) sometimes after a model is switched. This can be mitigated by reloading the site and will also be fixed in the next build.
✅ Guaranteed: Future Roadmap
1. Merge the keybind that opens google docs' built in voice-typing with the extension's keybind.
1.1. I might just make an option to merge the keyboard yourself for any website so It's not manually done/updated by me each time, but I'll add the major sites like google docs/microsoft word keybind merges as defaults in the extension.
2. Cancel/stop all voice session ids if the model is changed. This should fix red mic/stale session issues without having to reload the site.
3. Add an option to disable the extension for a specific site.
4. Move distil-medium out of experimental category since its decent now.
5. Make base-multilingual the default model (tiny multilingual sucks at english transcription and since the majority using this will be english speaking they may think its not working)
6. All of this should come at the same time with the full release of v1.0
❓ Not Guaranteed: Future Roadmap
1. Adding continuous voice speech recognition (prob unlikely since it'll be very hacky and annoying to deal with)
1.1. Currently, you can just click the mic again after speaking and the extension already records your audio until you finish talking anyways so I don't see the point. Would probably only work out well with the cloud api.
2. Fix the extension not detecting voice if user only says a singular word and making it work properly in Mozilla's Webspeech color test
3. If none of the above comes out for the v1.0 full release, it probably wont come out at all and was likely cancelled to maintain the current stability of the extension.Quelltext steht unter der Mozilla Public License 2.0
Version 0.3.2
Veröffentlicht 18. Jan. 2026 – 77,49 KBFunktioniert mit firefox 58.0 und höherAdded the ability export and import settings.
Added the ability to edit sites by clicking on them and clearing inputs for the sites lists.
Added auto-updating of site lists on options page.
Added a drop down for sites lists on options page.Quelltext steht unter der Mozilla Public License 2.0
Version 0.3.1
Veröffentlicht 18. Jan. 2026 – 73,96 KBFunktioniert mit firefox 58.0 und höherTLDR: less memory bloat, better per-tab status, fewer stuck transcriptions, a hotkey, and a more reliable “Enter/submit” after dictation.Quelltext steht unter der Mozilla Public License 2.0
Version 0.3.0
Veröffentlicht 17. Jan. 2026 – 58,54 KBFunktioniert mit firefox 58.0 und höherEnhancements
- Dual engine support: choose on-device Whisper or cloud AssemblyAI; store API key; optional prefetch/cache of default model.
- Adaptive VAD and stop logic: smarter noise floor, early “no speech” stop, optional 5s hard cap toggle, grace-window toggle.
- Per-site overrides include engine selection (plus model/language/timeout); popup auto-hides model when cloud engine is chosen and auto-saves changes.
- Options UI: engine card, dev toggles (hide model sections, hide favicons, hard cap, grace, cache model), “Save all” + factory reset; options auto-unload when idle.
- Action icon UX: download/cache/done/cancel badges; cached badge on reuse; brief held error state.
Fixes
- Better cancel/stale-session handling and tighter processing timeout to avoid hangs.
- Stricter handling of silent/too-short/pathological audio as “no audio/unintelligible,” reducing bad results.
- Processing fallback timer and GC/reload behavior to prevent long-running or stuck states.
- Basically its less likely to break....Quelltext steht unter der Mozilla Public License 2.0
Version 0.2.3
Veröffentlicht 13. Jan. 2026 – 44,94 KBFunktioniert mit firefox 58.0 und höherUpdated Garbage Collector (GC):
Idle memory usage is now at ~4MB (after being cleared see below) originally from 1GB
The extension clears cache after 30 seconds of idle and after closing the tab or navigating to a different site.Quelltext steht unter der Mozilla Public License 2.0