Releases · k2-fsa/sherpa-onnx

22 Oct 04:16

csukuangfj

v1.12.15

b68e69d

v1.12.15 Latest

Latest

What's Changed

exposing online punctuation model support in node-addon-api by @colourmebrad in #2609
Fix building wheels by @csukuangfj in #2619
Export one more Piper Arabic TTS model by @csukuangfj in #2623
fix: hot update language for sencevoice by @o-u-p in #2627
develop zipvoice c-api by @BarryKCL in #2628
Add CI tests for Zipvoice Go API by @csukuangfj in #2630
Remove hardcoded dithering value in NeMo transducer recognizer by @vsd-vector in #2639
Reduce verbose output about reading lexicon for TTS by @csukuangfj in #2648
Add Parakeet TDT model for generating subtitles by @csukuangfj in #2649
Add more Piper TTS models by @csukuangfj in #2651
Add CXX API for audio tagging by @csukuangfj in #2652
Add C# API for audio tagging by @csukuangfj in #2653
Support KWS + RKNN. by @csukuangfj in #2190
Support https://github.com/ASLP-lab/WenetSpeech-Chuan by @csukuangfj in #2656
Fix building for android by @csukuangfj in #2657
Fix ios build script by @vovasty in #2645
Update kaldi-native-fbank by @csukuangfj in #2659
Add missing python class definitions for builds without TTS support by @ius in #2660
Refactor JNI by @csukuangfj in #2661
Remove jieba from kokoro and matcha tts. by @csukuangfj in #2662
add flet_sherpa_onnx in readme by @SamYuan1990 in #2663
Remove cppjieba by @csukuangfj in #2664
Add phrase matcher to merge words into phrases for TTS. by @csukuangfj in #2668
Limit number of tokens per sentence in MatchaTTS. by @csukuangfj in #2671
Update README to include a ROS2 project using sherpa-onnx by @csukuangfj in #2672
Fix building Flutter APPs by @csukuangfj in #2673
Export Paraformer to RKNN by @csukuangfj in #2689
Update README.md add achatbot-go Projects using sherpa-onnx link by @weedge in #2691
Add CI to export Paraformer to RKNN by @csukuangfj in #2692
Support MatchTTS with English and Chinese by @csukuangfj in #2695
Export Paraformer ASR models from FunASR to Ascend NPU 910B by @csukuangfj in #2697
Update README to include Ascend NPU by @csukuangfj in #2698
Fix WASM (JS) after adding zipvoice. by @csukuangfj in #2702
Export SenseVoice ASR models to Ascend NPU 910B by @csukuangfj in #2707
Fix building for various language bindings after adding zipvoice by @csukuangfj in #2709
Release v1.12.15 by @csukuangfj in #2710

New Contributors

@colourmebrad made their first contribution in #2609
@o-u-p made their first contribution in #2627
@BarryKCL made their first contribution in #2628
@vovasty made their first contribution in #2645
@ius made their first contribution in #2660
@SamYuan1990 made their first contribution in #2663
@weedge made their first contribution in #2691

Full Changelog: v1.12.14...v1.12.15

Contributors

vovasty, ius, and 7 other contributors

Assets 84

checksum.txt

sha256:635203184a72492288e7424492f491ed047bb3f8354e7d2683dd6b3c21a5543b

9.27 KB 2025-10-27T01:34:13Z
sherpa-onnx-1.12.15-rknn.aar

sha256:d658d076384e22e8e601e796935892664161db745ce6c79e504790d156cc1eb9

20.2 MB 2025-10-22T08:49:39Z
sherpa-onnx-1.12.15.aar

sha256:6994c53380846107b0363fee02fa9db6176c61c746a2315893c6b01e7f6d4c1d

37 MB 2025-10-22T10:41:36Z
sherpa-onnx-native-lib-linux-aarch64-v1.12.15.jar

sha256:b86174d137d52b111cdba144e6f467e668cdb25672d0a2d7ad4d51a7db9dabae

6.34 MB 2025-10-24T03:05:13Z
sherpa-onnx-native-lib-linux-x64-v1.12.15.jar

sha256:2395e52a7b208ef6f4539568e8c83c0f247cca77feaabfe7aab3ba64bbf75cc9

7.23 MB 2025-10-24T03:05:20Z
sherpa-onnx-native-lib-osx-aarch64-v1.12.15.jar

sha256:9511ef77288e52c7f308ab84b36ab1431b6ee58de50e4dc76b4ecbe55940d8da

7.4 MB 2025-10-24T03:06:31Z
sherpa-onnx-native-lib-osx-x64-v1.12.15.jar

sha256:f4f3d6d1f1c1db1a94153e900f42d8eb6cb2e9a1e4c6661318ecdf06ffee0654

8.31 MB 2025-10-24T03:05:52Z
sherpa-onnx-native-lib-win-x64-v1.12.15.jar

sha256:320576af96bcf3cb799b02810ac1fb3f2f9a1949a16101a4d5fe05caa532068d

5.46 MB 2025-10-24T03:05:45Z
sherpa-onnx-non-streaming-asr-x64-v1.12.15.exe

sha256:ac442ae6c5f3b8ee1a870d2a876040ea81aa73086cf3a521da1327b106a43b14

17.5 MB 2025-10-24T04:36:20Z
sherpa-onnx-non-streaming-asr-x86-v1.12.15.exe

sha256:f2b65089cbad5095d5338a9f585d46c2ad0d52fb88986882735f42f990d20ef1

14.9 MB 2025-10-24T04:39:40Z
Source code (zip)

2025-10-22T04:11:25Z
Source code (tar.gz)

2025-10-22T04:11:25Z

18 Sep 07:13

github-actions

v1.12.14

26aa2fa

v1.12.14

What's Changed

Fix setting rknn core mask by @csukuangfj in #2594
Add Dart API for spoken language identification by @kibsoft in #2596
Add CI tests for dart spoken language identifcation example by @csukuangfj in #2598
Provide pre-compiled shepra-onnx libs/binaries for CUDA 12.x + onnxruntime 1.22.0 by @csukuangfj in #2599
Provide pre-compiled whls for cuda 12.x on Linux x64 and Windows x64 by @csukuangfj in #2601
Fix TDT decoding for NeMo TDT transducers by @csukuangfj in #2606
Add a C++ example for simulated streaming ASR by @csukuangfj in #2607
Release v1.12.14 by @csukuangfj in #2608

New Contributors

@kibsoft made their first contribution in #2596

Full Changelog: v1.12.13...v1.12.14

Contributors

kibsoft and csukuangfj

Assets 72

12 Sep 08:10

github-actions

v1.12.13

32c248b

v1.12.13

What's Changed

Fix initializing symbol table for OnlineRecognizer. by @csukuangfj in #2590
Support RK NPU for SenseVoice non-streaming ASR models by @csukuangfj in #2589
Upload RKNN models for sense-voice by @csukuangfj in #2592
Release v1.12.13 by @csukuangfj in #2593

Full Changelog: v1.12.12...v1.12.13

Contributors

csukuangfj

Assets 76

10 Sep 14:59

github-actions

v1.12.12

04a98ca

v1.12.12

What's Changed

Fix building for risc-v by @csukuangfj in #2549
Fix using sherpa-onnx as a cmake sub-project. by @csukuangfj in #2550
Update kaldifst and kaldi-decoder by @csukuangfj in #2551
Bug fix-修复armv8l架构下的Android系统调用java-api时启动异常 by @fengin in #2556
Disable loading libs from jar on Android. by @csukuangfj in #2557
Fix cantonese vits tts by @csukuangfj in #2558
Avoid appending blanks for Cantonese vits tts. by @csukuangfj in #2559
Add hint for loading model files from SD card on Android. by @csukuangfj in #2564
Update README to include https://github.com/Mentra-Community/MentraOS by @csukuangfj in #2565
Export models from https://github.com/voicekit-team/T-one to sherpa-onnx by @csukuangfj in #2571
Add C++ and Python support for T-one streaming Russian ASR models by @csukuangfj in #2575
Add various language bindings for streaming T-one Russian ASR models by @csukuangfj in #2576
Fix the missing online punctuation in android aar by @csukuangfj in #2577
Export KittenTTS mini v0.1 to sherpa-onnx by @csukuangfj in #2578
Upload new sense-voice models by @csukuangfj in #2580
Export ASLP-lab/WSYue-ASR/tree/main/u2pp_conformer_yue to sherpa-onnx by @csukuangfj in #2582
Add various languge bindings for Wenet non-streaming CTC models by @csukuangfj in #2584
Release v1.12.12 by @csukuangfj in #2586

New Contributors

@fengin made their first contribution in #2556

Full Changelog: v1.12.11...v1.12.12

Contributors

csukuangfj and fengin

Assets 80

01 Sep 06:17

github-actions

v1.12.11

a9187d5

v1.12.11

What's Changed

Add two more Piper tts models by @csukuangfj in #2525
Generate tts samples for MatchaTTS (English). by @csukuangfj in #2527
Fix releasing go packages by @csukuangfj in #2529
Add license info about tts models from OpenVoiceOS by @csukuangfj in #2530
Support BPE models with byte fallback. by @csukuangfj in #2531
Simplify the usage of our non-Android Java API by @csukuangfj in #2533
Fix wasm for kws by @csukuangfj in #2535
Add one more German tts model from OpenVoiceOS. by @csukuangfj in #2536
Fix uploading win32 libs to huggingface by @csukuangfj in #2537
Add Zipvoice by @pkufool in #2487
Fix c api by @csukuangfj in #2545
Fix linking by @csukuangfj in #2546
Release 1.12.11 by @csukuangfj in #2547

Full Changelog: v1.12.10...v1.12.11

Contributors

csukuangfj and pkufool

Assets 82

25 Aug 04:06

github-actions

v1.12.10

6b1fbde

v1.12.10

What's Changed

Add VOSK streaming Russian ASR models and Kroko streaming German ASR models by @csukuangfj in #2502
Refactor CI tests by @csukuangfj in #2504
Update APK versions by @csukuangfj in #2505
Export whisper distil-large-v3 and distil-large-v3.5 to sherpa-onnx by @csukuangfj in #2506
Support specifying pronunciations of phrases in Chinese TTS. by @csukuangfj in #2507
fix(flutter): fix unicode problem in windows path by @twmht in #2508
feat: add punctuation C++ API by @Aruxxxi in #2510
fix ctrl+c may lead to coredump by @yangjuncode in #2511
Add kitten tts nano v0.2 by @csukuangfj in #2512
Scripts to generate tts samples by @csukuangfj in #2513
Add tdt duration to APIs by @bradmurray-dt in #2514
Support 16KB page size for Android by @csukuangfj in #2520
Split sherpa-onnx Python package by @csukuangfj in #2521
Fix kokoro tts for punctuations by @csukuangfj in #2522
Release v1.12.10 by @csukuangfj in #2523

New Contributors

@Aruxxxi made their first contribution in #2510
@yangjuncode made their first contribution in #2511
@bradmurray-dt made their first contribution in #2514

Full Changelog: v1.12.9...v1.12.10

Contributors

twmht, csukuangfj, and 3 other contributors

Assets 79

16 Aug 11:23

github-actions

v1.12.9

af3e1d6

v1.12.9

What's Changed

Add more piper tts models by @csukuangfj in #2480
Fix ASR for UE by @PeiloViolet in #2483
push to maven center by @litongjava in #2463
Specify ABIs when building APKs by @csukuangfj in #2488
Add more debug info for vits tts by @csukuangfj in #2491
Add Swift API for computing speaker embeddings by @csukuangfj in #2492
Alex/feat add python example by @AlexWhisper in #2490
Support TDT transducer decoding by @csukuangfj in #2495
Fix java test by @csukuangfj in #2496
Refactor Swift API by @csukuangfj in #2493
add TtsReader app to README.md by @ys-pro-duction in #2498
Export https://huggingface.co/nvidia/parakeet-tdt-0.6b-v3 to sherpa-onnx by @csukuangfj in #2500
Fix building apk by @csukuangfj in #2499
Release v1.12.9 by @csukuangfj in #2501

New Contributors

@PeiloViolet made their first contribution in #2483
@litongjava made their first contribution in #2463
@AlexWhisper made their first contribution in #2490
@ys-pro-duction made their first contribution in #2498

Full Changelog: v1.12.8...v1.12.9

Contributors

csukuangfj, litongjava, and 3 other contributors

Assets 88

08 Aug 12:47

csukuangfj

v1.12.8

d819095

v1.12.8

What's Changed

Expose JNI to compute probability of chunk in VAD by @twmht in #2433
Support models from https://huggingface.co/Banafo/Kroko-ASR by @csukuangfj in #2453
Add APIs for Online NeMo CTC models by @csukuangfj in #2454
Export https://github.com/KittenML/KittenTTS to sherpa-onnx by @csukuangfj in #2456
Fix punctuations in kokoro tts. by @csukuangfj in #2458
Limit number of tokens in fire red asr decoding. by @csukuangfj in #2459
Add C++ runtime for kitten-tts by @csukuangfj in #2460
Add Kotlin and Java API for KittenTTS by @csukuangfj in #2461
Add Android TTS Engine APK for KittenTTS by @csukuangfj in #2465
Add Python API for KittenTTS. by @csukuangfj in #2466
Add C API for KittenTTS by @csukuangfj in #2467
Add CXX API for KittenTTS by @csukuangfj in #2469
Add JavaScript API (node-addon) for KittenTTS by @csukuangfj in #2470
Add JavaScript API (WebAssembly) for KittenTTS by @csukuangfj in #2471
Add Pascal API for KittenTTS by @csukuangfj in #2474
Add Dart API for KittenTTS by @csukuangfj in #2475
Add Swift API for KittenTTS by @csukuangfj in #2476
Add C# API for KittenTTS by @csukuangfj in #2477
Add Go API for KittenTTS by @csukuangfj in #2478
Release v1.12.8 by @csukuangfj in #2479

New Contributors

@twmht made their first contribution in #2433

Full Changelog: v1.12.7...v1.12.8

Contributors

twmht and csukuangfj

Assets 89

27 Jul 17:14

csukuangfj

v1.12.7

847a3e7

v1.12.7

What's Changed

Support Portuguese and German ASR models from NeMo by @csukuangfj in #2394
Support returning the current speech segment for VAD. by @csukuangfj in #2397
Add more piper tts polish models by @csukuangfj in #2403
Support VAD+ASR for WearOS by @csukuangfj in #2404
support test long audio with streaming-model & vad by @pingfengluo in #2405
fix typo in sherpa-onnx-vad-with-online-asr.cc by @pingfengluo in #2407
add tail padding for sherpa-onnx-vad-with-online-asr by @pingfengluo in #2408
Add more French TTS models by @csukuangfj in #2424
Add more piper tts models by @csukuangfj in #2425
Implement max_symbols_per_frame for GigaAM2 by @nshmyrev in #2423
Fix GigaAM transducer encoder output length data type by @csukuangfj in #2426
Add friendly log messages for Android and HarmonyOS TTS users. by @csukuangfj in #2427
fix setgraph in OnlineCtcFstDecoderConfig Java api by @njsyw1997 in #2411
Release v1.12.7 by @csukuangfj in #2428

New Contributors

@njsyw1997 made their first contribution in #2411

Full Changelog: v1.12.6...v1.12.7

Contributors

pingfengluo, nshmyrev, and 2 other contributors

Assets 90

12 Jul 14:15

github-actions

v1.12.6

27098a0

v1.12.6

What's Changed

Support silero-vad v4 exported by k2-fsa by @csukuangfj in #2372
Support ten vad by @csukuangfj in #2377
Fix compile errors for Linux by @csukuangfj in #2378
Add C API for ten-vad by @csukuangfj in #2379
Add CXX API examples for ten-vad. by @csukuangfj in #2380
Add JavaScript (WebAssembly) API for ten-vad by @csukuangfj in #2382
Add JavaScript (node-addon) API for ten-vad by @csukuangfj in #2383
Add Go API for ten-vad by @csukuangfj in #2384
Add C# API for ten-vad by @csukuangfj in #2385
Add Dart API for ten-vad by @csukuangfj in #2386
Add Swift API for ten-vad by @csukuangfj in #2387
Add Pascal API for ten-vad by @csukuangfj in #2388
Add Java/Kotlin API and Android support for ten-vad by @csukuangfj in #2389
Release v1.12.6 by @csukuangfj in #2390

Full Changelog: v1.12.5...v1.12.6

Contributors

csukuangfj

Assets 90

Releases: k2-fsa/sherpa-onnx

v1.12.15

What's Changed

New Contributors

Contributors

Uh oh!

v1.12.14

What's Changed

New Contributors

Contributors

Uh oh!

v1.12.13

What's Changed

Contributors

Uh oh!

v1.12.12

What's Changed

New Contributors

Contributors

Uh oh!

v1.12.11

What's Changed

Contributors

Uh oh!

v1.12.10

What's Changed

New Contributors

Contributors

Uh oh!

v1.12.9

What's Changed

New Contributors

Contributors

Uh oh!

v1.12.8

What's Changed

New Contributors

Contributors

Uh oh!

v1.12.7

What's Changed

New Contributors

Contributors

Uh oh!

v1.12.6

What's Changed

Contributors

Uh oh!