mirror of https://github.com/modelscope/FunASR synced 2025-09-15 14:48:36 +08:00

History

zhifu gao edec2fe85e Update SDK_tutorial_zh.md		2023-07-03 15:08:11 +08:00
..
csharp	add c# assembly for fsmn vad (#650 )	2023-06-21 09:31:59 +08:00
deploy_tools	[funasr-runtime-deploy 0.0.4] bug fix (#698 )	2023-07-02 22:16:53 +08:00
docs	Update SDK_tutorial_zh.md	2023-07-03 15:08:11 +08:00
grpc	fix paraformer-server for new apis	2023-05-10 16:52:46 +08:00
html5	h5 docs	2023-06-29 13:31:21 +08:00
java	Java ws client support (#651 )	2023-06-19 12:24:27 +08:00
onnxruntime	mv run_server.sh	2023-06-30 12:35:34 +08:00
python	docs	2023-07-02 10:56:23 +08:00
ssl_key	update server.key	2023-06-18 15:21:16 +08:00
triton_gpu	update setup (#686 )	2023-06-29 16:30:39 +08:00
websocket	export	2023-06-29 20:18:29 +08:00
__init__.py	export model	2023-02-13 17:43:01 +08:00
readme_cn.md	Update readme_cn.md	2023-07-03 15:06:18 +08:00
readme.md	docs	2023-06-30 12:00:25 +08:00
run_server.sh	Update run_server.sh	2023-06-30 12:53:23 +08:00

readme.md

FunASR runtime-SDK

中文文档（点击此处）

FunASR is a speech recognition framework developed by the Speech Lab of DAMO Academy, which integrates industrial-level models in the fields of speech endpoint detection, speech recognition, punctuation segmentation, and more. It has attracted many developers to participate in experiencing and developing. To solve the last mile of industrial landing and integrate models into business, we have developed the FunASR runtime-SDK. The SDK supports several service deployments, including:

File transcription service, Mandarin, CPU version, done
File transcription service, Mandarin, GPU version, in progress
File transcription service, English, in progress
Streaming speech recognition service, is in progress
and more.

File Transcription Service, Mandarin (CPU)

Currently, the FunASR runtime-SDK-0.0.1 version supports the deployment of file transcription service, Mandarin (CPU version), with a complete speech recognition chain that can transcribe tens of hours of audio into punctuated text, and supports recognition for more than a hundred concurrent streams.

To meet the needs of different users, we have prepared different tutorials with text and images for both novice and advanced developers.

Technical Principles

The technical principles and documentation behind FunASR explain the underlying technology, recognition accuracy, computational efficiency, and core advantages of the framework, including convenience, high precision, high efficiency, and support for long audio chains. For detailed information, please refer to the documentation available by docs.

Deployment Tutorial

The documentation mainly targets novice users who have no need for modifications or customization. It supports downloading model deployments from modelscope and also supports deploying models that users have fine-tuned. For detailed tutorials, please refer to docs.

Advanced Development Guide

The documentation mainly targets advanced developers who require modifications and customization of the service. It supports downloading model deployments from modelscope and also supports deploying models that users have fine-tuned. For detailed information, please refer to the documentation available by docs