Whisper.rn 项目下载及安装教程

2024-12-08 13:17:06作者：裘晴惠Vivianne

1. 项目介绍

Whisper.rn 是一个React Native的绑定库，用于OpenAI的Whisper自动语音识别（ASR）模型的性能推断。这个库可以让你在移动应用中集成Whisper ASR模型，实现音频文件的语音识别功能。

2. 项目下载位置

该项目托管在GitHub上，你可以通过以下地址进行下载和克隆：

https://github.com/mybigday/whisper.rn.git

3. 项目安装环境配置

在开始安装前，请确保你的开发环境已经配置了以下依赖：

Node.js
npm或yarn包管理器
React Native开发环境

以下是一个典型的环境配置流程（以macOS为例）：

# 安装Homebrew（如果尚未安装）
/bin/bash -c "$(curl -fsSL https://raw.githubusercontent.com/Homebrew/install/HEAD/install.sh)"

# 安装Node.js
brew install node

# 安装React Native CLI
npm install -g react-native-cli

4. 项目安装方式

克隆项目到本地后，进入项目目录进行安装：

# 克隆项目
git clone https://github.com/mybigday/whisper.rn.git

# 进入项目目录
cd whisper.rn

# 安装依赖
npm install

# 如果使用yarn
# yarn install

5. 项目处理脚本

根据官方文档，以下是初始化Whisper ASR模型和一些基本用法的示例脚本：

// 引入初始化函数
import { initWhisper } from 'whisper.rn';

// 初始化Whisper模型
const whisperContext = await initWhisper({
  filePath: 'file:///path/to/ggml-tiny.en.bin'
});

// 转写音频文件
const sampleFilePath = 'file:///path/to/sample.wav';
const options = { language: 'en' };
const [stop, promise] = whisperContext.transcribe(sampleFilePath, options);
const [result] = await promise;
// result: 转写结果

使用实时转写功能时，可以按照以下方式进行：

// 实时转写
const [stop, subscribe] = await whisperContext.transcribeRealtime(options);
subscribe(evt => {
  const [isCapturing, data, processTime, recordingTime] = evt;
  console.log(`Realtime transcribing: ${isCapturing ? 'ON' : 'OFF'}`);
  console.log(`Result: ${data.result}`);
  console.log(`Process time: ${processTime}ms`);
  console.log(`Recording time: ${recordingTime}ms`);
});