archive/livekit-rust-sdks: LiveKit real-time SDK and server API for Rust

Binh Pham 2a87895dc2 Add livekit-wakeword crate with ONNX-based wake word detection (#926 ) * Add melspectrogram ONNX model with feature extraction and tests Introduces the livekit-wakeword crate with a MelspectrogramModel that extracts mel-scaled spectrogram features from raw i16 PCM audio using ONNX Runtime. Includes tests verifying output shape and time-frame scaling. * Add embedding ONNX model with inference and tests * Reorganize wakeword crate: flatten modules and separate ONNX files - Move ONNX models from models/ to onnx/ to avoid naming collision - Flatten src/models/ into top-level src/ modules - Colocate unit tests with their respective modules - Remove prelude.rs in favor of direct imports * Implement WakeWordModel with shared constants and session helpers - Add WakeWordModel: stateless detection pipeline (mel -> embeddings -> classifier) - Classifiers loaded dynamically from ONNX files on disk - Hoist constants (SAMPLE_RATE, MEL_BINS, EMBEDDING_WINDOW, etc.) to lib.rs - Extract shared session builder helpers to eliminate duplication - Consolidate tests into a single end-to-end predict test - Add hey_livekit.onnx classifier model * Apply cargo fmt formatting * Auto generate changeset * Add Apache 2.0 license headers to wakeword source files * Add README for livekit-wakeword crate * Scope changeset to livekit-wakeword and add package to knope.toml * Switch wakeword ONNX backend from ort default to tract Replace the default ONNX Runtime C++ backend with the pure-Rust tract backend via ort-tract to avoid upstream test failures from native library linking issues. * Fix minimum audio duration in wakeword README (~2s, not ~4s) * Remove ndarray from public API signatures in livekit-wakeword Accept plain slices (&[i16], &[f32]) instead of ndarray types in the public detect/predict methods, keeping ndarray as an internal detail. * Replace Box<dyn Error> with WakeWordError enum in public API Add a typed WakeWordError enum (via thiserror) covering Ort, Shape, Io, and ModelNotFound cases, replacing dynamic error types across all public methods. * Add input sample rate resampling and make internal modules private Accept any common sample rate (16k–384k Hz) via a new sample_rate parameter on WakeWordModel::new(), resampling to 16 kHz internally using the resampler crate. Make embedding and melspectrogram modules pub(crate) since they are implementation details. Change MelspectrogramModel::detect() to accept &[f32] directly, avoiding a redundant i16→f32→i16→f32 round-trip when resampling. * Move predict test to integration tests and run cargo fmt * Rename integration test to integration.rs and add license header * Replace FFT resampler with FIR-based resampler for low-latency streaming Switch from ResamplerFft to ResamplerFir which accepts arbitrary input buffer sizes, eliminating chunk management and zero-padding complexity. Uses 64-sample latency (~1.3ms at 48kHz) with 90dB stopband attenuation. * Exclude tract backend on aarch64-pc-windows-msvc to fix MSVC build tract-linalg has ARM64 assembly (.S files) that MSVC cannot compile. Use a build.rs to auto-detect the target and conditionally enable ort-tract, falling back to native ONNX Runtime on aarch64-pc-windows-msvc. * Fix mel spectrogram normalization and add WAV-based integration tests The Rust mel spectrogram was missing the x/10 + 2 post-processing normalization from the openWakeWord pipeline, causing near-zero classifier scores. Added positive/negative WAV sample tests with a 0.5 threshold to catch regressions. * Update changeset to reflect full PR scope * Run cargo fmt on wakeword tests --------- Co-authored-by: knope-bot[bot] <152252888+knope-bot[bot]@users.noreply.github.com>	2026-03-11 18:31:45 +08:00
.cargo	chore: 'config' is deprecated in favor of config.toml (#312 )	2024-02-15 21:26:25 +01:00
.changeset	Add livekit-wakeword crate with ONNX-based wake word detection (#926 )	2026-03-11 18:31:45 +08:00
.github	update readme (#928 )	2026-03-05 07:44:29 +01:00
.vscode	fix: fix mute/unmute events for LocalTrack. (#799 )	2025-12-02 13:57:50 +08:00
examples	Upgrade example dependencies (#920 )	2026-03-06 15:12:14 -08:00
imgproc	chore: release (#881 )	2026-02-12 03:36:52 +11:00
libwebrtc	chore: release (#892 )	2026-02-16 10:27:44 -08:00
livekit	fix the video track subscription in single pc mode (#914 )	2026-03-03 11:40:10 +08:00
livekit-api	Bump jsonwebtoken to v10 to address CVE-2026-25537 (#917 )	2026-02-26 09:25:13 -08:00
livekit-ffi	Use knope bot for release management (#913 )	2026-03-02 09:31:19 +01:00
livekit-ffi-node-bindings	Fix node bindings release (#909 )	2026-02-23 19:43:18 +01:00
livekit-protocol	chore: release (#892 )	2026-02-16 10:27:44 -08:00
livekit-runtime	Use workspace dependencies & settings (#856 )	2026-02-03 10:42:24 +11:00
livekit-uniffi	Use workspace dependencies & settings (#856 )	2026-02-03 10:42:24 +11:00
livekit-wakeword	Add livekit-wakeword crate with ONNX-based wake word detection (#926 )	2026-03-11 18:31:45 +08:00
soxr-sys	chore: release (#845 )	2026-02-08 17:15:48 -08:00
webrtc-sys	webrtc-sys: Handle gracefully lack of libva on linux (#924 )	2026-03-06 15:16:22 -08:00
yuv-sys	chore: release (#881 )	2026-02-12 03:36:52 +11:00
.gitignore	clean up android .gitignore (#872 )	2026-02-08 17:05:27 -08:00
.gitmodules	move libyuv/imgproc from theomonnom/mikado (#508 )	2024-12-10 21:01:31 +01:00
Cargo.lock	Add livekit-wakeword crate with ONNX-based wake word detection (#926 )	2026-03-11 18:31:45 +08:00
Cargo.toml	Add livekit-wakeword crate with ONNX-based wake word detection (#926 )	2026-03-11 18:31:45 +08:00
download_ffi.py	Make download script backwards compatible with old tag format (#923 )	2026-03-02 13:47:45 +01:00
knope.toml	Add livekit-wakeword crate with ONNX-based wake word detection (#926 )	2026-03-11 18:31:45 +08:00
LICENSE	publish client-sdk-native (#12 )	2023-01-02 20:13:48 +01:00
NOTICE	Add license headers (#143 )	2023-07-27 17:08:39 -07:00
README.md	update readme (#928 )	2026-03-05 07:44:29 +01:00
rustfmt.toml	fix proto compilation & v0.3.2 (#315 )	2024-02-28 19:09:22 +01:00

Binh Pham 2a87895dc2

Add livekit-wakeword crate with ONNX-based wake word detection (#926 )

* Add melspectrogram ONNX model with feature extraction and tests

Introduces the livekit-wakeword crate with a MelspectrogramModel that
extracts mel-scaled spectrogram features from raw i16 PCM audio using
ONNX Runtime. Includes tests verifying output shape and time-frame scaling.

* Add embedding ONNX model with inference and tests

* Reorganize wakeword crate: flatten modules and separate ONNX files

- Move ONNX models from models/ to onnx/ to avoid naming collision
- Flatten src/models/ into top-level src/ modules
- Colocate unit tests with their respective modules
- Remove prelude.rs in favor of direct imports

* Implement WakeWordModel with shared constants and session helpers

- Add WakeWordModel: stateless detection pipeline (mel -> embeddings -> classifier)
- Classifiers loaded dynamically from ONNX files on disk
- Hoist constants (SAMPLE_RATE, MEL_BINS, EMBEDDING_WINDOW, etc.) to lib.rs
- Extract shared session builder helpers to eliminate duplication
- Consolidate tests into a single end-to-end predict test
- Add hey_livekit.onnx classifier model

* Apply cargo fmt formatting

* Auto generate changeset

* Add Apache 2.0 license headers to wakeword source files

* Add README for livekit-wakeword crate

* Scope changeset to livekit-wakeword and add package to knope.toml

* Switch wakeword ONNX backend from ort default to tract

Replace the default ONNX Runtime C++ backend with the pure-Rust
tract backend via ort-tract to avoid upstream test failures from
native library linking issues.

* Fix minimum audio duration in wakeword README (~2s, not ~4s)

* Remove ndarray from public API signatures in livekit-wakeword

Accept plain slices (&[i16], &[f32]) instead of ndarray types in the
public detect/predict methods, keeping ndarray as an internal detail.

* Replace Box<dyn Error> with WakeWordError enum in public API

Add a typed WakeWordError enum (via thiserror) covering Ort, Shape, Io,
and ModelNotFound cases, replacing dynamic error types across all public
methods.

* Add input sample rate resampling and make internal modules private

Accept any common sample rate (16k–384k Hz) via a new sample_rate
parameter on WakeWordModel::new(), resampling to 16 kHz internally
using the resampler crate. Make embedding and melspectrogram modules
pub(crate) since they are implementation details. Change
MelspectrogramModel::detect() to accept &[f32] directly, avoiding
a redundant i16→f32→i16→f32 round-trip when resampling.

* Move predict test to integration tests and run cargo fmt

* Rename integration test to integration.rs and add license header

* Replace FFT resampler with FIR-based resampler for low-latency streaming

Switch from ResamplerFft to ResamplerFir which accepts arbitrary input
buffer sizes, eliminating chunk management and zero-padding complexity.
Uses 64-sample latency (~1.3ms at 48kHz) with 90dB stopband attenuation.

* Exclude tract backend on aarch64-pc-windows-msvc to fix MSVC build

tract-linalg has ARM64 assembly (.S files) that MSVC cannot compile.
Use a build.rs to auto-detect the target and conditionally enable
ort-tract, falling back to native ONNX Runtime on aarch64-pc-windows-msvc.

* Fix mel spectrogram normalization and add WAV-based integration tests

The Rust mel spectrogram was missing the x/10 + 2 post-processing
normalization from the openWakeWord pipeline, causing near-zero
classifier scores. Added positive/negative WAV sample tests with
a 0.5 threshold to catch regressions.

* Update changeset to reflect full PR scope

* Run cargo fmt on wakeword tests

---------

Co-authored-by: knope-bot[bot] <152252888+knope-bot[bot]@users.noreply.github.com>

LiveKit Ecosystem
Agents SDKs	Python · Node.js
LiveKit SDKs	Browser · Swift · Android · Flutter · React Native · Rust · Node.js · Python · Unity · Unity (WebGL) · ESP32 · C++
Starter Apps	Python Agent · TypeScript Agent · React App · SwiftUI App · Android App · Flutter App · React Native App · Web Embed
UI Components	React · Android Compose · SwiftUI · Flutter
Server APIs	Node.js · Golang · Ruby · Java/Kotlin · Python · Rust · PHP (community) · .NET (community)
Resources	Docs · Docs MCP Server · CLI · LiveKit Cloud
LiveKit Server OSS	LiveKit server · Egress · Ingress · SIP
Community	Developer Community · Slack · X · YouTube

README.md

📹🎙️🦀 Rust Client SDK for LiveKit

Features

Crates

Getting started

Using Server API

Generating an access token

Creating a room with RoomService API

Using Real-time SDK

Connect to a Room and listen for events:

Receive video frames of a subscribed track

Examples

Building

MacOS

Motivation and Design Goals