mirror of
https://github.com/twitter/the-algorithm.git
synced 2024-11-05 19:25:09 +01:00
35 lines
2.4 KiB
Markdown
35 lines
2.4 KiB
Markdown
|
# Navi: High-Performance Machine Learning Serving Server in Rust
|
||
|
|
||
|
Navi is a high-performance, versatile machine learning serving server implemented in Rust, tailored for production usage. It's designed to efficiently serve within the Twitter tech stack, offering top-notch performance while focusing on core features.
|
||
|
|
||
|
## Key Features
|
||
|
|
||
|
- **Minimalist Design Optimized for Production Use Cases**: Navi delivers ultra-high performance, stability, and availability, engineered to handle real-world application demands with a streamlined codebase.
|
||
|
- **gRPC API Compatibility with TensorFlow Serving**: Seamless integration with existing TensorFlow Serving clients via its gRPC API, enabling easy integration, smooth deployment, and scaling in production environments.
|
||
|
- **Plugin Architecture for Different Runtimes**: Navi's pluggable architecture supports various machine learning runtimes, providing adaptability and extensibility for diverse use cases. Out-of-the-box support is available for TensorFlow and Onnx Runtime, with PyTorch in an experimental state.
|
||
|
|
||
|
## Current State
|
||
|
|
||
|
While Navi's features may not be as comprehensive as its open-source counterparts, its performance-first mindset makes it highly efficient.
|
||
|
- Navi for TensorFlow is currently the most feature-complete, supporting multiple input tensors of different types (float, int, string, etc.).
|
||
|
- Navi for Onnx primarily supports one input tensor of type string, used in Twitter's home recommendation with a proprietary BatchPredictRequest format.
|
||
|
- Navi for Pytorch is compilable and runnable but not yet production-ready in terms of performance and stability.
|
||
|
|
||
|
## Directory Structure
|
||
|
|
||
|
- `navi`: The main code repository for Navi
|
||
|
- `dr_transform`: Twitter-specific converter that converts BatchPredictionRequest Thrift to ndarray
|
||
|
- `segdense`: Twitter-specific config to specify how to retrieve feature values from BatchPredictionRequest
|
||
|
- `thrift_bpr_adapter`: generated thrift code for BatchPredictionRequest
|
||
|
|
||
|
## Content
|
||
|
We include all *.rs source code that makes up the main navi binaries for you to examine. The test and benchmark code, as well as configuration files are not included due to data security concerns.
|
||
|
|
||
|
## Run
|
||
|
in navi/navi you can run. Note you need to create a models directory and create some versions, preferably using epoch time, e.g., 1679693908377
|
||
|
- scripts/run_tf2.sh
|
||
|
- scripts/run_onnx.sh
|
||
|
|
||
|
## Build
|
||
|
you can adapt the above scripts to build using Cargo
|