inference-server

Star

Here are 40 public repositories matching this topic...

NVIDIA / gpu-rest-engine

Star

A REST API for Caffe using Docker and Go

docker caffe deep-learning gpu inference inference-server

Updated Jul 20, 2018
C++

underneathall / pinferencia

Star

Python + Inference - Model Deployment library in Python. Simplest model inference server ever.

Updated Feb 14, 2023
Python

roboflow / inference

Star

A fast, easy-to-use, production-ready inference server for computer vision supporting deployment of many popular model architectures and fine-tuned models.

Updated Jun 7, 2024
Python

BMW-InnovationLab / BMW-YOLOv4-Inference-API-GPU

Star

This is a repository for an nocode object detection inference API using the Yolov3 and Yolov4 Darknet framework.

Updated Jun 28, 2022
Python

BMW-InnovationLab / BMW-YOLOv4-Inference-API-CPU

Star

This is a repository for an nocode object detection inference API using the Yolov4 and Yolov3 Opencv.

Updated Jun 28, 2022
Python

basetenlabs / truss

Star

The simplest way to serve AI/ML models in production

open-source machine-learning packaging artificial-intelligence falcon easy-to-use whisper inference-server model-serving inference-api stable-diffusion wizardlm

Updated Jun 7, 2024
Python

BMW-InnovationLab / BMW-TensorFlow-Inference-API-CPU

Star

This is a repository for an object detection inference API using the Tensorflow framework.

Updated Jun 28, 2022
Python

autodeployai / ai-serving

Star

Serving AI/ML models in the open standard formats PMML and ONNX with both HTTP (REST API) and gRPC endpoints

api machine-learning real-time deep-learning grpc inference pmml inference-server onnx onnx-models ai-serving pmml-model

Updated Mar 13, 2024
Scala

pipeless-ai / pipeless

Star

An open-source computer vision framework to build and deploy apps in minutes

Updated May 8, 2024
Rust

containers / podman-desktop-extension-ai-lab

Star

Work with LLMs on a local environment using containers

ai local containers inference-server podman llms

Updated Jun 7, 2024
TypeScript

notAI-tech / fastDeploy

Star

Deploy DL/ ML inference pipelines with minimal extra code.

Updated Apr 23, 2024
Python

RubixML / Server

Star

A standalone inference server for trained Rubix ML estimators.

api infrastructure php machine-learning microservice json-api rest-api inference http-server inference-server inference-engine model-deployment php-ml ml-infrastructure model-server rubix-ml php-machine-learning rubix-server

Updated Feb 18, 2024
PHP

haicheviet / fullstack-machine-learning-inference

Star

Fullstack machine learning inference template

aws machine-learning cloudformation full-stack infrastructure-as-code twitter-sentiment-analysis inference-server fastapi machine-learning-template machine-learning-infrastructure

Updated Nov 24, 2023
Jupyter Notebook

kf5i / k3ai

Star

K3ai is a lightweight, fully automated, AI infrastructure-in-a-box solution that allows anyone to experiment quickly with Kubeflow pipelines. K3ai is perfect for anything from Edge to laptops.

kubernetes artificial-intelligence edge datascience machinelearning inference-server kubeflow kubeflow-pipelines k3s

Updated Nov 2, 2021
PowerShell

kibae / onnxruntime-server

Star

ONNX Runtime Server: The ONNX Runtime Server is a server that provides TCP and HTTP/HTTPS REST APIs for ONNX inference.

machine-learning ai deep-learning cuda inference-server nueral-networks contributions-welcome onnx onnxruntime

Updated Jun 3, 2024
C++

k9ele7en / Triton-TensorRT-Inference-CRAFT-pytorch

Star

Advanced inference pipeline using NVIDIA Triton Inference Server for CRAFT Text detection (Pytorch), included converter from Pytorch -> ONNX -> TensorRT, Inference pipelines (TensorRT, Triton server - multi-format). Supported model format for Triton inference: TensorRT engine, Torchscript, ONNX