llm-inference

Effortlessly create and manage your own AI infrastructure with Radiantloom AI. Privacy, security, and flexibility meet ease-of-use in this innovative open-source platform.

cloud ai artificial-intelligence transformer finetuning diffusion-models large-language-models llm-training llm-inference

Updated Jan 21, 2024

truefrontier-ai / Monolith

Star

Mamba for Vision, Perception and Action

model mamba vision-language llm llm-inference

Updated Dec 21, 2023

NajiAboo / google-gemini

Star

Detailed code explanation of google LLM gemini

google pillow vision computer-vision-algorithms google-chat llm llm-inference google-gemini gemini-pro-vision gemini-vision-pro gemini-chat google-gemini-pro google-llm

Updated Mar 5, 2024
Jupyter Notebook

amlana21 / llm-stream-publish

Star

How to stream LLM responses using AWS API Gateway Websockets and Lambda

aws devops terraform llm-inference

Updated May 10, 2024
HCL

aws-samples / optimize-foundation-models-deployment-on-amazon-sagemaker

Star

In this workshop, we demonstrate how to choose the right container and right instance types, optimize container parameters, and set up the right autoscaling policies and how to use APIs to get recommendations with Amazon SageMaker

sagemaker sagemaker-deployment llm-inference

Updated Apr 11, 2024
Jupyter Notebook

woheller69 / LLAMA_TK_CHAT

Star

Simple chat interface for local AI using llama-cpp-python and llama-cpp-agent

gui llm-inference llama-cpp-python llama-cpp-agent

Updated May 21, 2024
Python

awesome-software / tree-of-thoughts

Star

Plug in and Play Implementation of Tree of Thoughts: Deliberate Problem Solving with Large Language Models that Elevates Model Reasoning by atleast 70%

large-language-models prompt-engineering llm-inference

Updated Jun 1, 2023
Python

navneet1083 / qaml

Star

This repository contains question-answers model as an interface which retrieves answers from vector database for a question. Embeddings or tokenised vector being computed using OpenAI API call which gets inserted into ChromaDB as a RAG. OpenAI API key would be required to run this service.

rag huggingface-transformers langchain vectordb llm-inference

Updated Aug 1, 2023
Jupyter Notebook

awesome-software / DB-GPT

Star

llm-inference

Updated Sep 11, 2023
Python

txtatech / ai-colony

Star

A framework for multiple LLM models to operate in a non-adversarial fashion based on the structure of a bee colony working together to maintain a hive.

machine-learning ai llms llm-inference ai-management multiple-llm-system ai-colony

Updated Aug 11, 2023
Python

TheAthleticCoder / Multi-Document-Summarization

Star

Our project addresses the challenge of multi-document summarization with Large Language Models (LLMs), which are constrained by token length limitations. We propose a novel approach that combines the strengths of LLMs and Maximal Marginal Relevance (MMR).

nlp llm-inference qlora

Updated Dec 23, 2023
Jupyter Notebook

fork123aniket / LLM-RAG-powered-QA-App

Star

A Production-Ready, Scalable RAG-powered LLM-based Context-Aware QA App

question-answering ray fine-tuning context-aware-system large-language-models ray-serve llmops llm-serving eleutherai llm-training llm-inference retrieval-augmented-generation parameter-efficient-fine-tuning

Updated Jan 8, 2024
Python

austinweis / alpaca.cpp-gui

Star

GUI for GGML Alpaca models

flask gui flask-application llama gpt alpaca llm llamacpp llm-inference gguf

Updated Sep 24, 2023
HTML

Improve this page

Add a description, image, and links to the llm-inference topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the llm-inference topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llm-inference

Here are 411 public repositories matching this topic...

awesome-software / FrugalML

Regular-Baf / bafchat

useentropy / llmkit

anafisa / Text2Text-Transformer

siripragadashashank / accio

mapluisch / LLaVA-WebSocket-Server

nluxai / pynlux

aigeek0x0 / radiantloom-ai

truefrontier-ai / Monolith

NajiAboo / google-gemini

amlana21 / llm-stream-publish

aws-samples / optimize-foundation-models-deployment-on-amazon-sagemaker

woheller69 / LLAMA_TK_CHAT

awesome-software / tree-of-thoughts

navneet1083 / qaml

awesome-software / DB-GPT

txtatech / ai-colony

TheAthleticCoder / Multi-Document-Summarization

fork123aniket / LLM-RAG-powered-QA-App

austinweis / alpaca.cpp-gui

Improve this page

Add this topic to your repo