Skip to content
@ModelTC

ModelTC

Model Infra

Pinned

  1. MQBench MQBench Public

    Model Quantization Benchmark

    Shell 728 135

  2. United-Perception United-Perception Public

    United Perception

    Python 424 65

  3. NNLQP NNLQP Public

    Python 33 3

  4. Dipoorlet Dipoorlet Public

    Offline Quantization Tools for Deploy.

    Python 103 13

  5. lightllm lightllm Public

    LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

    Python 1.9k 170

Repositories

Showing 10 of 35 repositories
  • lightllm Public

    LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

    Python 1,910 Apache-2.0 170 51 5 Updated May 28, 2024
  • llmc Public

    This is the official implementation of "LLM-QBench: A Benchmark Towards the Best Practice for Post-training Quantization of Large Language Models", and it is also an efficient LLM compression tool with various advanced compression methods, supporting multiple inference backends.

    Python 81 Apache-2.0 8 2 0 Updated May 24, 2024
  • general-sam-py Public

    Python bindings for general-sam and some utilities

    Python 1 Apache-2.0 0 0 1 Updated May 20, 2024
  • FCPTS Public template
    Python 1 0 0 0 Updated May 14, 2024
  • msbench Public

    A tool for model sparse based on torch.fx

    Python 1 Apache-2.0 1 0 0 Updated May 14, 2024
  • TFMQ-DM Public

    [CVPR 2024 Highlight] TFMQ-DM: Temporal Feature Maintenance Quantization for Diffusion Models

    Jupyter Notebook 26 Apache-2.0 3 0 0 Updated May 13, 2024
  • mtc-token-healing Public

    Token healing implementation in Rust

    Rust 1 Apache-2.0 0 0 0 Updated May 13, 2024
  • statecs Public
    Rust 1 Apache-2.0 1 0 0 Updated May 10, 2024
  • Python 10 Apache-2.0 0 1 0 Updated Apr 27, 2024
  • general-sam Public

    A general suffix automaton implementation in Rust with Python bindings

    Rust 2 Apache-2.0 0 0 0 Updated Apr 25, 2024

People

This organization has no public members. You must be a member to see who’s a part of this organization.