InternLM / lmdeploy Public

Notifications You must be signed in to change notification settings
Fork 254
Star 2.9k

Code
Issues 139
Pull requests 25
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Issues: InternLM/lmdeploy

[Benchmark] benchmarks on different cuda architecture with mo...

#815 opened Dec 11, 2023 by lvhan028

Open 6

报名参加书生·浦语大模型实战营——两周带你玩转微调部署评测全链路

#890 opened Dec 26, 2023 by vansin

Open

Labels 32 Milestones 0

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

139 Open 757 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

[Bug] xcomposer 4khd lora weight error in lmdeploy

#1747 opened Jun 8, 2024 by ztfmars

2 tasks done

[Feature] Qwen 2 Support

#1746 opened Jun 8, 2024 by suptejas

[Feature] min_p sampling parameter

#1745 opened Jun 8, 2024 by josephrocca

[Bug] Many concurrent requests with --enable-prefix-caching AND --quant-policy 8 crashes with: CUDA runtime error: an illegal memory access was encountered /opt/lmdeploy/src/turbomind/utils/allocator.h:231

#1744 opened Jun 8, 2024 by josephrocca

2 tasks done

[Bug] Space is incorrectly removed from start of generated text for /v1/completion endpoint

#1743 opened Jun 8, 2024 by josephrocca

2 tasks done

logits输出有问题[Bug]

#1742 opened Jun 8, 2024 by GZL11

2 tasks done

[Docs] Guidance on setting num_tokens_per_iter and max_prefill_iters to optimal values

#1740 opened Jun 8, 2024 by josephrocca

[Bug] detokenize_incrementally: OverflowError: out of range integral type conversion attempted

#1739 opened Jun 7, 2024 by josephrocca

2 tasks done

[Feature] Speculative Decoding

#1738 opened Jun 7, 2024 by josephrocca

[Docs] Where is prefix cache data stored?

#1737 opened Jun 7, 2024 by josephrocca

[Bug] 量化模型时无输出

#1735 opened Jun 7, 2024 by NB-Group

2 tasks done

[Feature Request] OpenAI-compatible stop param

#1731 opened Jun 7, 2024 by josephrocca

[Bug] 部署cogvlm2运行时，接受的多个并发之间存在干扰，后面的请求使用前面请求传的图像

#1730 opened Jun 7, 2024 by LRHstudy

1 of 2 tasks

[Bug] CUDA OOM during calibration even with 5x 4090s? Falling back to --device cpu also fails (with different error)

#1729 opened Jun 7, 2024 by josephrocca

2 tasks done

High GPU memory for running InternVL-Chat-V1-5-AWQ awaiting response

#1728 opened Jun 7, 2024 by tairen99

[Feature] Support for THUDM/glm-4v-9b planned feature

#1726 opened Jun 6, 2024 by Iven2132

How to trace multiple GPUs using nsight system

#1722 opened Jun 6, 2024 by sleepwalker2017

[Bug] Mini-InternVL1.5-4B does not suceessfully initialized.

#1721 opened Jun 6, 2024 by cydiachen

1 of 2 tasks

[Bug] internlm2-chat-1_8b模型使用4bit KV量化的时候找不到key_stats.pth awaiting response

#1720 opened Jun 6, 2024 by jxfruit

2 tasks

[Bug] Why does prefix caching change the generated content

#1719 opened Jun 5, 2024 by DayDayupupupup

1 of 2 tasks

[Feature] 想问下有打算支持GLM4V模型吗

#1713 opened Jun 5, 2024 by will-wiki

lmdeploy0.4.2 8卡推理llama7-70b-instruct无反应

#1712 opened Jun 5, 2024 by yak9meat

[Feature] V100量化推理

#1711 opened Jun 5, 2024 by QwertyJack

[Feature] lmdeploy通过命令行可以启动一个gradio应用，这个gradio的应用是不是可以给用户提供UI修改的方法？ awaiting response

#1710 opened Jun 5, 2024 by kaiwang0112006

[Feature] Create Cuda 12 docker images

#1709 opened Jun 5, 2024 by nickmitchko

Previous 1 2 3 4 5 6 Next

Previous Next

ProTip! Type g i on any issue or pull request to go back to the issue listing page.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly