Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

cudaMemcpy耗时较大,对于大图来说,请问有优化方法吗? #192

Open
MrWwei opened this issue Apr 19, 2023 · 1 comment
Open

Comments

@MrWwei
Copy link

MrWwei commented Apr 19, 2023

我使用的是yolo gpuptr版本,批量数据在添加到images_gpu时,host与device拷贝时间较长,大佬能否提供一下优化思路?

@hopef
Copy link

hopef commented May 24, 2023

你可以使用pinned memory,也就是cudaMallocHost来分配host的内存,可以优化比较明显

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants