Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

trt8 is not supply leaky? #187

Open
tuteming opened this issue Feb 16, 2023 · 3 comments
Open

trt8 is not supply leaky? #187

tuteming opened this issue Feb 16, 2023 · 3 comments

Comments

@tuteming
Copy link

trt8 is not supply leaky?

@tuteming
Copy link
Author

I have a yolo-like cfg with leaky, it run in your trt7 version is good.
but run in your trt8 version has wrong.
error in chunk.cu line 74:
NV_CUDA_CHECK(cudaMemcpyAsync((char*)outputs[0] + b * _n_size_split, (char*)inputs[0] + b * 2 * _n_size_split, _n_size_split, cudaMemcpyDeviceToDevice, stream));

may change leaky to mish in my config file?

@tuteming
Copy link
Author

if in chunk.h set int _n_size_split=0
it can run, but no any detect results .
you have any idea can help me?

thanks

@tuteming
Copy link
Author

both in trt7(ok) and trt8(nothing result),
in yolo.cpp, I use
outputTensor.gridSize = (m_InputH / 32) * pow(2, _n_yolo_ind);
outputTensor.grid_h = (m_InputH / 32) * pow(2, _n_yolo_ind);
outputTensor.grid_w = (m_InputW / 32) * pow(2, _n_yolo_ind);

and delet
if (m_NetworkType == "yolov4")//pan
{
outputTensor.gridSize = (m_InputH / 32) * pow(2, 2 - _n_yolo_ind);
outputTensor.grid_h = (m_InputH / 32) * pow(2, 2 - _n_yolo_ind);
outputTensor.grid_w = (m_InputW / 32) * pow(2, 2 - _n_yolo_ind);
}

thanks

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant