training does not start #12644

faridamousa · 2024-05-13T06:07:54Z

Search before asking

I have searched the YOLOv8 issues and discussions and found no similar questions.

Question

when i want to train the model.
this is my code:
model = YOLO("yolov8n.yaml") # load the trained model

# Train the model
model.train(data="config.yaml", epochs=30, batch = 4, imgsz=256)

metrics = model.val(data="config.yaml")
print(metrics.box)

and i run the file. epoch 1 reaches 4% and then stops and the training stops. why is this happening? happened with me with yolov8 and yolov9

Additional

No response

The text was updated successfully, but these errors were encountered:

glenn-jocher · 2024-05-13T13:23:29Z

@faridamousa hello! It appears that the training process halts unexpectedly at 4% during the first epoch. Here are a few suggestions that might help to troubleshoot and resolve the issue:

Check the Dataset: Verify that config.yaml is correctly set up with valid paths to your dataset folders and that the images and labels are accessible and properly formatted.
Hardware Resources: Ensure that your hardware resources are not being maxed out. Monitor the CPU and memory usage, and if you are training on GPU, check for any potential issues with the CUDA environment or out-of-memory errors.
Terminal Output/Logs: Look closely at any error messages or warnings in the terminal output or logs generated during the training process. These might give more context on why the training is stopping.
Version Compatibility: Confirm that your YOLOv8 or YOLOv9 environment is set up with compatible versions of dependencies like PyTorch, CUDA, etc.
Simplify Your Configuration: Try reducing batch size or imgsz to see if it has an impact on progressing past the 4% mark.

If none of these suggestions resolve the issue, it would be helpful to have more details such as terminal output/errors, hardware specifications, and the exact content of config.yaml. This information will help in diagnosing the problem more effectively.

faridamousa added the question Further information is requested label May 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

training does not start #12644

training does not start #12644

faridamousa commented May 13, 2024

glenn-jocher commented May 13, 2024

training does not start #12644

training does not start #12644

Comments

faridamousa commented May 13, 2024

Search before asking

Question

Additional

glenn-jocher commented May 13, 2024