Allowing for user defined transforms #10308

edkazcarlson · 2024-04-25T04:13:13Z

🛠️ PR Summary

_{Made with ❤️ by Ultralytics Actions}

🌟 Summary

Enhancements and Flexibility in Data Handling and Model Configuration

📊 Key Changes

Introduced inputCh configuration to specify the number of input channels for images.
Added flexibility in data augmentation and label transformation with override_label_transforms and append_label_transforms options.
Enhanced model and dataset initialization to support the new configurations.
Improved plot handling for datasets with non-standard (non-3) image channels.

🎯 Purpose & Impact

Custom Input Channels: Allows models to handle images with different numbers of channels (e.g., grayscale or multispectral images), making the library more versatile.
Data Augmentation Customization: The addition of label transformation options provides users with the ability to customize or extend the data preprocessing and augmentation steps. This can lead to better model performance by tailoring the preprocessing steps to specific dataset characteristics.
Better Support for Non-Standard Images: The updates ensure that plotting functions gracefully handle images with non-standard channels, avoiding errors and improving user experience when working with such data.

These changes enhance the library's flexibility, making it more adaptable to various types of data and specific project requirements.

github-actions · 2024-04-25T04:13:30Z

All Contributors have signed the CLA. ✅
_{Posted by the CLA Assistant Lite bot.}

github-actions

👋 Hello @edkazcarlson, thank you for submitting an Ultralytics YOLOv8 🚀 PR! To allow your work to be integrated as seamlessly as possible, we advise you to:

✅ Verify your PR is up-to-date with ultralytics/ultralytics main branch. If your PR is behind you can update your code by clicking the 'Update branch' button or by running git pull and git merge main locally.
✅ Verify all YOLOv8 Continuous Integration (CI) checks are passing.
✅ Update YOLOv8 Docs for any new or updated features.
✅ Reduce changes to the absolute minimum required for your bug fix or feature addition. "It is not daily increase but daily decrease, hack away the unessential. The closer to the source, the less wastage there is." — Bruce Lee

See our Contributing Guide for details and let us know if you have any questions!

codecov · 2024-04-25T04:15:56Z

Codecov Report

Attention: Patch coverage is 77.41935% with 14 lines in your changes missing coverage. Please review.

Project coverage is 70.10%. Comparing base (22dec59) to head (eb5e012).

Files	Patch %	Lines
ultralytics/data/dataset.py	57.89%	8 Missing ⚠️
ultralytics/models/yolo/classify/train.py	70.00%	3 Missing ⚠️
ultralytics/models/yolo/detect/train.py	83.33%	1 Missing ⚠️
ultralytics/models/yolo/detect/val.py	87.50%	1 Missing ⚠️
ultralytics/utils/plotting.py	50.00%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main   #10308      +/-   ##
==========================================
- Coverage   70.14%   70.10%   -0.04%     
==========================================
  Files         124      124              
  Lines       15716    15754      +38     
==========================================
+ Hits        11024    11045      +21     
- Misses       4692     4709      +17

Flag	Coverage Δ
Benchmarks	`35.33% <51.61%> (-0.02%)`	⬇️
GPU	`36.56% <51.61%> (-0.57%)`	⬇️
Tests	`66.14% <75.80%> (-0.09%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

edkazcarlson · 2024-04-25T04:17:39Z

I have read the CLA Document and I sign the CLA

…ility

edkazcarlson · 2024-05-29T02:08:53Z

Unsure if this will help with reviewing, but I tested the changes with the following code

def preprocess32(x):
    """
    Convert to float32 and normalize to 0-1
    Returns: np array
    """
    if (type(x) == type(torch.tensor([]))):
        x = x.transpose(0,1).transpose(1,2) # go from channel, height, width to height, width, channel
        x = x.numpy().astype(np.float32)
        if np.max(x) > 1.1:
            x /= 255
    elif type(x) == type(Image.Image()):
        x = np.float32(np.asarray(x))
        x /= 255
    elif type(x) == type(np.array([])):
        x = np.float32(x)
        x /= 255
    else: 
        print(f'In preprocess32 found {type(x)} wanted {type(Image.Image())} or {type(np.array())} or {type(torch.tensor([]))}')
        exit()
        
    assert np.max(x) <= 1.01, f'np.max(x) {np.max(x)}'
    assert np.min(x) >= -.01, f'np.min(x) {np.min(x)}'
    x = np.clip(x, 0, 1)
    return x

def FourChannelTransformMethod(x):
    x = preprocess32(x)
    x = torch.tensor(x, dtype= torch.float)
    x = x.transpose(2,1).transpose(1,0) # hwc -> chw
    x = torch.cat((x, torch.zeros_like(x[0]).unsqueeze(0)), dim=0)
    return x #c h w 

class FourChannelTransform(object):
    """Changes an image from bgr to lrgb.

    Args: normalizeSB: boolean that is true if the saturation and brightness are normalized around 0
    """
    def __init__(self):
        pass
    def __call__(self, labels):
        img = FourChannelTransformMethod(labels['img'])
        labels['img'] = img.to(torch.float16)
        return labels
    
    
class ThreeChannelTransform(object):
    """Changes an image from bgr to lrgb.

    Args: normalizeSB: boolean that is true if the saturation and brightness are normalized around 0
    """
    def __init__(self, dtype):
        self.dtype = dtype
        pass
    def __call__(self, labels):
        labels['img'] = labels['img'].to(self.dtype)
        return labels

class TwoChannelTransform(object):
    def __init__(self):
        pass

    def __call__(self, labels):
        labels['img'] = labels['img'][0:2]
        return labels

def firstTest():
    print('Default DetectionTrainer')
    overrides = {'epochs': 2, 'imgsz': 640, 'data': 'coco.yaml', 'model': f'yolov8n.yaml', 'inputCh': 3, 'batch': 8, 'close_mosaic': 1}
    trainer = DetectionTrainer(overrides=overrides)
    trainer.train()

def secondTest():
    print('3 channel detection trainer with float16')
    overrides = {'epochs': 2, 'imgsz': 640, 'data': 'coco.yaml', 'model': f'yolov8n.yaml', 'inputCh': 3, 'batch': 8, 'close_mosaic': 1}
    trainer = DetectionTrainer(overrides=overrides, append_label_transforms=ThreeChannelTransform(torch.float16))
    trainer.train()

def thirdTest():
    print('3 channel detection trainer with float32')
    overrides = {'epochs': 2, 'imgsz': 640, 'data': 'coco.yaml', 'model': f'yolov8n.yaml', 'inputCh': 3, 'batch': 8, 'close_mosaic': 1}
    trainer = DetectionTrainer(overrides=overrides, append_label_transforms=ThreeChannelTransform(torch.float32))
    trainer.train()
    
def fourthTest():
    print('4 channel detection')
    overrides = {'epochs': 2, 'imgsz': 640, 'data': 'coco.yaml', 'model': f'yolov8n.yaml', 'inputCh': 4, 'batch': 8, 'close_mosaic': 1}
    trainer = DetectionTrainer(overrides=overrides, append_label_transforms=FourChannelTransform())
    trainer.train()
    
def fifthTest():
    print('2 channel detection')
    overrides = {'epochs': 2, 'imgsz': 640, 'data': 'coco.yaml', 'model': f'yolov8n.yaml', 'inputCh': 2, 'batch': 8, 'close_mosaic': 1}
    trainer = DetectionTrainer(overrides=overrides, append_label_transforms=TwoChannelTransform())
    trainer.train()

…ility

glenn-jocher · 2024-05-29T09:50:15Z

Thanks for sharing your testing code! It looks comprehensive and covers a variety of scenarios with different channel configurations and data types. This will definitely help in understanding how the changes perform across different setups. If you encounter any issues or have further suggestions, feel free to share! 🚀

…ility

Burhan-Q · 2024-06-07T00:33:24Z

@edkazcarlson hey, saw your message in the Discord server. This is an interesting PR to be sure, but I wouldn't be the best person to evaluate it. I did have a couple of questions for you to consider:

With the inputCh argument, have you tested all models and modes with additional channels? I think this would warrant a new pytest. Be sure to test segmentation, pose, and OBB models as well.
- Also, I would recommend to change it to input_ch as camelcase isn't the standard for us
I see your tests above, but how would someone utilize these changes if they're using the model.train() method? Not necessarily something you have to reply to me with an answer for, but just wanted to bring it up. A majority of users are going to start training in this way.
I would recommend adding some updates to the Docs. Minimum would be to update the training arguments, but would be good to see an example of how to use it with the normal train() method.

These are all things for you to consider, but I don't need you or expect you to reply. My suggestions are only that, suggestions. I shared them as I believe they are important for the PR to be accepted, but doesn't ensure it will or not. Please be patient, we have so much going on and only a handful of people working on PR reviews and a single person to approve/merge changes. Thanks for the PR 🚀

edkazcarlson added 9 commits April 12, 2024 18:35

stash initial progress

8190638

rename params

6700ca1

fix basedataset and children

9fca15c

validator has append and override

0aa75fb

stash

65c0c6e

stash

e52bb57

update

9b69212

stash

75d4b9a

clean up w comment

8e9898f

Auto-format by https://ultralytics.com/actions

79a1828

github-actions bot reviewed Apr 25, 2024

View reviewed changes

edkazcarlson added 2 commits April 24, 2024 21:20

merge w main

71eb701

fix typo

8e33824

edkazcarlson changed the title ~~User/ecarlson/adding transformation compatability~~ Allowing for user defined transforms Apr 28, 2024

Burhan-Q added the enhancement New feature or request label May 13, 2024

Merge branch 'main' into user/ecarlson/adding-transformation-compatab…

4557f7f

…ility

Merge branch 'main' into user/ecarlson/adding-transformation-compatab…

829694e

…ility

edkazcarlson and others added 4 commits May 29, 2024 18:19

Merge branch 'main' into user/ecarlson/adding-transformation-compatab…

2df3330

…ility

Merge branch 'main' into user/ecarlson/adding-transformation-compatab…

3a889b4

…ility

Auto-format by https://ultralytics.com/actions

00a9907

Merge branch 'main' into user/ecarlson/adding-transformation-compatab…

eb5e012

…ility

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allowing for user defined transforms #10308

Allowing for user defined transforms #10308

edkazcarlson commented Apr 25, 2024 •

edited

github-actions bot commented Apr 25, 2024 •

edited

github-actions bot left a comment

codecov bot commented Apr 25, 2024 •

edited

edkazcarlson commented Apr 25, 2024

edkazcarlson commented May 29, 2024

glenn-jocher commented May 29, 2024

Burhan-Q commented Jun 7, 2024

Allowing for user defined transforms #10308

Are you sure you want to change the base?

Allowing for user defined transforms #10308

Conversation

edkazcarlson commented Apr 25, 2024 • edited

🛠️ PR Summary

🌟 Summary

📊 Key Changes

🎯 Purpose & Impact

github-actions bot commented Apr 25, 2024 • edited

github-actions bot left a comment

Choose a reason for hiding this comment

codecov bot commented Apr 25, 2024 • edited

Codecov Report

edkazcarlson commented Apr 25, 2024

edkazcarlson commented May 29, 2024

glenn-jocher commented May 29, 2024

Burhan-Q commented Jun 7, 2024

edkazcarlson commented Apr 25, 2024 •

edited

github-actions bot commented Apr 25, 2024 •

edited

codecov bot commented Apr 25, 2024 •

edited