Train on your own 512 x 512 size image #32

zts12 · 2024-09-21T10:29:44Z

For 512*512 images, do I need to modify the config settings for training on my own dataset? (This is the SpaceNet configuration). Also, do you have any advice on the training batch, the results of 30 sessions are not ideal? Thank you very much for your work on the SAM-Road project, and look forward to your answers in your busy schedule, thank you.

htcr · 2024-09-24T05:33:09Z

I think the CityScale setup defaults to 512x512 patch size. Can you try that? Batch size depends on your GPU memory, I think you can start with the largest batch size you can get away with - then tune LR properly to make sure it converges. It may need some trail-and-error.

zts12 · 2024-09-24T05:36:49Z

Thank you very much for your answer, this is the result of my current road extraction with my own 0.5 meter resolution image using the configuration of SpaceNet, not very ideal, do you have any suggestions?

zts12 · 2024-09-24T05:37:21Z

htcr · 2024-09-24T05:39:38Z

I think our released model takes 1.0m/pixel images. Can you try resizing your images to that resolution? Also, have you fine-tuned on your own dataset? How large was your dataset?

htcr · 2024-09-24T05:40:02Z

Also did you correctly load the pre-trained SAM ckpts?

zts12 · 2024-09-24T05:47:35Z

Thank you again for your suggestion, I used the 0.5 meter resolution image to crop it to 512*512 image size, and also loaded the ckpt of the pre-trained SAM, and also adjusted the learning rate, and re-trained, but the effect is not justified by the good effect of the two datasets in the original paper, the dataset has a total of 3065 pieces, 2453 for training, 459 for testing, 153 for verification, according to the scale of Spacenet for the data division Each image is 512 and contains the corresponding required graph data. So if I change the image to 1 meter resolution, will the end result be improved? The results of the test are like this, and it is not clear whether the thresholds of key points and roads should also be modified?
======= Finding best thresholds ======
======= keypoint ======
======= Finding best thresholds ======
======= keypoint ======
Best threshold 0.01090240478515625, P=0.0 R=0.0 F1=nan
======= road ======
Best threshold 0.01090240478515625, P=0.0 R=0.0 F1=nan
======= road ======
Best threshold 0.0965576171875, P=0.0 R=0.0 F1=nan
======= topo ======
Best threshold 0.0965576171875, P=0.0 R=0.0 F1=nan

htcr · 2024-09-24T06:00:58Z

Hi, I think if you are fine-tuning from the original SAM ckpt (not the ones I released), resolution is less crucial. How does the images look like in general? The numbers you shown seems to suggest the model did not converge at all. The size of the dataset sounds reasonable, can you try the following options:

Debug the label generation logic. Does the GT masks look reasonable?
See if the model can just overfit one example. If not, maybe some hyperparameters are wrong.
Try different batch sizes / learning rates.
Apply some data augmentation. In SAM-Road paper, random cropping and rotation were applied.
Try to zero some loss terms to find which one is exploding.

Good luck with your experiments!

zts12 · 2024-09-24T09:11:58Z

Sorry for the late reply, thank you for your suggestion, I will follow your suggestion to carry out the experiment, I am a graduate student in a university, and the current direction of study is to use high-resolution remote sensing images for road extraction, thank you for communicating with you, can you add WeChat, my WeChat account is 18837621961, I will be honored.

EchoQiHeng · 2024-10-07T02:51:24Z

I trained SAM on the DeepGlobe dataset and the results were convincing, so I believe SAM is robust. Please carefully check your code.

zts12 · 2024-10-08T02:59:32Z

Thank you for your work sharing, but I also used DeepGlobe for training and testing, I cropped it to a 512*512 image, and trained and tested, but the result is not very ideal, but the clear road that can be extracted is incomplete The effect is not very good, can I consult your config settings and the division rules of the dataset? Or is there some other modification work and configuration work that I haven't noticed? Thanks for your answer.

I trained SAM on the DeepGlobe dataset and the results were convincing, so I believe SAM is robust. Please carefully check your code.

EchoQiHeng · 2024-10-09T02:42:45Z

Thank you for your work sharing, but I also used DeepGlobe for training and testing, I cropped it to a 512*512 image, and trained and tested, but the result is not very ideal, but the clear road that can be extracted is incomplete The effect is not very good, can I consult your config settings and the division rules of the dataset? Or is there some other modification work and configuration work that I haven't noticed? Thanks for your answer.

I trained SAM on the DeepGlobe dataset and the results were convincing, so I believe SAM is robust. Please carefully check your code.

I have demonstrated the visualization results on the DeepGlobe validation set, and I believe the model has converged and is functioning as expected. It seems I did not make any specific configuration settings for DeepGlobe. Of course, modifications to the SatMapDataset were necessary, and my process primarily involved cropping and augmentation. Please carefully check your RGB images and the corresponding GT Mask.

Additionally, I have displayed the IoU during the training process. Please provide more details and results from your experiments to facilitate further debugging.

immarshmellow · 2024-12-11T07:01:16Z

感谢大家的分享，不过我也用 DeepGlobe 进行了训练和测试，我裁剪成了一张 512*512 的图片，进行了训练和测试，但是结果不是很理想，但是可以提取的清路效果不是很好，可以查阅一下你的配置设置和数据集的划分规则吗？或者还有其他一些我没有注意到的修改工作和配置工作？谢谢你的回答。

我在 DeepGlobe 数据集上训练了 SAM，结果令人信服，因此我相信 SAM 是稳健的。请仔细检查您的代码。

我已经在 DeepGlobe 验证集上演示了可视化结果，我相信该模型已经收敛并按预期运行。我似乎没有为 DeepGlobe 进行任何特定的配置设置。当然，对 SatMapDataset 的修改是必要的，我的过程主要涉及裁剪和增强。请仔细检查您的 RGB 图像和相应的 GT 蒙版。此外，我在训练过程中还展示了 IoU。请提供实验的更多详细信息和结果，以便于进一步调试。

Hi, I would like to ask about the exact steps for training and testing with DeepGlobe. I don't understand much of this stuff and appreciate any help that can be provided.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Train on your own 512 x 512 size image #32

Train on your own 512 x 512 size image #32

zts12 commented Sep 21, 2024

htcr commented Sep 24, 2024

zts12 commented Sep 24, 2024

zts12 commented Sep 24, 2024

htcr commented Sep 24, 2024

htcr commented Sep 24, 2024

zts12 commented Sep 24, 2024

htcr commented Sep 24, 2024

zts12 commented Sep 24, 2024

EchoQiHeng commented Oct 7, 2024

zts12 commented Oct 8, 2024

EchoQiHeng commented Oct 9, 2024

immarshmellow commented Dec 11, 2024

Train on your own 512 x 512 size image #32

Train on your own 512 x 512 size image #32

Comments

zts12 commented Sep 21, 2024

htcr commented Sep 24, 2024

zts12 commented Sep 24, 2024

zts12 commented Sep 24, 2024

htcr commented Sep 24, 2024

htcr commented Sep 24, 2024

zts12 commented Sep 24, 2024

htcr commented Sep 24, 2024

zts12 commented Sep 24, 2024

EchoQiHeng commented Oct 7, 2024

zts12 commented Oct 8, 2024

EchoQiHeng commented Oct 9, 2024

immarshmellow commented Dec 11, 2024