sniklaus / pytorch-hed Goto Github PK

View Code? Open in Web Editor NEW

470.0 470.0 107.0 53 MB

a reimplementation of Holistically-Nested Edge Detection in PyTorch

License: GNU General Public License v3.0

Python 100.00%

cuda deep-learning python pytorch

pytorch-hed's People

Contributors

Stargazers

Watchers

Forkers

israelabebe oxrider mengkunzhao chaoso kafeiyin00 fengpanhe ssgood xychen9459 huahongzhang jacke121 anyuzoey sybil12 evilhamburger jingwanli6666 teeso skadambi20 tzn314 ccfccl pandinosaurus yacobby jinqi-cheng shengzhang90 tqwhdx19 lejeunel yangtong1989 jameswang007 maphysart blandocs tailororrr nywang2019 cqray1990 aghasemi hello-trouble buffaloiron ericelmoznino hanchenn code-by-lk luojie326 chuanqil cv-ip elttaes scnulpc wolverine45825243 pkuwison navidk381 timmo-prog davidelanz kaedenbrinkman peter8640 gangma2610 2929dance gitshohoku yonghoonkwon ccliangxd ml-edu lkampoli randintgenr lilipopololo hanrui56 yuliangguo yueyedeai fu-yanyuan trendingtechnology tarun005 jaehyek thewolftail queenzk520 jiamur pidanchaoren yinmange-mandy annbless jcccking superemperor000 sicilialeco zjchust zoequ jungjaewon parhomesmaeili linyaoge sjtuwangjy fcoclavero yyang181 kopurian camillarhodes taited azr919 jackzhousz xbkaishui mingjunhuang daisybby mohsenumn xvwcreator tranzmatt viettham1998 versua0 paddlejitlab yuyanze tzviaharon johconstantine bakaowo0407

pytorch-hed's Issues

I am trying to run the command "python run.py --model bsds500 --in ./images/sample.png --out ./out.png" and I am getting the error:

Traceback (most recent call last):
File "run.py", line 152, in
tenOutput = estimate(tenInput)
File "run.py", line 135, in estimate
netNetwork = Network().eval()
File "run.py", line 95, in init
self.load_state_dict({ strKey.replace('module', 'net'): tenWeight for strKey, tenWeight in torch.hub.load_state_dict_from_url(url='http://content.sniklaus.com/github/pytorch-hed/network-' + arguments_strModel + '.pytorch', file_name='hed-' + arguments_strModel).items() })
TypeError: load_state_dict_from_url() got an unexpected keyword argument 'file_name'

HED: side-output

Hello! Thanks for your work on re-implement HED from image to image. It's a practical Project! Can you teach me how to get HED side-output(eg. side-output 4)? HED's output is a fuse output ,I want to compare side-output with fuse output.

How to run on CPU?

Is there a way to run this on CPU?

any further step suggested to obtain slim edges

Hi,
Really appreciate your re-implement, do you have any suggestions for obtaining slimmer edges.
Thank you
Best

AttributeError: 'Tensor' object has no attribute 'clip'

Hi there, it seemed that your latest commit(5ef88c3) didn't work well with PyTorch 1.6? After python run.py, I got the following error,

Traceback (most recent call last):
File "run.py", line 154, in
PIL.Image.fromarray((tenOutput.clip(0.0, 1.0).numpy().transpose(1, 2, 0)[:, :, 0] * 255.0).astype(numpy.uint8)).save(arguments_strOut)
AttributeError: 'Tensor' object has no attribute 'clip'

When I changed the clip back to clamp, it worked. Could you be so kind to provide me with any clues on this? Thanks.

May have found a method to get a better ODS score

I'll try to make a pull request soon, but I actually just replaced the use of numpy with torchvision's transforms of PILToTensor() and ToPILImage()

RGB Normalization problem

Hi, I noticed that in the forward function it normalizes the input as

tenBlue = (tenInput[:, 0:1, :, :] * 255.0) - 104.00698793
tenGreen = (tenInput[:, 1:2, :, :] * 255.0) - 116.66876762 
tenRed = (tenInput[:, 2:3, :, :] * 255.0) - 122.67891434

But when I noramlized it by ImageNet RGB mean[0.485, 0.456, 0.406] as follows:

tenBlue = tenInput[:, 0:1, :, :]  - 0.406
tenGreen = tenInput[:, 1:2, :, :] - 0.456
tenRed = tenInput[:, 2:3, :, :] - 0.485

The output become blurry:

I feel very confused about this cause it seems to be same normalization process

about pre-trained model

can you tell me how to open network-bsds500.pytorch（i.e.the file download use download.bash）, i use .txt or .py the file will change to garbled。so，can you provide .pth file,or tell me how to open it?

How can I adjust the level of detail of the output

Hi, thank for your work! When I apply the network to some images, I found that the output is not fine-grained enough due to style gap and low resolution of source image. So how can I generate edge image with more detail? Change some hyperparameters?

BN layer

I have read some source code about many edge detection network, all of the network are without BN layer, why was that?

Using HED as a term of feature-based loss

Hi sniklaus,
Thanks very much for u excellent implementation.
There is a qusetion that I'd like to ask you for advice.
Have you ever used HED's feature map as a supervised term, just similar to VGG loss?
It would be greatly appreciated if you could offer me any suggestions!
Best,
Melon

bsds500 weights unavailable

Hey,

I just wanted to run your re-implementation of the pytorch HED model and the weights url http://content.sniklaus.com/github/pytorch-hed/network-bsds500 seems to point nowhere.

Keep up the good work!

PyTorch HED

Hello!

I really liked the idea of finding the edges of images with Holistically-Nested Edge Detection. But unfortunately, even on the sample.png image example, when I launch code "run.py" I have an error "option -f not recognized". I don't know how to solve this problem.

I also try to use reimplementation Pytorch Holistically-Nested Edge Detection with sample.png image example, I don’t get the same result as in the description, after running the code I get an image in gray shades and unlike contours.

I will be grateful for the answer!

How to test my own images with 512X512 size

thank you for sharing. How to test my own images with 512X512 size using your pretrained model?

Can you share the pretrained model?

Hello, I found you deleted the bash file. Can you share the pretrained model?

problem of the pretrained model

HI ,good morning. So sorry to bother you . There are something wrong with the pretrained network as follows .

deserialized_objects[key]._set_from_file(f, offset, f_should_read_directly)
RuntimeError: unexpected EOF, expected 4922858 more bytes. The file might be corrupted.

How to run without CUDA?

It shows error on my laptop:
AssertionError: Torch not compiled with CUDA enabled

Is it possible to run without CUDA?

Multiple images as input

Hello, @sniklaus Thanks for your contribution! Is there a way to generate edges from multiple images under a specific directory with your run.py? It seemed that your run.py can only process a single image as input so far. Thanks.

Training details

Could you please provide the details of training your pytorch model? Thanks!

model

你好，我还没有运行原代码，我想看一下这个模型的效果。
你能不能分享一下bsds500 的模型，感激不尽。

Where is the model file

F-Score is terrible for individual images

`Hello,

I took images from the BSDS dataset, and individually analyzed the F-Scores. I observed that the result is pretty bad in general, and I am not sure what I'm doing wrong. I am suspecting a bug, but I'm not confident.

Image vs ground truth:

HED output:

And of course, if the value of an individual cell is greater than 0.5, I round it off to 1 before estimating the F1 score. The image after rounding off looks like this:

Then, I calculate the F-Score using the following code:

Here, the gt is the groundtruth and the pred array is the prediction image from HED.

tp = np.sum(pred[np.where(pred == 1)] == gt[np.where(pred == 1)])
fp = np.sum(pred[np.where(pred == 1)] != gt[np.where(pred == 1)])
precision = tp / (tp + fp)

# recall
fn = np.sum(pred[np.where(pred == 0)] != gt[np.where(pred == 0)])
recall = tp / (tp + fn)

# f-value
fvalue = 2 * precision * recall / (precision + recall)

I get a value (F1 score) of 0.1 to 0.2.
The paper and the Github descriptions state much higher values (>0.7). Can anyone guide me regarding what could be going wrong? Since I am using the same metric and the same score, it is a little tricky.

Any help is appreciated.

problem

I want to know if this code is used for training or testing? If it is for training, why would the result be directly output?

RuntimeError: shape '[1, 3, 320, 480]' is invalid for input of size 614400

Traceback (most recent call last): File "run.py", line 153, in <module> tenOutput = estimate(tenInput) File "run.py", line 145, in estimate return netNetwork(tenInput.cuda().view(1, 3, intHeight, intWidth))[0, :, :, :].cpu() RuntimeError: shape '[1, 3, 320, 480]' is invalid for input of size 614400
I tried to test my own image, but I got this error. The difference between sample.png and my own image.png is the bit depth. Mine is 32 bit depth.