Darkflow: How do I evaluate accuracy of the test set?

Created on 5 Mar 2018 · 34Comments · Source: thtrieu/darkflow

One way is to calculate "Mean average precision: mAP".
But I'm not aware of this feature implemented in darkflow.
Do you have any suggestions on which darknet repo or person has written a script to do this?

Source

off99555

Most helpful comment

Isn't there anyone who has done it before?
No one tests their accuracy?
I don't know why such important stuff is not yet implemented.
If there is none, maybe I'll need to implement this feature by myself and commit it for good.

off99555 on 7 Mar 2018

👍17

All 34 comments

https://github.com/thtrieu/darkflow/blob/master/darkflow/utils/box.py has basic implementation needed for evaluating or computing the overlap between two images. You can modify it as per your need and compute the mean average precision as a multiplication of probability of class and box_iou

sandeeprepakula on 6 Mar 2018

😕6

off99555 on 7 Mar 2018

👍17

I would also be interested in this feature.

bravma on 8 Mar 2018

@off99555, I saw this guy working on F1 score.

377

ilayoon on 12 Mar 2018

Hello @off99555 , i am searching for calculate mAP from my own test data too.
I think @sandeeprepakula have a good answer, can you explain step by step how to calculate mAP from iou box ? i'm very noob in deep learning and object detection

Thanks for your help

andikira on 29 Mar 2018

@andikira I've not done it yet. I'm not sure too.

off99555 on 29 Mar 2018

Hello @off99555 , you have to see this repo https://github.com/Cartucho/mAP then see the extra folder, you can convert .xml and .json file, that repo works perfectly with darkflow.

andikira on 30 Mar 2018

👍9 ❤2 🎉1

@andikira @thtrieu I trained the yolo2-voc network on PASCAL V0C 0712 trainval set and tested on PASCAL VOC 2007 test set, and the mAP performance is only around 52% (after quite a lot of epoches (200~) considering that I initialized it with pretrained weights). It is quite below the official performance(76.8% mAP). I want to include experimental results with my idea using YOLO v2 on my paper. Could anyone know the reason?

bareblackfoot on 18 Apr 2018

i have seen that issue before, please check another issue @bareblackfoot

andikira on 19 Apr 2018

@andikira The mAP repo works wonderfully. I contributed a script there too. Thank you!

off99555 on 9 May 2018

Can you

@andikira The mAP repo works wonderfully. I contributed a script there too. Thank you!

Could you please explain more specificially ? how does it work ? how did you get the results?

srhtyldz on 16 Dec 2018

@srhtyldz Go to the https://github.com/Cartucho/mAP website and go to quickstart section.
The idea is that you need to clone the repo, predict your images using darkflow and then you copy the prediction files into the mAP folder. Also copy the ground-truth files.
And you do everything as the README file of mAP repo suggested and you should see mAP as a single number output.

off99555 on 17 Dec 2018

Also you need to convert your files from darkflow format to their format. They provide the script to convert darkflow format to their format here: https://github.com/Cartucho/mAP/tree/master/extra
Read the README.

off99555 on 17 Dec 2018

Also you need to convert your files from darkflow format to their format. They provide the script to convert darkflow format to their format here: https://github.com/Cartucho/mAP/tree/master/extra
Read the README.

I found out that darknet has map property .Did you try that ? Is there a difference between them or which one is more accurate ?

srhtyldz on 17 Dec 2018

@srhtyldz I think that the algorithm is the same. Except that mAP is an algorithm that has some parameters you can set. Usually, this is manually set by the writer of the algorithm. So the only time you see they report different numbers is when they have different parameters.
I suggest you read the explanation of how mAP works so that you can understand its hidden parameter.
I tried them both. I went with darknet in my thesis because darkflow does not have the option to specify the output folder. It's critical for my senior project because I run the code on FloydHub which is a deep learning cloud that only allows you to use certain folders.

off99555 on 17 Dec 2018

First I use darkflow with mAP and then I later find out that I cannot specify output path, so I switched to darknet and use its internal mAP computation.

off99555 on 17 Dec 2018

And most importantly, the owner of darknet repo (the fork version with AlexeyAB as the owner) is very helpful. He answers all the issues I posted which made me finished before the deadline. It was very stressful back then. I am very thankful of him, I also wrote his name in the thesis.

off99555 on 17 Dec 2018

First I use darkflow with mAP and then I later find out that I cannot specify output path, so I switched to darknet and use its internal mAP computation.

Which variable do you want to specify ?

First I use darkflow with mAP and then I later find out that I cannot specify output path, so I switched to darknet and use its internal mAP computation.

My opinion is also for darknet.I think you can only switch with threshold by calculation mAP ? right ?

srhtyldz on 17 Dec 2018

Which variable do you want to specify ?

@srhtyldz In darkflow about half a year ago, when I started this thread, doesn't allow me to specify the output folder path. The output folder path is the path where the prediction images are written.
If I remember correctly, the output folder is set to out folder inside the folder of your dataset.
E.g. if you have some images inside img/, the predictions will be inside img/out/.
In FloydHub, they don't allow you to write the output to the dataset folder. So if the dataset folder is img/, you can't create a new folder as img/out because you will encounter permission denied.

Maybe it allows you to specify the output path now but you have to check. I am not sure.

My opinion is also for darknet.I think you can only switch with threshold by calculation mAP ? right ?

If I remember correctly, mAP has one parameter that the writer needs to define. It is the threshold from 0 to 1 that you decide if an image is correctly predicted. Usually, this is set to 0.5. So if the area ratio goes above 0.5, it means the correct image.
And after that you also have to define the Average Precision. To fine the average, you need to find out how many precisions there are. It turns out that you can also define the number of precisions. Usually, the number of precisions is set to 11 so that you can line them up from 0, 0.1, 0.2, ..., 0.9, 1.0.

I am not sure what I'm talking about though. Because I have already forgotten most of it. You have to learn it to make sure that I'm right. But I'm pretty sure about the number 0.5 and 11 thing. I just don't know what they are called.

PS. I went back to find my thesis slide (when I know what mAP means), here is how I described it back then:

off99555 on 17 Dec 2018

Which variable do you want to specify ?

@srhtyldz In darkflow about half a year ago, when I started this thread, doesn't allow me to specify the output folder path. The output folder path is the path where the prediction images are written.
If I remember correctly, the output folder is set to out folder inside the folder of your dataset.
E.g. if you have some images inside img/, the predictions will be inside img/out/.
In FloydHub, they don't allow you to write the output to the dataset folder. So if the dataset folder is img/, you can't create a new folder as img/out because you will encounter permission denied.

Maybe it allows you to specify the output path now but you have to check. I am not sure.

My opinion is also for darknet.I think you can only switch with threshold by calculation mAP ? right ?

If I remember correctly, mAP has one parameter that the writer needs to define. It is the threshold from 0 to 1 that you decide if an image is correctly predicted. Usually, this is set to 0.5. So if the area ratio goes above 0.5, it means the correct image.
And after that you also have to define the Average Precision. To fine the average, you need to find out how many precisions there are. It turns out that you can also define the number of precisions. Usually, the number of precisions is set to 11 so that you can line them up from 0, 0.1, 0.2, ..., 0.9, 1.0.

I am not sure what I'm talking about though. Because I have already forgotten most of it. You have to learn it to make sure that I'm right. But I'm pretty sure about the number 0.5 and 11 thing. I just don't know what they are called.

PS. I went back to find my thesis slide (when I know what mAP means), here is how I described it back then:

Thanks a lot . I'll research mAP and IoU. I got the result but i couldn't interpret them.Can i ask one more question ? did you use 'recall' function ? map and recall function have same functionality or not ? do you have any ideas_?

srhtyldz on 17 Dec 2018

@srhtyldz mAP is calculated from precision as a function of recall. So precision and recall are included already. mAP is a popular and conventional evaluation metric for object detection tasks. So you don't have to do recall or precision individually. Just compute mAP and if it's high then it means your model performs well.

off99555 on 17 Dec 2018

@srhtyldz mAP is calculated from precision as a function of recall. So precision and recall are included already. mAP is a popular and conventional evaluation metric for object detection tasks. So you don't have to do recall or precision individually. Just compute mAP and if it's high then it means your model performs well.

I got 90.4% from mAP.I think it performs well.

srhtyldz on 17 Dec 2018

@srhtyldz That's very high if you have many classes. (Popular models on big dataset only does around 50-60%) But even if you have only two classes, it's still good.
Make sure to check the predicted images using human eyes to verify if the mAP is reasonable.

off99555 on 18 Dec 2018

@srhtyldz That's very high if you have many classes. (Popular models on big dataset only does around 50-60%) But even if you have only two classes, it's still good.
Make sure to check the predicted images using human eyes to verify if the mAP is reasonable.

I have only one class.Also, i saved all images with bounding box.
Did you draw the mAP chart ? In Alex repo, there is mAP chart and while you're training your model, you should put ' -map ' to get chart .I didn't put this word.Even you put that , how could you get the mAP chart ?Did you try it? also did you try ' valid ' function ? do you know the differences ?
I know I asked many questions but your answers are so important to my thesis :).

srhtyldz on 20 Dec 2018

From darknet, I just evaluate mAP using its terminal only and take snapshot of the console.
I don't know about the valid function.
For the mAP chart, it's possible if you use darkflow and use mAP repo above. There is a script which generates the animation and charts.
I think the mAP repo also work with darknet maybe.

off99555 on 20 Dec 2018

From darknet, I just evaluate mAP using its terminal only and take snapshot of the console.
I don't know about the valid function.
For the mAP chart, it's possible if you use darkflow and use mAP repo above. There is a script which generates the animation and charts.
I think the mAP repo also work with darknet maybe.

The problem is that i don't know how animation will be run.I tried to do that while training but i couldnt get animation for map.@alexeyAB maybe will help about this.

srhtyldz on 22 Dec 2018

I got mAP of my test results. From 56 images I got 48 images correct for my testing. I have four classes . How to make confusion matrix from that . Can someone help me .

reethimalisetty on 25 Jun 2019

@srhtyldz I used /darknet detector map cfg/aerial.data cfg/test_tiny.cfg backup\my_yolov3_tiny_final.weights function but I could not get any score or graph at the end of the training. How did you get mAP score in darknet

musimab on 28 Apr 2020

@mustafabuyuk Pass the -map argument to ./darknet (darknet.exe). It will show up on the graph at around 1000 iterations. I'm not sure if it's needed, but I also have a validation set of images too.

colinsenner on 11 May 2020

Can anyone have idea? YOLOv5 calculates mAP against validation dataset during training. Though which command we can calculate the against unseen test dataset (mAP, and Class wise mAP)

irfan-gif on 11 Apr 2021

@off99555 The Thread seems closed. Do you get the answer?
I have a set of Images and Yolo Annotation files(in txt format) for the validation.

How to use -map argument on validating the Images to get the mAP score? Will it be possible to derive the confusion matrix from this?

sandeepmohanadasan on 6 Jul 2021

@sandeeprepakula Read my answer directly before I closed the issue. That's how you can calculate mAP.
But I would suggest further, don't use this repo anymore. I haven't worked with object detection for a few months now so not sure about which repo is the best but from a quick look at the last commit of this repo, it was in Feb 2020. Do you think a repo that's not been updated for a year is worth using? If you get stuck on something no one will help you. Consider the community part as well when choosing which repo to use.

off99555 on 6 Jul 2021

@off99555 Thanks for the suggestion

sandeepmohanadasan on 6 Jul 2021

Was this page helpful?

0 / 5 - 0 ratings