PASCAL VOC: Pascal voc dataset have a xml file for each image.
YOLO: YOLO dataset have a txt file for each image.
COCO: COCO dataset have a json file for each image.
Currently, the following formats are supported:
| from | to | implemented |
|---|---|---|
| PASCAL VOC | YOLO(TXT files) | Yes |
| YOLO | PASCAL VOC (XML files) | Yes |
| from | to | Issue/PR(if any) |
|---|---|---|
| PASCAL VOC | COCO (JSON files) | No |
| PASCAL VOC | TFRecord (TFRecord files) | No |
| COCO | PASCAL VOC (XML files) | No |
| COCO | YOLO (TXT files) | No |
| COCO | TFRecord (TFRecord files) | No |
| YOLO | COCO (JSON files) | No |
| YOLO | TFRecord (TFRecord files) | No |
git clone https://github.com/codePerfectPlus/dataset-convertor/cd dataset-convertor
python -m venv venvsource venv/bin/activate
pip install requirements.txtpip install dataset-convertorconvert annotations from one format to another format.
dataset formatting example:
- data/pascal_voc/JPEGImages/*.jpg
- data/pascal_voc/Annotations/*.xml
- data/yolo5/JPEGImages/*.jpg
- data/yolo5/labels/*.txt
from convert import Convertor
con = Convertor(input_folder='/home/user/data/pascal_voc', output_folder='/home/user/data/yolo5')
con.voc2yolo()from convert import Convertor
con = Convertor(input_folder='/home/user/data/yolo5', output_folder='/home/user/data/pascal_voc')
con.yolo2voc()create an issue/PR if any format is missing.Open-source contribution is welcome.check the contributing guide for details.
- PASCAL VOC - http://host.robots.ox.ac.uk/pascal/VOC/
- COCO - http://cocodataset.org/
- YOLO9000 - https://arxiv.org/abs/1612.08242
- YOLO4 - https://arxiv.org/abs/2004.10934v1