基于遥感小目标XView数据集来实现一个目标检测任务—— 遥感小目标Xview原始目标检测数据集

张

张建站

2026/6/6 0:48:35

10分钟阅读

基于遥感小目标XView数据集来实现一个目标检测任务——遥感小目标Xview原始目标检测数据集数据集已经包括60个类别数据集是txt和json格式数据集已划分为训练集/验证集yolo可用相关Faster RCNN/Cascade RCNN/SSD/Retinant模型也可以下文章代码仅供参考基于遥感小目标XView数据集来实现一个目标检测任务。为了简化流程选择YOLOv5作为目标检测模型并使用PyTorch框架进行实现。以下是详细的步骤环境准备确保您已经安装了以下软件和库Python 3.8 或更高版本PyTorch 1.9 或更高版本torchvision 0.10 或更高版本OpenCVnumpypandasmatplotlibalbumentations用于数据增强您可以使用以下命令安装所需的Python库pipinstalltorch torchvision opencv-python numpy pandas matplotlib albumentations数据集准备假设您的数据集已经按照YOLO格式组织好并且包含训练集和验证集。如果还没有转换为YOLO格式我们需要先进行数据转换。数据集结构datasets/ ├── xview/ │ ├── images/ │ │ ├── train/ │ │ └── val/ │ └── labels/ │ ├── train/ │ └── val/ └── xview_classes.txtxview_classes.txt文件包含60个类别的名称每行一个类别名称。数据转换脚本如果您的数据集是JSON格式可以使用以下脚本来将其转换为YOLO格式的TXT文件。假设JSON文件中包含边界框信息和类别标签。[titleConvert JSON to YOLO Format]importjsonimportosfrompathlibimportPathimportcv2defconvert_json_to_yolo(json_file,output_dir,image_dir):withopen(json_file,r)asf:datajson.load(f)categories{category[id]:category[name]forcategoryindata[categories]}class_namessorted(categories.values())class_dict{class_name:idxforidx,class_nameinenumerate(class_names)}withopen(output_dir/classes.txt,w)asf:forclass_nameinclass_names:f.write(f{class_name}\n)forannotationindata[annotations]:img_idannotation[image_id]bboxannotation[bbox]category_idannotation[category_id]image_infonext(itemforitemindata[images]ifitem[id]img_id)image_pathimage_dir/image_info[file_name]width,heightimage_info[width],image_info[height]label_pathoutput_dir/labels/(Path(image_info[file_name]).stem.txt)withopen(label_path,a)asf:class_idxclass_dict[categories[category_id]]x_center(bbox[0]bbox[2]/2)/width y_center(bbox[1]bbox[3]/2)/height wbbox[2]/width hbbox[3]/height f.write(f{class_idx}{x_center}{y_center}{w}{h}\n)# Example usagejson_filepath/to/xview/annotations.jsonoutput_dirPath(datasets/xview/)image_dirPath(path/to/xview/images/train/)# Change to val/ for validation setconvert_json_to_yolo(json_file,output_dir,image_dir)模型训练我们将使用YOLOv5进行训练。首先克隆YOLOv5仓库并设置环境。gitclone https://github.com/ultralytics/yolov5.gitcdyolov5 pipinstall-rrequirements.txt准备配置文件创建一个hyp.scratch.yaml文件来定义超参数# Hyperparameters for YOLOv5 training from scratchlr0:0.01# initial learning rate (SGD1E-2, Adam1E-3)lrf:0.1# final OneCycleLR learning rate (lr0 * lrf)momentum:0.937# SGD momentum/Adam beta1weight_decay:0.0005# optimizer weight decay 5e-4warmup_epochs:3.0# warmup epochs (fractions ok)warmup_momentum:0.8# warmup initial momentumwarmup_bias_lr:0.1# warmup initial bias lrbox:0.05# box loss gaincls:0.5# cls loss gaincls_pw:1.0# cls BCELoss positive_weightobj:1.0# obj loss gain (scale with pixels)obj_pw:1.0# obj BCELoss positive_weightiou_t:0.20# IoU training thresholdanchor_t:4.0# anchor-multiple thresholdfl_gamma:0.0# focal loss gamma (efficientDet default gamma1.5)hsv_h:0.015# image HSV-Hue augmentation (fraction)hsv_s:0.7# image HSV-Saturation augmentation (fraction)hsv_v:0.4# image HSV-Value augmentation (fraction)degrees:0.0# image rotation (/- deg)translate:0.1# image translation (/- fraction)scale:0.5# image scale (/- gain)shear:0.0# image shear (/- deg)perspective:0.0# image perspective (/- fraction), range 0-0.001flipud:0.0# image flip up-down (probability)fliplr:0.5# image flip left-right (probability)mosaic:1.0# image mosaic (probability)mixup:0.0# image mixup (probability)copy_paste:0.0# segment copy-paste (probability)paste_in:0.0# segment paste-in (probability)rect:0# rectangular trainingresume:false# resume training from last checkpointnosave:false# only save final checkpointnoval:false# only validate final epochnoautoanchor:true# disable AutoAnchorevolve:false# evolve hyperparametersbucket:# gsutil bucketcache_images:false# cache images for faster trainingimage_weights:false# use weighted image selection for trainingsingle_cls:false# train multi-class data as single-classoptimizer:SGD# optimizer: SGD or AdamWsync_bn:false# use SyncBatchNorm, only available in DDP modeworkers:8# dataloader workers (max is number of CPU cores)freeze:0# freeze first n layersv5_metric:true# assume maximum recall as 1.0 in AP calculationmulti_scale:true# vary input size between 320 and 640 pixelsrect_training:false# rectangular trainingcos_lr:false# cosine LR schedulerclose_mosaic:1000# close mosaic borderscales:[0.5,1.5]# image size scalesaugment:true# augment dataverbose:false# verbose printseed:0# random seed for reproducibilitylocal_rank:-1# ddp device id (-1 for single gpu train)entity:null# WB entityupload_dataset:False# upload dataset as WB artifact tablebbox_interval:-1# WB bounding box logging intervalartifact_alias:latest# version of dataset artifact to useproject:runs/train# save results to project/nameexist_ok:false# existing project/name ok, do not incrementquad:false# quadrilateral anchorslinear_assignment:false# use linear assignment for NMS创建一个data.yaml文件来定义数据集路径和类别train:../datasets/xview/images/train/val:../datasets/xview/images/val/nc:60# number of classesnames:[class1,class2,...,class60]# list of class names将names字段替换为实际的类别名称。训练模型使用以下命令开始训练python train.py--img640--batch16--epochs50--datadata.yaml--cfgyolov5s.yaml--weightsyolov5s.pt--hyphyp.scratch.yaml结果评估训练完成后可以使用以下命令评估模型性能python val.py--datadata.yaml--weightsruns/train/exp/weights/best.pt--tasktest使用说明配置路径确保datasets/xview/目录结构正确。确保data.yaml中的路径和类别名称正确。运行脚本在终端中依次运行数据转换脚本、训练脚本和评估脚本。注意事项根据需要调整超参数和训练设置。可以通过修改data.yaml中的cfg参数来选择不同的YOLOv5模型架构如yolov5m.yaml,yolov5l.yaml,yolov5x.yaml。示例假设您的数据集文件夹结构如下datasets/ └── xview/ ├── annotations.json ├── images/ │ ├── train/ │ └── val/ └── labels/ ├── train/ └── val/并且annotations.json包含所有必要的注释信息。运行上述脚本后您可以查看训练日志和最终的模型权重文件。总结通过上述步骤我们可以构建一个全面的目标检测系统包括数据集准备、数据转换、模型训练和结果评估。以下是所有相关的代码文件数据转换脚本(convert_json_to_yolo.py)YOLOv5超参数配置文件(hyp.scratch.yaml)YOLOv5数据集配置文件(data.yaml)