Deploying CVAT on AWS for Image and Video Annotation
Building a computer vision model starts with labelled data, and that labelling work is where a surprising amount of ML project time disappears. CVAT (Computer Vision Annotation Tool) is one of the ...

Source: DEV Community
Building a computer vision model starts with labelled data, and that labelling work is where a surprising amount of ML project time disappears. CVAT (Computer Vision Annotation Tool) is one of the strongest open-source options for the job. It handles bounding boxes, polygons, segmentation masks, keypoints, and object tracking across images and video. The challenge most teams hit is not CVAT itself but the infrastructure around it. This post covers deploying a pre-configured CVAT environment on AWS EC2 so you can skip the Docker Compose setup and get straight to annotating. What the pre-built AMI includes Multi-format annotation - bounding boxes, polygons, segmentation masks, keypoints, ellipses, cuboids, and video object tracking Export-ready datasets - YOLO (v5 through v11), COCO, Pascal VOC, TFRecord, and LabelMe formats OpenCV-powered assists - semi-automatic annotation, keyframe interpolation on video, and label manipulation utilities S3 storage integration - pre-wired, no manual b