CVAT 是一个用于计算机视觉的交互式视频和图像注释工具。

zaha 59e31b3118 Fix CI-nightly tests and refactoring cypress config (#7908) 1 天之前
.github 59e31b3118 Fix CI-nightly tests and refactoring cypress config (#7908) 1 天之前
.regal ab8674c0d3 Move rego files into their respective apps (#7806) 2 周之前
.vscode f3247fa5a8 Optimized analytics requests to ClickHouse (#7804) 2 周之前
changelog.d f7b47fe8e0 Fixed object count in analytics for skeletons and tracks (#7883) 2 天之前
components 57085e8850 Update the Nuclio version (#7787) 2 周之前
cvat f7b47fe8e0 Fixed object count in analytics for skeletons and tracks (#7883) 2 天之前
cvat-canvas 3e29537995 Fixed: Cannot read properties of undefined (reading 'addClass') (#7834) 2 周之前
cvat-canvas3d 123c394028 Client code refactoring (#7208) 5 月之前
cvat-cli 4ad53f1510 Update develop after v2.13.0 1 周之前
cvat-core dc74a20d9f Fixed vertical polylines difficult to select (#7860) 1 周之前
cvat-data 26750a9afb [GSoC2024] fix warped image loading (#7583) 1 月之前
cvat-sdk 4ad53f1510 Update develop after v2.13.0 1 周之前
cvat-ui 899735d284 Remove tasks by projectId from state after deleting project (#7854) 1 周之前
dev 0f821e4564 Add a GitHub workflow for finalizing a release (#6998) 7 月之前
helm-chart 49ef2a394c Update helm (#7894) 2 天之前
serverless 57085e8850 Update the Nuclio version (#7787) 2 周之前
site 57085e8850 Update the Nuclio version (#7787) 2 周之前
supervisord 6673bb156e Modify backend entrypoint to wait redis (inmem and ondisk) (#7479) 3 月之前
tests 59e31b3118 Fix CI-nightly tests and refactoring cypress config (#7908) 1 天之前
utils e6037afb39 Fix task creation with video file when there are no valid keyframes (#7838) 1 周之前
.bandit 53697ecac5 SDK layer 2 - cover RC1 usecases (#4813) 1 年之前
.codacy.yml 7512fd6883 Reformatted (#2349) 3 年之前
.coveragerc 09a10ca59d Code coverage (#6173) 11 月之前
.dockerignore e1ad07c134 Removed extra data from docker context (#7429) 3 月之前
.editorconfig cf4329af05 Override EditorConfig settings for YAML files (#6980) 7 月之前
.eslintignore 961bc58935 Webpack dev server proxy (#3368) 2 年之前
.eslintrc.cjs 123c394028 Client code refactoring (#7208) 5 月之前
.gitattributes e7585b8ce9 DL models as serverless functions (#1767) 3 年之前
.gitignore ab8674c0d3 Move rego files into their respective apps (#7806) 2 周之前
.gitmodules 9615436ecc Website with documentation (#3039) 3 年之前
.nycrc e76b0ea5fc Cypress. Exclude some files from instrumentation. (#3349) 2 年之前
.prettierignore e309f2f8bf fixed templates 3 年之前
.prettierrc 7512fd6883 Reformatted (#2349) 3 年之前
.pylintrc 5f58a0f7be Add 2nd layer of SDK (#19) 1 年之前
.remarkignore 8df8872a85 Fix predefined sorting for task data (#5083) 11 月之前
.remarkrc.js 25975467ea Fix all remark warnings (#3261) 3 年之前
.stylelintrc.json 5a69e67ad8 Fixed cards on project page, updated stylelint & css loader packages (#6551) 9 月之前
CHANGELOG.md 71a965cd3b Prepare release v2.13.0 1 周之前
CITATION.cff 6c57c0389a Update documentation and repo links after transfer to the cvat-ai organization (#7722) 1 月之前
Dockerfile 0c940fce36 Update server dependencies (#7845) 1 周之前
Dockerfile.ci e631943e6f Upgrade Node 16 => 20 (#7766) 1 月之前
Dockerfile.ui 967a5a6c32 [Snyk] Security upgrade nginx from mainline-alpine to 1.25.4-alpine3.18 (#7625) 1 月之前
LICENSE e269f13b50 Update LICENSE (#7301) 4 月之前
README.md 6c57c0389a Update documentation and repo links after transfer to the cvat-ai organization (#7722) 1 月之前
SECURITY.md c60b200502 Update SECURITY.md 1 年之前
backend_entrypoint.sh 08550f8d5f Support running CVAT with an external database via Docker Compose (#7055) 6 月之前
docker-compose.ci.yml 9fb582d26a Simplify the dev environment setup instructions by reusing Compose files (#7254) 5 月之前
docker-compose.dev.yml 39afcd443f Added ability to call analytics report manually (#7805) 2 周之前
docker-compose.external_db.yml 08550f8d5f Support running CVAT with an external database via Docker Compose (#7055) 6 月之前
docker-compose.https.yml 6ae1cffdab Turn on Traefik access logs (#7109) 6 月之前
docker-compose.yml 9d2018f27e Added logging for `DatasetNotFound` error (#7778) 3 周之前
lint-staged.config.js 123c394028 Client code refactoring (#7208) 5 月之前
manage.py 57c23b08b7 Remove the DJANGO_CONFIGURATION environment variable 8 月之前
package.json 209826cdd9 Remove empty masks (#7295) 4 月之前
rqscheduler.py 9a600f3fa8 Honey pot server (#6204) 11 月之前
wait-for-it.sh eb9fba3685 Release 0.1.0 5 年之前
wait_for_deps.sh 48ab12b6bc Use separate services for storing job queues and cache (#7245) 5 月之前
yarn.lock 7e1fb254de Bump express from 4.18.2 to 4.19.2 (#7680) 1 月之前

README.md

CVAT Platform

Start Annotating Now

Computer Vision Annotation Tool (CVAT)

CI Gitter chat Discord Coverage Status server pulls ui pulls DOI

CVAT is an interactive video and image annotation tool for computer vision. It is used by tens of thousands of users and companies around the world. Our mission is to help developers, companies, and organizations around the world to solve real problems using the Data-centric AI approach.

Start using CVAT online: cvat.ai. You can use it for free, or subscribe to get unlimited data, organizations, autoannotations, and Roboflow and HuggingFace integration.

Or set CVAT up as a self-hosted solution: Self-hosted Installation Guide. We provide Enterprise support for self-hosted installations with premium features: SSO, LDAP, Roboflow and HuggingFace integrations, and advanced analytics (coming soon). We also do trainings and a dedicated support with 24 hour SLA.

Quick start ⚡

Partners ❤️

CVAT is used by teams all over the world. In the list, you can find key companies which help us support the product or an essential part of our ecosystem. If you use us, please drop us a line at contact@cvat.ai.

  • Human Protocol uses CVAT as a way of adding annotation service to the Human Protocol.
  • FiftyOne is an open-source dataset curation and model analysis tool for visualizing, exploring, and improving computer vision datasets and models that are tightly integrated with CVAT for annotation and label refinement.

Public datasets

ATLANTIS, an open-source dataset for semantic segmentation of waterbody images, developed by iWERS group in the Department of Civil and Environmental Engineering at the University of South Carolina is using CVAT.

For developing a semantic segmentation dataset using CVAT, see:

CVAT online: cvat.ai

This is an online version of CVAT. It's free, efficient, and easy to use.

cvat.ai runs the latest version of the tool. You can create up to 10 tasks there and upload up to 500Mb of data to annotate. It will only be visible to you or the people you assign to it.

For now, it does not have analytics features like management and monitoring the data annotation team. It also does not allow exporting images, just the annotations.

We plan to enhance cvat.ai with new powerful features. Stay tuned!

Prebuilt Docker images 🐳

Prebuilt docker images are the easiest way to start using CVAT locally. They are available on Docker Hub:

The images have been downloaded more than 1M times so far.

Screencasts 🎦

Here are some screencasts showing how to use CVAT.

Computer Vision Annotation Course: we introduce our course series designed to help you annotate data faster and better using CVAT. This course is about CVAT deployment and integrations, it includes presentations and covers the following topics:

  • Speeding up your data annotation process: introduction to CVAT and Datumaro. What problems do CVAT and Datumaro solve, and how they can speed up your model training process. Some resources you can use to learn more about how to use them.
  • Deployment and use CVAT. Use the app online at app.cvat.ai. A local deployment. A containerized local deployment with Docker Compose (for regular use), and a local cluster deployment with Kubernetes (for enterprise users). A 2-minute tour of the interface, a breakdown of CVAT’s internals, and a demonstration of how to deploy CVAT using Docker Compose.

Product tour: in this course, we show how to use CVAT, and help to get familiar with CVAT functionality and interfaces. This course does not cover integrations and is dedicated solely to CVAT. It covers the following topics:

  • Pipeline. In this video, we show how to use app.cvat.ai: how to sign up, upload your data, annotate it, and download it.

For feedback, please see Contact us

API

SDK

CLI

Supported annotation formats

CVAT supports multiple annotation formats. You can select the format after clicking the Upload annotation and Dump annotation buttons. Datumaro dataset framework allows additional dataset transformations with its command line tool and Python library.

For more information about the supported formats, see: Annotation Formats.

Annotation format Import Export
CVAT for images ✔️ ✔️
CVAT for a video ✔️ ✔️
Datumaro ✔️ ✔️
PASCAL VOC ✔️ ✔️
Segmentation masks from PASCAL VOC ✔️ ✔️
YOLO ✔️ ✔️
MS COCO Object Detection ✔️ ✔️
MS COCO Keypoints Detection ✔️ ✔️
MOT ✔️ ✔️
MOTS PNG ✔️ ✔️
LabelMe 3.0 ✔️ ✔️
ImageNet ✔️ ✔️
CamVid ✔️ ✔️
WIDER Face ✔️ ✔️
VGGFace2 ✔️ ✔️
Market-1501 ✔️ ✔️
ICDAR13/15 ✔️ ✔️
Open Images V6 ✔️ ✔️
Cityscapes ✔️ ✔️
KITTI ✔️ ✔️
Kitti Raw Format ✔️ ✔️
LFW ✔️ ✔️
Supervisely Point Cloud Format ✔️ ✔️

Deep learning serverless functions for automatic labeling

CVAT supports automatic labeling. It can speed up the annotation process up to 10x. Here is a list of the algorithms we support, and the platforms they can be run on:

Name Type Framework CPU GPU
Segment Anything interactor PyTorch ✔️ ✔️
Deep Extreme Cut interactor OpenVINO ✔️
Faster RCNN detector OpenVINO ✔️
Mask RCNN detector OpenVINO ✔️
YOLO v3 detector OpenVINO ✔️
YOLO v7 detector ONNX ✔️ ✔️
Object reidentification reid OpenVINO ✔️
Semantic segmentation for ADAS detector OpenVINO ✔️
Text detection v4 detector OpenVINO ✔️
SiamMask tracker PyTorch ✔️ ✔️
TransT tracker PyTorch ✔️ ✔️
f-BRS interactor PyTorch ✔️
HRNet interactor PyTorch ✔️
Inside-Outside Guidance interactor PyTorch ✔️
Faster RCNN detector TensorFlow ✔️ ✔️
Mask RCNN detector TensorFlow ✔️ ✔️
RetinaNet detector PyTorch ✔️ ✔️
Face Detection detector OpenVINO ✔️

License

The code is released under the MIT License.

This software uses LGPL-licensed libraries from the FFmpeg project. The exact steps on how FFmpeg was configured and compiled can be found in the Dockerfile.

FFmpeg is an open-source framework licensed under LGPL and GPL. See https://www.ffmpeg.org/legal.html. You are solely responsible for determining if your use of FFmpeg requires any additional licenses. CVAT.ai Corporation is not responsible for obtaining any such licenses, nor liable for any licensing fees due in connection with your use of FFmpeg.

Contact us

Gitter to ask CVAT usage-related questions. Typically questions get answered fast by the core team or community. There you can also browse other common questions.

Discord is the place to also ask questions or discuss any other stuff related to CVAT.

LinkedIn for the company and work-related questions.

YouTube to see screencast and tutorials about the CVAT.

GitHub issues for feature requests or bug reports. If it's a bug, please add the steps to reproduce it.

#cvat tag on StackOverflow is one more way to ask questions and get our support.

contact@cvat.ai to reach out to us if you need commercial support.

Links