
VLLM data labeler 作者: Aasman Bashyal
The "VLLM data labeler" is a browser add-on designed to assist users in creating labeled datasets from YouTube videos. This tool allows users to easily capture a frame from a playing YouTube video.
扩展元数据
关于此扩展
The VLLM Data Labeler is a browser add-on that transforms YouTube videos into a powerful data source for AI development. It seamlessly integrates into YouTube watch pages, allowing users to capture specific frames from videos. Each captured frame can then be meticulously annotated with custom labels, providing detailed descriptions and precise locations within the image.
This tool offers flexible export options: users can save individual frames as JPEG images or export them alongside their associated labels in a structured JSON format. This dual export capability is ideal for building comprehensive datasets for training Visual Large Language Models (VLLMs) and other computer vision applications. Features like automatic timestamp-based filenames and persistent local storage ensure an organized and uninterrupted workflow. The intuitive, draggable, and collapsible user interface guarantees a non-intrusive experience, letting you focus on data annotation without disrupting video playback. This add-on is an essential resource for researchers and developers seeking to streamline the creation of high-quality, annotated image datasets from video content.
This tool offers flexible export options: users can save individual frames as JPEG images or export them alongside their associated labels in a structured JSON format. This dual export capability is ideal for building comprehensive datasets for training Visual Large Language Models (VLLMs) and other computer vision applications. Features like automatic timestamp-based filenames and persistent local storage ensure an organized and uninterrupted workflow. The intuitive, draggable, and collapsible user interface guarantees a non-intrusive experience, letting you focus on data annotation without disrupting video playback. This add-on is an essential resource for researchers and developers seeking to streamline the creation of high-quality, annotated image datasets from video content.
评分 0(1 位用户)
权限与数据详细了解
必要权限:
- 访问您在 www.youtube.com 的数据
更多信息