Repository navigation
vot
- Website
- Wikipedia
Небольшое расширение, которое добавляет закадровый перевод видео из YaBrowser в другие браузеры
The first open-source Artificial Narrow Intelligence generalist agentic framework Computer-Using-Agent that fully operates graphical-user-interfaces (GUIs) by using only natural language. Uses Visualization-of-Thought and Chain-of-Thought reasoning to elicit spatial reasoning and perception, emulates, plans and simulates synthetic HID interactions.
[CVPR 2022 Oral & TPAMI 2024] MixFormer: End-to-End Tracking with Iterative Mixed Attention
This is a re-implementation of Siamese-RPN with pytorch, which is CVPR2018 spotlight.
The official VOT Challenge evaluation and analysis toolkit
Discriminative and Robust Online Learning for Siamese Visual Tracking (AAAI 2020)
Official Implementation of Towards Sequence-Level Training for Visual Tracking (ECCV 2022)
An unofficial library for interacting with Yandex VOT API, providing easy-to-use methods and enhanced functionality.
[ECCVW2018] A Memory Model based on the Siamese Network for Long-term Tracking (MMLT)
Modifications to improve single object tracking in 360° equirectangular videos.
Matlab code for several visual tracking algorithms
Automatic Measurement of Voice Onset Time (VOT) using Deep Recurrent Neural Networks (Interspeech 2016)
This repository contains several trackers integrated for VOT toolkit and their evaluation results.
Efficient Visual Tracking with Stacked Channel-Spatial Attention Learning
This work proposes a feature refined end-to-end tracking framework with a balanced performance using a high-level feature refine tracking framework. The feature refine module enhances the target feature representation power that allows the network to capture salient information to locate the target. The attention module is employed inside the feature refine mechanism to improve network discrimination power that augments the network ability to track the target in challenging scenarios.