ActiveEye: enabling continuous and responsive video understanding for smart eyewear systems

Xu, Z; Lu, T; Zhao, Y; Wang, Y; Dong, M; Chang, Y; Lv, Q; Dick, RP; Yang, F; Lu, T; Gu, N; Shang, L

Journal article

ActiveEye: enabling continuous and responsive video understanding for smart eyewear systems

Abstract:: Integrating vision-language models (VLMs) with wearable devices offers great potential for continuous and responsive video understanding, a key capability for applications such as smart eyewear-based conversational assistants. However, achieving this on resource-constrained devices is challenging due to the high energy demands of continuous spatial-temporal sampling and transmission. We propose ActiveEye , a VLM designed for energy-efficient and responsive video understanding. ActiveEye separates visual and motion semantic representations and incorporates an active perception-based feedback path to adaptively adjust spatial-temporal sampling and transmission rates. Implemented as a wearable-mobile-cloud system, ActiveEye is evaluated for energy efficiency, real-time semantic change detection, and video understanding in both laboratory and field studies. Using the EgoSchema dataset, ActiveEye reduces the front-end energy consumption by 49.14%, supporting 8.37 hours of continuous operation on a 2.1 Wh battery. It achieves the highest F1 score (0.80) and the lowest average time difference (1.30 s) compared with heuristic-based event detection algorithms, validating its timely semantic detection. Furthermore, ActiveEye achieves a visual question answering (VQA) accuracy of 61.6%, which is comparable to state-of-the-art VLM agents, despite their reliance on larger language decoders and more computationally intensive frame selection strategies. Two rounds of in-field user evaluations further confirm its effectiveness in real-world settings, demonstrating its practical viability as a continuous and responsive video understanding system, conversational assistant, and wearable companion.

Publication status:: Published

Peer review status:: Peer reviewed

Actions

Email

Email this record

Send the bibliographic details of this record to your email address.

Your Email
Please enter the email address that the record information will be sent to.

-
Your message (optional)
Please add any additional information to be included within the email.
Share
Cite

Cite this record

APA Style

Xu, Z., Lu, T., Zhao, Y., Wang, Y., Dong, M., Chang, Y., Lv, Q., Dick, R. P., Yang, F., Lu, T., Gu, N., & Shang, L. (2025). ActiveEye: enabling continuous and responsive video understanding for smart eyewear systems. Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, 9(4), 1–33.

MLA Style

Xu, Z, et al. “ActiveEye: Enabling Continuous and Responsive Video Understanding for Smart Eyewear Systems.” Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, vol. 9, no. 4, 2025, pp. 1–33.

Chicago Style

Xu, Z, T Lu, Y Zhao, et al. 2025. “ActiveEye: Enabling Continuous and Responsive Video Understanding for Smart Eyewear Systems.” Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies 9 (4): 1–33.
Print

Access Document

Files:: Xu_et_al_2025_ActiveEye_enabling_continuous.pdf

(Preview, Accepted manuscript, pdf, 10.5MB, Terms of use)

Publisher copy:: 10.1145/3770641

Authors

+ Xu, Z More by this author

Role:: Author

+ Lu, T More by this author

Role:: Author

+ Zhao, Y More by this author

Role:: Author

+ Wang, Y More by this author

Institution:: University of Oxford
Division:: MPLS
Department:: Engineering Science
Role:: Author
ORCID:: 0000-0002-6220-029X

+ Dong, M More by this author

Role:: Author

More authors...

+ Department for Science, Innovation and Technology More from this funder

Funder identifier:: https://ror.org/028z36n30
Grant:: K250071-101

+ Jiangsu Basic Research Program More from this funder

Grant:: BK20240414

+ Suzhou Industrial Park Leadership Talent Program More from this funder

Grant:: KJQ2024204

Publisher:: Association for Computing Machinery
Journal:: Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies More from this journal
Volume:: 9
Issue:: 4
Pages:: 1-33
Article number:: 228
Publication date:: 2025-12-02
Acceptance date:: 2025-09-19
DOI:: 10.1145/3770641
EISSN:: 2474-9567

Language:: English
Keywords:: smart eyewear

video understanding

energy-efficient

responsive
Pubs id:: 2348944
Local pid:: pubs:2348944
Deposit date:: 2026-03-09
ARK identifier:: ark:/29072/ora_0f8b5d718e4443bfb552fc3ba25c876c

Terms of use

Copyright holder:: Xu et al.
Notes:: The author accepted manuscript (AAM) of this paper has been made available under the University of Oxford's Open Access Publications Policy, and a CC BY public copyright licence has been applied.

Licence:: CC Attribution (CC BY)

Views and Downloads

About views and downloads

If you are the owner of this record, you can report an update to it here: Report update to this record

Journal article

ActiveEye: enabling continuous and responsive video understanding for smart eyewear systems

Actions

Access Document

Authors

Terms of use

Views and Downloads

Altmetrics

Dimensions

Journal article

ActiveEye: enabling continuous and responsive video understanding for smart eyewear systems

Actions

Access Document

Authors

Funding

Bibliographic Details

Item Description

Terms of use

Metrics

Views and Downloads

Altmetrics

Dimensions