MMTSA: Multi-Modal Temporal Segment Attention Network for Efficient Human Activity Recognition
Efficient multimodal HAR using camera and IMU. Achieves 11% F1-score improvement with lower latency on edge devices.
Efficient multimodal HAR using camera and IMU. Achieves 11% F1-score improvement with lower latency on edge devices.
Remote communication is essential for efficient collaboration among people at different locations. We present ConeSpeech, a virtual reality (VR) based multi-user remote …
Blood glucose measurement is commonly used to screen for and monitor diabetes, a chronic condition characterized by the inability to effectively modulate blood glucose that can …
Voice communication using an air-conduction microphone in noisy environments suffers from the degradation of speech audibility. Bone-conduction microphones (BCM) are robust against …
We present DRG-Keyboard, a gesture keyboard enabled by dual IMU rings, allowing the user to swipe the thumb on the index fingertip to perform word gesture typing as if typing on a …
An avatar mirroring the user's movement is commonly adopted in Virtual Reality(VR). Maintaining the user-avatar movement consistency provides the user a sense of body ownership and …
First mobile personalized rPPG system using dual cameras. Self-supervised learning without gold standard data.
The trend of IoT brings more and more connected smart devices into our daily lives, which can enable a ubiquitous sensing and interaction experience. However, augmenting many …
We present DualRing, a novel ring-form input device that can capture the state and movement of the user's hand and fingers. With two IMU rings attached to the user's thumb and …