SyncNet based on Meta's Perception Encoder Audio-Visual (PE-AV)
pytorch lip-sync lipsync time-synchronization multimodal-learning audio-visual lip active-speaker-detection time-delay-estimation syncnet lip-sync-detection audio-visual-sync
-
Updated
Jan 12, 2026 - Python