What this project does

Synchronises audio and video based on lip movement of speakers. This is a simplistic implementation that is far from being generic. It detects phases where speech has started after phases of silence, and at such points in the video, it looks for matching points in the audio timeline nearby. The first such audio-video offset found, is used to correct the audio-video async.
MediaPipe is used for detecting the points on the face and lips.

Running

Install the necessary Python packages and simply use python3 main.py.

Install requirements

TODO.

Attribution

The Pause Video: https://www.youtube.com/watch?v=7l1Tom9q8Ic
Voice activity detection: https://github.com/wiseman/py-webrtcvad

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
faceDetector.py		faceDetector.py
faceLandmarkAnnotations.zip		faceLandmarkAnnotations.zip
main.py		main.py
thePause2_withAudioOffset.mp4		thePause2_withAudioOffset.mp4
voiceActivityDetection.py		voiceActivityDetection.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

What this project does

Running

Install requirements

Attribution

About

Releases

Packages

Languages

License

nav9/audio_video_synchronizer

Folders and files

Latest commit

History

Repository files navigation

What this project does

Running

Install requirements

Attribution

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages