Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

pair video clips and captions #11

Open
lininglouis opened this issue Feb 23, 2021 · 0 comments
Open

pair video clips and captions #11

lininglouis opened this issue Feb 23, 2021 · 0 comments

Comments

@lininglouis
Copy link

Hi, thanks for sharing the code!
I have some questions on how you construct the clip-caption from one video.

  1. what if one cation cross multiple clips?
    In your paper, Figure2 shows the clip-caption pairs. The caption "two stiches on two and we'll slip stitch" corresponds to two clips, as your figure shows. Did you segment the video into shots, and assign each caption with its nearest caption? ( Another way is segmenting the video by captions, and for each caption, find its nearest video. I dont think you use this way, since in such way, one caption cannot match two clips.)

2 did you use all the clip-captions within one video?
Since one video might contain lots clip-caption pairs, suppose a video might contain 1000 clip-caption pairs. Did you use all 1000 pairs in the howto100M dataset? Is there any selection work on those pairs?

I would appreciate your reply. Thanks.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant